×

Specialized language identification

  • US 10,216,721 B2
  • Filed: 09/30/2014
  • Issued: 02/26/2019
  • Est. Priority Date: 09/30/2014
  • Status: Active Grant
First Claim
Patent Images

1. A system comprising:

  • multiple engines that are each to produce output representative of a summary of the document, wherein each one of the multiple engines applies a different type of engine selected from a group of engines comprising an extractive type of engine, an abstractive type of engine, and a frequency type of engine, wherein the output from each of the multiple engines varies between the multiple engines in accordance with a respective type of engine;

    a composite engine to generate a filtered set of content in a single output to reduce a size of the output produced by the multiple engines, wherein the filtered set of content comprises different combinations of the output from the multiple engines that have different densities of specialized word usage;

    an identification engine to;

    apply a weighting mechanism to the different combinations of the output in the filtered set of content;

    obtain a value corresponding to the different combinations of the output in the filtered set of content;

    identify specialized language from the different combinations of the output in the filtered set of content, wherein the value corresponding to the different combinations of the output in the filtered set of content reaching at least a particular threshold indicates specialized language within that output; and

    index the document based on the specialized language that is identified to identify other documents salient to the document based on the specialized language.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×