×

EXTRACTING TERMS FROM DOCUMENT DATA INCLUDING TEXT SEGMENT

  • US 20130253916A1
  • Filed: 05/21/2013
  • Published: 09/26/2013
  • Est. Priority Date: 10/02/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented system including a memory and a processor communicatively coupled to the memory for extracting terms from electronic document data that includes a text segment, the computer system comprising:

  • a first extraction unit that uses a first text processing information to extract a noun word from the document data;

    a second extraction unit that uses a second text processing information to extract a term candidate in relation to the extracted noun word from the document data or from a corpus that includes text data described in the same language used in the document data;

    a weight assignment unit that, in order to determine which one of a plurality of noun word types the extracted noun word and the extracted term candidate each belong to, uses a third text processing information to select which type to assign a weight from the plurality of types and assigns the weight to the selected type for each of the extracted noun word and the extracted term candidate;

    a determination unit that determines the type to which the extracted noun word and the extracted term candidate each belong, based on the assigned weight; and

    an output unit which follows the determination to output the extracted noun word and the extracted term candidate each in association with the determined type.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×