×

System for extracting information from a natural language text

  • US 8,170,867 B2
  • Filed: 07/18/2003
  • Issued: 05/01/2012
  • Est. Priority Date: 07/19/2002
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method implemented by computer of extracting information from a natural-language text of words comprising identifying patterns, wherein the words of the text are encoded by comparing them, using a processor, with the contents of a predefined lexicon containing less than 1000 tool words, said tools being essentially constituted by articles, prepositions, conjunctions and verbal auxiliaries, and in that nominal groups are then identified by searching subsets of the resulting succession of encoded words to look for groups of encoded words that comply with predefined syntactical rules, wherein the words of the text are encoded by evaluating the grammatical function of each word by comparing each word with the contents of said lexicon of tool words, so as to identify the tool words in the text, the grammatical function of said tool words being predefined, and in that the grammatical functions of the other words, which are not recognized as being tool words, are deduced by comparing their locations relative to the words recognized as being tool words.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×