×

METHOD AND SYSTEM FOR NATURAL LANGUAGE DICTIONARY GENERATION

  • US 20090006078A1
  • Filed: 06/27/2007
  • Published: 01/01/2009
  • Est. Priority Date: 06/27/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method of analyzing a text corpus in a natural language, comprising:

  • identifying each word token in the text corpus;

    applying one or more paradigm rules to each word token in the text corpus;

    generating one or more hypotheses for base forms of each word token;

    searching for other word inflected forms corresponding to the base form of each word token;

    verifying each hypothesis of the one or more hypotheses for each base form of each word token to identify verified hypothesis;

    adding grammatical values and inflection paradigms to each base form of each word token for each verified hypothesis; and

    obtaining information on its morphological descriptions for each word token with verified hypothesis.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×