×

Method and system for natural language dictionary generation

  • US 8,812,296 B2
  • Filed: 06/27/2007
  • Issued: 08/19/2014
  • Est. Priority Date: 06/27/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method for a computer system to create a morphological dictionary for a natural language, the method comprising:

  • identifying a word token in a text corpus;

    applying by the computer system one or more paradigm rules to the word token;

    generating by the computer system one or more hypotheses about a part of speech for a base form of the word token;

    searching by the computer system for one or more word inflected forms corresponding to the base form of the word token;

    verifying by the computer system a hypothesis of the one or more hypotheses for the base form of the word token;

    adding by the computer system at least one grammatical value and at least one inflection paradigm to the base form of the word token based at least in part on the verified hypothesis;

    obtaining by the computer system one or more morphological descriptions for the word token based at least in part on the verified hypothesis; and

    adding the base form of the word token with the one or more morphological descriptions to the morphological dictionary of the natural language.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×