×

Word Sense Disambiguation Using Emergent Categories

  • US 20100063796A1
  • Filed: 09/05/2008
  • Published: 03/11/2010
  • Est. Priority Date: 09/05/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented method for word sense disambiguation in a natural language sentence, comprising the steps of:

  • parsing said natural language sentence, comprising the steps of;

    identifying one or more possible parts of speech for each term in the natural language sentence;

    identifying one or more possible phrase structures in the natural language sentence;

    identifying terms comprising one or more linguistic roles in the natural language sentence by generating declared patterns;

    identifying possible sense combinations for said identified terms with said linguistic roles in the natural language sentence, comprising the steps of;

    applying emergent categories to identify possible valid senses for each of the identified terms comprising the linguistic roles in the natural language sentence, wherein said emergent categories identify a set of senses for terms in a dictionary, wherein said senses in one of the emergent categories corresponds to the senses in one of the other emergent categories by a correspondence function, wherein said correspondence function identifies a linguistic correspondence between two senses;

    providing an emergent categories database comprising a plurality of correspondence functions, wherein each of said correspondence functions comprising a given correspondence function type identifies two emergent categories, wherein said correspondence function type specifies a linguistic role pair, wherein said linguistic role pair is a pairing of two linguistic roles, wherein the senses in each of said two emergent categories play one of said two linguistic roles in the correspondence function type;

    identifying linguistic role pairs from among the identified terms with the linguistic roles in the natural language sentence for identifying pair-wise terms using said emergent categories database;

    identifying the correspondence functions in the emergent categories database with correspondence function types matching said identified linguistic role pairs, wherein for each of the linguistic role pairs, the emergent categories identified by the correspondence function are valid for the corresponding linguistic roles, wherein the emergent categories specify one or more senses representing terms matching said identified pair-wise terms in the natural language sentence, wherein each sense in one of the emergent categories in said identified correspondence function is a possible valid pair-wise sense for the term in the natural language sentence when paired with the other emergent categories in the identified correspondence function;

    comparing pair-wise senses for each term with the identified linguistic roles in the natural language sentence to identify said possible sense combinations; and

    inferring possible senses for each term with the identified linguistic roles in the natural language sentence and previous sentences;

    whereby said inference of said possible senses enables word sense disambiguation in the natural language sentence.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×