×

Text indexing system

  • US 5,708,829 A
  • Filed: 09/25/1996
  • Issued: 01/13/1998
  • Est. Priority Date: 02/01/1991
  • Status: Expired due to Fees
First Claim
Patent Images

1. An apparatus for generating an index for a collection of words, the apparatus comprising:

  • means for selecting an input word from said collection of words;

    means for generating words that are lexically related to said input word wherein a word that is lexically related to said input word is a word having a meaning related to a meaning of said input word, and wherein said input word and said lexically related words form a group of words having related meanings,said means for generating words that are lexically related to said input word comprising a recognition engine includinga means for identifying the underlying lexical form of the input word, the underlying lexical form of the input word being comprised of the lexical morphemes forming the input word,a means for scanning the lexical form of the input word for finding at least one lexical stem within the lexical morphemes forming said lexical form of the input word wherein a lexical stem of a word has a meaning related to the meaning of the input word, anda means for identifying suffixes attached to said lexical stem, wherein said suffixes include ending suffixes and continuation suffixes and said stem finding means and suffix identifying means cooperate to conduct morphological analysis of the input word from the root to the affix and wherein said recognition engine performs inflectional and derivational analysis, wherein each derivational analysis is performed recursively using more than two derivational suffixes within each said input word; and

    an indexing engine for representing the occurrence in said collection of words of any of the members of said group by a single member of said group.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×