Text searching system
First Claim
Patent Images
1. A computer apparatus for searching a collection of words based upon an input word, said computer apparatus comprising:
- a generating means for generating a first set of words containing members that are lexically related to the input word, said first set of words including words that are both regular inflectional nouns and words that are other than regular inflectional nouns, the generating means includinga recognition engine for finding at least one linguistic stem within said input word and identifying at least one suffix attached to each said linguistic stem,generating members of said first set of words by stripping from said input word suffixes selected from a first group of suffixes which do not substantially change the meaning of words when stripped therefrom, andperforming inflectional and derivational analysis upon said input word by generating all inflections and derivations that can be generated from said linguistic stems by the addition of one suffix wherein said derivational analysis employs at least three derivational suffixes; and
a search engine for searching said collection of words to detect the occurrence of words from a group of search words, said group of search words comprising the input word and said first set of words.
8 Assignments
0 Petitions
Accused Products
Abstract
An apparatus for searching a collection of words based upon an input word, the apparatus including means for generating a first set of words containing members that are lexically related to the input word, the first set of words including words that are other than regular inflectional nouns; and also including a search engine for searching the collection of words to detect the occurrence of any of the words from a group of search words, the group of search words including the input word and the first set of words.
352 Citations
35 Claims
-
1. A computer apparatus for searching a collection of words based upon an input word, said computer apparatus comprising:
-
a generating means for generating a first set of words containing members that are lexically related to the input word, said first set of words including words that are both regular inflectional nouns and words that are other than regular inflectional nouns, the generating means including a recognition engine for finding at least one linguistic stem within said input word and identifying at least one suffix attached to each said linguistic stem, generating members of said first set of words by stripping from said input word suffixes selected from a first group of suffixes which do not substantially change the meaning of words when stripped therefrom, and performing inflectional and derivational analysis upon said input word by generating all inflections and derivations that can be generated from said linguistic stems by the addition of one suffix wherein said derivational analysis employs at least three derivational suffixes; and a search engine for searching said collection of words to detect the occurrence of words from a group of search words, said group of search words comprising the input word and said first set of words. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A computer apparatus comprising a subject expansion system for expanding an input word into a plurality of related words, the computer apparatus comprising:
-
a means for receiving the input word; and a generating means for generating a first group of words from the input word, wherein said generating means generates at least some of said first group of words by stripping from said input word only suffixes selected from a first group of suffixes, the members of said first group of suffixes including only those suffixes which do not substantially change the meaning of words when stripped therefrom, and includes a recognition engine for finding all linguistic stems within said input word and performing inflectional and derivational analysis upon said input word by generating all inflections and derivations that can be generated from said linguistic stems by the addition of one suffix wherein said derivational analysis employs at least three derivational suffixes. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
-
31. A computer apparatus for generating a group of words from an input word, said computer comprising:
-
means for removing suffixes from said input word to generate at least one base word; a thesaurus means adapted to receive said one or more base words and generate a group of synonyms for certain of said at least one base word; and a generating means comprising a recognition engine for generating a base history for said input word, said base history identifying said one or more base words, said generating means including a means for finding a linguistic stem within said input word, a means for identifying suffixes attached to said linguistic stem, wherein said linguistic stem finding means and suffix identifying means cooperate to conduct morphological analysis of the input word from a root to an affix, and a means for performing inflectional and derivational analysis by generating all inflections and derivations than can be generated from said linguistic stems by the addition of one suffix wherein said derivational analysis employs at least three derivational suffixes. - View Dependent Claims (32, 33)
-
-
34. A computer-implemented method for generating a collection of words from an input word, said method comprising:
-
generating, by means of a generating engine, a first set of words containing members that are lexically related to the input word, said first set of words including words that are both regular inflectional nouns and words that are other than regular inflectional nouns, by operation of a recognition engine stripping from said input word only suffixes selected from a first group of suffixes, the members of said first group of suffixes including only those suffixes which do not substantially change the meaning of words when stripped therefrom; and generating, by operation of the generating engine, a second set of words that are lexically related to the input word, said second set of words including words that are both regular inflectional nouns and words that are other than regular inflectional nouns by finding at least one linguistic stem within said input word, identifying suffixes attached to each said linguistic stem, and by operation of the recognition engine, performing inflectional and derivational analysis by generating all inflections and derivations than can be generated from said linguistic stems by the addition of one suffix wherein said derivational analysis employs at least three derivational suffixes. - View Dependent Claims (35)
-
Specification