Full-form lexicon with tagged data and methods of constructing and using the same
First Claim
1. A computer readable storage medium having instructions, which when performed by a computing device, operate as a language processing system having a text analyzer, the text analyzer configured to receive and analyze input text by accessing a lexicon stored on the computer readable storage medium and provide an output in accordance with accessing information from the lexicon, the lexicon comprising a plurality of entries, each entry corresponding to a word entered in the lexicon, wherein each entry comprises:
- a first field comprising spelling information for an entered word;
a second field comprising part of speech information associated with the entered word; and
a third field comprising lemma delta information associated with the entered word, wherein the lemma delta information comprises transformation information associated with the entered word, the transformation information comprising an op code and an argument value, wherein the op code is indicative of an operation to perform upon the entered word based on the argument value in order to convert the entered word into a second word.
3 Assignments
0 Petitions
Accused Products
Abstract
A lexicon stored on a computer readable medium and used by language processing systems. The lexicon can store word information in a plurality of data fields associated with each entered word. The data fields can include information on spelling and grammar, parts of speech, steps that the entered word can be transformed into another word, a word description, and a segmentation for a compound word. Information that cannot be stored in the lexicon can be stored in an intermediate indexes table. Associated methods of constructing, updating and using the lexicon are introduced.
-
Citations
36 Claims
-
1. A computer readable storage medium having instructions, which when performed by a computing device, operate as a language processing system having a text analyzer, the text analyzer configured to receive and analyze input text by accessing a lexicon stored on the computer readable storage medium and provide an output in accordance with accessing information from the lexicon, the lexicon comprising a plurality of entries, each entry corresponding to a word entered in the lexicon, wherein each entry comprises:
-
a first field comprising spelling information for an entered word; a second field comprising part of speech information associated with the entered word; and a third field comprising lemma delta information associated with the entered word, wherein the lemma delta information comprises transformation information associated with the entered word, the transformation information comprising an op code and an argument value, wherein the op code is indicative of an operation to perform upon the entered word based on the argument value in order to convert the entered word into a second word. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer readable storage medium having instructions, which when performed by a computng device, operate as a language processing em having text analyzer, the text analyzer configured to receive and analyze input text by accessing a lexicon stored on the computer readable storage medium and provide an output in accordance with accessing information from the lexicon, the lexicon comprising a plurality of entries, each entry corresponding to a word entered in the lexicon, wherein each entry comprises:
-
a first field comprising spelling information for an entered word; a second field comprising part of speech information associated with the entered word; and a third field comprising lemma delta information associated with the entered word;
a fourth field comprising description information associated with the entered word; and a fifth field comprising static segmentation mask information associated with the entered word. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29)
-
-
30. A computer readable storage medium having instructions, which when performed by a computing device, operate as a language processing system having a text analyzer, the text analyzer configured to receive and analyze input text by accessing a lexicon stored on the computer readable storage medium and provide an output in accordance with accessing information from the lexicon, the lexicon comprising information for entered words, wherein for each entered word, corresponding word information is stored in data fields, the data fields comprising:
-
a spelling and dynamic segmentation field related to the entered word; a part of speech field related to the entered word; a lemma delta field related to the entered word; a description field for the entered word; and a static segmentation mask field for the entered word. - View Dependent Claims (31)
-
-
32. A method of constructing a lexicon comprising information about words, for each word, the method comprising steps of:
-
storing spelling and dynamic segmentation information; storing part of speech information; storing lemma delta information; storing description information for each word; and storing static segmentation mask information for words that are compound terms. - View Dependent Claims (33, 34, 35, 36)
-
Specification