Lexicon with sectionalized data and method of using the same
First Claim
Patent Images
1. A computer readable storage medium having a lexicon for storing word information and adapted for use with a text analyzer in a language processing system, wherein the lexicon is adapted to be used in a plurality of language processing tasks, the lexicon comprising:
- a word list section for storing a list of words;
a set of data sections corresponding with each word on the word list, wherein the data sections store substantially different selected information about the corresponding word in the word list; and
for each word on the word list, a plurality of pointers stored in an indices table apart from the sets of data sections, each of the pointers pointing to a different data section related to different information about the corresponding word, wherein the plurality of pointers comprises a first set and a second set of the pointers, the first set used to access information related to a first natural language processing task and the second set used to access information related to a second natural language processing task, wherein the first set of the pointers is not the same as the second set of the pointers.
2 Assignments
0 Petitions
Accused Products
Abstract
A data structure for a word lexicon includes a plurality of separate data sections for storing information related to word entries. An indices section includes pointers indicating the location of the information. The location of the associated pointers for each word entry is obtained as a function of a list of the word entries.
288 Citations
44 Claims
-
1. A computer readable storage medium having a lexicon for storing word information and adapted for use with a text analyzer in a language processing system, wherein the lexicon is adapted to be used in a plurality of language processing tasks, the lexicon comprising:
-
a word list section for storing a list of words; a set of data sections corresponding with each word on the word list, wherein the data sections store substantially different selected information about the corresponding word in the word list; and for each word on the word list, a plurality of pointers stored in an indices table apart from the sets of data sections, each of the pointers pointing to a different data section related to different information about the corresponding word, wherein the plurality of pointers comprises a first set and a second set of the pointers, the first set used to access information related to a first natural language processing task and the second set used to access information related to a second natural language processing task, wherein the first set of the pointers is not the same as the second set of the pointers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A computer-implemented method for obtaining word information by accessing a lexicon that is adapted for use in a plurality of different natural language processing tasks, wherein the lexicon is adapted to be used with a text analyzer in a language processing system, and wherein the lexicon stores word information pertaining to a plurality of words, the lexicon comprising:
-
a word list section storing the plurality of words; sets of data sections, wherein each set of data sections corresponds with an individual word in the word list section, each data section among a set of data sections storing different selected information about the corresponding word in the word list; and an indices section storing a plurality of pointers apart from the sets of data sections, wherein each plurality of pointers corresponds with an individual word and comprises a first set of pointers associated with a natural language processing task and a second set of pointers associated with a different natural language processing task, wherein the first set of pointers is different from the second set of pointers, each of the sets of pointers pointing to data in a data section, the method comprising; accessing the word list section as a function of said word to ascertain a pointer identification for the indices section; using the pointer identification to obtain one of the first or second sets of pointers in the indices section based on the natural language processing task to be performed; using one of the first or second sets of pointers to obtain information from only some data sections of the set of data sections, the only some data sections having information about said word necessary to perform the natural language processing task. - View Dependent Claims (27, 28, 29, 30, 31, 32, 37)
-
-
33. A computer-implemented method for storing word information in a lexicon adapted for use in a plurality of natural language processing tasks, wherein the lexicon is adapted for use with a text analyzer in a language processing system, the method comprising:
-
storing word information in a sets of data sections, each set of data sections corresponding with an individual word in a word list, each data section of a set of data sections storing substantially different selected information about the corresponding word in the word list; storing sets of pointers in an indices section separate from the sets of data sections, each set of pointers corresponding with an individual word in the word list, wherein some words comprise a first set of pointers associated with a natural language processing task and a second set of pointers associated with a different natural language processing task, wherein the first set of pointers is different from the second set of pointers, and wherein each of the pointers points to selected data in the sets of data sections; and storing the word list in a word list section separate from the sets of data sections and the indices section, the word list having information for identifying the corresponding set of pointers corresponding to an individual word. - View Dependent Claims (34, 35, 36)
-
-
38. A computer-implemented method for obtaining word information from a plurality of lexicons adapted for use in a plurality of different language processing tasks, the plurality of lexicons adapted for use with a text analyzer of a language processing system, each lexicon comprising:
-
a word list section storing a plurality of words; sets of data sections, wherein each set of data sections corresponds with an individual word in the word list, each data section of a set of data sections storing different selected information about the corresponding word in the word list; and an indices section storing a plurality of pointers for each individual word in the word list, the indices section stored apart from the sets of data sections, wherein each plurality of pointers comprises a first set of pointers associated with a natural language processing task and a second set of pointers associated with a different natural language processing task, wherein the first set of pointers is different from the second set of pointers, each of the pointers pointing to data in the sets of data sections, the method comprising; selectively accessing the sets of data sections of each of the lexicons having similar information and obtaining information therefrom using the first set or second set of pointers based on the particular language processing task performed. - View Dependent Claims (39, 40, 41, 42, 43, 44)
-
Specification