Methods and apparatus for understanding machine vocabulary
First Claim
1. A method for representing text in a language independent format of multiple bits and fields of bits comprising:
- referencing a word from a set of words into a dictionary of representations of words, in said format of multiple bits and fields of bits, to find a corresponding dictionary term, the set of words derived from a natural language usage defining a natural language context of the words in the set of words;
mapping the referenced dictionary term to at least one definition element indicative of usage of the word in at least one context, the definition elements defining a record of fields;
comparing the mapped definition element to the corresponding fields in the definition elements of other words in the set of words, the comparison operative to identify similar contexts between the definition elements;
disambiguating the referenced words by analyzing each of the definition elements of the referenced words with the definition elements of the other referenced words, the analysis operable to determine a particular definition for each of the referenced words in the context of the set of words, disambiguating including;
performing, with a processor, bitwise operations on at least a subset of the fields in the definition elements with corresponding fields in the other definition elements in the set of words, each of the definition elements indicative of a particular context, the operations for identifying a particular definition element based on the context in the set of words, the definition elements including bit fields of class, method, and category in the high bits, the method field determining a structure of fields in lower order bits, wherein disambiguating is performed on definition elements of equal category and method fields; and
identifying, from the comparing, a definition element corresponding to the usage of the word in a context of the set of words.
1 Assignment
0 Petitions
Accused Products
Abstract
Configurations herein provide a language processing mechanism operable to define a machine vocabulary and identify a machine language version of the words that preserves context and identifies the proper definition of the words by identifying and preserving context of a particular set of words, such as a sentence or paragraph. The machine vocabulary includes a definition section for each definition of a word. Each definition section includes a set of one or more definition elements. The definition elements include a predetermined format of definition fields, and each has a corresponding mask indicative of significant definition fields. The set of definition elements corresponding to a particular definition describe the usage of the word in a context matching that particular definition. Each definition element captures a characteristic of the definition according to fuzzy logic such that the definition elements collectively capture the context.
-
Citations
26 Claims
-
1. A method for representing text in a language independent format of multiple bits and fields of bits comprising:
-
referencing a word from a set of words into a dictionary of representations of words, in said format of multiple bits and fields of bits, to find a corresponding dictionary term, the set of words derived from a natural language usage defining a natural language context of the words in the set of words; mapping the referenced dictionary term to at least one definition element indicative of usage of the word in at least one context, the definition elements defining a record of fields; comparing the mapped definition element to the corresponding fields in the definition elements of other words in the set of words, the comparison operative to identify similar contexts between the definition elements; disambiguating the referenced words by analyzing each of the definition elements of the referenced words with the definition elements of the other referenced words, the analysis operable to determine a particular definition for each of the referenced words in the context of the set of words, disambiguating including; performing, with a processor, bitwise operations on at least a subset of the fields in the definition elements with corresponding fields in the other definition elements in the set of words, each of the definition elements indicative of a particular context, the operations for identifying a particular definition element based on the context in the set of words, the definition elements including bit fields of class, method, and category in the high bits, the method field determining a structure of fields in lower order bits, wherein disambiguating is performed on definition elements of equal category and method fields; and identifying, from the comparing, a definition element corresponding to the usage of the word in a context of the set of words. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method for processing language elements comprising:
-
receiving a series of words in a natural language; parsing the series of words to derive an ordered set of words in the natural language, the set of words derived from a natural language usage defining a natural language context of the words in the set of words; mapping each of the parsed words to a corresponding term in a dictionary of representations of multiple bits and fields of bits, of words in the particular natural language; mapping each of the identified dictionary terms to at least one definition element, the definition elements defining a record of fields and indicative of the word in context with other words; processing the mapped definition elements with corresponding fields in the other mapped definition elements from the ordered set of words to identify corresponding contexts of words in the ordered set of words, each of the definition elements identifying a particular definition of a word in the context defined by proximate words, each definition element having a context specific definition; disambiguating the parsed words by analyzing each of the definition elements of the parsed words with the definition elements of the other parsed words, the analysis operable to determine a particular definition for each of the parsed words in a context of the series of words, disambiguating including; performing, with a processor, bitwise operations on at least a subset of the fields in the definition elements with corresponding fields in the other definition elements in the word sequence, each of the definition elements indicative of a particular context, the operations for identifying a particular definition element based on the context in the word sequence, the definition elements including bit fields of class, method, and category in the high bits, the method field determining a structure of fields in lower order bits, wherein disambiguating is performed on definition elements of equal category and method fields; and identifying, from the processing, a particular definition element for each of the words applicable to the word in the ordered set of words. - View Dependent Claims (20, 21, 22, 23)
-
-
24. A data processing device for representing text in a language independent format of multiple bits and fields of bits, comprising:
-
a processor operable to execute instructions; a memory responsive to the processor operable to store the instructions; an interface operable to transfer data between a user device, the processor and the memory, the instructions operable to; reference a word from a set of words into a dictionary of bitwise representations of words to find a corresponding dictionary term, the set of words derived from a natural language usage defining a natural language context of the words in the set of words; map the referenced dictionary term to at least one definition element indicative of usage of the word in a specific context, the definition elements defining a record of fields, each of the definition elements identifying a particular definition of a word in the context defined by proximate words, each definition element having a context specific definition; compare the mapped definition element to the corresponding fields in the definition elements of other words in the set of words, the comparison operative to identify similar contexts between the definition elements; disambiguate the referenced words by analyzing each of the definition elements of the referenced words with the definition elements of the other referenced words, the analysis operable to determine a particular definition for each of the referenced words in a context of the series of words, disambiguating including; performing bitwise operations on at least a subset of the fields in the definition elements with corresponding fields in the other definition elements in the set of words, each of the definition elements indicative of a particular context, the operations for identifying a particular definition element based on the context in the set of words, the definition element including bit fields of class, method, and category in the high bits, the method field determining a structure of fields in lower order bits, wherein disambiguating is performed on definition elements of equal category and method fields; and identify, from the comparing, a definition element corresponding to the usage of the word in a context of the set of words.
-
-
25. A computer program product having a non-transitory computer readable medium operable to store computer program logic embodied in computer program code encoded thereon for processing language elements comprising:
-
computer program code for receiving a series of words in a natural language; computer program code for parsing the series of words to derive an ordered set of words in the natural language, the set of words derived from a natural language usage defining a natural language context of the words in the set of words; computer program code for mapping each of the parsed words to a corresponding term in a dictionary of bitwise representations of words in the particular natural language; computer program code for mapping each of the identified dictionary terms to at least one definition element, the definition elements indicative of the word in context with other words, each of the definition elements defining a record of fields and identifying a particular definition of a word in the context defined by proximate words, each definition element having a context specific definition; computer program code for processing the mapped definition elements with corresponding fields in the other mapped definition elements from the ordered set of words to identify corresponding contexts of words in the ordered set of words; computer program code for identifying, from the processing, a particular definition element for each of the words applicable to the word in the ordered set of words; computer program code for comparing the referenced words to the dictionary, the dictionary indicative of definitions of the referenced words, each word having at least one definition, the definitions corresponding to different contexts of the referenced words; and computer program code for disambiguating the referenced words by analyzing each of the definition elements of the referenced words with the definition elements of the other referenced words, the analysis operable to determine a particular definition for each of the referenced words in a context of the set of words, disambiguating including;
performing bitwise operations on at least a subset of the fields in the definition elements with corresponding fields in the other definition elements in the word sequence, each of the definition elements indicative of a particular context, the operations for identifying a particular definition element based on the context in the set of words, the definition element including bit fields of class, method, and category in the high bits, the method field determining a structure of fields in lower order bits, wherein disambiguating is performed on definition elements of equal category and method fields; andcomputer program code for identifying, from the comparing, a particular definition element for each of the words applicable to the word in the ordered set of words.
-
-
26. A machine vocabulary processor for translation independent of a native natural language comprising:
-
means for generating a bitwise language representation by classifying language elements according to a predetermined set of rules, the bitwise language representation having a plurality of bitwise definition elements corresponding to words in a particular usage context; means for parsing the words to identify the definition elements, each of the definition elements having fields;
means for receiving a natural language sentence in a first natural language, the natural language sentence comprising a series of words, the series of words derived from a natural language usage defining a natural language context of the words in the set of words;means for disambiguating the referenced words by analyzing each of the definition elements of the referenced words with the definition elements of the other referenced words, the analysis operable to determine a particular definition for each of the referenced words in the context of the series of words, disambiguating including; performing bitwise operations on at least a subset of the fields in the definition elements with corresponding fields in the other definition elements in the set of words, each of the definition elements indicative of a particular context, the operations for identifying a particular definition element based on the context in the set of words, the definition element includes bit fields of class, method, and category in the high bits, the method field determining a structure of fields in lower order bits, wherein disambiguating is performed on definition elements of equal category and method fields; means for computing a series of definition elements corresponding to each of the words in the first natural language sentence, the definition elements indicative of each of the words in the context of the received natural language sentence; and means for determining, from the computed series of definition elements, a second set of language elements in a second natural language, the definition elements selected in a manner that preserves the specific context from the first natural language sentence, each of the definition elements identifying a particular definition of a word in the context defined by proximate words, each definition element having a context specific definition.
-
Specification