Data disambiguation systems and methods
First Claim
Patent Images
1. A computer-implemented method comprising:
- receiving text with a computer system comprising a computer-readable medium configured with a functional presences engine, the functional presence engine configured as a probabilistic parser;
performing with the computer system, lexical analysis on the text effective to tokenize text portions to produce tokenized content in a format specified in one or more interpreted lexical files specifying one or more matching rules and corresponding output symbols; and
With a computer system configured with a knowledge base component operably associated with the functional presence engine, defining;
cases of text matchable to text received by the functional presence engine; and
responses that are triggered in an event of a match, wherein individual lexical files comprise a macro section that specifies macro values that are substitutable for macro names, and a lex section that specifies lexical rewrite rules, and wherein the lex section comprises a main section that contains rules that are executed at a top level of a tokenization process, and a sub-section associated with a rule in the main section, the sub-section containing a group of rules that get executed only if the associated main section rule produces the best match.
8 Assignments
0 Petitions
Accused Products
Abstract
Various embodiments provide a state-based, regular expression parser in which data, such as generally unstructured text, is received into the system and undergoes a tokenization process which permits structure to be imparted to the data. Tokenization of the data effectively enables various patterns in the data to be identified. In some embodiments, one or more components can utilize stimulus/response paradigms to recognize and react to patterns in the data.
-
Citations
12 Claims
-
1. A computer-implemented method comprising:
-
receiving text with a computer system comprising a computer-readable medium configured with a functional presences engine, the functional presence engine configured as a probabilistic parser; performing with the computer system, lexical analysis on the text effective to tokenize text portions to produce tokenized content in a format specified in one or more interpreted lexical files specifying one or more matching rules and corresponding output symbols; and With a computer system configured with a knowledge base component operably associated with the functional presence engine, defining; cases of text matchable to text received by the functional presence engine; and responses that are triggered in an event of a match, wherein individual lexical files comprise a macro section that specifies macro values that are substitutable for macro names, and a lex section that specifies lexical rewrite rules, and wherein the lex section comprises a main section that contains rules that are executed at a top level of a tokenization process, and a sub-section associated with a rule in the main section, the sub-section containing a group of rules that get executed only if the associated main section rule produces the best match. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer readable medium having instructions stored thereon which when executed by a processor cause the processor to:
-
receive text with a computer system configured with a functional presence engine, the functional presence engine configured as a probabilistic parser; perform lexical analysis on the text effective to tokenize text portions to produce tokenized content in a format specified in one or more interpreted lexical files specifying one or more matching rules and corresponding output symbols; and with a knowledge base component operably associated with the functional presence engine, define; cases of text matchable to text received by the functional presence engine; and responses that are triggered in an event of a match, wherein individual lexical files comprise a macro section that specifies macro values that are substitutable for macro names, and a lex section that specifies lexical rewrite rules, and wherein the lex section comprises a main section that contains rules that are executed at a top level of a tokenization process, and a sub-section associated with a rule in the main section, the sub-section containing a group of rules that get executed only if the associated main section rule produces the best match. - View Dependent Claims (8, 9, 10, 11, 12)
-
Specification