DATA DISAMBIGUATION SYSTEMS AND METHODS
6 Assignments
0 Petitions
Accused Products
Abstract
Various embodiments provide a state-based, regular expression parser in which data, such as generally unstructured text, is received into the system and undergoes a tokenization process which permits structure to be imparted to the data. Tokenization of the data effectively enables various patterns in the data to be identified. In some embodiments, one or more components can utilize stimulus/response paradigms to recognize and react to patterns in the data.
29 Citations
123 Claims
-
1-103. -103. (canceled)
-
104. A computer-based system for determining a response to an input text string, the system comprising:
a server that receives the input text string via a computer data network, wherein the server executes software instructions stored on a computer readable medium, wherein the server is programmed to; tokenize the input text string by parsing the input text string to define one or more recognizable patterns in the input text string; comparing the one or more recognizable patterns to a plurality of cases of text to determine whether the one or more recognizable patterns match one or more of the plurality of cases, wherein each of the plurality of cases define a response to be taken in the event of a case match, wherein the cases of text are stored in a knowledge base and defined using a hierarchical tag-based markup language; when a recognizable pattern matches only one case, perform the response for the case; and when a recognizable pattern matches two or more cases; score the two or more cases to determine the case with the highest probability match based on a scoring function; and perform the response for the case with the highest probability match. - View Dependent Claims (105, 106, 107, 108, 109, 110, 111)
-
112. A computer-based system for determining a response to an input text string, the system comprising:
a server that receives the input text string via a computer data network, wherein the server executes software instructions stored on a computer readable medium, wherein the server comprises; one or more knowledge bases that store a plurality of cases of text, each case defining a response to be taken in the event of a case match, where the cases are defined using a hierarchical tag-based markup language; and a functional presence engine that; tokenizes the input text string by parsing the input text string to define one or more recognizable patterns in the input text string; compares the one or more recognizable patterns to the plurality of cases of text in the one or more knowledge bases to determine whether the one or more recognizable patterns match one or more of the plurality of cases when a recognizable pattern matches only one case, performs the response for the case; and when a recognizable pattern matches two or more cases; scores the two or more cases to determine the case with the highest probability match based on a scoring function; and performs the response for the case with the highest probability match. - View Dependent Claims (113, 114, 115, 116, 117, 118, 119)
-
120. A computer-implemented method comprising:
-
receiving a text string by a computer system; tokenizing the text string by the computer system, wherein the tokenizing comprises parsing the input text string to define one or more recognizable patterns in the input text string; comparing by the computer system the one or more recognizable patterns to a plurality of cases of text; determining by the computer system whether the one or more recognizable patterns matches one or more of the plurality of cases, wherein each of the plurality of cases define a response to be taken in the event of a case match, wherein the cases of text are stored in a knowledge base and defined using a hierarchical tag-based markup language; when a recognizable pattern matches only one case, performing the response for the case; when a recognizable pattern matches two or more cases, scoring by the computer system the two or more cases to determine the case with the highest probability match based on a scoring function, and performing the response for the case with the highest probability match. - View Dependent Claims (121, 122, 123)
-
Specification