SYSTEM AND METHOD FOR RECOGNIZING STRUCTURE IN TEXT
First Claim
1. A method for processing information embedded in a text file with a grammar programming language, including:
- receiving a text file, the text file including a plurality of input values;
parsing each of the plurality of input values according to a set of rules;
compiling a script so as to produce a plurality of candidate textual shapes, each of the plurality of candidate textual shapes corresponding to a potential interpretation of the plurality of input values; and
providing an output, the output including at least one of;
a processed value, the processed value corresponding to a particular textual shape, the particular textual shape selected from the plurality of candidate textual shapes;
ora textual representation of the text file, the textual representation including a plurality of generic data structures that facilitate providing any of the plurality of candidate textual shapes, the generic data structures being a function of the set of rules.
2 Assignments
0 Petitions
Accused Products
Abstract
A method, system, and computer product for processing information embedded in a text file with a grammar programming language is provided. A text file is parsed according to a set of rules and candidate textual shapes corresponding to potential interpretations of the text file are provided by compiling a script. An output is provided, which may include either a processed value corresponding to a particular textual shape, or a textual representation of the text file that includes generic data structures that facilitate providing any of the candidate textual shapes, where the generic data structures are a function of the set of rules.
50 Citations
23 Claims
-
1. A method for processing information embedded in a text file with a grammar programming language, including:
-
receiving a text file, the text file including a plurality of input values; parsing each of the plurality of input values according to a set of rules; compiling a script so as to produce a plurality of candidate textual shapes, each of the plurality of candidate textual shapes corresponding to a potential interpretation of the plurality of input values; and providing an output, the output including at least one of; a processed value, the processed value corresponding to a particular textual shape, the particular textual shape selected from the plurality of candidate textual shapes;
ora textual representation of the text file, the textual representation including a plurality of generic data structures that facilitate providing any of the plurality of candidate textual shapes, the generic data structures being a function of the set of rules. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer-readable storage medium comprising instructions for facilitating processing information embedded in a text file with a grammar programming language, including:
-
a first module, the first module including instructions for receiving the text file as an input, the text file including a plurality of input values; a second module, the second module including instructions for providing a library, the library including a plurality of constructs for interpreting a textual shape of the text file; a third module, the third module including instructions for providing a script editor, the script editor configured to facilitate generating a script of the grammar programming language, the script including at least one of the plurality of constructs; a fourth module, the fourth module including instructions for compiling the script as a function of the text file, the compiling instructions facilitating generating a plurality of candidate textual shapes, each of the plurality of candidate textual shapes corresponding to a potential interpretation of the plurality of input values; and a fifth module, the fifth module including instructions for providing an output, the output including at least one of; a processed value, the processed value corresponding to a particular textual shape, the particular textual shape selected from the plurality of candidate textual shapes;
ora textual representation of the text file, the textual representation including a plurality of generic data structures that facilitate providing any of the plurality of candidate textual shapes, the generic data structures being a function of the script. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A system executed by one or more processors for facilitating processing information embedded in a text file with a grammar programming language, including:
-
means for receiving a text file, the text file including a plurality of input values; means for parsing each of the plurality of input values according to a set of rules; means for identifying at least one syntactical ambiguity; means for identifying at least one token ambiguity; means for prioritizing a plurality of candidate textual shapes, the plurality of candidate textual shapes including at least one candidate resolution to the at least one syntactical ambiguity; means for resolving the at least one token ambiguity; means for compiling a script so as to produce the plurality of candidate textual shapes, each of the plurality of candidate textual shapes corresponding to a potential interpretation of the plurality of input values; and means for providing an output, the output including at least one of; a processed value, the processed value corresponding to a particular textual shape, the particular textual shape selected from the plurality of candidate textual shapes;
ora textual representation of the text file, the textual representation including a plurality of generic data structures that facilitate providing any of the plurality of candidate textual shapes, the generic data structures being a function of the set of rules.
-
Specification