Method and system for reducing lexical ambiguity
First Claim
Patent Images
1. A method for reducing lexical ambiguity in an input stream, comprising:
- breaking the input stream into at least two tokens;
compiling lexical grammar rules to generate lexical functions;
creating a plurality of segments from the at least two tokens based upon lexical information and the lexical functions, defining a plurality of paths using the plurality of segments;
assigning a cost to each of the plurality of paths;
defining at least one best path based upon a corresponding cost to generate an output graph; and
providing the output graph to a syntactic analysis module for subsequent syntactic analysis of the input stream to reduce lexical ambiguity.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for reducing lexical ambiguity in an input stream are described. In one embodiment, the input stream is broken into tokens. The tokens are used to create a connection graph comprising a number of paths. Each of the paths is assigned a cost. At least one best path is defined based upon a corresponding cost to generate an output graph. The generated output graph is provided to reduce lexical ambiguity.
133 Citations
27 Claims
-
1. A method for reducing lexical ambiguity in an input stream, comprising:
-
breaking the input stream into at least two tokens;
compiling lexical grammar rules to generate lexical functions;
creating a plurality of segments from the at least two tokens based upon lexical information and the lexical functions, defining a plurality of paths using the plurality of segments;
assigning a cost to each of the plurality of paths;
defining at least one best path based upon a corresponding cost to generate an output graph; and
providing the output graph to a syntactic analysis module for subsequent syntactic analysis of the input stream to reduce lexical ambiguity. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. An apparatus for reducing lexical ambiguity in an input stream, comprising:
-
means for breaking the input stream into at least two tokens;
means for compiling lexical grammar rules to generate lexical functions;
means for creating a plurality of segments from the at least two tokens based upon lexical information and the lexical functions, means for defining a plurality of paths using the plurality of segments;
means for assigning a cost to each of the plurality of paths;
means for defining at least one best path based upon a corresponding cost to generate an output graph; and
means for providing the output graph to a syntactic analysis module for subsequent syntactic analysis of the input stream to reduce lexical ambiguity. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. An apparatus for reducing lexical ambiguity in an input stream, comprising:
-
a tokenizer to break the input stream into at least two tokens;
a grammar programming language (GPL) compiler to compile lexical grammar rules to generate lexical functions;
a segmentation engine to create a plurality of segments from the at least two tokens based upon lexical information and the lexical functions;
a path designator to define a plurality of paths using the plurality of segments;
a cost assignor to assign a cost to each of the plurality of paths;
a path calculator to define at least one best path based upon a corresponding cost to generate an output graph; and
a graph provider to provide the output graph to a syntactic analysis module for subsequent syntactic analysis of the input stream to reduce lexical ambiguity. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
-
22. A system for reducing lexical ambiguity, comprising:
-
a processor;
an input coupled to the processor, the input capable of receiving an input stream, the processor configured to break the input stream into at least two tokens, compile lexical grammar rules to generate lexical functions, create a plurality of segments from the at least two tokens based upon lexical information and the lexical functions, define a plurality of paths using the plurality of segments, assign a cost to each of the plurality of paths, and define at least one best path based upon a corresponding cost to generate an output graph; and
an output coupled to the processor, the output capable of providing the output graph to a syntactic analysis module for subsequent syntactic analysis of the input stream to reduce lexical ambiguity.
-
-
23. A computer readable medium comprising instructions, which when executed on a processor, perform method for reducing lexical ambiguity in an input stream, comprising:
-
breaking an input stream into at least two tokens;
compiling lexical grammar rules to generate lexical functions;
creating a plurality of segments from the at least two tokens based upon lexical information and the lexical functions;
defining a plurality of paths using the plurality of segments;
assigning a cost to each of the plurality of paths;
defining at least one best path based upon a corresponding cost to generate an output graph; and
providing the output graph to a syntactic analysis module for subsequent syntactic analysis of the input stream to reduce lexical ambiguity. - View Dependent Claims (24, 25, 26, 27)
-
Specification