Natural language query processing
First Claim
1. A natural language information retrieval method, comprising:
- receiving a query;
tokenizing the query;
selecting a query template based on a match between the tokenized query and one or more sequence patterns associated with a first portion of the query template;
initiating an information retrieval command associated with a second portion of the selected query template;
receiving results based on the initiated information retrieval command; and
returning at least a portion of the received results.
2 Assignments
0 Petitions
Accused Products
Abstract
An enhanced natural language information retrieval technique tokenizes an incoming query, comparing the tokenized representation against a collection of query templates. Query templates include a first portion having one or more query patterns representative of a query and in a form suitable for matching the tokenized representation of an incoming query. Query templates also include one or more information retrieval commands that are designed to return information relevant to those query patterns in its first portion. The enhanced natural language information retrieval technique selects those query templates that are determined to be most relevant to the incoming query (via its tokenized representation) and initiates one or more information retrieval commands associated with the selected query templates.
484 Citations
60 Claims
-
1. A natural language information retrieval method, comprising:
-
receiving a query;
tokenizing the query;
selecting a query template based on a match between the tokenized query and one or more sequence patterns associated with a first portion of the query template;
initiating an information retrieval command associated with a second portion of the selected query template;
receiving results based on the initiated information retrieval command; and
returning at least a portion of the received results. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A natural language method to tokenize a query, comprising:
-
identifying semantic units in a query;
associating a token with each uniquely identified semantic unit;
identifying stems for one or more of the tokens;
identifying lexical equivalents for one or more of the tokens;
representing the query as an ordered combination of the identified stems and tokens. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31)
-
-
33. A natural language query template accessible by a program being executed on a programmable control device, comprising:
-
a first portion having a sequence pattern representing a query, wherein the sequence pattern includes an ordered sequence of one or more required elements and zero or more optional elements; and
a second portion having a command sequence for generating a response to the query. - View Dependent Claims (32, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51)
-
-
52. A method to index data, comprising:
-
identifying semantic units in each of a plurality of data units;
associating an identified semantic unit with a location in each of the data units in which the identified semantic unit is found;
associating tokens with each uniquely identified semantic unit; and
indexing the tokens. - View Dependent Claims (53, 54, 55, 56, 57, 58, 59, 60)
-
Specification