GENERATING A QUERY STATEMENT BASED ON UNSTRUCTURED INPUT
First Claim
1. A computer-implemented method comprising:
- identifying words from a natural language request to search a dataset;
associating each of the words with an identifier that identifies a part of speech type of the corresponding word;
generating groupings of identifiers, each of the groupings comprising one or more of the identifiers;
providing the words and the groupings to a machine learning classifier to generate a structured query to the dataset, the machine learning classifier configured to select a structure of the structured query based on the groupings and to use the words and the structure to generate the structured query; and
querying the dataset based on the structured query provided from the machine learning classifier.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques for generating a query statement to query a dataset may be provided. For example, the query statement can be generated from natural language input, such as a natural language utterance. To do so, the input can be analyzed to detect a sentence, identify words in the sentence, and tag the words with the corresponding word types (e.g., nouns, verbs, adjectives, etc.). Expressions using the tags can be generated. Data about the expressions can be inputted to a classifier. Based on a detected pattern associated with the expressions, the classifier can predict a structure of the query statement, such as what expressions correspond to what clauses of the query statement. Based on this prediction, words associated with the expressions can be added to the clauses to generate the query statement and accordingly query the dataset.
-
Citations
20 Claims
-
1. A computer-implemented method comprising:
-
identifying words from a natural language request to search a dataset; associating each of the words with an identifier that identifies a part of speech type of the corresponding word; generating groupings of identifiers, each of the groupings comprising one or more of the identifiers; providing the words and the groupings to a machine learning classifier to generate a structured query to the dataset, the machine learning classifier configured to select a structure of the structured query based on the groupings and to use the words and the structure to generate the structured query; and querying the dataset based on the structured query provided from the machine learning classifier. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
a processor; a memory communicatively coupled to the processor and bearing instructions that, upon execution by the processor, cause the system to at least perform operations comprising; receiving natural language text comprising words; generating expressions from the natural language text based on the words, each of the expressions comprising at least an identifier of a part of speech type corresponding to a word from the words; predicting at least a clause of a query statement based on the expressions, the clause associated with the identifier; and generating the query statement based on adding the corresponding word to the clause. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A computer-readable storage medium storing instructions that, when executed on a computing device, configure the computing device to perform operations comprising:
-
receiving natural language text; classifying one or more words from the natural language text as a noun; associating the one or more words with a noun tag based on the classifying; generating an expression comprising the noun tag; predicting a clause of a query statement based on a pattern associated with the expression and other expressions, the other expressions generated based on tags associated with classified words or terms from the natural language text; and generating the query statement by adding the one or more words to the clause. - View Dependent Claims (19, 20)
-
Specification