Generating a query statement based on unstructured input
First Claim
1. A computer-implemented method comprising:
- identifying words from a natural language request to search a dataset;
associating each of the words with an identifier that identifies a part of speech type of the corresponding word;
generating groupings of identifiers of the words, each of the groupings comprising a respective set of the identifiers associated with a set of the words, and each of the groupings corresponding to a particular part of speech type, the identifiers comprising a noun tag;
providing the words and the groupings to a machine learning classifier to generate a structured query to the dataset, the machine learning classifier configured to;
select a structure of the structured query for a noun grouping based on the groupings by at least predicting that a word associated with the noun tag in the noun grouping belongs to a select clause or a where clause of the structure, andgenerate the structured query by using, in the structure, the set of words associated with the set of the identifiers of the noun grouping, wherein using the set of words comprises adding the word to the select clause or the where clause; and
querying the dataset based on the structured query provided from the machine learning classifier.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques for generating a query statement to query a dataset may be provided. For example, the query statement can be generated from natural language input, such as a natural language utterance. To do so, the input can be analyzed to detect a sentence, identify words in the sentence, and tag the words with the corresponding word types (e.g., nouns, verbs, adjectives, etc.). Expressions using the tags can be generated. Data about the expressions can be inputted to a classifier. Based on a detected pattern associated with the expressions, the classifier can predict a structure of the query statement, such as what expressions correspond to what clauses of the query statement. Based on this prediction, words associated with the expressions can be added to the clauses to generate the query statement and accordingly query the dataset.
-
Citations
20 Claims
-
1. A computer-implemented method comprising:
-
identifying words from a natural language request to search a dataset; associating each of the words with an identifier that identifies a part of speech type of the corresponding word; generating groupings of identifiers of the words, each of the groupings comprising a respective set of the identifiers associated with a set of the words, and each of the groupings corresponding to a particular part of speech type, the identifiers comprising a noun tag; providing the words and the groupings to a machine learning classifier to generate a structured query to the dataset, the machine learning classifier configured to; select a structure of the structured query for a noun grouping based on the groupings by at least predicting that a word associated with the noun tag in the noun grouping belongs to a select clause or a where clause of the structure, and generate the structured query by using, in the structure, the set of words associated with the set of the identifiers of the noun grouping, wherein using the set of words comprises adding the word to the select clause or the where clause; and querying the dataset based on the structured query provided from the machine learning classifier. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
a processor; and a memory communicatively coupled to the processor and bearing instructions that, upon execution by the processor, cause the system to at least perform operations comprising; receiving natural language text comprising words; generating expressions from the natural language text based on the words, each of the expressions comprising at least an identifier of a part of speech type corresponding to a word from the words, and each of the expressions corresponding to a particular part of speech type, the expression comprising a noun expression that includes a noun tag identifying a noun as one of the words; and predicting, by using a machine learning classifier, at least a clause of a query statement for the noun expression based on the expressions, and wherein the machine learning classifier is configured to; select a structure of the query statement by at least predicting that a word associated with the noun tag in the noun expression belongs to a select clause or a where clause of the structure, and generate the query statement based on adding the word to the select clause or the where clause. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A non-transitory computer-readable storage medium storing instructions that, when executed on a computing device, configure the computing device to perform operations comprising:
-
receiving natural language text; classifying one or more words from the natural language text as a noun; associating the one or more words with a noun tag based on the classifying; generating an expression comprising the noun tag; and predicting, by using a machine learning classifier, a clause of a query statement for the expression based on a pattern associated with the expression and other expressions, the other expressions generated based on tags associated with classified words or terms from the natural language text, wherein the machine learning classifier is configured to; select a structure of the query statement by at least predicting that the one or more words associated with the noun tag in the expression belongs to a select clause or a where clause of the structure, and generate the query statement by adding the one or more words associated with the noun tag of the expression to the select clause or the where clause. - View Dependent Claims (19, 20)
-
Specification