Discriminating between natural language and keyword language items
First Claim
1. A computer system comprising:
- one or more processing devices; and
one or more storage resources storing instructions which, when executed by the one or more processing devices, cause the one or more processing devices to;
receive an input linguistic item;
use a classification model to produce a label which identifies the input linguistic item as;
a natural language linguistic item, which expresses an intent using a natural language, ora keyword language linguistic item, which expresses the intent using one or more keywords; and
perform an action on the input linguistic item based at least on the label,the classification model being trained using training sets, individual training sets comprising an individual natural language linguistic item and an individual keyword language linguistic item that is identified as being related to the individual natural language linguistic item based at least on user behavior as reflected in a query click log.
2 Assignments
0 Petitions
Accused Products
Abstract
This disclosure pertains to a classification model, and to functionality for producing and applying the classification model. The classification model is configured to discriminate whether an input linguistic item (such as a query) corresponding to either a natural language (NL) linguistic item or a keyword language (KL) linguistic item. An NL linguistic item expresses an intent using a natural language, while a KL linguistic item expresses the intent using one or more keywords. In a training phase, the functionality produces the classification model based on query click log data or the like. In an application phase, the functionality may, among other uses, use the classification model to filter a subset of NL linguistic items from a larger set of items, and then use the subset of NL linguistic items to train a natural language interpretation model, such as a spoken language understanding model.
51 Citations
20 Claims
-
1. A computer system comprising:
-
one or more processing devices; and one or more storage resources storing instructions which, when executed by the one or more processing devices, cause the one or more processing devices to; receive an input linguistic item; use a classification model to produce a label which identifies the input linguistic item as; a natural language linguistic item, which expresses an intent using a natural language, or a keyword language linguistic item, which expresses the intent using one or more keywords; and perform an action on the input linguistic item based at least on the label, the classification model being trained using training sets, individual training sets comprising an individual natural language linguistic item and an individual keyword language linguistic item that is identified as being related to the individual natural language linguistic item based at least on user behavior as reflected in a query click log. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method performed by one or more processing devices, the method comprising:
-
identifying training sets comprising natural language linguistic items and related keyword language linguistic items, the training sets being identified based at least on similarity of user behavior by users that submit the natural language linguistic items and the related keyword language linguistic items as queries to a search engine; training a classification model to distinguish between natural language input and keyword language input, the classification model being trained using the training sets; receiving an input linguistic item; and using the trained classification model to produce a label which identifies the input linguistic item as; an input natural language linguistic item, which expresses an intent using natural language;
oran input query language linguistic item, which expresses the intent using one or more keywords.
-
-
12. A method implemented using one or more computing devices, the method comprising:
-
producing and storing a set of pair items based at least on relative similarity of the pair items as determined using a query click log, each pair item in the set of pair items including; (a) a natural language (NL) linguistic item, which expresses an intent using natural language; and (b) a keyword language (KL) linguistic item, which expresses the intent using one or more keywords, and is related to the NL linguistic item; and using a machine learning technique to generate and store an NL-KL classification model based, in part, on the set of pair items, the NL-KL classification model being configured to identify whether an input linguistic item corresponds to either an NL linguistic item or a KL linguistic item. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification