Query Language Identification
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer program products, for identifying the language of a search query. In one embodiment, the language of each term of a query is determined from the query terms and the language of the user interface a user used to enter the query. In another embodiment, an automatic interface language classifier is generated from a collection of past queries each submitted by a user. In some embodiments, a score is determined for each of multiple languages, each score indicating a likelihood that the query language is the corresponding one of the multiple languages.
102 Citations
39 Claims
-
1-27. -27. (canceled)
-
28. A computer implemented method, comprising:
-
receiving, in a system comprising one or more computers, through a user interface, a search query comprising one or more query terms, the user interface having an interface language, where the interface language is a natural language in which the user interface presents information; and determining, by the system, a query language of the query from the query, the interface language, and a plurality of query records stored by the system, the query language being a natural language; where determining the query language of the query further comprises; for each of multiple languages, calculating a first score for each query term and the respective language, each first score indicating the likelihood that the respective query term is in the respective language, where the first score is calculated based on a plurality of documents, each document having an associated natural language, calculating a second score for the respective language, the second score indicating the likelihood that the query is in the respective language given the user interface through which the query was received, where the second score is calculated based on the plurality of query records, and calculating a third score for the respective language, the third score being a combination of the first scores for the respective language and the second score for the respective language; and selecting the query language based on the third scores for the multiple languages. - View Dependent Claims (29, 30, 31)
-
-
32. A computer program product, encoded on a machine-readable storage device, operable to cause data processing apparatus to perform operations comprising:
-
receiving, in a system comprising one or more computers, through a user interface, a search query comprising one or more query terms, the user interface having an interface language, where the interface language is a natural language in which the user interface presents information; and determining, by the system, a query language of the query from the query, the interface language, and a plurality of query records stored by the system, the query language being a natural language; where determining the query language of the query further comprises; for each of multiple languages, calculating a first score for each query term and the respective language, each first score indicating the likelihood that the respective query term is in the respective language, where the first score is calculated based on a plurality of documents, each document having an associated natural language, calculating a second score for the respective language, the second score indicating the likelihood that the query is in the respective language given the user interface through which the query was received, where the second score is calculated based on the plurality of query records, and calculating a third score for the respective language, the third score being a combination of the first scores for the respective language and the second score for the respective language; and selecting the query language based on the third scores for the multiple languages. - View Dependent Claims (33, 34, 35)
-
-
36. A system comprising:
-
one or more computers configured to perform operations comprising; receiving, in a system comprising one or more computers, through a user interface, a search query comprising one or more query terms, the user interface having an interface language, where the interface language is a natural language in which the user interface presents information; and determining, by the system, a query language of the query from the query, the interface language, and a plurality of query records stored by the system, the query language being a natural language; where determining the query language of the query further comprises; for each of multiple languages, calculating a first score for each query term and the respective language, each first score indicating the likelihood that the respective query term is in the respective language, where the first score is calculated based on a plurality of documents, each document having an associated natural language, calculating a second score for the respective language, the second score indicating the likelihood that the query is in the respective language given the user interface through which the query was received, where the second score is calculated based on the plurality of query records, and calculating a third score for the respective language, the third score being a combination of the first scores for the respective language and the second score for the respective language; and selecting the query language based on the third scores for the multiple languages. - View Dependent Claims (37, 38, 39)
-
Specification