Online de-compounding of query terms
First Claim
1. A system comprising:
- one or more processors; and
a computer-readable storage device storing instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprisingreceiving a query submitted by a user to a search engine, wherein the query includes a first compound term; and
in response to receiving the query, performing the following operations;
generating one or more splits of the first compound term, wherein each split divides the compound term into two or more subterms, wherein at least one subterm is a term in a dictionary that associates terms with scores derived from a respective frequency of use of the subterm;
assigning a score to one or more subterms of each split that are in the dictionary, wherein the score for a subterm is the score stored in the dictionary for the subterm;
determining an overall score for each split from the scores for the subterms of the split;
selecting one or more of the one or more splits according to the overall score for each split; and
augmenting the query with the subterms of each selected split.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for query synonym expansion. One method includes receiving a query including a first compound term, and in response to receiving the query, performing the following operations before search results responsive to the query are identified: generating one or more splits of the first compound term, wherein each split divides the compound term into two or more subterms, assigning a score to each subterm of each split, determining an overall score for each split from the scores for the subterms of the split, selecting one or more of the one or more splits according to the overall score for each split, and augmenting the query with the subterms of each selected split.
-
Citations
47 Claims
-
1. A system comprising:
-
one or more processors; and a computer-readable storage device storing instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising receiving a query submitted by a user to a search engine, wherein the query includes a first compound term; and in response to receiving the query, performing the following operations; generating one or more splits of the first compound term, wherein each split divides the compound term into two or more subterms, wherein at least one subterm is a term in a dictionary that associates terms with scores derived from a respective frequency of use of the subterm; assigning a score to one or more subterms of each split that are in the dictionary, wherein the score for a subterm is the score stored in the dictionary for the subterm; determining an overall score for each split from the scores for the subterms of the split; selecting one or more of the one or more splits according to the overall score for each split; and augmenting the query with the subterms of each selected split. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer-readable storage device storing instructions that, when executed by data processing apparatus, cause the data processing apparatus to perform operations comprising:
-
receiving a query submitted by a user to a search engine, wherein the query includes a first compound term; and in response to receiving the query, performing the following operations; generating one or more splits of the first compound term, wherein each split divides the compound term into two or more subterms, wherein at least one subterm is a term in a dictionary that associates terms with scores derived from a respective frequency of use of the subterm; assigning a score to one or more subterms of each split that are in the dictionary, wherein the score for a subterm is the score stored in the dictionary for the subterm; determining an overall score for each split from the scores for the subterms of the split; selecting one or more of the one or more splits according to the overall score for each split; and augmenting the query with the subterms of each selected split. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31)
-
-
32. A computer implemented method, comprising:
-
receiving a query submitted by a user to a search engine, wherein the query includes a first compound term; and in response to receiving the query, performing the following operations before search results responsive to the query are identified; generating one or more splits of the first compound term, wherein each split divides the compound term into two or more subterms, wherein at least one subterm is a term in a dictionary that associates terms with scores derived from a respective frequency of use of the subterm; assigning a score to one or more subterms of each split that are in the dictionary, wherein the score for a subterm is the score stored in the dictionary for the subterm; determining an overall score for each split from the scores for the subterms of the split; selecting one or more of the one or more splits according to the overall score for each split; and augmenting the query with the subterms of each selected split. - View Dependent Claims (33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47)
-
Specification