Synonym generation using online decompounding and transitivity
First Claim
Patent Images
1. A system comprising:
- one or more processors; and
a computer-readable storage device storing instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising;
receiving a query submitted by a user to a search engine, wherein the query includes a first compound term; and
in response to receiving the query, performing the following operations;
generating one or more splits of the first compound term, wherein each split divides the compound term into two or more subterms, wherein at least one subterm is a term in a dictionary that associates terms with scores derived from a respective frequency of use of the subterm;
assigning a score to one or more subterms of each split that are in the dictionary, wherein the score for a subterm is the score stored in the dictionary for the subterm;
determining an overall score for each split from the scores for the subterms of the split;
selecting a first split from the one or more splits according to the overall score for each split; and
augmenting the query with a first synonym phrase, wherein the first synonym phrase is a synonym of a first subterm of the first split.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for query synonym expansion. One method includes receiving a query including a first compound term, and in response to receiving the query, performing the following operations: generating one or more splits of the first compound term, assigning a score to each subterm of each split, determining an overall score for each split from the scores for the subterms of the split, selecting a first split from the one or more splits according to the overall score for each split, and augmenting the query with a first synonym phrase that is a synonym of a first subterm of the first split.
-
Citations
36 Claims
-
1. A system comprising:
-
one or more processors; and a computer-readable storage device storing instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising; receiving a query submitted by a user to a search engine, wherein the query includes a first compound term; and in response to receiving the query, performing the following operations; generating one or more splits of the first compound term, wherein each split divides the compound term into two or more subterms, wherein at least one subterm is a term in a dictionary that associates terms with scores derived from a respective frequency of use of the subterm; assigning a score to one or more subterms of each split that are in the dictionary, wherein the score for a subterm is the score stored in the dictionary for the subterm; determining an overall score for each split from the scores for the subterms of the split; selecting a first split from the one or more splits according to the overall score for each split; and augmenting the query with a first synonym phrase, wherein the first synonym phrase is a synonym of a first subterm of the first split. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A non-transitory computer-readable storage device storing instructions that, when executed by data processing apparatus, cause the data processing apparatus to perform operations comprising:
-
receiving a query submitted by a user to a search engine, wherein the query includes a first compound term; and in response to receiving the query, performing the following operations; generating one or more splits of the first compound term, wherein each split divides the compound term into two or more subterms, wherein at least one subterm is a term in a dictionary that associates terms with scores derived from a respective frequency of use of the subterm; assigning a score to one or more subterms of each split that are in the dictionary, wherein the score for a subterm is the score stored in the dictionary for the subterm; determining an overall score for each split from the scores for the subterms of the split; selecting a first split from the one or more splits according to the overall score for each split; and augmenting the query with a first synonym phrase, wherein the first synonym phrase is a synonym of a first subterm of the first split. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 33, 34)
-
-
23. A computer-implemented method, comprising:
-
receiving a query submitted by a user to a search engine, wherein the query includes a first compound term; and in response to receiving the query, performing the following operations; generating one or more splits of the first compound term, wherein each split divides the compound term into two or more subterms, wherein at least one subterm is a term in a dictionary that associates terms with scores derived from a respective frequency of use of the subterm; assigning a score to one or more subterms of each split that are in the dictionary, wherein the score for a subterm is the score stored in the dictionary for the subterm; determining an overall score for each split from the scores for the subterms of the split; selecting a first split from the one or more splits according to the overall score for each split; and augmenting the query with a first synonym phrase, wherein the first synonym phrase is a synonym of a first subterm of the first split. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 35, 36)
-
Specification