Identifying common co-occurring elements in lists
First Claim
Patent Images
1. A computer-implemented method comprising:
- receiving a pair of terms that includes a term and a candidate synonym of the term;
determining that the terms of the pair have been identified as being potentially non-synonymous;
responsive to determining that the terms of the pair have been identified as being potentially non-synonymous, applying a penalty to the pair of terms, wherein application of the penalty comprises;
determining a first quantity, corresponding to a number of times that users of a search engine have submitted queries that include the term,determining a second quantity, corresponding to a number of times that the users have switched from a query that includes the term, to an alternative query that includes the candidate synonym and that does not include the term, andadjusting, based on a relationship between the first quantity and the second quantity, a value that is used to determine whether the terms of the pair are synonyms; and
determining, by one or more computers, whether the terms of the pair are synonyms using the adjusted value.
2 Assignments
0 Petitions
Accused Products
Abstract
One embodiment of the present invention provides a system for detecting correlations between terms. During operation, the system identifies one or more lists contained in one or more documents and identifies two terms co-occurring in the lists. The system further determines a correlation between the co-occurring terms, and places the co-occurring terms in a correlated-pair list based on the correlation.
50 Citations
24 Claims
-
1. A computer-implemented method comprising:
-
receiving a pair of terms that includes a term and a candidate synonym of the term; determining that the terms of the pair have been identified as being potentially non-synonymous; responsive to determining that the terms of the pair have been identified as being potentially non-synonymous, applying a penalty to the pair of terms, wherein application of the penalty comprises; determining a first quantity, corresponding to a number of times that users of a search engine have submitted queries that include the term, determining a second quantity, corresponding to a number of times that the users have switched from a query that includes the term, to an alternative query that includes the candidate synonym and that does not include the term, and adjusting, based on a relationship between the first quantity and the second quantity, a value that is used to determine whether the terms of the pair are synonyms; and determining, by one or more computers, whether the terms of the pair are synonyms using the adjusted value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A non-transitory computer storage medium encoded with a computer program, the program comprising instructions that when executed by one or more computers cause the one or more computers to perform operations comprising:
-
receiving a pair of terms that includes a term and a candidate synonym of the term; determining that the terms of the pair have been identified as being potentially non-synonymous; responsive to determining that the terms of the pair have been identified as being potentially non-synonymous, applying a penalty to the pair of terms, wherein application of the penalty comprises; determining a first quantity, corresponding to a number of times that users of a search engine have submitted queries that include the term, determining a second quantity, corresponding to a number of times that the users have switched from a query that includes the term, to an alternative query that includes the candidate synonym and that does not include the term, and adjusting, based on a relationship between the first quantity and the second quantity, a value that is used to determine whether the terms of the pair are synonyms; and determining, by one or more computers, whether the terms of the pair are synonyms using the adjusted value. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A system comprising:
-
one or more computers; and a non-transitory computer-readable storage medium coupled to the one or more computers having instructions stored thereon which, when executed by the one or more computers, cause the one or more computers to perform operations comprising; receiving a pair of terms that includes a term and a candidate synonym of the term; determining that the terms of the pair have been identified as being potentially non-synonymous; responsive to determining that the terms of the pair have been identified as being potentially non-synonymous, applying a penalty to the pair of terms, wherein application of the penalty comprises; determining a first quantity, corresponding to a number of times that users of a search engine have submitted queries that include the term, determining a second quantity, corresponding to a number of times that the users have switched from a query that includes the term, to an alternative query that includes the candidate synonym and that does not include the term, and adjusting, based on a relationship between the first quantity and the second quantity, a value that is used to determine whether the terms of the pair are synonyms; and determining, by one or more computers, whether the terms of the pair are synonyms using the adjusted value. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification