Method and apparatus for automatically identifying compunds
First Claim
Patent Images
1. A computer-implemented method comprising:
- generating a candidate compound from a search query, the candidate compound comprising two or more consecutive terms in the search query;
determining a number of times search results responsive to the search query were previously selected by a user;
determining a number of times search results responsive to the search query and that contain the candidate compound were previously selected by the user; and
determining a likelihood that the candidate compound is a compound using a value produced by dividing the determined number of times search results responsive to the search query and that contain the candidate compound were previously selected by the user by the determined number of times search results responsive to the search query were previously selected by the user.
2 Assignments
0 Petitions
Accused Products
Abstract
One embodiment of the present invention provides a system that automatically identifies compounds, such as bigrams or n-grams. During operation, the system obtains selections of search results which were selected by one or more users, wherein the search results were previously generated by a search engine in response to queries containing search terms. Next, the system forms a set of candidate compounds from the queries, wherein each candidate compound comprises n consecutive terms from a query. Then, for each candidate compound in the set, the system analyzes the selections of search results to calculate a likelihood that the candidate compound is a compound.
44 Citations
19 Claims
-
1. A computer-implemented method comprising:
-
generating a candidate compound from a search query, the candidate compound comprising two or more consecutive terms in the search query; determining a number of times search results responsive to the search query were previously selected by a user; determining a number of times search results responsive to the search query and that contain the candidate compound were previously selected by the user; and determining a likelihood that the candidate compound is a compound using a value produced by dividing the determined number of times search results responsive to the search query and that contain the candidate compound were previously selected by the user by the determined number of times search results responsive to the search query were previously selected by the user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A system comprising:
one or more computers operable to perform operations comprising; generating a candidate compound from a search query, the candidate compound comprising two or more consecutive terms in the search query; determining a number of times search results responsive to the search query were previously selected by a user; determining a number of times search results responsive to the search query and that contain the candidate compound were previously selected by the user; and determining a likelihood that the candidate compound is a compound using a value produced by dividing the determined number of times search results responsive to the search query and that contain the candidate compound were previously selected by the user by the determined number of times search results responsive to the search query were previously selected by the user.
-
19. A computer-readable storage medium storing instructions that, when executed by one or more computers, cause the one or more computers to perform operations comprising:
-
generating a candidate compound from a search query, the candidate compound comprising two or more consecutive terms in the search query; determining a number of times search results responsive to the search query were previously selected by a user; determining a number of times search results responsive to the search query and that contain the candidate compound were previously selected by the user; and determining a likelihood that the candidate compound is a compound using a value produced by dividing the determined number of times search results responsive to the search query and that contain the candidate compound were previously selected by the user by the number of times search results responsive to the search query were previously selected by the user.
-
Specification