Search queries improved based on query semantic information
First Claim
1. A computer-implemented method comprising:
- receiving, by a search system, a search query;
generating, by the search system, a potential substitute term that is related to a query term of the search query;
identifying, by the search system, an original set of documents that are responsive to the search query;
weighting each potential substitute term that appears in a document in the original set based on a prevalence of the potential substitute term in the original set of documents;
producing a pruned set of terms whose weight satisfies a condition;
determining that the potential substitute term is a member of the pruned set of terms; and
in response to determining that the potential substitute term is a member of the pruned set of terms, modifying, by the search system, the search query to include the potential substitute term.
3 Assignments
0 Petitions
Accused Products
Abstract
A search query for a search engine may be improved by incorporating alternate terms into the search query that are semantically similar to terms of the search query, taking into account information derived from the search query. An initial set of alternate terms that may be semantically similar to the original terms in the search query is generated. The initial set of alternate terms may be compared to information derived from the original search query. One example of such information is a set of documents retrieved in response to a search performed using the initial search query. One or more of the alternate terms may be added to the original search query based on their relationship to the information derived from the original search query.
77 Citations
21 Claims
-
1. A computer-implemented method comprising:
-
receiving, by a search system, a search query; generating, by the search system, a potential substitute term that is related to a query term of the search query; identifying, by the search system, an original set of documents that are responsive to the search query; weighting each potential substitute term that appears in a document in the original set based on a prevalence of the potential substitute term in the original set of documents; producing a pruned set of terms whose weight satisfies a condition; determining that the potential substitute term is a member of the pruned set of terms; and in response to determining that the potential substitute term is a member of the pruned set of terms, modifying, by the search system, the search query to include the potential substitute term. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
one or more computers and one or more storage devices storing instructions that are configured, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving a search query; generating a potential substitute term that is related to a query term of the search query; identifying an original set of documents that are responsive to the search query; weighting each potential substitute term that appears in a document in the original set based on a prevalence of the potential substitute term in the original set of documents; producing a pruned set of terms whose weight satisfies a condition; determining that the potential substitute term is a member of the pruned set of terms; and in response to determining that the potential substitute term is a member of the pruned set of terms, modifying the search query to include the potential substitute term. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable storage device storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving a search query; generating a potential substitute term that is related to a query term of the search query; identifying an original set of documents that are responsive to the search query; weighting each potential substitute term that appears in a document in the original set based on a prevalence of the potential substitute term in the original set of documents; producing a pruned set of terms whose weight satisfies a condition; determining that the potential substitute term is a member of the pruned set of terms; and in response to determining that the potential substitute term is a member of the pruned set of terms, modifying the search query to include the potential substitute term. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification