Determining related terms based on link annotations of documents belonging to search result sets
First Claim
1. A computer-implemented method comprising:
- receiving first query terms from a user;
automatically selecting, based at least in part on the first query terms, and from among a set of documents, first documents;
automatically generating second query terms based at least in part on anchor text that is contained within links that occur within the first documents;
automatically selecting, based at least in part on the second query terms, and from among the set of documents, second documents;
wherein the first documents differ from the second documents;
automatically generating third query terms based at least in part on anchor text that is contained within links that occur within the second documents, and not based on any input from said user;
and automatically selecting, based at least in part on the third query terms, and from among the set of documents, third documents;
wherein the third documents differ from the first documents and the second documents;
determining a first number that is a number of documents in an intersection of (a) a set consisting of the first documents and (b) a set consisting of the second documents;
determining a second number that is the first number divided by a number of documents in the set consisting of the second documents;
determining whether the second number is at least as great as a specified percentage;
and in response to determining that the second number is not at least as great as the specified percentage, automatically generating the third query terms based at least in part on anchor text that is contained within links that occur within the second documents;
wherein the step of generating the third query terms is performed by one or more computing devices;
and wherein said first and second documents are not displayed to said user.
9 Assignments
0 Petitions
Accused Products
Abstract
Techniques for automatically focusing searches conducted by a search engine are provided. According to one aspect, revised query terms are automatically generated based on text in links that are in incoming (and/or outgoing) link lists associated with documents that are referenced in initial search results generated based on initial query terms. For example, some of the phrases that appear in incoming (and/or outgoing) links associated with a result document may be selected. The selected phrases may be added to the initial query terms to generate revised query terms. These revised query terms may be submitted automatically to the search engine in order to produce a more focused list of revised search results. This process may be performed repeatedly, each iteration revising query terms generated by the previous iteration, until specified criteria are satisfied, at which point the final revised search results may be presented to a user.
-
Citations
20 Claims
-
1. A computer-implemented method comprising:
-
receiving first query terms from a user; automatically selecting, based at least in part on the first query terms, and from among a set of documents, first documents; automatically generating second query terms based at least in part on anchor text that is contained within links that occur within the first documents; automatically selecting, based at least in part on the second query terms, and from among the set of documents, second documents; wherein the first documents differ from the second documents; automatically generating third query terms based at least in part on anchor text that is contained within links that occur within the second documents, and not based on any input from said user; and automatically selecting, based at least in part on the third query terms, and from among the set of documents, third documents; wherein the third documents differ from the first documents and the second documents; determining a first number that is a number of documents in an intersection of (a) a set consisting of the first documents and (b) a set consisting of the second documents; determining a second number that is the first number divided by a number of documents in the set consisting of the second documents; determining whether the second number is at least as great as a specified percentage; and in response to determining that the second number is not at least as great as the specified percentage, automatically generating the third query terms based at least in part on anchor text that is contained within links that occur within the second documents; wherein the step of generating the third query terms is performed by one or more computing devices; and wherein said first and second documents are not displayed to said user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A volatile or non-volatile computer-readable storage medium storing instructions which, when executed by one or more processors, cause the one or more processors to perform steps comprising:
-
receiving first query terms from a user; automatically selecting, based at least in part on the first query terms, and from among a set of documents, first documents; automatically generating second query terms based at least in part on anchor text that is contained within links that occur within the first documents; automatically selecting, based at least in part on the second query terms, and from among the set of documents, second documents; wherein the first documents differ from the second documents; automatically generating third query terms based at least in part on anchor text that is contained within links that occur within the second documents, and not based on any input from said user; and automatically selecting, based at least in part on the third query terms, and from among the set of documents, third documents; wherein the third documents differ from the first documents and the second documents; determining a first number that is a number of documents in an intersection of (a) a set consisting of the first documents and (b) a set consisting of the second documents; determining a second number that is the first number divided by a number of documents in the set consisting of the second documents; determining whether the second number is at least as great as a specified percentage; and in response to determining that the second number is not at least as great as the specified percentage, automatically generating the third query terms based at least in part on anchor text that is contained within links that occur within the second documents; wherein said first and second documents are not displayed to said user. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification