Techniques for web site integration
First Claim
Patent Images
1. A method, comprising:
- providing, by operation of a computer system, a first document from a web site to a user, wherein the first document includes a plurality of terms;
automatically, by operation of the computer system, generating a search query from terms included in the first document, wherein the search query includes multiple terms from the first document, and wherein generating the search query comprises;
determining a respective first ratio for each of the multiple terms in the search query from a number of occurrences of the term in the first document and a total number of term occurrences in the first document,determining a respective second ratio for each of the multiple terms in the search query from a number of occurrences of the term in the web site and a total number of term occurrences in the web site,computing a respective weight for each of the multiple terms from the first ratio for the term and the second ratio for the term, andassigning the respective weight for each of the multiple terms in the search query to the term;
using the search query to determine a respective score for each of a plurality of documents in the web site, wherein the respective score for each document is based upon occurrences in the document of terms in the search query and on the respective weights assigned to the terms in the search query; and
identifying a set of documents from the plurality of documents in the web site based on the respective scores.
5 Assignments
0 Petitions
Accused Products
Abstract
Disclosed is a method and device for finding documents, such as Web pages, for presentation to a user, automatically or in response to a user expression of interest, which documents are part of a Web site being accessed by the user, and which documents relate to a document, such as a Web page, being accessed in the Web site. The method takes advantage of information retrieval techniques. The method generates the search query to use to find documents by reference to the text of the document in the Web site being accessed by the user. The method further uses a weighting function to weigh the terms used in the search query.
178 Citations
19 Claims
-
1. A method, comprising:
-
providing, by operation of a computer system, a first document from a web site to a user, wherein the first document includes a plurality of terms; automatically, by operation of the computer system, generating a search query from terms included in the first document, wherein the search query includes multiple terms from the first document, and wherein generating the search query comprises; determining a respective first ratio for each of the multiple terms in the search query from a number of occurrences of the term in the first document and a total number of term occurrences in the first document, determining a respective second ratio for each of the multiple terms in the search query from a number of occurrences of the term in the web site and a total number of term occurrences in the web site, computing a respective weight for each of the multiple terms from the first ratio for the term and the second ratio for the term, and assigning the respective weight for each of the multiple terms in the search query to the term; using the search query to determine a respective score for each of a plurality of documents in the web site, wherein the respective score for each document is based upon occurrences in the document of terms in the search query and on the respective weights assigned to the terms in the search query; and identifying a set of documents from the plurality of documents in the web site based on the respective scores. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system, comprising:
a computer system and non-transitory media containing a computer program, the computer program programming the computer system to perform operations comprising; providing, by operation of a computer system, a first document from a web site to a user, wherein the first document includes a plurality of terms; automatically, by operation of the computer system, generating a search query from terms included in the first document, wherein the search query includes multiple terms from the first document, and wherein generating the search query comprises; determining a respective first ratio for each of the multiple terms in the search query from a number of occurrences of the term in the first document and a total number of term occurrences in the first document, determining a respective second ratio for each of the multiple terms in the search query from a number of occurrences of the term in the web site and a total number of term occurrences in the web site, computing a respective weight for each of the multiple terms from the first ratio for the term and the second ratio for the term, and assigning the respective weight for each of the multiple terms in the search query to the term; using the search query to determine a respective score for each of a plurality of documents in the web site, wherein the respective score for each document is based upon occurrences in the document of terms in the search query and on the respective weights assigned to the terms in the search query; and identifying a set of documents from the plurality of documents in the web site based on the respective scores. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
Specification