Search engine system and associated content analysis methods for locating web pages with product offerings
First Claim
Patent Images
1. A computer-implemented method of analyzing web page content, the method comprising:
- retrieving a web page located by a crawler program;
programmatically analyzing content of the web page to evaluate whether the web page includes a product offering; and
generating, based at least in part on the programmatic analysis of the web page, a score that reflects a likelihood that the web page includes a product offering.
1 Assignment
0 Petitions
Accused Products
Abstract
A search engine system assists users in locating web pages from which user-specified products can be purchased. Web pages located by a crawler program are scored, based on a set of criteria, according to likelihood of including a product offering. A query server accesses an index of the scored web pages to locate pages that are both responsive to a user'"'"'s search query and likely to include a product offering. In one embodiment, the responsive web pages are listed on a composite search results page together with products that satisfy the query.
224 Citations
53 Claims
-
1. A computer-implemented method of analyzing web page content, the method comprising:
-
retrieving a web page located by a crawler program; programmatically analyzing content of the web page to evaluate whether the web page includes a product offering; and generating, based at least in part on the programmatic analysis of the web page, a score that reflects a likelihood that the web page includes a product offering. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 49, 50, 51)
-
-
18. A computer-implemented method of analyzing and indexing web pages, the method comprising:
-
retrieving a web page located by a crawler program; programmatically analyzing content of at least the web page to evaluate whether the web page includes a product offering; and if a product offering is detected in the web page as a result of the programmatic analysis, storing a representation of the web page in an index used by a search engine to provide functionality for users to substantially limit their searches to web pages that include product offerings. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29)
-
-
30. A computer-implemented method of analyzing web pages, the method comprising:
-
retrieving a plurality of web pages of a web site; programmatically analyzing the plurality of web pages to check for predefined indicia of a product offering within content of the web pages; and generating a score for the web site, said score reflecting a result of the programmatic analysis of the plurality of web pages. - View Dependent Claims (31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
-
-
41. A method of processing search queries, comprising:
-
receiving a search query specified by a user; identifying a set of web pages that both (a) are responsive to the search query, and (b) have respective scores that satisfy a threshold, said scores being based on an automated analysis of web page content and representing likelihoods that the web pages include product offerings; and generating a search results page that is responsive to the search query, said search results page including a listing of at least some of the web pages in the set. - View Dependent Claims (42, 43, 44, 45, 46, 47, 48)
-
-
52. A product-oriented search engine system, comprising:
-
a first data repository that contains product data of known sellers of products; a second data repository that identifies web pages that have been programmatically determined, within a selected level of confidence, to include a product offering; and a query server that is responsive to a search query specified by a user by checking the first data repository for products that are responsive to the search query, and by checking the second data repository for web pages that both are responsive to the search query and have been determined, within said confidence level, to include a product offering. - View Dependent Claims (53)
-
Specification