UTILIZATION OF FEATURES EXTRACTED FROM STRUCTURED DOCUMENTS TO IMPROVE SEARCH RELEVANCE
First Claim
1. A method that facilitates ranking documents with respect to a received query, the method comprising:
- at a general purpose search engine, receiving the query from a user that is configured to retrieve at least one document that is indexed by the general purpose search engine; and
causing a processor to output a ranked list of documents to the user based at least in part upon the query, the ranked list of documents comprising a semi-structured web page, a position of the semi-structured web page in the ranked list of documents based at least in part upon a value of a feature that is extracted from the semi-structured document at a learned location in the semi-structured document that is known to include the feature, wherein the position of the semi-structured document in the ranked list of documents is independent of any correlation between text of the query and the value of the feature.
2 Assignments
0 Petitions
Accused Products
Abstract
Features automatically extracted from semi-structured web pages are utilized by a search engine to rank documents that include semi-structured web pages. These features include, but are not limited to, a number of reviews, a number of positive reviews, and/or a number of negative reviews from a web page that includes user reviews. These features also include a number of views of a video that is viewable by way of a semi-structured web page. The features also include a number of subscribers to broadcasts of an individual from a social networking web page and a number of contacts of an individual listed on a social networking web page.
28 Citations
20 Claims
-
1. A method that facilitates ranking documents with respect to a received query, the method comprising:
-
at a general purpose search engine, receiving the query from a user that is configured to retrieve at least one document that is indexed by the general purpose search engine; and causing a processor to output a ranked list of documents to the user based at least in part upon the query, the ranked list of documents comprising a semi-structured web page, a position of the semi-structured web page in the ranked list of documents based at least in part upon a value of a feature that is extracted from the semi-structured document at a learned location in the semi-structured document that is known to include the feature, wherein the position of the semi-structured document in the ranked list of documents is independent of any correlation between text of the query and the value of the feature. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system that facilitates outputting a ranked list of search results responsive to receipt of a query, the system comprising:
-
a receiver component that receives the query from a user; and a ranker component that outputs a ranked list of documents responsive to receipt of the query, the ranked list of documents comprising a semi-structured web page at a position amongst the ranked list of documents, the position amongst the ranked list of documents based at least in part upon a value of a feature that is at a learned location in the semi-structured web page, the position of the semi-structured web page amongst the ranked list of documents being independent of any correlation between the query and the value of the feature. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer-readable medium comprising instructions that, when executed by a processor, causes the processor to perform acts comprising:
-
receiving a query from a user; extracting a value of a feature from a semi-structured web page independent of content of the query, the feature being one of; a number of reviews posted on the semi-structured web page by purchasers of a product that is displayed on the web page; a number of positive reviews posted on the semi-structured web page by purchasers of the product that is displayed on the web page; a number of negative reviews posted on the semi-structured web page by purchasers of the product that is displayed on the web page; a number of views of a video that is embedded on the web page; a number of contacts of an entity whose profile is included on the web page; a number of subscribers of an entity that broadcasts messages that are displayed on the web page; responsive to receiving the query, providing to the user a ranked list of search results, the ranked list or search results comprising a plurality of documents displayed in a particular order, the plurality of documents comprising the semi-structured web page that is at a certain position in the particular order, wherein the certain position in the particular order is based at least in part upon the value of the feature extracted from the semi-structured web page, and wherein the certain position in the particular order is independent of any correlation between the value of the feature and the query.
-
Specification