DOMAIN-AWARE SNIPPETS FOR SEARCH RESULTS
First Claim
1. One or more computer-readable media having computer-usable instructions stored thereon for performing a method of providing a domain-aware snippet for a search result, the method comprising:
- identifying source code for a plurality of web pages of a domain;
identifying one or more tag patterns of one or more sections within the source code of the plurality of web pages, wherein the plurality of web pages share at least one identical tag pattern;
extracting a template of the plurality of web pages based on the identified one or more tag patterns;
associating the template and content of the plurality web pages related to the template with a Uniform Resource Locator pattern of the domain;
storing the association of the template, the related content, and the Uniform Resource Locator pattern in a database.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques are disclosed for providing a domain-aware snippet for a search result. With such techniques, a domain classification component is provided for identifying a template used to generate a plurality of web pages of a domain, associating the template and content of the web pages related to the template with a Uniform Resource Locator pattern of the plurality of web pages, and storing the associated template, the related content, and the Uniform Resource Locator pattern in a database. A snippet extraction component is also provided for extracting text from a section of a web page of the plurality of web pages for a snippet of a search result corresponding to a search query, wherein the extracted text is based on a ranking value of the section and the relevance of the extracted text to the search query.
73 Citations
20 Claims
-
1. One or more computer-readable media having computer-usable instructions stored thereon for performing a method of providing a domain-aware snippet for a search result, the method comprising:
-
identifying source code for a plurality of web pages of a domain; identifying one or more tag patterns of one or more sections within the source code of the plurality of web pages, wherein the plurality of web pages share at least one identical tag pattern; extracting a template of the plurality of web pages based on the identified one or more tag patterns; associating the template and content of the plurality web pages related to the template with a Uniform Resource Locator pattern of the domain; storing the association of the template, the related content, and the Uniform Resource Locator pattern in a database. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. One or more computer-readable media having computer-usable instructions stored thereon for performing a method of providing a domain-aware snippet for a search result, the method comprising:
-
receiving a search query; obtaining one or more search results for the search query; identifying a Uniform Resource Locator for at least one of the one or more search results; determining the Uniform Resource Locator corresponds to a domain that uses a template to generate a plurality of web pages for the domain; identifying a section of at least one web page, from the plurality of web pages, that is relevant to the search query; and providing a snippet to a user for the at least one search result, wherein the snippet includes at least a portion of text from the identified section. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A system for providing a domain-aware snippet for a search result, the system comprising:
-
a domain classification component for identifying a template used to generate a plurality of web pages of a domain, associating the template and content of the plurality of web pages related to the template with a Uniform Resource Locator pattern of the plurality of web pages, and storing the associated template, the related content, and the Uniform Resource Locator pattern in a database; and a snippet extraction component for extracting text from a section of at least one web page of the plurality for a snippet of a search result corresponding to a search query, wherein the extracted text is based on a ranking value of the section and a relevance value of the extracted text to the search query. - View Dependent Claims (18, 19, 20)
-
Specification