Use of off-page content to enhance captions with additional relevant information
First Claim
1. A computer-implemented system, comprising:
- a processor coupled to a memory, the processor configured to execute computer-executable instructions associated with at least a data component;
a search engine configured to receive and process a query related to an underlying user task of a user for obtaining a search result comprising links to target webpages;
the data component configured to respond to processing of the query and operates to identify on-page data content elements on the target webpages related to the search result corresponding to the query, the data component obtains additional content element data comprising at least one of concepts or content, determined to be relevant to the content elements, from disparate off-page data from sources other than the target webpages to use in the search result, wherein the data component further comprises;
an offline component configured to collect content element data of an identifier associated with content elements from the off-page data sources, and determine size of the collected content element data of the identifier;
if the size of the collected content element data is below a threshold, all the collected content element data is stored in a network content index, and if the size of the collected content element data is not below the threshold, the collected content element data is injected into the network content index; and
an online component configured to retrieve collected content element data from the off-page data sources and applies a relevance heuristic to determine on-page data and off-page data to use in association with the search result, wherein the online component retrieves the collected content element data at runtime and sends a realtime call to an appropriate data source for additional content element data of a content element;
anda presentation component configured to present the on-page data content elements and the additional content element data from the off-page data sources in a caption accompanying the links to the target webpages in the search result.
2 Assignments
0 Petitions
Accused Products
Abstract
Architecture that uses content from off-page data sources such as feeds (e.g., yellow pages, coupons, social networks, commerce, etc.) to present additional, relevant information in association with search results. The additional and relevant information is directly relevant to the implicit task the user is trying to accomplish. The architecture includes online and offline mechanisms that identify an entity represented on a web page and look-up information related to that entity in disparate data sources. Relevance heuristics are employed to determine which of the available entity data to show in the caption given the user query, the web page, and the underlying user task (other known information about the user such as geographic location).
-
Citations
20 Claims
-
1. A computer-implemented system, comprising:
-
a processor coupled to a memory, the processor configured to execute computer-executable instructions associated with at least a data component; a search engine configured to receive and process a query related to an underlying user task of a user for obtaining a search result comprising links to target webpages; the data component configured to respond to processing of the query and operates to identify on-page data content elements on the target webpages related to the search result corresponding to the query, the data component obtains additional content element data comprising at least one of concepts or content, determined to be relevant to the content elements, from disparate off-page data from sources other than the target webpages to use in the search result, wherein the data component further comprises; an offline component configured to collect content element data of an identifier associated with content elements from the off-page data sources, and determine size of the collected content element data of the identifier;
if the size of the collected content element data is below a threshold, all the collected content element data is stored in a network content index, and if the size of the collected content element data is not below the threshold, the collected content element data is injected into the network content index; andan online component configured to retrieve collected content element data from the off-page data sources and applies a relevance heuristic to determine on-page data and off-page data to use in association with the search result, wherein the online component retrieves the collected content element data at runtime and sends a realtime call to an appropriate data source for additional content element data of a content element; and a presentation component configured to present the on-page data content elements and the additional content element data from the off-page data sources in a caption accompanying the links to the target webpages in the search result. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer-implemented method, comprising steps of:
-
processing a search query related to an underlying user task of a user, to obtain a search result comprising links to target web pages; identifying on-page content elements in corresponding target webpages based on the search query; retrieving off-page entity data to use in the search result, from off-page data sources different from the target webpages, the off-page entity data comprising at least one of concepts or content determined to be related to the on-page content elements; collecting, by an offline component, content element data of an identifier associated with content elements from the off-page data sources; determining size of the collected content element data of the identifier, and if the size of the collected content element data is below a threshold, storing all the collected content element data in a web content index, and if the size of the collected content element data is not below the threshold, injecting the collected content element data into the web content index; retrieving, by an online component, the collected content element data from the off-page data sources, wherein the online component retrieves the collected content element data at runtime and sends a realtime call to an appropriate data source for additional content element data of a content element; applying an importance heuristic to the off-page content element data and on-page content element data to determine importance criteria of the off-page content element data and on-page content element data to present; and presenting the on-page data content element data and the additional content element data from the off-page data sources in a caption accompanying the links to the target web pages in the search result of the query. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A computer-implemented method, comprising steps of:
-
processing a query related to an underlying user task of a user, to obtain a search result comprising links to target webpages; identifying a content element represented in each corresponding target webpage of the target webpages related to the search result based on the query; retrieving off-page content element data related to the content element and on-page content element data related to the content element, the off-page data being retrieved from off-page data sources different from the target webpages; collecting, by an offline component, content element data of an identifier associated with content elements from the off-page data sources; determining size of the collected content element data of the identifier, and if the size of the collected content element data is below a threshold, storing all the collected content element data in a network content index, and if the size of the collected content element data is not below the threshold, injecting the collected content element data into the network content index; retrieving, by an online component, the collected content element data from the off-page data sources, wherein the online component retrieves the collected content element data at runtime and sends a realtime call to an appropriate data source for additional content element data of a content element; applying heuristics to the off-page content element data and on-page content element data to determine content element data to present; and presenting the on-page data content element data and the additional content element data from the off-page data sources in a caption accompanying the links to the target webpages in the search result. - View Dependent Claims (17, 18, 19, 20)
-
Specification