System and method for automatic generation of dynamic search abstracts contain metadata by crawler
First Claim
1. A system for automatically generating dynamic search abstracts, comprising:
- a crawler for crawling documents and acquiring metadata and link information from the documents;
a metadata repository for storing the metadata acquired by the crawler;
a link repository for storing link information acquired by the crawler;
an abstract engine for generating abstracts of the documents from the metadata;
an indexing engine for periodically indexing the metadata and the link information;
a search engine for applying a search query to the metadata indexed by the indexing engine, to generate a preliminary result set containing selected abstracts; and
wherein the search engine inquires if the link repository contains new link information about the preliminary result set, and updates the selected abstracts based on the new link information, if any, to generate the dynamic search abstracts.
1 Assignment
0 Petitions
Accused Products
Abstract
A computer program product is provided as an abstract update system that improves Internet searching by automatically generating accurate and dynamic search abstracts. The abstract update system enables search engines to provide the most recent abstract information for a document that matches a search query, even if this document may have been crawled after the metadata was indexed. The dynamic abstracts are generated from the latest crawl metadata and link information stored in a link repository. As a result, the dynamic abstracts contain metadata based primarily on changes to a candidate page itself, and changes to what other sources provide about this candidate page. For example, even if a candidate page has not changed, new pages may point to it; some pages that used to point to the candidate page no longer do so; or existing pages that point to the candidate page change their annotations.
-
Citations
22 Claims
-
1. A system for automatically generating dynamic search abstracts, comprising:
-
a crawler for crawling documents and acquiring metadata and link information from the documents; a metadata repository for storing the metadata acquired by the crawler; a link repository for storing link information acquired by the crawler; an abstract engine for generating abstracts of the documents from the metadata; an indexing engine for periodically indexing the metadata and the link information; a search engine for applying a search query to the metadata indexed by the indexing engine, to generate a preliminary result set containing selected abstracts; and wherein the search engine inquires if the link repository contains new link information about the preliminary result set, and updates the selected abstracts based on the new link information, if any, to generate the dynamic search abstracts. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer program product for automatically generating dynamic search abstracts, comprising:
-
a crawler for crawling documents and acquiring metadata and link information from the documents; a metadata repository for storing the metadata acquired by the crawler; a link repository for storing link information acquired by the crawler; an abstract engine for generating abstracts of the documents from the metadata; an indexing engine for periodically indexing the metadata and the link information; a search engine for applying a search query to the metadata indexed by the indexing engine, to generate a preliminary result set containing selected abstracts; and wherein the search engine inquires if the link repository contains new link information about the preliminary result set, and updates the selected abstracts based on the new link information, if any, to generate the dynamic search abstracts. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A method for automatically generating dynamic search abstracts, comprising:
-
crawling documents and acquiring metadata and link information from the documents; storing the metadata acquired by the crawler in a metadata repository; storing link information acquired by the crawler in a link repository; generating abstracts of the documents from the metadata; periodically indexing the metadata and the link information; applying a search query to the metadata to generate a preliminary result set containing selected abstracts; and inquiring if the link repository contains new link information about the preliminary result set, and updating the selected abstracts based on the new link information, if any, to generate the dynamic search abstracts. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A computer program product having instruction codes for automatically generating dynamic search abstracts, comprising:
-
a first set of instruction codes that acquire documents and metadata and link information from the documents; a metadata repository that store the metadata; a link repository that store the link information; a second set of instruction codes that generate abstracts of the documents from the metadata; a third set of instruction codes that periodically index the metadata and the link information; a fourth set of instruction codes that apply a search query to the metadata, to generate a preliminary result set containing selected abstracts; and a fifth set of instruction codes that inquire if the link repository contains new link information about the preliminary result set, and that update the selected abstracts based on the new link information, if any, to generate the dynamic search abstracts. - View Dependent Claims (18, 19, 20, 21, 22)
-
Specification