Aggregating data from different sources
First Claim
1. A computer-implemented method to propagate updates to virtual pages, the method comprising:
- scanning a collection of documents to identify changes based an a hash identifier associated with a virtual page;
generating update notifications that include instructions for implementing update data based on the identified changes;
communicating the update notification to an index storing the virtual page; and
updating the virtual page in accordance with the instruction included in the update notification.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system that aggregates data associated with one or more entities from different data sources are provided. The data sources include documents, web pages, or images that have information about one or more entities. The information is extracted from the data sources based on criteria that define the entities. The extracted information is utilized to generate a hash identifier that corresponds to each entity and one or more storage locations. The one or more storage locations and associated hash identifiers are utilized to store the extracted information corresponding to the entities, and the extracted information for each entity is structured as a virtual page that is stored in an index having references to the data sources. The index storing the virtual pages is notified or updated when the associated data sources are modified.
-
Citations
20 Claims
-
1. A computer-implemented method to propagate updates to virtual pages, the method comprising:
-
scanning a collection of documents to identify changes based an a hash identifier associated with a virtual page; generating update notifications that include instructions for implementing update data based on the identified changes; communicating the update notification to an index storing the virtual page; and updating the virtual page in accordance with the instruction included in the update notification. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-implemented method to aggregate data from different sources, the method comprising:
-
receiving a collection of criteria associated with an entity; generating a hash identifier corresponding to the entity from the collection of criteria; accessing a corpus of documents referenced in a document index as having data related to the entity; for each accessed document in the corpus of documents having data related to the entity, extracting information from the corpus of documents according to the collection of criteria; generating a virtual page representative of the entity based on the extracted information; storing the virtual page in a storage location locatable according to the hash identifier; and incorporating the virtual page into the document index. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A computing system for aggregating data from different sources, the system comprising:
-
an interface component configured to obtain a collection of criteria associated with an entity; an access component configured to access a corpus of documents referenced in a document index, the corpus of documents having data associated with the entity; a hashing component configured to generate a hash identifier corresponding to the entity from a subset of the collection of criteria associated with the entity; an extraction component configured to extract data from the corpus of documents according to the collection of criteria; an aggregation component configured to aggregate the extracted data into a virtual page and storing the virtual page in a storage location locatable by the hash identifier generated by the hashing component; and an indexing component configured to incorporate a virtual page generated by the aggregation component in the document index. - View Dependent Claims (18, 19, 20)
-
Specification