System and method for data collection
First Claim
1. A method of extracting data from one or more data sources to a central repository storing a data model, the data model comprising a set of components and relationships representing entities in an environment and the relationships between these entities, wherein each of the components and relationships in the data model has a type and a set of properties and the values for the type and set of properties of each of the components and relationships in the data model are based on a first set of data extracted at a first time from each of the one or more data sources, comprising:
- extracting a second set of data from the one or more data sources at a second time subsequent to the first time;
normalizing and mapping the second set of data into a format associated with the data model in the central repository;
correlating the second set of data extracted at the second time with the components and relationships of the data model stored in the central repository;
comparing the components and relationships of the data model in the central repository with the second set of data; and
updating the data model in the central repository based on the comparison of the components and relationships of the data model, wherein updating the data model comprisesdetermining a set of operations to update the data central repository, wherein the set of operations is based on the comparison between the second set of data and the component and relationships in the central repository and the set of operations are configured to alter the types of values of the components or relationships in the central repository or to add components or relationships to the central repository to account for changes in the one or more data sources associated with the components or relationships which occurred between the first time and the second time, andapplying the set of operations to the central repository to update the data model.
13 Assignments
0 Petitions
Accused Products
Abstract
Embodiments of the present invention provide methods and systems for extracting data from multiple sources and loading data into a central repository. One embodiment of a method of the present invention includes extracting a first set of data from one or more data sources, correlating the first set of data with a second set of data contained in a central repository, comparing the first set of data with the second set of data, determining a set of operations to be applied to the second set of data and applying the set of operations to the second set of data.
66 Citations
21 Claims
-
1. A method of extracting data from one or more data sources to a central repository storing a data model, the data model comprising a set of components and relationships representing entities in an environment and the relationships between these entities, wherein each of the components and relationships in the data model has a type and a set of properties and the values for the type and set of properties of each of the components and relationships in the data model are based on a first set of data extracted at a first time from each of the one or more data sources, comprising:
-
extracting a second set of data from the one or more data sources at a second time subsequent to the first time; normalizing and mapping the second set of data into a format associated with the data model in the central repository; correlating the second set of data extracted at the second time with the components and relationships of the data model stored in the central repository; comparing the components and relationships of the data model in the central repository with the second set of data; and updating the data model in the central repository based on the comparison of the components and relationships of the data model, wherein updating the data model comprises determining a set of operations to update the data central repository, wherein the set of operations is based on the comparison between the second set of data and the component and relationships in the central repository and the set of operations are configured to alter the types of values of the components or relationships in the central repository or to add components or relationships to the central repository to account for changes in the one or more data sources associated with the components or relationships which occurred between the first time and the second time, and applying the set of operations to the central repository to update the data model. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory computer readable medium comprising instructions for extracting data from one or more data sources to a central repository storing a data model, the data model comprising a set of components and relationships representing entities in an environment and the relationships between these entities, wherein each of the components and relationships in the data model has a type and a set of properties and the values for the type and set of properties of each of the components and relationships in the data model are based on a first set of data extracted at a first time from each of the one or more data sources, the instruction including instructions for:
-
extracting a second set of data from the one or more data sources at a second time subsequent to the first time; normalizing and mapping the second set of data into a format associated with the data model in the central repository; correlating the second set of data extracted at the second time with the components and relationships of the data model stored in the central repository; comparing the components and relationships of the data model in the central repository with the second set of data; and updating the data model in the central repository based on the comparison of the components and relationships of the data model, wherein updating the data model comprises determining a set of operations to update the data central repository, wherein the set of operations is based on the comparison between the second set of data and the component and relationships in the central repository and the set of operations are configured to alter the types of values of the components or relationships in the central repository or to add components or relationships to the central repository to account for changes in the one or more data sources associated with the components or relationships which occurred between the first time and the second time, and applying the set of operations to the central repository to update the data model. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system, including
one or more data sources; -
a central repository coupled to the one or more data sources, the central repository storing a data model, the data model comprising a set of components and relationships representing entities in an environment and the relationships between these entities, wherein each of the components and relationships in the data model has a type and a set of properties and the values for the type and set of properties of each of the components and relationships in the data model are based on a first set of data extracted at a first time from each of the one or more data sources; and a non-transitory computer readable medium, including instructions for extracting data from the one or more data sources to the central repository, the instruction including instructions for; extracting a second set of data from the one or more data sources at a second time subsequent to the first time; normalizing and mapping the second set of data into a format associated with the data model in the central repository; correlating the second set of data extracted at the second time with the components and relationships of the data model stored in the central repository; comparing the components and relationships of the data model in the central repository with the second set of data; and updating the data model in the central repository based on the comparison of the components and relationships of the data model, wherein updating the data model comprises determining a set of operations to update the data central repository, wherein the set of operations is based on the comparison between the second set of data and the component and relationships in the central repository and the set of operations are configured to alter the types of values of the components or relationships in the central repository or to add components or relationships to the central repository to account for changes in the one or more data sources associated with the components or relationships which occurred between the first time and the second time, and applying the set of operations to the central repository to update the data model. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification