TECHNIQUES FOR APPLICATION DATA SCRUBBING, REPORTING, AND ANALYSIS
First Claim
Patent Images
1. A machine-implemented method, comprising:
- acquiring a first schema for a first data source and a second schema for a second data source;
using the first and second schemas to detect data types and patterns for the data types in both the data sources;
matching some first patterns associated with the first data source to other second patterns associated with the second data source in response to matching rules; and
generating a report that identifies the matched first patterns of the first data source to the second patterns of the second source.
9 Assignments
0 Petitions
Accused Products
Abstract
Techniques for application data scrubbing, reporting, and analysis are presented. A plurality of data sources are analyzed in accordance with their schemas and matching rules. Merging rules are applied to merge a number of data types across the data sources together. A report is produced for inspection and a master data source is generated. The processing can be iterated with rules modified in response to the report for purposes of refining the master data source.
39 Citations
25 Claims
-
1. A machine-implemented method, comprising:
-
acquiring a first schema for a first data source and a second schema for a second data source; using the first and second schemas to detect data types and patterns for the data types in both the data sources; matching some first patterns associated with the first data source to other second patterns associated with the second data source in response to matching rules; and generating a report that identifies the matched first patterns of the first data source to the second patterns of the second source. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A machine-implemented method, comprising:
-
interacting with a data analyst via an interface presented to the data analyst; receiving identifiers for data schemas and data sources associated with those data schemas from the data analyst via the interface; acquiring merge rules from the data analyst via the interface, wherein the merge rules identify conditions within the data sources for merging different data types defined in the data schemas together with one another; and parsing the data sources using the data schemas and enforcing the merge rules to produce a merge report and to produce a master data source that combines the data sources together in accordance with the merge rules. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A machine-implemented system, comprising:
-
a data analysis tool implemented in a machine-accessible and computer-readable medium and to execute on a machine; and a data analyzer implemented in a machine-accessible and computer-readable medium and to execute on the machine or another machine of a network; wherein the data analysis tool is adapted to provide an interface to a data analyst that permits the data analyst to identify data sources for analysis, and wherein the data analyzer is to acquire a separate data schema for each of the data sources and uses the data schemas to parse the data sources to identify data types and patterns, and wherein the data analyzer uses merge rules and policies to merge some of the data types and their corresponding data from the data sources together in a master data source. - View Dependent Claims (16, 17, 18, 19, 20)
-
-
21. A machine-implemented system, comprising:
-
applications implemented in a machine-accessible and computer-readable medium and to process on multiple machines of a network; and a data analyzer tool implemented in a machine-accessible and computer-readable medium and to process on a machine of the network; wherein each application produces application data defined by its own schema and wherein the data analyzer parses the application data using the schemas and further uses merge rules and policies to map the application data to a master data source. - View Dependent Claims (22, 23, 24, 25)
-
Specification