Data profiling
First Claim
Patent Images
1. A method for processing data including:
- profiling data from a data source, including reading the data from the data source, computing summary data characterizing the data while reading the data, and storing profile information that is based on the summary data; and
processing the data from the data source, including accessing the stored profile information and processing the data according to the accessed profile information.
4 Assignments
0 Petitions
Accused Products
Abstract
Processing data includes profiling data from a data source, including reading the data from the data source, computing summary data characterizing the data while reading the data, and storing profile information that is based on the summary data. The data is then processed from the data source. This processing includes accessing the stored profile information and processing the data according to the accessed profile information.
-
Citations
29 Claims
-
1. A method for processing data including:
-
profiling data from a data source, including reading the data from the data source, computing summary data characterizing the data while reading the data, and storing profile information that is based on the summary data; and
processing the data from the data source, including accessing the stored profile information and processing the data according to the accessed profile information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A method for processing data including:
-
profiling data from a data source, including reading the data from the data source, computing summary data characterizing the data while reading the data, and storing profile information that is based on the summary data;
whereinprofiling the data includes profiling said data in parallel, including partitioning the data into parts and processing the parts using separate ones of a first set of parallel components.
-
-
27. Software stored on a computer-readable medium including instructions for causing a computer system to:
-
profile data from a data source by reading the data from the data source, compute summary data characterizing the data while reading the data, and store profile information that is based on the summary data; and
process the data from the data source by accessing the stored profile information and process the data according to the accessed profile information.
-
-
28. A data processing system including:
-
a profiling module configured to read data from a data source, to compute summary data characterizing the data while reading the data, and to store profile information that is based on the summary data; and
a processing module configured to access the stored profile information and to process the data from the data source according to the accessed profile information.
-
-
29. A data processing system including:
-
means for profiling data from a data source, including means for reading the data from the data source, means for computing summary data characterizing the data while reading the data, and means for storing profile information that is based on the summary data; and
means for processing the data from the data source, including means for accessing the stored profile information and means for processing the data according to the accessed profile information.
-
Specification