Profiling data with location information
First Claim
1. A method for profiling data stored in at least one data storage system, the method including:
- accessing at least one collection of records stored in the data storage system over an interface coupled to the data storage system; and
processing the collection of records to generate result information characterizing values appearing in one or more specified fields of the collection of records, the processing including;
generating, for a first set of distinct values appearing in a first set of one or more fields of the records in the collection, corresponding location information that identifies, for each distinct value in the first set of distinct values, every record in which the distinct value appears,identifying, for a second set of one or more fields of the records in the collection different from the first set of one or more fields, a second set of distinct values appearing in the second set of one or more fields, andgenerating the result information characterizing values appearing in the one or more specified fields of the collection of records, based at least in part on;
locating at least one record of the collection of records using the location information for at least one value appearing in the first set of one or more fields, and determining at least one value appearing in the second set of one or more fields of the located record.
3 Assignments
0 Petitions
Accused Products
Abstract
Profiling data includes processing an accessed collection of records, including: generating, for a first set of distinct values appearing in a first set of one or more fields, corresponding location information; generating, for the first set of fields, a corresponding list of entries identifying a distinct value from the first set of distinct values and the location information for the distinct value; generating, for a second set of one or more fields, a corresponding list of entries, with each entry identifying a distinct value from a second set of distinct values appearing in the second set of fields; and generating result information, based at least in part on: locating at least one record of the collection using the location information for at least one value appearing in the first set of fields, and determining at least one value appearing in the second set of fields of the located record.
-
Citations
40 Claims
-
1. A method for profiling data stored in at least one data storage system, the method including:
-
accessing at least one collection of records stored in the data storage system over an interface coupled to the data storage system; and processing the collection of records to generate result information characterizing values appearing in one or more specified fields of the collection of records, the processing including; generating, for a first set of distinct values appearing in a first set of one or more fields of the records in the collection, corresponding location information that identifies, for each distinct value in the first set of distinct values, every record in which the distinct value appears, identifying, for a second set of one or more fields of the records in the collection different from the first set of one or more fields, a second set of distinct values appearing in the second set of one or more fields, and generating the result information characterizing values appearing in the one or more specified fields of the collection of records, based at least in part on;
locating at least one record of the collection of records using the location information for at least one value appearing in the first set of one or more fields, and determining at least one value appearing in the second set of one or more fields of the located record. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer program, stored on a computer-readable storage medium, for profiling data stored in at least one data storage system, the computer program including instructions for causing a computing system to:
-
access at least one collection of records stored in the data storage system over an interface coupled to the data storage system; and process the collection of records to generate result information characterizing values appearing in one or more specified fields of the collection of records, the processing including; generating, for a first set of distinct values appearing in a first set of one or more fields of the records in the collection, corresponding location information that identifies, for each distinct value in the first set of distinct values, every record in which the distinct value appears, identifying, for a second set of one or more fields of the records in the collection different from the first set of one or more fields, a second set of distinct values appearing in the second set of one or more fields, and generating the result information characterizing values appearing in the one or more specified fields of the collection of records, based at least in part on;
locating at least one record of the collection of records using the location information for at least one value appearing in the first set of one or more fields, and determining at least one value appearing in the second set of one or more fields of the located record. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
-
15. A computing system for profiling data stored in at least one data storage system, the computing system including:
-
an interface coupled to the data storage system configured to access at least one collection of records stored in the data storage system; and at least one processor configured to process the collection of records to generate result information characterizing values appearing in one or more specified fields of the collection of records, the processing including; generating, for a first set of distinct values appearing in a first set of one or more fields of the records in the collection, corresponding location information that identifies, for each distinct value in the first set of distinct values, every record in which the distinct value appears, identifying, for a second set of one or more fields of the records in the collection different from the first set of one or more fields, a second set of distinct values appearing in the second set of one or more fields, and generating the result information characterizing values appearing in the one or more specified fields of the collection of records, based at least in part on;
locating at least one record of the collection of records using the location information for at least one value appearing in the first set of one or more fields, and determining at least one value appearing in the second set of one or more fields of the located record. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
-
-
16. A computing system for profiling data stored in at least one data storage system, the computing system including:
-
means for accessing at least one collection of records stored in the data storage system; and means for processing the collection of records to generate result information characterizing values appearing in one or more specified fields of the collection of records, the processing including; generating, for a first set of distinct values appearing in a first set of one or more fields of the records in the collection, corresponding location information that identifies, for each distinct value in the first set of distinct values, every record in which the distinct value appears, identifying, for a second set of one or more fields of the records in the collection different from the first set of one or more fields, a second set of distinct values appearing in the second set of one or more fields, and generating the result information characterizing values appearing in the one or more specified fields of the collection of records, based at least in part on;
locating at least one record of the collection of records using the location information for at least one value appearing in the first set of one or more fields, and determining at least one value appearing in the second set of one or more fields of the located record.
-
Specification