Profiling data with location information
First Claim
1. A method for profiling data stored in at least one data storage system, the method including:
- accessing at least one collection of records stored in the data storage system over an interface coupled to the data storage system; and
processing the collection of records to generate result information characterizing values appearing in a first set of one or more fields of the collection of records based on profile information for one or more values, where profile information for a value appearing in a field that is being profiled summarizes the collection of records in which the value appears in the field that is being profiled, the processing including;
receiving, for a second set of two or more fields of the collection of records, a corresponding list of entries, with each entry identifying (1) a distinct combination of values appearing in the second set of two or more fields, and (2) profile information for the distinct combination of values that includes location information that identifies, for each distinct combination of values, every record in the collection in which the distinct combination of values appears; and
generating the result information characterizing values appearing in the first set of one or more fields of the collection of records, based at least in part on;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values, and determining profile information for at least one value appearing in at least one field of the first set of one or more fields based on the combined profile information.
3 Assignments
0 Petitions
Accused Products
Abstract
Profiling data includes processing an accessed collection of records, including: generating, for a first set of distinct values appearing in a first set of one or more fields, corresponding location information; generating, for the first set of fields, a corresponding list of entries identifying a distinct value from the first set of distinct values and the location information for the distinct value; generating, for a second set of one or more fields, a corresponding list of entries, with each entry identifying a distinct value from a second set of distinct values appearing in the second set of fields; and generating result information, based at least in part on: locating at least one record of the collection using the location information for at least one value appearing in the first set of fields, and determining at least one value appearing in the second set of fields of the located record.
72 Citations
73 Claims
-
1. A method for profiling data stored in at least one data storage system, the method including:
-
accessing at least one collection of records stored in the data storage system over an interface coupled to the data storage system; and processing the collection of records to generate result information characterizing values appearing in a first set of one or more fields of the collection of records based on profile information for one or more values, where profile information for a value appearing in a field that is being profiled summarizes the collection of records in which the value appears in the field that is being profiled, the processing including; receiving, for a second set of two or more fields of the collection of records, a corresponding list of entries, with each entry identifying (1) a distinct combination of values appearing in the second set of two or more fields, and (2) profile information for the distinct combination of values that includes location information that identifies, for each distinct combination of values, every record in the collection in which the distinct combination of values appears; and generating the result information characterizing values appearing in the first set of one or more fields of the collection of records, based at least in part on;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values, and determining profile information for at least one value appearing in at least one field of the first set of one or more fields based on the combined profile information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer program, stored on a computer-readable storage medium, for profiling data stored in at least one data storage system, the computer program including instructions for causing a computing system to:
-
access at least one collection of records stored in the data storage system over an interface coupled to the data storage system; and process the collection of records to generate result information characterizing values appearing in a first set of one or more fields of the collection of records based on profile information for one or more values, where profile information for a value appearing in a field that is being profiled summarizes the collection of records in which the value appears in the field that is being profiled, the processing including; receiving, for a second set of two or more fields of the collection of records, a corresponding list of entries, with each entry identifying (1) a distinct combination of values appearing in the second set of two or more fields, and (2) profile information for the distinct combination of values that includes location information that identifies, for each distinct combination of values, every record in the collection in which the distinct combination of values appears; and generating the result information characterizing values appearing in the first set of one or more fields of the collection of records, based at least in part on;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values, and determining profile information for at least one value appearing in at least one field of the first set of one or more fields based on the combined profile information. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42)
-
-
22. A computing system for profiling data stored in at least one data storage system, the computing system including:
-
an interface coupled to the data storage system configured to access at least one collection of records stored in the data storage system; and at least one processor configured to process the collection of records to generate result information characterizing values appearing in a first set of one or more fields of the collection of records based on profile information for one or more values, where profile information for a value appearing in a field that is being profiled summarizes the collection of records in which the value appears in the field that is being profiled, the processing including; receiving, for a second set of two or more fields of the collection of records, a corresponding list of entries, with each entry identifying (1) a distinct combination of values appearing in the second set of two or more fields, and (2) profile information for the distinct combination of values that includes location information that identifies, for each distinct combination of values, every record in the collection in which the distinct combination of values appears; and generating the result information characterizing values appearing in the first set of one or more fields of the collection of records, based at least in part on;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values, and determining profile information for at least one value appearing in at least one field of the first set of one or more fields based on the combined profile information. - View Dependent Claims (43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61)
-
-
23. A computing system for profiling data stored in at least one data storage system, the computing system including:
-
means for accessing at least one collection of records stored in the data storage system; and means for processing the collection of records to generate result information characterizing values appearing in a first set of one or more fields of the collection of records based on profile information for one or more values, where profile information for a value appearing in a field that is being profiled summarizes the collection of records in which the value appears in the field that is being profiled, the processing including; receiving, for a second set of two or more fields of the collection of records, a corresponding list of entries, with each entry identifying (1) a distinct combination of values appearing in the second set of two or more fields, and (2) profile information for the distinct combination of values that includes location information that identifies, for each distinct combination of values, every record in the collection in which the distinct combination of values appears; and generating the result information characterizing values appearing in the first set of one or more fields of the collection of records, based at least in part on;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values, and determining profile information for at least one value appearing in at least one field of the first set of one or more fields based on the combined profile information.
-
-
62. A method for profiling data stored in at least one data storage system, the method including:
-
accessing at least one collection of records stored in the data storage system over an interface coupled to the data storage system; and processing the collection of records to generate result information characterizing values appearing in a first set of one or more fields of the collection of records based on profile information for one or more values, where profile information for a value appearing in a field that is being profiled summarizes the collection of records in which the value appears in the field that is being profiled, the processing including; receiving, for a second set of two or more fields of the collection of records, a corresponding list of entries, with each entry identifying (1) a distinct combination of values appearing in the second set of two or more fields, and (2) profile information for the distinct combination of values; and generating the result information characterizing values appearing in the first set of one or more fields of the collection of records, based at least in part on;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values, and determining profile information for at least one value appearing in at least one field of the first set of one or more fields based on the combined profile information;wherein the list includes an entry for every distinct combination of values appearing in the second set of two or more fields; wherein the first set of one or more fields does not include any fields from the second set of two or more fields; wherein combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values includes;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field of the first set of one or more fields; andwherein the profile information from the first entry of the list includes location information that identifies every record in the collection in which the first distinct combination of values appears in the second set of two or more fields. - View Dependent Claims (63)
-
-
64. A computer program, stored on a computer-readable storage medium, for profiling data stored in at least one data storage system, the computer program including instructions for causing a computing system to:
-
access at least one collection of records stored in the data storage system over an interface coupled to the data storage system; and process the collection of records to generate result information characterizing values appearing in a first set of one or more fields of the collection of records based on profile information for one or more values, where profile information for a value appearing in a field that is being profiled summarizes the collection of records in which the value appears in the field that is being profiled, the processing including; receiving, for a second set of two or more fields of the collection of records, a corresponding list of entries, with each entry identifying (1) a distinct combination of values appearing in the second set of two or more fields, and (2) profile information for the distinct combination of values; and generating the result information characterizing values appearing in the first set of one or more fields of the collection of records, based at least in part on;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values, and determining profile information for at least one value appearing in at least one field of the first set of one or more fields based on the combined profile information;wherein the list includes an entry for every distinct combination of values appearing in the second set of two or more fields; wherein the first set of one or more fields does not include any fields from the second set of two or more fields; wherein combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values includes;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field of the first set of one or more fields; andwherein the profile information from the first entry of the list includes location information that identifies every record in the collection in which the first distinct combination of values appears in the second set of two or more fields. - View Dependent Claims (65)
-
-
66. A computing system for profiling data stored in at least one data storage system, the computing system including:
-
an interface coupled to the data storage system configured to access at least one collection of records stored in the data storage system; and at least one processor configured to process the collection of records to generate result information characterizing values appearing in a first set of one or more fields of the collection of records based on profile information for one or more values, where profile information for a value appearing in a field that is being profiled summarizes the collection of records in which the value appears in the field that is being profiled, the processing including; receiving, for a second set of two or more fields of the collection of records, a corresponding list of entries, with each entry identifying (1) a distinct combination of values appearing in the second set of two or more fields, and (2) profile information for the distinct combination of values; and generating the result information characterizing values appearing in the first set of one or more fields of the collection of records, based at least in part on;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values, and determining profile information for at least one value appearing in at least one field of the first set of one or more fields based on the combined profile information;wherein the list includes an entry for every distinct combination of values appearing in the second set of two or more fields; wherein the first set of one or more fields does not include any fields from the second set of two or more fields; wherein combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values includes;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field of the first set of one or more fields; andwherein the profile information from the first entry of the list includes location information that identifies every record in the collection in which the first distinct combination of values appears in the second set of two or more fields. - View Dependent Claims (67)
-
-
68. A computing system for profiling data stored in at least one data storage system, the computing system including:
-
means for accessing at least one collection of records stored in the data storage system; and means for processing the collection of records to generate result information characterizing values appearing in a first set of one or more fields of the collection of records based on profile information for one or more values, where profile information for a value appearing in a field that is being profiled summarizes the collection of records in which the value appears in the field that is being profiled, the processing including; receiving, for a second set of two or more fields of the collection of records, a corresponding list of entries, with each entry identifying (1) a distinct combination of values appearing in the second set of two or more fields, and (2) profile information for the distinct combination of values; and generating the result information characterizing values appearing in the first set of one or more fields of the collection of records, based at least in part on;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values, and determining profile information for at least one value appearing in at least one field of the first set of one or more fields based on the combined profile information;wherein the list includes an entry for every distinct combination of values appearing in the second set of two or more fields; wherein the first set of one or more fields does not include any fields from the second set of two or more fields; wherein combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values includes;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field of the first set of one or more fields; andwherein the profile information from the first entry of the list includes location information that identifies every record in the collection in which the first distinct combination of values appears in the second set of two or more fields. - View Dependent Claims (69)
-
-
70. A method for profiling data stored in at least one data storage system, the method including:
-
accessing at least one collection of records stored in the data storage system over an interface coupled to the data storage system; and processing the collection of records to generate result information characterizing values appearing in a first set of one or more fields of the collection of records based on profile information for one or more values, where profile information for a value appearing in a field that is being profiled summarizes the collection of records in which the value appears in the field that is being profiled, the processing including; receiving, for a second set of two or more fields of the collection of records, a corresponding list of entries, with each entry identifying (1) a distinct combination of values appearing in the second set of two or more fields, and (2) profile information for the distinct combination of values; and generating the result information characterizing values appearing in the first set of one or more fields of the collection of records, based at least in part on;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values, and determining profile information for at least one value appearing in at least one field of the first set of one or more fields based on the combined profile information;wherein the list includes an entry for every distinct combination of values appearing in the second set of two or more fields; wherein the first set of one or more fields does not include any fields from the second set of two or more fields; wherein combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values includes;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field of the first set of one or more fields; andwherein combining the first profile information and second profile information includes performing an intersection of the first profile information and second profile information.
-
-
71. A computer program, stored on a computer-readable storage medium, for profiling data stored in at least one data storage system, the computer program including instructions for causing a computing system to:
-
access at least one collection of records stored in the data storage system over an interface coupled to the data storage system; and process the collection of records to generate result information characterizing values appearing in a first set of one or more fields of the collection of records based on profile information for one or more values, where profile information for a value appearing in a field that is being profiled summarizes the collection of records in which the value appears in the field that is being profiled, the processing including; receiving, for a second set of two or more fields of the collection of records, a corresponding list of entries, with each entry identifying (1) a distinct combination of values appearing in the second set of two or more fields, and (2) profile information for the distinct combination of values; and generating the result information characterizing values appearing in the first set of one or more fields of the collection of records, based at least in part on;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values, and determining profile information for at least one value appearing in at least one field of the first set of one or more fields based on the combined profile information;wherein the list includes an entry for every distinct combination of values appearing in the second set of two or more fields; wherein the first set of one or more fields does not include any fields from the second set of two or more fields; wherein combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values includes;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field of the first set of one or more fields; andwherein combining the first profile information and second profile information includes performing an intersection of the first profile information and second profile information.
-
-
72. A computing system for profiling data stored in at least one data storage system, the computing system including:
-
an interface coupled to the data storage system configured to access at least one collection of records stored in the data storage system; and at least one processor configured to process the collection of records to generate result information characterizing values appearing in a first set of one or more fields of the collection of records based on profile information for one or more values, where profile information for a value appearing in a field that is being profiled summarizes the collection of records in which the value appears in the field that is being profiled, the processing including; receiving, for a second set of two or more fields of the collection of records, a corresponding list of entries, with each entry identifying (1) a distinct combination of values appearing in the second set of two or more fields, and (2) profile information for the distinct combination of values; and generating the result information characterizing values appearing in the first set of one or more fields of the collection of records, based at least in part on;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values, and determining profile information for at least one value appearing in at least one field of the first set of one or more fields based on the combined profile information;wherein the list includes an entry for every distinct combination of values appearing in the second set of two or more fields; wherein the first set of one or more fields does not include any fields from the second set of two or more fields; wherein combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values includes;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field of the first set of one or more fields; andwherein combining the first profile information and second profile information includes performing an intersection of the first profile information and second profile information.
-
-
73. A computing system for profiling data stored in at least one data storage system, the computing system including:
-
means for accessing at least one collection of records stored in the data storage system; and means for processing the collection of records to generate result information characterizing values appearing in a first set of one or more fields of the collection of records based on profile information for one or more values, where profile information for a value appearing in a field that is being profiled summarizes the collection of records in which the value appears in the field that is being profiled, the processing including; receiving, for a second set of two or more fields of the collection of records, a corresponding list of entries, with each entry identifying (1) a distinct combination of values appearing in the second set of two or more fields, and (2) profile information for the distinct combination of values; and generating the result information characterizing values appearing in the first set of one or more fields of the collection of records, based at least in part on;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values, and determining profile information for at least one value appearing in at least one field of the first set of one or more fields based on the combined profile information;wherein the list includes an entry for every distinct combination of values appearing in the second set of two or more fields; wherein the first set of one or more fields does not include any fields from the second set of two or more fields; wherein combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field having at least one value different from the first distinct combination of values includes;
combining first profile information from a first entry of the list having a first distinct combination of values, and second profile information for at least one field of the first set of one or more fields; andwherein combining the first profile information and second profile information includes performing an intersection of the first profile information and second profile information.
-
Specification