Generating data pattern information
First Claim
1. A method, including:
- storing, in a data storage system, at least one dataset including a plurality of records; and
processing, in a data processing system coupled to the data storage system, multiple records of the plurality of records to produce codes representing data patterns in the multiple records, the processing including;
for each of the multiple records in the plurality of records, associating with the record a code encoding a plurality of elements, wherein each element represents a state or property of a corresponding field or combination of fields as one of a set of element values, and, for at least one element of at least a first code, the number of element values in the set is smaller than the total number of data values that occur in the corresponding field or combination of fields over all of the plurality of records in the dataset;
determining one or more data patterns characterizing the multiple records based at least in part on the multiple codes associated with the multiple records; and
processing the multiple records in the plurality of records based on the determined one or more data patterns, including at least one of;
processing one or more subsets of the multiple records based on at least one corresponding determined data pattern from the determined one or more data patterns, or determining at least one correlation between states or properties of different fields based on the determined one or more data patterns characterizing the multiple records.
3 Assignments
0 Petitions
Accused Products
Abstract
A data storage system stores at least one dataset including a plurality of records. A data processing system, coupled to the data storage system, processes the plurality of records to produce codes representing data patterns in the records, the processing including: for each of multiple records in the plurality of records, associating with the record a code encoding one or more elements, wherein each element represents a state or property of a corresponding field or combination of fields as one of a set of element values, and, for at least one element of at least a first code, the number of element values in the set is smaller than the total number of data values that occur in the corresponding field or combination of fields over all of the plurality of records in the dataset.
86 Citations
52 Claims
-
1. A method, including:
-
storing, in a data storage system, at least one dataset including a plurality of records; and processing, in a data processing system coupled to the data storage system, multiple records of the plurality of records to produce codes representing data patterns in the multiple records, the processing including; for each of the multiple records in the plurality of records, associating with the record a code encoding a plurality of elements, wherein each element represents a state or property of a corresponding field or combination of fields as one of a set of element values, and, for at least one element of at least a first code, the number of element values in the set is smaller than the total number of data values that occur in the corresponding field or combination of fields over all of the plurality of records in the dataset; determining one or more data patterns characterizing the multiple records based at least in part on the multiple codes associated with the multiple records; and processing the multiple records in the plurality of records based on the determined one or more data patterns, including at least one of;
processing one or more subsets of the multiple records based on at least one corresponding determined data pattern from the determined one or more data patterns, or determining at least one correlation between states or properties of different fields based on the determined one or more data patterns characterizing the multiple records. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable medium storing a computer program, the computer program including instructions for causing a computer system to:
-
store, in a data storage system, at least one dataset including a plurality of records; and process, in at least one processor of the computer system coupled to the data storage system, multiple records of the plurality of records to produce codes representing data patterns in the multiple records, the processing including; for each of the multiple records in the plurality of records, associating with the record a code encoding a plurality of elements, wherein each element represents a state or property of a corresponding field or combination of fields as one of a set of element values, and, for at least one element of at least a first code, the number of element values in the set is smaller than the total number of data values that occur in the corresponding field or combination of fields over all of the plurality of records in the dataset; determining one or more data patterns characterizing the multiple records based at least in part on the multiple codes associated with the multiple records; and processing the multiple records in the plurality of records based on the determined one or more data patterns, including at least one of;
processing one or more subsets of the multiple records based on at least one corresponding determined data pattern from the determined one or more data patterns, or determining at least one correlation between states or properties of different fields based on the determined one or more data patterns characterizing the multiple records. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. A system, the system including:
-
a data storage system configured to store at least one dataset including a plurality of records; and a data processing system, coupled to the data storage system, configured to process multiple records of the plurality of records to produce codes representing data patterns in the multiple records, the processing including; for each of the multiple records in the plurality of records, associating with the record a code encoding a plurality of elements, wherein each element represents a state or property of a corresponding field or combination of fields as one of a set of element values, and, for at least one element of at least a first code, the number of element values in the set is smaller than the total number of data values that occur in the corresponding field or combination of fields over all of the plurality of records in the dataset; determining one or more data patterns characterizing the multiple records based at least in part on the multiple codes associated with the multiple records; and processing the multiple records in the plurality of records based on the determined one or more data patterns, including at least one of;
processing one or more subsets of the multiple records based on at least one corresponding determined data pattern from the determined one or more data patterns, or determining at least one correlation between states or properties of different fields based on the determined one or more data patterns characterizing the multiple records. - View Dependent Claims (30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42)
-
-
43. A system, the system including:
-
means for storing at least one dataset including a plurality of records; and means for processing multiple records of the plurality of records to produce codes representing data patterns in the multiple records, the processing including; for each of the multiple records in the plurality of records, associating with the record a code encoding a plurality of elements, wherein each element represents a state or property of a corresponding field or combination of fields as one of a set of element values, and, for at least one element of at least a first code, the number of element values in the set is smaller than the total number of data values that occur in the corresponding field or combination of fields over all of the plurality of records in the dataset; determining one or more data patterns characterizing the multiple records based at least in part on the multiple codes associated with the multiple records; and processing the multiple records in the plurality of records based on the determined one or more data patterns, including at least one of;
processing one or more subsets of the multiple records based on at least one corresponding determined data pattern from the determined one or more data patterns, or determining at least one correlation between states or properties of different fields based on the determined one or more data patterns characterizing the multiple records. - View Dependent Claims (44, 45, 46, 47, 48, 49, 50, 51, 52)
-
Specification