UNDERSTANDING DATA IN DATA SETS
First Claim
1. A computer-implemented method comprisingreceiving two or more data sets,each of the data sets containing data that can be interpreted as records each having data values for data fields,each of the data sets containing at least some data that is related to data in at least one of the other data sets,the data in different ones of the data sets being organized or expressed possibly differently,each of the data sets being susceptible to a definition of a key for the records of the data set,the data sets being characterized by repetitions of at least one of (a) records, (b) portions of keys, or (c) instances of values for data fields, andproviding to a user information about at least one of the repetitions.
1 Assignment
0 Petitions
Accused Products
Abstract
Among other things, there are two or more data sets. Each of the data sets contains data that can be interpreted as records each having data values for data fields. Each of the data sets contains at least some data that is related to data in at least one of the other data sets. The data in different data sets is organized or expressed possibly differently. Each of the data sets is susceptible to a definition of a key for the records of the data set. The data sets are characterized by repetitions of at least one of (a) records, (b) portions of keys, or (c) instances of values for data fields. Information about at least one of the repetitions is provided to a user.
15 Citations
32 Claims
-
1. A computer-implemented method comprising
receiving two or more data sets, each of the data sets containing data that can be interpreted as records each having data values for data fields, each of the data sets containing at least some data that is related to data in at least one of the other data sets, the data in different ones of the data sets being organized or expressed possibly differently, each of the data sets being susceptible to a definition of a key for the records of the data set, the data sets being characterized by repetitions of at least one of (a) records, (b) portions of keys, or (c) instances of values for data fields, and providing to a user information about at least one of the repetitions.
-
24. A computer-implemented method comprising
receiving a data set containing data that can be interpreted as records each having data values for data fields, the data set being characterized by any arbitrary number of repetitions of instances of values for at least one of the data fields, and providing to a user information about at least one of the repetitions.
-
32. A medium bearing an integrated file of data records, a key for the records, each of the records containing at least one data value for at least one data field, the data records containing information that represents data of at least two data sets, each of the data sets containing data that can be interpreted as records each having data values for data fields, each of the data sets containing at least some data that is related to data in at least one of the other data sets, the data in different ones of the data sets being organized or expressed possibly differently, each of the data sets being susceptible to a definition of a key for the records of the data set, the data sets being characterized by repetitions of at least one of (a) records, (b) portions of keys, or (c) instances of values for data fields, the integrated file including information that identifies the repetitions.
Specification