MANAGING RECORD FORMAT INFORMATION
First Claim
1. A method for preparing data for processing in a data processing system based on format information in a data storage system, the method including:
- receiving data that includes records that each have one or more values for respective fields over an input device or port; and
determining a target record format for processing the data in the data processing system, includinganalyzing multiple records in the data according to multiple validation tests to determine whether the data matches one or more candidate record formats stored in the data storage system, each candidate record format specifying a format for each field of a group of one or more fields, and each validation test corresponding to at least one candidate record format stored in the data storage system, andin response to receiving results of the validation tests, associating the target record format with the data based on at least one of;
a selected candidate record format for which at least a partial match was determined according to at least one validation test corresponding to the selected candidate record format, a parsed record format generated by a parser selected according to a known data type associated with the data, and a constructed record format generated from an analysis of characteristics of the data.
3 Assignments
0 Petitions
Accused Products
Abstract
Data is prepared for processing in a data processing system using format information. Data is received that includes records that have values for fields over an input device or port. A target record format for processing the data is determined. Multiple records are analyzed according to validation tests to determine whether the data matches candidate record formats. Each candidate record format specifies a format for each field, and each validation test corresponds to at least one candidate record format. In response to receiving results of the validation tests, the target record format is associated with the data based on at least one of: a candidate record format for which at least a partial match was determined according to at least one validation test, a parsed record format selected according to a data type associated with the data, and a constructed record format generated from an analysis of data characteristics.
86 Citations
16 Claims
-
1. A method for preparing data for processing in a data processing system based on format information in a data storage system, the method including:
-
receiving data that includes records that each have one or more values for respective fields over an input device or port; and determining a target record format for processing the data in the data processing system, including analyzing multiple records in the data according to multiple validation tests to determine whether the data matches one or more candidate record formats stored in the data storage system, each candidate record format specifying a format for each field of a group of one or more fields, and each validation test corresponding to at least one candidate record format stored in the data storage system, and in response to receiving results of the validation tests, associating the target record format with the data based on at least one of;
a selected candidate record format for which at least a partial match was determined according to at least one validation test corresponding to the selected candidate record format, a parsed record format generated by a parser selected according to a known data type associated with the data, and a constructed record format generated from an analysis of characteristics of the data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system for preparing data for processing in a data processing system based on format information in a data storage system, the system including:
-
means for receiving data that includes records that each have one or more values for respective fields over an input device or port; and means for determining a target record format for processing the data in the data processing system, including analyzing multiple records in the data according to multiple validation tests to determine whether the data matches one or more candidate record formats stored in the data storage system, each candidate record format specifying a format for each field of a group of one or more fields, and each validation test corresponding to at least one candidate record format stored in the data storage system, and in response to receiving results of the validation tests, associating the target record format with the data based on at least one of;
a selected candidate record format for which at least a partial match was determined according to at least one validation test corresponding to the selected candidate record format, a parsed record format generated by a parser selected according to a known data type associated with the data, and a constructed record format generated from an analysis of characteristics of the data.
-
-
16. A computer-readable medium storing a computer program for preparing data for processing in a data processing system based on format information in a data storage system, the computer program including instructions for causing a computer to:
-
receive data that includes records that each have one or more values for respective fields over an input device or port; and determine a target record format for processing the data in the data processing system, including analyzing multiple records in the data according to multiple validation tests to determine whether the data matches one or more candidate record formats stored in the data storage system, each candidate record format specifying a format for each field of a group of one or more fields, and each validation test corresponding to at least one candidate record format stored in the data storage system, and in response to receiving results of the validation tests, associating the target record format with the data based on at least one of;
a selected candidate record format for which at least a partial match was determined according to at least one validation test corresponding to the selected candidate record format, a parsed record format generated by a parser selected according to a known data type associated with the data, and a constructed record format generated from an analysis of characteristics of the data.
-
Specification