×

USING A DATA MINING ALGORITHM TO GENERATE FORMAT RULES USED TO VALIDATE DATA SETS

  • US 20090006283A1
  • Filed: 06/27/2007
  • Published: 01/01/2009
  • Est. Priority Date: 06/27/2007
  • Status: Active Grant
First Claim
Patent Images

1. An article of manufacture having code for causing operations to be performed, the operations comprising:

  • processing a data set having a plurality of columns and records providing data for each of the columns;

    receiving selection of at least one format column for which format rules are to be generated;

    receiving selection of at least one predictor column;

    generating a format mask column for each selected format column;

    for records in the data set, converting a value in the at least one format column to a format mask representing a format of the value in the format column and storing the format mask in the format mask column in the record for which the format mask was generated; and

    processing the at least one predictor column and the at least one format mask column to generate at least one format rule, wherein each format rule specifies a format mask associated with at least one condition in the at least one predictor column.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×