CLASSIFICATION OF DATA
First Claim
1. A method of reducing false positives in the classification of data, wherein the data can be categorized into columns, comprising:
- creating an assertion table or assessing an existing assertion table for the data whereby the data is placed into categories and each category is assigned one or more classifications;
setting a positive and/or negative assertion ratio for each category;
determining the accuracy of each classification by assessing a percentage of the data in each category to see if the data is correctly identified by the classification;
if the positive assertion ratio is reached, maintaining the classification for each category of data;
if the negative assertion ratio is reached, de-asserting the classification.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for reducing false positives in the classification of data is provided, wherein the data can be categorized into fields, including creating an assertion table or assessing an existing assertion table for the data whereby the data is placed into categories and each category is assigned one or more classifications, setting a positive and/or negative assertion ratio for each category, determining the accuracy of each classification by assessing a percentage of the data in each category to see if the data is correctly identified by the classification, if the positive assertion ratio is reached, maintaining the classification for each category of data, if the negative assertion ratio is reached, de-asserting the classification.
18 Citations
20 Claims
-
1. A method of reducing false positives in the classification of data, wherein the data can be categorized into columns, comprising:
-
creating an assertion table or assessing an existing assertion table for the data whereby the data is placed into categories and each category is assigned one or more classifications; setting a positive and/or negative assertion ratio for each category; determining the accuracy of each classification by assessing a percentage of the data in each category to see if the data is correctly identified by the classification; if the positive assertion ratio is reached, maintaining the classification for each category of data; if the negative assertion ratio is reached, de-asserting the classification. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system for reducing false positives in the classification of data comprising:
-
at least one database having one or more files of data, wherein the data can be categorized into fields; a user interface linked to the database; and a validation manager for creating an assertion table or assessing an existing assertion table for the data whereby the data is placed into categories and each category is assigned one or more classifications; setting a positive and/or negative assertion ratio for each category; determining the accuracy of each classification by assessing a percentage of the data in each category to see if the data is correctly identified by the classification; if the positive assertion ratio is reached, maintaining the classification for each category of data; if the negative assertion ratio is reached, de-asserting the classification. - View Dependent Claims (14, 15, 16)
-
-
17. A computer program product encoded in a computer readable medium for instructing a system to reducing false positives in the classification of data, wherein the data can be categorized into fields, the program code configured to cause the computer to perform the method comprising:
-
creating an assertion table or assessing an existing assertion table for the data whereby the data is placed into categories and each category is assigned one or more classifications; setting a positive and/or negative assertion ratio for each category; determining the accuracy of each classification by assessing a percentage of the data in each category to see if the data is correctly identified by the classification; if the positive assertion ratio is reached, maintaining the classification for each category of data; if the negative assertion ratio is reached, de-asserting the classification. - View Dependent Claims (18, 19, 20)
-
Specification