METHOD FOR REVEALING A TYPE OF DATA
First Claim
Patent Images
1. A method for revealing a type of data, the method comprising the steps of:
- receiving at least one training data set of objects to create a signature for that data type;
creating the signature, by;
(i) scanning all objects in each training data set, to create a list of unique N-Grams of any size and their statistics; and
(ii) adding each N-Gram in the list in case it appears in a minimum of predefined threshold objects to a repository,wherein each signature includes at least one N-Gram, wherein size of the at least one N-Gram is variable having a minimum limitation,wherein a temp maximum value is specified for the size of the at least one N-Gram, andwherein the temp maximum value for the size of the at least one N-Gram is increased after the creating of the signature when at least one identified N-Gram in the creation process reach the temp maximum value, andreceiving data of unknown type for examination;
scanning said received data of unknown type to score each N-Gram in the signature of each object type when it is found in the scanned data; and
determining the type of the unknown type data according to the signature of the object type that accumulated highest score by the N-Grams of the signature.
0 Assignments
0 Petitions
Accused Products
Abstract
A method for revealing a type of data is provided herein. The method is comprising
- the following steps (i) receiving at least one training data set of objects to create a signature for that data type; (ii) creating the signature, by: (a) scanning all objects in
- each training data set, to create a list of unique N-Grams of any size and their statistics; and (b) adding each N-Gram in the list in case it appears in a minimum of predefined threshold objects to a repository; (iii) receiving data of unknown type for examination; (iv) scanning said received data of unknown type to score each N-Gram
- in the signature of each object type when it is found in the scanned data; and (v) determining the type of the unknown type data according to the signature of the object type that accumulated highest score by the N-Grams of the signature.
- the following steps (i) receiving at least one training data set of objects to create a signature for that data type; (ii) creating the signature, by: (a) scanning all objects in
11 Citations
17 Claims
-
1. A method for revealing a type of data, the method comprising the steps of:
-
receiving at least one training data set of objects to create a signature for that data type; creating the signature, by;
(i) scanning all objects in each training data set, to create a list of unique N-Grams of any size and their statistics; and
(ii) adding each N-Gram in the list in case it appears in a minimum of predefined threshold objects to a repository,wherein each signature includes at least one N-Gram, wherein size of the at least one N-Gram is variable having a minimum limitation, wherein a temp maximum value is specified for the size of the at least one N-Gram, and wherein the temp maximum value for the size of the at least one N-Gram is increased after the creating of the signature when at least one identified N-Gram in the creation process reach the temp maximum value, and receiving data of unknown type for examination; scanning said received data of unknown type to score each N-Gram in the signature of each object type when it is found in the scanned data; and determining the type of the unknown type data according to the signature of the object type that accumulated highest score by the N-Grams of the signature. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
Specification