Methods for categorizing input data
First Claim
Patent Images
1. A method of categorizing random input data, comprising:
- a. analyzing at least a portion of the input data to derive a statistical test result for said portion which is indicative of a degree of randomness of the input data; and
b. categorizing said input data as either compressed or encrypted based on said statistical test result.
5 Assignments
0 Petitions
Accused Products
Abstract
Methods are provided for categorizing input data into a selected data type category. Exemplary embodiments are directed to the categorization of binary input data, for example random input data, as either compressed or encrypted based on statistical analysis. To this end, at least a portion of the input data is analyzed to derive a statistical test result for the portion that is indicative of a degree of randomness of the data. The data is then categorized as either compressed or encrypted based on the statistical test result.
39 Citations
25 Claims
-
1. A method of categorizing random input data, comprising:
-
a. analyzing at least a portion of the input data to derive a statistical test result for said portion which is indicative of a degree of randomness of the input data; and
b. categorizing said input data as either compressed or encrypted based on said statistical test result. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A data categorization method, comprising:
-
a. establishing a data categorization schema;
b. receiving binary input data of an unknown data type;
c. conducting a plurality of different statistical tests on the input data, each of which analyzes one or more distinct characteristics thereof to derive an associated statistical test result which is indicative of a degree of randomness for the input data; and
d. categorizing the input data into a selected data type category based on said statistical test results and said data categorization schema. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method of categorizing binary input data as either compressed or encrypted, comprising:
-
a. predetermining an associated χ
2 threshold range for each of a plurality of statistical tests which may be employed to analyze test data, such that, for each statistical test a given χ
2 test result is;
(1) indicative of data compression if it falls within the test'"'"'s associated χ
2 threshold range, or(2) indicative of data encryption if it falls outside the associated χ
2 threshold range; and
b. pre-establishing a schema for categorizing test data as either compressed or encrypted based on said threshold ranges and on statistical test results to be obtained when conducted against test data;
c. receiving input data for testing, wherein said input data is one of;
(1) a data file; and
(2) a contiguous data stream;
d. conducting each said statistical test on an associated portion of the input data to derive a set of actual χ
2 test results; and
e. categorizing said input data as either compressed or encrypted based on whether said actual χ
2 test results satisfy or fail to satisfy said schema. - View Dependent Claims (22, 23, 24, 25)
-
Specification