System and methods for analysis of data
First Claim
Patent Images
1. A computer method for analyzing two or more data sets using algorithmic components, comprising the steps of:
- generating a first sample data set from a hidden stochastic source including the steps of;
generating an independent stream ω
0 from flat white noise;
reading a first symbol σ
1 from a first symbol stream s1 and a second symbol σ
2 from the independent stream ω
0, wherein each of the first symbol σ
1 and the second symbol σ
2 are one selected from the group comprising;
a letter, a number, a digit, a character, a sign, a figure, a mark, an icon, an image, a vector, a matrix, and a polynomia;
writing the first symbol stream s1 to an output inverted stream s′
if the first symbol σ
1 is equal to the second symbol σ
2;
generating a second sample data set from an inverse hidden stochastic source including the steps of;
generating a number of |Σ
|−
1 independent copies of the first symbol stream s1, wherein |Σ
| is a binary alphabet size consisting of the numbers 0 and 1;
reading a current symbol σ
i from an inverse symbol stream si, wherein i=1, . . . , |Σ
|−
1;
writing
2 Assignments
0 Petitions
Accused Products
Abstract
Data processing including a universal metric to quantify and estimate the similarity and dissimilarity between data sets. Data streams are perfectly annihilated by a correct realization of their anti-streams. Any deviation of the collision product from a baseline, for example flat white noise, quantifies statistical dissimilarity. The invention relates generally to data mining. More specifically, the invention relates to the analysis of data using a universal metric to quantify and estimate the similarity and dissimilarity between sets of data.
8 Citations
5 Claims
-
1. A computer method for analyzing two or more data sets using algorithmic components, comprising the steps of:
-
generating a first sample data set from a hidden stochastic source including the steps of; generating an independent stream ω
0 from flat white noise;reading a first symbol σ
1 from a first symbol stream s1 and a second symbol σ
2 from the independent stream ω
0, wherein each of the first symbol σ
1 and the second symbol σ
2 are one selected from the group comprising;
a letter, a number, a digit, a character, a sign, a figure, a mark, an icon, an image, a vector, a matrix, and a polynomia;writing the first symbol stream s1 to an output inverted stream s′
if the first symbol σ
1 is equal to the second symbol σ
2;generating a second sample data set from an inverse hidden stochastic source including the steps of; generating a number of |Σ
|−
1 independent copies of the first symbol stream s1, wherein |Σ
| is a binary alphabet size consisting of the numbers 0 and 1;reading a current symbol σ
i from an inverse symbol stream si, wherein i=1, . . . , |Σ
|−
1;writing - View Dependent Claims (2, 3, 4, 5)
-
Specification