Method for searching content particularly for extracts common to two computer files
First Claim
1. A method implemented by computer means of searching content, for at least one extract common to a first file and to a second file, in the form of binary data, being searched for, wherein the method comprises:
- a prior preparation of the first file at least, comprising the following steps;
a) segmenting the first file into a succession of data packets, of chosen size, and identifying addresses of packets in said file,b) associating with the address of each packet a digital signature defining a fuzzy logic state from among at least three states;
“
true”
, “
false” and
“
undetermined”
, said signature resulting from a combinatorial calculation on data emanating from said file,and the search for common extract, itself, comprising the following steps;
c) comparing the fuzzy logic states associated with each packet address of the first file, with fuzzy logic states determined on the basis of data emanating from the second file,d) eliminating from said search for common extract, pairs of respective addresses of the first and second files whose respective logic states are “
true” and
“
false”
or “
false” and
“
true”
, and preserving the other pairs of addresses identifying data packets liable to comprise said common extract.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention relates to searching content, particularly for at least one extract common to a first data file and a second data file. The method comprises a preliminary step of preparing at least the first file by (a) dividing the first file into a series of data packets having a predetermined size, and identifying packet addresses in said file, (b) combining each packet address with a digital signature that defines one of three fuzzy logic states, namely true, false and indeterminate, and is the result of a combinatorial computation on data from said file; whereafter said method comprises performing an actual search for a common extract by (c) comparing the fuzzy logic states combined with each packet address of the first file with fuzzy logic states determined on the basis of data from the second file, and (d) removing from said common extract search the respective address pairs from the first and second files that have the respective logic states true and false or false and true, and retaining the other address pairs that identify data packets that may comprise said common extract.
-
Citations
30 Claims
-
1. A method implemented by computer means of searching content, for at least one extract common to a first file and to a second file, in the form of binary data, being searched for, wherein the method comprises:
-
a prior preparation of the first file at least, comprising the following steps; a) segmenting the first file into a succession of data packets, of chosen size, and identifying addresses of packets in said file, b) associating with the address of each packet a digital signature defining a fuzzy logic state from among at least three states;
“
true”
, “
false” and
“
undetermined”
, said signature resulting from a combinatorial calculation on data emanating from said file,and the search for common extract, itself, comprising the following steps; c) comparing the fuzzy logic states associated with each packet address of the first file, with fuzzy logic states determined on the basis of data emanating from the second file, d) eliminating from said search for common extract, pairs of respective addresses of the first and second files whose respective logic states are “
true” and
“
false”
or “
false” and
“
true”
, and preserving the other pairs of addresses identifying data packets liable to comprise said common extract. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A computer program product, stored in a memory of a central unit of a computer or on a removable medium to cooperate with a reader of said central unit, the program comprising instructions for conducting steps for searching at least one extract common to a first file and to a second file, in the form of binary data, said steps comprising:
-
a prior preparation of the first file at least, including the following steps; a) segmenting the first file into a succession of data packets, of chosen size, and identifying addresses of packets in said file, b) associating with the address of each packet a digital signature defining a fuzzy logic state from among at least three states;
“
true”
, “
false” and
“
undetermined”
, said signature resulting from a combinatorial calculation on data emanating from said file,and the search for common extract, itself, including the following steps; c) comparing the fuzzy logic states associated with each packet address of the first file, with fuzzy logic states determined on the basis of data emanating from the second file, d) eliminating from said search for common extract, pairs of respective addresses of the first and second files whose respective logic states are “
true” and
“
false”
or “
false” and
“
true”
, and preserving the other pairs of addresses identifying data packets liable to comprise said common extract. - View Dependent Claims (27, 28, 29)
-
-
30. A data structure stored on a computer-readable storage medium used for a search of at least one extract common to a first and a second file, the data structure being representative of the first file, the data structure being obtained by the implementation of the following steps:
-
a) segmenting the first file into a succession of data packets of chosen size, and identifying addresses of packets in said file, b) associating with the address of each packet a digital signature defining a fuzzy logic state from among at least three states;
“
true”
, “
false” and
“
undetermined”
, said signature resulting from a combinatorial calculation on data emanating from said file,the data structure thus comprising a succession of addresses identifying addresses of the first file, a fuzzy logic state from among the states;
“
true”
, “
false” and
“
undetermined”
, being assigned to each of said addresses of the data structure,c) comparing the fuzzy logic states associated with each packet address of the first file, with fuzzy logic states determined on the basis of data emanating from the second file, d) eliminating from said search for common extract, pairs of respective addresses of the first and second files whose respective logic states are “
true” and
“
false”
or “
false” and
“
true”
, and preserving the other pairs of addresses identifying data packets liable to comprise said common extract.
-
Specification