Generating and matching hashes of multimedia content
First Claim
1. A method to identify multimedia content, the method comprising:
- receiving an input block of hash words, the input block representing at least a part of an information signal;
interrogating a look-up table with a selected hash word from the input block to obtain a found hash word;
comparing, by a computer, the input block and a stored block of hash words in which the found hash word has the same position as the selected hash word;
selectively identifying the stored block of hash words as a matching reference signal based on the outcome of the comparing;
wherein the selected hash word comprises a plurality of reliable hash bits and unreliable hash bits; and
the interrogating comprises interrogating the look-up table with reliable bits of the selected hash word.
7 Assignments
0 Petitions
Accused Products
Abstract
Hashes are short summaries or signatures of data files which can be used to identify the file. The disclosed method generates robust hashes for multimedia content, for example, audio clips. The audio clip is divided into successive (preferably overlapping) frames. For each frame, the frequency spectrum is divided into bands. A robust property of each band (e.g. energy) is computed and represented by a respective hash bit. An audio clip is thus represented by a concatenation of binary hash words, one for each frame. To identify a possibly compressed audio signal, a block of hash words derived therefrom is matched by a computer with a large database. Such matching strategies are also disclosed. In an advantageous embodiment, the extraction process also provides information as to which of the hash bits are the least reliable. Flipping these bits considerably improves the speed and performance of the matching process.
-
Citations
9 Claims
-
1. A method to identify multimedia content, the method comprising:
-
receiving an input block of hash words, the input block representing at least a part of an information signal; interrogating a look-up table with a selected hash word from the input block to obtain a found hash word; comparing, by a computer, the input block and a stored block of hash words in which the found hash word has the same position as the selected hash word; selectively identifying the stored block of hash words as a matching reference signal based on the outcome of the comparing; wherein the selected hash word comprises a plurality of reliable hash bits and unreliable hash bits; and the interrogating comprises interrogating the look-up table with reliable bits of the selected hash word. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method to identify subject content, the method comprising:
-
receiving a hash value associated with an information signal, the hash value comprising one or more reliable hash bits and one or more unreliable hash bits; interrogating a look up table with the one or more reliable bits to determine one or more matching stored hash values; for each of the one or more matching stored hash values, calculating, by a computer, a bit error rate, the bit error rate representing a relationship between the one or more bits of the hash value and the corresponding bits of a matching stored hash value from the one or more matching stored hash values; selecting a matching stored hash value from the one or more matching stored hash values, for which the bit error rate is minimal; and identifying the matching stored hash value, for which the bit error rate is minimal, as the matching stored hash value associated with the subject content. - View Dependent Claims (8, 9)
-
Specification