Audio fingerprinting
First Claim
1. A method to calculate a fingerprint for media entities, comprising the steps of:
- reading a predefined amount of data from an input media entity data file, the predefined amount of data corresponding to a specified position in said media entity data file;
windowing said predefined amount of data into a plurality of sequential chunks;
for each chunk of said plurality of sequential chunks, calculating a set of psycho-acoustic spectral coefficients;
preserving a set of energetic coefficients of the set of psycho-acoustic spectral coefficients according to at least one pre-defined criterion;
calculating the inverse Discrete Fourier Transform (DFT) to generate an estimate of the salient coefficients of the set of most energetic coefficients; and
storing the results of the DFT for the plurality of sequential chunks into a matrix F, wherein a first axis of said matrix F corresponds to a slice of time of said media entities and a second axis of said matrix F correspond to a frequency band of the psycho-acoustic frequency scale.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and methods for the creation, management, and distribution of media entity fingerprinting are provided. In connection with a system that convergently merges perceptual and digital signal processing analysis of media entities for purposes of classifying the media entities, various means are provided to a user for automatically processing fingerprints for media entities for distribution to participating users. Techniques for providing efficient calculation and distribution of fingerprints for use in satisfying copyright regulations and in facilitating the association of meta data to media entities are included. In an illustrative implementation, the fingerprints may be generated and stored allowing for persistence of media from experience to experience.
-
Citations
17 Claims
-
1. A method to calculate a fingerprint for media entities, comprising the steps of:
-
reading a predefined amount of data from an input media entity data file, the predefined amount of data corresponding to a specified position in said media entity data file;
windowing said predefined amount of data into a plurality of sequential chunks;
for each chunk of said plurality of sequential chunks, calculating a set of psycho-acoustic spectral coefficients;
preserving a set of energetic coefficients of the set of psycho-acoustic spectral coefficients according to at least one pre-defined criterion;
calculating the inverse Discrete Fourier Transform (DFT) to generate an estimate of the salient coefficients of the set of most energetic coefficients; and
storing the results of the DFT for the plurality of sequential chunks into a matrix F, wherein a first axis of said matrix F corresponds to a slice of time of said media entities and a second axis of said matrix F correspond to a frequency band of the psycho-acoustic frequency scale. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for identifying an unknown media entity by employing media entity fingerprints of a plurality of media entities, comprising the steps of:
-
calculating a fingerprint for at least one media entity of said plurality of media entities, including;
reading a predefined amount of data from said at least one media entity, the predefined amount of data corresponding to a specified position in said at least one media entity;
windowing said predefined amount of data into a plurality of sequential chunks;
for each chunk of said plurality of sequential chunks, calculating a set of psycho-acoustic spectral coefficients;
preserving a set of energetic coefficients of the set of psycho-acoustic spectral coefficients according to at least one pre-defined criterion;
calculating the inverse Discrete Fourier Transform (DFT) to generate an estimate of the salient coefficients of the set of most energetic coefficients;
storing the results of the DFT for the plurality of sequential chunks into a matrix F, wherein a first axis of said matrix F corresponds to a slice of time of said media entities and a second axis of said matrix F correspond to a frequency band of the psycho-acoustic frequency scale;
based upon the calculating of the fingerprint of the at least one media entity, obtaining a sequence having length L of n random bits representing said calculated fingerprint;
obtaining a sequence having a length L of N random bits of said unknown media entity for identification;
comparing said n bits with said N bits; and
evaluating the results of said comparing to determine an estimate of similarity. - View Dependent Claims (9, 10, 11)
-
-
12. A system for calculating a fingerprint for media entities, comprising:
-
means for reading a predefined amount of data from an input media entity data file, the predefined amount of data corresponding to a specified position in said media entity data file;
means for windowing said predefined amount of data into a plurality of sequential chunks;
means for calculating a set of psycho-acoustic spectral coefficients for each chunk of said plurality of sequential chunks;
means for preserving a set of energetic coefficients of the set of psycho-acoustic spectral coefficients according to at least one pre-defined criterion;
means for calculating the inverse Discrete Fourier Transform (DFT) to generate an estimate of the salient coefficients of the set of most energetic coefficients; and
means for storing the results of the DFT for the plurality of sequential chunks into a matrix F, wherein a first axis of said matrix F corresponds to a slice of time of said media entities and a second axis of said matrix F correspond to a frequency band of the psycho-acoustic frequency scale. - View Dependent Claims (13, 14, 15, 16, 17)
-
Specification