Indexing media content library using audio track fingerprinting
First Claim
1. A computer-implemented video classification system comprisinga storage device configured to ingest and store one or more video files thereon, wherein the one or more video files comprise one or more respective audio tracks;
- one or more processors configured with instructions toperform a perceptual hash algorithm on each of the one or more audio tracks;
generate one or more audio fingerprints for the respective one or more audio tracks;
associate the one or more audio fingerprints with the respective one or more video files; and
index the one or more video files based upon the one or more respective audio fingerprints.
31 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented video classification system includes a storage device configured to ingest and store video files having respective audio tracks. Processors in the system are configured with instructions to perform a perceptual hash algorithm on each of the audio tracks to generate one or more audio fingerprints for each audio track. The audio fingerprints are associated with the respective video files. The video files are indexed based upon the audio fingerprints. Audio tracks from unknown video files are hashed and compared to audio fingerprints of known video files in a repository to identify the unknown video files.
-
Citations
13 Claims
-
1. A computer-implemented video classification system comprising
a storage device configured to ingest and store one or more video files thereon, wherein the one or more video files comprise one or more respective audio tracks; one or more processors configured with instructions to perform a perceptual hash algorithm on each of the one or more audio tracks; generate one or more audio fingerprints for the respective one or more audio tracks; associate the one or more audio fingerprints with the respective one or more video files; and index the one or more video files based upon the one or more respective audio fingerprints. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
9. A method implemented in a computer system for identifying video files in a media library, wherein one or more processors in the computer system is particularly configured to perform a number of processing steps comprising
performing an audio perceptual hash of a plurality of audio tracks associated with a plurality of video files in a media library to produce a plurality of audio fingerprints; -
associating each audio fingerprint with a respective video file in the media library; creating a media library of identifiable video files based upon the respective audio fingerprint; comparing the plurality of audio fingerprints to determine whether any of the plurality of audio fingerprints match within a threshold matching rate; and if there is a match between two or more audio fingerprints within the threshold matching rate, identifying the respective video files associated with the two or more audio fingerprints as different versions of the same video file.
-
-
10. The method of claim 10, wherein the threshold match level is within a range of less than about 95% and greater than about 50%.
-
11. A non-transitory computer readable storage medium containing instructions for instantiating a special purpose computer to index an unknown video file in a media library of known video files, wherein the instructions implement a computer process comprising the steps of
ingesting and storing one or more known video files on a storage device within a computer system, wherein the one or more known video files are associated with one or more hashed audio tracks with one or more associated audio fingerprints; -
receiving one or more unknown video files; performing audio perceptual hashing of one or more audio tracks associated with the one or more unknown video files to produce one or more associated audio fingerprints of the one or more unknown video files; comparing the one or more audio fingerprints of the one or more unknown video files to the one or more audio fingerprints of the one or more known video files; and determining a match level for each of the one or more audio fingerprints of the one or more unknown video files to each of the one or more audio fingerprints of the one or more known video files; and identifying and indexing the one or more unknown video files based upon the match level to each known video file.
-
-
12. The non-transitory computer readable storage medium of claim 12, wherein the identifying and indexing step further comprises determining whether the match level is less than about 50%, is within a threshold matching range of greater than about 50% and less than about 95%, or is greater than about 95%.
-
13. The non-transitory computer readable storage medium of claim 13, wherein
if the match level is 0%, determining that the one or more unknown video files are not associated with one or more known video files and identifying and indexing the one or more unknown video files as new video files; -
if the match level is within the threshold matching range of greater than about 50% to less than about 95%, determining the one or more unknown video files are associated with one or more known video files and identifying and indexing the one or more unknown video files as new versions of the one or more known video files; and if the match level is greater than about 95%, determining the one or more unknown video files are associated with the one or more known video files and identifying and indexing the one or more unknown video files as copies of the one or more known video files.
-
Specification