Method and apparatus for detecting near duplicate videos using perceptual video signatures
First Claim
Patent Images
1. A method of identifying a video signal comprising the steps of:
- receiving a first video signal at a processing device from an input device operably connected to the processing device;
executing the following steps on a processing device;
extracting at least one perceptual feature from a plurality of frames of the first video signal;
assigning a weighting value to each perceptual feature;
selecting at least a portion of the perceptual features according to the weighting value;
extracting at least one additional feature from the plurality of frames of the first video signal;
creating a digital fingerprint from the selected perceptual features and from the at least one additional feature;
creating a first digital signature from a plurality of the digital fingerprints;
storing the first digital signature which identifies the video signal according to the sorted perceptual features in a database;
assigning a cost value to a plurality of edit operations;
comparing the first digital signature to a second digital signature to identify each edit operation required to transform the first digital signature to the second digital signature;
adding a total cost of from the cost value for each of the edit operations; and
comparing the total cost of the edit operations against a predetermined level to determine whether the first and the second digital signatures identify a near-duplicate video signal.
3 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus for detection and identification of duplicate or near-duplicate videos using a perceptual video signature are disclosed. The disclosed apparatus and methods (i) extract perceptual video features, (ii) identify unique and distinguishing perceptual features to generate a perceptual video signature, (iii) compute a perceptual video similarity measure based on the video edit distance, and (iv) search and detect duplicate and near-duplicate videos. A complete framework to detect unauthorized copying of videos on the Internet using the disclosed perceptual video signature is disclosed.
50 Citations
18 Claims
-
1. A method of identifying a video signal comprising the steps of:
-
receiving a first video signal at a processing device from an input device operably connected to the processing device; executing the following steps on a processing device; extracting at least one perceptual feature from a plurality of frames of the first video signal; assigning a weighting value to each perceptual feature; selecting at least a portion of the perceptual features according to the weighting value; extracting at least one additional feature from the plurality of frames of the first video signal; creating a digital fingerprint from the selected perceptual features and from the at least one additional feature; creating a first digital signature from a plurality of the digital fingerprints; storing the first digital signature which identifies the video signal according to the sorted perceptual features in a database; assigning a cost value to a plurality of edit operations; comparing the first digital signature to a second digital signature to identify each edit operation required to transform the first digital signature to the second digital signature; adding a total cost of from the cost value for each of the edit operations; and comparing the total cost of the edit operations against a predetermined level to determine whether the first and the second digital signatures identify a near-duplicate video signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 9, 10)
-
-
8. A method of identifying a video signal comprising the steps of:
-
receiving a video signal at a processing device from an input device operably connected to the processing device; executing the following steps on a processing device; extracting at least one perceptual feature from a plurality of frames of the video signal; assigning a weighting value to each perceptual feature; selecting at least a portion of the perceptual features according to the weighting value, wherein each perceptual feature is one of a motion and a color; extracting at least one additional feature from the plurality of frames of the video signal, wherein each additional feature is one of a scene change and an object displayed in the video signal; creating a digital fingerprint from the selected perceptual features and from the at least one additional feature; storing the digital fingerprint which identifies the video signal according to the sorted perceptual features in a database; segmenting the frames into a plurality of regions prior to extracting the perceptual feature; selecting at least one region to include in the digital fingerprint according to a magnitude of motion energy in the region; and including the magnitude and a direction of the motion energy and a centroid and a size of each selected region in the digital fingerprint, wherein the weighting value is the magnitude and the direction of the motion energy identified in each of the regions. - View Dependent Claims (11, 12, 13)
-
-
14. A system for comparing a first video signal to a second video signal for the purpose of identifying near-duplicate videos comprising:
-
a processing device that receives the first video signal and that calculates a first perceptual digital signature of the first video signal, the first perceptual digital signature comprising a plurality of perceptual digital fingerprints; and a database, operably connected to the processing device and storing a plurality of additional perceptual digital signatures, each additional perceptual digital signature comprising a plurality of perceptual digital fingerprints; wherein the processing device; divides the first perceptual digital signature into a plurality of segments for comparison to the additional perceptual digital signatures using a video edit distance; compares the first perceptual digital signature to at least a portion of the additional perceptual digital signatures to identify near-duplicate videos; and identifies each segment of the first perceptual digital signature as a partial match of the additional digital signature if the video edit distance between at least three fingerprints of the first perceptual digital signature and three fingerprints of the additional perceptual digital signatures is zero. - View Dependent Claims (15, 16, 17, 18)
-
Specification