METHOD AND APPARATUS FOR PROVIDING MULTIMEDIA CONTENT OPTIMIZATION
First Claim
Patent Images
1. A method for detecting duplicate content in a pair of media files prior to publication on a webpage, the method comprising:
- generating fingerprint for the content of each of the pair of media files, the fingerprint defining a feature set for each media file;
comparing the fingerprint of the pair of media files to obtain a similarity score; and
declaring the media files as substantial duplicates when the similarity score exceeds an established threshold.
3 Assignments
0 Petitions
Accused Products
Abstract
Methods, system and computer readable medium for detecting duplicate content in a pair of media files prior to publication on a webpage include generating fingerprints for the contents of each of the pair of media files. The fingerprints of one of the pair of media file are then compared with the fingerprints of another of the pair of media files to compute a similarity score. The similarity score is compared against an established threshold. If the similarity score exceeds the established threshold, it is determined that the two media files are substantial duplicate of one another.
64 Citations
21 Claims
-
1. A method for detecting duplicate content in a pair of media files prior to publication on a webpage, the method comprising:
-
generating fingerprint for the content of each of the pair of media files, the fingerprint defining a feature set for each media file; comparing the fingerprint of the pair of media files to obtain a similarity score; and declaring the media files as substantial duplicates when the similarity score exceeds an established threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for publishing media files on a webpage, comprising:
-
identifying media files to be published on the webpage; computing similarity scores for the identified media files based on contents of the media files; determining a containment percentage score for the identified media files, the containment percentage score is associated with the content of the media files; ranking the media files based on pre-defined metric using the similarity scores and containment scores of the identified media files, the pre-defined metric including one or more established thresholds for comparing the similarity scores and containment scores of the media files to determine the ranking of the media files; and publishing the appropriate content of the media files based on the ranking of the media files. - View Dependent Claims (13, 14, 15, 16)
-
-
17. A system for detecting duplicate content in a pair of multimedia files prior to publication on a webpage, the system comprising:
-
a backend server to receive the multimedia files from a plurality of content providers, the backend server receiving the multimedia files over a communication network; a duplicate detection software module available to the backend server, the duplicate detection software module configured to compute a similarity score for the content of the media files received from the plurality of content providers; and a publish server to publish the multimedia files over the internet as a webpage. - View Dependent Claims (18)
-
-
19. A computer readable medium in which program instructions are stored, the program instructions when read by a server of a computing system, cause the server to perform a method for detecting duplicate content in a pair of media files prior to publication on a webpage, the method comprising:
-
generating fingerprint for the content of each of the pair of media files, the fingerprint defining a feature set for each media file; comparing the fingerprint of the pair of media files to obtain a similarity score; and declaring the media files as substantial duplicates when the similarity score exceeds an established threshold. - View Dependent Claims (20, 21)
-
Specification