Method and system for determining media file identifiers and likelihood of media file relationships
First Claim
Patent Images
1. A method for determining a similarity ratio that a selected media file of interest is a derivative work of or is derived from one or more predetermined media files comprising:
- providing a selected media file of interest and one or more predetermined media files to be compared with the selected media file of interest,obtaining media type classifications for the selected media file of interest and for each of the one or more predetermined media files,extracting data or metadata from the selected media file of interest and the one or more predetermined files, wherein at least two different categories of data or metadata are extracted from the selected media file of interest and the one or more predetermined media files by engaging at least two extraction engines,harvesting the data or metadata extracted from the selected media file of interest and the one or more predetermined media files,storing the data or metadata extracted from the selected media file of interest and the one or more predetermined media files,selecting, based on the media type classifications, two or more ranked categories of data or metadata to be used for generating media file identifiers for the selected media file of interest and for each of the one or more predetermined media files,generating, based on the selected two or more ranked categories of data or metadata, the media file identifiers for the selected media file of interest and for each of the one or more predetermined media files,storing the media file identifier generated for the selected media file of interest and the media file identifiers generated for each of the one or more predetermined media files,comparing the media file identifier generated for the selected media file of interest to the media file identifier generated for each of the one or more predetermined media files, anddetermining a similarity ratio that the selected media file of interest is a derivative work of or is derived from each of the one or more predetermined media files based on comparing the media file identifier generated for the selected media file of interest to the media file identifiers generated for each of the one or more predetermined media files, said similarity ratio indicative of whether the selected media file of interest is derived from one or more predetermined media files without regard to the media type classifications of selected media file of interest and the one or more predetermined media files.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system for determining the likelihood or similarity ratio that a selected media file of interest is related to one or more predetermined media files is provided that utilizes, combines, analyzes, and evaluates different categories of data and metadata extracted from each media file to generate a media file identifier for each media file that can then be used as a basis to compare any two media files to each other.
115 Citations
20 Claims
-
1. A method for determining a similarity ratio that a selected media file of interest is a derivative work of or is derived from one or more predetermined media files comprising:
-
providing a selected media file of interest and one or more predetermined media files to be compared with the selected media file of interest, obtaining media type classifications for the selected media file of interest and for each of the one or more predetermined media files, extracting data or metadata from the selected media file of interest and the one or more predetermined files, wherein at least two different categories of data or metadata are extracted from the selected media file of interest and the one or more predetermined media files by engaging at least two extraction engines, harvesting the data or metadata extracted from the selected media file of interest and the one or more predetermined media files, storing the data or metadata extracted from the selected media file of interest and the one or more predetermined media files, selecting, based on the media type classifications, two or more ranked categories of data or metadata to be used for generating media file identifiers for the selected media file of interest and for each of the one or more predetermined media files, generating, based on the selected two or more ranked categories of data or metadata, the media file identifiers for the selected media file of interest and for each of the one or more predetermined media files, storing the media file identifier generated for the selected media file of interest and the media file identifiers generated for each of the one or more predetermined media files, comparing the media file identifier generated for the selected media file of interest to the media file identifier generated for each of the one or more predetermined media files, and determining a similarity ratio that the selected media file of interest is a derivative work of or is derived from each of the one or more predetermined media files based on comparing the media file identifier generated for the selected media file of interest to the media file identifiers generated for each of the one or more predetermined media files, said similarity ratio indicative of whether the selected media file of interest is derived from one or more predetermined media files without regard to the media type classifications of selected media file of interest and the one or more predetermined media files. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system for determining a similarity ratio that a selected media file of interest is a derivative work of or is derived from one or more predetermined media files comprising:
-
a data receiving and input device for receiving a selected media file of interest and one or more predetermined media files to be compared with the selected media file of interest; a data receiving and output device for providing similarity ratios quantifying how similar a selected media file of interest is to each of the one or more predetermined media files; and a data and metadata harvesting, extraction, analysis, evaluation, and storage system, the data and metadata harvesting, extraction, analysis, evaluation, and storage system further comprising; at least two data extraction engines configured to extract different categories of data or metadata from the selected media file of interest and from each of the one or more predetermined media files; a data and metadata harvesting engine configured to manage the at least two data extraction engines and to collect and harvest data or metadata extracted from the at least two data extraction engines, wherein the harvested data or metadata is stored in a data store as data or metadata subsets within each category of data or metadata extracted from the selected media file of interest and each of the one or more predetermined media files; an analysis engine configured to; obtain media type classifications for the selected media file of interest and for each of the one or more predetermined media files; select, based on the media type classifications, at least one category of data or metadata to be used for generating media file identifiers for the selected media file of interest and for each of the one or more predetermined media files; generate, based on the selected at least one category of data or metadata, the media file identifiers for the selected media file of interest and for each of the one or more predetermined media files; compare the generated media file identifier for the selected media file of interest with each of the generated media file identifiers for each of the one or more predetermined media files; and determine similarity ratios that a selected media file of interest is a derivative work of or is derived from one or more predetermined media files based on the comparison of the media file identifier for the selected media file of interest with each of the generated media file identifiers for each of the one or more predetermined media files, said similarity ratio indicative of whether the selected media file of interest is derived from one or more predetermined media files without regard to the media type classifications of selected media file of interest and the one or more predetermined media files; and a user interface configured to provide a user access to a set of features and functionality of the system and to enable the user to select and rank one or more harvested subsets of data or metadata. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. Non-transitory computer-readable storage media encoded with a computer program including instructions executable by a processor for determining a similarity ratio that a selected media file of interest is a derivative work or is derived from one or more predetermined media files, the media comprising:
-
a database, recorded on the media, comprising different types of data and metadata extracted from each of the selected media file of interest and the one or more predetermined media files; an evaluation software module comprising instructions for; obtaining media type classifications for the selected media file of interest and for each of the one or more predetermined media files, extracting data or metadata from the selected media file of interest and the one or more predetermined files, wherein at least two different categories of data or metadata are extracted from the selected media file of interest and the one or more predetermined media files by engaging at least two extraction engines, harvesting the data or metadata extracted from the selected media file of interest and the one or more predetermined media files, storing the data or metadata extracted from the selected media file of interest and the one or more predetermined media files, selecting, based on the media type classifications, two or more ranked categories of data or metadata to be used for generating media file identifiers for the selected media file of interest and for each of the one or more predetermined media files, generating, based on the selected two or more ranked categories of data or metadata, the media file identifiers for the selected media file of interest and for each of the one or more predetermined media files, storing the media file identifiers generated for the selected media file of interest and for each of the one or more predetermined media files, comparing the media file identifier generated for the selected media file of interest to each of the media file identifier generated for each of the one or more predetermined media files, and determining a similarity ratio that the selected media file of interest is a derivative work of or is derived from each of the one or more predetermined media files based on comparing the media file identifier generated for the selected media file of interest to each of the media file identifiers generated for each of the one or more predetermined media files, said similarity ratio indicative of whether the selected media file of interest is derived from one or more predetermined media files without regard to the media type classifications of selected media file of interest and the one or more predetermined media files. - View Dependent Claims (18, 19, 20)
-
Specification