METHOD AND SYSTEM TO PROVIDE REFERENCE DATA FOR IDENTIFICATION OF DIGITAL CONTENT
First Claim
Patent Images
1. A method comprising:
- accessing identifiers of a content portion of digital content, the identifiers usable to identify the content portion and associated with multiple different sources of the content portion; and
defining reference data for the content portion by clustering the accessed identifiers, the reference data usable to identify the content portion.
1 Assignment
0 Petitions
Accused Products
Abstract
Source data is accessed for a content portion of digital content. The source data is usable to identify the content portion. The reference data is defined for the content portion by clustering the accessed source data. The reference data is usable to identify the content portion.
-
Citations
24 Claims
-
1. A method comprising:
-
accessing identifiers of a content portion of digital content, the identifiers usable to identify the content portion and associated with multiple different sources of the content portion; and
defining reference data for the content portion by clustering the accessed identifiers, the reference data usable to identify the content portion. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method comprising:
-
selecting a representative fingerprint from a set of fingerprints for a digital audio track by clustering; and
indexing the representative fingerprint for search queries. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A machine-readable medium comprising instructions, which when executed by a machine, cause the machine to:
-
access source data for a content portion of digital content, the source data usable to identify the content portion; and
define reference data for the content portion by clustering the accessed source data, the reference data usable to identify the content portion.
-
-
17. A machine-readable medium comprising instructions, which when executed by a machine, cause the machine to:
-
select a representative fingerprint from a set of fingerprints for a digital audio track by clustering; and
index the representative fingerprint for search queries.
-
-
18. A machine-readable medium comprising instructions, which when executed by a machine, cause the machine to:
-
compute distance values between each fingerprint of a set of fingerprints by use of a distance function;
calculate a number of matches by computing a number of the distance values below a distance threshold for each of the fingerprints of the set of fingerprints; and
select the fingerprint with a largest number of matches as a representative fingerprint.
-
-
19. A machine-readable medium comprising instructions, which when executed by a machine, cause the machine to:
-
calculate a number of matches by determining a number of the distance values below a distance threshold for each fingerprint of a set of fingerprints; and
select one or more of the fingerprints with a largest number of matches;
calculate an average distance for each of the fingerprints from the fingerprints matched; and
select a fingerprint with a lowest average distance from the one or more of the fingerprints with a largest number of matches as a representative fingerprint.
-
-
20. An apparatus comprising:
-
means for accessing identifiers of a content portion of digital content, the identifiers usable to identify the content portion and associated with multiple different sources of the content portion; and
means for defining reference data for the content portion by clustering the accessed identifiers, the reference data usable to identify the content portion.
-
-
21. An apparatus comprising:
-
a reference fingerprint collection comprising a representative set of fingerprints selected from a master fingerprint collection by clustering;
numerical identifiers to individually identify fingerprints among the representative set of fingerprints; and
text metadata to provide information regarding digital content associated with the representative set of fingerprints. - View Dependent Claims (22, 23)
-
-
24. A method of providing identifiers associated with known digital content items, the method comprising:
for each known digital content item of a plurality of content items, generating a plurality of identifiers associated with the known digital content item;
identifying at least two similar identifiers among the plurality of identifiers; and
storing a reference set of identifiers that excludes at least one similar identifier.
Specification