Method and system for determining content treatment
First Claim
1. A method comprising, for each of plural items of audio-visual content uploaded to a web site that distributes user-generated content for possible distribution of the uploaded content therefrom, each of said plural items of audio-visual content including both audio and video:
- at a time such audio-visual content is ingested by the web site, conducting a fingerprint-based analysis of the audio of the uploaded content item, to identify one or more portions of said content item that have been derived from one or more pre-existing works, said analysis being performed by a software-configured processor, said analysis including transforming plural partially overlapping temporal excerpts of the audio into spectral band data to determine, for each such excerpt, relative energy within plural different frequency bands;
repeating said transforming plural times for each second of the audio;
forming fingerprints using the spectral band data for said plural partially overlapping temporal excerpts; and
matching said fingerprints with reference data to identify one or more portions of the content;
wherein said fingerprints enable identification of content portions with a granularity of three seconds in length;
determining a length of an identified content portion, based on the matched fingerprints;
consulting stored rules to determine how to treat said uploaded content item, wherein treatment of the content item indicated by at least one of said stored rules depends on said length of the identified content portion; and
controlling distribution of the uploaded content item in accordance with a rule that depends on said length of the identified content portion;
wherein for a first content item, for which the identified portion is of a first length, said controlling comprises treating the uploaded first content item in accordance with a first stored rule that prohibits distribution of the uploaded content, andwherein for a second content item, for which the identified portion is of a second, different, length, said controlling comprises treating the uploaded second content item in accordance with a second, different, stored rule that permits distribution of the uploaded content and provides a majority share of ad revenue associated therewith to a rights-holder of a pre-existing work from which at least a portion of the uploaded second content item was derived.
3 Assignments
0 Petitions
Accused Products
Abstract
Metadata determines treatment of content by automated systems, including “user generated content” web sites. The metadata may be conveyed with the content or may be determined by other techniques, including systems based on digital watermarking or content fingerprinting. In some arrangements, treatment depends on the temporal length of a content excerpt that matches a pre-existing work. In others, treatment depends on popularity—either of the content, or a pre-existing work from which it was derived. A great variety of other factors and contexts can also be considered. Automated tools to preliminarily identify possible “fair use” can be realized; further determination may be made by human evaluators (including crowd-source approaches).
-
Citations
34 Claims
-
1. A method comprising, for each of plural items of audio-visual content uploaded to a web site that distributes user-generated content for possible distribution of the uploaded content therefrom, each of said plural items of audio-visual content including both audio and video:
-
at a time such audio-visual content is ingested by the web site, conducting a fingerprint-based analysis of the audio of the uploaded content item, to identify one or more portions of said content item that have been derived from one or more pre-existing works, said analysis being performed by a software-configured processor, said analysis including transforming plural partially overlapping temporal excerpts of the audio into spectral band data to determine, for each such excerpt, relative energy within plural different frequency bands;
repeating said transforming plural times for each second of the audio;
forming fingerprints using the spectral band data for said plural partially overlapping temporal excerpts; and
matching said fingerprints with reference data to identify one or more portions of the content;
wherein said fingerprints enable identification of content portions with a granularity of three seconds in length;determining a length of an identified content portion, based on the matched fingerprints; consulting stored rules to determine how to treat said uploaded content item, wherein treatment of the content item indicated by at least one of said stored rules depends on said length of the identified content portion; and controlling distribution of the uploaded content item in accordance with a rule that depends on said length of the identified content portion; wherein for a first content item, for which the identified portion is of a first length, said controlling comprises treating the uploaded first content item in accordance with a first stored rule that prohibits distribution of the uploaded content, and wherein for a second content item, for which the identified portion is of a second, different, length, said controlling comprises treating the uploaded second content item in accordance with a second, different, stored rule that permits distribution of the uploaded content and provides a majority share of ad revenue associated therewith to a rights-holder of a pre-existing work from which at least a portion of the uploaded second content item was derived. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system comprising:
-
a server that receives items of audio-video content uploaded by users, for possible distribution; a fingerprint processor that identifies, at a time of ingest—
for each of plural items of said audio-video content, and by reference to audio fingerprint data derived from the uploaded audio-content item—
one or more portions of said uploaded audio-video content item that have been derived from one or more pre-existing works, said processor performing said identification by analysis acts including;
transforming plural partially overlapping temporal excerpts of the audio into spectral band data to determine, for each such excerpt, relative energy within plural frequency bands;
repeating such transforming plural times for each second of audio;
forming fingerprints using the spectral band data for said plural partially overlapping temporal excerpts; and
matching the fingerprints with reference data to identify portions of the content, said fingerprints enabling identification of content portions with a granularity of three seconds in length;a database, the database containing rules specifying how uploaded content items should be treated; and a processor that controls distribution of uploaded content items in accordance with said rules; wherein at least one of said rules specifies treatment of content items based on a length of a portion identified as having been derived from a pre-existing work, wherein for a first item of content, for which the identified portion is of a first length, said rules cause the processor to prohibit distribution of said first item of content; and for a second item of content, for which the identified portion is of a second, different, length, said rules cause the processor to permit distribution of said second content item, with a majority of ad revenue associated therewith shared with a rights-holder of a pre-existing work from which at least a portion of said second item of content was derived. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A non-transitory computer readable medium comprising instructions stored thereon to cause one or more processors to perform the following:
-
conducting an audio fingerprint-based analysis of each of plural items of content, to identify one or more portions of said content item that have been derived from one or more pre-existing works, said analysis including transforming plural partially overlapping temporal excerpts of the audio into spectral band data to determine, for each such excerpt, relative energy within plural different frequency bands;
repeating said transforming plural times for each second of the audio;
forming fingerprints using the spectral band data for said plural partially overlapping temporal excerpts; and
matching said fingerprints with reference data to identify portions of the content;
wherein the fingerprints enable identification of content portions with a granularity of three seconds in length;determining a length of an identified content portion, based on the matched fingerprints; determining at least one rule that governs treatment of the uploaded content item, based at least in part on said length of the identified content portion; and controlling distribution of the uploaded content item in accordance with said determined rule; wherein a first rule prohibits distribution of the uploaded content; and a second rule permits distribution of the uploaded content, with a majority of ad revenue associated therewith shared with a rights-holder of a pre-existing work from which at least a portion of the uploaded content item was derived. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31, 32, 33, 34)
-
Specification