SYSTEMS AND METHODS FOR AUTOMATICALLY SUGGESTING MEDIA ACCOMPANIMENTS BASED ON IDENTIFIED MEDIA CONTENT
First Claim
1. A method, comprising:
- receiving, at a computing device, one or more images embodying content containing representations of one or more objects;
determining, by the computing device, an identity of at least one object of the one or more objects;
determining, based on the identified at least one object, one or more keywords representative of the one or more images;
determining, based on the one or more keywords, one or more audio tracks for presentation with the one or more images; and
outputting an indication of the determined one or more audio tracks.
2 Assignments
0 Petitions
Accused Products
Abstract
The disclosed technology includes automatically suggesting audio, video, or other media accompaniments to media content based on identified objects in the media content. Media content may include images, audio, video, or a combination. In one implementation, one or more images representative of the media content may be extracted. A visual search may be run across the images to identify objects or characteristics present in or associated with the media content. Keywords may be generated based on the identified objects and characteristics. The keywords may be used to determine suitable audio tracks to accompany the media content, for example by performing a search based on the keywords. The determined tracks may be presented to a user, or automatically arranged to match the media content. In another implementation, an aural search may be run across samples of the audio data to similarly identify objects and characteristics of the media content.
33 Citations
31 Claims
-
1. A method, comprising:
-
receiving, at a computing device, one or more images embodying content containing representations of one or more objects; determining, by the computing device, an identity of at least one object of the one or more objects; determining, based on the identified at least one object, one or more keywords representative of the one or more images; determining, based on the one or more keywords, one or more audio tracks for presentation with the one or more images; and outputting an indication of the determined one or more audio tracks. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computer-readable medium that stores instructions that, when executed by at least one processor in a system, cause the system to perform a method comprising:
-
receiving, at the at least one processor, a media sequence comprising audio data, the audio data containing representations of one or more objects; determining, by the at least one processor, an identity of at least one object of the one or more objects; determining, based on the identified at least one object, one or more keywords representative of the media sequence; determining, based on the one or more keywords, one or more audio tracks for presentation with the media sequence; and outputting an indication of the determined one or more audio tracks. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A system comprising:
at least one memory operatively coupled to at least one processor and configured for storing data and instructions that, when executed by the at least one processor, cause the system to; receive, at the at least one processor, a media sequence containing representations of one or more objects; determine, by the at least one processor, an identity of at least one object of the one or more objects; determine, based on the identified at least one object, one or more keywords representative of the media sequence; determine, based on the one or more keywords, one or more audio tracks for presentation with the media sequence; and output an indication of the determined one or more audio tracks. - View Dependent Claims (26, 27, 28, 29, 30, 31)
Specification