VOICE DESCRIPTION OF TIME-BASED MEDIA FOR INDEXING AND SEARCHING
First Claim
1. A method of associating a voice description with time-based media, the time-based media including at least one media track, the method comprising:
- enabling a user of a media editing system to record the user'"'"'s voice description of the time-based media while using the media editing system to play back the time-based media;
creating a voice description audio track for storing the voice description; and
storing the recorded voice description in the voice description audio track, wherein the voice description audio track is temporally synchronized with the at least one media track, and wherein the at least one media track and the voice description track are stored within a single media object.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and systems for time-synchronous voice annotation of video and audio media enable effective searching of time-based media content. A user record one or more types voice annotation onto corresponding named voice annotation tracks, which are stored within a media object comprising the time-based media and the annotations. The one or more annotation tracks can then be selectively searched for content using speech or text search terms. Various workflows enable voice annotation to be performed using media editing systems, or one or more stand alone voice annotations systems that permit multiple annotators to operate in parallel, generating different kinds of annotations, and returning their annotation tracks to a central location for consolidation.
32 Citations
17 Claims
-
1. A method of associating a voice description with time-based media, the time-based media including at least one media track, the method comprising:
-
enabling a user of a media editing system to record the user'"'"'s voice description of the time-based media while using the media editing system to play back the time-based media; creating a voice description audio track for storing the voice description; and storing the recorded voice description in the voice description audio track, wherein the voice description audio track is temporally synchronized with the at least one media track, and wherein the at least one media track and the voice description track are stored within a single media object. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method of associating a voice description with time-based media, the time-based media including at least one media track, the method comprising:
-
receiving the time-based media at a media annotation system; enabling a user of the media annotation system to record the user'"'"'s voice description of the time-based media while using the media annotation system to play back the time-based media; receiving from the user an identifier for an audio description track for storing the user'"'"'s voice description; creating the audio description track, wherein the audio description track is tagged by the identifier; storing the voice description in the audio description track in association with the at least one media track as a component of a media object comprising the media track and the audio description track, wherein the audio description track is temporally synchronized with the at least one media track; and outputting the media object from the voice annotation system.
-
-
16. A computer system for voice annotation of time-based media, the time-based media including at least one media track, the computer system comprising:
-
an audio input for receiving voice annotation from a user of the voice annotation system; an output for exporting the voice annotation; a processor programmed to; input via the audio input the user'"'"'s voice annotation of the time-based media while using the media annotation system to play back the time-based media; create an audio annotation track for storing the user'"'"'s voice annotation; input an identifier for the audio annotation track; store the voice annotation in the audio annotation track as a component of a media object comprising the at least one media track and the audio annotation track, wherein the audio annotation track is temporally synchronized with the at least one media track; and export the media object from the voice annotation system via the output.
-
-
17. A computer program product comprising:
a computer-readable medium with computer program instructions encoded thereon, wherein the computer program instructions, when processed by a computer, instruct the computer to perform a method of enabling a user to annotate time-based media, wherein the time-based media includes at least one media track, the method comprising; receiving the time-based media at a media annotation system; enabling the user to record voice annotation of the time-based media while the computer is playing back the time-based media, creating an audio annotation track and tagging the audio annotation track with an identifier received from the user; storing the voice annotation in the audio annotation track, wherein the audio annotation track is stored as a component of a media object that comprises the at least one media track and the audio annotation track, and wherein the audio annotation track is temporally synchronized with the at least one media track; and exporting the media object from the media annotation system.
Specification