Video cataloger system with audio track extraction
First Claim
Patent Images
1. A method of extracting audio for indexing of video, comprising:
- receiving video information having embedded audio information and associated time codes;
capturing the embedded audio information in the video information;
extracting a plurality of audio metadata tracks from the audio information, each audio metadata track having selected ones of the time codes indicative at least of start and stop times for the audio metadata track;
encoding the video information; and
accessing the encoded video information with the selected time codes of one of the audio metadata tracks.
10 Assignments
0 Petitions
Accused Products
Abstract
One aspect of the invention is directed to a system and method for video cataloging. The video is cataloged according to predefined or user definable metadata. The metadata is used to index and then retrieve encoded video.
158 Citations
40 Claims
-
1. A method of extracting audio for indexing of video, comprising:
-
receiving video information having embedded audio information and associated time codes; capturing the embedded audio information in the video information; extracting a plurality of audio metadata tracks from the audio information, each audio metadata track having selected ones of the time codes indicative at least of start and stop times for the audio metadata track; encoding the video information; and accessing the encoded video information with the selected time codes of one of the audio metadata tracks. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An audio engine for extracting metadata tracks, comprising:
-
an audio signal switch receiving an audio signal; an audio classification component controlling the audio signal switch according to whether the audio signal is classified as speech; and a plurality of audio metadata track extraction components in data communication with the output of the switch, wherein each audio metadata track extraction component provides an audio metadata track associated with speech. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. An audio engine for extracting metadata tracks, comprising:
-
an audio signal switch receiving an audio signal; an audio classification component in data communication with and controlling the audio signal switch according to whether the audio signal is classified as speech; and a plurality of audio metadata track extraction components in data communication with the output of the switch, wherein each audio metadata track extraction component provides an audio metadata track associated with speech. - View Dependent Claims (21, 22, 23, 24, 25, 26)
-
-
27. A method of extracting audio for indexing of video, comprising:
-
receiving video information having embedded audio information and associated time codes; capturing the embedded audio information in the video information; extracting a plurality of audio metadata tracks from the audio information, each audio metadata track being associated with selected ones of the time codes indicative at least of start and stop times for the audio metadata track; encoding the video information; and accessing the encoded video information with the selected time codes of one of the audio metadata tracks. - View Dependent Claims (28, 29, 30)
-
-
31. An audio engine for extracting metadata tracks, comprising:
-
an audio signal switch receiving an audio signal; an audio classification engine; an audio class dictionary configured to provide dictionary data indicative of audio classes to the audio classification engine; an audio class profiler in data communication with the audio classification engine, wherein the audio class profiler receives the audio signal, and wherein the audio class profiler is further in data communication with and controls the audio signal switch according to whether the audio signal is classified as speech; and a plurality of audio metadata track extraction components in data communication with the output of the switch, wherein each audio metadata track extraction component provides an audio metadata track associated with speech. - View Dependent Claims (32, 33, 34, 35, 36, 37, 38, 39, 40)
-
Specification