System and Method for Digital Video Retrieval Involving Speech Recognition
First Claim
Patent Images
1. A method comprising:
- converting a descriptive audio stream associated with a digital video to text;
performing a non-textual optical analysis of a frame on the digital video, to yield an analysis; and
aligning the text to frames in the digital video based on the analysis, a first bit rate associated with the digital video, and a second bit rate associated with the descriptive audio stream.
4 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are systems, methods, and computer readable media for retrieving digital images. The method embodiment includes converting a descriptive audio stream of a digital video that is provided for the visually impaired to text and then aligning that text to the appropriate segment of the digital video. The system then indexes the converted text from the descriptive audio stream with the text'"'"'s relationship to the digital video. The system enables queries using action words describing a desired scene from a digital video.
7 Citations
20 Claims
-
1. A method comprising:
-
converting a descriptive audio stream associated with a digital video to text; performing a non-textual optical analysis of a frame on the digital video, to yield an analysis; and aligning the text to frames in the digital video based on the analysis, a first bit rate associated with the digital video, and a second bit rate associated with the descriptive audio stream. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, result in the processor performing operations comprising; converting a descriptive audio stream associated with a digital video to text; performing a non-textual optical analysis of a frame on the digital video, to yield an analysis; and aligning the text to frames in the digital video based on the analysis, a first bit rate associated with the digital video, and a second bit rate associated with the descriptive audio stream. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer-readable storage device having instructions stored which, when executed by a computing device, result in the computing device performing operations comprising:
-
converting a descriptive audio stream associated with a digital video to text; performing a non-textual optical analysis of a frame on the digital video, to yield an analysis; and aligning the text to frames in the digital video based on the analysis, a first bit rate associated with the digital video, and a second bit rate associated with the descriptive audio stream. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification