Method and apparatus for audio-visual speech detection and recognition
First Claim
Patent Images
1. A method of providing speech recognition, the method comprising the steps of:
- processing a video signal associated with an arbitrary content video source;
processing an audio signal associated with the video signal; and
recognizing at least a portion of the processed audio signal, using at least a portion of the processed video signal, to generate an output signal representative of the audio signal.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques for providing speech recognition comprise the steps of processing a video signal associated with an arbitrary content video source, processing an audio signal associated with the video signal, and recognizing at least a portion of the processed audio signal, using at least a portion of the processed video signal, to generate an output signal representative of the audio signal.
-
Citations
56 Claims
-
1. A method of providing speech recognition, the method comprising the steps of:
-
processing a video signal associated with an arbitrary content video source;
processing an audio signal associated with the video signal; and
recognizing at least a portion of the processed audio signal, using at least a portion of the processed video signal, to generate an output signal representative of the audio signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53)
-
-
27. Apparatus for providing speech recognition, the apparatus comprising:
-
at least one processor operable to;
(i) process a video signal associated with an arbitrary content video source;
(ii) process an audio signal associated with the video signal; and
(iii) recognize at least a portion of the processed audio signal, using at least a portion of the processed video signal, to generate an output signal representative of the audio signal; and
memory, coupled to the at least one processor, for storing at least a portion of results associated with at least one of the processing and recognizing operations.
-
-
54. A method of providing speech recognition, the method comprising the steps of:
-
processing an image signal associated with an arbitrary content image source;
processing an audio signal associated with the image signal; and
recognizing at least a portion of the processed audio signal, using at least a portion of the processed image signal, to generate an output signal representative of the audio signal.
-
-
55. Apparatus for providing speech recognition, the apparatus comprising:
-
at least one processor operable to;
(i) process an image signal associated with an arbitrary content image source, (ii) process an audio signal associated with the image signal, and (iii) recognize at least a portion of the processed audio signal, using at least a portion of the processed image signal, to generate an output signal representative of the audio signal; and
memory, coupled to the at least one processor, for storing at least a portion of results associated with at least one of the processing and recognizing operations.
-
-
56. Apparatus for providing speech recognition, the apparatus comprising:
-
means for processing a video signal associated with an arbitrary content video source;
means for processing an audio signal associated with the video signal; and
means for recognizing at least a portion of the processed audio signal, using at least a portion of the processed video signal, to generate an output signal representative of the audio signal.
-
Specification