Method and apparatus for audio-visual speech detection and recognition
First Claim
Patent Images
1. A method of providing speech recognition, the method comprising the steps of:
- processing a video signal associated with an arbitrary content video source;
processing an audio signal associated with the video signal; and
recognizing at least a portion of the processed audio signal, using at least a portion of the processed video signal, to generate an output signal representative of the audio signal.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques for providing speech recognition comprise the steps of processing a video signal associated with an arbitrary content video source, processing an audio signal associated with the video signal, and recognizing at least a portion of the processed audio signal, using at least a portion of the processed video signal, to generate an output signal representative of the audio signal.
-
Citations
56 Claims
-
1. A method of providing speech recognition, the method comprising the steps of:
-
processing a video signal associated with an arbitrary content video source;
processing an audio signal associated with the video signal; and
recognizing at least a portion of the processed audio signal, using at least a portion of the processed video signal, to generate an output signal representative of the audio signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. Apparatus for providing speech recognition, the apparatus comprising:
-
at least one processor operable to;
(i) process a video signal associated with an arbitrary content video source;
(ii) process an audio signal associated with the video signal; and
(iii) recognize at least a portion of the processed audio signal, using at least a portion of the processed video signal, to generate an output signal representative of the audio signal; and
memory, coupled to the at least one processor, for storing at least a portion of results associated with at least one of the processing and recognizing operations. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53)
-
-
54. A method of providing speech recognition, the method comprising the steps of:
-
processing an image signal associated with an arbitrary content image source;
processing an audio signal associated with the image signal; and
recognizing at least a portion of the processed audio signal, using at least a portion of the processed image signal, to generate an output signal representative of the audio signal.
-
-
55. Apparatus for providing speech recognition, the apparatus comprising:
-
at least one processor operable to;
(i) process an image signal associated with an arbitrary content image source, (ii) process an audio signal associated with the image signal, and (iii) recognize at least a portion of the processed audio signal, using at least a portion of the processed image signal, to generate an output signal representative of the audio signal; and
memory, coupled to the at least one processor, for storing at least a portion of results associated with at least one of the processing and recognizing operations.
-
-
56. Apparatus for providing speech recognition, the apparatus comprising:
-
means for processing a video signal associated with an arbitrary content video source;
means for processing an audio signal associated with the video signal; and
means for recognizing at least a portion of the processed audio signal, using at least a portion of the processed video signal, to generate an output signal representative of the audio signal.
-
Specification