Audio-visual codebook dependent cepstral normalization
First Claim
Patent Images
1. An apparatus for enhancing speech for speech recognition, said apparatus comprising:
- a first input medium which obtains noisy audio-visual features;
a second input medium which obtains noisy audio features related to the noisy audio-visual features; and
a cepstral speech function output arrangement for combining the first and second inputs to yield enhanced audio features that are re-combined with visual features to yield enhanced audio-visual features used for speech recognition.
8 Assignments
0 Petitions
Accused Products
Abstract
An arrangement for yielding enhanced audio features towards the provision of enhanced audio-visual features for speech recognition. Input is provided in the form of noisy audio-visual features and noisy audio features related to the noisy audio-visual features.
20 Citations
21 Claims
-
1. An apparatus for enhancing speech for speech recognition, said apparatus comprising:
-
a first input medium which obtains noisy audio-visual features; a second input medium which obtains noisy audio features related to the noisy audio-visual features; and a cepstral speech function output arrangement for combining the first and second inputs to yield enhanced audio features that are re-combined with visual features to yield enhanced audio-visual features used for speech recognition. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of enhancing speech for speech recognition, said method comprisingthe steps of:
-
obtaining noisy audio-visual features; obtaining noisy audio features related to the noisy audio-visual features; and using a cepstral speech function operating on the noisy audio features and the noisy audio-visual features to yield enhanced audio features that are re-combined with visual features to yield enhanced audio-visual features used for speech recognition. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for enhancing speech for speech recognition, said method comprising the steps of:
-
obtaining noisy audio-visual features; obtaining noisy audio features related to the noisy audio-visual features; and using a cepstral speech function operating on the noisy audio features and the noisy audio-visual features to yield enhanced audio features that are re-combined with visual features to yield enhanced audio-visual features used for speech recognition.
-
Specification