Male acoustic model adaptation based on language-independent female speech data
First Claim
Patent Images
1. A method of generating proxy acoustic models for use in automatic speech recognition, comprising the steps of:
- (a) training acoustic models from speech received via microphone from male speakers of a first language using an automatic speech recognition (ASR) system comprising the microphone, memory, and a processor; and
(b) adapting the acoustic models trained in step (a) using the ASR system in response to language-independent speech data from female speakers of a second language, to generate proxy acoustic models for use during runtime of speech recognition of an utterance from a female speaker of the first language.
4 Assignments
0 Petitions
Accused Products
Abstract
A method of generating proxy acoustic models for use in automatic speech recognition includes training acoustic models from speech received via microphone from male speakers of a first language, and adapting the acoustic models in response to language-independent speech data from female speakers of a second language, to generate proxy acoustic models for use during runtime of speech recognition of an utterance from a female speaker of the first language.
-
Citations
15 Claims
-
1. A method of generating proxy acoustic models for use in automatic speech recognition, comprising the steps of:
-
(a) training acoustic models from speech received via microphone from male speakers of a first language using an automatic speech recognition (ASR) system comprising the microphone, memory, and a processor; and (b) adapting the acoustic models trained in step (a) using the ASR system in response to language-independent speech data from female speakers of a second language, to generate proxy acoustic models for use during runtime of speech recognition of an utterance from a female speaker of the first language. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method of automatic speech recognition, comprising the steps of:
-
(a) receiving an utterance via a microphone from a female speaker of a first language; (b) pre-processing the utterance with an automatic speech recognition pre-processor to generate acoustic feature vectors; (c) determining at least one formant frequency of the received utterance; (d) identifying at least one of a plurality of formant frequency bands in speech data from female speakers of a second language that corresponds to the at least one formant frequency determined in step (c); and (e) adapting acoustic models trained from speech from male speakers of the first language in response to the identifying step (d), to result in proxy acoustic models for the female speaker of the first language, wherein the method is carried out using an automatic speech recognition (ASR) system comprising the microphone, memory, and a processor. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
Specification