Rapid speech recognition adaptation using acoustic input
First Claim
Patent Images
1. An apparatus, comprising:
- a memory; and
a processor operatively coupled to the memory and configured to;
receive one or more speech recognition parameters prior to issuing a verbal prompt to a user;
issue the verbal prompt to the user;
receive an acoustic input from the user in response to the verbal prompt;
process one or more sequences of phonemes to obtain one or more acoustic representations, wherein the one or more sequences of phonemes are generated from a list of expected responses to the issued verbal prompt;
compare the acoustic input from the user to the one or more acoustic representations to determine an acoustic channel characterization and/or speaker class; and
adjust one or more speech recognition parameters based on the comparison, wherein the adjustment comprises an application of feature space mapping to the acoustic input, and further wherein the one or more adjusted speech recognition parameters are used to adapt a speech recognition module of a speech recognition system to use an acoustic model that is consistent with the acoustic channel characterization and/or speaker class so that the selected acoustic model is used for decoding subsequent acoustic input provided by the user as the conversation progresses.
1 Assignment
0 Petitions
Accused Products
Abstract
A method includes the following steps. An acoustic input is obtained from a user, including issuing a verbal prompt to the user and receiving the acoustic input from the user in response to the verbal prompt. One or more acoustic representations are obtained, wherein the one or more acoustic representations are generated from a list of expected responses to the issued verbal prompt. The acoustic input from the user is compared to the one or more acoustic representations. One or more speech recognition parameters are adjusted based on the comparison.
-
Citations
19 Claims
-
1. An apparatus, comprising:
- a memory; and
a processor operatively coupled to the memory and configured to; receive one or more speech recognition parameters prior to issuing a verbal prompt to a user; issue the verbal prompt to the user; receive an acoustic input from the user in response to the verbal prompt;
process one or more sequences of phonemes to obtain one or more acoustic representations, wherein the one or more sequences of phonemes are generated from a list of expected responses to the issued verbal prompt;compare the acoustic input from the user to the one or more acoustic representations to determine an acoustic channel characterization and/or speaker class; and adjust one or more speech recognition parameters based on the comparison, wherein the adjustment comprises an application of feature space mapping to the acoustic input, and further wherein the one or more adjusted speech recognition parameters are used to adapt a speech recognition module of a speech recognition system to use an acoustic model that is consistent with the acoustic channel characterization and/or speaker class so that the selected acoustic model is used for decoding subsequent acoustic input provided by the user as the conversation progresses. - View Dependent Claims (2, 3, 4, 5, 6, 7, 18, 19)
- a memory; and
-
8. An article of manufacture comprising a computer readable storage medium for storing computer readable program code which, when executed, causes a computer to:
-
receive one or more speech recognition parameters prior to issuing a verbal prompt to a user; issue the verbal prompt to the user; receive an acoustic input from the user in response to the verbal prompt;
process one or more sequences of phonemes to obtain one or more acoustic representations, wherein the one or more sequences of phonemes are generated from a list of expected responses to the issued verbal prompt;compare the acoustic input from the user to the one or more acoustic representations to determine an acoustic channel characterization and/or speaker class; and adjust one or more speech recognition parameters based on the comparison, wherein the adjustment comprises an application of feature space mapping to the acoustic input, and further wherein the one or more adjusted speech recognition parameters are used to adapt a speech recognition module of a speech recognition system to use an acoustic model that is consistent with the acoustic channel characterization and/or speaker class so that the selected acoustic model is used for decoding subsequent acoustic input provided by the user as the conversation progresses. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17)
-
Specification