Methods, apparatus and computer programs for automatic speech recognition
First Claim
1. A method for controlling operation of an automatic speech recognition (ASR) system, comprising the steps of:
- comparing sounds within an input audio signal with phones within an acoustic model to identify candidate matching phones;
calculating recognition confidence scores for individual candidate matching phones;
evaluating the recognition confidence scores to identify at least one of the candidate matching phones having a predefined recognition confidence characteristic; and
selecting a user prompt for eliciting a subsequent user input, wherein the selection is dependent on the identified at least one phone and the recognition confidence characteristic of the identified at least one phone.
3 Assignments
0 Petitions
Accused Products
Abstract
An automatic speech recognition (ASR) system includes a speech-responsive application and a recognition engine. The ASR system generates user prompts to elicit certain spoken inputs, and the speech-responsive application performs operations when the spoken inputs are recognised. The recognition engine compares sounds within an input audio signal with phones within an acoustic model, to identify candidate matching phones. A recognition confidence score is calculated for each candidate matching phone, and the confidence scores are used to help identify one or more likely sequences of matching phones that appear to match a word within the grammar of the speech-responsive application. The per-phone confidence scores are evaluated against predefined confidence score criteria (for example, identifying scores below a ‘low confidence’ threshold) and the results of the evaluation are used to influence subsequent selection of user prompts. One such system uses confidence scores to select prompts for targetted recognition training—encouraging input of sounds identified as having low confidence scores. Another system selects prompts to discourage input of sounds that were not easily recognised.
267 Citations
13 Claims
-
1. A method for controlling operation of an automatic speech recognition (ASR) system, comprising the steps of:
-
comparing sounds within an input audio signal with phones within an acoustic model to identify candidate matching phones;
calculating recognition confidence scores for individual candidate matching phones;
evaluating the recognition confidence scores to identify at least one of the candidate matching phones having a predefined recognition confidence characteristic; and
selecting a user prompt for eliciting a subsequent user input, wherein the selection is dependent on the identified at least one phone and the recognition confidence characteristic of the identified at least one phone. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. An automatic speech recognition system comprising a speech-responsive application program and a speech recognition engine, the speech recognition system comprising:
-
program code for comparing an input audio signal with phones within an acoustic model to identify candidate matching phones;
program code for calculating recognition confidence scores for each of the candidate matching phones;
program code for evaluating the recognition confidence scores for the candidate matching phones to identify at least one phone having a predefined recognition confidence characteristic; and
program code, responsive to the identified at least one phone and responsive to the recognition confidence characteristic of the identified at least one phone, for selecting a user prompt to elicit a subsequent user input.
-
Specification