Speech recognition system having multiple speech recognizers
First Claim
1. A speech recognition system for recognizing an input speech signal, the speech recognition system comprising:
- a first speech recognizer recognizing the input speech signal to generate a first speech text and a first confidence score indicating a level of accuracy of the first speech text;
a second speech recognizer recognizing the input speech signal to generate a second speech text and a second confidence score indicating a level of accuracy of the second speech text;
a computerized decision module coupled to the first speech recognizer and the second speech recognizer for selecting either the first speech text or the second speech text as an output speech text, whereinthe decision module receives external data selected from a group consisting of location information of a speaker of the input speech signal, the accent of the speaker, and the identity of the speaker;
the decision module adjusts the first confidence score to generate a first adjusted confidence score based upon the external data; and
the decision module selects the first speech text if the first adjusted confidence score is higher than the second confidence score.
2 Assignments
0 Petitions
Accused Products
Abstract
A speech recognition system recognizes an input speech signal by using a first speech recognizer and a second speech recognizer each coupled to a decision module. Each of the first and second speech recognizers outputs first and second recognized speech texts and first and second associated confidence scores, respectively, and the decision module selects either the first or the second speech text depending upon which of the first or second confidence score is higher. The decision module may also adjust the first and second confidence scores to generate first and second adjusted confidence scores, respectively, and select either the first or second speech text depending upon which of the first or second adjusted confidence scores is higher. The first and second confidence scores may be adjusted based upon the location of a speaker, the identity or accent of the speaker, the context of the speech, and the like.
-
Citations
26 Claims
-
1. A speech recognition system for recognizing an input speech signal, the speech recognition system comprising:
-
a first speech recognizer recognizing the input speech signal to generate a first speech text and a first confidence score indicating a level of accuracy of the first speech text; a second speech recognizer recognizing the input speech signal to generate a second speech text and a second confidence score indicating a level of accuracy of the second speech text; a computerized decision module coupled to the first speech recognizer and the second speech recognizer for selecting either the first speech text or the second speech text as an output speech text, wherein the decision module receives external data selected from a group consisting of location information of a speaker of the input speech signal, the accent of the speaker, and the identity of the speaker; the decision module adjusts the first confidence score to generate a first adjusted confidence score based upon the external data; and the decision module selects the first speech text if the first adjusted confidence score is higher than the second confidence score. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer-implemented method of recognizing an input speech signal to generate an output speech text, the method comprising:
-
recognizing the input speech signal using a first speech recognizer to generate a first speech text and a first confidence score indicating a level of accuracy of the first speech text; recognizing the input speech signal using a second speech recognizer to generate a second speech text and a second confidence score indicating a level of accuracy of the second speech text; receiving external data selected from a group consisting of location information of a speaker of the input speech signal, the accent of the speaker, and the identity of the speaker; adjusting the first confidence score to generate a first adjusted confidence score based upon the external data; and selecting the first speech text as the output speech text if the first adjusted confidence score is higher than the second confidence score. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A computerized decision module for use in a speech recognition system that recognizes an input speech signal to generate an output speech text by using a first speech recognizer and a second speech recognizer, the first speech recognizer recognizing the input speech signal to generate a first speech text and a first confidence score, and the second speech recognizer recognizing the input speech signal to generate a second speech text and a second confidence score, wherein:
-
the computerized decision module is coupled to the first speech recognizer and the second speech recognizer to select either the first speech text or the second speech text as the output speech text; the decision module receives external data selected from a group consisting of location information of a speaker of the input speech signal, the accent of the speaker, and the identity of the speaker; the decision module adjusts the first confidence score to generate a first adjusted confidence score based upon the external data; and the decision module selects the first speech text if the first adjusted confidence score is higher than the second confidence score. - View Dependent Claims (21, 22, 23, 24, 25, 26)
-
Specification