System and method for measuring confusion among words in an adaptive speech recognition system
First Claim
1. A method of measuring confusion between word sequences in a word sequence recognition system, comprising:
- having a new word sequence entered into an electronic device;
creating a new transcription of the new word sequence using a pronunciation-modeling system;
computing a distance between the new transcription and at least one prior transcription of a prior word sequence stored in a database if such a prior transcription exists; and
if the computed distance is less than a predefined threshold, informing a user of a potential confusion between the new word sequence and the prior word sequence.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method are proposed for measuring confusability or similarity between given entry pairs, including text string pairs and acoustic model pairs, in systems such as speech recognition and synthesis systems. A string edit distance (Levenshiten distance) can be applied to measure distance between any pair of text strings. It also can be used to calculate a confusion measurement between acoustic model pairs of different words and a model-driven method can be used to calculate a HMM model confusion matrix. This model-based approach can be efficiently calculated with low memory and low computational resources. Thus it can improve the speech recognition performance and models trained from text corpus.
68 Citations
20 Claims
-
1. A method of measuring confusion between word sequences in a word sequence recognition system, comprising:
-
having a new word sequence entered into an electronic device;
creating a new transcription of the new word sequence using a pronunciation-modeling system;
computing a distance between the new transcription and at least one prior transcription of a prior word sequence stored in a database if such a prior transcription exists; and
if the computed distance is less than a predefined threshold, informing a user of a potential confusion between the new word sequence and the prior word sequence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer program product for measuring confusion between word sequences in a word sequence recognition system, comprising:
-
computer code for having a new word sequence entered into an electronic device;
computer code for creating a new transcription of the new word sequence using a pronunciation-modeling system;
computer code for computing a distance between the new transcription and at least one prior transcription of a prior word sequence stored in a database if such a prior transcription exists; and
computer code for, if the computed distance is less than a predefined threshold, informing a user of a potential confusion between the new word sequence and the prior word sequence. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. An electronic device, comprising:
-
a processor; and
a memory unit communicatively connected to the processor and including a computer program product for measuring confusion between word sequences in a word sequence recognition system, the computer program product including;
computer code for having a new word sequence entered into the electronic device;
computer code for creating a new transcription of the new word sequence using a pronunciation-modeling system;
computer code for computing a distance between the new transcription and at least one prior transcription of a prior word sequence stored in a database if such a prior transcription exists; and
computer code for, if the computed distance is less than a predefined threshold, informing a user of a potential confusion between the new word sequence and the at least one prior word sequence. - View Dependent Claims (18, 19, 20)
-
Specification