Efficient empirical determination, computation, and use of acoustic confusability measures
First Claim
1. A method for generating an acoustic confusability measure, comprising:
- recognizing, via a speech recognition system, at least one utterance within a corpus of utterances, said corpus comprising utterances with corresponding reliable transcriptions, each reliable transcription having at least one associated reliable phoneme sequence, to yield at least one recognized word sequence, each said recognized word sequence having at least one associated recognized phoneme sequence; and
generating an empirically derived acoustic confusability measure from analysis of pairs of phoneme sequences, each said pair of phoneme sequences comprising one said associated recognized phoneme sequence and one said associated reliable phoneme sequence, wherein constituents of each said pair of phoneme sequences are associated with a common utterance within said corpus of utterances, said empirically derived acoustic confusability measure comprising a family of probability models over a phoneme alphabet, wherein the family of probability models is based in part on Laplace'"'"'s law of succession.
1 Assignment
0 Petitions
Accused Products
Abstract
Efficient empirical determination, computation, and use of an acoustic confusability measure comprises: (1) an empirically derived acoustic confusability measure, comprising a means for determining the acoustic confusability between any two textual phrases in a given language, where the measure of acoustic confusability is empirically derived from examples of the application of a specific speech recognition technology, where the procedure does not require access to the internal computational models of the speech recognition technology, and does not depend upon any particular internal structure or modeling technique, and where the procedure is based upon iterative improvement from an initial estimate; (2) techniques for efficient computation of empirically derived acoustic confusability measure, comprising means for efficient application of an acoustic confusability score, allowing practical application to very large-scale problems; and (3) a method for using acoustic confusability measures to make principled choices about which specific phrases to make recognizable by a speech recognition application.
167 Citations
3 Claims
-
1. A method for generating an acoustic confusability measure, comprising:
-
recognizing, via a speech recognition system, at least one utterance within a corpus of utterances, said corpus comprising utterances with corresponding reliable transcriptions, each reliable transcription having at least one associated reliable phoneme sequence, to yield at least one recognized word sequence, each said recognized word sequence having at least one associated recognized phoneme sequence; and generating an empirically derived acoustic confusability measure from analysis of pairs of phoneme sequences, each said pair of phoneme sequences comprising one said associated recognized phoneme sequence and one said associated reliable phoneme sequence, wherein constituents of each said pair of phoneme sequences are associated with a common utterance within said corpus of utterances, said empirically derived acoustic confusability measure comprising a family of probability models over a phoneme alphabet, wherein the family of probability models is based in part on Laplace'"'"'s law of succession. - View Dependent Claims (2, 3)
-
Specification