Efficient empirical computation and utilization of acoustic confusability
First Claim
1. In a computer implemented method for determining an empirically derived acoustic confusability measure, an iterative method for development of a probability model family Π
- ={p(d|t)}, comprising;
providing a recognized corpus;
establishing a termination condition which depends on any of;
a number of iterations executed; and
closeness of match between a previous and current probability family models;
defining a family of decoding costs;
setting an iteration count to 0;
setting a phoneme pair count to 0;
for each entry in the recognized corpus, performing the following steps;
constructing a lattice;
populating lattice arcs with values drawn from a current family of decoding costs;
applying a Bellman-Ford dynamic programming algorithm, or a Dijkstra'"'"'s shortest path algorithm, to find a shortest path through said lattice, from a source node to a terminal node; and
traversing said determined shortest path, wherein for each arc that is traversed, the phoneme pair count is incremented by 1;
for each transcription, computing a confidence score which is the sum of a phoneme pair value over all transcriptions paired with an utterance;
estimating said probability model family;
if the iteration count exceeds 0, testing said termination condition;
if said termination condition is satisfied, returning a desired probability model family and stopping;
if said termination condition is not satisfied, defining a new family of decoding costs and therefrom a new probability model family; and
incrementing said iteration count and repeating.
1 Assignment
0 Petitions
Accused Products
Abstract
Efficient empirical determination, computation, and use of an acoustic confusability measure comprises: (1) an empirically derived acoustic confusability measure, comprising a means for determining the acoustic confusability between any two textual phrases in a given language, where the measure of acoustic confusability is empirically derived from examples of the application of a specific speech recognition technology, where the procedure does not require access to the internal computational models of the speech recognition technology, and does not depend upon any particular internal structure or modeling technique, and where the procedure is based upon iterative improvement from an initial estimate; (2) techniques for efficient computation of empirically derived acoustic confusability measure, comprising means for efficient application of an acoustic confusability score, allowing practical application to very large-scale problems; and (3) a method for using acoustic confusability measures to make principled choices about which specific phrases to make recognizable by a speech recognition application.
-
Citations
7 Claims
-
1. In a computer implemented method for determining an empirically derived acoustic confusability measure, an iterative method for development of a probability model family Π
- ={p(d|t)}, comprising;
providing a recognized corpus; establishing a termination condition which depends on any of; a number of iterations executed; and closeness of match between a previous and current probability family models; defining a family of decoding costs; setting an iteration count to 0; setting a phoneme pair count to 0; for each entry in the recognized corpus, performing the following steps; constructing a lattice; populating lattice arcs with values drawn from a current family of decoding costs; applying a Bellman-Ford dynamic programming algorithm, or a Dijkstra'"'"'s shortest path algorithm, to find a shortest path through said lattice, from a source node to a terminal node; and traversing said determined shortest path, wherein for each arc that is traversed, the phoneme pair count is incremented by 1; for each transcription, computing a confidence score which is the sum of a phoneme pair value over all transcriptions paired with an utterance; estimating said probability model family; if the iteration count exceeds 0, testing said termination condition; if said termination condition is satisfied, returning a desired probability model family and stopping; if said termination condition is not satisfied, defining a new family of decoding costs and therefrom a new probability model family; and incrementing said iteration count and repeating. - View Dependent Claims (2, 3, 4, 5, 6, 7)
- ={p(d|t)}, comprising;
Specification