Scoring and re-scoring dynamic time warping of speech
First Claim
1. A method comprisingmeasuring first distances between (a) vectors belonging to a set of vectors that represent an utterance and (b) vectors belonging to a set of vectors that represent a template, the measuring being done in accordance with a first order of the utterance vectors and a first order of the template vectors, andmeasuring second distances between (a) vectors belonging to the set of vectors that represent the utterance and (b) vectors belonging to the set of vectors that represent the template, the measuring being done in accordance with a second order of the utterance vectors and a second order of the template vectors,in which the first template vector order and the second template vector order are different and/or the first utterance vector order and the second utterance vector order are different, andin which the measure of first distances is outside of a pre-specifled threshold, and the measure of second distances indicates that the template represents an out-of-vocabulary utterance.
4 Assignments
0 Petitions
Accused Products
Abstract
A method includes (i) measuring first distances between (a) vectors belonging to a set of vectors that represent an utterance and (b) vectors belonging to a set of vectors that represent a template, the measuring being done in accordance with a first order of the utterance vectors a first order of the template vectors, and (ii) measuring second distances between (a) individual vectors belonging to the set of vectors that represent the utterance and (b) individual vectors belonging to the set of vectors that represent the template, the measuring being done in accordance with a second order of the utterance vectors and a second order of the template vectors, and (iii) in which the first template vector order and the second template vector order are different and/or the first utterance vector order and the second utterance vector order are different. In another aspect, a method includes measuring distances between vectors that represent an utterance and vectors that represent a template, generating information indicative of how well the vectors of the utterance match the vectors of the template, and making a matching decision based on the measured distances and on the generated information.
-
Citations
19 Claims
-
1. A method comprising
measuring first distances between (a) vectors belonging to a set of vectors that represent an utterance and (b) vectors belonging to a set of vectors that represent a template, the measuring being done in accordance with a first order of the utterance vectors and a first order of the template vectors, and measuring second distances between (a) vectors belonging to the set of vectors that represent the utterance and (b) vectors belonging to the set of vectors that represent the template, the measuring being done in accordance with a second order of the utterance vectors and a second order of the template vectors, in which the first template vector order and the second template vector order are different and/or the first utterance vector order and the second utterance vector order are different, and in which the measure of first distances is outside of a pre-specifled threshold, and the measure of second distances indicates that the template represents an out-of-vocabulary utterance.
-
8. A method comprising
measuring first distances between (a) vectors belonging to a set of vectors that represent an utterance and (b) vectors belonging to a set of vectors that represent a template, the measuring being done in accordance with a first order of the utterance vectors and a first order of the template vectors, and measuring second distances between (a) vectors belonging to the set of vectors that represent the utterance and (b) vectors belonging to the set of vectors that represent the template, the measuring being done in accordance with a second order of the utterance vectors and a second order of the template vectors, in which the first template vector order and the second template vector order are different and/or the first utterance vector order and the second utterance vector order are different, and in which the method also includes using the measured second distances to reject the template as a match for the utterance even though the first distances imply a match.
-
9. A method comprising
measuring first distances between (a) vectors belonging to a set of vectors that represent an utterance and (b) vectors belonging to a set of vectors that represent a template, the measuring being done in accordance with a first order of the utterance vectors and a first order of the template vectors, and measuring second distances between (a) vectors belonging to the set of vectors that represent the utterance and (b) vectors belonging to the set of vectors that represent the template, the measuring being done in accordance with a second order of the utterance vectors and a second order of the template vectors, in which the first template vector order and the second template vector order are different and/or the first utterance vector order and the second utterance vector order are different, and in which the measuring of the first distances is done in parallel with the measuring of the second distances.
-
10. A method comprising
measuring first distances between (a) vectors belonging to a set of vectors that represent an utterance and (b) vectors belonging to a set of vectors that represent a template, the measuring being done in accordance with a first order of the utterance vectors and a first order of the template vectors, and measuring second distances between (a) vectors belonging to the set of vectors that represent the utterance and (b) vectors belonging to the set of vectors that represent the template, the measuring being done in accordance with a second order of the utterance vectors and a second order of the template vectors, in which the first template vector order and the second template vector order are different and/or the first utterance vector order and the second utterance vector order are different, the method also including measuring third distances between (a) individual vectors belonging to the set of vectors that represent the utterance and (b) individual vectors belonging to the set of vectors that represent the template, the measuring being done in accordance with a third order of the utterance vectors and a third order of the template vectors.
-
12. A method comprising
measuring first distances between (a) vectors belonging to a set of vectors that represent an utterance and (b) vectors belonging to a set of vectors that represent a template, the measuring being done in accordance with a first order of the utterance vectors and a first order of the template vectors, and measuring second distances between (a) vectors belonging to the set of vectors that represent the utterance and (b) vectors belonging to the set of vectors that represent the template, the measuring being done in accordance with a second order of the utterance vectors and a second order of the template vectors, in which the first template vector order and the second template vector order are different and/or the first utterance vector order and the second utterance vector order are different, generating information indicative of how well the vectors of the utterance match the vectors of the template, including producing a distribution of values associated with the vectors of the template, the values indicating the frequency with which reference template vectors produce a minimum local distance in matching with vectors of the utterance, making a matching decision based on the measured distances and on the generated information.
-
16. A method comprising
measuring first distances between (a) vectors belonging to a set of vectors that represent an utterance and (b) vectors belonging to a set of vectors that represent a template, the measuring being done in accordance with a first order of the utterance vectors and a first order of the template vectors, and measuring second distances between (a) vectors belonging to the set of vectors that represent the utterance and (b) vectors belonging to the set of vectors that represent the template, the measuring being done in accordance with a second order of the utterance vectors and a second order of the template vectors, in which the first template vector order and the second template vector order are different and/or the first utterance vector order and the second utterance vector order are different, generating information indicative of how well the vectors of the utterance match the vectors of the template, and making a matching decision based on the measured distances and on the generated information in which the measuring of distances includes generating a raw dynamic time warping score and rescoring the score based on the information indicative of how well the vectors of the utterance match the vectors of the template.
-
18. A method comprising
measuring first distances between (a) vectors belonging to a set of vectors that represent an utterance and (b) vectors belonging to a set of vectors that represent a template, the measuring being done in accordance with a first order of the utterance vectors and a first order of the template vectors, and measuring second distances between (a) vectors belonging to the set of vectors that represent the utterance and (b) vectors belonging to the set of vectors that represent the template, the measuring being done in accordance with a second order of the utterance vectors and a second order of the template vectors, in which the first template vector order and the second template vector order are different and/or the first utterance vector order and the second utterance vector order are different, producing a distribution of values associated with the vectors of the template, the values indicating the frequency with which reference template vectors produce a minimum local distance in matching with vectors of the utterance, and rescoring a score based on a spikiness of the distribution and on how well the entire set of vectors representing the template are used in the matching.
-
19. A method comprising receiving an utterance, forming a time-ordered series of vectors representing the utterance,
using dynamic time warping to score the utterance vectors against time-ordered series of vectors representing templates of possible utterances, the scoring being done using the template vectors and utterance vectors in their time order, using dynamic time warping to additionally score the utterance vectors against the vectors representing templates, the additional scoring being done using a different time-order of vectors of one or both of the template vectors and utterance vectors, and selecting one of the templates as a match for the utterance based on both the scoring and the additional scoring, in which the selecting includes removing from consideration templates for which the scoring indicates a match and the additional scoring indicates that the matching template is out of vocabulary.
Specification