Dynamic time warping using frequency distributed distance measures
First Claim
1. A method comprisingmeasuring distances between vectors that represent an utterance and vectors that represent a template, generating additional information including a frequency measure based on the measured distances, the additional information indicative of how well the vectors of the utterance match the vectors of the template, and making a matching decision based on the measured distances and on the generated additional information.
8 Assignments
0 Petitions
Accused Products
Abstract
Distances are measured between vectors representing speech and a stored reference template. Frequency distributions of the distance measurements are generated by counting how many times a particular reference template resulted in the lowest local distance. The numbers in the counters indicate regions (successive vectors) in a reference template that are good matches for speech input.
23 Citations
30 Claims
-
1. A method comprising
measuring distances between vectors that represent an utterance and vectors that represent a template, generating additional information including a frequency measure based on the measured distances, the additional information indicative of how well the vectors of the utterance match the vectors of the template, and making a matching decision based on the measured distances and on the generated additional information.
-
8. A method comprising
using dynamic time warping to generate a score representing distances between vectors that represent an utterance and vectors that represent a template, producing a distribution of values associated with the vectors of the template, the values indicating the frequency with which reference template vectors produce a minimum local distance in matching with vectors of the utterance, and rescoring the score based on a spikiness of the distribution and on how well the entire set of vectors representing the template are used in the matching.
-
9. A medium carrying instructions adapted to cause a machine to
measure distances between vectors that represent an utterance and vectors that represent a template, generate additional information including a frequency measure based on the measured distances, the additional information indicative of how well the vectors of the utterance match the vectors of the template, and make a matching decision based on the measured distances and on the generated additional information.
-
10. Apparatus comprising
an input port connected to receive digitized speech, and a speech recognizer configured to measure distances between vectors that represent an utterance and vectors that represent a template, generate additional information including a frequency measure based on the measured distances, the additional information indicative of how well the vectors of the utterance match the vectors of the template, and make a matching decision based on the measured distances and on the generated additional information.
-
14. Apparatus comprising
means for measuring distances between vectors that represent an utterance and vectors that represent a template, means for generating additional information including a frequency measure based on the measured distances, the additional information indicative of how well the vectors of the utterance match the vectors of the template, and means for making a matching decision based on the measured distances and on the generated additional information.
-
21. A method comprising
measuring distances between vectors that represent an utterance and vectors that represent a template, and generating additional information including a distribution of values associated with the vectors of the template, the values indicating the frequency with which reference template vectors produce a minimum local distance in matching with vectors of the utterance, the information indicative of how well the vectors of the utterance match the vectors of the template.
-
28. A method comprising
using dynamic time warping to generate a score representing distances between vectors that represent an utterance and vectors that represent a template, producing a distribution of values associated with the distances between vectors that represent an utterance and vectors that represent a template, and rescoring the score based on a spikiness of the distribution and on how well the entire set of vectors representing the template are used in the matching.
-
30. A medium adapted to carry instructions causing a machine to:
-
use dynamic time warping to generate a score representing distances between vectors that represent an utterance and vectors that represent a template, produce a distribution of values associated with the vectors of the template, the values indicating the frequency with which reference template vectors produce a minimum local distance in matching with vectors of the utterance, and rescore the score based on a spikiness of the distribution and on how well the entire set of vectors representing the template are used in the matching.
-
Specification