PHONETIC DISTANCE MEASUREMENT SYSTEM AND RELATED METHODS
First Claim
Patent Images
1. A method of generating a phonetic distance matrix comprising:
- determining a plurality of error occurrences by comparing a recognized speech file with a reference file;
determining a plurality of error rates corresponding to the plurality of error occurrences;
determining a plurality of phonetic distances as a function of the plurality of error rates; and
outputting a phonetic distance matrix based on the plurality of phonetic distances.
2 Assignments
0 Petitions
Accused Products
Abstract
Phonetic distances are empirically measured as a function of speech recognition engine recognition error rates. The error rates are determined by comparing a recognized speech file with a reference file. The phonetic distances can be normalized to earlier measurements. The phonetic distances/error rates can also be used to improve speech recognition engine grammar selection, as an aid in language training and evaluation, and in other applications.
-
Citations
23 Claims
-
1. A method of generating a phonetic distance matrix comprising:
-
determining a plurality of error occurrences by comparing a recognized speech file with a reference file; determining a plurality of error rates corresponding to the plurality of error occurrences; determining a plurality of phonetic distances as a function of the plurality of error rates; and outputting a phonetic distance matrix based on the plurality of phonetic distances. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A phonetic distance measurement system comprising:
-
a reference file; a recognized speech file; a comparison module configured to determine a plurality of error occurrences by comparing the recognized speech file and the reference file; an error rate module configured to determine a plurality of error rates corresponding to the plurality of error occurrences; and a measurement module configured to determine a plurality of phonetic distances as a function of the plurality of error rates. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A grammar development method for a speech recognition engine, the method comprising:
-
generating a plurality of recognized speech files by processing a plurality of audio files of recorded speech with the speech recognition engine; determining a plurality of substitution, insertion and deletion error occurrences by comparing the plurality of recognized speech files with a plurality of corresponding reference files; determining a plurality of substitution, insertion and deletion error rates from the plurality substitution, insertion and deletion error occurrences; and editing the grammar based on the plurality of error rates. - View Dependent Claims (19)
-
-
20. A language training and evaluation method comprising:
-
generating an audio file of a speaker; generating a recognized speech file by processing the audio file with a speech recognition engine; determining a plurality of substitution error occurrences for a plurality of phonetic element pairs by comparing the recognized speech file with a reference file corresponding to the recognized speech file; determining a plurality of error rates based on the plurality of substitution error occurrences; comparing the plurality of error rates with optimal values; and identifying phonetic element pairs requiring improvement based on a set of results of comparing the plurality of error rates with the optimal values. - View Dependent Claims (21, 22, 23)
-
Specification