Computer-Implemented Systems and Methods for Determining an Intelligibility Score for Speech
First Claim
1. A computer-implemented method of generating an intelligibility score for speech of a non-native speaker, comprising:
- receiving a recording of speech of a non-native speaker at a processing system;
identifying words in the speech recording using a computerized automated speech recognizer, wherein the automated speech recognizer provides a string of words identified in the speech recording based on a computerized acoustic model, and wherein the automated speech recognizer further provides an acoustic model likelihood score for each word in the string of words;
for a particular word in the string of words, determining a context metric value with the processing system based upon a usage of the particular word within the string of words;
determining an acoustic score with the processing system for the particular word based on the acoustic model likelihood score for the particular word from the automated speech recognizer;
determining an intelligibility score with the processing system for the particular word based on the acoustic score for the particular word and the context metric value for the particular word; and
determining an overall intelligibility score with the processing system for the string of words based on the intelligibility score for the particular word and intelligibility scores for other words in the string of words.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods are provided for generating an intelligibility score for speech of a non-native speaker. Words in a speech recording are identified using an automated speech recognizer, where the automated speech recognizer provides a string of words identified in the speech recording, and where the automated speech recognizer further provides an acoustic model likelihood score for each word in the string of words. For a particular word in the string of words, a context metric value is determined based upon a usage of the particular word within the string of words. An acoustic score for the particular word is determined based on the acoustic model likelihood score for the particular word from the automated speech recognizer. An intelligibility score is determined for the particular word based on the acoustic score for the particular word and the context metric value for the particular word.
-
Citations
23 Claims
-
1. A computer-implemented method of generating an intelligibility score for speech of a non-native speaker, comprising:
-
receiving a recording of speech of a non-native speaker at a processing system; identifying words in the speech recording using a computerized automated speech recognizer, wherein the automated speech recognizer provides a string of words identified in the speech recording based on a computerized acoustic model, and wherein the automated speech recognizer further provides an acoustic model likelihood score for each word in the string of words; for a particular word in the string of words, determining a context metric value with the processing system based upon a usage of the particular word within the string of words; determining an acoustic score with the processing system for the particular word based on the acoustic model likelihood score for the particular word from the automated speech recognizer; determining an intelligibility score with the processing system for the particular word based on the acoustic score for the particular word and the context metric value for the particular word; and determining an overall intelligibility score with the processing system for the string of words based on the intelligibility score for the particular word and intelligibility scores for other words in the string of words. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A computer-implemented system for generating an intelligibility score for speech of a non-native speaker, comprising:
-
a processing system; a non-transitory computer-readable medium encoded to contain; a recording of speech of a non-native speaker; an intelligibility score data structure comprising records associated with each word of a string of words, wherein a record for a particular word in the string of words includes fields for storing an acoustic score for the particular word, a context metric value for the particular word, and an intelligibility score for the particular word; instructions for commanding the processing system to execute steps comprising; identifying words in the speech recording using an automated speech recognizer, wherein the automated speech recognizer provides a string of words identified in the speech recording, and wherein the automated speech recognizer further provides an acoustic model likelihood score for each word in the string of words; for the particular word, determining a context metric value based upon a usage of the particular word within the string of words and storing the context metric value in the intelligibility score data structure record for the particular word; determining an acoustic score for the particular word based on the acoustic model likelihood score for the particular word from the automated speech recognizer and storing the acoustic score in the intelligibility score data structure record for the particular word; determining an intelligibility score for the particular word based on the acoustic score for the particular word and the context metric value for the particular word and storing the intelligibility score in the intelligibility score data structure record for the particular word; and determining an overall intelligibility score for the string of words based on the intelligibility score for the particular word and intelligibility scores for other words in the string of words. - View Dependent Claims (19, 20, 21, 22)
-
-
23. A non-transitory computer-readable medium encoded with instructions for commanding a processing system to execute a method of generating an intelligibility score for speech of a non-native speaker, comprising:
-
receiving a recording of speech of a non-native speaker; identifying words in the speech recording using an automated speech recognizer, wherein the automated speech recognizer provides a string of words identified in the speech recording, and wherein the automated speech recognizer further provides an acoustic model likelihood score for each word in the string of words; for a particular word in the string of words, determining a context metric value based upon a usage of the particular word within the string of words; determining an acoustic score for the particular word based on the acoustic model likelihood score for the particular word from the automated speech recognizer; determining an intelligibility score for the particular word based on the acoustic score for the particular word and the context metric value for the particular word; and determining an overall intelligibility score for the string of words based on the intelligibility score for the particular word and intelligibility scores for other words in the string of words.
-
Specification