Systems and methods for assessment of non-native spontaneous speech
First Claim
1. A computer-implemented method of assessing speech pronunciation, comprising:
- receiving speech for analysis via a computer-readable storage medium;
performing automatic speech recognition on speech using a processor to generate word hypotheses for the speech, the word hypotheses identifying a set words recognized by an automated speech recognizer in the speech using one or more data processors;
performing time alignment between the speech and the word hypotheses using the automatic speech recognizer to associate the word hypotheses with corresponding sounds of the speech;
calculating statistics regarding individual words and phonemes of the word hypotheses using the processor based on said alignment;
calculating a plurality of features for use in assessing pronunciation of the speech based on the statistics using the processor; and
calculating an assessment score based on one or more of the calculated features.
1 Assignment
0 Petitions
Accused Products
Abstract
Computer-implemented systems and methods are provided for assessing non-native spontaneous speech pronunciation. Speech recognition on digitized speech is performed using a non-native acoustic model trained with non-native speech to generate word hypotheses for the digitized speech. Time alignment is performed between the digitized speech and the word hypotheses using a reference acoustic model trained with native-quality speech. Statistics are calculated regarding individual words and phonemes in the word hypotheses based on the alignment. A plurality of features for use in assessing pronunciation of the speech are calculated based on the statistics, an assessment score is calculated based on one or more of the calculated features, and the assessment score is stored in a computer-readable memory.
-
Citations
33 Claims
-
1. A computer-implemented method of assessing speech pronunciation, comprising:
-
receiving speech for analysis via a computer-readable storage medium; performing automatic speech recognition on speech using a processor to generate word hypotheses for the speech, the word hypotheses identifying a set words recognized by an automated speech recognizer in the speech using one or more data processors; performing time alignment between the speech and the word hypotheses using the automatic speech recognizer to associate the word hypotheses with corresponding sounds of the speech; calculating statistics regarding individual words and phonemes of the word hypotheses using the processor based on said alignment; calculating a plurality of features for use in assessing pronunciation of the speech based on the statistics using the processor; and calculating an assessment score based on one or more of the calculated features. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer-implemented system for assessing speech pronunciation, comprising:
-
a processor; a non-transitory computer-readable memory comprising instructions for causing the processor to perform steps including; receiving speech for analysis via a computer-readable storage medium; performing automatic speech recognition on speech using a processor to generate word hypotheses for the speech, the word hypotheses identifying a set words recognized by an automated speech recognizer in the speech using one or more data processors; performing time alignment between the speech and the word hypotheses using the automatic speech recognizer to associate the word hypotheses with corresponding sounds of the speech; calculating statistics regarding individual words and phonemes of the word hypotheses using the processor based on said alignment; calculating a plurality of features for use in assessing pronunciation of the speech based on the statistics using the processor; and calculating an assessment score based on one or more of the calculated features. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A non-transitory computer-readable memory comprising computer-readable instructions, which when executed cause a processor to perform steps comprising:
-
receiving speech for analysis via a computer-readable storage medium; performing automatic speech recognition on speech using a processor to generate word hypotheses for the speech, the word hypotheses identifying a set words recognized by an automated speech recognizer in the speech using one or more data processors; performing time alignment between the speech and the word hypotheses using the automatic speech recognizer to associate the word hypotheses with corresponding sounds of the speech; calculating statistics regarding individual words and phonemes of the word hypotheses using the processor based on said alignment; calculating a plurality of features for use in assessing pronunciation of the speech based on the statistics using the processor; and calculating an assessment score based on one or more of the calculated features. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33)
-
Specification