Automatic reading tutoring with parallel polarized language modeling
First Claim
1. A computer-implemented method comprising:
- providing a user option to tune a weighting parameter, and responding to tuning of the weighting parameter by raising or lowering criteria for a general-domain garbage language model to adjust a miscue detection rate and a rate of false alarms;
displaying a text output having target words;
dynamically generating a domain-specific target language model for the text output, the target language model being specific to the text output having the target words and including a language score for the target words of the text output;
receiving an acoustic input;
modeling, using a processor of a computer, the acoustic input with the dynamically generated domain-specific target language model, comprising calculating an acoustic score for the target words with reference to the acoustic input;
further modeling the acoustic input with the general-domain garbage language model to identify an element of the acoustic input as a miscue that does not correspond properly to the target words of the text output; and
providing user-perceptible feedback.
2 Assignments
0 Petitions
Accused Products
Abstract
A novel system for automatic reading tutoring provides effective error detection and reduced false alarms combined with low processing time burdens and response times short enough to maintain a natural, engaging flow of interaction. According to one illustrative embodiment, an automatic reading tutoring method includes displaying a text output and receiving an acoustic input. The acoustic input is modeled with a domain-specific target language model specific to the text output, and with a general-domain garbage language model, both of which may be efficiently constructed as context-free grammars. The domain-specific target language model may be built dynamically or “on-the-fly” based on the currently displayed text (e.g. the story to be read by the user), while the general-domain garbage language model is shared among all different text outputs. User-perceptible tutoring feedback is provided based on the target language model and the garbage language model.
46 Citations
20 Claims
-
1. A computer-implemented method comprising:
-
providing a user option to tune a weighting parameter, and responding to tuning of the weighting parameter by raising or lowering criteria for a general-domain garbage language model to adjust a miscue detection rate and a rate of false alarms; displaying a text output having target words; dynamically generating a domain-specific target language model for the text output, the target language model being specific to the text output having the target words and including a language score for the target words of the text output; receiving an acoustic input; modeling, using a processor of a computer, the acoustic input with the dynamically generated domain-specific target language model, comprising calculating an acoustic score for the target words with reference to the acoustic input; further modeling the acoustic input with the general-domain garbage language model to identify an element of the acoustic input as a miscue that does not correspond properly to the target words of the text output; and providing user-perceptible feedback. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer-implemented method comprising:
-
receiving a user input indicative of a user-selected order of N-grams; accessing a text sample; identifying a first portion of the text sample, the first portion comprising at least a first phrase of the text sample; displaying the first portion of the text sample to the user and dynamically assembling a domain-specific target language model for the first portion of the text sample while the first portion of the text sample is being displayed to the user, wherein assembling the domain-specific target language model comprises; selecting an order of N-grams based on the user input; and constructing the domain-specific target language model based on the selected order of N-grams; receiving an acoustic input from the user; modeling the acoustic input, using a processor of a computer, with the domain-specific target language model and a general-domain garbage language model to identify elements of the acoustic input as miscues that do not correspond properly to the first portion of the text output; and providing user-perceptible feedback based on modeling the acoustic input with the target language model and modeling the acoustic input with the garbage language model. - View Dependent Claims (13, 14, 15, 16, 17)
-
-
18. A computer-implemented method comprising:
-
retrieving an indication of a text having target words from a data store; calculating a language model score for the target words using a domain-specific target model that is specific to the text having the target words; receiving an acoustic signal via a user input; calculating an acoustic score for the target words with reference to the acoustic signal using the domain-specific target model; obtaining a general-domain garbage model indicative of a set of garbage words including common words in a general domain; obtaining an acoustic score and a language model score for the set of garbage words from the general-domain garbage model; evaluating whether the acoustic signal comprises a mispronunciation with reference to the target words of the text based on a weighted comparison of the acoustic score and the language model score for the target words with the acoustic score and the language model score for the set of garbage words; providing user-perceptible feedback based on the evaluation; and providing a user option to tune a weighting parameter, and responding to tuning of the weighting parameter by raising or lowering criteria for the garbage model to adjust a miscue detection rate and a rate of false alarms. - View Dependent Claims (19, 20)
-
Specification