System and method for accented modification of a language model
First Claim
Patent Images
1. A method for modifying a language model, the method comprising the steps of:
- identifying accented speech pronunciations of words of a language;
identifying pronunciation differences between customary speech pronunciations and the accented speech pronunciations;
identifying, for each of said pronunciation differences, a first list of words in the language model that instantiate said pronunciation differences;
selectively adding the first list of words and their accented speech pronunciations to an accented speech file; and
modifying the language model according to the accent speech file.
8 Assignments
0 Petitions
Accused Products
Abstract
A system and method for a speech recognition technology that allows language models for a particular language to be customized through the addition of alternate pronunciations that are specific to the accent of the dictator, for a subset of the words in the language model. The system includes the steps of identifying the pronunciation differences that are best handled by modifying the pronunciations of the language model, identifying target words in the language model for pronunciation modification, and creating a accented speech file used to modify the language model.
-
Citations
29 Claims
-
1. A method for modifying a language model, the method comprising the steps of:
-
identifying accented speech pronunciations of words of a language;
identifying pronunciation differences between customary speech pronunciations and the accented speech pronunciations;
identifying, for each of said pronunciation differences, a first list of words in the language model that instantiate said pronunciation differences;
selectively adding the first list of words and their accented speech pronunciations to an accented speech file; and
modifying the language model according to the accent speech file. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for modifying a language model, the method comprising the steps of:
-
identifying accented speech pronunciations of a language;
identifying pronunciation differences between customary speech pronunciations and the accented speech pronunciations;
identifying, for each of said pronunciation differences, words in the language model that instantiate said pronunciation differences;
adding said words and said accented speech pronunciations corresponding to said words to an accented speech file according to a predetermined category; and
modifying the language model according to the accent speech file. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A method for customizing a language model for accented speakers, the method comprising the steps of:
-
identifying an accent;
determining pronunciation differences between the identified accent and the language model;
selecting a first subset of the pronunciation differences based on a first set of pre-determined criteria;
listing a first set of instantiations based on said first subset;
compiling an accent speech word list from the first set of instantiations;
determining accent-specific pronunciations corresponding to words in the accent speech word list; and
applying the accented speech word list and the accent-specific pronunciations to the language model. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. A method for modifying a language model, the method comprising the steps of:
-
identifying accented speech pronunciations of words of a language;
identifying pronunciation differences between customary speech pronunciations and the accented speech pronunciations;
identifying, for each of said pronunciation differences, a first list of words in the language model that instantiate said pronunciation differences;
selectively adding the first list of words and their accented speech pronunciations to an accented speech file;
selectively reducing the first list to a second list of words that are most frequently used in the language model;
selectively adding the second list of words and their accented speech pronunciations to the accented speech file;
selectively reducing the second list to a third list of words, wherein said third list includes words that intrude on other words if they are not given accented speech pronunciations;
selectively adding the third list of words and their accented speech pronunciations to the accented speech file;
selectively reducing the third list to a forth list of short words;
selectively adding the fourth list of words and their accented speech pronunciations to the accented speech file;
selectively reducing the fourth list to a fifth list of words with unrecognizable accented speech pronunciations;
selectively adding the fifth list of words and their accented speech pronunciations to the accented speech file; and
modifying the language model according to the accented speech file.
-
Specification