System and methods for accent classification and adaptation
First Claim
1. In an information processing system, a method for recognizing speech to be recognized, the method comprising the steps of:
- maintaining a model of speech accent that is established based on training speech data, wherein the training speech data includes at least a first set of training speech data, and wherein establishing the model of speech accent includes not using any phone or phone-class transcription of the first set of training speech data;
deriving features from the speech to be recognized, the features hereinafter referred to as features for identifying accent;
identifying accent of the speech to be recognized based on the features for identifying accent and on the model of speech accent; and
recognizing the speech to be recognized based at least in part on the identified accent of the speech.
4 Assignments
0 Petitions
Accused Products
Abstract
Speech is processed that may be colored by speech accent. A method for recognizing speech includes maintaining a model of speech accent that is established based on training speech data, wherein the training speech data includes at least a first set of training speech data, and wherein establishing the model of speech accent includes not using any phone or phone-class transcription of the first set of training speech data. Related systems are also presented. A system for recognizing speech includes an accent identification module that is configured to identify accent of the speech to be recognized; and a recognizer that is configured to use models to recognize the speech to be recognized, wherein the models include at least an acoustic model that has been adapted for the identified accent using training speech data of a language, other than primary language of the speech to be recognized, that is associated with the identified accent. Related methods are also presented.
111 Citations
34 Claims
-
1. In an information processing system, a method for recognizing speech to be recognized, the method comprising the steps of:
-
maintaining a model of speech accent that is established based on training speech data, wherein the training speech data includes at least a first set of training speech data, and wherein establishing the model of speech accent includes not using any phone or phone-class transcription of the first set of training speech data; deriving features from the speech to be recognized, the features hereinafter referred to as features for identifying accent; identifying accent of the speech to be recognized based on the features for identifying accent and on the model of speech accent; and recognizing the speech to be recognized based at least in part on the identified accent of the speech. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. In an information processing system, a method for recognizing speech to be recognized, the method comprising the steps of:
-
identifying accent of the speech to be recognized based on information derived from the speech to be recognized; and evaluating features derived from the speech to be recognized using at least an acoustic model that has been adapted for the identified accent using training speech data from a language, other than primary language of the speech to be recognized, that is associated with the identified accent. - View Dependent Claims (16, 17, 18)
-
-
19. A system for recognizing speech to be recognized, the system comprising:
-
an accent identifier that is configured to identify accent of the speech to be recognized, wherein the accent identifier comprises a model of speech accent that is established based at least in part on using certain training speech data without using any phone or phone-class transcription of the certain training speech data; and a recognizer that is configured to use models, including a model deemed appropriate for the accent identified by the accent identifier, to recognize the speech to be recognized. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
-
31. A system for recognizing speech to be recognized, the system comprising:
-
an accent identification module that is configured to identify accent of the speech to be recognized; and a recognizer that is configured to use models to recognize the speech to be recognized, wherein the models include at least an acoustic model that has been adapted for the identified accent using training speech data of a language, other than primary language of the speech to be recognized, that is associated with the identified accent. - View Dependent Claims (32, 33, 34)
-
Specification