Generating a task-adapted acoustic model from one or more different corpora
First Claim
1. A method of generating a task-dependent acoustic model from a task-independent (TI) training corpus that includes an acoustic representation of an utterance and a sequence of transcribed words corresponding to the acoustic representation, the method comprising:
- deriving a task relevance measure for each word in the TI training corpus, indicative of a relevance of the words to a task;
training a task-independent (TI) AM based on the TI training corpus, the TI AM including words from the TI training corpus and associated AM parameters; and
generating a task-dependent (TD) acoustic model (AM) based on the TI AM trained from the TI training corpus and the task relevance measures for the words in the TI training corpus.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention generates a task-dependent acoustic model from a supervised task-independent corpus and further adapted it with an unsupervised task dependent corpus. The task-independent corpus includes task-independent training data which has an acoustic representation of words and a sequence of transcribed words corresponding to the acoustic representation. A relevance measure is defined for each of the words in the task-independent data. The relevance measure is used to weight the data associated with each of the words in the task-independent training data. The task-dependent acoustic model is then trained based on the weighted data for the words in the task-independent training data.
-
Citations
21 Claims
-
1. A method of generating a task-dependent acoustic model from a task-independent (TI) training corpus that includes an acoustic representation of an utterance and a sequence of transcribed words corresponding to the acoustic representation, the method comprising:
-
deriving a task relevance measure for each word in the TI training corpus, indicative of a relevance of the words to a task;
training a task-independent (TI) AM based on the TI training corpus, the TI AM including words from the TI training corpus and associated AM parameters; and
generating a task-dependent (TD) acoustic model (AM) based on the TI AM trained from the TI training corpus and the task relevance measures for the words in the TI training corpus. - View Dependent Claims (5, 6, 7, 8, 9, 10)
-
-
2. (canceled)
-
3. (canceled)
-
4. (canceled)
-
11. (canceled)
-
12. (canceled)
-
13. (canceled)
-
14. A system for generating a task-dependent (TD) acoustic model (AM) from a task-independent (TI) training corpus, comprising:
-
a task relevance generator receiving a task input indicative of words relevant to a task and configured to generate a relevance measure for each word in the TI training corpus based on the task input; and
an AM generator, coupled to the TI training corpus and the task relevance generator and configured to generate the TD AM based on the TI training corpus and the relevance measure. - View Dependent Claims (15)
-
-
16. (canceled)
-
17. (canceled)
-
18. (canceled)
-
19. (canceled)
-
20. (canceled)
-
21. (canceled)
Specification