Generating a task-adapted acoustic model from one or more different corpora

US 20060036444A1
Filed: 09/29/2005
Published: 02/16/2006
Est. Priority Date: 03/20/2002
Status: Active Grant

First Claim

Patent Images

1. A method of generating a task-dependent acoustic model from a task-independent (TI) training corpus that includes an acoustic representation of an utterance and a sequence of transcribed words corresponding to the acoustic representation, the method comprising:

deriving a task relevance measure for each word in the TI training corpus, indicative of a relevance of the words to a task;

training a task-independent (TI) AM based on the TI training corpus, the TI AM including words from the TI training corpus and associated AM parameters; and

generating a task-dependent (TD) acoustic model (AM) based on the TI AM trained from the TI training corpus and the task relevance measures for the words in the TI training corpus.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The present invention generates a task-dependent acoustic model from a supervised task-independent corpus and further adapted it with an unsupervised task dependent corpus. The task-independent corpus includes task-independent training data which has an acoustic representation of words and a sequence of transcribed words corresponding to the acoustic representation. A relevance measure is defined for each of the words in the task-independent data. The relevance measure is used to weight the data associated with each of the words in the task-independent training data. The task-dependent acoustic model is then trained based on the weighted data for the words in the task-independent training data.

Citations

21 Claims

1. A method of generating a task-dependent acoustic model from a task-independent (TI) training corpus that includes an acoustic representation of an utterance and a sequence of transcribed words corresponding to the acoustic representation, the method comprising:
- deriving a task relevance measure for each word in the TI training corpus, indicative of a relevance of the words to a task;
  
  training a task-independent (TI) AM based on the TI training corpus, the TI AM including words from the TI training corpus and associated AM parameters; and
  
  generating a task-dependent (TD) acoustic model (AM) based on the TI AM trained from the TI training corpus and the task relevance measures for the words in the TI training corpus.
- View Dependent Claims (5, 6, 7, 8, 9, 10)
- - 5. The method of claim 1 wherein generating a TD AM comprises:
    - weighting words in the TI training corpus.
  - 6. The method of claim 5 wherein generating a TD AM comprises:
    - training the TD AM using the weighted words.
  - 7. The method of claim 1 wherein generating a TD AM comprises:
    - extracting relevant data from the TI training corpus based on the task relevance measures.
  - 8. The method of claim 7 wherein generating a TD AM comprises:
    - training the TD AM based on the relevant data.
  - 9. The method of claim 1 wherein deriving a task relevance measure comprises:
    - selecting a word from the TI training corpus; and
      
      defining the task relevance measure for the selected word based on a portion of the selected word that is found in the task.
  - 10. The method of claim 9 wherein defining the task relevance measure for the selected word, comprise:
    - defining the task relevance measure for the selected word based on a number of relevant triphones in the selected word, the number of relevant triphones being triphones in the selected word that are found in the task.

2. (canceled)

3. (canceled)

4. (canceled)

11. (canceled)

12. (canceled)

13. (canceled)

14. A system for generating a task-dependent (TD) acoustic model (AM) from a task-independent (TI) training corpus, comprising:
- a task relevance generator receiving a task input indicative of words relevant to a task and configured to generate a relevance measure for each word in the TI training corpus based on the task input; and
  
  an AM generator, coupled to the TI training corpus and the task relevance generator and configured to generate the TD AM based on the TI training corpus and the relevance measure.
- View Dependent Claims (15)
- - 15. The system of claim 14 wherein the task relevance generator is configured to generate the relevance measure for a selected word based on a number of phonetic units in the selected word that are in the task input.

16. (canceled)

17. (canceled)

18. (canceled)

19. (canceled)

20. (canceled)

21. (canceled)

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Microsoft Technology Licensing LLC (Microsoft Corporation)
Original Assignee
Microsoft Corporation
Inventors
Hwang, Mei Yuh

Granted Patent

US 7,263,487 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/275
CPC Class Codes

G10L 15/063   Training

G10L 15/065   Adaptation

G10L 15/183   using context dependencies,...

Generating a task-adapted acoustic model from one or more different corpora

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

21 Claims

Specification

Solutions

Use Cases

Quick Links

Generating a task-adapted acoustic model from one or more different corpora

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

21 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links