Creation and Use of Application-Generic Class-Based Statistical Language Models for Automatic Speech Recognition

US 20090055184A1
Filed: 08/24/2007
Published: 02/26/2009
Est. Priority Date: 08/24/2007
Status: Active Grant

First Claim

Patent Images

1. A method comprising:

for each of a plurality of speech applications, parsing a corpus of terms to produce a first output set, in which expressions identified in the corpus are replaced with corresponding grammar tags from a grammar that is specific to the application;

for each of the plurality of speech applications, replacing each of the grammar tags in the first output set with a class identifier of an application-generic class, to produce a second output set; and

processing collectively the second output sets or data derived from the output sets with a statistical language model (SLM) trainer to generate an application-generic class-based SLM.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method of creating an application-generic class-based SLM includes, for each of a plurality of speech applications, parsing a corpus of utterance transcriptions to produce a first output set, in which expressions identified in the corpus are replaced with corresponding grammar tags from a grammar that is specific to the application. The method further includes, for each of the plurality of speech applications, replacing each of the grammar tags in the first output set with a class identifier of an application-generic class, to produce a second output set. The method further includes processing the resulting second output sets with a statistical language model (SLM) trainer to generate an application-generic class-based SLM.

Citations

25 Claims

1. A method comprising:
- for each of a plurality of speech applications, parsing a corpus of terms to produce a first output set, in which expressions identified in the corpus are replaced with corresponding grammar tags from a grammar that is specific to the application;
  
  for each of the plurality of speech applications, replacing each of the grammar tags in the first output set with a class identifier of an application-generic class, to produce a second output set; and
  
  processing collectively the second output sets or data derived from the output sets with a statistical language model (SLM) trainer to generate an application-generic class-based SLM.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
- - 2. A method as recited in claim 1, wherein the application-generic class-based SLM includes one or more of said class identifiers.
  - 3. A method as recited in claim 2, further comprising:
    - creating an application-specific class-based SLM for a target speech application by replacing each said class identifier in the application-generic class-based SLM with a pointer to an application-specific grammar for the target speech application.
  - 4. A method as recited in claim 3, wherein said application-specific grammar is a class of the SLM.
  - 5. A method as recited in claim 1, wherein said parsing comprises:
    - for each identified expression, identifying a type of grammar to which the expression corresponds; and
      
      selecting a grammar tag to replace the expression based on the identified type of grammar.
  - 6. A method as recited in claim 5, wherein said identifying a type of grammar comprises determining whether the expression corresponds to a command grammar or a collection grammar.
  - 7. A method as recited in claim 6, wherein said replacing each of the grammar tags in the first output set with a class identifier of an application-generic class comprises:
    - replacing a grammar tag with a first class identifier if the grammar tag is determined to correspond to a first type of grammar; and
      
      replacing a grammar tag with a second class identifier if the grammar tag is determined to correspond to a second type of grammar.
  - 8. A method as recited in claim 7, wherein the first type of grammar is a command grammar and the second type of grammar is a collection grammar.
  - 9. A method as recited in claim 1, further comprising:
    - prior to said processing, performing on the second output sets collectively at least one operation from the set of operations consisting of;
      
      1) balancing between the second output sets according to a size of the corpus of the corresponding speech applications;
      
      2) filtering the second output sets to remove expressions that are not present in the corpus of at least a predetermined subset of the plurality of speech applications;
      
      3) assigning weights to tokens in the second output sets.
  - 10. A method as recited in claim 1, further comprising:
    - executing an automatic speech recognition (ASR) process to recognize speech represented in a stored set of audio data associated with a target speech application, by using an application-specific grammar for the target speech application in combination with the application-generic class-based SLM, to generate a set of recognition results.
  - 11. A method as recited in claim 10, further comprising:
    - processing at least a portion of the set of recognition results with an SLM trainer to generate a word-based SLM for use in ASR for the target speech application.
  - 12. A method as recited in claim 11, further comprising:
    - using the word-based SLM to perform ASR for the target speech application.

13. A method of creating a statistical language model (SLM) for automatic speech recognition (ASR), the method comprising:
- for each of a plurality of speech applications, parsing a corpus of utterance transcriptions from the application to produce a first output set, in which expressions identified in the corpus are replaced with corresponding grammar tags from a grammar that is specific to the application, wherein said parsing includesfor each identified expression, identifying a type of grammar to which the expression corresponds, including determining whether the expression corresponds to a first grammar or a second grammar, andselecting a grammar tag to replace the expression based on the identified type of grammar;
  
  for each of the plurality of speech applications, replacing each of the grammar tags in the first output set with a class identifier of an application-generic class, to produce a second output set, includingreplacing the grammar tag with a first class identifier if the grammar tag is determined to correspond to a grammar of the first type, andreplacing the grammar tag with a second class identifier if the grammar tag is determined to a grammar of the second type;
  
  filtering the second output sets collectively based on an algorithm to produce a third output set; and
  
  processing the third output set with an SLM trainer to generate an application-generic class-based SLM for ASR, wherein the application-generic class-based SLM includes one or more of said class identifiers.
- View Dependent Claims (14, 15, 16, 17, 18, 19)
- - 14. A method as recited in claim 13, wherein the first type of grammar is a command grammar and the second type of grammar is a collection grammar.
  - 15. A method as recited in claim 14, wherein said filtering comprises at least one operation from the set of operations consisting of:
    - 1) balancing between the second output sets according to a size of the corpus of the corresponding speech applications;
      
      2) filtering the second output sets to remove expressions that are not present in the corpus of at least a predetermined subset of the plurality of speech applications;
      
      3) assigning weights to tokens in the second output sets.
  - 16. A method as recited in claim 13, further comprising:
    - creating an application-specific class-based SLM for ASR for a target application by replacing each said class identifier in the application-generic class-based SLM with a reference to an application-specific grammar for the target application.
  - 17. A method as recited in claim 16, wherein said application-specific grammar is a class of the SLM.
  - 18. A method as recited in claim 16, further comprising generating a word-based SLM for use in ASR, for the target speech application, by:
    - executing an ASR process to recognize speech represented in a stored set of audio data associated with the target speech application, by using said application-specific class-based SLM to generate a set of recognition results; and
      
      processing at least a portion of the set of recognition results with an SLM trainer to generate the word-based SLM for use in ASR, for the target speech application.
  - 19. A method as recited in claim 18, further comprising:
    - using the word-based SLM to perform ASR for the target speech application.

20. A method comprising:
- creating an application-generic class-based statistical language model (SLM); and
  
  creating an application-specific SLM for use in automatic speech recognition for a target speech application, by incorporating into the application-generic class-based SLM an application-specific grammar for the target speech application.
- View Dependent Claims (21, 22)
- - 21. A method as recited in claim 20, wherein the application-specific grammar is a class of the SLM.
  - 22. A method as recited in claim 21, wherein creating an application-specific SLM comprises replacing a generic class identifier in the application-generic class-based SLM with a reference to an application-specific grammar for the target speech application.

23. A method comprising:
- inputting a set of audio data associated with a target speech application;
  
  executing an automatic speech recognition (ASR) process to recognize speech represented in the set of audio data, by using an application-generic class-based statistical language model (SLM) in combination with an application-specific grammar for the target speech application, to generate a set of recognition results; and
  
  processing at least a portion of the set of recognition results with an SLM trainer to generate a word-based SLM for the target speech application.
- View Dependent Claims (24)
- - 24. A method as recited in claim 23, further comprising:
    - executing a second ASR process to recognize speech represented in a second set of audio data associated with the target speech application, by using the word-based SLM.

25. An automatic speech recognition system comprising:
- an application-generic class-based statistical language model (SLM); and
  
  an automatic speech recognizer to recognize input speech represented in a set of audio data associated with a speech application, by using the application-generic class-based statistical language model (SLM) in combination with an application-specific grammar for the speech application, to generate a set of recognition results, wherein the application-specific grammar is a class, and wherein the application-generic class-based SLM includes a generic class identifier to indicate to the automatic speech recognizer where to apply the application-specific grammar in the SLM.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Nuance Communications, Inc. (Microsoft Corporation)
Original Assignee
Nuance Communications, Inc. (Microsoft Corporation)
Inventors
Hebert, Matthieu

Granted Patent

US 8,135,578 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/257
CPC Class Codes

G06F 40/205   Parsing

G10L 15/1815   Semantic context, e.g. disa...

G10L 15/183   using context dependencies,...

G10L 15/193   Formal grammars, e.g. finit...

G10L 15/197   Probabilistic grammars, e.g...

G10L 2015/228   of application context

Creation and Use of Application-Generic Class-Based Statistical Language Models for Automatic Speech Recognition

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

25 Claims

Specification

Solutions

Use Cases

Quick Links

Creation and Use of Application-Generic Class-Based Statistical Language Models for Automatic Speech Recognition

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

25 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links