System and method for applying dynamic contextual grammars and language models to improve automatic speech recognition accuracy
First Claim
1. A method for use with an automatic speech recognition system, the method comprising acts of:
- analyzing content of a body of speech submitted to a structured document to identify a first section of the structured document to which the body of speech is submitted;
in response to identifying the first section, loading a grammar and/or language model for use in recognizing the speech in the body submitted to the first section; and
performing speech recognition on the speech in the body using the grammar and/or language model.
4 Assignments
0 Petitions
Accused Products
Abstract
The invention involves the loading and unloading of dynamic section grammars and language models in a speech recognition system. The values of the sections of the structured document are either determined in advance from a collection of documents of the same domain, document type, and speaker; or collected incrementally from documents of the same domain, document type, and speaker; or added incrementally to an already existing set of values. Speech recognition in the context of the given field is constrained to the contents of these dynamic values. If speech recognition fails or produces a poor match within this grammar or section language model, speech recognition against a larger, more general vocabulary that is not constrained to the given section is performed.
-
Citations
16 Claims
-
1. A method for use with an automatic speech recognition system, the method comprising acts of:
-
analyzing content of a body of speech submitted to a structured document to identify a first section of the structured document to which the body of speech is submitted; in response to identifying the first section, loading a grammar and/or language model for use in recognizing the speech in the body submitted to the first section; and performing speech recognition on the speech in the body using the grammar and/or language model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. At least one computer-readable medium having instructions encoded thereon which, when executed in a system comprising at least one automatic speech recognition component, perform a method comprising acts of:
-
analyzing content of a body of speech submitted to a structured document to identify a first section of the structured document to which the body of speech is submitted; in response to identifying the first section, loading a grammar and/or language model for use in recognizing the speech in the body submitted to the first section; and performing speech recognition on the speech in the body using the grammar and/or language model. - View Dependent Claims (14)
-
-
15. A system for use with at least one automatic speech recognition component, the system comprising at least one processor programmed to:
-
analyze content of a body of speech submitted to a structured document to identify a first section of the structured document to which the body of speech is submitted; in response to identifying the first section, load a grammar and/or language model for use in recognizing the speech in the body submitted to the first section; and perform speech recognition on the speech in the body using the grammar and/or language model. - View Dependent Claims (16)
-
Specification