System and method for applying dynamic contextual grammars and language models to improve automatic speech recognition accuracy
First Claim
1. A method for loading and unloading dynamically constructed and identified language model or grammar data in an automatic speech recognition system having a structured report organization, the method comprising the steps of:
- determining sections used for the structured data input;
determining content within said sections for the structured data input;
based on said content, creating a recognition language model data;
determining a section status for said structured section input;
based on said section status, loading a corresponding recognition language model or grammar data into the automatic speech recognition system, and conducting speech recognition of the structured data input using said corresponding recognition language model or grammar data.
4 Assignments
0 Petitions
Accused Products
Abstract
The invention involves the loading and unloading of dynamic section grammars and language models in a speech recognition system. The values of the sections of the structured document are either determined in advance from a collection of documents of the same domain, document type, and speaker; or collected incrementally from documents of the same domain, document type, and speaker; or added incrementally to an already existing set of values. Speech recognition in the context of the given field is constrained to the contents of these dynamic values. If speech recognition fails or produces a poor match within this grammar or section language model, speech recognition against a larger, more general vocabulary that is not constrained to the given section is performed.
-
Citations
19 Claims
-
1. A method for loading and unloading dynamically constructed and identified language model or grammar data in an automatic speech recognition system having a structured report organization, the method comprising the steps of:
-
determining sections used for the structured data input;
determining content within said sections for the structured data input;
based on said content, creating a recognition language model data;
determining a section status for said structured section input;
based on said section status, loading a corresponding recognition language model or grammar data into the automatic speech recognition system, and conducting speech recognition of the structured data input using said corresponding recognition language model or grammar data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
Specification