Integrated Language Model, Related Systems and Methods
First Claim
1. A method of making an integrated language model for a speech recognition engine, the method comprising:
- identifying a first language model format for an upper-level language model component;
identifying a plurality of text elements to be represented by a non-terminal in the upper-level language model component;
generating the upper-level language model component including the non-terminal;
identifying a second language model format for a lower-level language model component to be applied to the non-terminal of the upper-level language model component; and
generating the lower-level language model component.
1 Assignment
0 Petitions
Accused Products
Abstract
An integrated language model includes an upper-level language model component and a lower-level language model component, with the upper-level language model component including a non-terminal and the lower-level language model component being applied to the non-terminal. The upper-level and lower-level language model components can be of the same or different language model formats, including finite state grammar (FSG) and statistical language model (SLM) formats. Systems and methods for making integrated language models allow designation of language model formats for the upper-level and lower-level components and identification of non-terminals. Automatic non-terminal replacement and retention criteria can be used to facilitate the generation of one or both language model components, which can include the modification of existing language models.
-
Citations
32 Claims
-
1. A method of making an integrated language model for a speech recognition engine, the method comprising:
-
identifying a first language model format for an upper-level language model component; identifying a plurality of text elements to be represented by a non-terminal in the upper-level language model component; generating the upper-level language model component including the non-terminal; identifying a second language model format for a lower-level language model component to be applied to the non-terminal of the upper-level language model component; and generating the lower-level language model component. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A method for identifying text elements to be represented by non-terminals in an integrated language model for a speech recognition engine, the method comprising:
-
determining a text element replacement criterion allowing automatic identification of the text elements to be represented by the non-terminals within an existing language model or textual corpus; and applying the text element replacement criterion to the existing language model or textual corpus. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30)
-
-
31. A system for making an integrated language model, the system comprising at least one processor and machine-readable memory configured to execute:
-
a language model integration control module adapted to receive user inputs regarding language model integration options and to generate language model modification rules and application rules based thereon; a language model generation module adapted to modify existing language models based upon the language model generation rules to generate upper-level and lower-level language model components for the integrated language model. - View Dependent Claims (32)
-
Specification