Method and apparatus for an improved language recognition system
First Claim
1. A method for language recognition, wherein a signal indicative of the language to be recognized is generated, comprising the steps of:
- generating candidate word strings for the signal;
selecting among the candidates using a language model utilizing a plurality of finite state machines, each machine having the ability to recognize a pair of sequences, one sequence scanned leftwards, the other scanned rightwards, and each machine corresponding to a word in a vocabulary of the language model.
3 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus for a language model and language recognition systems are disclosed. The method utilizes a plurality of probabilistic finite state machines having the ability to recognize a pair of sequences, one sequence scanned leftwards, the other scanned rightwards. Each word in the lexicon of the language model is associated with one or more such machines which model the semantic relations between the word and other words. Machine transitions create phrases from a set of word string hypotheses, and incrementally calculate costs related to the probability that such phrases represent the language to be recognized. The cascading lexical head machines utilized in the methods and apparatus capture the structural associations implicit in the hierachical organization of a sentence, resulting in a language model and language recognition systems that combine the lexical sensitivity of N-gram models with the structural properties of dependency grammar.
-
Citations
13 Claims
-
1. A method for language recognition, wherein a signal indicative of the language to be recognized is generated, comprising the steps of:
-
generating candidate word strings for the signal; selecting among the candidates using a language model utilizing a plurality of finite state machines, each machine having the ability to recognize a pair of sequences, one sequence scanned leftwards, the other scanned rightwards, and each machine corresponding to a word in a vocabulary of the language model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
- 10. A computer-readable storage medium comprising encoded computer-readable program instructions for use in conjunction with a programmable computer, which instructions cause the computer to select a language string from a plurality of language string hypotheses, the selected string providing the best correspondence to a signal representative of language, wherein such selection results from the action of a plurality of finite state machines capable of recognizing a pair of sequences, one sequence scanned leftwards, the other scanned rightwards through a data structure based on the plurality of language string hypotheses.
-
13. A method for selecting a word string from a plurality of word string hypotheses, wherein the word string hypotheses are derived from an input signal representative of language and the selected word string best represents the language, comprising the steps of:
-
(a) activating state machines that correspond to the words in the word string hypotheses, wherein the activated state machines are selected from a plurality of such state machines defining a lexicon, wherein each of the activated state machines are capable of recognizing a pair of sequences, one sequence scanned leftwards, the other scanned rightwards, and further wherein each state machine is characterized by an initial state; (b) generating a first plurality of phrase records, wherein, a phrase record is generated for each word in the word string hypotheses and each phrase record is characterized by a word, a state machine, the initial state, and a cost; (c) generating a phrase lattice by forming a data structure comprised of the phrase records of step (b); (d) generating a plurality of extended phrase records, wherein an extended phrase record is formed when a phrase record within the phrase lattice consumes an adjacent phrase record in the phrase lattice by a state machine transition, wherein, the extended phrase record; contains the words of both the phrase record and the adjacent phrase record, and is characterized by the state machine of the consuming phrase record, a new state corresponding to the transition of the state machine, and a new cost, wherein, the new cost is the sum of the costs of the consumed phrase and the consuming phrase, a cost associated with the state machine transition of the consuming phrase and a cost associated with a stop undertaken by the consumed state machine, and a cost pertaining to an association between the words in the consumed and consuming phrase records; (e) adding the extended phrase record to the phrase lattice if the new cost of the extended phrase record is lower than a reference phrase record cost; (f) repeating steps (d) and (e) wherein a phrase record can consume an adjacent phrase record until all phrase records have been fully extended and wherein the reference phrase record cost is updated by the extended phrase records added to the phrase lattice; and (g) selecting the lowest cost phrase record spanning the entire input signal.
-
Specification