Speech recognition system having parallel large vocabulary recognition engines
First Claim
1. A huge vocabulary speech recognition system for recognizing a sequence of spoken words, the system comprisinginput means for receiving a time-sequential input pattern representative of the sequence of spoken words;
- and a large vocabulary speech recognizer operative to recognize the input pattern as a sequence of words from the vocabulary using a large vocabulary recognition model associated with the speech recognizer;
characterized in that the system comprises a plurality of N large vocabulary speech recognizers, each being associated with a respective, different large vocabulary recognition model;
each of the recognition models being targeted to a specific part of the huge vocabulary; and
the system comprises a controller operative to direct the input pattern to a plurality of the speech recognizers and to select a recognized word sequence from the word sequences recognized by the plurality of speech recognizers.
4 Assignments
0 Petitions
Accused Products
Abstract
A huge vocabulary speech recognition system for recognizing a sequence of spoken words, having an input means for receiving a time-sequential input pattern representative of the sequence of spoken words. The system further includes a plurality of large vocabulary speech recognizers each being associated with a respective, different large vocabulary recognition model. Each of the recognition models is targeted to a specific part of the huge vocabulary. The system comprises a controller operative to direct the input pattern to a plurality of the speech recognizers and to select a recognized word sequence from the word sequences recognized by the plurality of speech recognizers.
256 Citations
8 Claims
-
1. A huge vocabulary speech recognition system for recognizing a sequence of spoken words, the system comprising
input means for receiving a time-sequential input pattern representative of the sequence of spoken words; - and
a large vocabulary speech recognizer operative to recognize the input pattern as a sequence of words from the vocabulary using a large vocabulary recognition model associated with the speech recognizer;
characterized in thatthe system comprises a plurality of N large vocabulary speech recognizers, each being associated with a respective, different large vocabulary recognition model;
each of the recognition models being targeted to a specific part of the huge vocabulary; and
the system comprises a controller operative to direct the input pattern to a plurality of the speech recognizers and to select a recognized word sequence from the word sequences recognized by the plurality of speech recognizers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
select a test recognition model from the N-M recognition models not yet used by one of the recognizers;
control a testing recognizer to recognize at least part of the input pattern with the test recognition model; and
if the recognition result of the testing recognizer is better than the recognition result of one of the recognizers, enable recognition with the test recognition model.
- and
-
7. A system as claimed in claim 1, wherein the recognition models are hierarchically arranged from models with a more generic context to models with a more specific context, and wherein the model selector is operative to enable recognition with a more specific model if recognition with the hierarchically-related more generic model at a higher level in the hierarchy obtains good recognition results compared to results of at least one recognizer associated with another recognition model.
-
8. A system as claimed in claim 1, wherein the system comprises a user station and a server station connected via a network, such as Internet;
- the user station being operative to receive the input pattern from a user and transfer a signal representative of the input pattern to the server station;
the server station comprising the recognizers and controller.
- the user station being operative to receive the input pattern from a user and transfer a signal representative of the input pattern to the server station;
Specification