Incorporating an Exogenous Large-Vocabulary Model into Rule-Based Speech Recognition
First Claim
1. A method for providing speech recognition, the method comprising:
- receiving an audio stream;
performing a rule-based speech recognition of the audio stream;
if a portion of the audio stream is recognized, inserting a recognition result of the recognized portion of the audio stream in a first recognition result;
if a portion of the audio stream is not recognized, inserting a mark-up in the first recognition result specifying the portion of the audio stream that is not recognized; and
sending the first recognition result to a large vocabulary speech recognition system for applying a statistical-based recognition of the portion of the audio stream that is not recognized.
3 Assignments
0 Petitions
Accused Products
Abstract
Incorporation of an exogenous large-vocabulary model into rule-based speech recognition is provided. An audio stream is received by a local small-vocabulary rule-based speech recognition system (SVSRS), and is streamed to a large-vocabulary statistically-modeled speech recognition system (LVSRS). The SVSRS and LVSRS perform recognitions of the audio. If a portion of the audio is not recognized by the SVSRS, a rule is triggered that inserts a mark-up in the recognition result. The recognition result is sent to the LVSRS. If a mark-up is detected, recognition of a specified portion of the audio is performed. The LVSRS result is unified with the SVSRS result and sent as a hybrid response back to the SVSRS. If the hybrid-recognition rule is not triggered, an arbitration algorithm is evoked to determine whether the SVSRS or the LVSRS recognition has a lesser word error rate. The determined recognition is sent as a response to the SVSRS.
18 Citations
20 Claims
-
1. A method for providing speech recognition, the method comprising:
-
receiving an audio stream; performing a rule-based speech recognition of the audio stream; if a portion of the audio stream is recognized, inserting a recognition result of the recognized portion of the audio stream in a first recognition result; if a portion of the audio stream is not recognized, inserting a mark-up in the first recognition result specifying the portion of the audio stream that is not recognized; and sending the first recognition result to a large vocabulary speech recognition system for applying a statistical-based recognition of the portion of the audio stream that is not recognized. - View Dependent Claims (2, 3, 4)
-
-
5. A system for providing incorporation of an exogenous large-vocabulary model into rule-based speech recognition, the system comprising:
-
one or more processors; and a memory coupled to the one or more processors, the one or more processors operable to; receive an audio stream; perform a rule-based speech recognition of the audio stream; if a portion of the audio stream is matched with a rule, insert a recognition result of the rule-matched portion of the audio stream in a first recognition result; if a portion of the audio stream is not matched with a rule, insert a mark-up in the first recognition result specifying the portion of the audio stream that is not matched with a rule; and send the first recognition result to a large vocabulary speech recognition system for applying a statistical-based recognition of the portion of the audio stream that is not recognized. - View Dependent Claims (6, 7, 8)
-
-
9. A method for providing incorporation of an exogenous large-vocabulary model into rule-based speech recognition, the method comprising:
-
receiving an audio stream; receiving a first recognition result; determining if the first recognition result comprises a mark-up specifying a portion of the audio stream is not recognized by a rule-based speech recognition system; if the first recognition result comprises a mark-up specifying a portion of the audio stream is not recognized by a rule-based speech recognition system, performing a statistical model-based recognition of the specified portion of the audio stream; combining a second recognition result with the first recognition result; and sending a combined recognition result to the rule-based speech recognition system. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A system for providing speech recognition, the system comprising:
-
one or more processors; and a memory coupled to the one or more processors, the one or more processors operable to; receive an audio stream; receive a first recognition result; determine if the first recognition result comprises a mark-up specifying a portion of the audio stream is not recognized by a rule-based speech recognition system; if the first recognition result comprises a mark-up specifying a portion of the audio stream is not recognized by a rule-based speech recognition system, performing a statistical model-based recognition of the specified portion of the audio stream; combine a second recognition result with the first recognition result; and send a combined recognition result to the rule-based speech recognition system. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification