Incorporating an exogenous large-vocabulary model into rule-based speech recognition
First Claim
1. A method for providing incorporation of an exogenous large-vocabulary model into rule-based speech recognition, comprising:
- receiving a first recognition result from a rule-based speech recognition system, the first recognition result including a mark-up that specifies a portion of a received audio stream was not recognized by the rule-based speech recognition system;
performing a statistical model-based recognition of the marked-up portion of the audio stream to create a second recognition result;
combining the second recognition result with the first recognition result to create a combined recognition result; and
sending the combined recognition result to the rule-based speech recognition system.
2 Assignments
0 Petitions
Accused Products
Abstract
Incorporation of an exogenous large-vocabulary model into rule-based speech recognition is provided. An audio stream is received by a local small-vocabulary rule-based speech recognition system (SVSRS), and is streamed to a large-vocabulary statistically-modeled speech recognition system (LVSRS). The SVSRS and LVSRS perform recognitions of the audio. If a portion of the audio is not recognized by the SVSRS, a rule is triggered that inserts a mark-up in the recognition result. The recognition result is sent to the LVSRS. If a mark-up is detected, recognition of a specified portion of the audio is performed. The LVSRS result is unified with the SVSRS result and sent as a hybrid response back to the SVSRS. If the hybrid-recognition rule is not triggered, an arbitration algorithm is evoked to determine whether the SVSRS or the LVSRS recognition has a lesser word error rate. The determined recognition is sent as a response to the SVSRS.
-
Citations
19 Claims
-
1. A method for providing incorporation of an exogenous large-vocabulary model into rule-based speech recognition, comprising:
-
receiving a first recognition result from a rule-based speech recognition system, the first recognition result including a mark-up that specifies a portion of a received audio stream was not recognized by the rule-based speech recognition system; performing a statistical model-based recognition of the marked-up portion of the audio stream to create a second recognition result; combining the second recognition result with the first recognition result to create a combined recognition result; and sending the combined recognition result to the rule-based speech recognition system. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system for providing speech recognition, comprising:
-
one or more processors; and a memory coupled to the one or more processors, the one or more processors operable to; receive a first recognition result for a received audio stream, the first recognition results being received from a rule-based speech recognition system; determine if the first recognition result comprises a mark-up that indicates a portion of the audio stream was not recognized by the rule-based speech recognition system; when it is determined the first recognition result comprises the mark-up, performing a statistical model-based recognition of the marked-up portion of the audio stream to create a second recognition result; combine the second recognition result with the first recognition result to create a combined recognition result; and send the combined recognition result to the rule-based speech recognition system. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer-readable storage device encoding computer executable instructions that, when executed by a processing unit, perform a method, comprising:
-
receiving a first recognition result of an audio stream from a rule-based speech recognition system, the first recognition result containing a mark-up that indicates a portion of the audio stream was not recognized by the rule-based speech recognition system; performing a statistical model-based recognition of the marked-up portion of the audio stream to create a second recognition result; combining the second recognition result with the first recognition result to form a combined recognition result; and sending the combined recognition result to the rule-based speech recognition system. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
Specification