SOLUTION THAT INTEGRATES VOICE ENROLLMENT WITH OTHER TYPES OF RECOGNITION OPERATIONS PERFORMED BY A SPEECH RECOGNITION ENGINE USING A LAYERED GRAMMAR STACK
First Claim
1. A speech enrollment system comprising:
- an ordered stack of grammars, wherein a topmost layer in the stack includes application grammars and wherein the bottommost layer in the stack includes an enrollment grammar;
a recognition engine configured to return results for speech input by processing the input using the ordered stack of grammars, wherein the processing occurs from the topmost layer in the stack to the bottommost layer in the stack, wherein each layer in the stack includes exit criteria based upon a defined condition, wherein when the exit criteria is satisfied, a speech recognition result is returned based upon that layer and lower layers of the ordered stack are ignored, whereby the speech enrollment system supports voice enrollment using the ordered stack without relying upon a voice enrollment specific API.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention can include a speech enrollment system including an ordered stack of grammars and a recognition engine. The ordered stack of grammars can include an application grammars layer, a confusable grammar layer, a personal grammar layer, a phrase enrolled grammar layer, and an enrollment grammar layer. The recognition engine can return recognition results for speech input by processing the input using the ordered stack of grammars. The processing can occur from the topmost layer in the stack to the bottommost layer in the stack. Each layer in the stack can includes exit criteria based upon a defined condition. When the exit criteria is satisfied, a result can be returned based upon that layer and lower layers of the ordered stack can be ignored.
-
Citations
20 Claims
-
1. A speech enrollment system comprising:
-
an ordered stack of grammars, wherein a topmost layer in the stack includes application grammars and wherein the bottommost layer in the stack includes an enrollment grammar; a recognition engine configured to return results for speech input by processing the input using the ordered stack of grammars, wherein the processing occurs from the topmost layer in the stack to the bottommost layer in the stack, wherein each layer in the stack includes exit criteria based upon a defined condition, wherein when the exit criteria is satisfied, a speech recognition result is returned based upon that layer and lower layers of the ordered stack are ignored, whereby the speech enrollment system supports voice enrollment using the ordered stack without relying upon a voice enrollment specific API. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for creating voice-enrolled grammars comprising:
-
receiving speech input; using entries in an application grammar to determine whether the speech input matches an entry in the application grammar with a sufficient confidence; when a sufficient confidence is determined returning a result that indicates a recognition match; when an insufficient confidence is determined, using entries in at least one of an application grammar and a personal grammar to determine whether the speech input matches an entry in the application grammar with a sufficient clash value; when a sufficient clash value is determined returning a result that indicates a clash with an existing grammar entry; when an insufficient clash value is determined performing a voice enrollment consistency detection operation; and depending upon results of the consistency detection operation, voice enrolling the speech input to generate an acoustic base form corresponding to the speech input. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A method of utilizing a layered grammar stack to integrate voice enrollment operations with other types of recognition operations of a speech recognition engine comprising:
-
establishing an ordered stack of grammars, wherein a topmost layer in the stack includes at least one application grammar and wherein the bottommost layer in the stack includes an enrollment grammar; receiving speech input; and processing the speech input with a speech recognizing engine that utilizes the ordered stack, wherein the processing occurs from the topmost layer in the stack to the bottommost layer in the stack, wherein each layer in the stack includes exit criteria based upon a defined condition, wherein when the exit criteria is satisfied, a speech recognition result is returned based upon that layer, and wherein when a result is returned from a layer, lower layers of the ordered stack are ignored. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification