Processing speech recognition errors in an embedded speech recognition system
First Claim
1. In a remote training system, a method for processing a speech misrecognition generated when converting speech audio to text in an embedded speech recognition system comprising:
- receiving from an embedded speech recognition system speech audio and an active acoustic model both associated with a detected speech misrecognition in said embedded speech recognition system;
first presenting a list of valid phrases which were contextually valid when the speech misrecognition occurred, and second presenting a list of words forming a selected one of said first presented contextually valid phrases;
modifying said active acoustic model based on selected ones of said words in said list and said received speech audio; and
, transmitting said modified acoustic model to said embedded speech recognition system.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system for processing speech misrecognitions. The system can include an embedded speech recognition system having at least one acoustic model and at least one active grammar, wherein the embedded speech recognition system is configured to convert speech audio to text using the at least one acoustic model and the at least one active grammar; a remote training system for modifying the at least one acoustic model based on corrections to speech misrecognitions detected in the embedded speech recognition system; and, a communications link for communicatively linking the embedded speech recognition system to the remote training system. The embedded speech recognition system can further include a user interface for presenting a dialog for correcting the speech misrecognitions detected in the embedded speech recognition system. Notably, the user interface can be a visual display. Alternatively, the user interface can be an audio user interface. Finally, the user interface can include both a visual display and an audio user interface.
-
Citations
17 Claims
-
1. In a remote training system, a method for processing a speech misrecognition generated when converting speech audio to text in an embedded speech recognition system comprising:
-
receiving from an embedded speech recognition system speech audio and an active acoustic model both associated with a detected speech misrecognition in said embedded speech recognition system;
first presenting a list of valid phrases which were contextually valid when the speech misrecognition occurred, and second presenting a list of words forming a selected one of said first presented contextually valid phrases;
modifying said active acoustic model based on selected ones of said words in said list and said received speech audio; and
,transmitting said modified acoustic model to said embedded speech recognition system. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A machine readable storage, having stored thereon a computer program for processing a speech misrecognition generated when converting speech audio to text in an embedded speech recognition system , said computer program having a plurality of code sections executable by a machine for causing the machine to perform the steps of:
-
A receiving from an embedded speech recognition system speech audio and an active acoustic model both associated with a detected speech misrecognition in said embedded speech recognition system;
first presenting a list of valid phrases which were contextually valid when the speech misrecognition occurred, and second presenting a list of words forming a selected one of said first presented contextually valid phrases;
modifying said active acoustic model based on selected ones of said words in said list and said received speech audio; and
,transmitting said modified acoustic model to said embedded speech recognition system. - View Dependent Claims (8, 9, 10, 11, 12, 14, 15, 16, 17)
-
-
13. A system for processing speech misrecognitions comprising:
-
an embedded speech recognition system comprising at least one acoustic model and at least one active grammar, said embedded speech recognition system configured to convert speech audio to text using said at least one acoustic model and said at least one active grammar;
a remote training system for modifying said at least one acoustic model based on corrections to speech misrecognitions detected in said embedded speech recognition system; and
,a communications link for communicatively linking said embedded speech recognition system to said remote training system.
-
Specification