Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise
First Claim
1. A system comprising at least one processor configured to:
- analyze digital data representing sounds captured by at least one microphone from an operating environment to compute background noise information associated with the operating environment, wherein;
the at least one processor is configured to match the sounds captured from the operating environment to a background noise from a plurality of background noises, andthe background noise information comprises an identification of the background noise matching the sounds captured from the operating environment;
select, based at least in part on the background noise information associated with the operating environment, a voice dialog from a plurality of voice dialogs, wherein;
the at least one processor is configured to select, based at least in part on the background noise matching the sounds captured from the operating environment, one or more grammars for use in carrying out the voice dialog with a user; and
perform automatic speech recognition, using the one or more grammars, on user speech captured from the operating environment.
8 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and products for testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise that include: receiving recorded background noise for each of the plurality of operating environments; generating a test speech utterance for recognition by a speech recognition engine using a grammar; mixing the test speech utterance with each recorded background noise, resulting in a plurality of mixed test speech utterances, each mixed test speech utterance having different background noise; performing, for each of the mixed test speech utterances, speech recognition using the grammar and the mixed test speech utterance, resulting in speech recognition results for each of the mixed test speech utterances; and evaluating, for each recorded background noise, speech recognition reliability of the grammar in dependence upon the speech recognition results for the mixed test speech utterance having that recorded background noise.
136 Citations
24 Claims
-
1. A system comprising at least one processor configured to:
-
analyze digital data representing sounds captured by at least one microphone from an operating environment to compute background noise information associated with the operating environment, wherein; the at least one processor is configured to match the sounds captured from the operating environment to a background noise from a plurality of background noises, and the background noise information comprises an identification of the background noise matching the sounds captured from the operating environment; select, based at least in part on the background noise information associated with the operating environment, a voice dialog from a plurality of voice dialogs, wherein; the at least one processor is configured to select, based at least in part on the background noise matching the sounds captured from the operating environment, one or more grammars for use in carrying out the voice dialog with a user; and perform automatic speech recognition, using the one or more grammars, on user speech captured from the operating environment. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method comprising acts of:
-
analyzing digital data representing sounds captured by at least one microphone from an operating environment to compute background noise information associated with the operating environment, wherein; the act of analyzing comprises matching the sounds captured from the operating environment to a background noise from a plurality of background noises, and the background noise information comprises an identification of the background noise matching the sounds captured from the operating environment; selecting, based at least in part on the background noise information associated with the operating environment, a voice dialog from a plurality of voice dialogs, wherein; the act of selecting the voice dialog comprises selecting, based at least in part on the background noise matching the sounds captured from the operating environment, one or more grammars for use in carrying out the voice dialog with a user; and performing automatic speech recognition, using the one or more grammars, on user speech captured from the operating environment. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. At least one non-transitory computer-readable medium encoded with a plurality of instructions that, when executed, perform a method comprising acts of:
-
analyzing digital data representing sounds captured by at least one microphone from an operating environment to compute background noise information associated with the operating environment, wherein; the act of analyzing comprises matching the sounds captured from the operating environment to a background noise from a plurality of background noises, and the background noise information comprises an identification of the background noise matching the sounds captured from the operating environment; selecting, based at least in part on the background noise information associated with the operating environment, a voice dialog from a plurality of voice dialogs, wherein; the act of selecting the voice dialog comprises selecting, based at least in part on the background noise matching the sounds captured from the operating environment, one or more grammars for use in carrying out the voice dialog with a user; and performing automatic speech recognition, using the one or more grammars, on user speech captured from the operating environment. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification