Automated testing of voice recognition software
First Claim
Patent Images
1. A method for testing a voice enabled application on a target device, the method comprising conducting one or more interactions with the target device, at least some of the interactions comprising:
- selecting one of a plurality of input modes for sending input to the target device;
presenting an acoustic utterance in an acoustic environment to the target device, including presenting a noise signal to the target device, using the selected input mode;
determining one of a plurality of response modes for responding to an output of the target device;
receiving an output of the target device in response to the acoustic utterance and the noise signal according to the determined response mode; and
comparing the output to an output expected from the acoustic utterance;
wherein the selected input mode and the determined response mode depend on input/output capabilities of the target devicewherein presenting the acoustic utterance further comprises generating the acoustic utterance using an acoustic speaker;
wherein the speaker comprises an artificial human mouth;
wherein the acoustic environment is produced using an acoustic noise source that generates the noise signal, the noise signal representing one or more environmental noises of a natural environment.
8 Assignments
0 Petitions
Accused Products
Abstract
A method and a system for testing a voice enabled application on a target device, the method including conducting one or more interactions with the target device, at least some of the interactions including presenting an acoustic utterance in an acoustic environment to the target device, receiving an output of the target device in response to the acoustic utterance, and comparing the output to an output expected from the acoustic utterance.
26 Citations
45 Claims
-
1. A method for testing a voice enabled application on a target device, the method comprising conducting one or more interactions with the target device, at least some of the interactions comprising:
-
selecting one of a plurality of input modes for sending input to the target device; presenting an acoustic utterance in an acoustic environment to the target device, including presenting a noise signal to the target device, using the selected input mode; determining one of a plurality of response modes for responding to an output of the target device; receiving an output of the target device in response to the acoustic utterance and the noise signal according to the determined response mode; and comparing the output to an output expected from the acoustic utterance; wherein the selected input mode and the determined response mode depend on input/output capabilities of the target device wherein presenting the acoustic utterance further comprises generating the acoustic utterance using an acoustic speaker; wherein the speaker comprises an artificial human mouth; wherein the acoustic environment is produced using an acoustic noise source that generates the noise signal, the noise signal representing one or more environmental noises of a natural environment. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A system for testing a voice enabled application on a target device, the system comprising:
-
the target device; a speaker configured to send sound to the target device; a noise source configured to send a noise signal to the target device; wherein the noise source comprises an acoustic noise source configured to generate the noise signal to produce the acoustic environment, the acoustic noise source replicating one or more noises of a real environment; a computer configured to conduct one or more interactions with the target device, including selecting one of a plurality of input modes for sending input to the target device and determining one of a plurality of response modes for responding to an output of the target device, at least some of the interactions comprising; sending commands to the target device using the selected input mode and receiving communications from the target device using the determined response mode; presenting an acoustic utterance in an acoustic environment to the target device; receiving an output of the target device in response to the acoustic utterance; comparing the output to an output expected from the acoustic utterance wherein the selected input mode and the determined response mode depend on input/output capabilities of the target device; wherein presenting the acoustic utterance further comprises generating the acoustic utterance using the speaker; wherein the speaker is an artificial human mouth. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45)
-
Specification