Third-party audio subsystem enhancement
First Claim
1. A system comprising:
- one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising;
receiving, from a mobile device or a mockup of the mobile device and by an automatic speech recognition (ASR) engine that processes voice search queries for a search engine, (i) a voice search query and (ii) data indicating that the voice search query represents audio test data for a test of an audio quality of an audio signal output by a test audio subsystem configuration of the mobile device or the mockup of the mobile device, wherein the voice search query includes a pre-recorded test utterance used for testing mobile devices or mockups of mobile devices;
generating, in response to receiving the data indicating that the voice search query represents audio test data for a test of the audio quality of the audio signal output by a test audio subsystem configuration of the mobile device or on the mockup of the mobile device, one or more audio quality metrics that reflect the audio quality of the audio signal output by the test audio subsystem configuration of the mobile device or the mockup of the mobile device; and
generating a response to the voice search query by the ASR engine, wherein the response references at least one of the one or more audio quality metrics.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing audio subsystem enhancement. In one aspect, a method includes: receiving a voice search query by an automatic speech recognition (ASR) engine that processes voice search queries for a search engine, wherein the voice search query includes an audio signal that corresponds to an utterance, and a test flag that indicates that an audio test is being performed; performing speech recognition on the audio signal to select one or more textual, candidate transcriptions that match the utterance; generating, in response to receiving the test flag, one or more audio quality metrics using the audio signal; and generating a response to the voice search query by the ASR engine, wherein the response references one or more of the candidate transcriptions and one or more of the audio quality metrics.
-
Citations
20 Claims
-
1. A system comprising:
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving, from a mobile device or a mockup of the mobile device and by an automatic speech recognition (ASR) engine that processes voice search queries for a search engine, (i) a voice search query and (ii) data indicating that the voice search query represents audio test data for a test of an audio quality of an audio signal output by a test audio subsystem configuration of the mobile device or the mockup of the mobile device, wherein the voice search query includes a pre-recorded test utterance used for testing mobile devices or mockups of mobile devices; generating, in response to receiving the data indicating that the voice search query represents audio test data for a test of the audio quality of the audio signal output by a test audio subsystem configuration of the mobile device or on the mockup of the mobile device, one or more audio quality metrics that reflect the audio quality of the audio signal output by the test audio subsystem configuration of the mobile device or the mockup of the mobile device; and generating a response to the voice search query by the ASR engine, wherein the response references at least one of the one or more audio quality metrics. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
13. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving, from a mobile device or a mockup of the mobile device and by an automatic speech recognition (ASR) engine that processes voice search queries for a search engine, (i) a voice search query and (ii) data indicating that the voice search query represents audio test data for a test of an audio quality of an audio signal output by a test audio subsystem configuration of the mobile device or the mockup of the mobile device, wherein the voice search query includes a pre-recorded test utterance used for testing mobile devices or mockups of mobile devices; generating, in response to receiving the data indicating that the voice search query represents audio test data for a test of the audio quality of the audio signal output by a test audio subsystem configuration of the mobile device or on the mockup of the mobile device, one or more audio quality metrics that reflect the audio quality of the audio signal output by the test audio subsystem configuration of the mobile device or the mockup of the mobile device; and generating a response to the voice search query by the ASR engine, wherein the response references at least one of the one or more audio quality metrics. - View Dependent Claims (14, 15, 16)
-
-
17. A computer-implemented method comprising:
-
receiving, from a mobile device or a mockup of the mobile device and by an automatic speech recognition (ASR) engine that processes voice search queries for a search engine, (i) a voice search query and (ii) data indicating that the voice search query represents audio test data for a test of an audio quality of an audio signal output by a test audio subsystem configuration of the mobile device or the mockup of the mobile device, wherein the voice search query includes a pre-recorded test utterance used for testing mobile devices or mockups of mobile devices; generating, in response to receiving the data indicating that the voice search query represents audio test data for a test of the audio quality of the audio signal output by a test audio subsystem configuration of the mobile device or on the mockup of the mobile device, one or more audio quality metrics that reflect the audio quality of the audio signal output by the test audio subsystem configuration of the mobile device or the mockup of the mobile device; and generating a response to the voice search query by the ASR engine, wherein the response references at least one of the one or more audio quality metrics. - View Dependent Claims (18, 19, 20)
-
Specification