Metadata-based weighting of geotagged environmental audio for enhanced speech recognition accuracy
First Claim
1. A system comprising:
- one or more computers; and
a computer-readable medium coupled to the one or more computers having instructions stored thereon which, when executed by the one or more computers, cause the one or more computers to perform operations comprising;
receiving an audio signal that corresponds to an utterance recorded by a mobile device;
determining a geographic location associated with the mobile device;
identifying a set of geotagged audio signals that correspond to environmental audio associated with the geographic location;
weighting each geotagged audio signal of the set of geotagged audio signals based on metadata associated with the respective geotagged audio signal; and
using the set of weighted geotagged audio signals to perform noise compensation on the audio signal that corresponds to the utterance.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing speech recognition accuracy. In one aspect, a method includes receiving an audio signal that corresponds to an utterance recorded by a mobile device, determining a geographic location associated with the mobile device, identifying a set of geotagged audio signals that correspond to environmental audio associated with the geographic location, weighting each geotagged audio signal of the set of geotagged audio signals based on metadata associated with the respective geotagged audio signal, and using the set of weighted geotagged audio signals to perform noise compensation on the audio signal that corresponds to the utterance.
42 Citations
30 Claims
-
1. A system comprising:
-
one or more computers; and a computer-readable medium coupled to the one or more computers having instructions stored thereon which, when executed by the one or more computers, cause the one or more computers to perform operations comprising; receiving an audio signal that corresponds to an utterance recorded by a mobile device; determining a geographic location associated with the mobile device; identifying a set of geotagged audio signals that correspond to environmental audio associated with the geographic location; weighting each geotagged audio signal of the set of geotagged audio signals based on metadata associated with the respective geotagged audio signal; and using the set of weighted geotagged audio signals to perform noise compensation on the audio signal that corresponds to the utterance. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer storage medium encoded with a computer program, the program comprising instructions that when executed by one or more computers cause the one or more computers to perform operations comprising:
receiving an audio signal that corresponds to an utterance recorded by a mobile device; determining a geographic location associated with the mobile device; identifying a set of geotagged audio signals that correspond to environmental audio associated with the geographic location; weighting each geotagged audio signal of the set of geotagged audio signals based on metadata associated with the respective geotagged audio signal; and using the set of weighted geotagged audio signals to perform noise compensation on the audio signal that corresponds to the utterance.
-
20. A computer-implemented method comprising:
-
receiving an audio signal that corresponds to an utterance recorded by a mobile device; determining a geographic location associated with the mobile device; identifying a set of geotagged audio signals that correspond to environmental audio associated with the geographic location; weighting each geotagged audio signal of the set of geotagged audio signals based on metadata associated with the respective geotagged audio signal; and using the set of weighted geotagged audio signals to perform noise compensation on the audio signal that corresponds to the utterance. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification