Asynchronous transfer of audio data
First Claim
1. A system comprising:
- a microphone configured to capture sound from a sound source within an environment and generate a first audio signal and a second audio signal based at least in part on data associated with the sound;
memory;
one or more processors;
one or more computer executable instructions stored in the memory and executable by the one or more processors to;
at least partially process the first audio signal, wherein partially processing the first audio signal reduces at least one of noise or echo from the data;
send, to a remote device and at a first rate, the first audio signal; and
send, to the remote device and at a second rate that is different than the first rate, the second audio signal, the second audio signal having been processed different than the first audio signal; and
the remote device being configured to;
perform speech recognition on the first audio signal based at least in part on an algorithm, the speech recognition including determining one or more words associated with the first audio signal;
perform speech recognition on the second audio signal based at least in part on an updated algorithm that is based at least in part on the algorithm; and
determine whether a first accuracy of speech recognition utilizing the updated algorithm is greater than a second accuracy of speech recognition utilizing the algorithm.
2 Assignments
0 Petitions
Accused Products
Abstract
The systems, devices, and processes described herein may asynchronously transfer audio signals from a voice-controlled device to a remote device. The audio signals may correspond to sound that is captured by multiple microphones of the voice-controlled device, which may then process the audio signals. The audio signals may also be transferred to the remote device for processing. Moreover, a determination of whether the voice-controlled device or the remote device is to process the audio signals may be based at least in part on the bandwidth of a network communicatively coupled to the voice-controlled device. The voice-controlled device may also cache and log the audio signals, and then asynchronously stream the audio signals to the remote device after the audio signals are initially processed, which may be based on the bandwidth of the network. The remote device may utilize the unprocessed audio signals to improve subsequent processing of audio signals.
42 Citations
25 Claims
-
1. A system comprising:
-
a microphone configured to capture sound from a sound source within an environment and generate a first audio signal and a second audio signal based at least in part on data associated with the sound; memory; one or more processors; one or more computer executable instructions stored in the memory and executable by the one or more processors to; at least partially process the first audio signal, wherein partially processing the first audio signal reduces at least one of noise or echo from the data; send, to a remote device and at a first rate, the first audio signal; and send, to the remote device and at a second rate that is different than the first rate, the second audio signal, the second audio signal having been processed different than the first audio signal; and the remote device being configured to; perform speech recognition on the first audio signal based at least in part on an algorithm, the speech recognition including determining one or more words associated with the first audio signal; perform speech recognition on the second audio signal based at least in part on an updated algorithm that is based at least in part on the algorithm; and determine whether a first accuracy of speech recognition utilizing the updated algorithm is greater than a second accuracy of speech recognition utilizing the algorithm. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method comprising:
-
receiving, during a first time period, at a first rate, and from a device associated with a microphone, a first audio signal generated based at least in part on data received from the microphone, the first audio signal corresponding to sound that is associated with a sound source, the first audio signal having been at least partially processed by the device to reduce at least one of noise or echo from the data; receiving, from the device, during a second time period that is subsequent to the first time period, and at a second rate that is slower than the first rate, a second audio signal, the second audio signal corresponding to the sound detected by the microphone and having been processed, by the device, different than the first audio signal; based at least in part on receiving the second audio signal, generating an updated algorithm associated with an algorithm used to process subsequent audio signals; performing speech recognition on the first audio signal based at least in part on the algorithm, the speech recognition including determining one or more words associated with the first audio signal; performing speech recognition on the second audio signal based at least in part on the updated algorithm; and determining whether a first accuracy of speech recognition utilizing the updated algorithm is greater than a second accuracy of speech recognition utilizing the algorithm. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 24, 25)
-
-
14. A system comprising:
-
memory; one or more processors; and one or more computer-executable instructions stored in the memory and executable by the one or more processors to perform operations comprising; receiving, during a first time period and from a device associated with a microphone, a first audio signal generated based at least in part on data received from the microphone, the audio signal corresponding to sound that is associated with a sound source, the first audio signal having been at least partially processed by the device to reduce at least one of noise or echo from the data; receiving, during a second time period and from the device, a second audio signal, the second audio signal corresponding to the sound detected by the microphone and having been processed, by the device, different than the first audio signal; based at least in part on receiving at least one of the first audio signal or the second audio signal, generating an algorithm used to process subsequent audio signal; performing speech recognition on the first audio signal based at least in part on the algorithm, the speech recognition including determining one or more words associated with the first audio signal; performing speech recognition on the second audio signal based at least in part on an updated algorithm that is based at least in part on the algorithm; and determining whether a first accuracy of speech recognition utilizing the updated algorithm is greater than a second accuracy of speech recognition utilizing the algorithm. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A method comprising:
-
receiving, during a first time period and from a device associated with a microphone, a first audio signal generated based at least in part on data received from the microphone, the audio signal corresponding to sound that is associated with a sound source, the first audio signal having been at least partially processed by the device to reduce at least one of noise or echo from the data; receiving, during a second time period and from the device, a second audio signal, the second audio signal corresponding to the sound detected by the microphone and having been processed, by the device, different than the first audio signal; based at least in part on at least one of the first audio signal or the second audio signal, determining an algorithm used to process subsequent audio signals; generating an updated algorithm based at least in part on the algorithm; performing speech recognition on the first audio signal based at least in part on the algorithm, the speech recognition including determining one or more words associated with the first audio signal; performing speech recognition on the second audio signal based at least in part on the updated algorithm; and determining whether a first accuracy of the speech recognition utilizing the updated algorithm is greater than a second accuracy of the speech recognition utilizing the algorithm. - View Dependent Claims (21, 22, 23)
-
Specification