Dual uplink pre-processing paths for machine and human listening
First Claim
1. A device for providing dual uplink processing paths, the device comprising:
- at least one semi-conductor processor;
a non-transitory computer-readable medium storing instructions, when executed by the at least one semi-conductor processor, are configured to implement;
parallel uplink processing paths including a first uplink processing path for processing and uploading audio signals adapted for human listening to at least one remote server and a second uplink processing path for processing and uploading audio signals adapted for machine listening to the at least one remote server, the first and second uplink processing paths being separate uplink communication links,the parallel uplink processing paths configured to receive an audio stream representing speech from a user and apply two different pre-processing algorithms separately and in parallel to generate a first audio signal adapted for human listening and a second audio signal adapted for machine listening, the two different pre-processing algorithms applying different noise reduction and compression techniques on the speech,the first uplink processing path configured to apply a first pre-processing algorithm to the audio stream to create the first audio signal adapted for human listening such that the first audio signal includes a non-linear gain, artifacts apart from the speech of the user, and multiple background sound levels, the second uplink processing path configured to apply a second pre-processing algorithm to the audio stream to create the second audio signal adapted for machine listening such that the second audio signal includes a linear gain and a substantially constant background sound level, the second audio signal being devoid of the artifacts of the first audio signal; and
a network interface unit configured to concurrently and separately transmit the first audio signal and the second audio signal to the at least one remote server such that the first audio signal is transmitted via the first uplink processing path and the second audio signal is transmitted via the second uplink processing path.
2 Assignments
0 Petitions
Accused Products
Abstract
In some implementations, a device for providing dual uplink processing paths may include a human listening (HL) input processing unit configured to receive an audio stream and pre-process the audio stream to create a first audio signal adapted for human listening via a first uplink processing path, a machine listening (ML) input processing unit configured to receive the audio stream and pre-process the audio stream to create a second audio signal adapted for machine listening via a second uplink processing path, and a network interface unit configured to transmit the first audio signal via the first uplink processing path and transmit the second audio signal via the second uplink processing path to a remote server.
39 Citations
19 Claims
-
1. A device for providing dual uplink processing paths, the device comprising:
-
at least one semi-conductor processor; a non-transitory computer-readable medium storing instructions, when executed by the at least one semi-conductor processor, are configured to implement; parallel uplink processing paths including a first uplink processing path for processing and uploading audio signals adapted for human listening to at least one remote server and a second uplink processing path for processing and uploading audio signals adapted for machine listening to the at least one remote server, the first and second uplink processing paths being separate uplink communication links, the parallel uplink processing paths configured to receive an audio stream representing speech from a user and apply two different pre-processing algorithms separately and in parallel to generate a first audio signal adapted for human listening and a second audio signal adapted for machine listening, the two different pre-processing algorithms applying different noise reduction and compression techniques on the speech, the first uplink processing path configured to apply a first pre-processing algorithm to the audio stream to create the first audio signal adapted for human listening such that the first audio signal includes a non-linear gain, artifacts apart from the speech of the user, and multiple background sound levels, the second uplink processing path configured to apply a second pre-processing algorithm to the audio stream to create the second audio signal adapted for machine listening such that the second audio signal includes a linear gain and a substantially constant background sound level, the second audio signal being devoid of the artifacts of the first audio signal; and a network interface unit configured to concurrently and separately transmit the first audio signal and the second audio signal to the at least one remote server such that the first audio signal is transmitted via the first uplink processing path and the second audio signal is transmitted via the second uplink processing path. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method for processing an audio stream using dual pre-processing paths, the method being performed by at least one semi-conductor processor, the method including:
-
providing parallel uplink processing paths including a first uplink processing path for processing and uploading audio signals adapted for human listening to at least one remote server and a second uplink processing path for processing and uploading audio signals adapted for machine listening to the at least one remote server, the machine listening including speech-to-text conversion, the first and second uplink processing paths being separate uplink communication links; receiving an audio stream representing speech from a user; applying two different pre-processing algorithms separately and in parallel to generate a first audio signal adapted for human listening and a second audio signal adapted for machine listening, the two different pre-processing algorithms applying different noise reduction techniques on the speech when a bandwidth of the audio stream is decreased, the applying including, applying a first pre-processing algorithm to the audio stream in the first uplink processing path to create the first audio signal adapted for human listening such that the first audio signal includes a non-linear gain, artifacts apart from the speech of the user, and multiple background sound levels, applying a second pre-processing algorithm to the audio stream in the second uplink processing path to create the second audio signal adapted for machine listening such that the second audio signal includes a linear gain and a substantially constant background sound level, the second audio signal being devoid of the artifacts of the first audio signal; and concurrently and separately transmitting the first audio signal and the second audio signal to the at least one remote server such that the first audio signal is transmitted via a first uplink processing link and the second audio signal is transmitted via a second uplink processing link. - View Dependent Claims (8, 9, 10)
-
-
11. A non-transitory computer-readable medium storing executable instructions, when executed by at least one semi-conductor processor, are configured to:
-
provide parallel uplink processing paths including a first uplink processing path for processing and uploading server audio signals adapted for human listening to at least one remote server and a second uplink processing path for processing and uploading audio signals adapted for machine listening to the at least one remote server, the machine listening includes voice command recognition, the first and second uplink processing paths being separate uplink communication links; receive an audio stream representing speech from a user via at least one microphone; apply two different pre-processing algorithms separately and in parallel to generate a first audio signal adapted for human listening and a second audio signal adapted for machine listening, the two different pre-processing algorithms applying different noise reduction and suppression techniques on the speech, including, apply a first pre-processing algorithm to the audio stream in the first uplink processing path to create the first audio signal adapted for human listening such that the first audio signal includes a non-linear gain, artifacts apart from the speech of the user, and multiple background sound levels, apply a second pre-processing algorithm to the audio stream in the second uplink processing path to create the second audio signal adapted for machine listening such that the second audio signal includes a linear gain and a substantially constant background sound level, the second audio signal being devoid of the artifacts of the first audio signal; and concurrently and separately transmit the first audio signal and the second audio signal to the at least one remote server such that the first audio signal is transmitted via the first uplink processing path and the second audio signal is transmitted via the second uplink processing path. - View Dependent Claims (12, 13, 14)
-
-
15. A device comprising:
-
at least one semi-conductor processor; a non-transitory computer-readable medium storing instructions, when executed by the at least one semi-conductor processor, are configured to implement; parallel uplink processing paths including a first uplink processing path for processing and uploading audio signals adapted for human listening to at least one remote server and a second uplink processing path for processing and uploading audio signals adapted for machine listening to the at least one remote server, the machine listening being associated with a speech-to-text conversion application, the human listening being associated with a voice application, the first and second uplink processing paths being separate uplink communication links, the parallel uplink processing paths configured to receive an audio stream representing speech from a user and apply two different pre-processing algorithms in parallel to generate a first audio signal adapted for human listening and a second audio signal adapted for machine listening, the two different pre-processing algorithms applying different noise reduction techniques on the speech when a bandwidth of the audio stream is decreased, the first uplink processing path configured to apply a first pre-processing algorithm to the audio stream to create the first audio signal adapted for human listening such that the first audio signal includes a non-linear gain, artifacts apart from the speech of the user, and multiple background sound levels, the second uplink processing path configured to apply a second pre-processing algorithm to the audio stream to create the second audio signal adapted for machine listening such that the second audio signal includes a linear gain and a substantially constant background sound level, the second audio signal being devoid of the artifacts of the first audio signal; and a network interface unit configured to concurrently and separately transmit, over a network, the first audio signal and the second audio signal to the at least one remote server such that the first audio signal is transmitted via the first uplink processing path and the second audio signal is transmitted via the second uplink processing path, the network interface unit configured to receive, over the network, text information of the speech of the user corresponding to the second audio signal from the at least one remote server while the voice application is actively processing the first audio signal. - View Dependent Claims (16, 17, 18, 19)
-
Specification