System and method for transmitting voice input from a remote location over a wireless data channel
First Claim
Patent Images
1. A method for digital signal manipulation, comprising:
- receiving an acoustic analog signal at a user system;
converting the analog signal to a digital signal;
canceling noise from the digital signal to form a processed digital signal;
detecting user speech in the processed digital signal by evaluating change in amplitude sign of the processed digital signal; and
if user speech is detected in the processed digital signal,packaging the user speech into speech packets to form a packaged voice signal;
selecting a transmission format compatible with the packaged voice signal; and
transmitting the packaged voice signal to a server;
receiving non acoustic data at the user system; and
if user speech is not detected in the processed digital signal,packaging the non acoustic data into data packets to form a packaged data signal;
selecting a transmission format compatible with the packaged data signal; and
transmitting the packaged data signal to a server.
23 Assignments
0 Petitions
Accused Products
Abstract
A system and method for improving voice recognition processing at a server system that receives voice input from a remotely located user system. The user system includes a microphone, a processor that performs front-end voice recognition processing of the received user voice input, and a communication component configured to send the front-end processed user voice input to a destination wirelessly over a network. The server system includes a communication component configured to receive the sent front-end processed user voice input, and a processor configured to complete voice recognition processing of the sent front-end processed user voice input.
155 Citations
13 Claims
-
1. A method for digital signal manipulation, comprising:
-
receiving an acoustic analog signal at a user system; converting the analog signal to a digital signal; canceling noise from the digital signal to form a processed digital signal; detecting user speech in the processed digital signal by evaluating change in amplitude sign of the processed digital signal; and if user speech is detected in the processed digital signal, packaging the user speech into speech packets to form a packaged voice signal; selecting a transmission format compatible with the packaged voice signal; and transmitting the packaged voice signal to a server; receiving non acoustic data at the user system; and if user speech is not detected in the processed digital signal, packaging the non acoustic data into data packets to form a packaged data signal; selecting a transmission format compatible with the packaged data signal; and transmitting the packaged data signal to a server. - View Dependent Claims (2)
-
-
3. A method for digital signal manipulation, comprising:
-
receiving an acoustic analog signal at a user system; converting the analog signal to a digital signal; canceling noise from the digital signal to form a processed digital signal; detecting user speech in the processed digital signal by evaluating change in amplitude sign of the processed digital signal; and if user speech is detected in the processed digital signal, packaging the user speech into speech packets to form a packaged voice signal; selecting a transmission format compatible with the packaged voice signal; transmitting the packaged voice signal to a server; evaluating the processed digital signal to determine whether data exist that enhances speech detection and matching at the server; and if data from the processed digital signal exist to be transmitted to the server to enhance speech detection and matching at the server, packaging the data from the processed digital signal into data packets; and interspersing data packets with the voice packets.
-
-
4. A method for digital signal manipulation, comprising:
-
receiving an acoustic analog signal at a user system; converting the analog signal to a digital signal; canceling noise and echoes from the digital signal to form a processed digital signal; detecting user speech in the processed digital signal by evaluating change in amplitude sign of the processed digital signal; and if user speech is detected in the processed digital signal, packaging the user speech into speech packets to form a packaged voice signal; selecting a transmission format compatible with the packaged voice signal; and transmitting the packaged voice signal to a server; if user speech is not detected in the processed digital signal, receiving non acoustic data at the user system; packaging the processed digital signal into data packets to form a packaged data signal; selecting a transmission format compatible with the packaged data signal; and transmitting the packaged data signal to a server. - View Dependent Claims (5)
-
-
6. A method for digital signal manipulation, comprising:
-
receiving an acoustic analog signal at a user system; converting the analog signal to a digital signal; canceling noise and echoes from the digital signal to form a processed digital signal; detecting user speech in the processed digital signal by evaluating change in amplitude sign of the processed digital signal; and if user speech is detected in the processed digital signal, packaging the user speech into speech packets to form a packaged voice signal; selecting a transmission format compatible with the packaged voice signal; and transmitting the packaged voice signal to a server; evaluating the processed digital signal to determine whether data exist that enhances speech detection and matching at the server; and if data from the processed digital signal exists to be transmitted to the server to enhance speech detection and matching at the server, packaging the data from the processed digital signal into data packets; and interspersing data packets with the voice packets.
-
-
7. A system comprising:
-
a user system configured for receiving an acoustic analog signal and converting to a digital signal, where at the user system the system further comprises; a processor having; a first algorithm for canceling noise and echoes from the digital signal to form a processed digital signal; a second algorithm for detecting speech in the processed digital signal by examining for the change in amplitude sign and the rate of amplitude change in the processed digital signal; a third algorithm for packaging the processed digital signal with data or speech packets in accordance with the detected user speech to form a packaged voice signal; a fourth algorithm for selecting a transmission format in accord with the packaged voice signal; and means for transmitting the digital signal to a server, the server having a plurality of algorithms, wherein the plurality of algorithms match the speech content of the packaged voice signal with instructions stored in the server database to execute the instructions, wherein if user speech is detected, a determination is made whether data needs to be added to the packaged voice signal to enhance speech detection and matching at the server. - View Dependent Claims (8, 9)
-
-
10. A system comprising:
-
means for receiving at a user an acoustic analog signal and converting to a digital signal, where at the user system the system further comprises; means for canceling noise and echoes from the digital signal to form a processed digital signal; means for detecting user speech in the processed digital signal by examining for the change in amplitude sign and the rate of amplitude change in the processed digital signal; means for packaging the processed digital signal with data or speech packets in accordance with the user detected user speech to form a packaged voice signal; means for selecting a transmission format in accord with the packaged voice signal; means for transmitting the packaged voice signal to a server system, where the server system further includes; means for matching the speech content of the packaged voice signal with instructions stored in the server database; and means for executing the instructions; if user speech is not detected in the processed digital signal, further comprising; means for receiving non acoustic data at the user system; means for packaging the processed digital signal into data packets to form a packaged data signal; means for selecting a transmission format compatible with the packaged data signal; and means for transmitting the packaged data signal to a server.
-
-
11. A system comprising:
-
means for receiving at a user an acoustic analog signal and converting to a digital signal, where at the user system the system further comprises; means for canceling noise and echoes from the digital signal to form processed digital Signal; means for detecting user speech in the processed digital signal by examining for the change in amplitude sign and the rate of amplitude change in the processed digital signal; means for packaging the processed digital signal with data or speech packets in accordance with the user detected user speech to form a packaged voice signal; means for selecting a transmission format in accord with the packaged voice signal; means for transmitting the packaged voice signal to a server system, where the server system further includes; means for matching the speech content of the packaged voice signal with instructions stored in the server database; and means for executing the instructions; if user speech is detected in the processed digital signal, further comprising; means for evaluating the additional digital data to determine whether data exist that enhances speech detection and matching at the server; and if data from the processed digital signal exists to be transmitted to the server to enhance speech detection and matching at the server, means for packaging the data from the processed digital signal into data packets, and means for interspersing data packets with the voice packets.
-
-
12. A system comprising:
-
means for receiving at a user an acoustic analog signal and converting to a digital signal, where at the user system the system further comprises; means for canceling noise and echoes from the digital signal to form a processed digital signal; means for detecting user speech in the processed digital signal by examining for the change in amplitude sign and the rate of amplitude change in the processed digital signal; means for packaging the processed digital signal with data or speech packets in accordance with the user detected user speech to form a packaged voice signal; means for selecting a transmission format in accord with the packaged voice signal; means for transmitting the packaged voice signal to a server system, where the server system further includes; means for matching the speech content of the packaged voice signal with instructions stored in the server database; and means for executing the instructions; wherein if user speech is detected, a determination is made whether data needs to be added to the packaged voice signal to enhance speech detection and matching at the server. - View Dependent Claims (13)
-
Specification