Speech recognition over packet networks
First Claim
Patent Images
1. A speech recognition system comprising:
- user equipment connected to a first side of a packet network for accepting speech input; and
for performing partial speech recognition of said speech input, and;
a speech recognition application server connected to a second side of said packet network for performing speech recognition on speech data corresponding to said speech input wherein;
said speech data includes feature extraction data from said partial recognition which is provided across said packet network to said speech recognition application server according to a first protocol and includes decompressed speech transmitted to the speech recognition application server via the packet network according to a second protocol, said speech recognition application server performing said speech recognition based upon said feature extraction data and said decompressed speech.
1 Assignment
0 Petitions
Accused Products
Abstract
In a system in which user equipment is connected to a packet network and a speech recognition application server is also connected to the packet network for performing speech recognition on speech data from the user equipment, a speech recognition system selectively performs feature extraction at a user end before transmitting speech data to be recognized. The feature extraction is performed only for speech which is to be recognized.
-
Citations
12 Claims
-
1. A speech recognition system comprising:
-
user equipment connected to a first side of a packet network for accepting speech input; and
for performing partial speech recognition of said speech input, and;
a speech recognition application server connected to a second side of said packet network for performing speech recognition on speech data corresponding to said speech input wherein;
said speech data includes feature extraction data from said partial recognition which is provided across said packet network to said speech recognition application server according to a first protocol and includes decompressed speech transmitted to the speech recognition application server via the packet network according to a second protocol, said speech recognition application server performing said speech recognition based upon said feature extraction data and said decompressed speech. - View Dependent Claims (2, 3, 4, 5, 6)
means for generating a recognition signal if speech recognition is to be performed by said speech recognition application server; and
means for receiving said recognition signal and for inhibiting said feature extraction and said transmission of extracted features unless said recognition signal is received.
-
-
7. In a system in which user equipment is connected to a first side of a packet network and a speech recognition application server is connected to a second side of said packet network for performing speech recognition on speech data, a method for implementing speech recognition across a packet network, comprising:
-
inputting speech to said user equipment;
extracting features from said speech by selectively performing partial speech recognition on said speech at said user equipment side of said packet network;
transmitting said extracted features across said network to said speech recognition application server according to a first codec selected for transmission of extracted features, and transmitting said speech across said network to said speech recognition application server according to a second codec selected for transmission of speech, and providing said speech and said extracted features as speech data to said speech recognition application server for recognition. - View Dependent Claims (8, 9, 10, 11, 12)
compressing the extracted features prior to transmitting them over the packet network.
-
-
11. A method as in claim 10 further comprising:
compressing the features using at least one of linear quantization and vector quantization.
-
12. A method for implementation of speech recognition across a packet network according to claim 7, further comprising the steps of:
-
providing an indication that speech recognition is to be performed at said second side of said packet network by said speech recognition application server;
receiving said indication at said first side of said packet network;
whereinsaid feature extraction is performed only upon receipt of said indication.
-
Specification