Method for processing speech signal features for streaming transport
DCFirst Claim
Patent Images
1. A method of formatting speech data for a distributed speech recognition system comprising the steps of:
- (a) capturing speech data uttered by a speaker at a client computing device;
(b) extracting acoustic features from said speech data,(c) representing said extracted acoustic features by speech symbols including speech vector data;
(d) converting said speech vector data to a byte stream;
wherein one or more NULL characters are included in said byte stream to indicate a termination of speech data from said client computing device, each of said one or more NULL characters comprising a plurality of zero value data bits;
further wherein other NULL characters present in said byte stream are removed prior to transmitting said byte stream.
3 Assignments
Litigations
0 Petitions
Accused Products
Abstract
Speech signal information is formatted, processed and transported in accordance with a format adapted for TCP/IP protocols used on the Internet and other communications networks. NULL characters are used for indicating the end of a voice segment. The method is useful for distributed speech recognition systems such as a client-server system, typically implemented on an intranet or over the Internet based on user queries at his/her computer, a PDA, or a workstation using a speech input interface.
-
Citations
23 Claims
-
1. A method of formatting speech data for a distributed speech recognition system comprising the steps of:
-
(a) capturing speech data uttered by a speaker at a client computing device; (b) extracting acoustic features from said speech data, (c) representing said extracted acoustic features by speech symbols including speech vector data; (d) converting said speech vector data to a byte stream; wherein one or more NULL characters are included in said byte stream to indicate a termination of speech data from said client computing device, each of said one or more NULL characters comprising a plurality of zero value data bits; further wherein other NULL characters present in said byte stream are removed prior to transmitting said byte stream. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method of formatting speech data for a speech recognition system comprising the steps of:
-
capturing speech data at a client computing device, said speech data including query words spoken by a speaker in a speech utterance; extracting acoustic features including mel frequency cepstral coefficient (MFCC) speech vector data from said speech data on a continuous basis until silence is detected; converting said speech vector data, while said speech utterance is being spoken by said speaker, to a byte stream suitable for transport across an Internet based network connection; removing any NULL characters in said byte stream before said speech vector data is transmitted through said Internet based network connection; adding a NULL character to an end of said byte stream to indicate a termination of speech data from said client computing device. - View Dependent Claims (20, 21, 22, 23)
-
Specification