Method and Apparatus for Transmitting Speech Data To a Remote Device In a Distributed Speech Recognition System
1 Assignment
0 Petitions
Accused Products
Abstract
A method of transmitting speech data to a remote device in a distributed speech recognition system, includes the steps of: dividing an input speech signal into frames; calculating, for each frame, a voice activity value representative of the presence of speech activity in the frame; grouping the frames into multiframes, each multiframe including a predetermined number of frames; calculating, for each multiframe, a voice activity marker representative of the number of frames in the multiframe representing speech activity; and selectively transmitting, on the basis of the voice activity marker associated with each multiframe, the multiframes to the remote device.
-
Citations
36 Claims
-
1-18. -18. (canceled)
-
19. A method of transmitting speech data to a remote device in a distributed speech recognition system, comprising:
-
dividing an input speech signal into frames; calculating, for each frame, a voice activity value representative of the presence of speech activity in said frame; and grouping said frames into multiframes, each multiframe comprising a predetermined number of frames;
comprising the steps of;calculating, for each multiframe, a voice activity marker representative of the number of frames in said multiframe having a voice activity value representing speech activity, said voice activity marker being indicative of speech activity in said multiframe; and selectively transmitting, on the basis of said voice activity marker associated with each multiframe, said multiframes to said remote device. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 35, 36)
-
-
29. A user terminal comprising a front-end module of a speech recognition system distributed over a communications network, said front end module comprising:
-
a feature extraction block for dividing an input speech signal into frames, and for calculating, for each frame, a voice activity value representative of the presence of speech activity in said frame; a bitstream formatting block for grouping said frames into multiframes, each multiframe comprising a predetermined number of frames; said front-end module further comprising; a marker block for calculating, for each multiframe, a voice activity marker representative of the number of frames in said multiframe having a voice activity value representing speech activity, said voice activity marker being indicative of speech activity in said multiframe; and a decision block for selectively transmitting, on the basis of said voice activity marker associated with each multiframe, said multiframes over said communications network to a remote back-end module of said distributed speech recognition system. - View Dependent Claims (30, 31, 32, 33, 34)
-
Specification