Method and apparatus of audio switching
First Claim
1. A method, implemented by a speech-switching device connected to a communication network, for speech switching between multiple terminals, comprising steps of:
- (1) extracting, by the speech-switching device, a mute flag from encoded speech data transmitted by each of the terminals respectively, and determining one or more non-mute terminals;
(2) non-fully decoding, by the speech-switching device, the encoded speech data of each non-mute terminal respectively;
(3) calculating, by the speech-switching device, a speech energy value of each non-mute terminal according to the non-fully decoded speech data of each non-mute terminal;
(4) comparing, by the speech-switching device, the speech energy values of the non-mute terminals with each other and selecting one or more terminals with a largest speech energy;
(5) fully decoding, by the speech-switching device, the encoded speech data transmitted by each remaining selected terminal; and
(6) performing, by the speech-switching device, a linear superposition of different combination modes to the decoded speech data of the selected terminals, encoding, by the speech-switching device, the decoded speech data obtained after the linear superposition of the different combination modes, and transmitting the encoded data to the corresponding terminals respectively;
wherein when the terminals adopt speech coding/decoding protocol G.728, the non-fully decoding comprises;
extracting corresponding code vectors from excitation codebooks according to codebook indexes contained in the encoded speech data transmitted by the terminals;
performing a logarithmic gain prediction to the extracted code vectors; and
performing a decoding synthesis filtering process to the code vectors after the logarithmic gain prediction in order to obtain the corresponding non-fully decoded speech data.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for speech switching, including: extracting mute flags from encoded speech data transmitted by each of the terminals respectively in order to determine one or more non-mute terminals; decoding the encoded speech data of each non-mute terminal respectively; calculating speech energy of each non-mute terminal according to the decoded speech data of each non-mute terminal; comparing the speech energy of the non-mute terminals and selecting one or more terminals with relatively large speech energy; performing linear superposition with different combination methods to the decoded speech data of the selected terminals, and encoding the decoded speech data which is obtained from the linear superposition of the different combination methods, and transmitting the data to the corresponding terminals respectively. An apparatus for speech switching is also disclosed. With this invention, resource consumption during procedure of speech switching may be reduced, the effect of speech communication may be improved.
50 Citations
12 Claims
-
1. A method, implemented by a speech-switching device connected to a communication network, for speech switching between multiple terminals, comprising steps of:
-
(1) extracting, by the speech-switching device, a mute flag from encoded speech data transmitted by each of the terminals respectively, and determining one or more non-mute terminals; (2) non-fully decoding, by the speech-switching device, the encoded speech data of each non-mute terminal respectively; (3) calculating, by the speech-switching device, a speech energy value of each non-mute terminal according to the non-fully decoded speech data of each non-mute terminal; (4) comparing, by the speech-switching device, the speech energy values of the non-mute terminals with each other and selecting one or more terminals with a largest speech energy; (5) fully decoding, by the speech-switching device, the encoded speech data transmitted by each remaining selected terminal; and (6) performing, by the speech-switching device, a linear superposition of different combination modes to the decoded speech data of the selected terminals, encoding, by the speech-switching device, the decoded speech data obtained after the linear superposition of the different combination modes, and transmitting the encoded data to the corresponding terminals respectively; wherein when the terminals adopt speech coding/decoding protocol G.728, the non-fully decoding comprises; extracting corresponding code vectors from excitation codebooks according to codebook indexes contained in the encoded speech data transmitted by the terminals; performing a logarithmic gain prediction to the extracted code vectors; and performing a decoding synthesis filtering process to the code vectors after the logarithmic gain prediction in order to obtain the corresponding non-fully decoded speech data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An apparatus for speech switching between multiple terminals, comprising:
-
a first mute flag extracting unit, configured to extract a mute flag from encoded speech data transmitted by each terminal respectively, and determine one or more non-mute terminals; a first decoding unit, connected with the first mute flag extracting unit, and configured to non-fully decode the encoded speech data of the non-mute terminals after the mute flags are extracted; a speech energy calculating unit, connected with the first decoding unit, and configured to calculate the speech energy value of each non-mute terminal according to the decoded speech data of each non-mute terminal; a speech energy comparing unit, connected with the speech energy calculating unit, and configured to compare the speech energy values of the non-mute terminals with each other and select one or more terminals with a largest speech energy; a second decoding unit configured to fully decode the encoded speech data transmitted by the terminals with the largest speech energy, wherein the speech energy comparing unit is connected with a linear superposition unit through the second decoding unit; a linear superposition unit, connected with the speech energy comparing unit, and configured to perform a linear superposition in different combination methods to the decoded speech data of the selected terminals; and an encoding unit, connected with the linear superposition unit, and configured to encode the decoded speech data after linear superposition and transmit the data to the corresponding terminals; when the terminals adopt speech encoding/decoding protocol G.728, the first decoding unit further comprises; an excitation codebook searcher, configured to extract the corresponding code vectors from excitation codebook according to the codebook indexes contained in the encoded speech data transmitted by the terminals; a logarithmic gain predictor, configured to perform a logarithmic gain prediction to the extracted code vectors; and a decoding synthesis filter, configured to perform a decoding synthesis filtering process to the code vectors after the logarithmic gain prediction in order to obtain the corresponding non-fully decoded speech data. - View Dependent Claims (11, 12)
-
Specification