Methods for generating comfort noise during discontinuous transmission
First Claim
1. A method for producing comfort noise (CN) in a digital mobile terminal that uses a discontinuous transmission, comprising the steps of:
- in response to a speech pause, calculating random excitation spectral control (RESC) parameters; and
transmitting the RESC parameters to a receiver together with predetermined ones of CN parameters.
2 Assignments
0 Petitions
Accused Products
Abstract
An improved method for generating comfort noise (CN) in a mobile terminal operating in a discontinuous transmission (DTX) mode. In one embodiment the invention provides an improved method for comfort noise generation, in which a random excitation is modified by a spectral control filter so that the frequency content of comfort noise and background noise become similar. In another embodiment the transmitter identifies speech coding parameters that are not representative of the actual background noise, and replaces the identified parameters with parameters having a median value. In this manner the non-representative parameters do not skew the result of an averaging operation.
-
Citations
121 Claims
-
1. A method for producing comfort noise (CN) in a digital mobile terminal that uses a discontinuous transmission, comprising the steps of:
-
in response to a speech pause, calculating random excitation spectral control (RESC) parameters; and
transmitting the RESC parameters to a receiver together with predetermined ones of CN parameters. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
where b(i) represents filter coefficients, with i=1, . . . , R.
-
-
9. A method as in claim 7, and further comprising a step of determining an excitation gain from the spectrally flattened residual signal.
-
10. A method as in claim 1, wherein the step of shaping includes steps of:
-
forming an excitation by generating a white noise excitation sequence;
scaling the generated white noise sequence to produce a scaled noise sequence; and
processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content.
-
-
11. A method as in claim 1, wherein the step of calculating RESC parameters include a step of:
-
applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal, wherein the RESC inverse filter HRESC(Z) has the form of an all-zero filter described by;
where b(i) represents filter coefficients, with i=1, . . . ,R; and wherein the step of shaping includes steps of, forming an excitation by generating a white noise excitation sequence;
scaling the generated white noise sequence to produce a scaled noise sequence; and
processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content;
wherein the RESC filter performs an inverse operation to the RESC inverse filter and is of the form;
-
-
12. A method as in claim 1, wherein RESC parameters rmean(i), i=1, . . . ,R define the filter coefficients b(i), i=1, . . . , R, are transmitted as part of the predetermined one of the CN parameters, and are used in the RESC filter to spectrally weight the excitation for the synthesis filter.
-
13. Apparatus for generating comfort noise (CN) in a system that uses a discontinuous transmission to a network, comprising:
-
means in said digital mobile terminal that is responsive to a speech pause for calculating random excitation spectral control (RESC) parameters and for transmitting the RESC parameters together with predetermined ones of CN parameters to a receiver in said network. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
where b(i) represents filter coefficients, with i=1. . . , R.
-
-
21. Apparatus as in claim 19, and further comprising means for determining an excitation gain from the spectrally flattened residual signal.
-
22. Apparatus as in claim 13, wherein said shaping means is comprised of:
-
means for forming an excitation by generating a white noise excitation sequence;
means for scaling the generated white noise sequence to produce a scaled noise sequence; and
means for processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content.
-
-
23. Apparatus as in claim 13, wherein said calculating means is comprised of:
-
means for applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal, wherein the RESC inverse filter HRESC(Z) has the form of an all-zero filter described by;
where b(i) represents filter coefficients, with i=1, . . . ,R; and
wherein said shaping means is comprised of, means for forming an excitation by generating a white noise excitation sequence;
means for scaling the generated white noise sequence to produce a scaled noise sequence; and
means for processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content;
wherein RESC filter performs an inverse operation to the RESC inverse filter and is of the form;
-
-
24. Apparatus as in claim 23, wherein RESC parameters rmean(i), i=1, . . . ,R define the filter coefficients b(i), i=1, . . . , R, are transmitted as part of the predetermined ones of the CN parameters, and are used in the RESC filter to spectrally weight the excitation for the synthesis filter.
-
25. A method for producing comfort noise (CN) in a digital mobile terminal receiver that uses a discontinuous transmission, comprising the steps of:
-
receiving random excitation spectral (RESC) parameters;
and shaping the spectral content of an excitation using the received RESC parameters prior to applying the excitation to a synthesis filter. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36)
where b(i) represents filter coefficients, with i=1, . . . ,R.
-
-
33. A method as in claim 31, and further comprising a step of determining an excitation gain from the spectrally flattened residual signal.
-
34. A method as in claim 25, wherein the step of shaping includes steps of:
-
forming an excitation by generating a white noise excitation sequence;
scaling the generated white noise sequence to produce a scaled noise sequence; and
processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content.
-
-
35. A method as in claims 25, wherein the step of calculating RESC parameters include a step of:
-
applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal, wherein the RESC inverse filter HRESC(Z) has the form of an all-zero filter described by;
where b(i) represents filter coefficients, with i=1, . . . ,R; and
wherein the step of shaping includes steps of, forming an excitation by generating a white noise excitation sequence;
scaling the generated white noise sequence to produce a scaled noise sequence; and
processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content;
wherein the RESC filter performs an inverse operation to the RESC inverse filter and is of the form;
-
-
36. A method as in claim 35, wherein RESC parameters rmean(i), i=1, . . . ,R define the filter coefficients b(i), i=1, . . . , R, are transmitted as part of the predetermined one of the CN parameters, and are used in the RESC filter to spectrally weight the excitation for the synthesis filter.
-
37. Mobile terminal apparatus for generating comfort noise (CN) in a system that uses a discontinuous transmission to a network, comprising:
-
means in said mobile terminal for shaping the spectral content of an excitation using received excitation spectral control (RESC) parameters prior to applying the excitation to a synthesis filter. - View Dependent Claims (38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48)
where b(i) represents filter coefficients, with i=1, . . . ,R.
-
-
45. Apparatus as in claim 43, and further comprising means for determining an excitation gain from the spectrally flattened residual signal.
-
46. Apparatus as in claim 37, wherein said shaping means is comprised of:
-
means for forming an excitation by generating a white noise excitation sequence;
means for scaling the generated white noise sequence to produce a scaled noise sequence; and
means for processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content.
-
-
47. Apparatus as in claim 37, wherein said calculating means is comprised of:
-
means for applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal, wherein the RESC inverse filter HRESC(Z) has the form of an all-zero filter described by;
where b(i) represents filter coefficients, with i=1, . . . ,R; and
wherein said shaping means is comprised of, means for forming an excitation by generating a white noise excitation sequence;
means for scaling the generated white noise sequence to produce a scaled noise sequence; and
means for processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content;
wherein RESC filter performs an inverse operation to the RESC inverse filter and is of the form;
-
-
48. Apparatus as in claim 47, wherein RESC parameters rmean(i), i=1, . . . ,R define the filter coefficients b(i), i=1, . . . , R, are transmitted as part of the predetermined ones of the CN parameters, and are used in the RESC filter to spectrally weight the excitation for the synthesis filter.
-
49. A method for producing comfort noise (CN) in a network element that uses a discontinuous transmission, comprising the steps of:
-
receiving excitation spectral control (RESC) parameters; and
shaping the spectral content of an excitation using the received RESC parameters prior to applying the excitation to a synthesis filter. - View Dependent Claims (50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60)
where b(i) represents filter coefficients, with i=1, . . . ,R.
-
-
57. A method as in claim 55, and further comprising a step of determining an excitation gain from the spectrally flattened residual signal.
-
58. A method as in claim 49, wherein the step of shaping includes steps of:
-
forming an excitation by generating a white noise excitation sequence;
scaling the generated white noise sequence to produce a scaled noise sequence; and
processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content.
-
-
59. A method as in claim 49, wherein the step of calculating RESC parameters include a step of:
-
applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal, wherein the RESC inverse filter HRESC(Z) has the form of an all-zero filter described by;
where b(i) represents filter coefficients, with i=1, . . . ,R; and
wherein the step of shaping includes steps of, forming an excitation by generating a white noise excitation sequence;
scaling the generated white noise sequence to produce a scaled noise sequence; and
processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content;
wherein the RESC filter performs an inverse operation to the RESC inverse filter and is of the form;
-
-
60. A method as in claim 59, wherein RESC parameters rmean(i), i=1, . . . ,R define the filter coefficients b(i), i=1, . . . , R, are transmitted as part of the predetermined one of the CN parameters, and are used in the RESC filter to spectrally weight the excitation for the synthesis filter.
-
61. Apparatus for generating comfort noise (CN) in a system having a digital mobile terminal that uses a discontinuous transmission to a network, comprising:
-
means in said network for shaping the spectral content of an excitation using received excitation spectral control (RESC) parameters prior to applying the excitation to a synthesis filter. - View Dependent Claims (62, 64, 65, 66, 67, 68, 69, 70, 71, 72)
where b(i) represents filter coefficients, with i=1, . . . ,R.
-
-
69. Apparatus as in claim 67, and further comprising means for determining an excitation gain from the spectrally flattened residual signal.
-
70. Apparatus as in claim 61, wherein said shaping means is comprised of:
-
means for forming an excitation by generating a white noise excitation sequence;
means for scaling the generated white noise sequence to produce a scaled noise sequence; and
means for processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content.
-
-
71. Apparatus as in claim 61, wherein said calculating means is comprised of:
-
means for applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(Z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal, wherein the RESC inverse filter HRESC(Z) has the form of an all-zero filter described by;
where b(i) represents filter coefficients, with i=1, . . . ,R; and
wherein said shaping means is comprised of, means for forming an excitation by generating a white noise excitation sequence;
means for scaling the generated white noise sequence to produce a scaled noise sequence; and
means for processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content;
wherein RESC filter performs an inverse operation to the RESC inverse filter and is of the form;
-
-
72. Apparatus as in claim 71, wherein RESC parameters rmean(i), i=1, . . . ,R define the filter coefficients b(i), i=1, . . . , R, are transmitted as part of the predetermined ones of the CN parameters, and are used in the RESC filter to spectrally weight the excitation for the synthesis filter.
-
63. Apparatus as in claim 63, wherein the speech coder implements a LPC analysis technique, and wherein the analysis is of lower degree than the LPC analysis technique.
-
73. A method for producing comfort noise (CN) in a digital network element that uses a discontinuous transmission, comprising the steps of:
-
in response to a speech pause, calculating random excitation spectral control (RESC) parameters; and
transmitting the RESC parameters to a receiver together with predetermined ones of CN parameters. - View Dependent Claims (74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84)
where b(i) represents filter coefficients, with i=1, . . . ,R.
-
-
81. A method as in claim 79, and further comprising a step of determining an excitation gain from the spectrally flattened residual signal.
-
82. A method as in claim 73, wherein the step of shaping includes steps of:
-
forming an excitation by generating a white noise excitation sequence;
scaling the generated white noise sequence to produce a scaled noise sequence; and
processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content.
-
-
83. A method as in claim 73, wherein the step of calculating RESC parameters include a step of:
-
applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal, wherein the RESC inverse filter HRESC(z) has the form of an all-zero filter described by;
where b(i) represents filter coefficients, with i=1, . . . ,R; and
wherein the step of shaping includes steps of, forming an excitation by generating a white noise excitation sequence;
scaling the generated white noise sequence to produce a scaled noise sequence; and
processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content;
wherein the RESC filter performs an inverse operation to the RESC inverse filter and is of the form;
-
-
84. A method as in claim 83, wherein RESC parameters rmean(i), i=1, . . . ,R define the filter coefficients b(i), i=1, . . . , R, are transmitted as part of the CN parameters, and are used in the RESC filter to spectrally weight the excitation for the synthesis filter.
-
85. Apparatus for generating comfort noise (CN) in a system having a network element that uses a discontinuous transmission, comprising:
-
means in said network element that is responsive to a speech pause for calculating random excitation spectral control (RESC) parameters and for transmitting the RESC parameters together with predetermined ones of CN parameters to a receiver in said network. - View Dependent Claims (86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96)
where b(i) represents filter coefficients, with i=1, . . . ,R.
-
-
93. Apparatus as in claim 91, and further comprising means for determining an excitation gain from the spectrally flattened residual signal.
-
94. Apparatus as in claim 85, wherein said shaping means is comprised of:
-
means for forming an excitation by generating a white noise excitation sequence;
means for scaling the generated white noise sequence to produce a scaled noise sequence; and
means for processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content.
-
-
95. Apparatus as in claim 85, wherein said calculating means is comprised of:
-
means for applying an LPC residual signal from a speech coder inverse filter to a RESC inverse filter HRESC(z) to produce a spectrally controlled residual signal which generally has a flatter spectrum than the LPC residual signal, wherein the RESC inverse filter HRESC(z) has the form of an all-zero filter described by;
where b(i) represents filter coefficients, with i=1, . . . ,R; and
wherein said shaping means is comprised of, means for forming an excitation by generating a white noise excitation sequence;
means for scaling the generated white noise sequence to produce a scaled noise sequence; and
means for processing the scaled noise sequence in a RESC filter to produce an excitation having a desired spectral content;
wherein RESC filter performs an inverse operation to the RESC inverse filter and is of the form;
-
-
96. Apparatus as in claim 95, wherein RESC parameters rmean(i), i=1, . . . ,R define the filter coefficients b(i), i=1, . . . , R, are transmitted as part of the predetermined ones of the CN parameters, and are used in the RESC filter to spectrally weight the excitation for the synthesis filter.
-
97. A method for generating comfort noise (CN) in an element of a mobile communications network that uses a discontinuous transmission, comprising the steps of:
-
in response to a speech pause, buffering a set of speech coding parameters;
within an averaging period, replacing speech coding parameters of the set that are not representative of background noise with speech coding parameters that are representative of the background noise; and
averaging the set of speech coding parameters. - View Dependent Claims (98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116)
measuring distances of the speech coding parameters from one another between individual frames within the averaging period;
identifying those speech coding parameters which have the largest distances to the other parameters within the averaging period; and
if the distances exceed a predetermined threshold, replacing an identified speech coding parameter with a speech coding parameter which has a smallest measured distance to the other speech coding parameters within the averaging period.
-
-
99. A method as in claim 97, wherein the step of replacing includes the steps of:
-
measuring distances of the speech coding parameters from one another between individual frames within the averaging period;
identifying those speech coding parameters which have the largest distances to the other parameters within the averaging period; and
if the distances exceed a predetermined threshold, replacing an identified speech coding parameter with a speech coding parameter having a median value.
-
-
100. A method as in claim 97, wherein the step of averaging includes a step of computing an average excitation gain gmean and average short term spectral coefficients fmean(i).
-
101. A method as in claim 97, wherein the step of replacing includes steps of:
-
forming a set of buffered excitation gain values over the averaging period;
ordering the set of buffered excitation gain values; and
performing a median replacement operation in which those L excitation gain values differing the most from the median value, where the difference exceeds a predetermined threshold value, are replaced by the median value of the set.
-
-
102. A method as in claim 101, wherein a length N of the averaging period is an odd number, and wherein the median of the ordered set is the ((N+1)/2)th element of the set.
-
103. A method as in claim 97, and further comprising a step of:
-
forming a set of buffered Line Spectral Pair (LSP) coefficients f(k), k=1, . . . ,M over the averaging period; and
determining a spectral distance of the LSP coefficients fi(k) of the ith frame in the averaging period, to the LSP coefficients fj(k) of the jth frame in the averaging period.
-
-
104. A method as in claim 103, where the step of determining the spectral distance is accomplished in accordance with the expression
-
R ij = ∑ k = 1 M ( f i ( k ) - f j ( k ) ) 2 , where M is the degree of the LPC model, and fi(k) is the kth LSP parameter of the ith frame in the averaging period.
-
-
105. A method as in claim 103, and further comprising a step of determining the spectral distance Δ
- Si of the LSP coefficients fi(k) of frame i to the LSP coefficients of all the other frames j=1, . . . ,N, i≠
j, within the averaging period of length N.
- Si of the LSP coefficients fi(k) of frame i to the LSP coefficients of all the other frames j=1, . . . ,N, i≠
-
106. A method as in claim 105, wherein the step of determining the spectral distance is accomplished by determining the sum of the spectral distances Δ
- Rij in accordance with
for all i=1, . . . ,N.
- Rij in accordance with
-
107. A method as in claim 105, and further comprising steps of:
-
after the spectral distances Δ
Si have been found for each of the LSP vectors fi within the averaging period, ordering the spectral distances according to their values;
considering a vector fi with the smallest distance Δ
Si within the averaging period i=1, 2,. . . ,N to be a median vector fmed of the averaging period having a distance denoted as Δ
Smed; and
performing a median replacement of P (0≦
P≦
N-1) LSP vectors fi with the median vector fmed.
-
-
108. A method as in claim 107, wherein the steps of identifying and replacing are performed independently for excitation gain values g and Line Spectral Pair (LSP) vectors fi.
-
109. A method as in claim 98, wherein the steps of identifying and replacing are combined together for excitation gain values g and Line Spectral Pair (LSP) vectors fi.
-
110. A method as in claim 109, comprising steps of:
in response to determining that the speech coding parameters in an individual frame are to be replaced by median values of the parameters, replacing both the excitation gain value g and the LSP vector fi of that frame by the respective parameters of the frame containing the median parameters.
-
111. A method as in claim 110, and comprising initial steps of:
-
determining a distance Δ
Tij between the parameters of the ith frame and the jth frame of the averaging period in accordance with the expressionwhere M is the degree of the LPC model, fi(k) is the kth LSP parameter of the ith frame of the averaging period, and gi is the excitation gain parameter of the ith frame.
-
-
112. A method as in claim 111, and further comprising a step of:
-
determining a distance Δ
Si of the speech coding parameters of frame i, for all i=1, . . . ,N, to the speech coding parameters of all the other frames j=1, . . . ,N, i≠
j within the averaging period of length N, in accordance withfor all i=1, . . . ,N.
-
-
113. A method as in claim 112, wherein after the distances Δ
- Si have been determined for each of the frames within the averaging period, further comprising steps of;
ordering the distances according to their values; and
considering a frame with the smallest distance Δ
Si within the averaging period i=1,2, . . . ,N as a median frame, having distance Δ
Smed, of the averaging period, the median frame having speech coder parameters gmed and fmed.
- Si have been determined for each of the frames within the averaging period, further comprising steps of;
-
114. A method as in claim 113, and comprising a step of performing median replacement on the speech coding parameter frames within the averaging period i=1,2, . . . ,N wherein parameters gi and fi of L (0≦
- L≦
N-1) frames are replaced by the parameters gmed and fmed of the median frame.
- L≦
-
115. A method as in claim 113, wherein differences between each individual distance and the median distance are determined by dividing an individual distance by the median distance in accordance with Δ
- Si/Δ
Smed.
- Si/Δ
-
116. A method as in claim 107, wherein differences between each individual distance and the median distance are determined by dividing an individual distance by the median distance in accordance with Δ
- Si/Δ
Smed.
- Si/Δ
-
117. Apparatus for generating comfort noise (CN) in an element of a mobile communication network that uses a discontinuous transmission to a network, comprising:
data processing means in network element that is responsive to a speech pause for buffering a set of speech coding parameters and, within an averaging period, for replacing speech coding parameters of the set that are not representative of background noise with speech coding parameters that are representative of the background noise, said data processing means averaging the set of speech coding parameters and transmitting the averaged set of speech coding parameters to the mobile terminal. - View Dependent Claims (118, 119, 120, 121)
Specification