High-band signal modeling
First Claim
1. A method of reducing a transmission bandwidth of a bit stream, the method comprising:
- filtering, at a speech encoder, an audio signal into a group of low-frequency sub-bands within a low-band frequency range and a first group of high-frequency sub-bands within a high-band frequency range;
generating a first residual signal of a first high-frequency sub-band in the first group of high-frequency sub-bands;
generating a harmonically extended signal based on the group of low-frequency sub-bands and a non-linear processing function;
generating a second group of high-frequency sub-bands based, at least in part, on the harmonically extended signal, wherein the second group of high-frequency sub-bands corresponds to the first group of high-frequency sub-bands;
determining, at a dedicated parameter estimator, a first adjustment parameter based on a comparison of an energy level associated with the first residual signal to an energy level of a first high-frequency sub-band in the second group of high-frequency sub-bands;
determining a second adjustment parameter for a second high-frequency sub-band in the second group of high-frequency sub-bands based on a metric of a second high-frequency sub-band in the first group of high-frequency sub-bands; and
transmitting the first adjustment parameter and the second adjustment parameter to a speech decoder as part of the bit stream, the first adjustment parameter and the second adjustment parameter usable by the speech decoder to reconstruct the first group of high-frequency sub-bands, wherein the transmission bandwidth of the bit stream is reduced compared to transmission of an encoded version of the first group of high-frequency sub-bands.
1 Assignment
0 Petitions
Accused Products
Abstract
A method includes filtering, at a speech encoder, an audio signal into a first group of sub-bands within a first frequency range and a second group of sub-bands within a second frequency range. The method also includes generating a harmonically extended signal based on the first group of sub-bands. The method further includes generating a third group of sub-bands based, at least in part, on the harmonically extended signal. The third group of sub-bands corresponds to the second group of sub-bands. The method also includes determining a first adjustment parameter for a first sub-band in the third group of sub-bands or a second adjustment parameter for a second sub-band in the third group of sub-bands. The first adjustment parameter is based on a metric of a first sub-band in the second group of sub-bands, and the second adjustment parameter is based on a metric of a second sub-band in the second group of sub-bands.
39 Citations
35 Claims
-
1. A method of reducing a transmission bandwidth of a bit stream, the method comprising:
-
filtering, at a speech encoder, an audio signal into a group of low-frequency sub-bands within a low-band frequency range and a first group of high-frequency sub-bands within a high-band frequency range; generating a first residual signal of a first high-frequency sub-band in the first group of high-frequency sub-bands; generating a harmonically extended signal based on the group of low-frequency sub-bands and a non-linear processing function; generating a second group of high-frequency sub-bands based, at least in part, on the harmonically extended signal, wherein the second group of high-frequency sub-bands corresponds to the first group of high-frequency sub-bands; determining, at a dedicated parameter estimator, a first adjustment parameter based on a comparison of an energy level associated with the first residual signal to an energy level of a first high-frequency sub-band in the second group of high-frequency sub-bands; determining a second adjustment parameter for a second high-frequency sub-band in the second group of high-frequency sub-bands based on a metric of a second high-frequency sub-band in the first group of high-frequency sub-bands; and transmitting the first adjustment parameter and the second adjustment parameter to a speech decoder as part of the bit stream, the first adjustment parameter and the second adjustment parameter usable by the speech decoder to reconstruct the first group of high-frequency sub-bands, wherein the transmission bandwidth of the bit stream is reduced compared to transmission of an encoded version of the first group of high-frequency sub-bands. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An apparatus for reducing a transmission bandwidth of a bit stream, the apparatus comprising:
-
a first filter configured to filter an audio signal into a group of low-frequency sub-bands within a low-band frequency range and a first group of high-frequency sub-bands within a high-band frequency range; a parameter estimator configured to generate a first residual signal of a first high-frequency sub-band in the first group of high-frequency sub-bands; a non-linear transformation generator configured to generate a harmonically extended signal based on the group of low-frequency sub-bands and a non-linear processing function; a second filter configured to generate a second group of high-frequency sub-bands based, at least in part, on the harmonically extended signal, wherein the second group of high-frequency sub-bands corresponds to the first group of high-frequency sub-bands; dedicated parameter estimators configured to; determine a first adjustment parameter based on a comparison of an energy level associated with the first residual signal to an energy level of a first high-frequency sub-band in the second group of high-frequency sub-bands; and determine a second adjustment parameter for a second high-frequency sub-band in the second group of high-frequency sub-bands based on a metric of a second high-frequency sub-band in the first group of high-frequency sub-bands; and a transmitter to transmit the first adjustment parameter and the second adjustment parameter to a speech decoder as part of the bit stream, the first adjustment parameter and the second adjustment parameter usable by the speech decoder to reconstruct the first group of high-frequency sub-bands, wherein the transmission bandwidth of the bit stream is reduced compared to transmission of an encoded version of the first group of high-frequency sub-bands. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A non-transitory computer-readable medium comprising instructions for reducing a transmission bandwidth of a bit stream, wherein the instructions, when executed by a processor at a speech encoder, cause the processor to:
-
filter an audio signal into a group of low-frequency sub-bands within a low-band frequency range and a first group of high-frequency sub-bands within a high-band frequency range; generate a first residual signal of a first sub-band in the first group of high-frequency sub-bands; generate a harmonically extended signal based on the group of low-frequency sub-bands and a non-linear processing function; generate a second group of high-frequency sub-bands based, at least in part, on the harmonically extended signal, wherein the second group of high-frequency sub-bands corresponds to the first group of high-frequency sub-bands; determine, at a dedicated parameter estimator, a first adjustment parameter based on a comparison of an energy level associated with the first residual signal to an energy level of a first high-frequency sub-band in the second group of high-frequency sub-bands; determine a second adjustment parameter for a second high-frequency sub-band in the second group of high-frequency sub-bands based on a metric of a second high-frequency sub-band in the first group of high-frequency sub-bands; and initiate transmission of the first adjustment parameter and the second adjustment parameter to a speech decoder as part of the bit stream, wherein the first adjustment parameter and the second adjustment parameter are usable by the speech decoder to reconstruct the first group of high-frequency sub-bands, and wherein the transmission bandwidth of the bit stream is reduced compared to transmission of an encoded version of the first group of high-frequency sub-bands. - View Dependent Claims (20, 21, 22, 23)
-
-
24. An apparatus for reducing a transmission bandwidth of a bit stream, the apparatus comprising:
-
means for filtering an audio signal into a group of low-frequency sub-bands within a low-band frequency range and a first group of high-frequency sub-bands within a high-band frequency range; means for generating a first residual signal of a first high-frequency sub-band in the first group of high-frequency sub-bands; means for generating a harmonically extended signal based on the group of low-frequency sub-bands and a non-linear processing function; means for generating a second group of high-frequency sub-bands based, at least in part, on the harmonically extended signal, wherein the second group of high-frequency sub-bands corresponds to the first group of high-frequency sub-bands; means for determining a first adjustment parameter based on a comparison of an energy level associated with the first residual signal to an energy level of a first high-frequency sub-band in the second group of high-frequency sub-bands; means for determining a second adjustment parameter for a second high-frequency sub-band in the second group of high-frequency sub-bands based on a metric of a second high-frequency sub-band in the first group of high-frequency sub-bands; and means for transmitting the first adjustment parameter and the second adjustment parameter to a speech decoder as part of the bit stream, the first adjustment parameter and the second adjustment parameter usable by the speech decoder to reconstruct the first group of high-frequency sub-bands, wherein the transmission bandwidth of the bit stream is reduced compared to transmission of an encoded version of the first group of high-frequency sub-bands. - View Dependent Claims (25, 26, 27, 28)
-
-
29. A method comprising:
-
generating, at a speech decoder, a harmonically extended signal based on a low-band excitation signal, wherein the low-band excitation signal is generated by a linear prediction based decoder based on parameters received from a speech encoder; generating a group of high-band excitation sub-bands based, at least in part, on the harmonically extended signal; adjusting, at a dedicated parameter adjuster, the group of high-band excitation sub-bands based on adjustment parameters received from the speech encoder, wherein a transmission bandwidth of a bit stream is reduced compared to transmission of an encoded version of high-frequency sub-bands of an encoder-side audio signal, and wherein the adjustment parameters comprise; a first adjustment parameter based on a comparison of an energy level of a first high-frequency sub-band in a group of high-frequency sub-bands to an energy level associated with a residual signal of a first high-frequency sub-band in a second group of high-frequency; and a second adjustment parameter for a second high-frequency sub-band in the group of high-frequency sub-bands; and reconstructing the high-frequency sub-bands of the encoder-side audio signal based on the adjusted group of high-band excitation sub-bands. - View Dependent Claims (30)
-
-
31. An apparatus comprising:
-
a non-linear transformation generator configured to generate a harmonically extended signal based on a low-band excitation signal, wherein the low-band excitation signal is generated by a linear prediction based decoder based on parameters received from a speech encoder; a second filter configured to generate a group of high-band excitation sub-bands based, at least in part, on the harmonically extended signal; dedicated parameter adjusters configured to adjust the group of high-band excitation sub-bands based on adjustment parameters received from the speech encoder, wherein a transmission bandwidth of a bit stream is reduced compared to transmission of an encoded version of high-frequency sub-bands of an encoder-side audio signal, and wherein the adjustment parameters comprise; a first adjustment parameter based on a comparison of an energy level of a first high-frequency sub-band in a group of high-frequency sub-bands to an energy level associated with a residual signal of a first high-frequency sub-band in a second group of high-frequency; and a second adjustment parameter for a second high-frequency sub-band in the group of high-frequency sub-bands; and a reconstruction unit configured to reconstruct the high-frequency sub-bands of the encoder-side audio signal based on the adjusted group of high-band excitation sub-bands. - View Dependent Claims (32)
-
-
33. An apparatus comprising:
-
means for generating a harmonically extended signal based on a low-band excitation signal, wherein the low-band excitation signal is generated by a linear prediction based decoder based on parameters received from a speech encoder; means for generating a group of high-band excitation sub-bands based, at least in part, on the harmonically extended signal; means for adjusting the group of high-band excitation sub-bands based on adjustment parameters received from the speech encoder, wherein a transmission bandwidth of a bit stream is reduced compared to transmission of an encoded version of high-frequency sub-bands of an encoder-side audio signal, and wherein the adjustment parameters comprise; a first adjustment parameter based on a comparison of an energy level of a first high-frequency sub-band in a group of high-frequency sub-bands to an energy level associated with a residual signal of a first high-frequency sub-band in a second group of high-frequency; and a second adjustment parameter for a second high-frequency sub-band in the group of high-frequency sub-bands; and means for reconstructing the high-frequency sub-bands of the encoder-side audio signal based on the adjusted group of high-band excitation sub-bands. - View Dependent Claims (34)
-
-
35. A non-transitory computer-readable medium comprising instructions that, when executed by a processor at a speech decoder, cause the processor to:
-
generate a harmonically extended signal based on a low-band excitation signal, wherein the low-band excitation signal is generated by a linear prediction based decoder based on parameters received from a speech encoder; generate a group of high-band excitation sub-bands based, at least in part, on the harmonically extended signal; and adjust, at a dedicated parameter adjuster, the group of high-band excitation sub-bands based on adjustment parameters received from the speech encoder, wherein a transmission bandwidth of a bit stream is reduced compared to transmission of an encoded version of high-frequency sub-bands of an encoder-side audio signal, and wherein the adjustment parameters comprise; a first adjustment parameter based on a comparison of an energy level of a first high-frequency sub-band in a group of high-frequency sub-bands to an energy level associated with a residual signal of a first high-frequency sub-band in a second group of high-frequency; and a second adjustment parameter for a second high-frequency sub-band in the group of high-frequency sub-bands; and reconstruct the high-frequency sub-bands of the encoder-side audio signal based on the adjusted group of high-band excitation sub-bands.
-
Specification