Speech/audio encoding apparatus and method thereof
First Claim
1. A speech/audio encoding device comprising:
- a receiver that receives a time-domain speech/audio input signal;
a memory; and
a processor thattransforms the speech/audio input signal into a frequency domain;
quantizes energy envelopes which represent an energy level for a frequency spectrum of the speech/audio input signal;
groups quantized energy envelopes into a plurality of groups based on similarity of frequencies, such that quantized energy envelopes having frequencies of significance are positioned adjacent to one another, and quantized energy envelopes having frequencies of non-significance are positioned adjacent to one another;
determines a perceptually significant group and a perceptually non-significant group, the perceptually significant group including one or more significant bands, each perceptually significant group including a local-peak frequency, and the perceptually non-significant group being a group other than the perceptually significant group;
allocates bits to a plurality of subbands corresponding to the grouped quantized energy envelopes; and
encodes a spectrum included in a subband using the bits allocated to the subbands in a subband-by-subband basis,wherein more bits are allocated to subbands corresponding to the perceptually significant group than the perceptually non-significant group.
0 Assignments
0 Petitions
Accused Products
Abstract
A speech/audio encoding device for selectively allocating bits for higher precision encoding. The speech/audio encoding device receives a time-domain speech/audio input signal, transforms the speech/audio input signal into a frequency domain, and quantizes an energy envelope corresponding to an energy level for a frequency spectrum of the speech/audio input signal. The speech/audio encoding device further groups quantized energy envelopes into a plurality of groups, determines a perceptual significant group including one or more significant bands and a local-peak frequency, and allocates bits to a plurality of subbands corresponding to the grouped quantized energy envelopes, in which each of the subbands is obtained by splitting the frequency spectrum of the speech/audio input signal. The speech/audio encoding device encodes the frequency spectrum using the bits allocated to the subbands.
54 Citations
6 Claims
-
1. A speech/audio encoding device comprising:
-
a receiver that receives a time-domain speech/audio input signal; a memory; and a processor that transforms the speech/audio input signal into a frequency domain; quantizes energy envelopes which represent an energy level for a frequency spectrum of the speech/audio input signal; groups quantized energy envelopes into a plurality of groups based on similarity of frequencies, such that quantized energy envelopes having frequencies of significance are positioned adjacent to one another, and quantized energy envelopes having frequencies of non-significance are positioned adjacent to one another; determines a perceptually significant group and a perceptually non-significant group, the perceptually significant group including one or more significant bands, each perceptually significant group including a local-peak frequency, and the perceptually non-significant group being a group other than the perceptually significant group; allocates bits to a plurality of subbands corresponding to the grouped quantized energy envelopes; and encodes a spectrum included in a subband using the bits allocated to the subbands in a subband-by-subband basis, wherein more bits are allocated to subbands corresponding to the perceptually significant group than the perceptually non-significant group. - View Dependent Claims (2, 3)
-
-
4. A speech/audio encoding method comprising:
-
receiving, by a receiver, a time-domain speech/audio input signal; transforming, by a processor, the speech/audio input signal into a frequency domain; quantizing, by the processor, energy envelopes which represent an energy level for a frequency spectrum of the speech/audio input signal; grouping, by the processor, quantized energy envelopes into a plurality of groups based on similarity of frequencies, such that quantized energy envelopes having frequencies of significance are positioned adjacent to one another, and quantized energy envelopes having frequencies of non-significance are positioned adjacent to one another; determining, by the processor, a perceptually significant group and a perceptually non-significant group, the perceptually significant group including one or more significant bands, each perceptually significant group including a local-peak frequency, and the perceptually non-significant group being a group other than the perceptually significant group; allocating, by the processor, bits to a plurality of subbands corresponding to the grouped quantized energy envelopes; and encoding, by the processor, a spectrum included in a subband using the bits allocated to the subbands in a subband-by-subband basis wherein more bits are allocated to subbands corresponding to the perceptually significant group than the perceptually non-significant group. - View Dependent Claims (5, 6)
-
Specification