Audio and speech processing with optimal bit-allocation for constant bit rate applications
First Claim
Patent Images
1. A method of audio or speech processing, comprising:
- generating, by an apparatus, a plurality of frames, each of the frames comprising a plurality of transform coefficients;
allocating bits, by the apparatus, to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, the selection of the selected predefined bit allocation vector being based on respective amplitudes of the transform coefficients, wherein the allocation comprises selecting one of the bit allocation vectors from the dictionary for each of the frames, wherein each of the bit allocation vectors is identified by an index; and
transmitting each of the frames with the index for the bit allocation vector selected for that frame, wherein the index for each of the frames is transmitted within that frame.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and apparatus for audio and speech processing including generating a plurality of frames, each of the frames comprising a plurality of transform coefficients, and allocating bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal.
-
Citations
35 Claims
-
1. A method of audio or speech processing, comprising:
-
generating, by an apparatus, a plurality of frames, each of the frames comprising a plurality of transform coefficients; allocating bits, by the apparatus, to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, the selection of the selected predefined bit allocation vector being based on respective amplitudes of the transform coefficients, wherein the allocation comprises selecting one of the bit allocation vectors from the dictionary for each of the frames, wherein each of the bit allocation vectors is identified by an index; and transmitting each of the frames with the index for the bit allocation vector selected for that frame, wherein the index for each of the frames is transmitted within that frame. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An apparatus for audio or speech processing, comprising:
-
a processing system configured to; generate a plurality of frames, each of the frames comprising a plurality of transform coefficients; and allocate bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the processing system further comprises a dictionary having a plurality of predefined bit allocation vectors, wherein the allocation of the bits for each of the frames is based on a selected one of the predefined bit allocation vectors, the selection of the selected predefined bit allocation vector being based on respective amplitudes of the transform coefficients; wherein the processing system is further configured to allocate bits by selecting one of the bit allocation vectors from the dictionary for each of the frames, and wherein each of the bit allocation vectors is identified by an index; and a transmitter configured to transmit each of the frames with the index for the bit allocation vector selected for that frame, wherein the transmitter is configured to transmit the index for each of the frames within that frame. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. An apparatus for audio or speech processing, comprising:
-
means for generating a plurality of frames, each of the frames comprising a plurality of transform coefficients; means for allocating bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, the selection of the selected predefined bit allocation vector being based on respective amplitudes of the transform coefficients, wherein the means for allocating bits comprises means for selecting one of the bit allocation vectors from the dictionary for each of the frames, and wherein each of the bit allocation vectors is identified by an index; and means for transmitting each of the frames with the index for the bit allocation vector selected for that frame, wherein the means for transmitting comprises means for transmitting the index for each of the frames within that frame. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. A computer-program product for processing audio or speech, comprising:
computer-readable storage device encoded with codes executable by a processor to; generate a plurality of frames, each of the frames comprising a plurality of transform coefficients; and allocate bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, the selection of the selected predefined bit allocation vector being based on respective amplitudes of the transform coefficients, wherein the allocation comprises selecting one of the bit allocation vectors from the dictionary for each of the frames, wherein each of the bit allocation vectors is identified by an index; and transmit each of the frames with the index for the bit allocation vector selected for that frame, wherein the index for each of the frames is transmitted within that frame.
-
30. A headset, comprising:
-
a transducer; a processing system configured to; generate a plurality of frames from audio or speech output from the transducer, each of the frames comprising a plurality of transform coefficients; and allocate bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, the selection of the selected predefined bit allocation vector being based on respective amplitudes of the transform coefficients, wherein the processing system is further configured to allocate bits by selecting one of the bit allocation vectors from the dictionary for each of the frames, and wherein each of the bit allocation vectors is identified by an index; and a transmitter configured to transmit each of the frames with the index for the bit allocation vector selected for that frame, wherein the transmitter is configured to transmit the index for each of the frames within that frame.
-
-
31. A watch comprising:
-
a user interface; a processing system configured to; generate a plurality of frames from audio or speech output from the user interface, each of the frames comprising a plurality of transform coefficients; and allocate bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, the selection of the selected predefined bit allocation vector being based on respective amplitudes of the transform coefficients, wherein the processing system is further configured to allocate bits by selecting one of the bit allocation vectors from the dictionary for each of the frames, and wherein each of the bit allocation vectors is identified by an index; and a transmitter configured to transmit each of the frames with the index for the bit allocation vector selected for that frame, wherein the transmitter is configured to transmit the index for each of the frames within that frame.
-
-
32. A sensing apparatus, comprising:
-
a sensor; a processing system configured to; generate a plurality of frames from audio or speech output from the sensor, each of the frames comprising a plurality of transform coefficients; and allocate bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, the selection of the selected predefined bit allocation vector being based on respective amplitudes of the transform coefficients, wherein the processing system is further configured to allocate bits by selecting one of the bit allocation vectors from the dictionary for each of the frames, and wherein each of the bit allocation vectors is identified by an index; and a transmitter configured to transmit each of the frames with the index for the bit allocation vector selected for that frame, wherein the transmitter is configured to transmit the index for each of the frames within that frame.
-
-
33. A method of audio or speech processing, comprising:
-
generating, by an apparatus, a plurality of frames, each of the frames comprising a plurality of transform coefficients; allocating bits, by the apparatus, to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, wherein the allocation comprises selecting one of the bit allocation vectors from the dictionary for each of the frames, wherein each of the bit allocation vectors is identified by an index; and transmitting each of the frames with the index for the bit allocation vector selected for that frame, and wherein the index for each of the frames is transmitted within that frame.
-
-
34. An apparatus for audio or speech processing, comprising:
-
a processing system configured to; generate a plurality of frames, each of the frames comprising a plurality of transform coefficients; and allocate bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the processing system further comprises a dictionary having a plurality of predefined bit allocation vectors, and wherein the allocation of the bits for each of the frames is based on a selected one of the predefined bit allocation vectors, wherein the processing system is further configured to allocate bits by selecting one of the bit allocation vectors from the dictionary for each of the frames, and wherein each of the bit allocation vectors is identified by an index; and a transmitter configured to transmit each of the frames with the index for the bit allocation vector selected for that frame, wherein the transmitter is configured to transmit the index for each of the frames within that frame.
-
-
35. An apparatus for audio or speech processing, comprising:
-
means for generating a plurality of frames, each of the frames comprising a plurality of transform coefficients; means for allocating bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, wherein the means for allocating bits comprises means for selecting one of the bit allocation vectors from the dictionary for each of the frames, and wherein each of the bit allocation vectors is identified by an index; and means for transmitting each of the frames with the index for the bit allocation vector selected for that frame, wherein the means for transmitting comprises means for transmitting the index for each of the frames within that frame.
-
Specification