Audio and speech processing with optimal bit-allocation for constant bit rate applications

US 8,781,822 B2
Filed: 02/02/2010
Issued: 07/15/2014
Est. Priority Date: 12/22/2009
Status: Active Grant

First Claim

Patent Images

1. A method of audio or speech processing, comprising:

generating, by an apparatus, a plurality of frames, each of the frames comprising a plurality of transform coefficients;

allocating bits, by the apparatus, to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, the selection of the selected predefined bit allocation vector being based on respective amplitudes of the transform coefficients, wherein the allocation comprises selecting one of the bit allocation vectors from the dictionary for each of the frames, wherein each of the bit allocation vectors is identified by an index; and

transmitting each of the frames with the index for the bit allocation vector selected for that frame, wherein the index for each of the frames is transmitted within that frame.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Methods and apparatus for audio and speech processing including generating a plurality of frames, each of the frames comprising a plurality of transform coefficients, and allocating bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal.

Citations

35 Claims

1. A method of audio or speech processing, comprising:
- generating, by an apparatus, a plurality of frames, each of the frames comprising a plurality of transform coefficients;
  
  allocating bits, by the apparatus, to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, the selection of the selected predefined bit allocation vector being based on respective amplitudes of the transform coefficients, wherein the allocation comprises selecting one of the bit allocation vectors from the dictionary for each of the frames, wherein each of the bit allocation vectors is identified by an index; and
  
  transmitting each of the frames with the index for the bit allocation vector selected for that frame, wherein the index for each of the frames is transmitted within that frame.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
- - 2. The method of claim 1 wherein each of the bit allocation vectors comprises a plurality of elements, each of the elements representing a possible bit allocation for a corresponding one of the transform coefficients in any one of the frames, wherein the sum of the elements of all bit allocation vectors in the dictionary equals a fixed number.
  - 3. The method of claim 1 wherein the allocation comprises quantizing the transform coefficients for each of the frames based on the selected bit allocation vector for that frame.
  - 4. The method of claim 1 wherein the selection comprises computing a metric based on the respective amplitudes of the transform coefficients for that frame, and selecting the bit allocation vector based on the metric.
  - 5. The method of claim 1 wherein the index for each of the frames is transmitted independent of the transmission of that frame.
  - 6. The method of claim 1 wherein the allocation comprises selecting one of the bit allocation vectors from the dictionary for at least two of the frames.
  - 7. The method of claim 6 wherein the selection comprises computing a metric based on respective amplitudes of the transform coefficients for said at least two of the frames, and selecting the bit allocation vector based on the metric.
  - 8. The method of claim 6 wherein the allocation further comprises quantizing the transform coefficients for each of said at least two of the frames based on the selected bit allocation vector.
  - 9. The method of claim 6 further comprising transmitting said at least two of the frames with the index for the bit allocation vector.
  - 10. The method of claim 1, wherein the predefined bit allocation vectors each allocate the same number of bits.

11. An apparatus for audio or speech processing, comprising:
- a processing system configured to;
  
  generate a plurality of frames, each of the frames comprising a plurality of transform coefficients; and
  
  allocate bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the processing system further comprises a dictionary having a plurality of predefined bit allocation vectors, wherein the allocation of the bits for each of the frames is based on a selected one of the predefined bit allocation vectors, the selection of the selected predefined bit allocation vector being based on respective amplitudes of the transform coefficients;
  
  wherein the processing system is further configured to allocate bits by selecting one of the bit allocation vectors from the dictionary for each of the frames, and wherein each of the bit allocation vectors is identified by an index; and
  
  a transmitter configured to transmit each of the frames with the index for the bit allocation vector selected for that frame, wherein the transmitter is configured to transmit the index for each of the frames within that frame.
- View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
- - 12. The apparatus of claim 11 wherein each of the bit allocation vectors comprises a plurality of elements, each of the elements representing a possible bit allocation for a corresponding one of the transform coefficients in any one of the frames, wherein the sum of the elements of all the bit allocation vectors in the dictionary equals a fixed number.
  - 13. The apparatus of claim 11 wherein the processing system is further configured to allocate bits by quantizing the transform coefficients for each of the frames based on the selected bit allocation vector for that frame.
  - 14. The apparatus of claim 11 wherein the processing system is further configured to select one of the bit allocation vectors by computing a metric based on the respective amplitudes of the transform coefficients for that frame, and selecting the bit allocation vector based on the metric.
  - 15. The apparatus of claim 11 wherein the transmitter is configured to transmit the index for each of the frames independent of the transmission of that frame.
  - 16. The apparatus of claim 11 wherein the processing system is further configured to allocate bits by selecting one of the bit allocation vectors from the dictionary for at least two of the frames.
  - 17. The apparatus of claim 16 wherein the processing system is further configured to select the bit allocation vector by computing a metric based on respective amplitudes of the transform coefficients for said at least two of the frames, and selecting the bit allocation vector based on the metric.
  - 18. The apparatus of claim 16 wherein the processing system is further configured to allocate bits by quantizing the transform coefficients for each of said at least two of the frames based on the selected bit allocation vector.
  - 19. The apparatus of claim 16 wherein the transmitter is configured to transmit said at least two of the frames with the index for the bit allocation vector selected for said at least two of the frames.

20. An apparatus for audio or speech processing, comprising:
- means for generating a plurality of frames, each of the frames comprising a plurality of transform coefficients;
  
  means for allocating bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, the selection of the selected predefined bit allocation vector being based on respective amplitudes of the transform coefficients, wherein the means for allocating bits comprises means for selecting one of the bit allocation vectors from the dictionary for each of the frames, and wherein each of the bit allocation vectors is identified by an index; and
  
  means for transmitting each of the frames with the index for the bit allocation vector selected for that frame, wherein the means for transmitting comprises means for transmitting the index for each of the frames within that frame.
- View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28)
- - 21. The apparatus of claim 20 wherein each of the bit allocation vectors comprises a plurality of elements, each of the elements representing a possible bit allocation for a corresponding one of the transform coefficients in any one of the frames, wherein the sum of the elements of all the bit allocation vectors in the dictionary equals a fixed number.
  - 22. The apparatus of claim 20 wherein the means for allocation comprises means for quantizing the transform coefficients for each of the frames based on the selected bit allocation vector for that frame.
  - 23. The apparatus of claim 20 wherein the means for selecting comprises means for computing a metric based on the respective amplitudes of the transform coefficients for that frame, and means for selecting the bit allocation vector based on the metric.
  - 24. The apparatus of claim 20 wherein the means for transmitting comprises means for transmitting the index for each of the frames independent of the transmission of that frame.
  - 25. The apparatus of claim 20, wherein the means for allocating bits further comprises means for selecting one of the bit allocation vectors from the dictionary for at least two of the frames.
  - 26. The apparatus of claim 25 wherein the means for selecting one of the bit allocation vectors comprises means for computing a metric based on respective amplitudes of the transform coefficients for said at least two of the frames, and means for selecting the bit allocation vector based on the metric.
  - 27. The apparatus of claim 25 wherein the means for allocating bits further comprises means for quantizing the transform coefficients for each of said at least two of the frames based on the selected bit allocation vector.
  - 28. The apparatus of claim 25 further comprising means for transmitting said at least two of the frames with the index for the bit allocation vector selected for said at least two of the frames.

29. A computer-program product for processing audio or speech, comprising:
- computer-readable storage device encoded with codes executable by a processor to;
  
  generate a plurality of frames, each of the frames comprising a plurality of transform coefficients; and
  
  allocate bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, the selection of the selected predefined bit allocation vector being based on respective amplitudes of the transform coefficients, wherein the allocation comprises selecting one of the bit allocation vectors from the dictionary for each of the frames, wherein each of the bit allocation vectors is identified by an index; and
  
  transmit each of the frames with the index for the bit allocation vector selected for that frame, wherein the index for each of the frames is transmitted within that frame.

30. A headset, comprising:
- a transducer;
  
  a processing system configured to;
  
  generate a plurality of frames from audio or speech output from the transducer, each of the frames comprising a plurality of transform coefficients; and
  
  allocate bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, the selection of the selected predefined bit allocation vector being based on respective amplitudes of the transform coefficients, wherein the processing system is further configured to allocate bits by selecting one of the bit allocation vectors from the dictionary for each of the frames, and wherein each of the bit allocation vectors is identified by an index; and
  
  a transmitter configured to transmit each of the frames with the index for the bit allocation vector selected for that frame, wherein the transmitter is configured to transmit the index for each of the frames within that frame.

31. A watch comprising:
- a user interface;
  
  a processing system configured to;
  
  generate a plurality of frames from audio or speech output from the user interface, each of the frames comprising a plurality of transform coefficients; and
  
  allocate bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, the selection of the selected predefined bit allocation vector being based on respective amplitudes of the transform coefficients, wherein the processing system is further configured to allocate bits by selecting one of the bit allocation vectors from the dictionary for each of the frames, and wherein each of the bit allocation vectors is identified by an index; and
  
  a transmitter configured to transmit each of the frames with the index for the bit allocation vector selected for that frame, wherein the transmitter is configured to transmit the index for each of the frames within that frame.

32. A sensing apparatus, comprising:
- a sensor;
  
  a processing system configured to;
  
  generate a plurality of frames from audio or speech output from the sensor, each of the frames comprising a plurality of transform coefficients; and
  
  allocate bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, the selection of the selected predefined bit allocation vector being based on respective amplitudes of the transform coefficients, wherein the processing system is further configured to allocate bits by selecting one of the bit allocation vectors from the dictionary for each of the frames, and wherein each of the bit allocation vectors is identified by an index; and
  
  a transmitter configured to transmit each of the frames with the index for the bit allocation vector selected for that frame, wherein the transmitter is configured to transmit the index for each of the frames within that frame.

33. A method of audio or speech processing, comprising:
- generating, by an apparatus, a plurality of frames, each of the frames comprising a plurality of transform coefficients;
  
  allocating bits, by the apparatus, to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, wherein the allocation comprises selecting one of the bit allocation vectors from the dictionary for each of the frames, wherein each of the bit allocation vectors is identified by an index; and
  
  transmitting each of the frames with the index for the bit allocation vector selected for that frame, and wherein the index for each of the frames is transmitted within that frame.

34. An apparatus for audio or speech processing, comprising:
- a processing system configured to;
  
  generate a plurality of frames, each of the frames comprising a plurality of transform coefficients; and
  
  allocate bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the processing system further comprises a dictionary having a plurality of predefined bit allocation vectors, and wherein the allocation of the bits for each of the frames is based on a selected one of the predefined bit allocation vectors, wherein the processing system is further configured to allocate bits by selecting one of the bit allocation vectors from the dictionary for each of the frames, and wherein each of the bit allocation vectors is identified by an index; and
  
  a transmitter configured to transmit each of the frames with the index for the bit allocation vector selected for that frame, wherein the transmitter is configured to transmit the index for each of the frames within that frame.

35. An apparatus for audio or speech processing, comprising:
- means for generating a plurality of frames, each of the frames comprising a plurality of transform coefficients;
  
  means for allocating bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal, wherein the allocation of the bits for each of the frames is based on a selected one of a plurality of predefined bit allocation vectors in a dictionary, wherein the means for allocating bits comprises means for selecting one of the bit allocation vectors from the dictionary for each of the frames, and wherein each of the bit allocation vectors is identified by an index; and
  
  means for transmitting each of the frames with the index for the bit allocation vector selected for that frame, wherein the means for transmitting comprises means for transmitting the index for each of the frames within that frame.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Qualcomm, Inc.
Original Assignee
Qualcomm, Inc.
Inventors
Majumdar, Somdeb, Fazeldehkordi, Amin, Garudadri, Harinath
Primary Examiner(s)
PULLIAS, JESSE SCOTT

Application Number

US12/698,534
Publication Number

US 20110153315A1
Time in Patent Office

1,624 Days
Field of Search

704201-230
US Class Current

704/222
CPC Class Codes

G10L 19/002   Dynamic bit allocation for ...

G10L 19/0212   using orthogonal transforma...

G10L 19/035   Scalar quantisation

Audio and speech processing with optimal bit-allocation for constant bit rate applications

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

35 Claims

Specification

Solutions

Use Cases

Quick Links

Audio and speech processing with optimal bit-allocation for constant bit rate applications

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

35 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links