Sub-sampled excitation waveform codebooks
First Claim
Patent Images
1. A method for forming an excitation waveform in a speech coder, the method comprising:
- determining whether an acoustic signal in an analysis frame is a band-limited signal;
if the acoustic signal is a band-limited signal, then using a sub-sampled sparse codebook to generate the excitation waveform, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook; and
if the acoustic signal is not a band-limited signal, then using a sparse codebook to generate the excitation waveform,wherein the sparse codebook comprises a set of predetermined possible positions and the sub-sampled sparse code book comprises a subset of the predetermined positions, such that the excitation waveform is generated through placement of pulses within the predetermined positions or the subset; and
wherein using the sub-sampled sparse codebook to generate the excitation waveform comprises generating an initial excitation waveform, determining whether the initial excitation waveform comprises mostly odd track positions or mostly even track positions, and decimating the initial excitation waveform to generate the excitation waveform.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and apparatus are presented for reducing the number of bits needed to represent an excitation waveform. An acoustic signal in an analysis frame is analyzed to determine whether it is a band-limited signal. A sub-sampled sparse codebook is used to generate the excitation waveform if the acoustic signal is a band-limited signal. The sub-sampled sparse codebook is generated by decimating permissible pulse locations from the codebook track in accordance with the frequency characteristic of the acoustic signal.
58 Citations
36 Claims
-
1. A method for forming an excitation waveform in a speech coder, the method comprising:
-
determining whether an acoustic signal in an analysis frame is a band-limited signal; if the acoustic signal is a band-limited signal, then using a sub-sampled sparse codebook to generate the excitation waveform, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook; and if the acoustic signal is not a band-limited signal, then using a sparse codebook to generate the excitation waveform, wherein the sparse codebook comprises a set of predetermined possible positions and the sub-sampled sparse code book comprises a subset of the predetermined positions, such that the excitation waveform is generated through placement of pulses within the predetermined positions or the subset; and wherein using the sub-sampled sparse codebook to generate the excitation waveform comprises generating an initial excitation waveform, determining whether the initial excitation waveform comprises mostly odd track positions or mostly even track positions, and decimating the initial excitation waveform to generate the excitation waveform. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. Apparatus for forming an excitation waveform, comprising:
-
a memory element; and a processing element configured to execute a set of instructions stored on the memory element, the set of instructions for; determining whether an acoustic signal in an analysis frame is a band-limited signal; using a sub-sampled sparse codebook to generate the excitation waveform if the acoustic signal is a band-limited signal, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from a sparse codebook; and using the sparse codebook to generate the excitation waveform if the acoustic signal is not a band-limited signal, wherein the sparse codebook comprises a set of predetermined possible positions and the sub-sampled sparse code book comprises a subset of the predetermined positions, such that the excitation waveform is generated through placement of pulses within the predetermined positions or the subset; and wherein using the sub-sampled sparse codebook to generate the excitation waveform comprises generating an initial excitation waveform, determining whether the initial excitation waveform comprises mostly odd track positions or mostly even track positions, and decimating the initial excitation waveform to generate the excitation waveform. - View Dependent Claims (13, 14, 15, 16)
-
-
17. An apparatus for forming an excitation waveform, comprising:
-
means for determining whether an acoustic signal in an analysis frame is a band-limited signal; means for using a sub-sampled sparse codebook to generate the excitation waveform if the acoustic signal is a band-limited signal, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from a sparse codebook; and means for using the sparse codebook to generate the excitation waveform if die acoustic signal is not a band-limited signal, wherein the sparse codebook comprises a set of predetermined possible positions and the sub-sampled sparse codebook comprises a subset of the predetermined positions, such that die excitation waveform is generated through placement of pulses within the predetermined positions or the subset; wherein using the sub-sampled sparse codebook to generate the excitation waveform comprises generating an initial excitation waveform, determining whether the initial excitation waveform comprises mostly odd track positions or mostly even track positions, and decimating the initial excitation waveform to generate the excitation waveform. - View Dependent Claims (18)
-
-
19. A method for a signal coder to reduce the number of bits used to represent an excitation waveform, comprising:
-
determining a frequency characteristic of an acoustic signal; generating a sub-sampled sparse codebook waveform from a sparse codebook if the frequency characteristic indicates that sub-sampling does not impair the perceptual quality of the acoustic signal, wherein the sparse codebook comprises a set of predetermined possible positions and the sub-sampled sparse code book comprises a subset of the predetermined positions, such that the excitation waveform is generated through placement of pulses within the predetermined positions or the subset, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook; and using the sub-sampled sparse codebook waveform to represent the excitation waveform rather than a waveform from the sparse codebook; wherein using the sub-sampled sparse codebook to represent the excitation waveform comprises generating an initial excitation waveform, determining whether the initial excitation waveform comprises mostly odd track positions or mostly even track positions, and decimating the initial excitation waveform.
-
-
20. Apparatus for reducing the number of bits used to represent an excitation waveform, comprising:
-
a memory element; and a processing element configured to execute a set of instructions stored on the memory element, the set of instructions for; determining a frequency characteristic of an acoustic signal; generating a sub-sampled sparse codebook waveform from a sparse codebook if the frequency characteristic indicates that sub-sampling does not impair the perceptual quality of the acoustic signal, wherein the sparse codebook comprises a set of predetermined possible positions and the sub-sampled sparse code book comprises a subset of the predetermined positions, such that the excitation waveform is generated through placement of pulses within the predetermined positions or the subset, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook; and using the sub-sampled sparse codebook waveform to represent the excitation waveform rather than a waveform from the sparse codebook; wherein using the sub-sampled sparse codebook to represent the excitation waveform comprises generating an initial excitation waveform, determining whether the initial excitation waveform comprises mostly odd track positions or mostly even track positions, and decimating the initial excitation waveform.
-
-
21. An apparatus for reducing the number of bits used to represent an excitation waveform, comprising:
-
means for determining a frequency characteristic of an acoustic signal; means for generating a sub-sampled sparse codebook waveform from a sparse codebook if the frequency characteristic indicates that sub-sampling does not impair the perceptual quality of the acoustic signal, wherein the sparse codebook comprises a set of predetermined possible positions and the sub-sampled sparse code book comprises a subset of the predetermined positions, such that the excitation waveform is generated through placement of pulses within the predetermined positions or the subset, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook; and means for using the sub-sampled sparse codebook waveform to represent the excitation waveform rather than a waveform from the sparse codebook; wherein using the sub-sampled sparse codebook waveform to represent the excitation waveform comprises generating an initial excitation waveform, determining whether the initial excitation waveform comprises mostly odd track positions or mostly even track positions, and decimating the initial excitation waveform. - View Dependent Claims (22, 23, 24, 25)
-
-
26. A method for execution by a suitably programmed processor to generate a sub-sampled sparse codebook from a sparse codebook, wherein the sparse codebook comprises pulses at a set of permissible pulse locations, the method comprising:
-
analyzing a frequency characteristic of an acoustic signal; determining whether an initial excitation waveform corresponding to the acoustic signal comprises mostly odd track positions or mostly even track positions; and decimating a subset of permissible pulse locations from the set of permissible pulse locations of the sparse codebook in accordance with the frequency characteristic of the acoustic signal to generate the sub-sampled sparse codebook, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook.
-
-
27. Apparatus for generating a sub-sampled sparse codebook from a sparse codebook, wherein the sparse codebook comprises pulses at a set of permissible pulse locations, the apparatus comprising:
-
a memory element; and a processing element configured to execute a set of instructions stored on the memory element, the set of instructions for; analyzing a frequency characteristic of an acoustic signal; determining whether an initial excitation waveform corresponding to the acoustic signal comprises mostly odd track positions or mostly even track positions; and decimating a subset of permissible pulse locations from the set of permissible pulse locations of the sparse codebook in accordance with the frequency characteristic of the acoustic signal to generate the sub-sampled sparse codebook, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook.
-
-
28. Apparatus for generating a sub-sampled sparse codebook from a sparse codebook, wherein the sparse codebook comprises pulses at a set of permissible pulse locations, the apparatus comprising:
-
means for analyzing a frequency characteristic of an acoustic signal; means for determining whether an initial excitation waveform corresponding to the acoustic signal comprises mostly odd track positions or mostly even track positions; and means for decimating a subset of permissible pulse locations from the set of permissible pulse locations of the sparse codebook in accordance with the frequency characteristic of the acoustic signal to generate the sub-sampled sparse codebook, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook. - View Dependent Claims (29, 30, 31, 32)
-
-
33. A speech coder, comprising:
-
a linear predictive coding (LPC) unit configured to determine LPC coefficients of an acoustic signal; a frequency analysis unit configured to determine whether the acoustic signal is band-limited; a quantizer unit configured to receive the LPC coefficients to and quantize the LPC coefficients; and an excitation parameter generator configured to receive a determination from the frequency analysis unit regarding whether the acoustic signal is band-limited and to implement a sub-sampled sparse codebook, the sparse codebook comprising a set of predetermined possible positions and the sub-sampled sparse code book comprising a subset of the predetermined positions, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from the sparse codebook, and wherein implementing the sub-sampled sparse codebook comprises determining whether an initial excitation waveform comprises mostly odd track positions or mostly even track positions. - View Dependent Claims (34, 35)
-
-
36. A computer-program product comprising a computer-readable medium having instructions thereon, the instructions comprising:
-
code for determining whether an acoustic signal in an analysis frame is a band-limited signal; code for using a sub-sampled sparse codebook to generate an excitation waveform if the acoustic signal is a band-limited signal, wherein the sub-sampled sparse codebook comprises either only even track positions or only odd track positions from a sparse codebook; and code for using the sparse codebook to generate the excitation waveform if the acoustic signal is not a band-limited signal, wherein the sparse codebook comprises a set of predetermined possible positions and the sub-sampled sparse code book comprises a subset of the predetermined positions, such that the excitation waveform is generated through placement of pulses within the predetermined positions or the subset; wherein using the sub-sampled sparse codebook to generate the excitation waveform comprises generating an initial excitation waveform, determining whether the initial excitation waveform comprises mostly odd track positions or mostly even track positions, and decimating the initial excitation waveform.
-
Specification