Method and apparatus for efficient multiband celp wideband speech and music coding and decoding

US 5,778,335 A
Filed: 02/26/1996
Issued: 07/07/1998
Est. Priority Date: 02/26/1996
Status: Expired due to Term

First Claim

Patent Images

1. A method for encoding and decoding sound, comprising the steps of:

analyzing an input waveform and computing the linear prediction coefficients for a portion of the input waveform;

classifying the input waveform as one of a group comprising speech and music;

generating a first plurality of codebooks, each having an output, where each codebook is associated with a frequency band;

generating at least one first adaptive codebook having an output;

coupling the output of the first plurality of codebooks and the output of the at least one first adaptive codebook together to create a composite waveform;

synthesis filtering the composite waveform;

perceptually weighting the input waveform;

perceptually weighting the synthesis filtered composite waveform;

differencing the perceptually weighted synthesis filtered composite waveform from the perceptually weighted input waveform to form an output waveform;

searching through the first plurality of codebooks and the adaptive codebook to minimize the errors in the output waveform; and

decoding the output waveform using a second plurality of codebooks and at least one second adaptive codebook.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method of digitally compressing speech and music by use of multiple band ("multiband") fixed excitations stored in codebooks. The use of multiband fixed excitations, along with a coupling method for interconnecting the excitation codebooks and adaptive codebooks and for generating the composite excitation signal, improve the long-term and short-term prediction, and the use of voice-music classification allows the coding structure to be adapted to the statistical character of the audio signal.

Citations

7 Claims

1. A method for encoding and decoding sound, comprising the steps of:
- analyzing an input waveform and computing the linear prediction coefficients for a portion of the input waveform;
  
  classifying the input waveform as one of a group comprising speech and music;
  
  generating a first plurality of codebooks, each having an output, where each codebook is associated with a frequency band;
  
  generating at least one first adaptive codebook having an output;
  
  coupling the output of the first plurality of codebooks and the output of the at least one first adaptive codebook together to create a composite waveform;
  
  synthesis filtering the composite waveform;
  
  perceptually weighting the input waveform;
  
  perceptually weighting the synthesis filtered composite waveform;
  
  differencing the perceptually weighted synthesis filtered composite waveform from the perceptually weighted input waveform to form an output waveform;
  
  searching through the first plurality of codebooks and the adaptive codebook to minimize the errors in the output waveform; and
  
  decoding the output waveform using a second plurality of codebooks and at least one second adaptive codebook.
- View Dependent Claims (2, 3)
- - 2. The method of claim 1, further comprising the step of masking an output quantization noise from the output of the first plurality of codebooks.
  - 3. The method of claim 1, further comprising the step of post-filtering the decoded output waveform.

4. A system to encode and decode sound, comprising:
- an analyzer to compute linear prediction coefficients for a portion of an input waveform;
  
  a classifier for classifying the input waveform as one of a group comprising speech, speech and music, and music;
  
  a first plurality of codebooks, each having an output, where each codebook is associated with a frequency band;
  
  at least one first adaptive codebook having an output;
  
  a first coupler to couple the output of the first plurality of codebooks and the output of the at least one first adaptive codebook together to create a composite waveform;
  
  a synthesis filter for filtering the composite waveform;
  
  a first perceptual weighting filter for filtering the input waveform;
  
  a second perceptual weighting filter for filtering the synthesis filtered composite waveform;
  
  a signal combiner for differencing the perceptually weighted synthesis filtered composite waveform from the perceptually weighted input waveform to form an output waveform;
  
  selector means for searching through the first plurality of codebooks and the adaptive codebook to minimize the errors in the output waveform; and
  
  decoder means for decoding the output waveform, the decoder comprising a second plurality of codebooks and at least one second adaptive codebook.
- View Dependent Claims (5, 6)
- - 5. The system of claim 4, wherein the system further comprises masking means for masking a quantization noise from the output of the first plurality of codebooks.
  - 6. The system of claim 4, further comprising of post-filtering means for filtering the decoded output waveform.

7. A method for encoding an audio signal, comprising the steps of:
- generating a multiple band excitation codebook bank and at least one adaptive codebook;
  
  coupling the multiple band fixed excitation codebook bank and the at least one adaptive codebook for generating a composite excitation signal,providing a long-term and a short-term prediction signal;
  
  classifying as voice or music the composite excitation signal based on the long-term prediction signal and the short-term prediction signal; and
  
  adapting the classified composite excitation signal to a statistical character of the audio signal.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Regents of the University of California (University of California)
Original Assignee
Regents of the University of California (University of California)
Inventors
Ubale, Anil Wamanrao, Gersho, Allen
Primary Examiner(s)
Hudspeth, David R.
Assistant Examiner(s)
Smits, Talivaldis Ivars

Application Number

US08/605,509
Time in Patent Office

862 Days
Field of Search

395/2.28, 395/2.32, 395/2.71, 395/2.73, 395/2.91, 704/219, 704/223, 704/262, 704/264, 704/500
US Class Current

704/219
CPC Class Codes

G10H 1/125   using a digital filter

G10H 2210/046   for differentiation between...

G10H 2240/251   Mobile telephone transmissi...

G10H 2250/581   Codebook-based waveform com...

G10H 2250/585   CELP [code excited linear p...

G10H 7/00   Instruments in which the to...

G10L 19/002   Dynamic bit allocation for ...

G10L 19/0204   using subband decomposition

G10L 19/10   the excitation function bei...

G10L 19/12   the excitation function bei...

G10L 19/18   Vocoders using multiple modes

G10L 2019/0005   Multi-stage vector quantisa...

G10L 2025/783   based on threshold decision

Method and apparatus for efficient multiband celp wideband speech and music coding and decoding

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

7 Claims

Specification

Solutions

Use Cases

Quick Links

Method and apparatus for efficient multiband celp wideband speech and music coding and decoding

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

7 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links