METHOD AND APPARATUS TO ENCODE AND DECODE AN AUDIO/SPEECH SIGNAL

US 20100010807A1
Filed: 07/14/2009
Published: 01/14/2010
Est. Priority Date: 07/14/2008
Status: Active Grant

First Claim

Patent Images

1. An apparatus to encode an audio/speech signal, the apparatus comprising:

a signal transforming unit to transform an inputted audio signal or speech signal into at least one of a high frequency resolution signal and a high temporal resolution signal;

a psychoacoustic modeling unit to control the signal transforming unit;

a time domain encoding unit to encode the signal, transformed by the signal transforming unit, based on a speech modeling; and

a quantizing unit to quantize the signal outputted from at least one of the signal transforming unit and the time domain encoding unit.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method and apparatus to encode and decode an audio/speech signal is provided. An inputted audio signal or speech signal may be transformed into at least one of a high frequency resolution signal and a high temporal resolution signal. The signal may be encoded by determining an appropriate resolution, the encoded signal may be decoded, and thus the audio signal, the speech signal, and a mixed signal of the audio signal and the speech signal may be processed.

Citations

21 Claims

1. An apparatus to encode an audio/speech signal, the apparatus comprising:
- a signal transforming unit to transform an inputted audio signal or speech signal into at least one of a high frequency resolution signal and a high temporal resolution signal;
  
  a psychoacoustic modeling unit to control the signal transforming unit;
  
  a time domain encoding unit to encode the signal, transformed by the signal transforming unit, based on a speech modeling; and
  
  a quantizing unit to quantize the signal outputted from at least one of the signal transforming unit and the time domain encoding unit.
- View Dependent Claims (2)
- - 2. The apparatus of claim 1, wherein the quantizing unit includes a Code Excitation Linear Prediction (CELP) to model a signal where correlation information is removed.

3. An apparatus to encode an audio/speech signal, the apparatus comprising:
- a parametric stereo processing unit to process stereo information of an inputted audio signal or speech signal;
  
  a high frequency signal processing unit to process a high frequency signal of the inputted audio signal or speech signal;
  
  a signal transforming unit to transform the inputted audio signal or speech signal into at least one of a high frequency resolution signal and a high temporal resolution signal;
  
  a psychoacoustic modeling unit to control the signal transforming unit;
  
  a time domain encoding unit to encode the signal, transformed by the signal transforming unit, based on a speech modeling; and
  
  a quantizing unit to quantize the signal outputted from at least one of the signal transforming unit and the time domain encoding unit.
- View Dependent Claims (4, 5, 6, 7, 8, 9)
- - 4. The apparatus of claim 3, wherein the time domain encoding unit includes a CELP to model a signal where correlation information is removed.
  - 5. The apparatus of claim 3, wherein the quantizing unit is a spectrum quantizing unit, and further comprises:
    - a switching unit to select any one of the outputted signals from the spectrum quantizing unit and the time domain encoding unit depending on whether the transformed audio signal or speech signal is the high frequency resolution signal or the high temporal resolution signal.
  - 6. The apparatus of claim 3, further comprising:
    - a downsampling unit to downsample the audio signal or speech signal.
  - 7. The apparatus of claim 3, wherein the signal transforming unit includes at least one of a Frequency Varying Modulated Lapped Transform (FV-MLT) and a Modified Discrete Cosine Transform (MDCT).
  - 8. The apparatus of claim 3, wherein the psychoacoustic modeling unit provides the quantizing unit with information about a noise during quantization.
  - 9. The apparatus of claim 3, wherein the time domain encoding unit further comprises:
    - a predicting unit to apply the speech modeling to the signal transformed by the signal transforming unit, and to remove correlation information.

10. An apparatus to decode audio/speech signal, the apparatus comprising:
- a resolution decision unit to determine whether a current frame signal is a high frequency resolution signal or a high temporal resolution signal, based on information about time domain encoding or frequency domain encoding, the information being included in a bitstream;
  
  a dequantizing unit to dequantize the bitstream when the resolution decision unit determines the signal is the high frequency resolution signal;
  
  a time domain decoding unit to decode additional information for inverse linear prediction from the bitstream, and to restore the high temporal resolution signal using the additional information; and
  
  an inverse signal transforming unit to inverse-transform at least one of an output signal from the time domain decoding unit and an output signal from the dequantizing unit into an audio signal or speech signal of a time domain.
- View Dependent Claims (11)
- - 11. The apparatus of claim 10, wherein the apparatus further comprises at least one of:
    - a high frequency signal decoding unit to process a high frequency signal of the inverse-transformed signal, anda parametric stereo processing unit to process stereo information of the inverse-transformed signal.

12. An apparatus to encoding an audio/speech signal, the apparatus comprising:
- a signal transforming unit to transform an inputted audio signal or speech signal into at least one of a high frequency resolution signal and a high temporal resolution signal;
  
  a psychoacoustic modeling unit to control the signal transforming unit;
  
  a temporal noise shaping unit to shape at least one of the transformed high frequency resolution signal and the transformed high temporal resolution signal;
  
  a high rate stereo unit to encode stereo information of the transformed signal; and
  
  a quantizing unit to quantize the signal outputted from at least one of the temporal noise shaping unit and the high rate stereo unit.
- View Dependent Claims (13)
- - 13. The apparatus of claim 12, further comprising:
    - a high frequency signal processing unit to process a high frequency signal of the audio signal or the speech signal.

14. An apparatus of decoding an audio/speech signal, the apparatus comprising:
- a dequantizing unit to dequantize a bitstream;
  
  a high rate stereo/decoder to decode the dequantized signal;
  
  a temporal noise shaper/decoder to process the signal decoded by the high rate stereo/decoder; and
  
  an inverse signal transforming unit to inverse-transform the processed signal into an audio signal or speech signal of a time domain,wherein the bitstream is generated by a transformation of the inputted audio signal or speech signal into at least one of a high frequency resolution signal and a high temporal resolution signal.
- View Dependent Claims (15)
- - 15. The apparatus of claim 14, further comprising:
    - a high frequency signal processing unit to process a high frequency signal of the inverse-transformed signal.

16. An apparatus to encode an audio/speech signal, the apparatus comprising:
- a signal transforming unit to transform an inputted audio signal or speech signal into at least one of a high frequency resolution signal and a high temporal resolution signal;
  
  a psychoacoustic modeling unit to control the signal transforming unit;
  
  a low rate determination unit to determine whether the transformed signal has a low rate;
  
  a time domain encoding unit to encode the transformed signal based on a speech modeling when the transformed signal has the low rate;
  
  a temporal noise shaping unit to shape the transformed signal;
  
  a high rate stereo unit to encode stereo information of the shaped signal; and
  
  a quantizing unit to quantize at least one of an output signal from the high rate stereo unit and an output signal from the time domain encoding unit.
- View Dependent Claims (17)
- - 17. The apparatus of claim 16, further comprising:
    - a parametric stereo processing determination unit to determine whether to operate a parametric stereo processing unit based on predetermined information;
      
      the parametric stereo processing unit to process stereo information of an inputted high frequency signal when it is determined that the parametric stereo processing unit is to be operated;
      
      a high frequency signal processing determination unit to determine whether to operate a high frequency signal processing unit based on other predetermined information; and
      
      the high frequency signal processing unit to process an inputted high frequency signal when it is determined that the high frequency signal processing unit is to be operated.

18. A method of encoding an audio/speech signal, the method comprising:
- transforming an inputted audio signal or speech signal into at least one of a high frequency resolution signal and a high temporal resolution signal, and controlling the transformed signal based on a psychoacoustic modeling;
  
  time-encoding the transformed signal based at least in part on a speech modeling; and
  
  quantizing at least one of the transformed signal and the time-encoded signal.

19. A method of decoding an audio/speech signal, the method comprising:
- determining whether a current frame signal is a high frequency resolution signal or a high temporal resolution signal, based at least in part on information included in the bitstream about time domain encoding or frequency domain encoding;
  
  dequantizing the bitstream when the signal is determined as the high frequency resolution signal;
  
  decoding additional information for inverse linear prediction from the bitstream, and restoring the high temporal resolution signal using the additional information; and
  
  inverse-transforming at least one of the restored signal and the dequantized signal into an audio signal or speech signal of a time domain.

20. A method of encoding audio and speech signals, the method comprising:
- receiving at least one audio signal and at least one speech signal;
  
  transforming the at least one of the received audio signal and the received speech signal into at least one of a frequency resolution signal and a temporal resolution signal;
  
  encoding the transformed signal; and
  
  quantizing at least one of the transformed signal and the encoded signal.

21. A method of decoding audio and speech signals, the method comprising:
- determining whether a current frame signal is a frequency resolution signal or a temporal resolution signal with information in the bitstream of a received signal about time domain encoding or frequency domain encoding;
  
  dequantizing the bitstream when the received signal is the frequency resolution signal;
  
  inverse linear predicting from the information in the bitstream and restoring the temporal resolution signal using the information; and
  
  inverse-transforming at least one of the dequantized signal and the restored temporal resolution signal into an audio signal or speech signal of a time domain.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Samsung Electronics Co. Ltd.
Original Assignee
Samsung Electronics Co. Ltd.
Inventors
Oh, Eun Mi, Kim, Mi Young, Kim, Jung Hoe, Sung, Ho Sang, Choo, Ki Hyun

Granted Patent

US 8,532,982 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/200.100
CPC Class Codes

G10L 19/008   Multichannel audio signal c...

G10L 19/0204   using subband decomposition

G10L 19/0212   using orthogonal transforma...

G10L 19/03   Spectral prediction for pre...

G10L 19/04   using predictive techniques

G10L 19/12   the excitation function bei...

G10L 19/167   Audio streaming, i.e. forma...

G10L 19/20   using sound class specific ...

METHOD AND APPARATUS TO ENCODE AND DECODE AN AUDIO/SPEECH SIGNAL

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

21 Claims

Specification

Solutions

Use Cases

Quick Links

METHOD AND APPARATUS TO ENCODE AND DECODE AN AUDIO/SPEECH SIGNAL

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

21 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links