Coding mode determination method and apparatus, audio encoding method and apparatus, and audio decoding method and apparatus

US 10,468,046 B2
Filed: 07/18/2018
Issued: 11/05/2019
Est. Priority Date: 11/13/2012
Status: Active Grant

First Claim

Patent Images

1. A method of encoding an audio signal, the method comprising:

receiving the audio signal;

obtaining, performed by at least one processor, first parameters of a current frame of the audio signal;

selecting, performed by the at least one processor, a class of the current frame in the audio signal from among a plurality of classes including a music class and a speech class, based on first parameters of the current frame by using a Gaussian mixture model (GMM);

obtaining second parameters including first tonality, second tonality and third tonality;

generating a plurality of conditions, where each of the plurality of conditions is generated based on a combination of the obtained second parameters;

determining, performed by the at least one processor, whether an error occurs in the selected class of the current frame based on whether at least one of the plurality of conditions is met;

when the error occurs in the selected class of the current frame, correcting, performed by the at least one processor, the selected class of the current frame;

encoding, performed by the at least one processor, the current frame, based on either the corrected class or the selected class of the current frame; and

generating a bitstream based on the encoded current frame,wherein the first tonality is obtained from a subband of 0 to 1 kHz, the second tonality is obtained from a subband of 1 to 2 kHz and the third tonality is obtained from a subband of 2 to 4 kHz, andwherein the correcting comprises;

when the error occurs in the selected class of the current frame and the selected class of the current frame is the speech class, correcting the selected class of the current frame from the speech class to the music class; and

when the error occurs in the selected class of the current frame and the selected class of the current frame is the music class, correcting the selected class of the current frame from the music class to the speech class.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Provided are a method and an apparatus for determining an encoding mode for improving the quality of a reconstructed audio signal. A method of determining an encoding mode includes determining one from among a plurality of encoding modes including a first encoding mode and a second encoding mode as an initial encoding mode in correspondence to characteristics of an audio signal, and if there is an error in the determination of the initial encoding mode, generating a modified encoding mode by modifying the initial encoding mode to a third encoding mode.

Citations

6 Claims

1. A method of encoding an audio signal, the method comprising:
- receiving the audio signal;
  
  obtaining, performed by at least one processor, first parameters of a current frame of the audio signal;
  
  selecting, performed by the at least one processor, a class of the current frame in the audio signal from among a plurality of classes including a music class and a speech class, based on first parameters of the current frame by using a Gaussian mixture model (GMM);
  
  obtaining second parameters including first tonality, second tonality and third tonality;
  
  generating a plurality of conditions, where each of the plurality of conditions is generated based on a combination of the obtained second parameters;
  
  determining, performed by the at least one processor, whether an error occurs in the selected class of the current frame based on whether at least one of the plurality of conditions is met;
  
  when the error occurs in the selected class of the current frame, correcting, performed by the at least one processor, the selected class of the current frame;
  
  encoding, performed by the at least one processor, the current frame, based on either the corrected class or the selected class of the current frame; and
  
  generating a bitstream based on the encoded current frame,wherein the first tonality is obtained from a subband of 0 to 1 kHz, the second tonality is obtained from a subband of 1 to 2 kHz and the third tonality is obtained from a subband of 2 to 4 kHz, andwherein the correcting comprises;
  
  when the error occurs in the selected class of the current frame and the selected class of the current frame is the speech class, correcting the selected class of the current frame from the speech class to the music class; and
  
  when the error occurs in the selected class of the current frame and the selected class of the current frame is the music class, correcting the selected class of the current frame from the music class to the speech class.
- View Dependent Claims (2, 3, 4, 5, 6)
- - 2. The method of claim 1, wherein the correcting is performed based on at least two independent states.
  - 3. The method of claim 1, wherein the second parameters further comprise a difference between a voicing parameter and a correlation parameter.
  - 4. The method of claim 1, wherein the determining of whether the error occurs in the selected class of the current frame occurs comprises:
    - determining whether the current frame has speech characteristics when the current frame is classified as the music class; and
      
      determining whether the current frame has music characteristics when the current frame is classified as the speech class.
  - 5. The method of claim 1, wherein the correcting comprises:
    - correcting a classification of the current frame, when the current frame is classified as the music class and has speech characteristics; and
      
      correcting the classification of the current frame, when the current frame is classified as the speech class and has music characteristics.
  - 6. The method of claim 1, wherein the determining is performed further based on a hangover parameter which is used to prevent frequent switching between coding modes.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Samsung Electronics Co. Ltd.
Original Assignee
Samsung Electronics Co. Ltd.
Inventors
Choo, Ki-hyun, Porov, Anton Victorovich, Osipov, Konstantin Sergeevich, Lee, Nam-suk
Primary Examiner(s)
Tzeng, Feng-Tzer

Application Number

US16/039,110
Publication Number

US 20180322887A1
Time in Patent Office

475 Days
Field of Search
US Class Current
CPC Class Codes

G10L 19/00   Speech or audio signals ana...

G10L 19/12   the excitation function bei...

G10L 19/20   using sound class specific ...

G10L 19/22   Mode decision, i.e. based o...

Coding mode determination method and apparatus, audio encoding method and apparatus, and audio decoding method and apparatus

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

Citations

6 Claims

Specification

Solutions

Use Cases

Quick Links

Coding mode determination method and apparatus, audio encoding method and apparatus, and audio decoding method and apparatus

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

6 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links