Decoding method, apparatus and recording medium

US 10,643,631 B2
Filed: 10/15/2019
Issued: 05/05/2020
Est. Priority Date: 04/24/2014
Status: Active Grant

First Claim

Patent Images

1. A decoding method, implemented by a decoding apparatus having processing circuitry, comprising:

where p is an integer equal to or greater than 1,decoding, by the processing circuitry, input adjusted LSP codes to obtain a decoded adjusted LSP parameter sequence {circumflex over (

)}θ

_γ[1], {circumflex over (

)}θ

_γ[2], . . . , {circumflex over (

)}θ

_γ[p];

with a frequency domain parameter sequence ω

[1], ω

[2], . . . , ω

[p] being the decoded adjusted LSP parameter sequence {circumflex over (

)}θ

_γ[1], {circumflex over (

)}θ

_γ[2], . . . , {circumflex over (

)}θ

_γ[p], executing, by the processing circuitry, a parameter sequence conversion step of determining a converted frequency domain parameter sequence ˜

ω

[1], ˜

ω

[2], . . . , ˜

ω

[p] using the frequency domain parameter sequence ω

[1], ω

[2], . . . , ω

[p] as input to thereby generate the converted frequency domain parameter sequence ˜

ω

[1], ˜

ω

[2], . . . , ˜

ω

[p] as a decoded approximate LSP parameter sequence {circumflex over (

)}θ

_app[1], {circumflex over (

)}θ

_app[2], . . . , {circumflex over (

)}θ

_app[p];

generating, by the processing circuitry, a decoded adjusted linear prediction coefficient sequence {circumflex over (

)}a_γ[1], {circumflex over (

)}a_γ[2], . . . , {circumflex over (

)}a_γ[p] by converting the decoded adjusted LSP parameter sequence {circumflex over (

)}θ

_γ[1], {circumflex over (

)}θ

_γ[2], . . . , {circumflex over (

)}θ

_γ[p] into linear prediction coefficients;

calculating, by the processing circuitry, a decoded smoothed power spectral envelope series {circumflex over (

)}W₆₅[1], {circumflex over (

)}W_γ[2], . . . , {circumflex over (

)}W_γ[N] which is a series in frequency domain corresponding to the decoded adjusted linear prediction coefficient sequence {circumflex over (

)}a_γ[1], {circumflex over (

)}a_γ[2], . . . , {circumflex over (

)}a_γ[p];

generating, by the processing circuitry, decoded sound signals using a frequency domain signal sequence resulting from decoding of input frequency domain signal codes and the decoded smoothed power spectral envelope series {circumflex over (

)}W_γ[1], {circumflex over (

)}W_γ[2], . . . , {circumflex over (

)}W_γ[N];

decoding, by the processing circuitry, input LSP codes to obtain a decoded LSP parameter sequence {circumflex over (

)}θ

[1], {circumflex over (

)}θ

[2], . . . , {circumflex over (

)}θ

[p]; and

decoding, by the processing circuitry, input time domain signal codes, and generating decoded sound signals by synthesizing the time domain signal codes using either the decoded LSP parameter sequence for a preceding time segment or the decoded approximate LSP parameter sequence for the preceding time segment, and the decoded LSP parameter sequence for the predetermined time segment,whereinthe processing circuitry determines a value of each converted frequency domain parameter ˜

ω

[i] (i=1, 2, . . . , p) in the converted frequency domain parameter sequence ˜

ω

[1], ˜

ω

[2], . . . , ˜

ω

[p] through linear transformation which is based on a relationship of values between ω

[i] and one or more frequency domain parameters adjacent to ω

[i].

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The present invention reduces encoding distortion in frequency domain encoding compared to conventional techniques, and obtains LSP parameters that correspond to quantized LSP parameters for the preceding frame and are to be used in time domain encoding from coefficients equivalent to linear prediction coefficients resulting from frequency domain encoding. When p is an integer equal to or greater than 1, a linear prediction coefficient sequence which is obtained by linear prediction analysis of audio signals in a predetermined time segment is represented as a[1], a[2], . . . , a[p], and ω[1], ω[2], . . . , ω[p] are a frequency domain parameter sequence derived from the linear prediction coefficient sequence a[1], a[2], . . . , a[p], an LSP linear transformation unit (300) determines the value of each converted frequency domain parameter ˜ω[i] (i=1, 2, . . . , p) in a converted frequency domain parameter sequence ˜ω[1], ˜ω[2], . . . , ˜ω[p] using the frequency domain parameter sequence ω[1], ω[2], . . . , ω[p] as input, through linear transformation which is based on the relationship of values between ω[i] and one or more frequency domain parameters adjacent to ω[i].

38 Citations

5 Claims

1. A decoding method, implemented by a decoding apparatus having processing circuitry, comprising:
- where p is an integer equal to or greater than 1,decoding, by the processing circuitry, input adjusted LSP codes to obtain a decoded adjusted LSP parameter sequence {circumflex over (
  
  )}θ
  
  _γ[1], {circumflex over (
  
  )}θ
  
  _γ[2], . . . , {circumflex over (
  
  )}θ
  
  _γ[p];
  
  with a frequency domain parameter sequence ω
  
  [1], ω
  
  [2], . . . , ω
  
  [p] being the decoded adjusted LSP parameter sequence {circumflex over (
  
  )}θ
  
  _γ[1], {circumflex over (
  
  )}θ
  
  _γ[2], . . . , {circumflex over (
  
  )}θ
  
  _γ[p], executing, by the processing circuitry, a parameter sequence conversion step of determining a converted frequency domain parameter sequence ˜
  
  ω
  
  [1], ˜
  
  ω
  
  [2], . . . , ˜
  
  ω
  
  [p] using the frequency domain parameter sequence ω
  
  [1], ω
  
  [2], . . . , ω
  
  [p] as input to thereby generate the converted frequency domain parameter sequence ˜
  
  ω
  
  [1], ˜
  
  ω
  
  [2], . . . , ˜
  
  ω
  
  [p] as a decoded approximate LSP parameter sequence {circumflex over (
  
  )}θ
  
  _app[1], {circumflex over (
  
  )}θ
  
  _app[2], . . . , {circumflex over (
  
  )}θ
  
  _app[p];
  
  generating, by the processing circuitry, a decoded adjusted linear prediction coefficient sequence {circumflex over (
  
  )}a_γ[1], {circumflex over (
  
  )}a_γ[2], . . . , {circumflex over (
  
  )}a_γ[p] by converting the decoded adjusted LSP parameter sequence {circumflex over (
  
  )}θ
  
  _γ[1], {circumflex over (
  
  )}θ
  
  _γ[2], . . . , {circumflex over (
  
  )}θ
  
  _γ[p] into linear prediction coefficients;
  
  calculating, by the processing circuitry, a decoded smoothed power spectral envelope series {circumflex over (
  
  )}W₆₅[1], {circumflex over (
  
  )}W_γ[2], . . . , {circumflex over (
  
  )}W_γ[N] which is a series in frequency domain corresponding to the decoded adjusted linear prediction coefficient sequence {circumflex over (
  
  )}a_γ[1], {circumflex over (
  
  )}a_γ[2], . . . , {circumflex over (
  
  )}a_γ[p];
  
  generating, by the processing circuitry, decoded sound signals using a frequency domain signal sequence resulting from decoding of input frequency domain signal codes and the decoded smoothed power spectral envelope series {circumflex over (
  
  )}W_γ[1], {circumflex over (
  
  )}W_γ[2], . . . , {circumflex over (
  
  )}W_γ[N];
  
  decoding, by the processing circuitry, input LSP codes to obtain a decoded LSP parameter sequence {circumflex over (
  
  )}θ
  
  [1], {circumflex over (
  
  )}θ
  
  [2], . . . , {circumflex over (
  
  )}θ
  
  [p]; and
  
  decoding, by the processing circuitry, input time domain signal codes, and generating decoded sound signals by synthesizing the time domain signal codes using either the decoded LSP parameter sequence for a preceding time segment or the decoded approximate LSP parameter sequence for the preceding time segment, and the decoded LSP parameter sequence for the predetermined time segment,whereinthe processing circuitry determines a value of each converted frequency domain parameter ˜
  
  ω
  
  [i] (i=1, 2, . . . , p) in the converted frequency domain parameter sequence ˜
  
  ω
  
  [1], ˜
  
  ω
  
  [2], . . . , ˜
  
  ω
  
  [p] through linear transformation which is based on a relationship of values between ω
  
  [i] and one or more frequency domain parameters adjacent to ω
  
  [i].
- View Dependent Claims (5)
- - 5. A non-transitory computer-readable recording medium having a program recorded thereon for causing a computer to carry out the steps of the decoding method according to claim 1 or 2.

2. A decoding method, implemented by a decoding apparatus having processing circuitry, comprising:
- where p is an integer equal to or greater than 1,decoding, by the processing circuitry, input adjusted LSP codes to obtain a decoded adjusted LSP parameter sequence {circumflex over (
  
  )}θ
  
  _γ[1], {circumflex over (
  
  )}θ
  
  _γ[2], . . . , {circumflex over (
  
  )}θ
  
  _γ[p];
  
  with a frequency domain parameter sequence ω
  
  [1], ω
  
  [2], . . . , ω
  
  [p] being the decoded adjusted LSP parameter sequence {circumflex over (
  
  )}θ
  
  _γ[1], {circumflex over (
  
  )}θ
  
  _γ[2], . . . , {circumflex over (
  
  )}θ
  
  _γ[p], executing, by the processing circuitry, a parameter sequence conversion step of determining a converted frequency domain parameter sequence ˜
  
  ω
  
  [1], ˜
  
  ω
  
  [2], . . . , ˜
  
  ω
  
  [p] using the frequency domain parameter sequence ω
  
  [1], ω
  
  [2], . . . , ω
  
  [p] as input to thereby generate the converted frequency domain parameter sequence ˜
  
  ω
  
  [1], ˜
  
  ω
  
  [2], . . . , ˜
  
  ω
  
  [p] as a decoded approximate LSP parameter sequence {circumflex over (
  
  )}θ
  
  _app[1], {circumflex over (
  
  )}θ
  
  _app[2], . . . , {circumflex over (
  
  )}θ
  
  _app[p];
  
  calculating, by the processing circuitry, a decoded smoothed power spectral envelope series {circumflex over (
  
  )}W_γ[1], {circumflex over (
  
  )}W_γ[2], . . . , {circumflex over (
  
  )}W_γ[N] based on the decoded adjusted LSP parameter sequence {circumflex over (
  
  )}θ
  
  _γ[1], {circumflex over (
  
  )}θ
  
  _γ[2], . . . , {circumflex over (
  
  )}θ
  
  _γ[p];
  
  generating, by the processing circuitry, decoded sound signals using a frequency domain signal sequence resulting from decoding of input frequency domain signal codes and the decoded smoothed power spectral envelope series {circumflex over (
  
  )}W_γ[1], {circumflex over (
  
  )}W_γ[2], . . . , {circumflex over (
  
  )}W_γ[N];
  
  decoding, by the processing circuitry, input LSP codes to obtain a decoded LSP parameter sequence {circumflex over (
  
  )}θ
  
  [1], {circumflex over (
  
  )}θ
  
  [2], . . . , {circumflex over (
  
  )}θ
  
  [p]; and
  
  decoding, by the processing circuitry, input time domain signal codes, and generating decoded sound signals by synthesizing the time domain signal codes using either the decoded LSP parameter sequence for a preceding time segment or the decoded approximate LSP parameter sequence for the preceding time segment, and the decoded LSP parameter sequence for the predetermined time segment,whereinthe processing circuitry determines a value of each converted frequency domain parameter ˜
  
  ω
  
  [i] (i=1, 2, . . . , p) in the converted frequency domain parameter sequence ˜
  
  ω
  
  [1], ˜
  
  ω
  
  [2], . . . , ˜
  
  ω
  
  [p] through linear transformation which is based on a relationship of values between ω
  
  [i] and one or more frequency domain parameters adjacent to ω
  
  [i].

3. A decoding apparatus comprising:
- where p is an integer equal to or greater than 1,an adjusted LSP code decoding unit that decodes input adjusted LSP codes to obtain a decoded adjusted LSP parameter sequence {circumflex over (
  
  )}θ
  
  _γ[1], {circumflex over (
  
  )}θ
  
  _γ[2], . . . , {circumflex over (
  
  )}θ
  
  _γ[p];
  
  a decoded LSP linear transformation unit that, with a frequency domain parameter sequence ω
  
  [1], ω
  
  [2], . . . , ω
  
  [p] being the decoded adjusted LSP parameter sequence {circumflex over (
  
  )}θ
  
  _γ[1], {circumflex over (
  
  )}θ
  
  _γ[2], . . . , {circumflex over (
  
  )}θ
  
  _γ[p], executes a parameter sequence converting unit of determining a converted frequency domain parameter sequence ˜
  
  ω
  
  [1], ˜
  
  ω
  
  [2], . . . , ˜
  
  ω
  
  [p] using the frequency domain parameter sequence ω
  
  [1], ω
  
  [2], . . . , ω
  
  [p] as input to thereby generate the converted frequency domain parameter sequence ˜
  
  ω
  
  [1], ˜
  
  ω
  
  [2], . . . , ˜
  
  ω
  
  [p] as a decoded approximate LSP parameter sequence {circumflex over (
  
  )}θ
  
  _app[1], {circumflex over (
  
  )}θ
  
  _app[2], . . . , {circumflex over (
  
  )}θ
  
  _app[p];
  
  a decoded linear prediction coefficient sequence generating unit that generates a decoded adjusted linear prediction coefficient sequence {circumflex over (
  
  )}a_γ[1], {circumflex over (
  
  )}a_γ[2], . . . , {circumflex over (
  
  )}a_γ[p] by converting the decoded adjusted LSP parameter sequence {circumflex over (
  
  )}θ
  
  _γ[1], {circumflex over (
  
  )}θ
  
  _γ[2], . . . , {circumflex over (
  
  )}θ
  
  _γ[p] into linear prediction coefficients;
  
  a decoded smoothed power spectral envelope series calculating unit that calculates a decoded smoothed power spectral envelope series {circumflex over (
  
  )}W_γ[1], {circumflex over (
  
  )}W_γ[2], . . . , {circumflex over (
  
  )}W_γ[N] which is a series in frequency domain corresponding to the decoded adjusted linear prediction coefficient sequence {circumflex over (
  
  )}a₆₅[1], {circumflex over (
  
  )}a_γ[2], . . . , {circumflex over (
  
  )}a_γ[p];
  
  a frequency domain decoding unit that generates decoded sound signals using a frequency domain signal sequence resulting from decoding of input frequency domain signal codes and the decoded smoothed power spectral envelope series {circumflex over (
  
  )}W_γ[1], {circumflex over (
  
  )}W_γ[2], . . . , {circumflex over (
  
  )}W_γ[N];
  
  an LSP code decoding unit that decodes input LSP codes to obtain a decoded LSP parameter sequence {circumflex over (
  
  )}θ
  
  [1], {circumflex over (
  
  )}θ
  
  [2], . . . , {circumflex over (
  
  )}θ
  
  [p]; and
  
  a time domain decoding unit that decodes input time domain signal codes, and generates decoded sound signals by synthesizing the time domain signal codes using either the decoded LSP parameter sequence obtained by the LSP code decoding unit for a preceding time segment or the decoded approximate LSP parameter sequence obtained in the decoded LSP linear transformation unit for the preceding time segment, and the decoded LSP parameter sequence for the predetermined time segment,whereinthe parameter sequence conversion unit determines a value of each converted frequency domain parameter ˜
  
  ω
  
  [i] (i=1, 2, . . . , p) in the converted frequency domain parameter sequence ˜
  
  ω
  
  [1], ˜
  
  ω
  
  [2], . . . , ˜
  
  ω
  
  [p] through linear transformation which is based on a relationship of values between ω
  
  [i] and one or more frequency domain parameters adjacent to ω
  
  [i].

4. A decoding apparatus comprising:
- where p is an integer equal to or greater than 1,an adjusted LSP code decoding unit that decodes input adjusted LSP codes to obtain a decoded adjusted LSP parameter sequence {circumflex over (
  
  )}θ
  
  _γ[1], {circumflex over (
  
  )}θ
  
  _γ[2], . . . , {circumflex over (
  
  )}θ
  
  _γ[p];
  
  a decoded LSP linear transformation unit that, with a frequency domain parameter sequence ω
  
  [1], ω
  
  [2], . . . , ω
  
  [p] being the decoded adjusted LSP parameter sequence {circumflex over (
  
  )}θ
  
  _γ[1], {circumflex over (
  
  )}θ
  
  _γ[2], . . . , {circumflex over (
  
  )}_γ[p], executes a parameter sequence converting unit of determining a converted frequency domain parameter sequence ˜
  
  ω
  
  [1], ˜
  
  ω
  
  [2], . . . , ˜
  
  ω
  
  [p] using the frequency domain parameter sequence ω
  
  [1], ω
  
  [2], . . . , ω
  
  [p] as input to thereby generate the converted frequency domain parameter sequence ˜
  
  ω
  
  [1], ˜
  
  ω
  
  [2], . . . , ˜
  
  ω
  
  [p] as a decoded approximate LSP parameter sequence {circumflex over (
  
  )}θ
  
  _app[1], {circumflex over (
  
  )}θ
  
  _app[2], . . . , {circumflex over (
  
  )}θ
  
  _app[p];
  
  a decoded smoothed power spectral envelope series calculating unit that calculates a decoded smoothed power spectral envelope series {circumflex over (
  
  )}W_γ[1], {circumflex over (
  
  )}W_γ[2], . . . , {circumflex over (
  
  )}W_γ[N] based on the decoded adjusted LSP parameter sequence {circumflex over (
  
  )}θ
  
  _γ[1], {circumflex over (
  
  )}θ
  
  _γ[2], . . . , {circumflex over (
  
  )}θ
  
  _γ[p];
  
  a frequency domain decoding unit that generates decoded sound signals using a frequency domain signal sequence resulting from decoding of input frequency domain signal codes and the decoded smoothed power spectral envelope series {circumflex over (
  
  )}W_γ[1], {circumflex over (
  
  )}W_γ[2], . . . , {circumflex over (
  
  )}W_γ[N];
  
  an LSP code decoding unit that decodes input LSP codes to obtain a decoded LSP parameter sequence {circumflex over (
  
  )}θ
  
  [1], {circumflex over (
  
  )}θ
  
  [2], . . . , {circumflex over (
  
  )}θ
  
  [p]; and
  
  an time domain decoding unit that decodes input time domain signal codes, and generates decoded sound signals by synthesizing the time domain signal codes using either the decoded LSP parameter sequence obtained in the LSP code decoding unit for a preceding time segment or the decoded approximate LSP parameter sequence obtained in the decoded LSP linear transformation unit for the preceding time segment, and the decoded LSP parameter sequence for the predetermined time segment,whereinthe parameter sequence conversion unit determines a value of each converted frequency domain parameter ˜
  
  ω
  
  [i] (i=1, 2, . . . , p) in the converted frequency domain parameter sequence ˜
  
  ω
  
  [1], ˜
  
  ω
  
  [2], . . . , ˜
  
  ω
  
  [p] through linear transformation which is based on a relationship of values between ω
  
  [i] and one or more frequency domain parameters adjacent to ω
  
  [i].

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Nippon Telegraph and Telephone Corporation
Original Assignee
Nippon Telegraph and Telephone Corporation
Inventors
Moriya, Takehiro, Kamamoto, Yutaka, Harada, Noboru, Kameoka, Hirokazu, Sugiura, Ryosuke
Primary Examiner(s)
Riley, Marcus T

Application Number

US16/601,740
Publication Number

US 20200043506A1
Time in Patent Office

203 Days
Field of Search

None
US Class Current
CPC Class Codes

G10L 19/02   using spectral analysis, e....

G10L 19/07   Line spectrum pair [LSP] vo...

G10L 19/12   the excitation function bei...

G10L 25/06   the extracted parameters be...

G10L 25/12   the extracted parameters be...

Decoding method, apparatus and recording medium

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

38 Citations

5 Claims

Specification

Solutions

Use Cases

Quick Links

Decoding method, apparatus and recording medium

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

38 Citations

5 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links