ENCODING DEVICE, DECODING DEVICE, AND METHOD THEREOF FOR SPECIFYING A BAND OF A GREAT ERROR

US 20130332150A1
Filed: 08/14/2013
Published: 12/12/2013
Est. Priority Date: 03/02/2007
Status: Active Grant

First Claim

Patent Images

1. A speech encoding apparatus, comprising:

a first layer encoding section that performs encoding processing with respect to an input speech signal to generate first layer encoded data;

a first layer decoding section that performs decoding processing using the first layer encoded data to generate a first layer decoded signal;

a first layer error transform coefficient calculation section that transforms a first layer error signal which is an error between the input speech signal and the first layer decoded signal into a frequency domain to calculate first layer error transform coefficients; and

a second layer encoding section that performs encoding processing with respect to the first layer error transform coefficients to generate second layer encoded data,wherein the second layer encoding section comprises;

a setting section that sets a low-frequency band and a high-frequency band for the first layer error transform coefficients, sets a fixed band in the high-frequency band and sets a plurality of band candidates in the low-frequency band;

a selection section that calculates perceptual weighted energy of the first layer error transform coefficients in each of the plurality of band candidates and selects one band from among the plurality of band candidates in the low-frequency band based on the perceptual weighted energy;

a concatenated band configuring section that concatenates the one band selected in the low-frequency band and the fixed band in the high-frequency band to configure a concatenated band; and

an encoded data generation section that encodes the first layer error transform coefficients included in the concatenated band to generate the second layer encoded data.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Disclosed is an encoding device which can accurately specify a band having a large error among all the bands by using a small calculation amount. A first position identifier uses a first layer error conversion coefficient indicating an error of a decoding signal for an input signal so as to search for a band having a large error in a relatively wide bandwidth in all the bands of the input signal and generates first position information indicating the identified band. A second position identifier searches for a target frequency band having a large error in a relatively narrow bandwidth in the band identified by the first position identifier and generates second position information indicating the identified target frequency band. An encoder encodes a first layer decoding error conversion coefficient contained in the target frequency band.

9 Citations

View as Search Results

8 Claims

1. A speech encoding apparatus, comprising:
- a first layer encoding section that performs encoding processing with respect to an input speech signal to generate first layer encoded data;
  
  a first layer decoding section that performs decoding processing using the first layer encoded data to generate a first layer decoded signal;
  
  a first layer error transform coefficient calculation section that transforms a first layer error signal which is an error between the input speech signal and the first layer decoded signal into a frequency domain to calculate first layer error transform coefficients; and
  
  a second layer encoding section that performs encoding processing with respect to the first layer error transform coefficients to generate second layer encoded data,wherein the second layer encoding section comprises;
  
  a setting section that sets a low-frequency band and a high-frequency band for the first layer error transform coefficients, sets a fixed band in the high-frequency band and sets a plurality of band candidates in the low-frequency band;
  
  a selection section that calculates perceptual weighted energy of the first layer error transform coefficients in each of the plurality of band candidates and selects one band from among the plurality of band candidates in the low-frequency band based on the perceptual weighted energy;
  
  a concatenated band configuring section that concatenates the one band selected in the low-frequency band and the fixed band in the high-frequency band to configure a concatenated band; and
  
  an encoded data generation section that encodes the first layer error transform coefficients included in the concatenated band to generate the second layer encoded data.
- View Dependent Claims (2, 3)
- - 2. The speech encoding apparatus according to claim 1,wherein the encoded data generation section comprises a pulse position specifying section that specifies positions of a plurality of pulses from among pulse candidate positions set in the concatenated band based on the first layer error transform coefficients, and generates pulse position information showing the specified positions of the plurality of pulses, andthe encoded data generation section generates the second layer encoded data using selection information showing the one band selected in the low-frequency band and the pulse position information.
  - 3. The speech encoding apparatus according to claim 1, wherein a bandwidth of a band candidate is different from a bandwidth of the fixed band.

4. A speech decoding apparatus, comprising:
- a receiving section that receives;
  
  first layer encoded data acquired in a speech encoding apparatus by performing encoding processing with respect to an input speech signal; and
  
  second layer encoded data acquired in the speech encoding apparatus by transforming a first layer error signal which is an error between a first layer decoded signal obtained by decoding the first layer encoded data and the input speech signal into a frequency domain to calculate first layer error transform coefficients and by performing encoding processing with respect to the first layer error transform coefficients;
  
  a first layer decoding section that decodes the first layer encoded data to generate the first layer decoded signal;
  
  a second layer decoding section that decodes the second layer encoded data to generate first layer decoded error transform coefficients;
  
  a time domain transforming section that transforms the first layer decoded error transform coefficients into a time domain to generate a first layer decoded error signal; and
  
  an addition section that adds the first layer decoded signal and the first layer decoded error signal to generate a decoded signal,wherein the second layer decoding section comprises;
  
  a setting section that sets a low-frequency band and a high-frequency band for the first layer error transform coefficients, sets a fixed band in the high-frequency band and sets a plurality of band candidates in the low-frequency band; and
  
  a decoded error transform coefficient generation section that decodes the second layer encoded data to generate selection information showing a position of a specific band from among the plurality of band candidates and pulse position information showing positions of pulses in a concatenated band of the specific band and the fixed band, specifies positions of pulses in the low-frequency band using the pulse position information corresponding to the specific band and the selection information and specifies positions of pulses in the high-frequency band using the pulse position information corresponding to the fixed band, to generate the first layer decoded error transform coefficients.
- View Dependent Claims (5, 6)
- - 5. The speech decoding apparatus according to claim 4, wherein the second layer encoded data comprises the selection information and encoded information, andthe encoded information comprises position information of a plurality of pulses and gain information of the plurality of pulses.
  - 6. The speech decoding apparatus according to claim 4, wherein a bandwidth of a band candidate is different from a bandwidth of the fixed band.

7. A speech encoding method, comprising:
- performing encoding processing with respect to an input speech signal to generate first layer encoded data;
  
  performing decoding processing using the first layer encoded data to generate a first layer decoded signal;
  
  transforming a first layer error signal which is an error between the input speech signal and the first layer decoded signal into a frequency domain to calculate first layer error transform coefficients; and
  
  performing encoding processing with respect to the first layer error transform coefficients to generate second layer encoded data,wherein the encoding processing with respect to the first layer error transform coefficients comprises;
  
  setting a low-frequency band and a high-frequency band for the first layer error transform coefficients, setting a fixed band in the high-frequency band and setting a plurality of band candidates in the low-frequency band;
  
  calculating perceptual weighted energy of the first layer error transform coefficients in each of the plurality of band candidates and selecting one band from among the plurality of band candidates in the low-frequency band based on the perceptual weighted energy;
  
  concatenating the one band selected in the low-frequency band and the fixed band in the high-frequency band to configure a concatenated band; and
  
  encoding the first layer error transform coefficients included in the concatenated band to generate the second layer encoded data.

8. A speech decoding method, comprising:
- receiving;
  
  first layer encoded data acquired using a speech encoding method by performing encoding processing with respect to an input speech signal; and
  
  second layer encoded data acquired using the speech encoding method by transforming a first layer error signal which is an error between a first layer decoded signal obtained by decoding the first layer encoded data and the input speech signal into a frequency domain to calculate first layer error transform coefficients and by performing encoding processing with respect to the first layer error transform coefficients;
  
  decoding the first layer encoded data to generate the first layer decoded signal;
  
  decoding the second layer encoded data to generate first layer decoded error transform coefficients;
  
  transforming the first layer decoded error transform coefficients into a time domain to generate a first layer decoded error signal; and
  
  adding the first layer decoded signal and the first layer decoded error signal to generate a decoded signal, whereinin the decoding of the second layer encoded data;
  
  a low-frequency band and a high-frequency band for the first layer error transform coefficients are set, a fixed band in the high-frequency band is set and a plurality of band candidates in the low-frequency band is set;
  
  the second layer encoded data is decoded to generate selection information showing a position of a specific band from among the plurality of band candidates and pulse position information showing positions of pulses in a concatenated band of the specific band and the fixed band; and
  
  positions of first pulses in the low-frequency band and positions of second pulses in the high-frequency band are specified to generate the first layer decoded error transform coefficients, the first pulses being specified using the pulse position information corresponding to the specific band and the selection information and the second pulses being specified using the pulse position information corresponding to the fixed band.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Panasonic Intellectual Property Corporation of America (Panasonic Holdings Corporation)
Original Assignee
Panasonic Corporation (Panasonic Holdings Corporation)
Inventors
OSHIKIRI, Masahiro, YAMANASHI, Tomofumi, MORII, Toshiyuki

Granted Patent

US 8,935,162 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/205
CPC Class Codes

G10L 19/00   Speech or audio signals ana...

G10L 19/005   Correction of errors induce...

G10L 19/0204   using subband decomposition

G10L 19/0208   Subband vocoders

G10L 19/0212   using orthogonal transforma...

G10L 19/24   Variable rate codecs, e.g. ...

ENCODING DEVICE, DECODING DEVICE, AND METHOD THEREOF FOR SPECIFYING A BAND OF A GREAT ERROR

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

9 Citations

8 Claims

Specification

Solutions

Use Cases

Quick Links

ENCODING DEVICE, DECODING DEVICE, AND METHOD THEREOF FOR SPECIFYING A BAND OF A GREAT ERROR

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

9 Citations

8 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links