Voice encoding method and voice decoding method

US 5,473,727 A
Filed: 11/01/1993
Issued: 12/05/1995
Est. Priority Date: 10/31/1992
Status: Expired due to Term

First Claim

Patent Images

1. A method for encoding a compressed digital signal to provide a transmission signal resistant to transmission channel errors, the compressed digital signal being derived from a digital speech signal by dividing the digital speech signal in time to provide a signal block, orthogonally transforming the signal block to provide spectral data on the frequency axis, and using multi-band excitation to determine from the spectral data whether each of plural bands obtained by a pitch-dependent division of the spectral data in frequency represents one of a voiced (V) and an unvoiced (UV) sound, and to derive from the spectral data a spectral amplitude for each of a predetermined number of bands obtained by a fixed division of the spectral data by frequency, each spectral amplitude being a component of the compressed signal, the method comprising the steps of:

performing hierarchical vector quantizing to quantize the spectral amplitude of each of the predetermined number of bands to provide an upper-layer index, and to provide lower-layer indices fewer in number than the predetermined number of bands;

applying convolution coding to the upper-layer index to encode the upper-layer index for error correction, and to provide an error correction-coded upper-layer index; and

including the error correction-coded upper-level index and the lower-level indices in the transmission signal.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A compressed digital speech signal is encoded to provide a transmission error-resistant transmission signal. The compressed speech signal is derived from a digital speech signal by performing a pitch search on a block obtained by dividing the speech signal in time to provide pitch information for the block. The block of the speech signal is orthogonally transformed to provide spectral data, which is divided by frequency into plural bands in response to the pitch information. A voiced/unvoiced sound discrimination generates voiced/-unvoiced (V/UV) information indicating whether the spectral data in each of the plural bands represents a voiced or an unvoiced sound. The spectral data in the plural bands are interpolated to provide spectral amplitudes for a predetermined number of bands, independent of the pitch. Hierarchical vector quantizing is applied to the spectral amplitudes to generate upper-layer indices, representing an overview of the spectral amplitudes, and lower-layer indices, representing details of the spectral amplitudes. CRC error detection coding is applied to the upper-layer indices, the pitch information, and the V/UV information to generate CRC codes. Convolution coding for error correction is applied to the upper-layer indices, the higher-order bits of the lower-layer indices, the pitch information, the V/UV information, and the CRC codes. The convolution-coded quantities from two blocks of the speech signal are then interleaved in a frame of the transmission signal, together with the lower-order bits of the respective lower-layer indices.

127 Citations

7 Claims

1. A method for encoding a compressed digital signal to provide a transmission signal resistant to transmission channel errors, the compressed digital signal being derived from a digital speech signal by dividing the digital speech signal in time to provide a signal block, orthogonally transforming the signal block to provide spectral data on the frequency axis, and using multi-band excitation to determine from the spectral data whether each of plural bands obtained by a pitch-dependent division of the spectral data in frequency represents one of a voiced (V) and an unvoiced (UV) sound, and to derive from the spectral data a spectral amplitude for each of a predetermined number of bands obtained by a fixed division of the spectral data by frequency, each spectral amplitude being a component of the compressed signal, the method comprising the steps of:
- performing hierarchical vector quantizing to quantize the spectral amplitude of each of the predetermined number of bands to provide an upper-layer index, and to provide lower-layer indices fewer in number than the predetermined number of bands;
  
  applying convolution coding to the upper-layer index to encode the upper-layer index for error correction, and to provide an error correction-coded upper-layer index; and
  
  including the error correction-coded upper-level index and the lower-level indices in the transmission signal.
- View Dependent Claims (2, 3, 4, 5)
- - 2. The method of claim 1, wherein:
    - the step of performing hierarchical vector quantizing generates lower-level indices including higher-order bits and lower-order bits; and
      
      in the step of applying convolution coding, convolution coding is additionally applied to the higher-order bits of the lower-layer indices, and is not applied to the lower-order bits of the lower-layer indices.
  - 3. The method of claim 2, wherein the multi-band excitation is additionally used to determine pitch information for the signal block, the pitch information being additionally a component of the compressed signal, and determining whether each of the plural bands represents one of a voiced (V) and an unvoiced (UV) sound generates V/UV information for each of the plural bands, the V/UV information for each of the plural bands being additionally a component of the compressed signal, and wherein:
    - in the step of applying convolution coding, convolution coding is additionally applied to the pitch information and to the V/UV information for each of the plural bands.
  - 4. The method of claim 3, wherein:
    - the method additionally comprises the step of coding the pitch information, the V/UV information for each of the plural bands, and the upper-layer index for error detection using cyclic redundancy check (CRC) error detection coding to provide CRC-processed pitch information, V/UV information for each of the plural bands, and upper-layer index; and
      
      the step of applying convolution coding applies convolution coding to the CRC-processed pitch information, V/UV information for each of the plural bands, and upper-layer index, together with the higher-order bits of the lower-layer indices.
  - 5. The method of claim 4, wherein the digital speech signal is divided in time additionally to provide an additional signal block following the signal block at an interval of a frame, the frame being shorter than the signal block, and CRC-processed additional pitch information, additional V/UV information for each of plural bands, and additional upper-level index are derived from the additional signal block;
    - andin the step of applying convolution coding, the convolution coding is applied to a unit composed of the CRC-processed pitch information, the V/UV information for each of the plural bands, the upper-level index, and the CRC-processed additional pitch information, additional V/UV information for each of plural bands, and additional upper-level index.

6. A method for decoding a transmission signal that has been coded to provide resistance to transmission errors, the transmission signal including frames composed of pitch information, voiced/unvoiced (V/UV) information for each of plural bands, an upper-layer index and lower-layer indices generated by hierarchical vector quantizing, the lower-layer indices including upper-order bits and lower-order bits, the pitch information, the V/UV information, and the upper-layer index being coded to generate codes for cyclic redundancy check (CRC) error detection, the pitch information, the V/UV information, the upper-layer index, the upper-order bits of the lower-layer indices, and the CRC codes being convolution-coded, the method comprising the steps of:
- performing cyclic redundancy check (CRC) error detection on the pitch information, the V/UV information for each of plural bands, and the upper-layer index of each of the frames of the transmission signal;
  
  performing interpolation processing on frames of the transmission signal detected by the step of performing CRC error detection as including an error; and
  
  applying hierarchical vector dequantizing to the upper-layer index and the lower-layer indices of each frame following convolution decoding to generate spectral amplitudes for a predetermined number of bands.
- View Dependent Claims (7)
- - 7. The decoding method of claim 6, additionally comprising steps of:
    - expanding the pitch information, the V/UV information, the upper-level index, and the lower-layer indices of consecutive frames to produce spectral envelopes for consecutive ones of the frames using an expansion method; and
      
      controlling the expansion method in response to a dimensional relationship between the spectral envelopes produced from the consecutive ones of the frames, the expansion method being controlled for a predetermined number of frames beginning with a first one of the consecutive ones of the frames in which no uncorrected errors are detected by the step of performing CRC error detection.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Sony Corporation (Sony Group Corp.)
Original Assignee
Sony Corporation (Sony Group Corp.)
Inventors
Nishiguchi, Masayuki, Matsumoto, Jun, Ono, Shinobu, Wakatsuki, Ryoji
Primary Examiner(s)
MacDonald, Allen R.
Assistant Examiner(s)
Onka, Thomas

Application Number

US08/146,580
Time in Patent Office

764 Days
Field of Search

381/38, 381/47, 395/2.1, 395/2.12, 395/2.14, 395/2.16, 395/2.17, 395/2.31, 395/2.32, 395/2.35, 395/2.42, 395/2.77
US Class Current

704/222
CPC Class Codes

G10L 19/005   Correction of errors induce...

G10L 19/10   the excitation function bei...

G10L 25/93   Discriminating between voic...

Voice encoding method and voice decoding method

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

127 Citations

7 Claims

Specification

Solutions

Use Cases

Quick Links

Voice encoding method and voice decoding method

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

127 Citations

7 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links