Speech compression system and method
First Claim
Patent Images
1. A method of encoding an input speech signal to generate an encoded speech, the method comprising:
- selecting one of a first framing structure and a second framing structure for encoding parameters of a frame of the input speech signal;
determining a pitch characteristic for each of a plurality of subframes of the frame of the input speech signal, when the second framing structure is selected, wherein the determining the pitch characteristic is performed as a function of prediction coefficients associated with each of the plurality of subframes;
searching a 3-pulse codebook including pulses having relative positions with respect to each other, when the second framing structure is selected, to determine a fixed codebook excitation vector with three pulses having the relative positions with respect to each other;
enhancing the fixed codebook excitation vector using the pitch characteristic to generate an enhanced fixed codebook excitation vector;
determining a fixed codebook gain for each of the plurality of subframes, when the second framing structure is selected, based on the enhanced the fixed codebook excitation vector; and
generating the encoded speech using the fixed codebook gain.
9 Assignments
0 Petitions
Accused Products
Abstract
The invention improves the encoding and decoding of speech by focusing the encoding on the perceptually important characteristics of speech. The system analyzes selected features of an input speech signal, and first performing a common frame based speech coding of an input speech signal. The system then performs a speech coding based on either a first speech coding mode or a second speech coding mode. The selection of a mode is based on characteristics of the input speech signal. The first speech coding mode uses a first framing structure and the second speech coding mode uses a second framing structure.
29 Citations
8 Claims
-
1. A method of encoding an input speech signal to generate an encoded speech, the method comprising:
-
selecting one of a first framing structure and a second framing structure for encoding parameters of a frame of the input speech signal; determining a pitch characteristic for each of a plurality of subframes of the frame of the input speech signal, when the second framing structure is selected, wherein the determining the pitch characteristic is performed as a function of prediction coefficients associated with each of the plurality of subframes; searching a 3-pulse codebook including pulses having relative positions with respect to each other, when the second framing structure is selected, to determine a fixed codebook excitation vector with three pulses having the relative positions with respect to each other; enhancing the fixed codebook excitation vector using the pitch characteristic to generate an enhanced fixed codebook excitation vector; determining a fixed codebook gain for each of the plurality of subframes, when the second framing structure is selected, based on the enhanced the fixed codebook excitation vector; and generating the encoded speech using the fixed codebook gain. - View Dependent Claims (2, 3, 4)
-
-
5. A speech compression system for encoding an input speech signal to generate an encoded speech, the speech compression system comprising:
-
a mode selection module configured to selected one of a first framing structure and a second framing structure for encoding parameters of a frame of the input speech signal; and wherein the speech compression system is configured to; determine a pitch characteristic for each of a plurality of subframes of the frame of the input speech signal, when the second framing structure is selected, wherein the pitch characteristic is determined as a function of prediction coefficients associated with each of the plurality of subframes; search a 3-pulse codebook including pulses having relative positions with respect to each other, when the second framing structure is selected, to determine a fixed codebook excitation vector with three pulses having the relative positions with respect to each other; enhance the fixed codebook excitation vector using the pitch characteristic to generate an enhanced fixed codebook excitation vector; determine a fixed codebook gain for each of the plurality of subframes, when the second framing structure is selected, based on the enhanced the fixed codebook excitation vector; generate the encoded speech using the fixed codebook gain. - View Dependent Claims (6, 7, 8)
-
Specification