System of encoding and decoding speech signals
First Claim
1. A speech compression system for processing a speech signal, the speech compression system comprising:
- an encoding system operable to select a rate for encoding the speech signal as a function of characterization of the speech signal, the encoding system operable to select a type classification, comprising one of a first type and a second type, the encoding system operable to encode the speech signal as a function of the rate and the type classification when a first rate is selected, the encoding system operable to encode the speech signal as a function of the rate, regardless of the type classification, when a second rate is selected; and
the encoding system operable to encode an adaptive codebook gain and a fixed codebook gain when the first rate is selected and the type classification is the second type, where the adaptive codebook gain is encoded exclusive of the fixed codebook gain.
12 Assignments
0 Petitions
Accused Products
Abstract
A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.
-
Citations
66 Claims
-
1. A speech compression system for processing a speech signal, the speech compression system comprising:
-
an encoding system operable to select a rate for encoding the speech signal as a function of characterization of the speech signal, the encoding system operable to select a type classification, comprising one of a first type and a second type, the encoding system operable to encode the speech signal as a function of the rate and the type classification when a first rate is selected, the encoding system operable to encode the speech signal as a function of the rate, regardless of the type classification, when a second rate is selected; and
the encoding system operable to encode an adaptive codebook gain and a fixed codebook gain when the first rate is selected and the type classification is the second type, where the adaptive codebook gain is encoded exclusive of the fixed codebook gain. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A variable rate speech compression system for processing a speech signal, the variable rate speech compression system comprising:
-
a first codec operable to encode and decode a first frame of the speech signal, the first codec activated as a function of a selected bit rate, the first codec operable to encode and decode as a function of a type classification, the type classification comprising one of a first type and a second type; and
a second codec operable to encode and decode a second frame of the speech signal, the second codec activated as a function of a selected bit rate to encode and decode the second frame regardless of the type classification;
where the first codec is operable to encode an adaptive codebook gain and a fixed codebook gain when the first rate is selected and the type classification is the second type, and where the adaptive codebook gain is encoded exclusive of the fixed codebook gain. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A variable rate speech compression system for processing a speech signal, the variable rate speech compression system comprising:
-
an encoding system operable to extract and encode parameters of a frame of the speech signal, the encoding system comprising;
a frame processing module operable to encode parameters representative of the frame as a function of a rate selection comprising one of a first rate and a second rate and a type classification comprising one of a first type and a second type, the frame processing module comprising a pitch pre-processing module operable to modify the speech signal to improve encoding; and
a subframe processing module operable to encode parameters representative of a subframe of the frame as a function of the rate selection and the type classification;
where the pitch pre-processing module is operable to modify the speech signal as a function of a grade characterization of the speech signal. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 34, 35, 36)
-
-
30. A variable rate speech compression system for processing a speech signal, the variable rate speech compression system comprising:
-
an encoding system operable to extract and encode parameters of a frame of the speech signal, the encoding system comprising;
a frame processing module operable to encode parameters representative of the frame as a function of a rate selection comprising one of a first rate and a second rate and a type classification comprising one of a first type and a second type, the frame processing module comprising a pitch pre-processing module operable to modify the speech signal to improve encoding; and
a subframe processing module operable to encode parameters representative of a subframe of the frame as a function of the rate selection and the type classification;
where the subframe processing module is operable to search a fixed codebook to determine a fixed codebook contribution for the subframe, when the rate selection is the first rate, and the type classification is the first type;
where the fixed codebook comprises three 5-pulse codebooks.
-
-
31. A variable rate speech compression system for processing a speech signal, the variable rate speech compression system comprising:
-
an encoding system operable to extract and encode parameters of a frame of the speech signal, the encoding system comprising;
a frame processing module operable to encode parameters representative of the frame as a function of a rate selection comprising one of a first rate and a second rate and a type classification comprising one of a first type and a second type, the frame processing module comprising a pitch pre-processing module operable to modify the speech signal to improve encoding; and
a subframe processing module operable to encode parameters representative of a subframe of the frame as a function of the rate selection and the type classification;
where the subframe processing module is operable to search a fixed codebook to determine a fixed codebook contribution for the subframe, when the rate selection is the first rate, and the type classification is the second type;
where the fixed codebook comprises an 8-pulse codebook.
-
-
32. A variable rate speech compression system for processing a speech signal, the variable rate speech compression system comprising:
-
an encoding system operable to extract and encode parameters of a frame of the speech signal, the encoding system comprising;
a frame processing module operable to encode parameters representative of the frame as a function of a rate selection comprising one of a first rate and a second rate and a type classification comprising one of a first type and a second type, the frame processing module comprising a pitch pre-processing module operable to modify the speech signal to improve encoding; and
a subframe processing module operable to encode parameters representative of a subframe of the frame as a function of the rate selection and the type classification;
where the subframe processing module is operable to search a fixed codebook to determine a fixed codebook contribution for the subframe, when the rate selection is the second rate, and the type classification is the first type;
the fixed codebook comprising a 2-pulse codebook, a 3-pulse codebook and a Gaussian codebook.
-
-
33. A variable rate speech compression system for processing a speech signal, the variable rate speech compression system comprising:
-
an encoding system operable to extract and encode parameters of a frame of the speech signal, the encoding system comprising;
a frame processing module operable to encode parameters representative of the frame as a function of a rate selection comprising one of a first rate and a second rate and a type classification comprising one of a first type and a second type, the frame processing module comprising a pitch pre-processing module operable to modify the speech signal to improve encoding; and
a subframe processing module operable to encode parameters representative of a subframe of the frame as a function of the rate selection and the type classification;
where the subframe processing module is operable to search a fixed codebook to determine a fixed codebook contribution for the subframe, when the rate selection is the second rate, and the type classification is the second type;
where the fixed codebook comprises a 2-pulse codebook and a 3-pulse codebook.
-
-
37. A system for processing a frame of a speech signal, the system comprising:
-
a first codec selectively activated as a function of a rate selected for the frame, the first codec comprising;
a plurality of frame processing modules operable to encode parameters representative of the frame as a function of a type classification of the frame to form a first portion of an encoded speech signal; and
a plurality of subframe processing modules operable to encode parameters representative of each of a plurality of subframes of the frame as a function of the type classification to form a second portion of the encoded speech signal; and
a second codec selectively activated as a function of the rate selected for the frame, the second codec comprising;
an initial frame-processing module operable to encode parameters representative of the frame regardless of the type classification; and
a rate module operable to encode parameters of the frame regardless of the type classification;
where the first codec is operable to encode an adaptive codebook gain and a fixed codebook gain as a function of the rate and the type classification, and where the adaptive codebook gain is encoded exclusive of the fixed codebook gain. - View Dependent Claims (38, 39, 40, 41, 42, 43, 44, 47, 48)
h) post-processing the frame as a function of the first rate selection when the first rate selection was previously selected; and
i) post-processing the frame as a function of the second rate selection when the second rate selection was previously selected.
-
-
48. The method of claim 47, where h) comprises performing a fine tuning search for a pitch period as a function of pitch correlation and gain controlled harmonic filtering.
-
45. A method of processing a speech signal with a speech compression system, the method comprising:
-
a) extracting parameters from a frame of the speech signal;
b) selecting from one of a first rate selection and a second rate selection as a function of the parameters extracted from the frame;
c) selecting a type classification as a function of the parameters extracted from the frame when the first rate selection is selected;
d) encoding the frame as a function of the first rate selection and the type classification when the first rate selection is selected; and
e) encoding the frame as a function of the second rate selection, regardless of the type classification, when the second rate selection is selected;
where b) comprises characterizing the frame as a function of the parameters., and where the characterizing the frame comprises;
performing an initial characterization, and performing a final characterization.- View Dependent Claims (46, 49, 50, 51, 52, 53, 54)
f) decoding the frame as a function of the first rate selection and the type classification when the first rate selection was previously selected; and
g) decoding the frame as a function of the second rate selection regardless of the type classification when the second rate selection was previously selected.
-
-
49. The method of claim 45, where d) comprises encoding with a full-rate codec and a half-rate codec.
-
50. The method of claim 45, where e) comprises encoding with one of a quarter-rate codec and an eighth-rate codec.
-
51. The method of claim 45, where d) comprises:
-
generating a first portion of a bitstream by encoding parameters representative of the frame; and
generating a second portion of the bitstream by encoding parameters representative of each of a plurality of subframes of the frame.
-
-
52. The method of claim 51, where d) further comprises determining as a function of the type classification the parameters representative of the frame and the parameters representative of one of the subframes.
-
53. The method of claims 45, where c) comprises:
-
characterizing the frame as at least one of silence/background noise, stationary noise-like unvoiced speech, non-stationary unvoiced speech, onset speech and non-stationary voiced speech; and
selecting type zero.
-
-
54. The method of claim 45, where c) comprises:
-
characterizing the frame as stationary voiced speech; and
selecting type one.
-
-
55. A method of processing a speech signal, the method comprising:
-
a) extracting parameters from a frame of the speech signal;
b) selecting a bit rate as a function of the extracted parameters;
c) selecting a type classification as a function of the extracted parameters;
d) encoding extracted parameters representative of the frame as a function of the bit rate and the type classification; and
e) encoding extracted parameters representative of each of a plurality of subframes of the frame as a function of the bit rate and the type classification;
where e) comprises encoding a fixed codebook contribution when the type classification is type one, and where encoding the fixed codebook contribution comprises searching an 8-pulse codebook when the bit rate selected is a full rate. - View Dependent Claims (56, 57, 58, 59, 60, 61, 63, 64, 65, 66)
injecting high frequency noise into a plurality of codebooks;
incorporating additional pulses into the codebooks;
filtering the codebooks; and
searching the codebooks to determine the fixed codebook contribution.
-
-
66. The method of claim 64, where encoding the fixed codebook gain and the adaptive codebook gain comprises:
-
jointly quantizing the fixed codebook gain and the adaptive codebook gain for each of the subframes; and
determining a quantized gain vector representing the joint quantization.
-
-
62. A method of processing a speech signal, the method comprising:
-
a) extracting parameters from a frame of the speech signal;
b) selecting a bit rate as a function of the extracted parameters;
c) selecting a type classification as a function of the extracted parameters;
d) encoding extracted parameters representative of the frame as a function of the bit rate and the type classification; and
e) encoding extracted parameters representative of each of a plurality of subframes of the frame as a function of the bit rate and the type classification;
where e) comprises encoding a fixed codebook contribution when the type classification is type one, and where encoding the fixed codebook contribution comprises searching a 2-pulse codebook and a 3-pulse codebook when the bit rate selected is a half rate.
-
Specification