Synthesis of speech signals in the absence of coded parameters
First Claim
1. A method of generating coefficient signals representing frequency components of a signal representing speech information based on an estimate of the signal spectrum and a noise masking measure associated with the speech signal, the method comprising:
- generating a first signal relating the estimate of the signal spectrum to the noise masking measure at each of one or more frequencies;
for one or more of said frequencies, forming a coefficient signal magnitude based on said first signal at a corresponding frequency, andselecting a coefficient phase at random, wherein the step of forming a coefficient signal magnitude comprises forming the magnitude as a function of a quantized gain signal associated with the frequency corresponding to said coefficient, wherein the first signal relating the estimate of the signal spectrum to the noise masking measure at each of one or more frequencies comprises a ratio of the estimate of the signal spectrum to a perceptual threshold signal, and wherein when said ratio is greater than a predetermined threshold, the magnitude of said coefficients is at least a predetermined amount above said gain signal evaluated at said frequency.
4 Assignments
0 Petitions
Accused Products
Abstract
A speech compression system called "Transform Predictive Coding", or TPC, provides for encoding 7 kHz wideband speech (16 kHz sampling) at a target bit-rate range of 16 to 32 kb/s (1 to 2 bits/sample). The system uses short-term and long-term prediction to remove the redundancy in speech. A prediction residual is transformed and coded in the frequency domain to take advantage of knowledge in human auditory perception. The TPC coder uses only open-loop quantization and therefore has a fairly low complexity. The speech quality of TPC is essentially transparent at 32 kb/s, very good at 24 kb/s, and acceptable at 16 kb/s.
77 Citations
6 Claims
-
1. A method of generating coefficient signals representing frequency components of a signal representing speech information based on an estimate of the signal spectrum and a noise masking measure associated with the speech signal, the method comprising:
-
generating a first signal relating the estimate of the signal spectrum to the noise masking measure at each of one or more frequencies; for one or more of said frequencies, forming a coefficient signal magnitude based on said first signal at a corresponding frequency, and selecting a coefficient phase at random, wherein the step of forming a coefficient signal magnitude comprises forming the magnitude as a function of a quantized gain signal associated with the frequency corresponding to said coefficient, wherein the first signal relating the estimate of the signal spectrum to the noise masking measure at each of one or more frequencies comprises a ratio of the estimate of the signal spectrum to a perceptual threshold signal, and wherein when said ratio is greater than a predetermined threshold, the magnitude of said coefficients is at least a predetermined amount above said gain signal evaluated at said frequency. - View Dependent Claims (2, 3)
-
-
4. A method of generating coefficient signals representing frequency components of a signal representing speech information based on an estimate of the signal spectrum and a noise masking measure associated with the speech signal, the method comprising:
-
generating a first signal relating the estimate of the signal spectrum to the noise masking measure at each of one or more frequencies; for one or more of said frequencies, forming a coefficient signal magnitude based on said first signal at a corresponding frequency, and selecting a coefficient phase at random, wherein the step of forming a coefficient signal magnitude comprises forming the magnitude as a function of a quantized gain signal associated with the frequency corresponding to said coefficient, wherein the first signal relating the estimate of the signal spectrum to the noise masking measure at each of one or more frequencies comprises a ratio of the estimate of the signal spectrum to a perceptual threshold signal, and wherein when said ratio is less than or equal to a predetermined threshold, the magnitude of said coefficients is at least a predetermined amount below said gain signal evaluated at said frequency. - View Dependent Claims (5, 6)
-
Specification