Speech coding system and method using silence enhancement
First Claim
Patent Images
1. A method, comprising:
- receiving, at a computing device, an input audio signal including a zero level, wherein the input audio signal comprises a plurality of audio frames;
determining, by the computing device, that at least one frame of the plurality of audio frames has a volume level within a selected range of the zero level;
modifying, by the computing device, the volume level of the at least one frame to correspond to the zero level of the input audio signal to generate an enhanced audio signal;
filtering the enhanced audio signal by a high-pass filter to generate a filtered audio signal;
attenuating noise included in the filtered audio signal to generate a pre-processed speech signal;
encoding, by the computing device, the pre-processed speech signal to generate an encoded plurality of audio frames, wherein the encoded plurality of audio frames are decodable by a speech decoding system to generate a reproduced version of the input audio signal; and
transmitting, to the speech decoding system via a communication link, a signal that includes the encoded plurality of audio frames.
4 Assignments
0 Petitions
Accused Products
Abstract
Various techniques for speech coding and decoding are disclosed. For example, speech data generated from a speech signal may be decoded by receiving the speech data in a format that has at least one main pulse in a subframe of the speech data, and generating a first predicted pulse that has a lower gain than the main pulse. A second predicted pulse may also be generated as a mirror image of the first predicted pulse on a reverse time scale, on the other side of the main pulse in the subframe of the speech data. The the speech signal may be reconstructed using the first predicted pulse and the second predicted pulse.
93 Citations
16 Claims
-
1. A method, comprising:
-
receiving, at a computing device, an input audio signal including a zero level, wherein the input audio signal comprises a plurality of audio frames; determining, by the computing device, that at least one frame of the plurality of audio frames has a volume level within a selected range of the zero level; modifying, by the computing device, the volume level of the at least one frame to correspond to the zero level of the input audio signal to generate an enhanced audio signal; filtering the enhanced audio signal by a high-pass filter to generate a filtered audio signal; attenuating noise included in the filtered audio signal to generate a pre-processed speech signal; encoding, by the computing device, the pre-processed speech signal to generate an encoded plurality of audio frames, wherein the encoded plurality of audio frames are decodable by a speech decoding system to generate a reproduced version of the input audio signal; and transmitting, to the speech decoding system via a communication link, a signal that includes the encoded plurality of audio frames. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method, comprising:
-
receiving, at a computing device, a speech signal including a zero level and one or more samples, wherein the one or more samples have corresponding magnitude levels; determining, by the computing device, that at least one sample of the one or more samples corresponds to silence noise; in response to the at least one sample corresponding to silence noise, the computing device adjusting the corresponding magnitude levels of the at least one sample to the zero level to generate an enhanced audio signal; filtering the enhanced audio signal by a high-pass filter to generate a filtered audio signal; attenuating noise included in the filtered audio signal to generate a pre-processed speech signal; encoding, by the computing device, the pre-processed speech signal to generate encoded one or more samples, wherein the encoded one or more samples are decodable by a speech decoding system to generate a reproduced version of the speech signal; and transmitting, to the speech decoding system via a communication link, a signal that includes the encoded one or more samples. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
Specification