Scalable and embedded codec for speech and audio signals
First Claim
1. A system for processing audio signals comprising:
- a scalable embedded audio-speech encoder comprising;
(a) a audio-speech frame extractor for dividing a low bit rate, input audio signal into a plurality of signal frames corresponding to successive time intervals;
(b) a audio-speech frame mode classifier for determining if the low bit-rate signal in a frame is in a steady-state mode or a transition state mode;
(c) a audio-speech processor for extracting parameters of the low-bit rate signal in a frame, received from said frame mode classifier, wherein said extracted parameters include supplemental phase information for transition state mode frames; and
(d) a multi-mode audio-speech coder for processing extracted parameters of frames of the low bit-rate signal in at least two distinct paths, a first path processing a first set of extracted parameters using a first bit allocation when a signal in a frame is determined to be in said steady-state mode, and a second path processing a second set of extracted parameters including supplemental phase information using a second bit allocation when a signal in a frame is determined to be in said transition state mode.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and method for processing of audio and speech signals is disclosed, which provide compatibility over a range of communication devices operating at different sampling frequencies and/or bit rates. The analyzer of the system divides the input signal in different portions, at least one of which carries information sufficient to provide intelligible reconstruction of the input signal. The analyzer also encodes separate information about other portions of the signal in an embedded manner, so that a smooth transition can be achieved from low bit-rate to high bit-rate applications. Accordingly, communication devices operating at different sampling rates and/or bit-rates can extract corresponding information from the output bit stream of the analyzer. In the present invention embedded information generally relates to separate parameters of the input signal, or to additional resolution in the transmission of original signal parameters. Non-linear techniques for enhancing the overall performance of the system are also disclosed. Also disclosed is a novel method of improving the quantization of signal parameters. In a specific embodiment the input signal is processed in two or more modes dependent on the state of the signal in a frame. When the signal is determined to be in a transition state, the encoder provides phase information about N sinusoids, which the decoder end uses to improve the quality of the output signal at low bit rates.
-
Citations
5 Claims
-
1. A system for processing audio signals comprising:
- a scalable embedded audio-speech encoder comprising;
(a) a audio-speech frame extractor for dividing a low bit rate, input audio signal into a plurality of signal frames corresponding to successive time intervals; (b) a audio-speech frame mode classifier for determining if the low bit-rate signal in a frame is in a steady-state mode or a transition state mode; (c) a audio-speech processor for extracting parameters of the low-bit rate signal in a frame, received from said frame mode classifier, wherein said extracted parameters include supplemental phase information for transition state mode frames; and (d) a multi-mode audio-speech coder for processing extracted parameters of frames of the low bit-rate signal in at least two distinct paths, a first path processing a first set of extracted parameters using a first bit allocation when a signal in a frame is determined to be in said steady-state mode, and a second path processing a second set of extracted parameters including supplemental phase information using a second bit allocation when a signal in a frame is determined to be in said transition state mode. - View Dependent Claims (2, 3, 4, 5)
- a scalable embedded audio-speech encoder comprising;
Specification