Scalable and embedded codec for speech and audio signals

US 9,047,865 B2
Filed: 08/10/2007
Issued: 06/02/2015
Est. Priority Date: 09/23/1998
Status: Expired due to Fees

First Claim

Patent Images

1. A system for processing audio signals comprising:

a scalable embedded audio-speech encoder comprising;

(a) a audio-speech frame extractor for dividing a low bit rate, input audio signal into a plurality of signal frames corresponding to successive time intervals;

(b) a audio-speech frame mode classifier for determining if the low bit-rate signal in a frame is in a steady-state mode or a transition state mode;

(c) a audio-speech processor for extracting parameters of the low-bit rate signal in a frame, received from said frame mode classifier, wherein said extracted parameters include supplemental phase information for transition state mode frames; and

(d) a multi-mode audio-speech coder for processing extracted parameters of frames of the low bit-rate signal in at least two distinct paths, a first path processing a first set of extracted parameters using a first bit allocation when a signal in a frame is determined to be in said steady-state mode, and a second path processing a second set of extracted parameters including supplemental phase information using a second bit allocation when a signal in a frame is determined to be in said transition state mode.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system and method for processing of audio and speech signals is disclosed, which provide compatibility over a range of communication devices operating at different sampling frequencies and/or bit rates. The analyzer of the system divides the input signal in different portions, at least one of which carries information sufficient to provide intelligible reconstruction of the input signal. The analyzer also encodes separate information about other portions of the signal in an embedded manner, so that a smooth transition can be achieved from low bit-rate to high bit-rate applications. Accordingly, communication devices operating at different sampling rates and/or bit-rates can extract corresponding information from the output bit stream of the analyzer. In the present invention embedded information generally relates to separate parameters of the input signal, or to additional resolution in the transmission of original signal parameters. Non-linear techniques for enhancing the overall performance of the system are also disclosed. Also disclosed is a novel method of improving the quantization of signal parameters. In a specific embodiment the input signal is processed in two or more modes dependent on the state of the signal in a frame. When the signal is determined to be in a transition state, the encoder provides phase information about N sinusoids, which the decoder end uses to improve the quality of the output signal at low bit rates.

Citations

5 Claims

1. A system for processing audio signals comprising:
- a scalable embedded audio-speech encoder comprising;
  
  (a) a audio-speech frame extractor for dividing a low bit rate, input audio signal into a plurality of signal frames corresponding to successive time intervals;
  
  (b) a audio-speech frame mode classifier for determining if the low bit-rate signal in a frame is in a steady-state mode or a transition state mode;
  
  (c) a audio-speech processor for extracting parameters of the low-bit rate signal in a frame, received from said frame mode classifier, wherein said extracted parameters include supplemental phase information for transition state mode frames; and
  
  (d) a multi-mode audio-speech coder for processing extracted parameters of frames of the low bit-rate signal in at least two distinct paths, a first path processing a first set of extracted parameters using a first bit allocation when a signal in a frame is determined to be in said steady-state mode, and a second path processing a second set of extracted parameters including supplemental phase information using a second bit allocation when a signal in a frame is determined to be in said transition state mode.
- View Dependent Claims (2, 3, 4, 5)
- - 2. The system of claim 1 wherein said first set of extracted parameters comprise gain, pitch, voicing and Line Spectrum Pair (LSP) parameters.
  - 3. The system of claim 1 wherein said frame mode classifier receives input from said processor for extracting parameters and outputs at least one state flag upon a determination of a frame being in said transition state mode.
  - 4. The system of claim 3 wherein the multi-mode coder selects one of said at least two distinct processing paths on the basis of said at least one state flag.
  - 5. The system of claim 1 further comprising a decoder for decoding signals in at least two distinct processing paths.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Alcatel-Lucent SA (Nokia Corporation)
Original Assignee
Alcatel-Lucent SA (Nokia Corporation)
Inventors
Aguilar, Joseph Gerard, Campana, David A., Dunn, Robert B., McAulay, Robert J., Sun, Xiaoquin, Wang, Wei, Watkins, Craig, Zopf, Robert W., Chen, Juin-Hwey
Primary Examiner(s)
Opsasnick, Michael N

Application Number

US11/889,332
Publication Number

US 20080052068A1
Time in Patent Office

2,853 Days
Field of Search

704/219
US Class Current

1/1
CPC Class Codes

G10L 19/002   Dynamic bit allocation for ...

G10L 19/093   using sinusoidal excitation...

G10L 19/24   Variable rate codecs, e.g. ...

Scalable and embedded codec for speech and audio signals

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

5 Claims

Specification

Solutions

Use Cases

Quick Links

Scalable and embedded codec for speech and audio signals

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

5 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links