Robust real-time speech codec
First Claim
1. In an audio processing tool, a method comprising:
- processing plural frames for an audio signal, wherein the plural frames include a mix of one or more intra frames and one or more predicted frames, wherein at least one of the one or more predicted frames uses long-term prediction from outside of the predicted frame, and wherein each of the one or more intra frames uses no long-term prediction from outside of the intra frame; and
outputting a result.
2 Assignments
0 Petitions
Accused Products
Abstract
Various strategies for rate/quality control and loss resiliency in an audio codec are described. The various strategies can be used in combination or independently. For example, a real-time speech codec uses intra frame coding/decoding, adaptive multi-mode forward error correction [“FEC”], and rate/quality control techniques. Intra frames help a decoder recover quickly from packet losses, while compression efficiency is still emphasized with predicted frames. Various strategies for inserting intra frames and signaling intra/predicted frames are described. With the adaptive multi-mode FEC, an encoder adaptively selects between multiple modes to efficiently and quickly provide a level of FEC that takes into account the bandwidth currently available for FEC. The FEC information itself may be predictively encoded and decoded relative to primary encoded information. Various rate/quality and FEC control strategies allow additional adaptation to available bandwidth and network conditions.
169 Citations
70 Claims
-
1. In an audio processing tool, a method comprising:
-
processing plural frames for an audio signal, wherein the plural frames include a mix of one or more intra frames and one or more predicted frames, wherein at least one of the one or more predicted frames uses long-term prediction from outside of the predicted frame, and wherein each of the one or more intra frames uses no long-term prediction from outside of the intra frame; and
outputting a result. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. In an audio encoder, a method of encoding plural frames for an audio signal, the method comprising:
-
encoding plural predicted frames of the plural frames; and
encoding plural intra frames of the plural frames, wherein the encoder sets intra frame usage and inserts the plural intra frames among the plural predicted frames according to the intra frame usage. - View Dependent Claims (18, 19, 20, 21, 22)
-
-
23. In an audio decoder, a method comprising:
-
decoding plural frames for an audio signal, wherein the plural frames include one or more intra frames and one or more predicted frames, and wherein frame-level type signaling information differentiates the one or more intra frames from the one or more predicted frames in a bitstream; and
outputting decoded information. - View Dependent Claims (24, 25, 26, 27)
-
-
28. In a speech processing tool, a method comprising:
-
processing a frame for a speech signal, including processing primary encoded information for the frame and one or more versions of forward error correction information for the frame, wherein the primary encoded information comprises plural parameter values, and wherein each of the one or more versions of forward error correction information comprises a subset of the plural parameter values selected based at least in part on an estimate of extra available bits; and
outputting a result. - View Dependent Claims (29, 30, 31, 32, 33, 34)
-
-
35. In a speech processing tool, a method comprising:
-
processing a frame for a speech signal, including processing primary encoded information for the frame and plural versions of forward error correction information for the frame, wherein the primary encoded information comprises plural parameter values, and wherein each of the plural versions of forward error correction information comprises a different subset of the plural parameter values for the frame; and
outputting a result. - View Dependent Claims (36, 37, 38, 39)
-
-
40. In an audio processing tool, a method comprising:
-
processing encoded information for an audio signal, wherein the encoded information includes forward error correction information for a first frame and primary encoded information for a second frame, and wherein at least some of the forward error correction information for the first frame is predictively encoded relative to the primary encoded information for the second frame; and
outputting a result. - View Dependent Claims (41, 42, 43, 44, 45, 46, 47, 48)
-
-
49. In a real-time speech encoder that uses linear prediction, a method comprising:
-
encoding a speech signal as plural linear prediction parameters, including adjusting bitrate and quality for a current frame of the speech signal based at least in part on (a) complexity of the current frame, (b) complexity and/or rate of at least some surrounding segments of the speech signal, (c) desired operating rate, (d) currently available network bandwidth, and (e) current network congestion or noise conditions or decoder feedback; and
outputting encoded speech. - View Dependent Claims (50, 51)
-
-
52. In an encoder-side audio processing tool, a method of encoding one or more frames of an audio signal, the method comprising:
-
estimating a number of extra available bits for a segment of the audio signal after basic encoding; and
using at least some of the extra available bits for adaptive forward error correction. - View Dependent Claims (53, 54, 55, 56, 57, 58, 59)
-
-
60. In a real-time speech encoder that uses linear prediction, a method comprising:
-
encoding a speech signal as plural linear prediction parameters, including adjusting bitrate and quality for a current segment of the speech signal based at least in part on a quality smoothness criteria for a transition between a previous segment and the current segment; and
outputting encoded speech. - View Dependent Claims (61, 62, 63)
-
-
64. In an audio processing tool, a method comprising:
-
processing a frame for an audio signal, including processing first information that represents the frame as a predicted frame or intra frame, and further including processing second information that represents the frame as an intra frame; and
outputting a result. - View Dependent Claims (65, 66, 67, 68, 69, 70)
-
Specification