Method for interoperation between adaptive multi-rate wideband (AMR-WB) and multi-mode variable bit-rate wideband (VMR-WB) codecs
First Claim
1. An interworking apparatus, comprising a unit operable with a source-controlled Variable bit-rate Multi-mode WideBand (VMR-WB) codec providing a mode of operation that is interoperable with an Adaptive Multi-Rate wideband (AMR-WB) codec, where in a VMR-WB encoding/AMR-WB decoding case, speech frames are encoded in an AMR-WB interoperable mode of a VMR-WB encoder using one of bit rates corresponding to Interoperable-Full Rate (I-FR) for active speech frames, Interoperable-Half Rate (I-FIR) at least for dim-and-burst signaling, Quarter Rate-Comfort Noise Generator (CNG-QR) to encode at least relevant background noise frames and Eighth Rate-Comfort Noise Generator (CNG-ER) frames for background noise frames not encoded as CNG-QR frames, said interworking apparatus operable such that,invalid frames are transmitted to an AMR-WB decoder as erased frames;
- I-FR frames are transmitted to the AMR-WB decoder as 12.65, 8.85 or 6.60 kbps AMR-WB frames depending on the I-FR type;
CNG-QR frames are transmitted to the AMR-WB decoder as Silence Descriptor Update (SID_UPDATE) frames;
CNG-ER frames are transmitted to the AMR-WB decoder as NO_DATA frames; and
I-HR frames are translated to 12.65, 8.85, or 6.60 kbps frames, depending on the frame type, by generating missing algebraic codebook indices, where bits indicating the I-HR type are discarded.
2 Assignments
0 Petitions
Accused Products
Abstract
A source-controlled Variable bit-rate Multi-mode WideBand (VMR-WB) codec, having a mode of operation that is interoperable with the Adaptive Multi-Rate wideband (AMR-WB) codec, the codec comprising: at least one Interoperable full-rate (I-FR) mode, having a first bit allocation structure based on one of a AMR-WB codec coding types; and at least one comfort noise generator (CNG) coding type for encoding inactive speech frame having a second bit allocation structure based on AMR-WB SID_UPDATE coding type. Methods for i) digitally encoding a sound using a source-controlled Variable bit rate multi-mode wideband (VMR-WB) codec for interoperation with an adaptative multi-rate wideband (AMR-WB) codec, ii) translating a Variable bit rate multi-mode wideband (VMR-WB) codecsignal frame into an Adaptive Multi-Rate wideband (AMR-WB) signal frame, iii) translating an Adaptive Multi-Rate wideband (AMR-WB) signal frame into a Variable bit rate multi-mode wideband (VMR-WB) signal frame, and iv) translating an Adaptive Multi-Rate wideband (AMR-WB) signal frame into a Variable bit rate multi-mode wideband (VMR-WB) signal frame are also provided.
-
Citations
63 Claims
-
1. An interworking apparatus, comprising a unit operable with a source-controlled Variable bit-rate Multi-mode WideBand (VMR-WB) codec providing a mode of operation that is interoperable with an Adaptive Multi-Rate wideband (AMR-WB) codec, where in a VMR-WB encoding/AMR-WB decoding case, speech frames are encoded in an AMR-WB interoperable mode of a VMR-WB encoder using one of bit rates corresponding to Interoperable-Full Rate (I-FR) for active speech frames, Interoperable-Half Rate (I-FIR) at least for dim-and-burst signaling, Quarter Rate-Comfort Noise Generator (CNG-QR) to encode at least relevant background noise frames and Eighth Rate-Comfort Noise Generator (CNG-ER) frames for background noise frames not encoded as CNG-QR frames, said interworking apparatus operable such that,
invalid frames are transmitted to an AMR-WB decoder as erased frames; -
I-FR frames are transmitted to the AMR-WB decoder as 12.65, 8.85 or 6.60 kbps AMR-WB frames depending on the I-FR type; CNG-QR frames are transmitted to the AMR-WB decoder as Silence Descriptor Update (SID_UPDATE) frames; CNG-ER frames are transmitted to the AMR-WB decoder as NO_DATA frames; and I-HR frames are translated to 12.65, 8.85, or 6.60 kbps frames, depending on the frame type, by generating missing algebraic codebook indices, where bits indicating the I-HR type are discarded.
-
-
2. A method for encoding a speech signal according to a first speech coding scheme so that it can be decoded according to a second speech coding scheme, the speech signal comprising active speech periods during which there is active speech and inactive speech periods during which there is no active speech, the first speech coding scheme having a first set of available coding modes, each of said first set of coding modes having an associated encoding bit-rate, the second speech coding scheme having a second set of available coding modes including a discontinuous transmission coding mode in which silence descriptor frames are generated during inactive speech periods, the method comprising:
-
receiving an input speech signal for encoding according to the first speech coding scheme; applying a speech frame derived from the input speech signal to a voice activity detection function to determine whether the speech frame is an active speech frame containing active speech or an inactive speech frame that does not contain active speech; when it is determined that the input speech frame is an inactive speech frame, performing a determination operation according to a predetermined rule to specify whether, according to the second speech coding scheme, the inactive speech frame is to be encoded as a silence descriptor frame; and when it is determined that the input speech frame is to be encoded as a silence descriptor frame, encoding the input speech frame using a first predetermined encoding mode selected from said first set of available encoding modes that has an encoding bit-rate sufficiently high to allow encoding of the input speech frame with a number of bits compatible with a silence descriptor frame according to the second speech coding scheme; when it is determined that the input speech frame is not to be encoded as a silence descriptor frame, encoding the input speech frame using a second predetermined encoding mode selected from said first set of encoding modes. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. An apparatus for encoding a speech signal according to a first speech coding scheme so that it can be decoded according to a second speech coding scheme, the speech signal comprising active speech periods during which there is active speech and inactive speech periods during which there is no active speech, the first speech coding scheme having a first set of available coding modes, each of said first set of coding modes having an associated encoding bit-rate, the second speech coding scheme having a second set of available coding modes including a discontinuous transmission coding mode in which silence descriptor frames are generated during inactive speech periods, the apparatus comprising:
-
an input for receiving a speech signal for encoding according to the first speech coding scheme; a voice activity detector for determining whether a speech frame derived from said speech signal can be classified as an active speech frame containing active speech or an inactive speech frame that does not contain active speech; an inactive speech frame processing unit operable to perform a determination operation on a speech frame classified as inactive according to a predetermined rule to specify whether, according to the second speech coding scheme, the inactive speech frame is to be encoded as a silence descriptor frame; and an encoding unit responsive to the determination operation performed by said inactive frame processing unit, operable to encode the input speech frame using a first predetermined encoding mode selected from said first set of available encoding modes when it is determined that the input speech frame is to be encoded as a silence descriptor frame, said first predetermined encoding mode having an encoding bit-rate sufficiently high to allow encoding of the input speech frame with a number of bits compatible with a silence descriptor frame according to the second speech coding scheme and operable to encode the input speech frame using a second predetermined encoding mode selected from said first set of encoding modes when it is determined that the input speech frame is not to be encoded as a silence descriptor frame. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
-
-
41. A circuit comprising:
-
an input for receiving a speech signal for encoding according to a first speech coding scheme for decoding according to a second speech coding scheme, the speech signal comprising active speech periods during which there is active speech and inactive speech periods during which there is no active speech, the first speech coding scheme having a first set of available coding modes, each of said first set of coding modes having an associated encoding bit-rate, the second speech coding scheme having a second set of available coding modes including a discontinuous transmission coding mode in which silence descriptor frames are generated during inactive speech periods; a voice activity detector for determining whether a speech frame derived from said speech signal can be classified as an active speech frame containing active speech or an inactive speech frame that does not contain active speech; an inactive speech frame processing unit operable to perform a determination operation on a speech frame classified as inactive according to a predetermined rule to specify whether, according to the second speech coding scheme, the inactive speech frame is to be encoded as a silence descriptor frame; and an encoding unit responsive to the determination operation performed by said inactive frame processing unit, operable to encode the input speech frame using a first predetermined encoding mode selected from said first set of available encoding modes when it is determined that the input speech frame is to be encoded as a silence descriptor frame, said first predetermined encoding mode having an encoding bit-rate sufficiently high to allow encoding of the input speech frame with a number of bits compatible with a silence descriptor frame according to the second speech coding scheme and operable to encode the input speech frame using a second predetermined encoding mode selected from said first set of encoding modes when it is determined that the input speech frame is not to be encoded as a silence descriptor frame. - View Dependent Claims (42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59)
-
-
60. An apparatus comprising:
-
means for inputting a speech signal to encode the speech signal according to a first speech coding scheme for decoding according to a second speech coding scheme, wherein the speech signal comprises active speech periods during which there is active speech and inactive speech periods during which there is no active speech, the first speech coding scheme having a first set of available coding modes, each of said first set of coding modes having an associated encoding bit-rate, the second speech coding scheme having a second set of available coding modes including a discontinuous transmission coding mode in which silence descriptor frames are generated during inactive speech periods; means for detecting voice activity in a speech frame derived from the input speech signal to determine whether the speech frame is an active speech frame containing active speech or an inactive speech frame that does not contain active speech; means to perform a determination operation according to a predetermined rule when it is determined that the input speech is an inactive speech frame, to determine if, according to the second speech coding scheme, the inactive speech frame is to be encoded as a silence descriptor frame; and means for encoding the input speech frame, when it is determined that the input speech frame is to be encoded as a silence descriptor frame, using a first predetermined encoding mode selected from said first set of available encoding modes that has an encoding bit-rate sufficiently high to allow encoding of the input speech frame with a number of bits compatible with a silence descriptor frame according to the second speech coding scheme; and means for encoding the input speech frame, when it is determined that the input speech frame is not to be encoded as a silence descriptor frame, using a second predetermined encoding mode selected from said first set of encoding modes. - View Dependent Claims (61, 62, 63)
-
Specification