Method for interoperation between adaptive multi-rate wideband (AMR-WB) and multi-mode variable bit-rate wideband (VMR-WB) codecs

US 7,203,638 B2
Filed: 01/19/2005
Issued: 04/10/2007
Est. Priority Date: 10/11/2002
Status: Active Grant

First Claim

Patent Images

1. An interworking apparatus, comprising a unit operable with a source-controlled Variable bit-rate Multi-mode WideBand (VMR-WB) codec providing a mode of operation that is interoperable with an Adaptive Multi-Rate wideband (AMR-WB) codec, where in a VMR-WB encoding/AMR-WB decoding case, speech frames are encoded in an AMR-WB interoperable mode of a VMR-WB encoder using one of bit rates corresponding to Interoperable-Full Rate (I-FR) for active speech frames, Interoperable-Half Rate (I-FIR) at least for dim-and-burst signaling, Quarter Rate-Comfort Noise Generator (CNG-QR) to encode at least relevant background noise frames and Eighth Rate-Comfort Noise Generator (CNG-ER) frames for background noise frames not encoded as CNG-QR frames, said interworking apparatus operable such that,invalid frames are transmitted to an AMR-WB decoder as erased frames;

I-FR frames are transmitted to the AMR-WB decoder as 12.65, 8.85 or 6.60 kbps AMR-WB frames depending on the I-FR type;

CNG-QR frames are transmitted to the AMR-WB decoder as Silence Descriptor Update (SID_UPDATE) frames;

CNG-ER frames are transmitted to the AMR-WB decoder as NO_DATA frames; and

I-HR frames are translated to 12.65, 8.85, or 6.60 kbps frames, depending on the frame type, by generating missing algebraic codebook indices, where bits indicating the I-HR type are discarded.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A source-controlled Variable bit-rate Multi-mode WideBand (VMR-WB) codec, having a mode of operation that is interoperable with the Adaptive Multi-Rate wideband (AMR-WB) codec, the codec comprising: at least one Interoperable full-rate (I-FR) mode, having a first bit allocation structure based on one of a AMR-WB codec coding types; and at least one comfort noise generator (CNG) coding type for encoding inactive speech frame having a second bit allocation structure based on AMR-WB SID_UPDATE coding type. Methods for i) digitally encoding a sound using a source-controlled Variable bit rate multi-mode wideband (VMR-WB) codec for interoperation with an adaptative multi-rate wideband (AMR-WB) codec, ii) translating a Variable bit rate multi-mode wideband (VMR-WB) codecsignal frame into an Adaptive Multi-Rate wideband (AMR-WB) signal frame, iii) translating an Adaptive Multi-Rate wideband (AMR-WB) signal frame into a Variable bit rate multi-mode wideband (VMR-WB) signal frame, and iv) translating an Adaptive Multi-Rate wideband (AMR-WB) signal frame into a Variable bit rate multi-mode wideband (VMR-WB) signal frame are also provided.

Citations

63 Claims

1. An interworking apparatus, comprising a unit operable with a source-controlled Variable bit-rate Multi-mode WideBand (VMR-WB) codec providing a mode of operation that is interoperable with an Adaptive Multi-Rate wideband (AMR-WB) codec, where in a VMR-WB encoding/AMR-WB decoding case, speech frames are encoded in an AMR-WB interoperable mode of a VMR-WB encoder using one of bit rates corresponding to Interoperable-Full Rate (I-FR) for active speech frames, Interoperable-Half Rate (I-FIR) at least for dim-and-burst signaling, Quarter Rate-Comfort Noise Generator (CNG-QR) to encode at least relevant background noise frames and Eighth Rate-Comfort Noise Generator (CNG-ER) frames for background noise frames not encoded as CNG-QR frames, said interworking apparatus operable such that,invalid frames are transmitted to an AMR-WB decoder as erased frames;
- I-FR frames are transmitted to the AMR-WB decoder as 12.65, 8.85 or 6.60 kbps AMR-WB frames depending on the I-FR type;
  
  CNG-QR frames are transmitted to the AMR-WB decoder as Silence Descriptor Update (SID_UPDATE) frames;
  
  CNG-ER frames are transmitted to the AMR-WB decoder as NO_DATA frames; and
  
  I-HR frames are translated to 12.65, 8.85, or 6.60 kbps frames, depending on the frame type, by generating missing algebraic codebook indices, where bits indicating the I-HR type are discarded.

2. A method for encoding a speech signal according to a first speech coding scheme so that it can be decoded according to a second speech coding scheme, the speech signal comprising active speech periods during which there is active speech and inactive speech periods during which there is no active speech, the first speech coding scheme having a first set of available coding modes, each of said first set of coding modes having an associated encoding bit-rate, the second speech coding scheme having a second set of available coding modes including a discontinuous transmission coding mode in which silence descriptor frames are generated during inactive speech periods, the method comprising:
- receiving an input speech signal for encoding according to the first speech coding scheme;
  
  applying a speech frame derived from the input speech signal to a voice activity detection function to determine whether the speech frame is an active speech frame containing active speech or an inactive speech frame that does not contain active speech;
  
  when it is determined that the input speech frame is an inactive speech frame, performing a determination operation according to a predetermined rule to specify whether, according to the second speech coding scheme, the inactive speech frame is to be encoded as a silence descriptor frame; and
  
  when it is determined that the input speech frame is to be encoded as a silence descriptor frame, encoding the input speech frame using a first predetermined encoding mode selected from said first set of available encoding modes that has an encoding bit-rate sufficiently high to allow encoding of the input speech frame with a number of bits compatible with a silence descriptor frame according to the second speech coding scheme;
  
  when it is determined that the input speech frame is not to be encoded as a silence descriptor frame, encoding the input speech frame using a second predetermined encoding mode selected from said first set of encoding modes.
- View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21)
- - 3. A method according to claim 2, wherein said second predetermined encoding mode is used to encode inactive speech frames according to the first speech coding scheme.
  - 4. A method according to claim 2, wherein the first speech coding scheme comprises at least a quarter-rate encoding mode and an eighth-rate encoding mode, the quarter-rate encoding mode arranged to produce quarter-rate encoded speech frames having a certain first predetermined number of bits greater than the number of bits used to represent a silence descriptor frame in said second speech encoding scheme, the eighth-rate encoding mode arranged to produce eighth-rate encoded speech frames having a certain second predetermined number of bits less than the number of bits used to represent a silence descriptor frame in said second speech coding scheme, and when it is determined that the input speech frame is to be encoded as a silence descriptor frame, the input speech frame is encoded with a number of bits compatible with a silence descriptor frame according to the second speech coding scheme and is transmitted as a quarter-rate encoded speech frame.
  - 5. A method according to claim 2, wherein the first speech coding scheme comprises a full-rate encoding mode arranged to produce full-rate encoded speech frames comprising a first number of bits, a half-rate encoding mode arranged to produce half-rate encoded speech frames having a second number of bits less than said first number of bits, a quarter-rate encoding mode arranged to produce quarter-rate encoded speech frames with a third number of bits less than said second number of bits and an eighth-rate encoding mode arranged to produce eighth-rate encoded speech frames with a fourth number of bits less than said third number of bits, the third number of bits being greater than the number of bits used to represent a silence descriptor frame in said second speech encoding scheme, the fourth number of bits being less than the number of bits used to represent a silence descriptor frame according to said second speech coding scheme, and when it is determined that the input speech frame is to be encoded as a silence descriptor frame, the input speech frame is encoded with a number of bits compatible with a silence descriptor frame of the second speech coding scheme and is transmitted as a quarter-rate encoded speech frame.
  - 6. A method according to claim 3, wherein when it is determined that the inactive speech frame is not to be encoded as a silence descriptor frame, the input speech frame is encoded using said eighth-rate encoding mode.
  - 7. A method according to claim 2, wherein the first speech coding scheme is conformed to CDMA rate set 2.
  - 8. A method according to claim 2, wherein the first speech coding scheme is conformed to CDMA rate set 1.
  - 9. A method according to claim 2, wherein the first speech coding scheme is defined according to a VMR-WB speech coding standard and the second speech coding scheme is defined according to an AMR-WB speech coding standard.
  - 10. A method according to claim 4, wherein said first predetermined number of bits is 54 and said second predetermined number of bits is 20.
  - 11. A method according to claim 5, wherein said first number of bits is 266, said second number of bits is 124, said third number of bits is 54 and said fourth number of bits is 20.
  - 12. A method according to claim 10, wherein said first predetermined number of bits corresponds to a bit-rate of 2.7 kbits/s and said second predetermined number of bits corresponds to a bit-rate of 1.0 kbits/s.
  - 13. A method according to claim 5, wherein said first number of bits corresponds to a bit-rate of 13.3 kbits/s, said second number of bits corresponds to a bit-rate of 6.2 kbits/s, said third number of bits corresponds to a bit-rate of 2.7 kbits/s and said fourth number of bits corresponds to a bit-rate of 1.0 kbits/s.
  - 14. A method according to claim 10, wherein when it is determined that the input speech frame is to be encoded as a silence descriptor frame, the input speech frame is encoded with 35 bits, leaving 19 bits of said quarter-rate encoded speech frame unused.
  - 15. A method according to claim 4, wherein the number of bits used to represent a silence descriptor frame according to the second speech coding scheme corresponds to 1 .75 kbits/s.
  - 16. A method according to claim 2, wherein when consecutive input speech frames following an active speech period are determined to be inactive speech frames, thereby forming a sequence of inactive speech frames, said predetermined rule specifies that the first inactive speech frame of said sequence, the fourth inactive speech frame and thereafter every eighth inactive speech frame of said sequence is to be encoded as a silence descriptor frame.
  - 17. A method according to claim 2, wherein when consecutive input speech frames following an active speech period are determined to be inactive speech frames, thereby forming a sequence of inactive speech frames, said predetermined rule specifies that a) the first inactive speech frame of said sequence is to be encoded as a silence descriptor frame, b) the next two inactive speech frames of said sequence are to be encoded using said second predetermined encoding mode, c) the fourth inactive speech frame of said sequence is to encoded as a silence descriptor frame, d) the next seven inactive speech frames are to be encoded using said second predetermined encoding mode and the following inactive speech frame is to be encoded as a silence descriptor frame and step d) is to be repeated until an active speech frame is detected.
  - 18. A method according to claim 2, wherein when consecutive input speech frames following an active speech period are determined to be inactive speech frames, thereby forming a sequence of inactive speech frames, said predetermined rule specifies that the first inactive speech frame of said sequence is to be encoded as a silence descriptor frame and thereafter every eighth inactive speech frame of said sequence is to be encoded as a silence descriptor frame.
  - 19. A method according to claim 2, wherein when consecutive input speech frames are determined to be inactive speech frames, thereby forming a sequence of inactive speech frames, said predetermined rule specifies that a) the first inactive speech frame of said sequence is to be encoded as a silence descriptor frame, b) the next k inactive speech frames of said sequence are to be encoded using said second predetermined encoding mode and the following inactive speech frame is to be encoded as a silence descriptor frame and step b) is to be repeated until an active speech frame is detected.
  - 20. A method according to claim 19, wherein k is equal to 7.
  - 21. A method according to claim 2, wherein when consecutive input speech frames following an active speech period are determined to be inactive speech frames, thereby forming a sequence of inactive speech frames, said predetermined rule specifies that an inactive speech frame is encoded as a silence descriptor frame when noise characteristics change.

22. An apparatus for encoding a speech signal according to a first speech coding scheme so that it can be decoded according to a second speech coding scheme, the speech signal comprising active speech periods during which there is active speech and inactive speech periods during which there is no active speech, the first speech coding scheme having a first set of available coding modes, each of said first set of coding modes having an associated encoding bit-rate, the second speech coding scheme having a second set of available coding modes including a discontinuous transmission coding mode in which silence descriptor frames are generated during inactive speech periods, the apparatus comprising:
- an input for receiving a speech signal for encoding according to the first speech coding scheme;
  
  a voice activity detector for determining whether a speech frame derived from said speech signal can be classified as an active speech frame containing active speech or an inactive speech frame that does not contain active speech;
  
  an inactive speech frame processing unit operable to perform a determination operation on a speech frame classified as inactive according to a predetermined rule to specify whether, according to the second speech coding scheme, the inactive speech frame is to be encoded as a silence descriptor frame; and
  
  an encoding unit responsive to the determination operation performed by said inactive frame processing unit, operable to encode the input speech frame using a first predetermined encoding mode selected from said first set of available encoding modes when it is determined that the input speech frame is to be encoded as a silence descriptor frame, said first predetermined encoding mode having an encoding bit-rate sufficiently high to allow encoding of the input speech frame with a number of bits compatible with a silence descriptor frame according to the second speech coding scheme and operable to encode the input speech frame using a second predetermined encoding mode selected from said first set of encoding modes when it is determined that the input speech frame is not to be encoded as a silence descriptor frame.
- View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
- - 23. An apparatus according to claim 22, wherein the first speech coding scheme comprises at least a quarter-rate encoding mode and an eighth-rate encoding mode, the quarter-rate encoding mode arranged to produce quarter-rate encoded speech frames having a certain first predetermined number of bits greater than the number of bits used to represent a silence descriptor frame in said second speech encoding scheme, the eighth-rate encoding mode arranged to produce eighth-rate encoded speech frames having a certain second predetermined number of bits less than the number of bits used to represent a silence descriptor frame in said second speech coding scheme, and the encoding unit is arranged to encode the input speech frame with a number of bits compatible with a silence descriptor frame according to the second speech coding scheme within a quarter-rate encoded speech frame when the inactive speech frame processing unit determines that the input speech frame is to be encoded as a silence descriptor frame.
  - 24. An apparatus according to claim 22, wherein the first speech coding scheme comprises a full-rate encoding mode arranged to produce full-rate encoded speech frames comprising a first number of bits, a half-rate encoding mode arranged to produce half-rate encoded speech frames having a second number of bits less than said first number of bits, a quarter-rate encoding mode arranged to produce quarter-rate encoded speech frames with a third number of bits less than said second number of bits and an eighth-rate encoding mode arranged to produce eighth-rate encoded speech frames with a fourth number of bits less than said third number of bits, the third number of bits being greater than the number of bits used to represent a silence descriptor frame in said second speech encoding scheme, the fourth number of bits being less than the number of bits used to represent a silence descriptor frame according to said second speech coding scheme, and the encoding unit is arranged to encode the input speech frame with a number of bits compatible with a silence descriptor frame according to the second speech coding scheme within a quarter-rate encoded speech frame when the inactive speech frame processing unit determines that the input speech frame is to be encoded as a silence descriptor frame.
  - 25. An apparatus according to claim 23, wherein the encoding unit is arranged to encode the input speech frame using said eighth-rate encoding mode when the inactive speech frame processing unit determines that the input speech frame is not to be encoded as a silence descriptor frame.
  - 26. An apparatus according to claim 22, arranged to operate according to CDMA rate set 2.
  - 27. An apparatus according to claim 22, arranged to operate according to CDMA rate set 1.
  - 28. An apparatus according to claim 22, wherein the first speech coding scheme is defined according to a VMR-WB speech coding standard and the apparatus is arranged to enable interoperation with a second speech coding scheme defined according to the AMR-WB speech coding standard.
  - 29. An apparatus according to claim 23, wherein said first predetermined number of bits is 54 and said second predetermined number of bits is 20.
  - 30. An apparatus according to claim 24, wherein said first number of bits is 266, said second number of bits is 124, said third number of bits is 54 and said fourth number of bits is 20.
  - 31. An apparatus according to claim 23, wherein said first predetermined number of bits corresponds to a bit-rate of 2.7 kbits/s and said second predetermined number of bits corresponds to a bit-rate of 1.0 kbits/s.
  - 32. An apparatus according to claim 24, wherein said first number of bits corresponds to a bit-rate of 13.3 kbits/s, said second number of bits corresponds to a bit-rate of 6.2 kbits/s, said third number of bits corresponds to a bit-rate of 2.7 kbits/s and said fourth number of bits corresponds to a bit-rate of 1.0 kbits/s.
  - 33. An apparatus according to claim 29, wherein when the inactive speech frame processing unit determines that the input speech frame is to be encoded as a silence descriptor frame the encoding unit is arranged to encode the input speech frame with 35 bits, leaving 19 bits of said quarter-rate encoded speech frame unused.
  - 34. An apparatus according to claim 23, wherein the number of bits used to represent a silence descriptor frame according to the second speech coding scheme corresponds to 1.75 kbits/s.
  - 35. An apparatus according to claim 22, wherein when the voice activity detector determines consecutive input speech frames following an active speech period to be inactive speech frames, thereby forming a sequence of inactive speech frames, the inactive speech frame processing unit determines according to said predetermined rule that the first inactive speech frame of said sequence, the fourth inactive speech frame and thereafter every eighth inactive speech frame of said sequence is to be encoded as a silence descriptor frame.
  - 36. An apparatus according to claim 22, wherein when the voice activity detector determines consecutive input speech frames following an active speech period to be inactive speech frames, thereby forming a sequence of inactive speech frames, the inactive speech frame processing unit specifies according to said predetermined rule that a) the first inactive speech frame of said sequence is to be encoded as a silence descriptor frame, b) the next two inactive speech frames of said sequence are to be encoded using said second predetermined encoding mode, c) the fourth inactive speech frame of said sequence is to encoded as a silence descriptor frame, d) the next seven inactive speech frames are to be encoded using said second predetermined encoding mode and the following inactive speech frame is to be encoded as a silence descriptor frame and step d) is to be repeated until an active speech frame is detected.
  - 37. An apparatus according to claim 22, wherein when the voice activity detector determines consecutive input speech frames following an active speech period to be inactive speech frames, thereby forming a sequence of inactive speech frames, the inactive speech frame processing unit determines according to said predetermined rule that the first inactive speech frame of said sequence is to be encoded as a silence descriptor frame and thereafter every eighth inactive speech frame of said sequence is to be encoded as a silence descriptor frame.
  - 38. An apparatus according to claim 22, wherein when the voice activity detector determines consecutive input speech frames to be inactive speech frames, thereby forming a sequence of inactive speech frames, the inactive speech frame processing unit specifies according to said predetermined rule that a) the first inactive speech frame of said sequence is to be encoded as a silence descriptor frame, b) the next k inactive speech frames of said sequence are to be encoded using said second predetermined encoding mode and the following inactive speech frame is to be encoded as a silence descriptor frame and step b) is to be repeated until an active speech frame is detected.
  - 39. An apparatus according to claim 38, arranged to set k equal to 7.
  - 40. An apparatus according to claim 22, wherein when the voice activity detector determines consecutive input speech frames following an active speech period to be inactive speech frames, thereby forming a sequence of inactive speech frames, the inactive speech frame processing unit determines according to said predetermined rule that an inactive speech frame is encoded as a silence descriptor frame when noise characteristics change.

41. A circuit comprising:
- an input for receiving a speech signal for encoding according to a first speech coding scheme for decoding according to a second speech coding scheme, the speech signal comprising active speech periods during which there is active speech and inactive speech periods during which there is no active speech, the first speech coding scheme having a first set of available coding modes, each of said first set of coding modes having an associated encoding bit-rate, the second speech coding scheme having a second set of available coding modes including a discontinuous transmission coding mode in which silence descriptor frames are generated during inactive speech periods;
  
  a voice activity detector for determining whether a speech frame derived from said speech signal can be classified as an active speech frame containing active speech or an inactive speech frame that does not contain active speech;
  
  an inactive speech frame processing unit operable to perform a determination operation on a speech frame classified as inactive according to a predetermined rule to specify whether, according to the second speech coding scheme, the inactive speech frame is to be encoded as a silence descriptor frame; and
  
  an encoding unit responsive to the determination operation performed by said inactive frame processing unit, operable to encode the input speech frame using a first predetermined encoding mode selected from said first set of available encoding modes when it is determined that the input speech frame is to be encoded as a silence descriptor frame, said first predetermined encoding mode having an encoding bit-rate sufficiently high to allow encoding of the input speech frame with a number of bits compatible with a silence descriptor frame according to the second speech coding scheme and operable to encode the input speech frame using a second predetermined encoding mode selected from said first set of encoding modes when it is determined that the input speech frame is not to be encoded as a silence descriptor frame.
- View Dependent Claims (42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59)
- - 42. A circuit according to claim 41, wherein the first speech coding scheme comprises a quarter-rate encoding mode and an eighth-rate encoding mode,wherein the quarter-rate encoding mode is arranged to produce quarter-rate encoded speech frames having a certain first predetermined number of bits greater than the number of bits used to represent a silence descriptor frame in said second speech encoding scheme;
    - wherein the eighth-rate encoding mode is arranged to produce eighth-rate encoded speech frames having a certain second predetermined number of bits less than the number of bits used to represent a silence descriptor frame in said second speech coding scheme; and
      
      wherein the encoding unit is arranged to encode the input speech frame with a number of bits compatible with a silence descriptor frame according to the second speech coding scheme within a quarter-rate encoded speech frame when the inactive speech frame processing unit determines that the input speech frame is to be encoded as a silence descriptor frame.
  - 43. A circuit according to claim 41, wherein the first speech coding scheme comprises a full-rate encoding mode, a half-rate encoding mode, a quarter-rate encoding mode, and an eighth-rate encoding mode,wherein the full-rate encoding mode is arranged to produce full-rate encoded speech frames comprising a first number of bits;
    - wherein the half-rate encoding mode is arranged to produce half-rate encoded speech frames having a second number of bits less than said first number of bits;
      
      wherein the quarter-rate encoding mode is arranged to produce quarter-rate encoded speech frames with a third number of bits less than said second number of bits;
      
      wherein the eighth-rate encoding mode is arranged to produce eighth-rate encoded speech frames with a fourth number of bits less than said third number of bits, the third number of bits being greater than the number of bits used to represent a silence descriptor frame in said second speech encoding scheme, the fourth number of bits being less than the number of bits used to represent a silence descriptor frame according to said second speech coding scheme; and
      
      wherein the encoding unit is arranged to encode the input speech frame with a number of bits compatible with a silence descriptor frame according to the second speech coding scheme within a quarter-rate encoded speech frame when the inactive speech frame processing unit determines that the input speech frame is to be encoded as a silence descriptor frame.
  - 44. A circuit according to claim 42, wherein the encoding unit is arranged to encode the input speech frame using said eighth-rate encoding mode when the inactive speech frame processing unit determines that the input speech frame is not to be encoded as a silence descriptor frame.
  - 45. A circuit according to claim 41, arranged to operate according to CDMA rate set 2.
  - 46. A circuit according to claim 41, arranged to operate according to CDMA rate set 1.
  - 47. A circuit according to claim 41, wherein the first speech coding scheme is defined according to a VMR-WB speech coding standard, and where the second speech coding scheme is defined according to a AMR-WB speech coding standard.
  - 48. A circuit according to claim 42, wherein said first predetermined number of bits is 54 and said second predetermined number of bits is 20.
  - 49. A circuit according to claim 43, wherein said first number of bits is 266, said second number of bits is 124, said third number of bits is 54 and said fourth number of bits is 20.
  - 50. A circuit according to claim 42, wherein said first predetermined number of bits corresponds to a bit-rate of 2.7 kbits/s and said second predetermined number of bits corresponds to a bit-rate of 1.0 kbits/s.
  - 51. A circuit according to claim 43, wherein said first number of bits corresponds to a bit-rate of 13.3 kbits/s, said second number of bits corresponds to a bit-rate of 6.2 kbits/s, said third number of bits corresponds to a bit-rate of 2.7 kbits/s and said fourth number of bits corresponds to a bit-rate of 1.0 kbits/s.
  - 52. A circuit according to claim 48, wherein when the inactive speech frame processing unit determines that the input speech frame is to be encoded as a silence descriptor frame the encoding unit is arranged to encode the input speech frame with 35 bits, leaving 19 bits of said quarter-rate encoded speech frame unused.
  - 53. A circuit according to claim 42, wherein the number of bits used to represent a silence descriptor frame according to the second speech coding scheme corresponds to 1.75 kbits/s.
  - 54. A circuit according to claim 41, wherein when the voice activity detector determines consecutive input speech frames following an active speech period to be inactive speech frames, thereby forming a sequence of inactive speech frames, the inactive speech frame processing unit determines according to said predetermined rule that the first inactive speech frame of said sequence, the fourth inactive speech frame and thereafter every eighth inactive speech frame of said sequence is to be encoded as a silence descriptor frame.
  - 55. A circuit according to claim 41, wherein when the voice activity detector determines consecutive input speech frames following an active speech period to be inactive speech frames, thereby forming a sequence of inactive speech frames, the inactive speech frame processing unit specifies according to said predetermined rule that a) the first inactive speech frame of said sequence is to be encoded as a silence descriptor frame, b) the next two inactive speech frames of said sequence are to be encoded using said second predetermined encoding mode, c) the fourth inactive speech frame of said sequence is to encoded as a silence descriptor frame, d) the next seven inactive speech frames are to be encoded using said second predetermined encoding mode and the following inactive speech frame is to be encoded as a silence descriptor frame and step d) is to be repeated until an active speech frame is detected.
  - 56. A circuit according to claim 41, wherein when the voice activity detector determines consecutive input speech frames following an active speech period to be inactive speech frames, thereby forming a sequence of inactive speech frames, the inactive speech frame processing unit determines according to said predetermined rule that the first inactive speech frame of said sequence is to be encoded as a silence descriptor frame and thereafter every eighth inactive speech frame of said sequence is to be encoded as a silence descriptor frame.
  - 57. A circuit according to claim 41, wherein when the voice activity detector determines consecutive input speech frames to be inactive speech frames, thereby forming a sequence of inactive speech frames, the inactive speech frame processing unit specifies according to said predetermined rule that a) the first inactive speech frame of said sequence is to be encoded as a silence descriptor frame, b) the next k inactive speech frames of said sequence are to be encoded using said second predetermined encoding mode and the following inactive speech frame is to be encoded as a silence descriptor frame and step b) is to be repeated until an active speech frame is detected.
  - 58. A circuit according to claim 57, arranged to set k equal to 7.
  - 59. A circuit according to claim 41, wherein when the voice activity detector determines consecutive input speech frames following an active speech period to be inactive speech frames, thereby forming a sequence of inactive speech frames, the inactive speech frame processing unit determines according to said predetermined rule that an inactive speech frame is encoded as a silence descriptor frame when noise characteristics change.

60. An apparatus comprising:
- means for inputting a speech signal to encode the speech signal according to a first speech coding scheme for decoding according to a second speech coding scheme, wherein the speech signal comprises active speech periods during which there is active speech and inactive speech periods during which there is no active speech, the first speech coding scheme having a first set of available coding modes, each of said first set of coding modes having an associated encoding bit-rate, the second speech coding scheme having a second set of available coding modes including a discontinuous transmission coding mode in which silence descriptor frames are generated during inactive speech periods;
  
  means for detecting voice activity in a speech frame derived from the input speech signal to determine whether the speech frame is an active speech frame containing active speech or an inactive speech frame that does not contain active speech;
  
  means to perform a determination operation according to a predetermined rule when it is determined that the input speech is an inactive speech frame, to determine if, according to the second speech coding scheme, the inactive speech frame is to be encoded as a silence descriptor frame; and
  
  means for encoding the input speech frame, when it is determined that the input speech frame is to be encoded as a silence descriptor frame, using a first predetermined encoding mode selected from said first set of available encoding modes that has an encoding bit-rate sufficiently high to allow encoding of the input speech frame with a number of bits compatible with a silence descriptor frame according to the second speech coding scheme; and
  
  means for encoding the input speech frame, when it is determined that the input speech frame is not to be encoded as a silence descriptor frame, using a second predetermined encoding mode selected from said first set of encoding modes.
- View Dependent Claims (61, 62, 63)
- - 61. An apparatus according to claim 60, wherein said second predetermined encoding mode is used to encode inactive speech frames according to the first speech coding scheme.
  - 62. An apparatus according to claim 60, wherein the first speech coding scheme comprises a quarter-rate encoding mode and an eighth-rate encoding mode, further comprising:
    - means for producing quarter-rate encoded speech frames having a certain first predetermined number of bits greater than the number of bits used to represent a silence descriptor frame in said second speech encoding scheme;
      
      means for producing eighth-rate encoded speech frames having a certain second predetermined number of bits less than the number of bits used to represent a silence descriptor frame in said second speech coding scheme; and
      
      where said encoding means operates, when it is determined that the input speech frame is to be encoded as a silence descriptor frame, for encoding the speech frame with a number of bits compatible with a silence descriptor frame according to the second speech coding scheme for transmitting as a quarter-rate encoded speech frame.
  - 63. An apparatus according to claim 60, wherein the first speech coding scheme further comprises:
    - a full-rate encoding mode arranged to produce full-rate encoded speech frames comprising a first number of bits;
      
      a half-rate encoding mode arranged to produce half-rate encoded speech frames having a second number of bits less than said first number of bits;
      
      a quarter-rate encoding mode arranged to produce quarter-rate encoded speech frames with a third number of bits less than said second number of bits, the third number of bits being greater than the number of bits used to represent a silence descriptor frame in said second speech encoding scheme;
      
      an eighth-rate encoding mode arranged to produce eighth-rate encoded speech frames with a fourth number of bits less than said third number of bits, and the fourth number of bits being less than the number of bits used to represent a silence descriptor frame according to a second speech coding scheme; and
      
      where said encoding means operates, when it is determined that the input speech frame is to be encoded as a silence descriptor frame, for encoding the speech frame with a number of bits compatible with a silence descriptor frame of the second speech coding scheme within a quarter-rate encoded speech frame for transmitting as a quarter-rate encoded speech frame.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Nokia Technologies Oy (Nokia Corporation)
Original Assignee
Nokia Corporation
Inventors
Salami, Redwan, Jelinek, Milan
Primary Examiner(s)
HARPER, V PAUL

Application Number

US11/039,540
Publication Number

US 20050267746A1
Time in Patent Office

811 Days
Field of Search

None
US Class Current

704/201
CPC Class Codes

G10L 19/012   Comfort noise or silence co...

G10L 19/173   Transcoding, i.e. convertin...

G10L 19/24   Variable rate codecs, e.g. ...

Method for interoperation between adaptive multi-rate wideband (AMR-WB) and multi-mode variable bit-rate wideband (VMR-WB) codecs

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

63 Claims

Specification

Solutions

Use Cases

Quick Links

Method for interoperation between adaptive multi-rate wideband (AMR-WB) and multi-mode variable bit-rate wideband (VMR-WB) codecs

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

63 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links