Audio coding systems and methods using spectral component coupling and spectral component regeneration

US 20040225505A1
Filed: 05/08/2003
Published: 11/11/2004
Est. Priority Date: 05/08/2003
Status: Active Grant

First Claim

Patent Images

1. A method for encoding one or more input audio signals, wherein the method comprises:

receiving the one or more input audio signals and obtaining therefrom one or more baseband signals and one or more residual signals, wherein spectral components of a baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and spectral components in an associated residual signal represent spectral components of the respective input audio signal in a second set of frequency subbands that are not represented by the baseband signal;

obtaining energy measures of at least some spectral components of one or more synthesized signals to be generated during decoding, wherein the one or more synthesized signals have spectral components within the second set of frequency subbands;

obtaining energy measures of at least some spectral components of each residual signal;

calculating scale factors by obtaining square roots of ratios of the energy measures of spectral components in the residual signals to the energy measures of spectral components in the one or more synthesized signals, square roots of ratios of the energy measures of spectral components in the one or more synthesized signals to the energy measures of spectral components in the residual signals, ratios of square roots of the energy measures of spectral components in the residual signals to square roots of the energy measures of spectral components in the one or more synthesized signals, or ratios of square roots of the energy measures of spectral components in the one or more synthesized signals to square roots of the energy measures of spectral components in the residual signals; and

assembling signal information and scaling information into an encoded signal, wherein the signal information represents the spectral components in the one or more baseband signals and the scaling information represents the scale factors.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

An audio encoder discards spectral components of an input signal and uses channel coupling to reduce the information capacity requirements of an encoded signal. Channel coupling represents selected spectral components of multiple channels of signals in a composite form. An audio decoder synthesizes spectral components to replace the discarded spectral components and generates spectral components for individual channel signals from the coupled-channel signal. The encoder provides scale factors in the encoded signal that improve the efficiency of the decoder to generate output signals that substantially preserve the spectral energy of the original input signals.

Citations

71 Claims

1. A method for encoding one or more input audio signals, wherein the method comprises:
- receiving the one or more input audio signals and obtaining therefrom one or more baseband signals and one or more residual signals, wherein spectral components of a baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and spectral components in an associated residual signal represent spectral components of the respective input audio signal in a second set of frequency subbands that are not represented by the baseband signal;
  
  obtaining energy measures of at least some spectral components of one or more synthesized signals to be generated during decoding, wherein the one or more synthesized signals have spectral components within the second set of frequency subbands;
  
  obtaining energy measures of at least some spectral components of each residual signal;
  
  calculating scale factors by obtaining square roots of ratios of the energy measures of spectral components in the residual signals to the energy measures of spectral components in the one or more synthesized signals, square roots of ratios of the energy measures of spectral components in the one or more synthesized signals to the energy measures of spectral components in the residual signals, ratios of square roots of the energy measures of spectral components in the residual signals to square roots of the energy measures of spectral components in the one or more synthesized signals, or ratios of square roots of the energy measures of spectral components in the one or more synthesized signals to square roots of the energy measures of spectral components in the residual signals; and
  
  assembling signal information and scaling information into an encoded signal, wherein the signal information represents the spectral components in the one or more baseband signals and the scaling information represents the scale factors.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
- - 2. The method according to claim 1 wherein the one or more synthesized signals are to be generated at least in part by frequency translation of at least some of the spectral components in the one or more baseband signals.
  - 3. The method according to claim 2 wherein the spectral components of synthesized signals are to be generated by frequency translation that maintains phase coherence.
  - 4. The method according to claim 1 wherein the one or more synthesized signals are to be generated at least in part by a combination of a frequency translation of at least some of the spectral components in the one or more baseband signals and a generation of one or more noise-like signals having spectral levels adapted according to spectral levels in the one or more baseband signals, and wherein the energy measures of spectral components in the one or more synthesized signals is obtained without regard to spectral levels in the noise-like signals.
  - 5. The method according to claim 1 wherein the one or more synthesized signals are to be generated at least in part by generation of one or more noise-like signals.
  - 6. The method according to claim 1 wherein the energy measures of spectral components of the residual signals are obtained from values representing magnitudes of the spectral components.
  - 7. The method according to claim 6 that comprises:
    - applying a first analysis filterbank to the one or more input audio signals to obtain the one or more baseband signals and the one or more residual signals; and
      
      applying a second analysis filterbank to the one or more input audio signals to obtain additional spectral components;
      
      wherein the energy measures of spectral components in the residual signals are calculated from the spectral components of the residual signals and one or more of the additional spectral components.
  - 8. The method according to claim 1 wherein the scaling information represents the scale factors normalized with respect to one or more normalizing values, and wherein the scaling information includes a representation of the one or more normalizing values.
  - 9. The method according to claim 8 wherein the one or more normalizing values are selected from a set of values.
  - 10. The method according to claim 8 wherein the one or more normalizing values comprise a maximum allowable value for scale factors.
  - 11. The method according to claim 1 that calculates a scale factor for one or more of the frequency subbands for the respective residual signals.
  - 12. The method according to claim 11 wherein frequency extents of one or more of the sets of frequency subbands are adapted, and wherein the method assembles into the encoded signal an indication of the adapted frequency extents.
  - 13. The method according to claim 12 wherein the frequency extents are adapted by selecting from a set of extents.
  - 14. The method according to claim 1 for a plurality of the input audio signals, wherein the method comprises:
    - obtaining from the plurality of input audio signals a coupled-channel signal having spectral components representing a composite of spectral components of two or more of the input audio signals in a third set of frequency subbands;
      
      obtaining energy measures of at least some spectral components of the coupled-channel signal;
      
      obtaining energy measures of at least some of the spectral components of the two or more input audio signals represented by the coupled-channel signal in the third set of frequency subbands; and
      
      calculating coupling scale factors by obtaining square roots of ratios of the energy measures of spectral components in the two or more input audio signals to the energy measures of spectral energy in the coupled-channel signal, square roots of ratios of the energy measures of spectral energy in the coupled-channel signal to the energy measures of spectral components in the two or more input audio signals, ratios of square roots of the energy measures of spectral components in the two or more input audio signals to square roots of the energy measures of spectral energy in the coupled-channel signal, or ratios of square roots of the energy measures of spectral energy in the coupled-channel signal to square roots of the energy measures of spectral components in the two or more input audio signals;
      
      wherein the scaling information also represents the coupling scale factors and the signal information also represents the spectral components in the coupled-channel signal.
  - 15. The method according to claim 14 wherein the one or more synthesized signals are to be generated at least in part by frequency translation of at least some of the spectral components of the input audio signals in the third set of frequency subbands.
  - 16. The method according to claim 14 that comprises:
    - detecting one or more characteristics of the plurality of input audio signals;
      
      adapting frequency extents of the first set of frequency subbands, the second set of frequency subbands, or the third set of frequency subbands in response to the detected characteristics; and
      
      assembling into the encoded signal an indication of the adapted frequency extents.
  - 17. The method according to claim 1 that comprises:
    - detecting one or more characteristics of the one or more input audio signals;
      
      adapting frequency extents of the first set of frequency subbands or the second set of frequency subbands in response to the detected characteristics; and
      
      assembling into the encoded signal an indication of the adapted frequency extents.

18. A method for decoding an encoded signal representing one or more input audio signals, wherein the method comprises:
- obtaining scaling information and signal information from the encoded signal, wherein the scaling information represents scale factors calculated from square roots of ratios of energy measures of spectral components or ratios of square roots of energy measures of spectral components, and the signal information represents spectral components for one or more baseband signals, wherein the spectral components in each baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands;
  
  generating for each respective baseband signal an associated synthesized signal having spectral components in a second set of frequency subbands that are not represented by the respective baseband signal, wherein the spectral components in the associated synthesized signal are scaled by multiplication or division according to one or more of the scale factors; and
  
  generating one or more output audio signals, wherein each output audio signal represents a respective input audio signal and is generated from the spectral components in a respective baseband signal and its associated synthesized signal.
- View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31)
- - 19. The method according to claim 18 wherein the associated synthesized signal is generated at least in part by frequency translation of at least some of the spectral components in the respective baseband signal.
  - 20. The method according to claim 19 wherein the frequency translation maintains phase coherence.
  - 21. The method according to claim 18 wherein the associated synthesized signal is generated at least in part by generation of a noise-like signal having spectral levels adapted according to one or more of the scale factors.
  - 22. The method according to claim 18 that obtains from the encoded signal one or more normalizing values and reverses normalization of the scale factors with respect to the one or more normalizing values.
  - 23. The method according to claim 22 wherein the one or more normalizing values are conveyed in the encoded signal by scaling information that represents selected values in a set of values.
  - 24. The method according to claim 22 wherein the one or more normalizing values comprise a maximum allowable value for scale factors.
  - 25. The method according to claim 18 wherein frequency subbands of the associated synthesized signal are associated with a respective scale factor.
  - 26. The method according to claim 25 that adapts the generation of the associated synthesized signal in response to subband information conveyed in the encoded signal that specifies frequency extents of the frequency subbands.
  - 27. The method according to claim 26 wherein the subband information represents selected frequency extents in a set of extents.
  - 28. The method according to claim 18 for decoding a signal representing a plurality of input audio signals, wherein the method comprises:
    - obtaining from the encoded signal a coupled-channel signal having spectral components representing a composite of two or more of the plurality of input audio signals in a third set of frequency subbands, wherein the scaling information also represents coupling scale factors calculated from square roots of ratios of energy measures of spectral components of the two or more input audio signals in the third set of frequency subbands to the energy measures of spectral energy in the coupled-channel signal, square roots of ratios of the energy measures of spectral energy in the coupled-channel signal to the energy measures of spectral components of the two or more input audio signals in the third set of frequency subbands, ratios of square roots of the energy measures of spectral components of the two or more input audio signals in the third set of frequency subbands to square roots of the energy measures of spectral energy in the coupled-channel signal, or ratios of square roots of the energy measures of spectral energy in the coupled-channel signal to square roots of the energy measures of spectral components of the two or more input audio signals in the third set of frequency subbands; and
      
      generating from the coupled-channel signal a respective decoupled signal for each of the two or more input audio signals represented by the coupled-channel signal, wherein the decoupled signals have spectral components in the third set of frequency subbands that are scaled by multiplication or division according to one or more of the coupling scale factors;
      
      wherein output audio signals representing the two or more input audio signals are also generated from the spectral components in respective decoupled signals.
  - 29. The method according to claim 28 wherein the associated synthesized signal is generated at least in part by frequency translation of at least some of the spectral components in the third set of frequency subbands.
  - 30. The method according to claim 28 that comprises:
    - obtaining from the encoded signal an indication of frequency extents of the first, second or third sets of frequency subbands; and
      
      adapting the generation of synthesized signals and decoupled signals in response to the indication.
  - 31. The method according to claim 18 that comprises:
    - obtaining from the encoded signal an indication of frequency extents of the first or second sets of frequency subbands; and
      
      adapting the generation of synthesized signals and decoupled signals in response to the indication.

32. A method for encoding a plurality of input audio signals, wherein the method comprises:
- receiving the plurality of input audio signals and obtaining therefrom a plurality of baseband signals, a plurality of residual signals and a coupled-channel signal, wherein spectral components of a baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and spectral components of an associated residual signal represent spectral components of the respective input audio signal in a second set of frequency subbands that are not represented by the baseband signal, and wherein spectral components of the coupled-channel signal represent a composite of spectral components of two or more of the input audio signals in a third set of frequency subbands;
  
  obtaining energy measures of at least some spectral components of each residual signal and the two or more input audio signals represented by the coupled-channel signal; and
  
  assembling control information and signal information into an encoded signal, wherein the control information is derived from the energy measures and wherein the signal information represents the spectral components in the plurality of baseband signals and the coupled-channel signal.
- View Dependent Claims (33, 34, 35)
- - 33. The method according to claim 32 that comprises:
    - obtaining energy measures of at least some spectral components of one or more synthesized signals to be generated during decoding, wherein the one or more synthesized signals have spectral components within the second set of frequency subbands; and
      
      deriving at least some of the control information by calculating square roots of ratios of the energy measures or ratios of square roots of the energy measures.
  - 34. The method of claim 33 wherein at least some of the spectral components of the one or more synthesized signals are to be synthesized from spectral components in the third set of frequency subbands.
  - 35. The method according to claim 32 wherein frequency extents of the sets of frequency subbands are adapted, and wherein the method assembles into the encoded signal an indication of the adapted frequency extents.

36. A method for decoding an encoded signal representing a plurality of input audio signals, wherein the method comprises:
- obtaining control information and signal information from the encoded signal, wherein the control information is derived from energy measures of spectral components and the signal information represents spectral components of a plurality of baseband signals and a coupled-channel signal, wherein the spectral components in each baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and the spectral components of the coupled-channel signal represent a composite of spectral components in a third set of frequency subbands of two or more of the plurality of input audio signals;
  
  generating for each respective baseband signal an associated synthesized signal having spectral components in a second set of frequency subbands that are not represented by the respective baseband signal, wherein the spectral components in the associated synthesized signal are scaled according to the control information;
  
  generating from the coupled-channel signal a respective decoupled signal for each of the two or more input audio signals represented by the coupled-channel signal, wherein the decoupled signals have spectral components in the third set of frequency subbands that are scaled according to the control information; and
  
  generating a plurality of output audio signals, wherein each output audio signal represents a respective input audio signal and is generated from the spectral components in a respective baseband signal and its associated synthesized signal, and wherein output audio signals representing the two or more audio signals are also generated from the spectral components in the respective decoupled signals.
- View Dependent Claims (37, 38, 39)
- - 37. The method according to claim 36 wherein the control information conveys a representation of scale factors calculated from square roots of ratios of energy measures or ratios of square roots of the energy measures, and wherein some of the energy measures in the ratios represent energy of at least some spectral components of the synthesized signals.
  - 38. The method of claim 37 wherein at least some of the spectral components of the one or more synthesized signals are synthesized from spectral components in the third set of frequency subbands.
  - 39. The method according to claim 36 wherein frequency extents of one or more of the sets of frequency subbands are adapted in response to the control information.

40. An encoder for encoding one or more input audio signals, wherein the encoder has processing circuitry that performs a signal processing method that comprises:
- receiving the one or more input audio signals and obtaining therefrom one or more baseband signals and one or more residual signals, wherein spectral components of a baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and spectral components in an associated residual signal represent spectral components of the respective input audio signal in a second set of frequency subbands that are not represented by the baseband signal;
  
  obtaining energy measures of at least some spectral components of one or more synthesized signals to be generated during decoding, wherein the one or more synthesized signals have spectral components within the second set of frequency subbands;
  
  obtaining energy measures of at least some spectral components of each residual signal;
  
  calculating scale factors by obtaining square roots of ratios of the energy measures of spectral components in the residual signals to the energy measures of spectral components in the one or more synthesized signals, square roots of ratios of the energy measures of spectral components in the one or more synthesized signals to the energy measures of spectral components in the residual signals, ratios of square roots of the energy measures of spectral components in the residual signals to square roots of the energy measures of spectral components in the one or more synthesized signals, or ratios of square roots of the energy measures of spectral components in the one or more synthesized signals to square roots of the energy measures of spectral components in the residual signals; and
  
  assembling signal information and scaling information into an encoded signal, wherein the signal information represents the spectral components in the one or more baseband signals and the scaling information represents the scale factors.

41. A decoder for decoding an encoded signal representing one or more input audio signals, wherein the decoder has processing circuitry that performs a signal processing method that comprises:
- obtaining scaling information and signal information from the encoded signal, wherein the scaling information represents scale factors calculated from square roots of ratios of energy measures of spectral components or ratios of square roots of energy measures of spectral components, and the signal information represents spectral components for one or more baseband signals, wherein the spectral components in each baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands;
  
  generating for each respective baseband signal an associated synthesized signal having spectral components in a second set of frequency subbands that are not represented by the respective baseband signal, wherein the spectral components in the associated synthesized signal are scaled by multiplication or division according to one or more of the scale factors; and
  
  generating one or more output audio signals, wherein each output audio signal represents a respective input audio signal and is generated from the spectral components in a respective baseband signal and its associated synthesized signal.

42. An encoder for encoding a plurality of input audio signals, wherein the encoder has processing circuitry that performs a signal processing method that comprises:
- receiving the plurality of input audio signals and obtaining therefrom a plurality of baseband signals, a plurality of residual signals and a coupled-channel signal, wherein spectral components of a baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and spectral components of an associated residual signal represent spectral components of the respective input audio signal in a second set of frequency subbands that are not represented by the baseband signal, and wherein spectral components of the coupled-channel signal represent a composite of spectral components of two or more of the input audio signals in a third set of frequency subbands;
  
  obtaining energy measures of at least some spectral components of each residual signal and the two or more input audio signals represented by the coupled-channel signal; and
  
  assembling control information and signal information into an encoded signal, wherein the control information is derived from the energy measures and wherein the signal information represents the spectral components in the plurality of baseband signals and the coupled-channel signal.

43. A decoder for decoding an encoded signal representing a plurality of input audio signals, wherein the decoder has processing circuitry that performs a signal processing method that comprises:
- obtaining control information and signal information from the encoded signal, wherein the control information is derived from energy measures of spectral components and the signal information represents spectral components of a plurality of baseband signals and a coupled-channel signal, wherein the spectral components in each baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and the spectral components of the coupled-channel signal represent a composite of spectral components in a third set of frequency subbands of two or more of the plurality of input audio signals;
  
  generating for each respective baseband signal an associated synthesized signal having spectral components in a second set of frequency subbands that are not represented by the respective baseband signal, wherein the spectral components in the associated synthesized signal are scaled according to the control information;
  
  generating from the coupled-channel signal a respective decoupled signal for each of the two or more input audio signals represented by the coupled-channel signal, wherein the decoupled signals have spectral components in the third set of frequency subbands that are scaled according to the control information; and
  
  generating a plurality of output audio signals, wherein each output audio signal represents a respective input audio signal and is generated from the spectral components in a respective baseband signal and its associated synthesized signal, and wherein output audio signals representing the two or more audio signals are also generated from the spectral components in the respective decoupled signals.

44. A medium conveying a program of instructions executable by a device, wherein execution of the program of instructions causes the device to perform a method for encoding one or more input audio signals, wherein the method comprises:
- receiving the one or more input audio signals and obtaining therefrom one or more baseband signals and one or more residual signals, wherein spectral components of a baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and spectral components in an associated residual signal represent spectral components of the respective input audio signal in a second set of frequency subbands that are not represented by the baseband signal;
  
  obtaining energy measures of at least some spectral components of one or more synthesized signals to be generated during decoding, wherein the one or more synthesized signals have spectral components within the second set of frequency subbands;
  
  obtaining energy measures of at least some spectral components of each residual signal;
  
  calculating scale factors by obtaining square roots of ratios of the energy measures of spectral components in the residual signals to the energy measures of spectral components in the one or more synthesized signals, square roots of ratios of the energy measures of spectral components in the one or more synthesized signals to the energy measures of spectral components in the residual signals, ratios of square roots of the energy measures of spectral components in the residual signals to square roots of the energy measures of spectral components in the one or more synthesized signals, or ratios of square roots of the energy measures of spectral components in the one or more synthesized signals to square roots of the energy measures of spectral components in the residual signals; and
  
  assembling signal information and scaling information into an encoded signal, wherein the signal information represents the spectral components in the one or more baseband signals and the scaling information represents the scale factors.

45. A medium conveying a program of instructions executable by a device, wherein execution of the program of instructions causes the device to perform a method for decoding an encoded signal representing one or more input audio signals, wherein the method comprises:
- obtaining scaling information and signal information from the encoded signal, wherein the scaling information represents scale factors calculated from square roots of ratios of energy measures of spectral components or ratios of square roots of energy measures of spectral components, and the signal information represents spectral components for one or more baseband signals, wherein the spectral components in each baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands;
  
  generating for each respective baseband signal an associated synthesized signal having spectral components in a second set of frequency subbands that are not represented by the respective baseband signal, wherein the spectral components in the associated synthesized signal are scaled by multiplication or division according to one or more of the scale factors; and
  
  generating one or more output audio signals, wherein each output audio signal represents a respective input audio signal and is generated from the spectral components in a respective baseband signal and its associated synthesized signal.
- View Dependent Claims (46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58)
- - 46. The medium according to claim 45 wherein the associated synthesized signal is generated at least in part by frequency translation of at least some of the spectral components in the respective baseband signal.
  - 47. The medium according to claim 46 wherein the frequency translation maintains phase coherence.
  - 48. The medium according to claim 45 wherein the associated synthesized signal is generated at least in part by generation of a noise-like signal having spectral levels adapted according to one or more of the scale factors.
  - 49. The medium according to claim 45 wherein the method obtains from the encoded signal one or more normalizing values and reverses normalization of the scale factors with respect to the one or more normalizing values.
  - 50. The medium according to claim 49 wherein the one or more normalizing values are conveyed in the encoded signal by scaling information that represents selected values in a set of values.
  - 51. The medium according to claim 49 wherein the one or more normalizing values comprise a maximum allowable value for scale factors.
  - 52. The medium according to claim 45 wherein frequency subbands of the associated synthesized signal are associated with a respective scale factor.
  - 53. The medium according to claim 52 wherein the method adapts the generation of the associated synthesized signal in response to subband information conveyed in the encoded signal that specifies frequency extents of the frequency subbands.
  - 54. The medium according to claim 53 wherein the subband information represents selected frequency extents in a set of extents.
  - 55. The medium according to claim 45 for decoding a signal representing a plurality of input audio signals, wherein the method comprises:
    - obtaining from the encoded signal a coupled-channel signal having spectral components representing a composite of two or more of the plurality of input audio signals in a third set of frequency subbands, wherein the scaling information also represents coupling scale factors calculated from square roots of ratios of energy measures of spectral components of the two or more input audio signals in the third set of frequency subbands to the energy measures of spectral energy in the coupled-channel signal, square roots of ratios of the energy measures of spectral energy in the coupled-channel signal to the energy measures of spectral components of the two or more input audio signals in the third set of frequency subbands, ratios of square roots of the energy measures of spectral components of the two or more input audio signals in the third set of frequency subbands to square roots of the energy measures of spectral energy in the coupled-channel signal, or ratios of square roots of the energy measures of spectral energy in the coupled-channel signal to square roots of the energy measures of spectral components of the two or more input audio signals in the third set of frequency subbands; and
      
      generating from the coupled-channel signal a respective decoupled signal for each of the two or more input audio signals represented by the coupled-channel signal, wherein the decoupled signals have spectral components in the third set of frequency subbands that are scaled by multiplication or division according to one or more of the coupling scale factors;
      
      wherein output audio signals representing the two or more input audio signals are also generated from the spectral components in respective decoupled signals.
  - 56. The medium according to claim 55 wherein the associated synthesized signal is generated at least in part by frequency translation of at least some of the spectral components in the third set of frequency subbands.
  - 57. The medium according to claim 55 wherein the method comprises:
    - obtaining from the encoded signal an indication of frequency extents of the first, second or third sets of frequency subbands; and
      
      adapting the generation of synthesized signals and decoupled signals in response to the indication.
  - 58. The medium according to claim 45 wherein the method comprises:
    - obtaining from the encoded signal an indication of frequency extents of the first or second sets of frequency subbands; and
      
      adapting the generation of synthesized signals and decoupled signals in response to the indication.

59. A medium conveying a program of instructions executable by a device, wherein execution of the program of instructions causes the device to perform a method for encoding a plurality of input audio signals, wherein the method comprises:
- receiving the plurality of input audio signals and obtaining therefrom a plurality of baseband signals, a plurality of residual signals and a coupled-channel signal, wherein spectral components of a baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and spectral components of an associated residual signal represent spectral components of the respective input audio signal in a second set of frequency subbands that are not represented by the baseband signal, and wherein spectral components of the coupled-channel signal represent a composite of spectral components of two or more of the input audio signals in a third set of frequency subbands;
  
  obtaining energy measures of at least some spectral components of each residual signal and the two or more input audio signals represented by the coupled-channel signal; and
  
  assembling control information and signal information into an encoded signal, wherein the control information is derived from the energy measures and wherein the signal information represents the spectral components in the plurality of baseband signals and the coupled-channel signal.

60. A medium conveying a program of instructions executable by a device, wherein execution of the program of instructions causes the device to perform a method for decoding an encoded signal representing a plurality of input audio signals, wherein the method comprises:
- obtaining control information and signal information from the encoded signal, wherein the control information is derived from energy measures of spectral components and the signal information represents spectral components of a plurality of baseband signals and a coupled-channel signal, wherein the spectral components in each baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and the spectral components of the coupled-channel signal represent a composite of spectral components in a third set of frequency subbands of two or more of the plurality of input audio signals;
  
  generating for each respective baseband signal an associated synthesized signal having spectral components in a second set of frequency subbands that are not represented by the respective baseband signal, wherein the spectral components in the associated synthesized signal are scaled according to the control information;
  
  generating from the coupled-channel signal a respective decoupled signal for each of the two or more input audio signals represented by the coupled-channel signal, wherein the decoupled signals have spectral components in the third set of frequency subbands that are scaled according to the control information; and
  
  generating a plurality of output audio signals, wherein each output audio signal represents a respective input audio signal and is generated from the spectral components in a respective baseband signal and its associated synthesized signal, and wherein output audio signals representing the two or more audio signals are also generated from the spectral components in the respective decoupled signals.
- View Dependent Claims (61, 62, 63)
- - 61. The medium according to claim 60 wherein the control information conveys a representation of scale factors calculated from square roots of ratios of energy measures or ratios of square roots of the energy measures, and wherein some of the energy measures in the ratios represent energy of at least some spectral components of the synthesized signals.
  - 62. The medium according to claim 61 wherein at least some of the spectral components of the one or more synthesized signals are synthesized from spectral components in the third set of frequency subbands.
  - 63. The medium according to claim 60 wherein frequency extents of one or more of the sets of frequency subbands are adapted in response to the control information.

64. A medium conveying encoded information representing one or more input audio signals, wherein the encoded information was generated by a method that comprises:
- receiving the one or more input audio signals and obtaining therefrom one or more baseband signals and one or more residual signals, wherein spectral components of a baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and spectral components in an associated residual signal represent spectral components of the respective input audio signal in a second set of frequency subbands that are not represented by the baseband signal;
  
  obtaining energy measures of at least some spectral components of one or more synthesized signals to be generated during decoding, wherein the one or more synthesized signals have spectral components within the second set of frequency subbands;
  
  obtaining energy measures of at least some spectral components of each residual signal;
  
  calculating scale factors by obtaining square roots of ratios of the energy measures of spectral components in the residual signals to the energy measures of spectral components in the one or more synthesized signals, square roots of ratios of the energy measures of spectral components in the one or more synthesized signals to the energy measures of spectral components in the residual signals, ratios of square roots of the energy measures of spectral components in the residual signals to square roots of the energy measures of spectral components in the one or more synthesized signals, or ratios of square roots of the energy measures of spectral components in the one or more synthesized signals to square roots of the energy measures of spectral components in the residual signals; and
  
  assembling signal information and scaling information into an encoded signal, wherein the signal information represents the spectral components in the one or more baseband signals and the scaling information represents the scale factors.
- View Dependent Claims (65, 66, 67, 68)
- - 65. The medium according to claim 64 for a plurality of the input audio signals, wherein the method comprises:
    - obtaining from the plurality of input audio signals a coupled-channel signal having spectral components representing a composite of spectral components of two or more of the input audio signals in a third set of frequency subbands;
      
      obtaining energy measures of at least some spectral components of the coupled-channel signal;
      
      obtaining energy measures of at least some of the spectral components of the two or more input audio signals represented by the coupled-channel signal in the third set of frequency subbands; and
      
      calculating coupling scale factors by obtaining square roots of ratios of the energy measures of spectral components in the two or more input audio signals to the energy measures of spectral energy in the coupled-channel signal, square roots of ratios of the energy measures of spectral energy in the coupled-channel signal to the energy measures of spectral components in the two or more input audio signals, ratios of square roots of the energy measures of spectral components in the two or more input audio signals to square roots of the energy measures of spectral energy in the coupled-channel signal, or ratios of square roots of the energy measures of spectral energy in the coupled-channel signal to square roots of the energy measures of spectral components in the two or more input audio signals;
      
      wherein the scaling information also represents the coupling scale factors and the signal information also represents the spectral components in the coupled-channel signal.
  - 66. The medium according to claim 65 wherein the one or more synthesized signals are to be generated at least in part by frequency translation of at least some of the spectral components of the input audio signals in the third set of frequency subbands.
  - 67. The medium according to claim 65 wherein the method comprises:
    - detecting one or more characteristics of the plurality of input audio signals;
      
      adapting frequency extents of the first set of frequency subbands, the second set of frequency subbands, or the third set of frequency subbands in response to the detected characteristics; and
      
      assembling into the encoded signal an indication of the adapted frequency extents.
  - 68. The medium according to claim 64 wherein the method comprises:
    - detecting one or more characteristics of the one or more input audio signals;
      
      adapting frequency extents of the first set of frequency subbands or the second set of frequency subbands in response to the detected characteristics; and
      
      assembling into the encoded signal an indication of the adapted frequency extents.

69. A medium conveying encoded information representing a plurality of input audio signals, wherein the encoded information was generated by a method that comprises:
- receiving the plurality of input audio signals and obtaining therefrom a plurality of baseband signals, a plurality of residual signals and a coupled-channel signal, wherein spectral components of a baseband signal represent spectral components of a respective input audio signal in a first set of frequency subbands and spectral components of an associated residual signal represent spectral components of the respective input audio signal in a second set of frequency subbands that are not represented by the baseband signal, and wherein spectral components of the coupled-channel signal represent a composite of spectral components of two or more of the input audio signals in a third set of frequency subbands;
  
  obtaining energy measures of at least some spectral components of each residual signal and the two or more input audio signals represented by the coupled-channel signal; and
  
  assembling control information and signal information into an encoded signal, wherein the control information is derived from the energy measures and wherein the signal information represents the spectral components in the plurality of baseband signals and the coupled-channel signal.
- View Dependent Claims (70, 71)
- - 70. The medium according to claim 69 wherein the method comprises:
    - obtaining energy measures of at least some spectral components of one or more synthesized signals to be generated during decoding, wherein the one or more synthesized signals have spectral components within the second set of frequency subbands and at least some of the spectral components of the one or more synthesized signals are to be synthesized from spectral components in the third set of frequency subbands; and
      
      deriving at least some of the control information by calculating square roots of ratios of the energy measures or ratios of square roots of the energy measures.
  - 71. The medium according to claim 69 wherein frequency extents of the sets of frequency subbands are adapted, and wherein the method assembles into the encoded signal an indication of the adapted frequency extents.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Dolby Laboratories Licensing Corporation (Dolby Laboratories Incorporated)
Original Assignee
Dolby Laboratories Licensing Corporation (Dolby Laboratories Incorporated)
Inventors
Truman, Michael Mead, Williams, Philip Anthony, Vernon, Stephen Decker, Andersen, Robert Loring

Granted Patent

US 7,318,035 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/500
CPC Class Codes

G10L 19/02 using spectral analysis, e....

G10L 21/038 using band spreading techni...

Audio coding systems and methods using spectral component coupling and spectral component regeneration

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

71 Claims

Specification

Solutions

Use Cases

Quick Links

Audio coding systems and methods using spectral component coupling and spectral component regeneration

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

71 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links