Classification of audio signals
First Claim
1. An apparatus comprising:
- a processor;
a memory including machine executable instructions, the memory and the machine executable instructions being configured to, in association with the processor, cause the apparatus to;
receive frames of an audio signal in a frequency band;
perform a first excitation for a speech like audio signal which is mostly speech signal; and
perform a second excitation for a music like audio signal;
wherein the apparatus is further caused to;
divide the frequency band into at least a first and a second group of sub band audio signals, wherein each sub band audio signal has a narrower bandwidth than said frequency band, and said second group containing sub bands of higher frequencies than said first group;
produce information indicative of normalised signal energies of a current frame of the audio signal at least at one sub band;
select one excitation among said at least first excitation and said second excitation, the selection based on a defined relation between normalised signal energy of said first group of sub bands and normalised signal energy of said second group of sub bands for the frames of the audio signal and to use said relation in the selection of the excitation; and
perform the selected excitation for a frame of the audio signal.
2 Assignments
0 Petitions
Accused Products
Abstract
An encoder comprising an input for inputting frames of an audio signal in a frequency band, at least a first excitation block for performing a first excitation for a speech like audio signal, and a second excitation block for performing a second excitation for a non-speech like audio signal. The encoder further comprises a filter for dividing the frequency band into a plurality of sub bands each having a narrower bandwidth than the frequency band. The encoder also comprises an excitation selection block for selecting one excitation block among the at least first excitation block and the second excitation block for performing the excitation for a frame of the audio signal on the basis of the properties of the audio signal at least at one of the sub bands. The invention also relates to a device, a system, a method and a storage medium for a computer program.
10 Citations
33 Claims
-
1. An apparatus comprising:
-
a processor; a memory including machine executable instructions, the memory and the machine executable instructions being configured to, in association with the processor, cause the apparatus to; receive frames of an audio signal in a frequency band; perform a first excitation for a speech like audio signal which is mostly speech signal; and perform a second excitation for a music like audio signal; wherein the apparatus is further caused to; divide the frequency band into at least a first and a second group of sub band audio signals, wherein each sub band audio signal has a narrower bandwidth than said frequency band, and said second group containing sub bands of higher frequencies than said first group; produce information indicative of normalised signal energies of a current frame of the audio signal at least at one sub band; select one excitation among said at least first excitation and said second excitation, the selection based on a defined relation between normalised signal energy of said first group of sub bands and normalised signal energy of said second group of sub bands for the frames of the audio signal and to use said relation in the selection of the excitation; and perform the selected excitation for a frame of the audio signal. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A device comprising an encoder comprising an input configured to input frames of an audio signal in a frequency band, a first excitation block configured to perform a first excitation for a speech like audio signal which is mostly speech signal, and a second excitation block configured to perform a second excitation for a music like audio signal, wherein said encoder further comprises a filter configured to divide the frequency band into at least a first and a second group of sub band audio signals, wherein each sub band audio signal has a narrower bandwidth than said frequency band and said second group containing sub bands of higher frequencies than said first group wherein said filter further comprises a filter block configured to produce information indicative of normalised signal energies of a current frame of the audio signal at least at one sub band;
- and the device also comprising an excitation selection block configured to select one excitation block among said at least first excitation block and said second excitation block, the selection based on a defined relation between normalised signal energy of said first group of sub bands and normalised signal energy of said second group of sub bands for the frames of the audio signal and to use said relation in the selection of the excitation block so that the selected excitation block performs the excitation for a frame of the audio signal.
- View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
16. A mobile communication device comprising an encoder comprising an input configured to input frames of an audio signal in a frequency band, a first excitation block configured to perform a first excitation for a speech like audio signal which is mostly speech signal, and a second excitation block configured to perform a second excitation for a music like audio signal, wherein said encoder further comprises a filter configured to divide the frequency band into at least a first and a second group of sub band audio signals, wherein each sub band audio signal has a narrower bandwidth than said frequency band and said second group containing sub bands of higher frequencies than said first group wherein said filter further comprises a filter block configured to produce information indicative of normalised signal energies of a current frame of the audio signal at least at one sub band;
- and the device also comprising an excitation selection block configured to select one excitation block among said at least first excitation block and a second excitation block, the selection based on a defined relation between normalised signal energy of said first group of sub bands and normalised signal energy of said second group of sub bands for the frames of the audio signal and to use said relation in the selection of the excitation block so that the selected excitation block performs the excitation for a frame of the audio signal.
-
17. A system comprising an encoder comprising:
-
a processor; a memory including machine executable instructions, the memory and the machine executable instructions being configured to, in association with the processor, cause the encoder to; receive frames of an audio signal in a frequency band; perform a first excitation for a speech like audio signal which is mostly speech signal; and perform a second excitation for a music like audio signal; wherein said encoder is further caused to; divide the frequency band into at least a first and a second group of sub band audio signals, wherein each sub band audio signal has a narrower bandwidth than said frequency band and said second group containing sub bands of higher frequencies than said first group; produce information indicative of normalised signal energies of a current frame of the audio signal at least at one sub band; select one excitation among said at least first excitation and said second excitation, the selection based on a defined relation between normalised signal energy of said first group of sub bands and normalised signal energy of said second group of sub bands for the frames of the audio signal and to use said relation in the selection of the excitation; and perform the selected excitation for a frame of the audio signal. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A method comprising:
-
receiving input frames of an audio signal in a frequency band at a device; using a first excitation for a speech like audio signal which is mostly speech signal; using a second excitation for a music like audio signal; dividing the frequency band into at least a first and a second group of sub band audio signals, wherein each sub band audio signal has a narrower bandwidth than said frequency band and said second group containing sub bands of higher frequencies than said first group; producing information indicative of normalised signal energies of a current frame of the audio signal at least at one sub band by using a filter block; selecting one excitation among said at least first excitation and said second excitation by defining a relation between normalised signal energy of said first group of sub bands and normalised signal energy of said second group of sub bands for the frames of the audio signal and using said relation in the selection of the excitation; and using the selected excitation to perform the excitation for a frame of the audio signal. - View Dependent Claims (27, 28, 29, 30)
-
-
31. A non-transitory computer readable medium stored with instructions, which when executed by a processor, perform:
-
compressing audio signals in a frequency band, in which a first excitation is used for a speech like audio signal which is mostly speech signal, and a second excitation is used for a music like audio signal; dividing the frequency band into at least a first and a second group of sub band audio signals, wherein each sub band audio signal has a narrower bandwidth than said frequency band and said second group containing sub bands of higher frequencies than said first group; producing information indicative of normalised signal energies of a current frame of the audio signal at least at one sub band by using a filter block; selecting one excitation among said at least first excitation and said second excitation by defining a relation between normalised signal energy of said first group of sub bands and normalised signal energy of said second group of sub bands for the frames of the audio signal and using said relation in the selection of the excitation; and using the selected excitation to perform the excitation for a frame of the audio signal. - View Dependent Claims (32, 33)
-
Specification