Audio mixing method, apparatus and system
First Claim
1. An audio mixing method, comprising:
- receiving, from a first site, a first audio stream that comprises a first plurality of sound source objects;
receiving, from a second site, a second audio stream that comprises a second plurality of sound source objects;
receiving, from a third site, a third audio stream that comprises a third plurality of sound source objects;
analyzing the first audio stream to obtain a sound characteristic value of each of the plurality of sound source objects;
analyzing the second audio stream to obtain a sound characteristic value of each of the second plurality of sound source objects;
analyzing the third audio stream to obtain a sound characteristic value of each of the third plurality of sound source objects;
selecting, according to a descending sequence of sound characteristic values of the first plurality of sound source objects, the second plurality of sound source objects, and the third plurality of sound source objects, a predetermined number of sound source objects from the first plurality of sound source objects, the second plurality of sound source objects, and the third plurality of sound source objects to serve as main sound source objects,wherein a portion, but not all, of each of the first plurality of sound source objects, the second plurality of sound source objects, and the third plurality of sound source objects are selected as main sound source objects for audio mixing, andwherein a sound source object that is not selected as a main sound source object in an audio stream that contains a main sound source object is muted without muting the main sound source object or the audio stream;
determining each site selected from among the first site, the second site, and the third site from which each of the main sound source objects were received in an audio stream;
determining a target site selected from among the first site, the second site, and the third site;
determining audio streams that require audio mixing as audio streams received from each site selected from among the first site, the second site, and the third site, but not the target site, from which each of the main sound source objects were received in the audio stream; and
either performing audio mixing on the determined audio streams that require audio mixing for the target site, and sending the audio streams after the audio mixing to the target site;
orsending to the target site the determined audio streams that require audio mixing for the target site to perform audio mixing in the target site.
3 Assignments
0 Petitions
Accused Products
Abstract
An audio mixing method, apparatus and system, which can ensure sound quality after audio mixing and reduce consumption of computing resources. The method includes: receiving an audio stream of each site, and analyzing the audio stream of each site to obtain a sound characteristic value of a sound source object; selecting, according to a descending sequence of sound characteristic values of sound source objects, a predetermined number of sound source objects from the sound source objects to serve as main sound source objects; determining, according to a relationship between a target site and the sites where the main sound source objects are located, audio streams that require audio mixing for the target site; and performing audio mixing on the audio streams that require audio mixing for the target site and sending the audio streams after the audio mixing to the target site.
-
Citations
14 Claims
-
1. An audio mixing method, comprising:
-
receiving, from a first site, a first audio stream that comprises a first plurality of sound source objects; receiving, from a second site, a second audio stream that comprises a second plurality of sound source objects; receiving, from a third site, a third audio stream that comprises a third plurality of sound source objects; analyzing the first audio stream to obtain a sound characteristic value of each of the plurality of sound source objects; analyzing the second audio stream to obtain a sound characteristic value of each of the second plurality of sound source objects; analyzing the third audio stream to obtain a sound characteristic value of each of the third plurality of sound source objects; selecting, according to a descending sequence of sound characteristic values of the first plurality of sound source objects, the second plurality of sound source objects, and the third plurality of sound source objects, a predetermined number of sound source objects from the first plurality of sound source objects, the second plurality of sound source objects, and the third plurality of sound source objects to serve as main sound source objects, wherein a portion, but not all, of each of the first plurality of sound source objects, the second plurality of sound source objects, and the third plurality of sound source objects are selected as main sound source objects for audio mixing, and wherein a sound source object that is not selected as a main sound source object in an audio stream that contains a main sound source object is muted without muting the main sound source object or the audio stream; determining each site selected from among the first site, the second site, and the third site from which each of the main sound source objects were received in an audio stream; determining a target site selected from among the first site, the second site, and the third site;
determining audio streams that require audio mixing as audio streams received from each site selected from among the first site, the second site, and the third site, but not the target site, from which each of the main sound source objects were received in the audio stream; andeither performing audio mixing on the determined audio streams that require audio mixing for the target site, and sending the audio streams after the audio mixing to the target site;
orsending to the target site the determined audio streams that require audio mixing for the target site to perform audio mixing in the target site. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. An audio mixing apparatus comprising:
-
a computer processor configured to; receive an audio stream from a first site, a second site, and a third site, wherein each audio stream comprises a plurality of sound source objects; analyze the audio stream of the first site, the second site, and the third site to obtain a sound characteristic value of each sound source object; select, according to a descending sequence of sound characteristic values of the sound source objects, a predetermined number of the sound source objects to serve as main sound source objects, wherein a portion, but not all, of the plurality of sound source objects are selected as main sound source objects for audio mixing, and wherein a sound source object that is not selected as a main sound source object in an audio stream that contains a main sound source object is muted without muting the main sound source object or the audio stream; determine each site selected from among the first site, the second site, and the third site from which each of the main sound source objects were received in an audio stream; determine a target site selected from among the first site, the second site, and the third site; and determine audio streams that require audio mixing as audio streams received from each site selected from among the first site, the second site, and the third site, but not the target site, from which each of the main sound source objects were received in the audio stream; and a transmitter coupled to the computer processor and configured to either; perform audio mixing on the determined audio streams that require audio mixing for the target site, and send the audio stream after the audio mixing to the target site;
orsend, to the target site, the determined audio streams that require audio mixing for the target site to perform audio mixing in the target site. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. An audio mixing system, comprising:
-
a site terminal configured to; collect an audio signal from at least one sound source object; perform spatial audio object coding on the collected audio signal to form a down-mixed audio stream; send the down-mixed audio stream and a spatial side information to an audio mixing apparatus, wherein the spatial side information comprises; a maximum energy value among energy values of each sound source object in the down-mixed audio stream; and a ratio of an energy value of each sound source object in the down-mixed audio stream to the maximum energy value among energy values of each sound source object in the down-mixed audio stream, wherein the audio mixing apparatus comprises a computer processor configured to; receive a down-mixed audio stream and spatial side information from a plurality of site terminals, wherein at least some of the plurality of site terminals comprise a plurality of sound source objects; and analyze the down-mixed audio stream and spatial side information of each site terminal to obtain a sound characteristic value of each sound source object corresponding to each site terminal; select, according to a descending sequence of sound characteristic values of the first plurality of sound source objects, the second plurality of sound source objects, and the third plurality of sound source objects, a predetermined number of sound source objects from the first plurality of sound source objects, the second plurality of sound source objects, and the third plurality of sound source objects to serve as main sound source objects, wherein a portion, but not all, of each of the first plurality of sound source objects, the second plurality of sound source objects, and the third plurality of sound source objects are selected as main sound source objects for audio mixing; select, according to a descending sequence of sound characteristic values of sound source objects, a predetermined number of sound source objects from the sound source objects to serve as main sound source objects; determine the site terminals where the main sound source objects are located; and determine, according to a relationship between a target site terminal and the site terminals where the main sound source objects are located, the down-mixed audio streams that require audio mixing for the target site terminal, and wherein the computer processor is further configured to either; perform audio mixing on the determined down-mixed audio streams that require audio mixing for the target site terminal to form a mixed audio stream, and send the mixed audio stream after the audio mixing to the target site terminal;
orsend, to the target site terminal via a transmitter, the determined down-mixed audio streams that require audio mixing for the target site terminal to perform audio mixing.
-
Specification