Spatialization arrangement for conference call
First Claim
1. A method for distinguishing speakers in a conference call of a plurality of participants, the method comprising:
- receiving speech frames of the conference call, said speech frames including encoded speech parameters;
examining at least one speech parameter of the received speech frames; and
classifying the speech frames to belong to one of the participants, the classification being carried out according to differences in the examined at least one speech parameter.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for distinguishing speakers in a conference call of a plurality of participants, in which method speech frames of the conference call are received in a receiving unit, which speech frames include encoded speech parameters. At least one parameter of the received speech frames is examined in an audio codec of the receiving unit, and the speech frames are classified to belong to one of the participants, the classification being carried out according to differences in the examined at least one speech parameter. These functions may be carried out in a speaker identification block, which is applicable in various positions of a teleconferencing processing chain. Finally, a spatialization effect is created in a terminal reproducing the audio signal according to notified differences by placing the participants at distinct positions in an acoustical space of the audio signal.
72 Citations
32 Claims
-
1. A method for distinguishing speakers in a conference call of a plurality of participants, the method comprising:
-
receiving speech frames of the conference call, said speech frames including encoded speech parameters;
examining at least one speech parameter of the received speech frames; and
classifying the speech frames to belong to one of the participants, the classification being carried out according to differences in the examined at least one speech parameter. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for distinguishing speakers in a conference call with a plurality of participants, the system comprising:
-
means for receiving speech frames of the conference call, said speech frames including encoded speech parameters;
means for examining at least one parameter of the received speech frames; and
means for classifying the speech frames to belong to one of the participants, the classification being based on differences in the examined at least one speech parameter. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A terminal device for a three-dimensional spatialization of an audio signal of a conference call with a plurality of participants, the device comprising:
-
means for receiving speech frames of the conference call, said speech frames including encoded speech parameters;
means for examining at least one parameter of the received speech frames;
means for classifying the speech frames to belong to one of the participants, the classification being based on differences in the examined at least one speech parameter; and
a spatialization means for creating a spatialization effect to the audio signal to be reproduced by placing the participants at distinct positions in an acoustical space of the audio signal. - View Dependent Claims (16, 17, 20)
-
-
18. A computer program product, stored on a computer readable medium and executable in a data processing device, for a three-dimensional spatialization of an audio signal of a conference call with a plurality of participants, the computer program product comprising:
-
a computer program code section for receiving speech frames of the conference call, said speech frames including encoded speech parameters;
a computer program code section for examining at least one parameter of the received speech frames;
a computer program code section for classifying the speech frames to belong to one of the participants, the classification being based on differences in the examined at least one speech parameter; and
a computer program code section for creating a spatialization effect to the audio signal to be reproduced by placing the participants at distinct positions in an acoustical space of the audio signal. - View Dependent Claims (19)
-
-
21. A conference bridge for a teleconferencing system, the bridge comprising:
-
means for receiving speech frames of the conference call with a plurality of participants, said speech frames including encoded speech parameters;
means for examining at least one parameter of the received speech frames;
means for classifying the speech frames to belong to one of the participants, the classification being based on differences in the examined at least one speech parameter; and
means for including information based on the speech frame classification of the participants in an audio signal for a further spatialization processing of the audio signal. - View Dependent Claims (22, 23, 24, 25, 26)
-
-
27. A computer program product, stored on a computer readable medium and executable in a data processing device, for distinguishing speakers in a conference call with a plurality of participants, the computer program product comprising:
-
a computer program code section for receiving speech frames of the conference call, said speech frames including encoded speech parameters;
a computer program code section for examining at least one parameter of the received speech frames;
a computer program code section for classifying the speech frames to belong to one of the participants, the classification being based on differences in the examined at least one speech parameter; and
a computer program code section for including information based on the speech frame classification of the participants in an audio signal for a further spatialization processing of the audio signal.
-
-
28. A terminal device for operating as a master terminal connecting a plurality of slave terminals to a conference bridge, the terminal device comprising:
-
means for receiving speech frames of the conference call with a plurality of participants, said speech frames including encoded speech parameters;
an audio codec for examining at least one parameter of the received speech frames;
means for classifying the speech frames to belong to one of the participants, the classification being based on differences in the examined at least one speech parameter; and
means for including information based on the speech frame classification of the participants in an audio signal for a further spatialization processing of the audio signal. - View Dependent Claims (29, 30, 31, 32)
-
Specification