Spatialization arrangement for conference call
First Claim
1. A method comprising:
- receiving speech frames of a conference call, said speech frames including encoded speech parameters;
examining at least one speech parameter of the received speech frames;
classifying the speech frames to belong to one of a plurality of participants in said conference call, the classification being carried out according to differences in the examined at least one speech parameter;
determining a control word for each participant according to differences in the examined at least one speech parameter; and
attaching control words to speech frames, the control word of each speech frame being characteristic to the participant speaking in the particular speech frame.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for distinguishing speakers in a conference call of a plurality of participants, in which method speech frames of the conference call are received in a receiving unit, which speech frames include encoded speech parameters. At least one parameter of the received speech frames is examined in an audio codec of the receiving unit, and the speech frames are classified to belong to one of the participants, the classification being carried out according to differences in the examined at least one speech parameter. These functions may be carried out in a speaker identification block, which is applicable in various positions of a teleconferencing processing chain. Finally, a spatialization effect is created in a terminal reproducing the audio signal according to notified differences by placing the participants at distinct positions in an acoustical space of the audio signal.
-
Citations
27 Claims
-
1. A method comprising:
-
receiving speech frames of a conference call, said speech frames including encoded speech parameters; examining at least one speech parameter of the received speech frames; classifying the speech frames to belong to one of a plurality of participants in said conference call, the classification being carried out according to differences in the examined at least one speech parameter; determining a control word for each participant according to differences in the examined at least one speech parameter; and attaching control words to speech frames, the control word of each speech frame being characteristic to the participant speaking in the particular speech frame. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system comprising:
-
a receiving unit configured to receive speech frames of a conference call, said speech frames including encoded speech parameters; a decoder configured to examine at least one parameter of the received speech frames; a recognition block configured to classify the speech frames to belong to one of a plurality of participants in said conference call, the classification being based on differences in the examined at least one speech parameter and to determine a control word for each participant according to differences in the examined at least one speech parameter; and a spatialization processing module configured to attach control words to speech frames, the control word of each speech frame being characteristic to the participant speaking in the particular speech frame. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A terminal device comprising:
-
a receiving unit configured to receive speech frames of a conference call, said speech frames including encoded speech parameters; a decoder configured to examine at least one parameter of the received speech frames; a recognition block configured to classify the speech frames to belong to one of a plurality of participants in said conference call, the classification being based on differences in the examined at least one speech parameter and to determine a control word for each participant according to differences in the examined at least one speech parameter; and a spatialization processing module configured to attach control words to speech frames, the control word of each speech frame being characteristic to the participant speaking in the particular speech frame, and configured to create a three-dimensional spatialization effect to an audio signal to be reproduced by placing the participants at distinct positions in an acoustical space of the audio signal. - View Dependent Claims (14, 15)
-
-
16. A computer readable medium stored with instructions, which when executed by a data processing device, performs:
-
receiving speech frames of a conference call, said speech frames including encoded speech parameters; examining at least one parameter of the received speech frames; classifying the speech frames to belong to one of a plurality of participants in said conference call, the classification being based on differences in the examined at least one speech parameter; determining a control word for each participant according to differences in the examined at least one speech parameter; attaching control words to speech frames, the control word of each speech frame being characteristic to the participant speaking in the particular speech frame, and creating a three-dimensional spatialization effect to the audio signal to be reproduced by placing the participants at distinct positions in an acoustical space of the audio signal. - View Dependent Claims (17)
-
-
18. A conference bridge for a teleconferencing system, the bridge comprising:
-
a receiving unit configured to receive speech frames of a conference call with a plurality of participants, said speech frames including encoded speech parameters; a decoder configured to examine at least one parameter of the received speech frames; and a recognition block configured to classify the speech frames to belong to one of the participants, the classification being based on differences in the examined at least one speech parameter and to determine a control word for each participant according to differences in the examined at least one speech parameter; and to include information based on the speech frame classification of the participants in an audio signal for a spatialization processing of the audio signal. - View Dependent Claims (19, 20, 21, 22)
-
-
23. A computer readable medium stored with instructions, which when executed by a data processing device, performs:
-
receiving speech frames of a conference call, said speech frames including encoded speech parameters; examining at least one parameter of the received speech frames; classifying the speech frames to belong to one of a plurality of participants in said conference call, the classification being based on differences in the examined at least one speech parameter; determining a control word for each participant according to differences in the examined at least one speech parameter; and including information based on the speech frame classification of the participants in an audio signal for a further spatialization processing of the audio signal.
-
-
24. A terminal device comprising:
-
a receiving unit configured to receive speech frames of a conference call with a plurality of participants, said speech frames including encoded speech parameters; an audio decoder configured to examine at least one parameter of the received speech frames; and a recognition block configured to classify the speech frames to belong to one of the participants, the classification being based on differences in the examined at least one speech parameter; to include information based on the speech frame classification of the participants is configured to determine a control word for each participant according to differences in the examined at least one speech parameter; and to include information based on the speech frame classification of the participants in an audio signal for a further spatialization processing of the audio signal, wherein said terminal device operates as a master terminal connecting a plurality of slave terminals to a conference bridge. - View Dependent Claims (25, 26, 27)
-
Specification