System And Method For Determining The Active Talkers In A Video Conference
First Claim
1. A method of determining the active talker for display on a video conferencing system, comprising the steps of:
- for each participant, capturing audio data using an audio capture sensor and video data using a video capture sensor;
determining the probability of active speech (pA, pB . . . pN), where the probability of active speech is a function of the probability of soft voice detection captured by the audio capture sensor and the probability of lip motion detection captured by the video capture sensor; and
automatically displaying at least the participant that has the highest probability of active speech.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention describes a method of determining the active talker for display on a video conferencing system, including the steps of: for each participant, capturing audio data using an audio capture sensor and video data using a video capture sensor; determining the probability of active speech (pA, pB . . . pN), where the probability of active speech is a function of the probability of soft voice detection captured by the audio capture sensor and the probability of lip motion detection captured by the video capture sensor; and automatically displaying at least the participant that has the highest probability of active speech.
91 Citations
19 Claims
-
1. A method of determining the active talker for display on a video conferencing system, comprising the steps of:
-
for each participant, capturing audio data using an audio capture sensor and video data using a video capture sensor; determining the probability of active speech (pA, pB . . . pN), where the probability of active speech is a function of the probability of soft voice detection captured by the audio capture sensor and the probability of lip motion detection captured by the video capture sensor; and automatically displaying at least the participant that has the highest probability of active speech. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A computer readable storage medium having computer-readable program instructions stored thereon for causing a computer system to perform a method of providing feedback to a participant in a video conference, the method comprising the steps of:
-
determining the probability of active speech (pA, pB . . . pN), where the probability of active speech is a function of the probability of soft voice detection captured by the audio capture sensor and the probability of lip motion detection captured by the video capture sensor; and automatically displaying at least the participant that has the highest probability of active speech.
-
-
19. An apparatus for providing feedback to a participant in a video conference, the apparatus comprising:
a computer readable storage medium having computer-readable program instructions stored thereon for causing a computer system to perform a method of providing feedback to a participant in a video conference, the method comprising the steps of; determining the probability of active speech (pA, pB . . . pN), where the probability of active speech is a function of the probability of soft voice detection captured by the audio capture sensor and the probability of lip motion detection captured by the video capture sensor; and automatically displaying at least the participant that has the highest probability of active speech.
Specification