Video and audio tagging for active speaker detection
First Claim
1. A transmitter system for a videoconferencing system, comprising:
- a tag generator to generate an audio tag;
a combiner to combine an audio signal with the audio tag to produce a tagged audio signal; and
a transmitter to transmit the tagged audio signal and a corresponding video signal; and
a control system operative to;
determine whether the audio signal is above a threshold level;
if the audio signal has been determined to be above the threshold level, then determine whether the audio signal has an audio tag embedded therein; and
if the audio signal has been determined not to have an audio tag embedded therein, then either direct a camera toward a source of the audio signal or select a camera pointing toward a source of the audio signal, wherein the camera produces the corresponding video signal.
3 Assignments
0 Petitions
Accused Products
Abstract
A videoconferencing system is described that is configured to select an active speaker while avoiding erroneously selecting a microphone or camera that is picking up audio or video from a connected remote signal. A determination is made whether an audio signal is above a threshold level. If so, then a determination is made as to whether a tag is present in that audio signal. If so, that signal is ignored. If not, a camera is directed toward the sound source identified by the audio signal. A determination is made whether a tag is present in the video signal from that camera. If so, the camera is redirected. If not, local tag(s) are inserted into the audio signal and/or the video signal. The tagged signal(s) are transmitted. Thus, system will ignore sound or video that has an embedded tag from another videoconferencing system.
-
Citations
17 Claims
-
1. A transmitter system for a videoconferencing system, comprising:
-
a tag generator to generate an audio tag; a combiner to combine an audio signal with the audio tag to produce a tagged audio signal; and a transmitter to transmit the tagged audio signal and a corresponding video signal; and a control system operative to; determine whether the audio signal is above a threshold level; if the audio signal has been determined to be above the threshold level, then determine whether the audio signal has an audio tag embedded therein; and if the audio signal has been determined not to have an audio tag embedded therein, then either direct a camera toward a source of the audio signal or select a camera pointing toward a source of the audio signal, wherein the camera produces the corresponding video signal. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method for operating a videoconferencing system, the method comprising:
-
receiving an audio signal; receiving a corresponding video signal; generating an audio tag; determining whether the audio signal is above a threshold level; if the audio signal has been determined to be above the threshold level, then determining whether the audio signal has an audio tag embedded therein; and if the audio signal has been determined not to have an audio lag embedded therein, then either directing a camera toward a source of the audio signal or selecting a camera pointing toward a source of the audio signal, wherein the camera produces the corresponding video signal; combining the audio signal with the audio tag to produce a tagged audio signal; and transmitting the tagged audio signal and the corresponding video signal. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer storage medium having computer executable instructions stored thereon which, when executed by a computer, cause the computer to:
-
determine whether a received audio signal is above a threshold level; if the received audio signal has been determined to be above the threshold level, the determine whether the received audio signal has an audio tag embedded therein; if the received audio signal has been determined not to have an audio lag embedded therein, then either direct a camera toward a source of the received audio signal or select a camera pointing toward a source of the received audio signal, wherein the camera produces a corresponding video signal; generate an audio tag; combine the received audio signal with the audio tag to produce a tagged audio signal; and transmit the tagged audio signal and the corresponding video signal. - View Dependent Claims (14, 15, 16, 17)
-
Specification