Videoconferencing endpoint having multiple voice-tracking cameras
First Claim
1. An automated videoconferencing method for an environment having a plurality of participants, the method comprising:
- capturing video for a videoconference from one vantage point of the environment with at least one camera;
capturing audio in the environment with microphones co-located with the at least one camera;
determining at least one wide view of the participants in the captured video from the one vantage point of the environment;
determining one or more narrow views of one or more of the participants in the captured video from the one vantage point of the environment, the one or more narrow views contained in the at least one wide view; and
selectively outputting the wide or narrow views of the participants in the captured video for the videoconference based in part on the captured audio of the participants during the videoconference,wherein to determine and selectively output the wide or narrow views, the method comprises;
determining from the captured audio that at least two locations of the one or more of the participants have audio indicative of speech and are engaged in one type of audio exchange in the environment relative to the one vantage point based on a designation of an active speaker in the environment having alternated between the same at least two locations within a time frame,determining a selected view from the one or more narrow views of the captured video framing the at least two determined locations from the one vantage point of the environment, andoutputting the selected narrow view of the captured video framing the at least two determined locations from the one vantage point for the videoconference.
9 Assignments
0 Petitions
Accused Products
Abstract
A videoconferencing apparatus automatically tracks speakers in a room and dynamically switches between a controlled, people-view camera and a fixed, room-view camera. When no one is speaking, the apparatus shows the room view to the far-end. When there is a dominant speaker in the room, the apparatus directs the people-view camera at the dominant speaker and switches from the room-view camera to the people-view camera. When there is a new speaker in the room, the apparatus switches to the room-view camera first, directs the people-view camera at the new speaker, and then switches to the people-view camera directed at the new speaker. When there are two near-end speakers engaged in a conversation, the apparatus tracks and zooms-in the people-view camera so that both speakers are in view.
-
Citations
23 Claims
-
1. An automated videoconferencing method for an environment having a plurality of participants, the method comprising:
-
capturing video for a videoconference from one vantage point of the environment with at least one camera; capturing audio in the environment with microphones co-located with the at least one camera; determining at least one wide view of the participants in the captured video from the one vantage point of the environment; determining one or more narrow views of one or more of the participants in the captured video from the one vantage point of the environment, the one or more narrow views contained in the at least one wide view; and selectively outputting the wide or narrow views of the participants in the captured video for the videoconference based in part on the captured audio of the participants during the videoconference, wherein to determine and selectively output the wide or narrow views, the method comprises; determining from the captured audio that at least two locations of the one or more of the participants have audio indicative of speech and are engaged in one type of audio exchange in the environment relative to the one vantage point based on a designation of an active speaker in the environment having alternated between the same at least two locations within a time frame, determining a selected view from the one or more narrow views of the captured video framing the at least two determined locations from the one vantage point of the environment, and outputting the selected narrow view of the captured video framing the at least two determined locations from the one vantage point for the videoconference. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A videoconferencing apparatus for an environment having a plurality of participants, the apparatus comprising:
-
at least one camera capturing video for a videoconference from one vantage point of the environment; a plurality of microphones co-located with the at least one camera and capturing audio; a network interface communicatively coupling to a network; and a processing unit operatively coupled to the network interface, the at least one camera, and the microphones, the processing unit programmed to; determine at least one wide view of the participants in the captured video from the one vantage point of the environment, determine one or more narrow views of one or more of the participants in the captured video from the one vantage point of the environment, the one or more narrow views contained in the at least one wide view, and selectively output the wide or narrow views of the participants in the captured video for the videoconference based in part on the captured audio during the videoconference, wherein to determine and selectively output the wide or narrow views, the processing unit is programmed to; determine from the captured audio that at least two locations of the one or more of the participants have audio indicative of speech and are engaged in one type of audio exchange in the environment relative to the one vantage point based on a designation of an active speaker in the environment having alternated between the same at least two locations within a time frame, determine a selected view from the one or more narrow views of the captured video framing the at least two determined locations from the one vantage point of the environment, and output the selected narrow view of the captured video framing the at least two determined locations from the one vantage point for the videoconference. - View Dependent Claims (18, 19, 20, 21, 22, 23)
-
Specification