Smart video conferencing system
First Claim
1. A method of generating a video stream for use in a video conference, comprising:
- receiving, by a first video conferencing endpoint device, first video data captured from a first field of view of a physical environment, the first video data comprising a plurality of frames;
determining activity data from portions of the first video data using information provided in the plurality of frames;
generating, by the first video conferencing endpoint device, second video data from a second field of view of the physical environment, based on the determined activity data;
generating a video stream that comprises the generated second video data and audio data captured within the physical environment; and
transmitting the video stream to a video conferencing application executing on a user device, wherein the video conferencing application is configured to process the video stream as an input video stream to facilitate the transmission of the video stream to a remote video conferencing device for display.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments provide techniques for facilitating transmission of a video stream from a first video conferencing device to a remote video conferencing device. Embodiments receive, by the first video conferencing endpoint device, first video data captured from a first field of view of a physical environment. The video data includes a plurality of frames. Activity data is determined for portions of the first video data across the plurality of frames. Embodiments generate, by a first video conferencing endpoint device, second video data from a second field of view of the physical environment, based on the determined activity data. Additionally, embodiments facilitate the transmission of the video stream to the remote video conferencing device for display, the video stream comprising the generated second video data and audio data captured within the physical environment.
82 Citations
29 Claims
-
1. A method of generating a video stream for use in a video conference, comprising:
-
receiving, by a first video conferencing endpoint device, first video data captured from a first field of view of a physical environment, the first video data comprising a plurality of frames; determining activity data from portions of the first video data using information provided in the plurality of frames; generating, by the first video conferencing endpoint device, second video data from a second field of view of the physical environment, based on the determined activity data; generating a video stream that comprises the generated second video data and audio data captured within the physical environment; and transmitting the video stream to a video conferencing application executing on a user device, wherein the video conferencing application is configured to process the video stream as an input video stream to facilitate the transmission of the video stream to a remote video conferencing device for display. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 22)
-
-
9. A system for generating a video stream for use in a video conference, comprising:
-
a first camera sensor configured to capture first video data comprising a plurality of frames from a first field of view of a physical environment; a second camera sensor; a mounting structure capable of adjusting an orientation of the second camera sensor along one or more degrees of freedom; control logic configured to; determine activity data for portions of the first video data across the plurality of frames; and control movement of the mounting structure to adjust the orientation of the second camera along the one or more degrees of freedom, based on the determined activity data; and video processing logic configured to; capture second video data from a second field of view of the physical environment using the second camera sensor; encode the captured second video data; generate a video stream comprising the captured second video data and audio data captured within the physical environment; and transmit the video stream to a video conferencing application executing on a user device, wherein the video conferencing application is configured to process the video stream as an input video stream to facilitate the transmission of the video stream to a remote video conferencing device for display. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A system for generating a video stream for use in a video conference, comprising:
-
a camera sensor configured to capture first video data comprising a plurality of frames from a first field of view of a physical environment at a first resolution; control logic configured to; determine activity data for portions of the first video data across the plurality of frames; define a portion of the captured first video data to extract, based on the determined activity data; and extract the portion of the captured video data to create second video data, the second video data having less than all of a plurality of pixels of the captured video data; and video processing logic configured to; generate a video stream that comprises the second video data and audio data captured within the physical environment; and transmit the video stream to a video conferencing application executing on a user device, wherein the video conferencing application is configured to process the video stream as an input video stream to facilitate the transmission of the video stream to a remote video conferencing device for display. - View Dependent Claims (17, 18, 19, 20, 21)
-
-
23. A non-transitory computer-readable medium containing computer program code that, when executed by operation of one or more computer processors, performs an operation for generating a video stream for use in a video conference, the operation comprising:
-
receiving, by a first video conferencing endpoint device, first video data captured from a first field of view of a physical environment, the first video data comprising a plurality of frames; determining activity data from portions of the first video data using information provided in the plurality of frames; generating, by the first video conferencing endpoint device, second video data from a second field of view of the physical environment, based on the determined activity data; generating a video stream that comprises the generated second video data and audio data captured within the physical environment; and transmitting the video stream to a video conferencing application executing on a user device, wherein the video conferencing application is configured to process the video stream as an input video stream to facilitate the transmission of the video stream to a remote video conferencing device for display. - View Dependent Claims (24, 25, 26, 27, 28, 29)
-
Specification