SPEAKER ANTICIPATION
First Claim
1. A method of anticipating a video switch to accommodate a new speaker in a video conference comprising:
- analyzing a real time video stream captured by a camera local to a first videoconference endpoint according to at least one speaker anticipation model;
predicting, by the at least one speaker anticipation model, that a new speaker is about to speak, the at least one speaker anticipation model trained by a guided learning dataset from historical video feeds derived from a series of labeled video frames from the historical video feeds;
sending video of the anticipated new speaker to the conferencing server in response to a request for the video of the anticipated new speaker from the conferencing server; and
distributing the video of the anticipated new speaker to a second videoconference endpoint.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods are disclosed for anticipating a video switch to accommodate a new speaker in a video conference comprising a real time video stream captured by a camera local to a first videoconference endpoint is analyzed according to at least one speaker anticipation model. The speaker anticipation model predicts that a new speaker is about to speak. Video of the anticipated new speaker is sent to the conferencing server in response to a request for the video on the anticipated new speaker from the conferencing server. Video of the anticipated new speaker is distributed to at least a second videoconference endpoint.
29 Citations
20 Claims
-
1. A method of anticipating a video switch to accommodate a new speaker in a video conference comprising:
-
analyzing a real time video stream captured by a camera local to a first videoconference endpoint according to at least one speaker anticipation model; predicting, by the at least one speaker anticipation model, that a new speaker is about to speak, the at least one speaker anticipation model trained by a guided learning dataset from historical video feeds derived from a series of labeled video frames from the historical video feeds; sending video of the anticipated new speaker to the conferencing server in response to a request for the video of the anticipated new speaker from the conferencing server; and distributing the video of the anticipated new speaker to a second videoconference endpoint. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. At least one non-transitory computer readable medium comprising instructions that when executed cause at least one computing device to:
-
receive, from a first video conference endpoint participating in a video conference, a prediction that a new speaker is about to speak at the first video conference endpoint; determine an allocation of media bandwidth distributed to the first video conference endpoint and at least a second video conference endpoint participating in the video conference, wherein the allocation of media bandwidth of the first video conference endpoint is increased based on the strength of the prediction; and request, from the first video conference endpoint, upgraded video of the new speaker according to the allocation. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A videoconference system for anticipating a video switch to accommodate a new speaker in a video conference, the system comprising:
-
a first videoconference endpoint participating in a multi-endpoint meeting hosted by a videoconference server, the first videoconference endpoint configured to; analyze a real time video stream captured by a camera local to the first videoconference endpoint according to at least one speaker anticipation model; predict, by the at least one speaker anticipation model, that a new speaker is about to speak, the at least one speaker anticipation model trained by a guided learning dataset from historical video feeds derived from a series of labeled video frames from the historical video feeds; send video of the anticipated new speaker to the videoconference server in response to a request for the video of the anticipated new speaker from the videoconference server; and distribute the video of the anticipated new speaker to a second videoconference endpoint. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification