Speaker anticipation

  • US 10,477,148 B2
  • Filed: 07/11/2017
  • Issued: 11/12/2019
  • Est. Priority Date: 06/23/2017
  • Status: Active Grant
First Claim
Patent Images

1. A method of anticipating a video switch to accommodate a new speaker in a video conference comprising:

  • analyzing, by at least one speaker anticipation model, a real time video stream captured by a camera local to a first videoconference endpoint;

    predicting, by the at least one speaker anticipation model, that a new speaker is about to speak, the at least one speaker anticipation model trained by a guided learning dataset from historical video feeds derived from a series of labeled video frames from the historical video feeds of the new speaker, each of the labeled video frames including one of two different labels based on audio;

    sending video of the new speaker to a conferencing server in response to a request for the video of the new speaker from the conferencing server; and

    distributing, via the conferencing server, the video of the new speaker to a second videoconference endpoint,wherein,each of the labeled video frames is default-labeled as a pre-speaking frame except for any of the labeled video frames including audio from a same endpoint, andeach of the labeled video frames including the audio from the same endpoint is labeled as speaking frames.

View all claims
    ×
    ×

    Thank you for your feedback

    ×
    ×