×

Using speaker clustering to switch between different camera views in a video conference system

  • US 9,633,270 B1
  • Filed: 04/05/2016
  • Issued: 04/25/2017
  • Est. Priority Date: 04/05/2016
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • at a video conference endpoint including a microphone and a camera;

    detecting a talker position of a talker based on audio detected by the microphone;

    detecting one or more faces and face positions in video captured by the camera;

    determining whether a detected talker position matches any detected face position;

    if there is no match, performing speaker clustering across a speech segment in the detected audio and speech segments in previously detected audio;

    if the speaker clustering indicates the talker is known, determining whether the detected talker position matches a previous closeup position of the talker, wherein the previous closeup position is a previously detected talker position that matches a previously detected face; and

    based on results of the determining whether the detected talker position matches the previous closeup position of the talker, framing either a closeup camera view on the previous closeup position or a non-closeup camera view based on the detected talker position.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×