×

Voice tracking camera with speaker identification

  • US 9,723,260 B2
  • Filed: 05/18/2010
  • Issued: 08/01/2017
  • Est. Priority Date: 05/18/2010
  • Status: Active Grant
First Claim
Patent Images

1. An automated videoconferencing method for a near-end endpoint located in a near-end environment at a near-end site to conduct a videoconference with one or more far-end endpoints located in one or more far-end environments at one or more far-end sites, the method comprising:

  • detecting one or more first audio indicative of speech during the videoconference with the near-end endpoint in the near-end environment at the near-end site;

    determining one or more first speech frequency characteristics of the one or more first audio, the one or more first speech frequency characteristics characterizing frequency information indicative of the speech detected in the one or more first audio;

    determining one or more first source locations of the one or more first audio with the near-end endpoint in the near-end environment at the near-end site;

    directing at least one camera of the near-end endpoint at at least one of the one or more first source locations in the near-end environment;

    storing the one or more first speech frequency characteristics with information of at least one camera directed at the at least one of the one or more first source locations;

    detecting second audio indicative of speech with the near-end endpoint in the near-end environment at the near-end site;

    determining a second speech frequency characteristic of the second audio, the second speech frequency characteristic characterizing frequency information indicative of the speech detected in the second audio; and

    determining that the second speech frequency characteristic does not match the stored one or more first speech frequency characteristics;

    determining a second source location of the second audio with the near-end endpoint in the near-end environment at the near-end site;

    directing the at least one camera of the near-end endpoint at the second source location in the near-end environment; and

    storing the second speech frequency characteristic with information of the at least one camera directed at the second source location.

View all claims
  • 10 Assignments
Timeline View
Assignment View
    ×
    ×