Acoustic echo cancellation based sub band domain active speaker detection for audio and video conferencing applications

  • US 10,771,621 B2
  • Filed: 04/02/2018
  • Issued: 09/08/2020
  • Est. Priority Date: 10/31/2017
  • Status: Active Grant
First Claim
Patent Images

1. A method for detecting an active speaker in at least a two-way conference comprising:

  • analyzing real time audio in one or more sub band domains according to an echo canceller model, wherein the echo canceller model includes at least in part processing the real time audio using an acoustic echo cancellation linear filter;

    determining, based on the analyzed real time audio, one or more audio metrics;

    weighting, via a trained machine learning model, the one or more audio metrics based on importance of the one or more audio metrics for active speaker determination in the one or more sub band domains;

    summing the one or more weighted audio metrics;

    comparing the one or more summed weighted audio metrics and a hysteresis threshold;

    in response to the one or more summed weighted audio metrics being greater than the hysteresis threshold, determining a speaker status as active; and

    in response to the speaker status being active, removing one or more of residual echo or noise from the real time audio based on the weighted one or more audio metrics.

View all claims

    Thank you for your feedback