Audio selection based on user engagement
First Claim
1. A method comprising:
- receiving, during an audio-video communication session, audio input data from a microphone array comprising at least two microphones, wherein the audio input data is generated by a first sound source at a first location within an environment and a second sound source at a second location within the environment;
determining a first classification for the first sound source and a second classification for the second sound source;
predicting a first engagement metric for the first sound source and a second engagement metric for the second sound source, wherein;
the first engagement metric is based on the first classification and the second engagement metric is based on the second classification;
the first engagement metric approximates an interest level of a receiving user for the first sound source; and
the second engagement metric approximates an interest level from the receiving user for the second sound source;
determining that the first engagement metric is greater than the second engagement metric;
processing the audio input data to generate an audio output signal, wherein the audio output signal amplifies sound generated by the first sound source and attenuates sound generated by the second sound source; and
sending the audio output signal to a computing device associated with the receiving user.
2 Assignments
0 Petitions
Accused Products
Abstract
In one embodiment, a method includes receiving audio input data from a microphone array of at least two microphones. The audio input data is generated by a first sound source at a first location and a second sound source at a second location. The method also includes calculating a first engagement metric for the first sound source and a second engagement metric for the second sound source. The first engagement metric approximates an interest level of a receiving user for the first sound source, and the second engagement metric approximates an interest level from the receiving user for the second sound source. The method also includes determining that the first engagement metric is greater than the second engagement metric, and processing the audio input data to generate an audio output signal. The audio output signal may amplify sound generated by the first sound source relative to the second sound source.
-
Citations
17 Claims
-
1. A method comprising:
-
receiving, during an audio-video communication session, audio input data from a microphone array comprising at least two microphones, wherein the audio input data is generated by a first sound source at a first location within an environment and a second sound source at a second location within the environment; determining a first classification for the first sound source and a second classification for the second sound source; predicting a first engagement metric for the first sound source and a second engagement metric for the second sound source, wherein; the first engagement metric is based on the first classification and the second engagement metric is based on the second classification; the first engagement metric approximates an interest level of a receiving user for the first sound source; and the second engagement metric approximates an interest level from the receiving user for the second sound source; determining that the first engagement metric is greater than the second engagement metric; processing the audio input data to generate an audio output signal, wherein the audio output signal amplifies sound generated by the first sound source and attenuates sound generated by the second sound source; and sending the audio output signal to a computing device associated with the receiving user. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-readable non-transitory storage medium embodying software that is operable when executed to:
-
receive, during an audio-video communication session, audio input data from a microphone array comprising at least two microphones, wherein the audio input data is generated by a first sound source at a first location within an environment and a second sound source at a second location within the environment; determine a first classification for the first sound source and a second classification for the second sound source; predict a first engagement metric for the first sound source and a second engagement metric for the second sound source, wherein; the first engagement metric is based on the first classification and the second engagement metric is based on the second classification; the first engagement metric approximates an interest level of a receiving user for the first sound source; and the second engagement metric approximates an interest level from the receiving user for the second sound source; determine that the first engagement metric is greater than the second engagement metric; process the audio input data to generate an audio output signal, wherein the audio output signal amplifies sound generated by the first sound source and attenuates sound generated by the second sound source; and send the audio output signal to a computing device associated with the receiving user. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
one or more processors; and a computer-readable non-transitory storage medium coupled to one or more of the processors and comprising instructions operable when executed by one or more of the processors to cause the system to; receive, during an audio-video communication session, audio input data from a microphone array comprising at least two microphones, wherein the audio input data is generated by a first sound source at a first location within an environment and a second sound source at a second location within the environment; determine a first classification for the first sound source and a second classification for the second sound source; predict a first engagement metric for the first sound source and a second engagement metric for the second sound source, wherein; the first engagement metric is based on the first classification and the second engagement metric is based on the second classification; the first engagement metric approximates an interest level of a receiving user for the first sound source; and the second engagement metric approximates an interest level from the receiving user for the second sound source; determine that the first engagement metric is greater than the second engagement metric; process the audio input data to generate an audio output signal, wherein the audio output signal amplifies sound generated by the first sound source and attenuates sound generated by the second sound source; and send the audio output signal to a computing device associated with the receiving user. - View Dependent Claims (16, 17)
-
Specification