Active speaker indicator for conference participants
First Claim
Patent Images
1. A system, comprising:
- a processor; and
a non-transitory computer-readable storage medium embodying software that is operable when executed by the processor to;
receive requests to join a conference from a plurality of user devices proximate a first endpoint, the requests comprising a username;
receive an audio signal for the conference from the first endpoint, the first endpoint operable to capture audio proximate the first endpoint;
transmit the audio signal to a second endpoint, remote from the first endpoint;
receive a plurality of audio energy values from the plurality of user devices proximate the first endpoint, the audio energy values associated with the audio signal;
identify an active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices; and
transmit an identity of the identified active speaker to the second endpoint while continuing to transmit audio signals received from the first endpoint to the second endpoint wherein;
the active speaker is a first active speaker;
identify a second active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices; and
transmit an identity for both the first active speaker and the second active speaker to the second endpoint.
1 Assignment
0 Petitions
Accused Products
Abstract
In one embodiment, a method includes receiving requests to join a conference from a plurality of user devices proximate a first endpoint. The requests include a username. The method also includes receiving an audio signal for the conference from the first endpoint. The first endpoint is operable to capture audio proximate the first endpoint. The method also includes transmitting the audio signal to a second endpoint, remote from the first endpoint. The method also includes identifying, by a processor, an active speaker proximate the first endpoint based on information received from the plurality of user devices.
17 Citations
21 Claims
-
1. A system, comprising:
-
a processor; and a non-transitory computer-readable storage medium embodying software that is operable when executed by the processor to; receive requests to join a conference from a plurality of user devices proximate a first endpoint, the requests comprising a username; receive an audio signal for the conference from the first endpoint, the first endpoint operable to capture audio proximate the first endpoint; transmit the audio signal to a second endpoint, remote from the first endpoint; receive a plurality of audio energy values from the plurality of user devices proximate the first endpoint, the audio energy values associated with the audio signal; identify an active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices; and transmit an identity of the identified active speaker to the second endpoint while continuing to transmit audio signals received from the first endpoint to the second endpoint wherein;
the active speaker is a first active speaker;
identify a second active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices; and
transmit an identity for both the first active speaker and the second active speaker to the second endpoint. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method, comprising:
-
receiving requests to join a conference from a plurality of user devices proximate a first endpoint, the requests comprising a username; receiving an audio signal for the conference from the first endpoint, the first endpoint operable to capture audio proximate the first endpoint; transmitting the audio signal to a second endpoint, remote from the first endpoint; receiving a plurality of audio energy values from the plurality of user devices, the audio energy values associated with the audio signal; identifying, by a processor, an active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices; and transmit an identity of the identified active speaker to the second endpoint while continuing to transmit audio signals received from the first endpoint to the second endpoint wherein;
the active speaker is a first active speaker;
identifying a second active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices; and
transmitting an identity for both the first active speaker and the second active speaker to the second endpoint. - View Dependent Claims (7, 8, 9, 10)
-
-
11. One or more non-transitory computer-readable storage media embodying software that is operable when executed by a processor to:
-
receive requests to join a conference from a plurality of user devices proximate a first endpoint, the requests comprising a username; receive an audio signal for the conference from the first endpoint, the first endpoint operable to capture audio proximate the first endpoint; transmit the audio signal to a second endpoint, remote from the first endpoint; receive a plurality of audio energy values from the plurality of user devices, the audio energy values associated with the audio signal; identify an active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices; and transmit an identity of the identified active speaker to the second endpoint while continuing to transmit audio signals received from the first endpoint to the second endpoint wherein;
the active speaker is a first active speaker;
identify a second active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices; and
transmit an identity for both the first active speaker and the second active speaker to the second endpoint. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A system, comprising:
-
a processor; and a non-transitory computer-readable storage medium embodying software that is operable when executed by the processor to; receive requests to join a conference from a plurality of user devices proximate a first endpoint, the requests comprising a username; receive registration audio signals associated with a plurality of users; generate voice identification information for the plurality of users based on the received registration audio signals; store the voice identification information in a database; receive an audio signal for the conference from the first endpoint, the first endpoint operable to capture audio proximate the first endpoint; transmit the audio signal to a second endpoint, remote from the first endpoint; identify an active speaker proximate the first endpoint based on the audio signal and the voice identification information; and transmit an identity of the identified active speaker to the second endpoint while continuing to transmit audio signals received from the first endpoint to the second endpoint wherein;
the active speaker is a first active speaker;
identify a second active speaker proximate the first endpoint based on the audio signals received from the first endpoint; and
transmit an identity for both the first active speaker and the second active speaker to the second endpoint. - View Dependent Claims (17, 18, 19, 20, 21)
-
Specification