Active speaker indicator for conference participants

US 9,210,269 B2
Filed: 10/31/2012
Issued: 12/08/2015
Est. Priority Date: 10/31/2012
Status: Active Grant

First Claim

Patent Images

1. A system, comprising:

a processor; and

a non-transitory computer-readable storage medium embodying software that is operable when executed by the processor to;

receive requests to join a conference from a plurality of user devices proximate a first endpoint, the requests comprising a username;

receive an audio signal for the conference from the first endpoint, the first endpoint operable to capture audio proximate the first endpoint;

transmit the audio signal to a second endpoint, remote from the first endpoint;

receive a plurality of audio energy values from the plurality of user devices proximate the first endpoint, the audio energy values associated with the audio signal;

identify an active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices; and

transmit an identity of the identified active speaker to the second endpoint while continuing to transmit audio signals received from the first endpoint to the second endpoint wherein;

the active speaker is a first active speaker;

identify a second active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices; and

transmit an identity for both the first active speaker and the second active speaker to the second endpoint.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

In one embodiment, a method includes receiving requests to join a conference from a plurality of user devices proximate a first endpoint. The requests include a username. The method also includes receiving an audio signal for the conference from the first endpoint. The first endpoint is operable to capture audio proximate the first endpoint. The method also includes transmitting the audio signal to a second endpoint, remote from the first endpoint. The method also includes identifying, by a processor, an active speaker proximate the first endpoint based on information received from the plurality of user devices.

17 Citations

View as Search Results

21 Claims

1. A system, comprising:
- a processor; and
  
  a non-transitory computer-readable storage medium embodying software that is operable when executed by the processor to;
  
  receive requests to join a conference from a plurality of user devices proximate a first endpoint, the requests comprising a username;
  
  receive an audio signal for the conference from the first endpoint, the first endpoint operable to capture audio proximate the first endpoint;
  
  transmit the audio signal to a second endpoint, remote from the first endpoint;
  
  receive a plurality of audio energy values from the plurality of user devices proximate the first endpoint, the audio energy values associated with the audio signal;
  
  identify an active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices; and
  
  transmit an identity of the identified active speaker to the second endpoint while continuing to transmit audio signals received from the first endpoint to the second endpoint wherein;
  
  the active speaker is a first active speaker;
  
  identify a second active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices; and
  
  transmit an identity for both the first active speaker and the second active speaker to the second endpoint.
- View Dependent Claims (2, 3, 4, 5)
- - 2. The system of claim 1, wherein:
    - the software is further operable when executed to identify the active speaker proximate the first endpoint by comparing the plurality of audio energy values.
  - 3. The system of claim 2, wherein the software is further operable when executed to identify the active speaker proximate the first endpoint by calibrating the plurality of user devices.
  - 4. The system of claim 2, wherein the software is further operable when executed to identify the active speaker proximate the first endpoint by:
    - identifying a greatest audio energy value user device of the plurality of user devices; and
      
      identifying the username transmitted by the greatest audio energy value user device as the active speaker.
  - 5. The system of claim 1, wherein the software is further operable when executed to identify the active speaker proximate the first endpoint by:
    - receiving a first audio energy value from a first user device of the plurality of user devices;
      
      receiving a second audio energy value from a second user device of the plurality of user devices; and
      
      comparing the first audio energy value with the second audio energy value.

6. A method, comprising:
- receiving requests to join a conference from a plurality of user devices proximate a first endpoint, the requests comprising a username;
  
  receiving an audio signal for the conference from the first endpoint, the first endpoint operable to capture audio proximate the first endpoint;
  
  transmitting the audio signal to a second endpoint, remote from the first endpoint;
  
  receiving a plurality of audio energy values from the plurality of user devices, the audio energy values associated with the audio signal;
  
  identifying, by a processor, an active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices; and
  
  transmit an identity of the identified active speaker to the second endpoint while continuing to transmit audio signals received from the first endpoint to the second endpoint wherein;
  
  the active speaker is a first active speaker;
  
  identifying a second active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices; and
  
  transmitting an identity for both the first active speaker and the second active speaker to the second endpoint.
- View Dependent Claims (7, 8, 9, 10)
- - 7. The method of claim 6, wherein:
    - identifying the active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices comprises comparing the plurality of audio energy values.
  - 8. The method of claim 7, wherein identifying the active speaker proximate the first endpoint based on information received from the plurality of user devices further comprises calibrating the plurality of user devices.
  - 9. The method of claim 7, wherein identifying the active speaker proximate the first endpoint based on information received from the plurality of user devices further comprises:
    - identifying a greatest audio energy value user device of the plurality of user devices; and
      
      identifying the username transmitted by the greatest audio energy value user device as the active speaker.
  - 10. The method of claim 6, wherein identifying the active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices comprises:
    - receiving a first audio energy value from a first user device of the plurality of user devices;
      
      receiving a second audio energy value from a second user device of the plurality of user devices; and
      
      comparing the first audio energy value with the second audio energy value.

11. One or more non-transitory computer-readable storage media embodying software that is operable when executed by a processor to:
- receive requests to join a conference from a plurality of user devices proximate a first endpoint, the requests comprising a username;
  
  receive an audio signal for the conference from the first endpoint, the first endpoint operable to capture audio proximate the first endpoint;
  
  transmit the audio signal to a second endpoint, remote from the first endpoint;
  
  receive a plurality of audio energy values from the plurality of user devices, the audio energy values associated with the audio signal;
  
  identify an active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices; and
  
  transmit an identity of the identified active speaker to the second endpoint while continuing to transmit audio signals received from the first endpoint to the second endpoint wherein;
  
  the active speaker is a first active speaker;
  
  identify a second active speaker proximate the first endpoint based on the audio energy values received from the plurality of user devices; and
  
  transmit an identity for both the first active speaker and the second active speaker to the second endpoint.
- View Dependent Claims (12, 13, 14, 15)
- - 12. The media of claim 11, wherein:
    - the software is further operable when executed to identify the active speaker proximate the first endpoint by comparing the plurality of audio energy values.
  - 13. The media of claim 12, wherein the software is further operable when executed to identify the active speaker proximate the first endpoint by calibrating the plurality of user devices.
  - 14. The media of claim 12, wherein the software is further operable when executed to identify the active speaker proximate the first endpoint by:
    - identifying a greatest audio energy value user device of the plurality of user devices; and
      
      identifying the username transmitted by the greatest audio energy value user device as the active speaker.
  - 15. The media of claim 11, wherein the software is further operable when executed to identify the active speaker proximate the first endpoint by:
    - receiving a first audio energy value from a first user device of the plurality of user devices;
      
      receiving a second audio energy value from a second user device of the plurality of user devices; and
      
      comparing the first audio energy value with the second audio energy value.

16. A system, comprising:
- a processor; and
  
  a non-transitory computer-readable storage medium embodying software that is operable when executed by the processor to;
  
  receive requests to join a conference from a plurality of user devices proximate a first endpoint, the requests comprising a username;
  
  receive registration audio signals associated with a plurality of users;
  
  generate voice identification information for the plurality of users based on the received registration audio signals;
  
  store the voice identification information in a database;
  
  receive an audio signal for the conference from the first endpoint, the first endpoint operable to capture audio proximate the first endpoint;
  
  transmit the audio signal to a second endpoint, remote from the first endpoint;
  
  identify an active speaker proximate the first endpoint based on the audio signal and the voice identification information; and
  
  transmit an identity of the identified active speaker to the second endpoint while continuing to transmit audio signals received from the first endpoint to the second endpoint wherein;
  
  the active speaker is a first active speaker;
  
  identify a second active speaker proximate the first endpoint based on the audio signals received from the first endpoint; and
  
  transmit an identity for both the first active speaker and the second active speaker to the second endpoint.
- View Dependent Claims (17, 18, 19, 20, 21)
- - 17. The system of claim 16, wherein the software is further operable when executed to:
    - select a subset of the plurality of users; and
      
      identify the active speaker proximate the first endpoint based on the audio signal and the voice identification information for the subset of the plurality of users.
  - 18. The system of claim 17, wherein the software is further operable when executed to:
    - select the subset of the plurality of users based on the received requests to join the conference.
  - 19. The system of claim 17, wherein:
    - the database further stores location information associated with the plurality of users; and
      
      the software is further operable when executed to;
      
      determine a location of the first endpoint; and
      
      select the subset of the plurality of users based on the location information associated with the plurality of users.
  - 20. The system of claim 16, wherein the software is further operable when executed to:
    - receive active speaker detection feedback, the feedback indicating the accuracy of the active speaker identification; and
      
      update the voice identification information based on the active speaker detection feedback.
  - 21. The system of claim 16, wherein the software is further operable when executed to:
    - receive a video signal for the conference from the first endpoint;
      
      process the video signal to produce a processed video signal that includes active speaker identification information; and
      
      transmit the processed video signal to the second endpoint.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Cisco Technology, Inc. (Cisco Systems, Inc.)
Original Assignee
Cisco Technology, Inc. (Cisco Systems, Inc.)
Inventors
Liu, Yanghua, Chen, Weidong, Gandhi, Biren, Bhat, Raghurama, Khouri, Joseph Fouad, Houston, John Joseph, Toombs, Brian Thomas
Primary Examiner(s)
Patel, Hemant

Application Number

US13/664,640
Publication Number

US 20140118472A1
Time in Patent Office

1,133 Days
Field of Search

348 1401- 1416, 370259-271, 370351-357, 709201-207, 709217-248, 379/201.01, 37920201-20701, 704270-278
US Class Current

1/1
CPC Class Codes

G10L 17/00   Speaker identification or v...

H04M 3/563   User guidance or feature se...

H04N 7/15   Conference systems

Active speaker indicator for conference participants

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

17 Citations

21 Claims

Specification

Solutions

Use Cases

Quick Links

Active speaker indicator for conference participants

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

17 Citations

21 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links