System and method for performing distributed video conferencing
First Claim
1. An apparatus for executing a video conference, comprising:
- an audio mixer residing in a network and capable of determining the active speaker associated with one end point of one or more end points, the audio mixer being able to create a mixed audio stream that may be communicated to the one or more end points such that the one or more end points do not have to perform audio mixing, the mixed audio stream is a mix of input audio streams from the one or more end points, and the audio mixer being able to determine an input-to-output mapping for each input audio stream, the input-to-output mapping indicating the delay of the input audio stream to the communicated mixed audio stream; and
one or more media switches operable to receive information associated with the active speaker, the input-to-output mapping for the audio stream associated with the active speaker, and one or more video streams associated with the video conference, wherein at least one media switch of the one or more media switches is operable to suppress the one or more video streams except for a selected video stream associated with the active speaker and delay the selected video according to the input-to-output mapping such that the selected video stream corresponds to the audio stream of the active speaker, the selected video stream propagating to the one or more media switches during the video conference without audio, and wherein a selected media switch of the one or more media switches is operable to replicate the selected video stream such that it may be communicated to one or more end points associated with the selected media switch.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for executing a video conference is provided that includes receiving one or more audio streams associated with a video conference from one or more end points and determining an active speaker associated with one of the end points. Audio information associated with the active speaker may be received at one or more media switches. One or more video streams may be suppressed except for a selected video stream associated with the active speaker, the selected video stream propagating to one or more of the media switches during the video conference. The selected video stream may be replicated such that it may be communicated to one or more of the end points associated with a selected one of the media switches.
26 Citations
34 Claims
-
1. An apparatus for executing a video conference, comprising:
-
an audio mixer residing in a network and capable of determining the active speaker associated with one end point of one or more end points, the audio mixer being able to create a mixed audio stream that may be communicated to the one or more end points such that the one or more end points do not have to perform audio mixing, the mixed audio stream is a mix of input audio streams from the one or more end points, and the audio mixer being able to determine an input-to-output mapping for each input audio stream, the input-to-output mapping indicating the delay of the input audio stream to the communicated mixed audio stream; and one or more media switches operable to receive information associated with the active speaker, the input-to-output mapping for the audio stream associated with the active speaker, and one or more video streams associated with the video conference, wherein at least one media switch of the one or more media switches is operable to suppress the one or more video streams except for a selected video stream associated with the active speaker and delay the selected video according to the input-to-output mapping such that the selected video stream corresponds to the audio stream of the active speaker, the selected video stream propagating to the one or more media switches during the video conference without audio, and wherein a selected media switch of the one or more media switches is operable to replicate the selected video stream such that it may be communicated to one or more end points associated with the selected media switch. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for executing a video conference, comprising:
-
receiving, from an audio mixer, information associated with an active speaker, the audio mixer residing in a network and capable of determining the active speaker associated with one end point of one or more end points, and the audio mixer being able to create a mixed audio stream that may be communicated to the one or more end points such that the one or more end points do not have to perform audio mixing, the mixed audio stream is a mix of input audio streams from the one or more end points, and the audio mixer being able to determine an input-to-output mapping for each input audio stream, the input-to-output mapping indicating the delay of the input audio stream to the communicated mixed audio stream; receiving one or more video streams associated with the video conference and the input-to-output mapping for the audio stream associated with the active speaker from the one or more end points; suppressing the one or more video streams except for a selected video stream associated with the active speaker; delaying the selected video according to the input-to-putout mapping such that the selected video stream corresponds to the audio stream of the active speaker; propagating the selected video stream to one or more media switches during the video conference without audio; and replicating the selected video stream such that it may be communicated to one or more end points associated with a selected media switch of the one or more media switches. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
-
19. A system for executing a video conference, comprising:
-
means for receiving information associated with an active speaker, wherein the system includes an audio mixer that resides in a network and is capable of determining the active speaker associated with one end point of one or more end points, and the audio mixer being able to create a mixed audio stream that may be communicated to the one or more end points such that the one or more end points do not have to perform audio mixing, the mixed audio stream is a mix of input audio streams from the one or more end points, and the audio mixer being able to determine an input-to-output mapping for each input audio stream, the input-to-output mapping indicating the delay of the input audio stream to the communicated mixed audio stream; means for receiving one or more video streams associated with the video conference and the input-to-output mapping for the audio stream associated with the active speaker from the one or more end points; means for suppressing the one or more video streams except for a selected video stream associated with the active speaker; means for delaying the selected video according the input-to-output mapping such that the selected video stream corresponds to the selected audio stream of the active speaker; means for propagating the selected video stream to one or more media switches during the video conference without audio; and means for replicating the selected video stream such that it may be communicated to one or more end points associated with a selected media switch of the one or more media switches. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26)
-
-
27. Software for executing a video conference, the software being embodied in a non-transitory computer readable medium and comprising computer code such that when executed by one or more processors is operable to:
-
receive, from an audio mixer, information associated with an active speaker, the audio mixer residing in a network and capable of determining the active speaker with one end point of one or more end points, and the audio mixer being able to create a mixed audio stream that may he communicated to the one or more end points such that the one or more end points do not have to perform audio mixing, the mixed audio stream is a mix of input audio streams from the one or more end points, and the audio mixer being able to determine an input-to-output mapping for each input audio stream, the input-to-output mapping indicating the delay of the input audio stream to the communicated mixed audio stream; receive one or more video streams associated with the video conference and the input-to-output mapping for the audio stream associated with the active speaker from the one or more end points; suppress the one or more video streams except for a selected video stream associated with the active speaker; delaying the selected video according to the input-to-putout mapping such that the selected video stream corresponds to the audio stream of the active speaker; propagate the selected video stream to one or more media switches during the video conference without audio; and replicate the selected video stream such that it may he communicated to one or more end points associated with a selected media switch of the one or more media switches. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34)
-
Specification