Systems and methods for scalable composition of media streams for real-time multimedia communication
First Claim
1. A system, comprising:
- one or more media processing nodes, each configured to accept from a plurality of participants to a video conference a plurality of audio and video streams from a plurality of video conference endpoints each associated with one of the plurality of participants, wherein the plurality of video conference endpoints are of different types, and wherein each of the media processing nodes builds up a composite metadata field associated with the composite video stream in order to apply operations outlined in the metadata field to each of the audio and video streams of the composite video stream and each of the media processing nodes utilizes information from the composite metadata field for efficient transcoding and processing of the associated composite audio and video streams;
for each of the plurality of participants to the video conference, convert and compose the plurality of audio and video streams into one or more composite video and audio streams compatible with the video conference endpoint associated with the participant in real time;
render the composite video and audio stream compatible with the video conference endpoint associated with each of the plurality of participants to the video conference.
5 Assignments
0 Petitions
Accused Products
Abstract
A new approach is proposed that contemplates systems and methods to support the operation of a Virtual Media Room or Virtual Meeting Room (VMR), wherein each VMR can accept from a plurality of participants at different geographic locations a variety of video conferencing feeds from video conference endpoints that can be either proprietary or standards-based and enable a multi-party video conferencing session among the plurality of participants by composing one composite audio and video stream for each of the participants. Each single VMR can be implemented across an infrastructure of globally distributed set of servers/media processing nodes co-located in Points of Presence (POPs) for Internet access. Each VMR also gives its users a rich set of conferencing and collaboration interaction hitherto not experienced by video conferencing users.
51 Citations
27 Claims
-
1. A system, comprising:
-
one or more media processing nodes, each configured to accept from a plurality of participants to a video conference a plurality of audio and video streams from a plurality of video conference endpoints each associated with one of the plurality of participants, wherein the plurality of video conference endpoints are of different types, and wherein each of the media processing nodes builds up a composite metadata field associated with the composite video stream in order to apply operations outlined in the metadata field to each of the audio and video streams of the composite video stream and each of the media processing nodes utilizes information from the composite metadata field for efficient transcoding and processing of the associated composite audio and video streams; for each of the plurality of participants to the video conference, convert and compose the plurality of audio and video streams into one or more composite video and audio streams compatible with the video conference endpoint associated with the participant in real time; render the composite video and audio stream compatible with the video conference endpoint associated with each of the plurality of participants to the video conference. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A method, comprising:
-
accepting from a plurality of participants to a video conference a plurality of audio and video streams from a plurality of video conference endpoints each associated with one of the plurality of participants, wherein the plurality of video conference endpoints are of different types; for each of the plurality of participants to the video conference, converting and composing the plurality of audio and video streams into one composite video and audio stream compatible with the video conference endpoint associated with the participant in real time; rendering the composite video and audio stream compatible with the video conference endpoint associated with each of the plurality of participants to the video conference; building up a composite metadata field associated with the composite video stream in order to apply operations outlined in the metadata field to each of the audio and video streams of the composite video stream; and utilizing information from the composite metadata field for efficient transcodinq and processing of the associated composite audio and video stream. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27)
-
Specification