Spatially correlated audio in multipoint videoconferencing
First Claim
1. A method for controlling a first endpoint in a multipoint videoconference, the first endpoint comprising a plurality of loudspeakers spatially arranged with respect to a screen, comprising:
- receiving audio and video image signals from a plurality of endpoints;
assessing from the audio signals which of the plurality of endpoints comprises a speaking endpoint;
generating a video layout for the first endpoint, the layout positioning video images from one or more of the plurality of endpoints at different positions in the layout; and
generating a plurality of audio streams for the first endpoint, each one of the plurality of audio streams corresponding to one of the plurality of loudspeakers, wherein the audio streams are differentiated so as to generate a perception that the audio stream emanates from a position in the layout corresponding to a video image from the speaking endpoint.
10 Assignments
0 Petitions
Accused Products
Abstract
The disclosed method provides audio location perception to an endpoint in a multipoint videoconference by providing a plurality of audio streams to the endpoint, wherein each of the audio streams corresponds to one of the loudspeakers at the endpoint. The audio streams are differentiated so as to emphasize broadcasting of the audio streams through one or more loudspeakers closest to a position of a speaking endpoint in a videoconference layout that is displayed at the endpoint. For example, the audio broadcast at a loudspeaker that is at a far-side of the screen might be attenuated or time delayed compared to audio broadcast at a loudspeaker that is located at a near-side of the display. The disclosure also provides a multipoint control unit (MCU) that processes audio signals from two or more endpoints according to the positions in a layout of the endpoints and then transmits processed audio streams to the endpoints in a way that allows endpoints to broadcast spatially correlated audio.
52 Citations
17 Claims
-
1. A method for controlling a first endpoint in a multipoint videoconference, the first endpoint comprising a plurality of loudspeakers spatially arranged with respect to a screen, comprising:
-
receiving audio and video image signals from a plurality of endpoints;
assessing from the audio signals which of the plurality of endpoints comprises a speaking endpoint;
generating a video layout for the first endpoint, the layout positioning video images from one or more of the plurality of endpoints at different positions in the layout; and
generating a plurality of audio streams for the first endpoint, each one of the plurality of audio streams corresponding to one of the plurality of loudspeakers, wherein the audio streams are differentiated so as to generate a perception that the audio stream emanates from a position in the layout corresponding to a video image from the speaking endpoint. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for providing audio location perception to a first endpoint in a multipoint video conference, the endpoint comprising a plurality of loudspeakers, the method comprising:
-
providing a plurality of audio streams to the first endpoint, each one of the plurality of audio streams corresponding to one of the plurality of loudspeakers, wherein the audio streams are differentiated so as to emphasize broadcasting of the audio stream through one or more loudspeakers closest to a position of a speaking endpoint in a videoconference layout. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17)
-
Specification