Spatially correlated audio in multipoint videoconferencing
First Claim
1. A method for controlling a first endpoint having a plurality of loudspeakers spatially arranged with respect to a screen in a multipoint videoconference between the first endpoint and a plurality of endpoints, comprising:
- receiving at a multipoint control unit compressed audio and video image signals from the first endpoint and the plurality of endpoints;
decoding the audio and video image signals;
assessing from the audio signals which of the plurality of endpoints comprises a speaking endpoint;
generating a video layout for the first endpoint, the layout positioning video images from two or more of the plurality of endpoints at different positions in the layout;
processing one or more of the plurality of decoded audio signals in one or more channels, each channel corresponding to a speaker at the first endpoint,to generate a perception that the audio stream emanates from a position in the layout corresponding to a video image from the speaking endpointencoding the video layout and the plurality of processed audio signals; and
transmitting the encoded signals to the first endpoint.
10 Assignments
0 Petitions
Accused Products
Abstract
The disclosed method provides audio location perception to an endpoint in a multipoint videoconference by providing a plurality of audio streams to the endpoint, wherein each of the audio streams corresponds to one of the loudspeakers at the endpoint. The audio streams are differentiated so as to emphasize broadcasting of the audio streams through one or more loudspeakers closest to a position of a speaking endpoint in a videoconference layout that is displayed at the endpoint. For example, the audio broadcast at a loudspeaker that is at a far-side of the screen might be attenuated or time delayed compared to audio broadcast at a loudspeaker that is located at a near-side of the display. The disclosure also provides a multipoint control unit (MCU) that processes audio signals from two or more endpoints according to the positions in a layout of the endpoints and then transmits processed audio streams to the endpoints in a way that allows endpoints to broadcast spatially correlated audio.
-
Citations
8 Claims
-
1. A method for controlling a first endpoint having a plurality of loudspeakers spatially arranged with respect to a screen in a multipoint videoconference between the first endpoint and a plurality of endpoints, comprising:
-
receiving at a multipoint control unit compressed audio and video image signals from the first endpoint and the plurality of endpoints; decoding the audio and video image signals; assessing from the audio signals which of the plurality of endpoints comprises a speaking endpoint; generating a video layout for the first endpoint, the layout positioning video images from two or more of the plurality of endpoints at different positions in the layout; processing one or more of the plurality of decoded audio signals in one or more channels, each channel corresponding to a speaker at the first endpoint, to generate a perception that the audio stream emanates from a position in the layout corresponding to a video image from the speaking endpoint encoding the video layout and the plurality of processed audio signals; and transmitting the encoded signals to the first endpoint. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
Specification