NUI video conference controls
First Claim
1. A method of providing a video conference between a local location and a remote location, the method comprising:
- receiving at the local location, video and audio signal obtained from a remote capture device disposed at the remote location, the obtained signals including those that are representative of real sights and real sound captured at the remote location by the remote capture device and are thus representative of real aspects of the remote location, the remote capture device being part of a remote system that is operatively coupled via a network to a local system having a local capture device disposed at the local location, the remote capture device including a respective remote depth camera;
presenting at the local location, a respective local representation of the remote location including that derived from the obtained video and audio signal that are respectively representative of real sights and real sound as captured by the remote capture device, the local representation including a displayed representation presented on a local display proximate to the local capture device;
tracking movements of a local user in a space proximate to the local capture device; and
responsive to a user'"'"'s local gesture detected by the local capture device, controlling the remote system to alter how it captures and processes reality representing signals at the remote location including altering processing of audio or visual signals by the remote system such that the respective local representation of real aspects of the remote location is correspondingly altered, said altering including altering of processing of depth based visual signals having a depth attribute provided by the remote depth camera.
3 Assignments
0 Petitions
Accused Products
Abstract
A system and method providing gesture controlled video conferencing includes a local capture device detecting movements of a user in a local environment and an audio/visual display. A processor is coupled to the capture device and a remote capture device and a remote processor at a remote environment via a network. The local processor includes instructions to render a representation of the remote environment on the display responsive to the remote processor and remote capture device. The processor also tracks movements of a local user in a space proximate to the local capture device. Responsive to a user gesture detected at the local capture device, the audio or visual signals provided by the remote capture device are altered to change the representation of the remote location is altered locally.
-
Citations
20 Claims
-
1. A method of providing a video conference between a local location and a remote location, the method comprising:
-
receiving at the local location, video and audio signal obtained from a remote capture device disposed at the remote location, the obtained signals including those that are representative of real sights and real sound captured at the remote location by the remote capture device and are thus representative of real aspects of the remote location, the remote capture device being part of a remote system that is operatively coupled via a network to a local system having a local capture device disposed at the local location, the remote capture device including a respective remote depth camera; presenting at the local location, a respective local representation of the remote location including that derived from the obtained video and audio signal that are respectively representative of real sights and real sound as captured by the remote capture device, the local representation including a displayed representation presented on a local display proximate to the local capture device; tracking movements of a local user in a space proximate to the local capture device; and responsive to a user'"'"'s local gesture detected by the local capture device, controlling the remote system to alter how it captures and processes reality representing signals at the remote location including altering processing of audio or visual signals by the remote system such that the respective local representation of real aspects of the remote location is correspondingly altered, said altering including altering of processing of depth based visual signals having a depth attribute provided by the remote depth camera. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A gesture controlled video conferencing apparatus configured for providing a video conference between a local environment and a remote environment, the apparatus comprising:
-
a local capture device configured to detect movements of a local user in the local environment; an audio/visual display disposed in the local environment; a processor coupled to the local capture device, the processor being further coupled via a network to a combination of a remote capture device and a remote processor disposed at the remote environment, the remote capture device including a respective remote depth camera, the processor including code instructing the processor to; render a representation of the remote environment on the audio/visual display, the rendered representation including at least one of reality representing audio and video portions corresponding to real sights and real sounds captured and processed at the remote environment; track movements of the local user in a space proximate to the local capture device; and responsive to a predetermined user gesture detected at the local capture device, alter a corresponding visual portion of the rendered representation of real sights and real sounds captured and processed at the remote environment, the altering at the remote environment including altering of processing of depth based visual signals having a depth attribute provided by the remote depth camera, whereby due to the altering at the remote environment, the reality representing representation of the real sights of the remote environment is altered locally on the audio/visual display. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A gesture controlled video conferencing apparatus configured for providing a video conference between a local first physical environment and a remote second physical environment, the apparatus comprising:
-
a depth and image capture device configured to detect movements of a user in the first physical environment; an audio/visual display disposed in the first physical environment; a first processor coupled to the capture device and the audio/visual display, the processor being further coupled via a network to a second depth and image capture device and a second processor disposed in the remote second physical environment, the second depth and image capture device being configured to capture real aspects of the second physical environment and to provide corresponding audio visual signals, the first processor including code instructing the first processor to; render a representation of real aspects of the second physical environment on the display responsive to the corresponding audio visual signals provided by the second depth and image capture device; present a user interface on the audio/visual display, the user interface responsive to gestures from the user; track movements of the user in a space proximate to the first capture device and responsive to a predetermined user gesture by the user, presenting a user interface allowing the user a set of one or more controls for altering how the second depth and image capture device is used for capturing and processing reality representing signals at the second physical environment to thereby alter the representation of the real aspects of the remote second physical environment as rendered on the audio/visual display. - View Dependent Claims (17, 18, 19, 20)
-
Specification