Image manipulating teleconferencing system
First Claim
1. A teleconferencing system which enables a conferee at a local terminal to enjoy the appearance of eye contact during a teleconference with a second conferee at a connected remote terminal, the system comprising:
- a terminal which comprises;
image display means for displaying an image;
image pickup means for producing a video signal representative of the image of the first conferee, the image pickup means being placed beyond the outside perimeter of the image on the image display means so as not to interfere with the viewing of the image display;
audio pickup means to produce an audio signal representative of speech and other sounds produced by the first conferee; and
audio reproduction means for audibly reproducing an audio signal from the remote terminal, the audio signal representative of speech and other sounds produced by the second conferee such that the first conferee and the second conferee can carry out a conversation;
digitizing means for creating a digital representation of the video signal from the image pickup means;
signal transmission means for processing audio and video signals from the local terminal and transmitting them to the remote terminal, and receiving audio and video signals from the remote terminal and processing them for use by the local terminal so that the local terminal is connected to the remote terminal;
orientation means for deriving data representing a spatial orientation of elements of the image of one of the first conferee, the second conferee, and both conferees from a digital representation of the video signal from their respective terminals so as to determine elements of one of the first conferee'"'"'s gaze, the second conferee'"'"'s gaze and both conferees'"'"' gazes, respectively;
image manipulation means for using the spatial orientation data to manipulate the digital image representation to create the appearance of eye contact between the first and second conferees to facilitate natural conversation.
4 Assignments
0 Petitions
Accused Products
Abstract
A teleconferencing system that allows for natural eye contact between conferees is provided. The system comprises two or more terminals which are connected to allow a teleconference to occur. Each terminal comprises a screen to display an image of the remote conferee and a video camera to transmit an image of the local conferee to the remote screen. The video camera is conveniently located above the screen, beyond the perimeter of the image. Microphones and speakers are also provided to allow the conferees to hear as well as speak to one another. Eye contact is provided, despite the mounting of the camera above eye level, by image manipulating the image of a conferee to remove any distortion caused by camera placement and to redirect the apparent direction of the conferee'"'"'s gaze. Image manipulation can also simulate zooming, tilting and panning of the camera by expanding that portion of the camera field that frames the conferee to fill the entire screen and by keeping the conferee'"'"'s image so centered in spite of movement by the conferee.
-
Citations
29 Claims
-
1. A teleconferencing system which enables a conferee at a local terminal to enjoy the appearance of eye contact during a teleconference with a second conferee at a connected remote terminal, the system comprising:
-
a terminal which comprises; image display means for displaying an image; image pickup means for producing a video signal representative of the image of the first conferee, the image pickup means being placed beyond the outside perimeter of the image on the image display means so as not to interfere with the viewing of the image display; audio pickup means to produce an audio signal representative of speech and other sounds produced by the first conferee; and audio reproduction means for audibly reproducing an audio signal from the remote terminal, the audio signal representative of speech and other sounds produced by the second conferee such that the first conferee and the second conferee can carry out a conversation; digitizing means for creating a digital representation of the video signal from the image pickup means; signal transmission means for processing audio and video signals from the local terminal and transmitting them to the remote terminal, and receiving audio and video signals from the remote terminal and processing them for use by the local terminal so that the local terminal is connected to the remote terminal; orientation means for deriving data representing a spatial orientation of elements of the image of one of the first conferee, the second conferee, and both conferees from a digital representation of the video signal from their respective terminals so as to determine elements of one of the first conferee'"'"'s gaze, the second conferee'"'"'s gaze and both conferees'"'"' gazes, respectively; image manipulation means for using the spatial orientation data to manipulate the digital image representation to create the appearance of eye contact between the first and second conferees to facilitate natural conversation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A teleconferencing system which enables a first conferee at a local terminal to enjoy a teleconference with a second conferee at a connected remote terminal, the system comprising:
-
a terminal comprising; image display means for displaying an image; image pickup means for producing a video signal representative of the image of the first conferee, the image pickup means being equipped with a lens that produces a field of view substantially larger than that necessary to accommodate the first conferee; audio pickup means to produce an audio signal representative of speech and other sounds produced by the first conferee; and audio reproduction means for audibly reproducing an audio signal from the remote terminal, the signal representative of speech and other sounds produced by the second conferee such that the first and second conferees can carry out a conversation; signal transmission means for processing signals from a local terminal and transmitting them to a remote terminal, and receiving signals from the remote terminal and processing them for use by the local terminal so that the local terminal is connected to the remote terminal; digitizing means for creating a digital representation of the video signal from the image pickup means; orientation means for deriving data representing a spatial orientation of elements of the image of one of the first conferee, the second conferee, and both conferees from a digital representation of the video signal from their respective terminals; and image manipulation means for using the spatial orientation data to manipulate the digital image representation to extract a portion of the image immediately surrounding one of the first conferee, the second conferee, and both conferees and to cause the portion to fill the image display means of one of the remote terminal, the local terminal, and both terminals, respectively thereby affording the illusion of tilting, panning, and zooming of the image pickup device without physically moving the device. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 29)
-
-
26. A teleconferencing system which enables a first conferee at a local terminal to enjoy an appearance of eye contact during a teleconference with a second conferee at a connected remote terminal, the system comprising:
-
a terminal which comprises; a monitor for displaying an image; a video camera for producing a video signal representative of the image of the first conferee, the video camera being placed beyond the outside perimeter of the image on the monitor so as not to interfere with the viewing of the image display; a microphone and associated electronics to produce an audio signal representative of speech and other sounds produced by the first conferee; and a speaker and associated electronics for audibly reproducing an audio signal from the remote terminal, the signal representative of speech and other sounds produced by the second conferee such that the first and the second conferees can carry out a conversation; a video digitizer for creating a digital representation of the video signal from the video camera; a system controller for controlling the system and processing the audio and video signals comprising; a central processing unit with memory for executing program instructions; pattern recognition means for deriving data representing a spatial orientation of elements of an image of one of the first conferee, the second conferee and both conferees from the digital representation of the video signal so as to determine elements of one of the first conferee, the second conferee, and both conferees, respectively characteristic of one of the first conferee'"'"'s gaze, the seconds conferee'"'"'s gaze, and both conferee'"'"'s gazes, respectively; and image manipulation means for using the spatial orientation data to manipulate the digital image representation to create a natural conversational appearance of eye contact between conferees at the connected local and remote terminals; and signal transmission means for processing signals from the system controller and transmitting them to the remote terminal, and receiving signals from the remote terminal and processing them for use by the system controller so that the local terminal is connected to the remote terminal. - View Dependent Claims (27, 28)
-
Specification