Image displaying apparatus
First Claim
Patent Images
1. An actually shot image displaying apparatus comprising:
- a first displaying portion at which an actually shot image is displayed;
a voice information obtainer for obtaining voice information;
an image creator for creating an animated face image which expresses appearances of speech by a person by means of deforming a two-dimensional face model or a three-dimensional face model in accordance with the obtained voice information; and
a second displaying portion at which a plurality of animated face images are displayed and laid out as an auxiliary image including the animated face image which expresses appearances of speech by a person, the plurality of animated face images correspond to persons in the actually shot image, whereinthe first displaying portion and second displaying portion are arranged on a same display screen, and the animated face image, which expresses appearances of speech by a person and corresponds to a person who is speaking, is displayed distinctively from the other animated face images; and
when a person without an associated face model Mp registered is included in those who are to be displayed, a texture extracted from a main image may be pasted to a standard model to thereby create a face model of this person.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention aims at realizing displaying of a realistic image of a conversation scene in which a speaker can be visually recognized by people watching this image regardless of the content of the image. To this end, according to the present invention, a two-dimensional or three-dimensional face model is deformed, and animations A1i through A3i which express a state in which a person is speaking are consequently created and displayed as auxiliary images.
-
Citations
20 Claims
-
1. An actually shot image displaying apparatus comprising:
-
a first displaying portion at which an actually shot image is displayed; a voice information obtainer for obtaining voice information; an image creator for creating an animated face image which expresses appearances of speech by a person by means of deforming a two-dimensional face model or a three-dimensional face model in accordance with the obtained voice information; and a second displaying portion at which a plurality of animated face images are displayed and laid out as an auxiliary image including the animated face image which expresses appearances of speech by a person, the plurality of animated face images correspond to persons in the actually shot image, wherein the first displaying portion and second displaying portion are arranged on a same display screen, and the animated face image, which expresses appearances of speech by a person and corresponds to a person who is speaking, is displayed distinctively from the other animated face images; and when a person without an associated face model Mp registered is included in those who are to be displayed, a texture extracted from a main image may be pasted to a standard model to thereby create a face model of this person. - View Dependent Claims (2, 3, 4, 5)
-
-
6. An actually shot image displaying apparatus comprising;
-
a first displaying portion at which an actually shot image is displayed; a voice information obtainer for obtaining voice information; an image creator for creating an animated face image which expresses appearances of speech by a person by means of deforming a two-dimensional face model or a three-dimensional face model in accordance with the obtained voice information; a second displaying portion at which a plurality of animated face images are displayed and laid out as an auxiliary image including the animated face image which expresses appearances of speed by a person, the plurality of animated face images correspond to persons in the actually shot image; and a determiner for determining whether a direction of a person represeted in the actually shot image is full-face pose or not, wherein a first displaying portion and second displaying portion are arranged on a same display screen, and the animated face image, which expresses appearances of speech by a person and corresponds to a person who is speaking, is displayed distinctively from the other animated face images, and the second displaying portion displays an animated face image of a person which direction is determined as a not full-face pose by the determiner and does not display an animated face of a person which direction is determined as a full-face pose as the actually shot image will sufficiently show facial expressions when the person is speaking.
-
-
7. An image creating system comprising:
-
an image acquiring portion for acquiring image signals representing an actually shot image; a voice acquiring portion for acquiring voice signals for representing voice accompanying the actually shot image; a model storing memory for storing a plurality of models; a specifying processor for specifying a model from the plurality of models stored in the model storing memory on the basis of the acquired voice signals; a memory controller for read out of the specified model from the model storing memory; an image creating processor for creating an animated face image by deforming the read out model; and a display including a first displaying portion for displaying the image signals representing the actually shot image, and a second displaying portion for displaying, as an auxiliary image, a plurality of animated face images including the animated face image created by deforming the read out model, the plurality of animated face images correspond to persons in the actually shot image, wherein the first displaying portion and second displaying portion are arranged together on the display, and the animated face image created by deforming the read out model, which corresponds to a person who is speaking, is displayed distinctively from the other animated face images; and when a person without an associated face model Mp registered is included in those who are to be displayed, a texture extracted from a main image may be pasted to a standard model to thereby create a face model of this person. - View Dependent Claims (8, 9, 10, 11, 12, 16, 17)
-
-
13. An actually shot image displaying system for displaying a plurality of persons, the system comprising:
-
a main image displaying portion for displaying an actually shot image which represents positional relationship of a plurality of persons; a speaker for producing sounds which represents voice of the plurality of persons; an auxiliary image displaying portion which has a plurality of frames each of which displays an animated full-face pose image of a person, the animated full-face pose image in each of the plurality of frames corresponds to a person in the actually shot image; a determining processor for determining from which person of the plurality of the persons a voice is produced, on the basis of the voice to be represented by the speaker; a creating processor for creating an animated full-face pose image of a person determined by the determining processor as the person from which the voice is produced; and a display controller for controlling the auxiliary image displaying portion to display the created animated full-face pose image within one frame of the plurality of frames, wherein the main image displaying portion and the auxiliary image displaying are arranged on a same display screen and the created animated full-face pose image of the person determined as the person from which the voice is produced is displayed distinctively from the other animated full-face pose images; and when a person without an associated face model Mp registered is included in those who are to be displayed, a texture extracted from a main image may be pasted to a standard model to thereby create a face model of this person. - View Dependent Claims (14, 15)
-
-
18. An actually shot image displaying apparatus, comprising:
-
a visual information input section which obtains visual information by actually shooting an image with a camera; an audio information input section which obtain audio information; an image generator which selects a face model stored in a memory, and deforms the face model selected based on the audio information to create an animated face image; and a controller which controls to display on a display device an image of the actually shot image together with an auxiliary image having a plurality of animated face images including the animated face image created by the image generator, the plurality of animated face images correspond to persons in the actually shot image, wherein the animated face image created by the image generator, which corresponds to a person who is speaking, is displayed distinctively from the other animated face images; and when a person without an associated face model Mp registered is included in those who are to be displayed, a texture extracted from a main image may be pasted to a standard model to thereby create a face model of this person. - View Dependent Claims (19, 20)
-
Specification