Image processing device, animation display method and computer readable medium
First Claim
Patent Images
1. An image processing device comprising:
- a processor configured to;
detect words or phrases within text of a sentence or a clause, wherein the text corresponds to audio to be reproduced;
determine, for at least one of the words or phrases detected within the text of the sentence or the clause, a corresponding one of a plurality of word/phrase-expressions;
determine that at least one of the words or phrases within the text of the sentence or the clause is a context-dependent word/phrase-expression;
assign a most frequent one of the word/phrase-expression determined for the at least one of the words or phrases detected, while ignoring the context-dependent word/phrase-expression determined, as one of a plurality of sentence/clause-expressions to the text of the sentence or the clause; and
generate frames of animation of a face of increased expressiveness to be displayed in sync with a reproduction of the audio, by at least performing;
generate a mouth shape of a mouth of the face for each of the frames based on the words or phrases detected within the text; and
generate an emotional expression of the face for each of the frames based on the one of the plurality of sentence/clause-expressions assigned to the text of the sentence or the clause.
1 Assignment
0 Petitions
Accused Products
Abstract
An image processing device includes a controller and a display. The controller adds an expression to a displayed face image in accordance with an audio when the audio is output. Further, the controller generates an animation in which a mouth contained in the face image with the expression moves in sync with the audio. The display displays the generated animation.
7 Citations
6 Claims
-
1. An image processing device comprising:
a processor configured to; detect words or phrases within text of a sentence or a clause, wherein the text corresponds to audio to be reproduced; determine, for at least one of the words or phrases detected within the text of the sentence or the clause, a corresponding one of a plurality of word/phrase-expressions; determine that at least one of the words or phrases within the text of the sentence or the clause is a context-dependent word/phrase-expression; assign a most frequent one of the word/phrase-expression determined for the at least one of the words or phrases detected, while ignoring the context-dependent word/phrase-expression determined, as one of a plurality of sentence/clause-expressions to the text of the sentence or the clause; and generate frames of animation of a face of increased expressiveness to be displayed in sync with a reproduction of the audio, by at least performing; generate a mouth shape of a mouth of the face for each of the frames based on the words or phrases detected within the text; and generate an emotional expression of the face for each of the frames based on the one of the plurality of sentence/clause-expressions assigned to the text of the sentence or the clause. - View Dependent Claims (2)
-
3. A method comprising:
-
detecting words or phrases within text of a sentence or a clause, wherein the text corresponds to audio to be reproduced; determining, for at least one of the words or phrases detected within the text of the sentence or the clause, a corresponding one of a plurality of word/phrase expressions; determining that at least one of the words or phrases within the text of the sentence or the clause is a context-dependent word/phrase-expression; assigning a most frequent one of the word/phrase-expression determined for the at least one of the words or phrases detected, while ignoring the context-dependent word/phrase-expression determined, as one of a plurality of sentence/clause-expressions to the text of the sentence or the clause; and generating frames of animation of a face of increased expressiveness to be displayed in sync with a reproduction of the audio, by at least; generating a mouth shape of a mouth of the face for each of the frames based on the words or phrases detected within the text; and generating an emotional expression of the face for each of the frames based on the one of the plurality of sentence/clause-expressions assigned to the text of the sentence or the clause. - View Dependent Claims (4)
-
-
5. A non-transitory computer readable storage medium storing a program to cause a computer to at least perform:
-
detecting words or phrases within text of a sentence or a clause, wherein the text corresponds to audio to be reproduced; determining, for at least one of the words or phrases detected within the text of the sentence or the clause, a corresponding one of a plurality of word/phrase-expressions; determining that at least one of the words or phrases within the text of the sentence or the clause is a context-dependent word/phrase-expression; assigning a most frequent one of the word/phrase-expression determined for the at least one of the words or phrases detected, while ignoring the context-dependent word/phrase-expression determined, as one of a plurality of sentence/clause-expressions to the text of the sentence or the clause; and generating frames of animation of a face of increased expressiveness to be displayed in sync with a reproduction of the audio, by at least; generating a mouth shape of a mouth of the face for each of the frames based on the words or phrases detected within the text; and generating an emotional expression of the face for each of the frames based on the one of the plurality of sentence/clause-expressions assigned to the text of the sentence or the clause. - View Dependent Claims (6)
-
Specification