Conversational interface agent
First Claim
Patent Images
1. A computer readable storage medium having instructions, which when executed on a computer provide a user interface, the instructions comprising:
- a speech synthesizer receiving input for synthesis and providing an audio output signal; and
a video rendering module receiving information related to the audio output signal, the video rendering module rendering a representation of a head having a talking mouth portion with movements in accordance with the audio output signal, and wherein the video rendering module renders, as part of the representation a sequence of video frames having the head, and wherein the video rendering module selectively adds, to each frame, a mouth position for the mouth portion based in part on tracked movements of the head, wherein the video rendering module tracks movements of the head in the sequence of video frames, and wherein the tracked movements include translations and rotations of the head.
1 Assignment
0 Petitions
Accused Products
Abstract
A video rewrite technique for rendering a talking head or agent completely simulates a conversation by including a waiting or listening state. Smooth transitions are provided to and from a talking state.
35 Citations
7 Claims
-
1. A computer readable storage medium having instructions, which when executed on a computer provide a user interface, the instructions comprising:
-
a speech synthesizer receiving input for synthesis and providing an audio output signal; and a video rendering module receiving information related to the audio output signal, the video rendering module rendering a representation of a head having a talking mouth portion with movements in accordance with the audio output signal, and wherein the video rendering module renders, as part of the representation a sequence of video frames having the head, and wherein the video rendering module selectively adds, to each frame, a mouth position for the mouth portion based in part on tracked movements of the head, wherein the video rendering module tracks movements of the head in the sequence of video frames, and wherein the tracked movements include translations and rotations of the head. - View Dependent Claims (2, 3)
-
-
4. A computer readable storage medium having instructions, which when executed on a computer provide a user interface, the instructions comprising:
-
a speech synthesizer receiving input for synthesis and providing an audio output signal; and a video rendering module receiving information related to the audio output signal, the video rendering module rendering a representation of a head having a talking mouth portion with movements in accordance with the audio output signal, the video rendering module accessing a store having a sequence of frames of the head and rendering at least a portion of each of the frames in the sequence while selectively adding a corresponding mouth position based at least in part on tracked movements of the head, wherein the tracked movements of the head include translations and rotations. - View Dependent Claims (5, 6)
-
-
7. A computer-implemented method for generating a talking head on a computer display to simulate a conversation, the method comprising:
-
rendering a sequence of video frames of a head; tracking movements of the head throughout the sequence; selectively adding a corresponding mouth position to frames in the sequence as a function of the tracked movements of the head; wherein the tracked movements of the head include translations and rotations; wherein tracking comprises calculating interpolated physical movements of the head based on frames of the sequence; and wherein calculating interpolated physical movements includes calculating interpolated physical movements as a function of a corresponding preceding frame and a corresponding succeeding frame for each of a plurality of frames in the sequence.
-
Specification