Animated Digital Assistant
First Claim
Patent Images
1. A method for interacting with a user comprising:
- receiving an input from a device;
determining a text-based response based on the input using a logic engine;
generating an audio stream of a voice-synthesized response based on the text-based response, the voice-synthesized response having a plurality of phonemes;
rendering a video stream based on the plurality of phonemes, the video stream comprising an animated head speaking the voice-synthesized response;
synchronizing the video and the audio;
transmitting the video stream and the audio stream over the network; and
presenting the video stream and the audio stream on the device.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for interacting with a user comprising: receiving an input on a device, determining a text-based response based on the input using a logic engine, generating an audio stream of a voice-synthesized response based on the text-based response, rendering a video stream using a morphing of predetermined shapes based on phonemes in the voice-synthesized response, the video stream comprising an animated head speaking the voice-synthesized response, synchronizing the video stream and the audio stream, transmitting the video stream and the audio stream over the network; and presenting the video stream and the audio stream on the device.
-
Citations
20 Claims
-
1. A method for interacting with a user comprising:
-
receiving an input from a device; determining a text-based response based on the input using a logic engine; generating an audio stream of a voice-synthesized response based on the text-based response, the voice-synthesized response having a plurality of phonemes; rendering a video stream based on the plurality of phonemes, the video stream comprising an animated head speaking the voice-synthesized response; synchronizing the video and the audio; transmitting the video stream and the audio stream over the network; and presenting the video stream and the audio stream on the device. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A machine-readable medium that provides instructions for a processor, which when executed by the processor cause the processor to perform a method for interacting with a user comprising:
-
receiving an input from a device; determining a text-based response based on the input using a logic engine; generating an audio stream of a voice-synthesized response based on the text-based response, the voice-synthesized response having a plurality of phonemes; rendering a video stream based on the plurality of phonemes, the video stream comprising an animated head speaking the voice-synthesized response; synchronizing the video stream and the audio stream; transmitting the video stream and the audio stream over the network; and presenting the video stream and the audio stream on the device. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A system for interacting with a user comprising:
-
a device configured to receive an input and present a video stream and an audio stream; a server coupled to the device, the server being configured to receive the input and transmit the video stream and the audio stream to the device; a logic process coupled to receive the input, the logic process generating a text-based response based on the input; a text-to-speech process configured to receive the text-based response and generate an audio stream of a voice-synthesized response based on the text-based response, the voice-synthesized response having a plurality of phonemes; a video rendering process for generating a video stream based on the plurality of phonemes, the video stream comprising an animated head speaking the voice-synthesized response; and a synchronization process for synchronizing the audio stream and the video stream. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification