ADAPTIVE TEXT-TO-SPEECH OUTPUTS
First Claim
Patent Images
1. A method performed by one or more computers, the method comprising:
- receiving, by the one or more computers, context data from a client device of a user;
selecting, by the one or more computers, a user context corresponding to the context data from the client device, the user context being selected from among a plurality of user contexts;
determining, by the one or more computers, a text segment for text-to-speech synthesis by a text-to-speech module based on the selected user context;
generating, by the one or more computers, audio data comprising a synthesized utterance of the text segment using the text-to-speech module; and
providing, by the one or more computers and to the client device, the audio data comprising the synthesized utterance of the text segment.
2 Assignments
0 Petitions
Accused Products
Abstract
In some implementations, a language proficiency of a user of a client device is determined by one or more computers. The one or more computers then determines a text segment for output by a text-to-speech module based on the determined language proficiency of the user. After determining the text segment for output, the one or more computers generates audio data including a synthesized utterance of the text segment. The audio data including the synthesized utterance of the text segment is then provided to the client device for output.
-
Citations
20 Claims
-
1. A method performed by one or more computers, the method comprising:
-
receiving, by the one or more computers, context data from a client device of a user; selecting, by the one or more computers, a user context corresponding to the context data from the client device, the user context being selected from among a plurality of user contexts; determining, by the one or more computers, a text segment for text-to-speech synthesis by a text-to-speech module based on the selected user context; generating, by the one or more computers, audio data comprising a synthesized utterance of the text segment using the text-to-speech module; and providing, by the one or more computers and to the client device, the audio data comprising the synthesized utterance of the text segment. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system comprising:
-
one or more computers; and a non-transitory computer-readable medium coupled to the one or more computers having instructions stored thereon, which, when executed by the one or more computers, cause the one or more computers to perform operations comprising; receiving, by the one or more computers, context data from a client device of a user; selecting, by the one or more computers, a user context corresponding to the context data from the client device, the user context being selected from among a plurality of user contexts; determining, by the one or more computers, a text segment for text-to-speech synthesis by a text-to-speech module based on the selected user context; generating, by the one or more computers, audio data comprising a synthesized utterance of the text segment using the text-to-speech module; and providing, by the one or more computers and to the client device, the audio data comprising the synthesized utterance of the text segment. - View Dependent Claims (13, 14, 15)
-
-
16. A non-transitory computer-readable storage device encoded with computer program instructions that, when executed by one or more computers, cause the one or more computers to perform operations comprising:
-
receiving, by the one or more computers, context data from a client device of a user; selecting, by the one or more computers, a user context corresponding to the context data from the client device, the user context being selected from among a plurality of user contexts; determining, by the one or more computers, a text segment for text-to-speech synthesis by a text-to-speech module based on the selected user context; generating, by the one or more computers, audio data comprising a synthesized utterance of the text segment using the text-to-speech module; and providing, by the one or more computers and to the client device, the audio data comprising the synthesized utterance of the text segment. - View Dependent Claims (17, 18, 19, 20)
-
Specification