ADAPTIVE TEXT-TO-SPEECH OUTPUTS
First Claim
Patent Images
1. A method comprising:
- determining, by data processing hardware, a user context of a user of a client device, the user context indicating a level of complexity of speech that the user is likely able to comprehend;
determining, by the data processing hardware, a particular text segment for text-to-speech output to the user, the particular text segment having a complexity score indicating a corresponding level of complexity associated with the particular text segment;
modifying, by the data processing hardware, the particular text segment for the text-to-speech output to the user based on the complexity score of the particular text segment and the selected user context;
generating, by the data processing hardware, audio data comprising a synthesized utterance of the modified particular text segment; and
providing, by the data processing hardware, the audio data comprising the synthesized utterance of the modified particular text segment to the client device.
0 Assignments
0 Petitions
Accused Products
Abstract
In some implementations, a language proficiency of a user of a client device is determined by one or more computers. The one or more computers then determines a text segment for output by a text-to-speech module based on the determined language proficiency of the user. After determining the text segment for output, the one or more computers generates audio data including a synthesized utterance of the text segment. The audio data including the synthesized utterance of the text segment is then provided to the client device for output.
0 Citations
22 Claims
-
1. A method comprising:
-
determining, by data processing hardware, a user context of a user of a client device, the user context indicating a level of complexity of speech that the user is likely able to comprehend; determining, by the data processing hardware, a particular text segment for text-to-speech output to the user, the particular text segment having a complexity score indicating a corresponding level of complexity associated with the particular text segment; modifying, by the data processing hardware, the particular text segment for the text-to-speech output to the user based on the complexity score of the particular text segment and the selected user context; generating, by the data processing hardware, audio data comprising a synthesized utterance of the modified particular text segment; and providing, by the data processing hardware, the audio data comprising the synthesized utterance of the modified particular text segment to the client device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system comprising:
-
data processing hardware; and memory hardware in communication with the data processing hardware and storing instructions, that when executed by the data processing hardware, cause the data processing hardware to perform operations comprising; determining a user context of a user of a client device, the user context indicating a level of complexity of speech that the user is likely able to comprehend; determining a particular text segment for text-to-speech output to the user, the particular text segment having a complexity score indicating a corresponding level of complexity associated with the particular text segment; modifying the particular text segment for the text-to-speech output to the user based on the complexity score of the particular text segment and the selected user context; generating audio data comprising a synthesized utterance of the modified particular text segment; and providing the audio data comprising the synthesized utterance of the modified particular text segment to the client device. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification