Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
First Claim
Patent Images
1. A communication system comprising:
- a text-to-speech engine configured to provide an audible output to a user, the text-to-speech engine including an adjustable operational parameter; and
a processing circuitry configured to;
monitor an ambient noise level and, in response to an occurrence of a predefined condition associated with the ambient noise level, modify the adjustable operational parameter of the text-to-speech engine, and monitor an environmental condition related to intelligibility of the audible output of the text-to-speech engine;
modify the adjustable operational parameter of the text-to-speech engine based on the monitored environmental condition, wherein the monitored environmental condition comprises at least one of;
a type of a message being converted by the text-to-speech engine;
a type of a command received from the user;
a location of the user;
a proximity of the user to another user;
an ambient temperature of the user'"'"'s environment;
a time of day;
an experience level of the user with the text-to-speech engine;
an experience level of the user with an area of a task application;
an amount of time logged by the user with the task application;
a language of the message being converted by the text-to-speech engine;
a length of the message being converted by the text-to-speech engine; and
a frequency that the message being converted by the text-to-speech engine is used by the task application;
receive a user input indicating that the audible output of the text-to-speech engine is understood by the user after the adjustable operational parameter is modified; and
in response to the user input, restore the modified adjustable operational parameter of the text-to-speech engine to a previous setting after a predefined amount of time has elapsed.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed. A voice engine or other application of a device provides a mechanism to alter the adjustable operational parameters of the text-to-speech engine. In response to one or more environmental conditions, the adjustable operational parameters of the text-to-speech engine are modified to increase the intelligibility of synthesized speech.
196 Citations
19 Claims
-
1. A communication system comprising:
-
a text-to-speech engine configured to provide an audible output to a user, the text-to-speech engine including an adjustable operational parameter; and a processing circuitry configured to; monitor an ambient noise level and, in response to an occurrence of a predefined condition associated with the ambient noise level, modify the adjustable operational parameter of the text-to-speech engine, and monitor an environmental condition related to intelligibility of the audible output of the text-to-speech engine; modify the adjustable operational parameter of the text-to-speech engine based on the monitored environmental condition, wherein the monitored environmental condition comprises at least one of;
a type of a message being converted by the text-to-speech engine;
a type of a command received from the user;
a location of the user;
a proximity of the user to another user;
an ambient temperature of the user'"'"'s environment;
a time of day;
an experience level of the user with the text-to-speech engine;
an experience level of the user with an area of a task application;
an amount of time logged by the user with the task application;
a language of the message being converted by the text-to-speech engine;
a length of the message being converted by the text-to-speech engine; and
a frequency that the message being converted by the text-to-speech engine is used by the task application;receive a user input indicating that the audible output of the text-to-speech engine is understood by the user after the adjustable operational parameter is modified; and in response to the user input, restore the modified adjustable operational parameter of the text-to-speech engine to a previous setting after a predefined amount of time has elapsed. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A communication system comprising:
-
a text-to-speech engine configured to provide an audible output to a user, the text-to-speech engine including an adjustable operational parameter; and a processing circuitry configured to; monitor an environmental condition related to intelligibility of the audible output of the text-to-speech engine; modify the adjustable operational parameter based on the monitored environmental condition, wherein the monitored environmental condition comprises at least one of;
a language of a message being converted by the text-to-speech engine and one of speed, pitch, and/or volume of the audible output of the text-to-speech engine;receive a user input indicating that the audible output of the text-to-speech engine is understood by the user after the adjustable operational parameter is modified; and in response to the user input, restore the modified adjustable operational parameter of the text-to-speech engine to a previous setting after a predefined amount of time has elapsed. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14)
-
-
15. A method comprising:
-
monitoring an environmental condition related to intelligibility of an audible output of a text-to-speech engine (TTS) and an ambient noise level, wherein the TTS includes an adjustable operational parameter associated to the TTS and provides the audible output to a user; modifying the adjustable operational parameter of the text-to-speech engine based on the monitored environmental condition and the ambient noise level, wherein the monitored environmental condition comprises at least one of;
a type of a message being converted by the text-to-speech engine;
a type of a command received from the user;
a location of the user;
a proximity of the user to another user;
an ambient temperature of the user'"'"'s environment;
a time of day;
an experience level of the user with the text-to-speech engine;
an experience level of the user with an area of a task application;
an amount of time logged by the user with the task application;
a language of the message being converted by the text-to-speech engine;
a length of the message being converted by the text-to-speech engine;
the ambient noise level corresponding to the environment; and
a frequency that the message being converted by the text-to-speech engine is used by the task application;receiving a user input indicating that the audible output of the text-to-speech engine is understood by the user after the adjustable operational parameter is modified; and in response to the user input, restoring the modified adjustable operational parameter of the text-to-speech engine to a previous setting after a predefined amount of time has elapsed. - View Dependent Claims (16, 17, 18, 19)
-
Specification