Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
First Claim
Patent Images
1. A communication system for a speech-based environment, the communication system comprising:
- a text-to-speech engine configured to provide an audible output to a user, the text-to-speech engine including one or more adjustable operational parameters; and
processing circuitry configured to;
monitor an ambient noise level and, in response to the monitored ambient noise level, modify the adjustable operational parameter of the text-to-speech engine, andmonitor environmental conditions related to intelligibility of the audible output of the text-to-speech engine and, in response to the monitored environmental conditions, modify one or more of the adjustable operational parameters of the text-to-speech engine,the monitored environmental conditions comprising a type of message being converted by the text-to-speech engine, a type of command received from the user, an experience level of the user with the text-to-speech engine, an experience level of the user with an area of a task application, an amount of time logged by the user with a task application, a language of a message being converted by the text-to-speech engine, a length of a message being converted by the text-to-speech engine, a frequency that a message being converted by the text-to-speech engine is used by a task application, or any combination thereof;
wherein the adjustable operational parameter is a speed of the text-to-speech engine, which is temporarily reduced in response to the monitored environmental conditions to increase the intelligibility of the audible output to the user.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed. A voice engine or other application of a device provides a mechanism to alter the adjustable operational parameters of the text-to-speech engine. In response to one or more environmental conditions, the adjustable operational parameters of the text-to-speech engine are modified to increase the intelligibility of synthesized speech.
192 Citations
19 Claims
-
1. A communication system for a speech-based environment, the communication system comprising:
-
a text-to-speech engine configured to provide an audible output to a user, the text-to-speech engine including one or more adjustable operational parameters; and processing circuitry configured to; monitor an ambient noise level and, in response to the monitored ambient noise level, modify the adjustable operational parameter of the text-to-speech engine, and monitor environmental conditions related to intelligibility of the audible output of the text-to-speech engine and, in response to the monitored environmental conditions, modify one or more of the adjustable operational parameters of the text-to-speech engine, the monitored environmental conditions comprising a type of message being converted by the text-to-speech engine, a type of command received from the user, an experience level of the user with the text-to-speech engine, an experience level of the user with an area of a task application, an amount of time logged by the user with a task application, a language of a message being converted by the text-to-speech engine, a length of a message being converted by the text-to-speech engine, a frequency that a message being converted by the text-to-speech engine is used by a task application, or any combination thereof; wherein the adjustable operational parameter is a speed of the text-to-speech engine, which is temporarily reduced in response to the monitored environmental conditions to increase the intelligibility of the audible output to the user. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A communication system for a speech-based environment, the communication system comprising:
-
a text-to-speech engine configured to provide an audible output to a user, the text-to-speech engine including an adjustable operational parameter; and processing circuitry configured to monitor environmental conditions related to intelligibility of the audible output of the text-to-speech engine and, in response to the monitored environmental conditions, modify the adjustable operational parameter; wherein the monitored environmental conditions comprise an experience level of the user with the text-to-speech engine, an experience level of the user with an area of a task application, an amount of time logged by the user with a task application, a language of a message being converted by the text-to-speech engine, a length of a message being converted by the text-to-speech engine, and/or a frequency that a message being converted by the text-to-speech engine is used by a task application; wherein the adjustable operational parameter is a speed of the text-to-speech engine, which is temporarily reduced in response to the monitored environmental conditions to increase the intelligibility of the audible output to the user. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14)
-
-
15. A communication system for a speech-based environment, the communication system comprising:
-
a text-to-speech engine configured to provide an audible output to a user, the text-to-speech engine including an adjustable operational parameter; and processing circuitry configured to monitor environmental conditions related to intelligibility of the audible output of the text-to-speech engine and, in response to the monitored environmental conditions, modify the adjustable operational parameter; wherein the monitored environmental conditions comprise a type of command received from the user, an experience level of the user with the text-to-speech engine, an experience level of the user with an area of a task application, an amount of time logged by the user with a task application, a language of a message being converted by the text-to-speech engine, a length of a message being converted by the text-to-speech engine, a frequency that a message being converted by the text-to-speech engine is used by a task application, or any combination thereof; wherein the adjustable operational parameter is a speed of the text-to-speech engine, which is temporarily reduced in response to the monitored environmental conditions to increase the intelligibility of the audible output to the user. - View Dependent Claims (16, 17, 18, 19)
-
Specification