Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment

US 10,685,643 B2
Filed: 06/28/2017
Issued: 06/16/2020
Est. Priority Date: 05/20/2011
Status: Active Grant

First Claim

Patent Images

1. A communication system comprising:

a text-to-speech engine configured to provide an audible output to a user, the text-to-speech engine including an adjustable operational parameter; and

a processing circuitry configured to;

monitor an ambient noise level and, in response to an occurrence of a predefined condition associated with the ambient noise level, modify the adjustable operational parameter of the text-to-speech engine, and monitor an environmental condition related to intelligibility of the audible output of the text-to-speech engine;

modify the adjustable operational parameter of the text-to-speech engine based on the monitored environmental condition, wherein the monitored environmental condition comprises at least one of;

a type of a message being converted by the text-to-speech engine;

a type of a command received from the user;

a location of the user;

a proximity of the user to another user;

an ambient temperature of the user'"'"'s environment;

a time of day;

an experience level of the user with the text-to-speech engine;

an experience level of the user with an area of a task application;

an amount of time logged by the user with the task application;

a language of the message being converted by the text-to-speech engine;

a length of the message being converted by the text-to-speech engine; and

a frequency that the message being converted by the text-to-speech engine is used by the task application;

receive a user input indicating that the audible output of the text-to-speech engine is understood by the user after the adjustable operational parameter is modified; and

in response to the user input, restore the modified adjustable operational parameter of the text-to-speech engine to a previous setting after a predefined amount of time has elapsed.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed. A voice engine or other application of a device provides a mechanism to alter the adjustable operational parameters of the text-to-speech engine. In response to one or more environmental conditions, the adjustable operational parameters of the text-to-speech engine are modified to increase the intelligibility of synthesized speech.

196 Citations

19 Claims

1. A communication system comprising:
- a text-to-speech engine configured to provide an audible output to a user, the text-to-speech engine including an adjustable operational parameter; and
  
  a processing circuitry configured to;
  
  monitor an ambient noise level and, in response to an occurrence of a predefined condition associated with the ambient noise level, modify the adjustable operational parameter of the text-to-speech engine, and monitor an environmental condition related to intelligibility of the audible output of the text-to-speech engine;
  
  modify the adjustable operational parameter of the text-to-speech engine based on the monitored environmental condition, wherein the monitored environmental condition comprises at least one of;
  
  a type of a message being converted by the text-to-speech engine;
  
  a type of a command received from the user;
  
  a location of the user;
  
  a proximity of the user to another user;
  
  an ambient temperature of the user'"'"'s environment;
  
  a time of day;
  
  an experience level of the user with the text-to-speech engine;
  
  an experience level of the user with an area of a task application;
  
  an amount of time logged by the user with the task application;
  
  a language of the message being converted by the text-to-speech engine;
  
  a length of the message being converted by the text-to-speech engine; and
  
  a frequency that the message being converted by the text-to-speech engine is used by the task application;
  
  receive a user input indicating that the audible output of the text-to-speech engine is understood by the user after the adjustable operational parameter is modified; and
  
  in response to the user input, restore the modified adjustable operational parameter of the text-to-speech engine to a previous setting after a predefined amount of time has elapsed.
- View Dependent Claims (2, 3, 4, 5, 6)
- - 2. The communication system of claim 1, wherein the processing circuitry is further configured to restore the modified adjustable operational parameter of the text-to-speech engine to the previous setting in response to the ambient noise level indicating a return to a previous state.
  - 3. The communication system of claim 2, wherein the adjustable operational parameter of the text-to-speech engine that is modified comprises speed, pitch, and/or volume.
  - 4. The communication system of claim 1, wherein the processing circuitry is further configured to vary a modification amount of the adjustable operational parameter incrementally.
  - 5. The communication system of claim 1, wherein the processing circuitry is further configured to monitor a task performed by the user.
  - 6. The communication system of claim 1, wherein:
    - the text-to-speech engine is further configured to convert a message including a flag indicating the type of the message being converted;
      
      the text-to-speech engine includes multiple adjustable operational parameters; and
      
      the processing circuitry is further configured to monitor the type of the message being converted and, in response to the monitored type, modify one or more of the multiple adjustable operational parameters.

7. A communication system comprising:
- a text-to-speech engine configured to provide an audible output to a user, the text-to-speech engine including an adjustable operational parameter; and
  
  a processing circuitry configured to;
  
  monitor an environmental condition related to intelligibility of the audible output of the text-to-speech engine;
  
  modify the adjustable operational parameter based on the monitored environmental condition, wherein the monitored environmental condition comprises at least one of;
  
  a language of a message being converted by the text-to-speech engine and one of speed, pitch, and/or volume of the audible output of the text-to-speech engine;
  
  receive a user input indicating that the audible output of the text-to-speech engine is understood by the user after the adjustable operational parameter is modified; and
  
  in response to the user input, restore the modified adjustable operational parameter of the text-to-speech engine to a previous setting after a predefined amount of time has elapsed.
- View Dependent Claims (8, 9, 10, 11, 12, 13, 14)
- - 8. The communication system of claim 7, wherein the processing circuitry is further configured to restore the modified adjustable operational parameter of the text-to-speech engine to the previous setting in response to the monitored environmental condition indicating a return to a previous state.
  - 9. The communication system of claim 7, wherein the adjustable operational parameter of the text-to-speech engine that is modified comprises the speed, the pitch, and/or the volume.
  - 10. The communication system of claim 7, wherein the processing circuitry is further configured to vary a modification amount of the adjustable operational parameter incrementally.
  - 11. The communication system of claim 7, wherein:
    - the text-to-speech engine includes multiple adjustable operational parameters;
      
      the processing circuitry is further configured to monitor the environmental condition related to intelligibility of the audible output of the text-to-speech engine and, in response to the monitored environmental condition, modify one or more of the multiple adjustable operational parameters, wherein the monitored environmental condition comprises a type of the message being converted by the text-to-speech engine, a type of a command received from the user, a location of the user, a proximity of the user to the other user, an ambient temperature of the user'"'"'s environment, and/or a time of day.
  - 12. The communication system of claim 7, wherein:
    - the text-to-speech engine is further configured to convert a message including a flag indicating the type of the message being converted;
      
      the text-to-speech engine includes multiple adjustable operational parameters; and
      
      the processing circuitry is further configured to monitor the type of the message being converted and, in response to the monitored type, modify one or more of the multiple adjustable operational parameters.
  - 13. The communication system of claim 7, further comprising a detector operable for monitoring temperature and/or an ambient noise level.
  - 14. The communication system of claim 7, wherein the processing circuitry is further configured to detect a spoken command indicating that the user is experiencing difficulties understanding the audible output of the text-to-speech engine.

15. A method comprising:
- monitoring an environmental condition related to intelligibility of an audible output of a text-to-speech engine (TTS) and an ambient noise level, wherein the TTS includes an adjustable operational parameter associated to the TTS and provides the audible output to a user;
  
  modifying the adjustable operational parameter of the text-to-speech engine based on the monitored environmental condition and the ambient noise level, wherein the monitored environmental condition comprises at least one of;
  
  a type of a message being converted by the text-to-speech engine;
  
  a type of a command received from the user;
  
  a location of the user;
  
  a proximity of the user to another user;
  
  an ambient temperature of the user'"'"'s environment;
  
  a time of day;
  
  an experience level of the user with the text-to-speech engine;
  
  an experience level of the user with an area of a task application;
  
  an amount of time logged by the user with the task application;
  
  a language of the message being converted by the text-to-speech engine;
  
  a length of the message being converted by the text-to-speech engine;
  
  the ambient noise level corresponding to the environment; and
  
  a frequency that the message being converted by the text-to-speech engine is used by the task application;
  
  receiving a user input indicating that the audible output of the text-to-speech engine is understood by the user after the adjustable operational parameter is modified; and
  
  in response to the user input, restoring the modified adjustable operational parameter of the text-to-speech engine to a previous setting after a predefined amount of time has elapsed.
- View Dependent Claims (16, 17, 18, 19)
- - 16. The method of claim 15, wherein the environmental condition further includes one of a system message and a high priority message.
  - 17. The method of claim 15, wherein the adjustable operational parameter of the text-to-speech engine that is modified comprises speed, pitch, and/or volume.
  - 18. The method of claim 15, wherein the modifying comprises varying a modification amount of the adjustable operational parameter incrementally.
  - 19. The method of claim 15, wherein monitoring the proximity of the user to the other user comprises detecting a presence of a wireless signal transmitted by a device of the other user.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Vocollect, Inc. (Honeywell International Inc.)
Original Assignee
Vocollect, Inc. (Honeywell International Inc.)
Inventors
Hendrickson, James, Stiffey, Debra Drylie, Littleton, Duane, Pecorari, John, Slusarczyk, Arkadiusz
Primary Examiner(s)
Guerra-Erazo, Edgar X

Application Number

US15/635,326
Publication Number

US 20180018955A1
Time in Patent Office

1,084 Days
Field of Search
US Class Current
CPC Class Codes

G10L 13/02 Methods for producing synth...

G10L 13/033 Voice editing, e.g. manipul...

Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

196 Citations

19 Claims

Specification

Use Cases

Quick Links

Others

Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

196 Citations

19 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others