Easy generation and automatic training of spoken dialog systems using text-to-speech
First Claim
1. A dialog system training environment comprising:
- a user simulator that provides a text to speech output associated with an utterance; and
, a dialog system that comprises;
a speech model having a plurality of modifiable parameters, the speech model receives the speech input from the utterance and produces output features; and
, a dialog action model having a plurality of modifiable parameters, the dialog model receives the speech output features from the speech model and produces output actions. parameters of the speech model and/or the dialog action model based, at least in part, upon the utterance identified by the speech model and/or the action taken by the dialog action model.
2 Assignments
0 Petitions
Accused Products
Abstract
A dialog system training environment and method using text-to-speech (TTS) are provided. The only knowledge a designer requires is a simple specification of when the dialog system has failed or succeeded, and for any state of the dialog, a list of the possible actions the system can take. The training environment simulates a user using TTS varied at adjustable levels, a dialog action model of a dialog system responds to the produced utterance by trying out all possible actions until it has failed or succeeded. From the data accumulated in the training environment it is possible for the dialog action model to learn which states to go to when it observes the appropriate speech and dialog features so as to increase the likelihood of success. The data can also be used to improve the speech model.
125 Citations
20 Claims
-
1. A dialog system training environment comprising:
-
a user simulator that provides a text to speech output associated with an utterance; and
,a dialog system that comprises;
a speech model having a plurality of modifiable parameters, the speech model receives the speech input from the utterance and produces output features; and
,a dialog action model having a plurality of modifiable parameters, the dialog model receives the speech output features from the speech model and produces output actions. parameters of the speech model and/or the dialog action model based, at least in part, upon the utterance identified by the speech model and/or the action taken by the dialog action model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method of training a learning system either offline or online comprising:
-
generating an utterance using text to speech;
identifying the utterance using a speech model;
performing an action; and
,updating a dialog action model or speech model or both. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A dialog system training environment comprising:
-
means for simulating an utterance;
means for identifying the utterance;
means for modeling speech using a plurality of modifiable parameters, the means for modeling speech receiving the utterance and producing output features;
means for modeling dialog actions using a plurality of modifiable parameters, the means for modeling dialog actions receiving the speech output features and producing output actions; and
,means for modifying parameters of the means for modeling speech and/or the means for modeling dialog model actions based, at least in part, upon the utterance identified by the means for modeling speech and/or the action taken by the means for modeling dialog actions. - View Dependent Claims (20)
-
Specification