Turn-taking confidence
First Claim
Patent Images
1. A method for managing interactive dialog between a machine and a user comprising:
- verbalizing at least one desired sequence of one or more spoken phrases;
enabling a user to hear the at least one desired sequence of one or more spoken phrases;
receiving audio input from the user or an environment of the user;
determining a timing position of a possible speech onset from the audio input;
managing an interaction between the at least one desired sequence of one or more spoken phrases and the audio input, by determining at least one likelihood value dependent upon the possible speech onset.
5 Assignments
0 Petitions
Accused Products
Abstract
A method for managing interactive dialog between a machine and a user is claimed. In one embodiment, an interaction between the machine and the user is managed by determining at least one likelihood value which is dependent upon a possible speech onset of the user. In another embodiment, the likelihood value can be dependent a model of a desire of the user for specific items, a model of an attention of the user to specific items, or a model of turn-taking cues. Further, the likelihood value can be utilized in a voice activity system.
53 Citations
20 Claims
-
1. A method for managing interactive dialog between a machine and a user comprising:
-
verbalizing at least one desired sequence of one or more spoken phrases;
enabling a user to hear the at least one desired sequence of one or more spoken phrases;
receiving audio input from the user or an environment of the user;
determining a timing position of a possible speech onset from the audio input;
managing an interaction between the at least one desired sequence of one or more spoken phrases and the audio input, by determining at least one likelihood value dependent upon the possible speech onset. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system for managing interactive dialog between a machine and a user comprising:
-
means for verbalizing at least one desired sequence of one or more spoken phrases;
means for enabling a user to hear the at least one desired sequence of one or more spoken phrases;
means for receiving audio input from the user or an environment of the user;
means for determining a timing position of a possible speech onset from the audio input;
means for managing an interaction between the at least one desired sequence of one or more spoken phrases and the audio input, by determining at least one likelihood value dependent upon the possible speech onset. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification