User authentication for voice-input devices
First Claim
Patent Images
1. A method comprising;
- determining, based at least in part on a first audio signal, a request from a user;
causing, based at least in part on the request, output of a first question that is associated with a first predefined response;
determining, based at least in part on a second audio signal, that a first utterance of the user corresponds to the first predefined response based at least in part on a contextual representation associated with the first utterance;
causing, based at least in part on the first utterance, output of a second question that is associated with a second predefined response;
determining, based at least in part on a third audio signal, that a second utterance of the user corresponds to the second predefined response; and
causing, based at least in part on the second utterance, audible output of data.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques for authenticating users at devices that interact with the users via voice input. For instance, the described techniques may allow a voice-input device to safely verify the identity of a user by engaging in a back-and-forth conversation. The device or another device coupled thereto may then verify the accuracy of the responses from the user during the conversation, as well as compare an audio signature associated with the user'"'"'s responses to a pre-stored audio signature associated with the user. By utilizing multiple checks, the described techniques are able to accurately and safely authenticate the user based solely on an audible conversation between the user and the voice-input device.
17 Citations
20 Claims
-
1. A method comprising;
-
determining, based at least in part on a first audio signal, a request from a user; causing, based at least in part on the request, output of a first question that is associated with a first predefined response; determining, based at least in part on a second audio signal, that a first utterance of the user corresponds to the first predefined response based at least in part on a contextual representation associated with the first utterance; causing, based at least in part on the first utterance, output of a second question that is associated with a second predefined response; determining, based at least in part on a third audio signal, that a second utterance of the user corresponds to the second predefined response; and causing, based at least in part on the second utterance, audible output of data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system comprising:
-
one or more microphones; one or more speakers; one or more processors; and memory storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising; determining, based at least in part on a first audio signal generated by the one or more microphones, a request; outputting, by the one or more speakers and based at least in part on the request, a first question that is associated with a first predefined response; determining, based at least in part on a second audio signal generated by the one or more microphones, that a first utterance corresponds to the first predefined response; outputting, by the one or more speakers and based at least in part on the first utterance, a second question that is associated with a second predefined response; determining, based at least in part on a third audio signal generated by the one or more microphones, that a second utterance corresponds to the second predefined response based at least in part on a contextual representation associated with the second utterance; and outputting, by the one or more speakers and based at least in part on the second utterance, audio data associated with the request. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
-
17. A system comprising:
-
one or more microphones; one or more speakers; one or more processors; and memory storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising; determining, based at least in part on a first audio signal generated by the one or more microphones, a request; outputting, by the one or more speakers and based at least in part on the request, a first question that is associated with a first predefined response and a second predefined response; determining, based at least in part on a second audio signal generated by the one or more microphones, that a first utterance corresponds to at least one of the first predefined response or the second predefined response, wherein determining that the first utterance corresponds to the at least one of the first predefined response or the second predefined response comprises determining that the first utterance does not correspond to at least one of the first predefined response or the second predefined response; outputting, by the one or more speakers and based at least in part on the first utterance, a second question that is associated with a third predefined response and a fourth predefined response; determining, based at least in part on a third audio signal generated by the one or more microphones, that a second utterance corresponds to at least one of the third predefined response or the fourth predefined response; and outputting, by the one or more speakers and based at least in part on the second utterance, audio data associated with the request. - View Dependent Claims (18, 19, 20)
-
Specification