Multi-user authentication on a device
First Claim
1. A computer-implemented method comprising:
- determining, by one or more computers, that an utterance includes a particular user speaking a hotword based at least on a first hotword detection model generated from a first set of samples of the particular user speaking the hotword;
in response to determining that an utterance includes a particular user speaking a hotword based at least on the first hotword detection model generated from the first set of samples of the particular user speaking the hotword, storing at least a portion of the utterance as a new sample;
obtaining a second set of samples of the particular user speaking the utterance, where the second set of samples includes the new sample and less than all the samples in the first set of samples;
determining, by the one or more computers, that a second utterance includes the particular user speaking the hotword based at least on a second hotword detection model generated from the second set of samples of the user speaking the hotword; and
in response to determining, by the one or more computers, that a second utterance includes the particular user speaking the hotword, recognizing the second utterance as having been spoken by the particular user.
1 Assignment
0 Petitions
Accused Products
Abstract
In some implementations, an utterance is determined to include a particular user speaking a hotword based at least on a first set of samples of the particular user speaking the hotword. In response to determining that an utterance includes a particular user speaking a hotword based at least on a first set of samples of the particular user speaking the hotword, at least a portion of the utterance is stored as a new sample. A second set of samples of the particular user speaking the utterance is obtained, where the second set of samples includes the new sample and less than all the samples in the first set of samples. A second utterance is determined to include the particular user speaking the hotword based at least on the second set of samples of the user speaking the hotword.
130 Citations
20 Claims
-
1. A computer-implemented method comprising:
-
determining, by one or more computers, that an utterance includes a particular user speaking a hotword based at least on a first hotword detection model generated from a first set of samples of the particular user speaking the hotword; in response to determining that an utterance includes a particular user speaking a hotword based at least on the first hotword detection model generated from the first set of samples of the particular user speaking the hotword, storing at least a portion of the utterance as a new sample; obtaining a second set of samples of the particular user speaking the utterance, where the second set of samples includes the new sample and less than all the samples in the first set of samples; determining, by the one or more computers, that a second utterance includes the particular user speaking the hotword based at least on a second hotword detection model generated from the second set of samples of the user speaking the hotword; and in response to determining, by the one or more computers, that a second utterance includes the particular user speaking the hotword, recognizing the second utterance as having been spoken by the particular user. - View Dependent Claims (2, 3, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
4. The method of 3, wherein the reference samples comprise samples from a registration process for the particular user and the most recent stored samples comprise samples from queries spoken by the particular user.
-
13. A system comprising:
-
one or more computers; and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; determining, by the one or more computers, that an utterance includes a particular user speaking a hotword based at least on a first hotword detection model generated from a first set of samples of the particular user speaking the hotword; in response to determining that an utterance includes a particular user speaking a hotword based at least on the first hotword detection model generated from a-the first set of samples of the particular user speaking the hotword, storing at least a portion of the utterance as a new sample; obtaining a second set of samples of the particular user speaking the utterance, where the second set of samples includes the new sample and less than all the samples in the first set of samples; determining, by the one or more computers, that a second utterance includes the particular user speaking the hotword based at least on a second hotword detection model generated from the second set of samples of the user speaking the hotword; and in response to determining, by the one or more computers, that a second utterance includes the particular user speaking the hotword, recognizing the second utterance as having been spoken by the particular user. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
-
20. A non-transitory computer-readable medium storing instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
determining, by one or more computers, that an utterance includes a particular user speaking a hotword based at least on a first hotword detection model generated from a first set of samples of the particular user speaking the hotword; in response to determining that an utterance includes a particular user speaking a hotword based at least on the first hotword detection model generated from the first set of samples of the particular user speaking the hotword, storing at least a portion of the utterance as a new sample; obtaining a second set of samples of the particular user speaking the utterance, where the second set of samples includes the new sample and less than all the samples in the first set of samples; determining, by the one or more computers, that a second utterance includes the particular user speaking the hotword based at least on a second hotword detection model generated from the second set of samples of the user speaking the hotword; and in response to determining, by the one or more computers, that a second utterance includes the particular user speaking the hotword, recognizing the second utterance as having been spoken by the particular user.
-
Specification