Speaker verification using co-location information
First Claim
1. A computer-implemented method comprising:
- obtaining, by a first computing device that is configured to respond to voice commands while in a locked state upon receipt of a particular, predefined hotword, a value for a setting that indicates that the first computing device is permitted to provide speaker verification data to other computing devices;
receiving, by the first computing device, audio data that corresponds to an utterance of a voice command that is preceded by the particular, predefined hotword, the audio data being received while the first computing device is in a locked state and is co-located with a second computing device that is also configured to respond to voice commands that are preceded by the particular, predefined hotword;
while the first computing device is in the locked state, and based on the obtained value for the setting that indicates that the first computing device is permitted to share speaker verification data with other computing devices, transmitting, by the first computing device, a message to the second computing device that (i) is co-located with the first computing device and (ii) is configured to respond to voice commands that are preceded by the particular, predefined hotword; and
determining, by the first computing device, to remain in the locked state and not respond to the voice command despite receiving the audio data that corresponds to the utterance of the voice command that is preceded by the particular, predefined hotword.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a user in a multi-user environment. One of the methods includes receiving, by a first user device, an audio signal encoding an utterance, obtaining, by the first user device, a first speaker model for a first user of the first user device, obtaining, by the first user device for a second user of a second user device that is co-located with the first user device, a second speaker model for the second user or a second score that indicates a respective likelihood that the utterance was spoken by the second user, and determining, by the first user device, that the utterance was spoken by the first user using (i) the first speaker model and the second speaker model or (ii) the first speaker model and the second score.
-
Citations
39 Claims
-
1. A computer-implemented method comprising:
-
obtaining, by a first computing device that is configured to respond to voice commands while in a locked state upon receipt of a particular, predefined hotword, a value for a setting that indicates that the first computing device is permitted to provide speaker verification data to other computing devices; receiving, by the first computing device, audio data that corresponds to an utterance of a voice command that is preceded by the particular, predefined hotword, the audio data being received while the first computing device is in a locked state and is co-located with a second computing device that is also configured to respond to voice commands that are preceded by the particular, predefined hotword; while the first computing device is in the locked state, and based on the obtained value for the setting that indicates that the first computing device is permitted to share speaker verification data with other computing devices, transmitting, by the first computing device, a message to the second computing device that (i) is co-located with the first computing device and (ii) is configured to respond to voice commands that are preceded by the particular, predefined hotword; and determining, by the first computing device, to remain in the locked state and not respond to the voice command despite receiving the audio data that corresponds to the utterance of the voice command that is preceded by the particular, predefined hotword. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system comprising:
a first computing device that is configured to respond to voice commands while in a locked state upon receipt of a particular, predefined hotword and one or more storage devices storing instructions that are operable, when executed by the first computing device, to cause the first computing device to perform operations comprising; obtaining a value for a setting that indicates that the first computing device is permitted to provide speaker verification data to other computing devices; receiving audio data that corresponds to an utterance of a voice command that is preceded by the particular, predefined hotword, the audio data being received while the first computing device is in a locked state and is co-located with a second computing device that is also configured to respond to voice commands that are preceded by the particular, predefined hotword; while the first computing device is in the locked state, and based on the obtained value for the setting that indicates that the first computing device is permitted to share speaker verification data with other computing devices, transmitting a message to the second computing device that (i) is co-located with the first computing device and (ii) is configured to respond to voice commands that are preceded by the particular, predefined hotword; and determining to remain in the locked state and not respond to the voice command despite receiving the audio data that corresponds to the utterance of the voice command that is preceded by the particular, predefined hotword. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
27. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
obtaining, by a first computing device that is configured to respond to voice commands while in a locked state upon receipt of a particular, predefined hotword, a value for a setting that indicates that the first computing device is permitted to provide speaker verification data to other computing devices; receiving, by the first computing device, audio data that corresponds to an utterance of a voice command that is preceded by the particular, predefined hotword, the audio data being received while the first computing device is in a locked state and is co-located with a second computing device that is also configured to respond to voice commands that are preceded by the particular, predefined hotword; while the first computing device is in the locked state, and based on the obtained value for the setting that indicates that the first computing device is permitted to share speaker verification data with other computing devices, transmitting, by the first computing device, a message to the second computing device that (i) is co-located with the first computing device and (ii) is configured to respond to voice commands that are preceded by the particular, predefined hotword; and determining, by the first computing device, to remain in the locked state and not respond to the voice command despite receiving the audio data that corresponds to the utterance of the voice command that is preceded by the particular, predefined hotword. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39)
-
Specification