Speaker verification using co-location information
First Claim
1. A method comprising:
- receiving, at data processing hardware, audio data corresponding to an utterance of a voice command captured by a user device, the user device having a plurality of different users, each user of the plurality of different users having different corresponding user permissions to access a plurality of applications on the user device;
for each user of the plurality of different users of the user device;
obtaining, by the data processing hardware, corresponding speaker verification data from memory hardware in communication with the data processing hardware; and
generating, by the data processing hardware, a corresponding speaker verification score by comparing the corresponding speaker verification data and the audio data, the corresponding speaker verification score indicating a likelihood that the utterance of the voice command was spoken by the corresponding user of the plurality of different users of the user device;
identifying, by the data processing hardware, a speaker of the utterance of the voice command as the user of the plurality of different users of the user device associated with a highest corresponding speaker verification score; and
processing, by the data processing hardware, the voice command using a speech recognition module to identify a particular action for the user device to execute, the particular action, when executed by the user device, launching a particular application of the plurality of applications on the user device based on the corresponding user permissions associated with the identified speaker to access the application.
0 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a user in a multi-user environment. One of the methods includes receiving, by a first user device, an audio signal encoding an utterance, obtaining, by the first user device, a first speaker model for a first user of the first user device, obtaining, by the first user device for a second user of a second user device that is co-located with the first user device, a second speaker model for the second user or a second score that indicates a respective likelihood that the utterance was spoken by the second user, and determining, by the first user device, that the utterance was spoken by the first user using (i) the first speaker model and the second speaker model or (ii) the first speaker model and the second score.
-
Citations
22 Claims
-
1. A method comprising:
-
receiving, at data processing hardware, audio data corresponding to an utterance of a voice command captured by a user device, the user device having a plurality of different users, each user of the plurality of different users having different corresponding user permissions to access a plurality of applications on the user device; for each user of the plurality of different users of the user device; obtaining, by the data processing hardware, corresponding speaker verification data from memory hardware in communication with the data processing hardware; and generating, by the data processing hardware, a corresponding speaker verification score by comparing the corresponding speaker verification data and the audio data, the corresponding speaker verification score indicating a likelihood that the utterance of the voice command was spoken by the corresponding user of the plurality of different users of the user device; identifying, by the data processing hardware, a speaker of the utterance of the voice command as the user of the plurality of different users of the user device associated with a highest corresponding speaker verification score; and processing, by the data processing hardware, the voice command using a speech recognition module to identify a particular action for the user device to execute, the particular action, when executed by the user device, launching a particular application of the plurality of applications on the user device based on the corresponding user permissions associated with the identified speaker to access the application. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system comprising:
-
data processing hardware; and memory hardware in communication with the data processing hardware and storing instructions, that when executed by the data processing hardware, cause the data processing hardware to perform operations comprising; receiving audio data corresponding to an utterance of a voice command captured by a user device, the user device having a plurality of different users, each user of the plurality of different users having different corresponding user permissions to access a plurality of applications on the user device; for each user of the plurality of different users of the user device; obtaining corresponding speaker verification data from the memory hardware; and generating a corresponding speaker verification score by comparing the corresponding speaker verification data and the audio data, the corresponding speaker verification score indicating a likelihood that the utterance of the voice command was spoken by the corresponding user of the plurality of different users of the user device; identifying a speaker of the utterance of the voice command as the user of the plurality of different users of the user device associated with a highest corresponding speaker verification score; and processing the voice command using a speech recognition module to identify a particular action for the user device to execute, the particular action, when executed by the user device, launching a particular application of the plurality of applications on the corresponding user device based on user permissions associated with the identified speaker to access the application. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification