Training speaker recognition models for digital assistants
First Claim
1. A non-transitory computer-readable storage medium storing one or more programs configured to be executed by one or more processors of an electronic device, the one or more programs including instructions for:
- obtaining, at a first time, user authentication information;
receiving, at a second time, a user utterance representing a request to perform one or more tasks;
generating a voice print from the user utterance;
determining whether a plurality of conditions are satisfied, the plurality of conditions including;
a first condition that the user authentication information corresponds to one or more authentication credentials stored in the memory, wherein the one or more authentication credentials are assigned to a registered user of the electronic device; and
a second condition that the first time and the second time are not separated by more than a predefined time period;
in accordance with determining that the plurality of conditions are satisfied, updating, based on the voice print, a speaker profile assigned to the registered user,after updating the speaker profile, determining whether a total number of voice prints associated with the speaker profile is equal to a predefined number; and
in accordance with determining that the total number of voice prints associated with the speaker profile is equal to a predefined number, providing an output prompting a user to activate a voice invocation feature of the digital assistant.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques for training a speaker recognition model used for interacting with a digital assistant are provided. In some examples, user authentication information is obtained at a first time. At a second time, a user utterance representing a user request is received. A voice print is generated from the user utterance. A determination is made as to whether a plurality of conditions are satisfied. The plurality of conditions includes a first condition that the user authentication information corresponds to one or more authentication credentials assigned to a registered user of an electronic device. The plurality of conditions further includes a second condition that the first time and the second time are not separated by more than a predefined time period. In accordance with a determination that the plurality of conditions are satisfied, a speaker profile assigned to the registered user is updated based on the voice print.
2542 Citations
54 Claims
-
1. A non-transitory computer-readable storage medium storing one or more programs configured to be executed by one or more processors of an electronic device, the one or more programs including instructions for:
-
obtaining, at a first time, user authentication information; receiving, at a second time, a user utterance representing a request to perform one or more tasks; generating a voice print from the user utterance; determining whether a plurality of conditions are satisfied, the plurality of conditions including; a first condition that the user authentication information corresponds to one or more authentication credentials stored in the memory, wherein the one or more authentication credentials are assigned to a registered user of the electronic device; and a second condition that the first time and the second time are not separated by more than a predefined time period; in accordance with determining that the plurality of conditions are satisfied, updating, based on the voice print, a speaker profile assigned to the registered user, after updating the speaker profile, determining whether a total number of voice prints associated with the speaker profile is equal to a predefined number; and in accordance with determining that the total number of voice prints associated with the speaker profile is equal to a predefined number, providing an output prompting a user to activate a voice invocation feature of the digital assistant. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. An electronic device, comprising:
-
one or more processors; a memory; and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including instructions for; obtaining, at a first time, user authentication information; receiving, at a second time, a user utterance representing a request to perform one or more tasks; generating a voice print from the user utterance; determining whether a plurality of conditions are satisfied, the plurality of conditions including; a first condition that the user authentication information corresponds to one or more authentication credentials stored in the memory, wherein the one or more authentication credentials are assigned to a registered user of the electronic device; and a second condition that the first time and the second time are not separated by more than a predefined time period; in accordance with determining that the plurality of conditions are satisfied, updating, based on the voice print, a speaker profile assigned to the registered user; after updating the speaker profile, determining whether a total number of voice prints associated with the speaker profile is equal to a predefined number; and in accordance with determining that the total number of voice prints associated with the speaker profile is equal to a predefined number, providing an output prompting a user to activate a voice invocation feature of the digital assistant. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37)
-
-
20. A method for generating speaker recognition models in a digital assistant, the method comprising:
at an electronic device having a processor and memory; obtaining, at a first time, user authentication information; receiving, at a second time, a user utterance representing a request to perform one or more tasks; generating a voice print from the user utterance; determining whether a plurality of conditions are satisfied, the plurality of conditions including; a first condition that the user authentication information corresponds to one or more authentication credentials stored in the memory, wherein the one or more authentication credentials are assigned to a registered user of the electronic device; and a second condition that the first time and the second time are not separated by more than a predefined time period; in accordance with determining that the plurality of conditions are satisfied, updating, based on the voice print, a speaker profile assigned to the registered user; after updating the speaker profile, determining whether a total number of voice prints associated with the speaker profile is equal to a predefined number; and in accordance with determining that the total number of voice prints associated with the speaker profile is equal to a predefined number, providing an output prompting a user to activate a voice invocation feature of the digital assistant. - View Dependent Claims (38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54)
Specification