SPEAKER IDENTIFICATION AND UNSUPERVISED SPEAKER ADAPTATION TECHNIQUES
First Claim
1. A method for operating a virtual assistant, the method comprising:
- at an electronic device;
receiving, at the electronic device, an audio input comprising user speech;
determining whether a speaker of the user speech is a predetermined user based at least in part on a speaker profile for the predetermined user; and
in accordance with a determination that the speaker of the user speech is the predetermined user, adding the audio input comprising user speech to the speaker profile for the predetermined user.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and processes for generating a speaker profile for use in performing speaker identification for a virtual assistant are provided. One example process can include receiving an audio input including user speech and determining whether a speaker of the user speech is a predetermined user based on a speaker profile for the predetermined user. In response to determining that the speaker of the user speech is the predetermined user, the user speech can be added to the speaker profile and operation of the virtual assistant can be triggered. In response to determining that the speaker of the user speech is not the predetermined user, the user speech can be added to an alternate speaker profile and operation of the virtual assistant may not be triggered. In some examples, contextual information can be used to verify results produced by the speaker identification process.
431 Citations
27 Claims
-
1. A method for operating a virtual assistant, the method comprising:
at an electronic device; receiving, at the electronic device, an audio input comprising user speech; determining whether a speaker of the user speech is a predetermined user based at least in part on a speaker profile for the predetermined user; and in accordance with a determination that the speaker of the user speech is the predetermined user, adding the audio input comprising user speech to the speaker profile for the predetermined user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
14. A non-transitory computer-readable storage medium comprising instructions for:
-
receiving an audio input comprising user speech; determining whether a speaker of the user speech is a predetermined user based at least in part on a speaker profile for the predetermined user; and in accordance with a determination that the speaker of the user speech is the predetermined user, adding the audio input comprising user speech to the speaker profile for the predetermined user.
-
-
15. A system comprising:
-
one or more processors; memory storing one or more programs configured to be executed by the one or more processors, the one or more programs including instructions for; receiving an audio input comprising user speech; determining whether a speaker of the user speech is a predetermined user based at least in part on a speaker profile for the predetermined user; and in accordance with a determination that the speaker of the user speech is the predetermined user, adding the audio input comprising user speech to the speaker profile for the predetermined user. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27)
-
Specification