Voice trigger for a digital assistant
First Claim
1. A method for operating a voice trigger, performed at an electronic device including one or more processors and memory storing instructions for execution by the one or more processors, the method comprising:
- determining, based on comparing an amount of light detected on at least a front surface of the electronic device to a threshold amount of light, whether to operate the voice trigger in a standby mode or in a listening mode;
in accordance with a determination to operate the voice trigger in the listening mode;
receiving a sound input;
generating an input representation of the sound input, wherein the input representation represents audio signatures of the sound input;
determining whether at least a portion of the sound input corresponds to a predetermined type of sound;
upon a determination that at least a portion of the sound input corresponds to the predetermined type, determining whether the sound input includes predetermined content based on comparing of the input representation of the sound input to one or more reference representations;
upon a determination that the sound input includes the predetermined content, generating a control signal comprising instructions to initiate a speech-based service; and
initiating the speech-based service based on the control signal; and
in accordance with a determination to operate the voice trigger in the standby mode, forgoing initiating the speech-based service based on received sound input.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for operating a voice trigger is provided. In some implementations, the method is performed at an electronic device including one or more processors and memory storing instructions for execution by the one or more processors. The method includes receiving a sound input. The sound input may correspond to a spoken word or phrase, or a portion thereof. The method includes determining whether at least a portion of the sound input corresponds to a predetermined type of sound, such as a human voice. The method includes, upon a determination that at least a portion of the sound input corresponds to the predetermined type, determining whether the sound input includes predetermined content, such as a predetermined trigger word or phrase. The method also includes, upon a determination that the sound input includes the predetermined content, initiating a speech-based service, such as a voice-based digital assistant.
2949 Citations
48 Claims
-
1. A method for operating a voice trigger, performed at an electronic device including one or more processors and memory storing instructions for execution by the one or more processors, the method comprising:
-
determining, based on comparing an amount of light detected on at least a front surface of the electronic device to a threshold amount of light, whether to operate the voice trigger in a standby mode or in a listening mode; in accordance with a determination to operate the voice trigger in the listening mode; receiving a sound input; generating an input representation of the sound input, wherein the input representation represents audio signatures of the sound input; determining whether at least a portion of the sound input corresponds to a predetermined type of sound; upon a determination that at least a portion of the sound input corresponds to the predetermined type, determining whether the sound input includes predetermined content based on comparing of the input representation of the sound input to one or more reference representations; upon a determination that the sound input includes the predetermined content, generating a control signal comprising instructions to initiate a speech-based service; and initiating the speech-based service based on the control signal; and in accordance with a determination to operate the voice trigger in the standby mode, forgoing initiating the speech-based service based on received sound input. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer-readable storage medium, storing one or more programs for execution by one or more processors of an electronic device, the one or more programs including instructions for:
-
determining, based on comparing an amount of light detected on at least a front surface of the electronic device to a threshold amount of light, whether to operate a voice trigger in a standby mode or in a listening mode; in accordance with a determination to operate the voice trigger in the listening mode; receiving a sound input; generating an input representation of the sound input, wherein the input representation represents audio signatures of the sound input; determining whether at least a portion of the sound input corresponds to a predetermined type of sound; upon a determination that at least a portion of the sound input corresponds to the predetermined type, determining whether the sound input includes predetermined content based on comparing of the input representation of the sound input to one or more reference representations; upon a determination that the sound input includes the predetermined content, generating a control signal comprising instructions to initiate a speech based service; and initiating the speech-based service based on the control signal; and in accordance with a determination to operate the voice trigger in the standby mode, forgoing initiating the speech-based service based on received sound input. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34)
-
-
18. An electronic device, comprising:
-
a sound receiving unit configured to receive sound input; and a processing unit coupled to the sound receiving unit, the processing unit configured to; determine, based on comparing an amount of light detected on at least a front surface of the electronic device to a threshold amount of light, whether to operate a voice trigger in a standby mode or in a listening mode; in accordance with a determination to operate the voice trigger in the listening mode; generate an input representation of the sound input, wherein the input representation represents audio signatures of the sound input; determine whether at least a portion of the sound input corresponds to a predetermined type of sound; upon a determination that at least a portion of the sound input corresponds to the predetermined type, determine whether the sound input includes predetermined content based on comparing of the input representation of the sound input to one or more reference representations; upon a determination that the sound input includes the predetermined content, generate a control signal comprising instructions to initiate a speech-based service; and initiate the speech-based service based on the control signal; and in accordance with a determination to operate the voice trigger in the standby mode, forgo initiating the speech-based service based on received sound input. - View Dependent Claims (19, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48)
-
Specification