Joint Speaker Authentication and Key Phrase Identification
First Claim
1. A spoken command analyzer module comprising instructions embodied in one or more non-transitory machine accessible storage media, the spoken command analyzer module configured to cause a computing system comprising one or more computing devices to perform operations comprising:
- receive data representative of a current speech sample captured by a sound capture device;
with a model that models both user-specific acoustic properties of one or more prior speech samples and command-specific acoustic properties of the one or or more prior speech samples;
analyze the data to determine substantive content of the speech and whether the substantive content includes a command to effect an action by an associated device, andanalyze the data to determine identity of a human speaker of the speech and whether the identity matches an identity of a user who is authorized to issue the command; and
in response to determining that the identity matches the authorized user'"'"'s identity and determining that the data includes the command and determining that the identified user is authorized to issue the command, issue an instruction to effect performance of the action by the associated device.
1 Assignment
0 Petitions
Accused Products
Abstract
A spoken command analyzer computing system includes technologies configured to analyze information extracted from a speech sample and, using a joint speaker and phonetic content model, both determine whether the analyzed speech includes certain content (e.g., a command) and to identify the identity of the human speaker of the speech. In response to determining that the identity matches the authorized user'"'"'s identity and determining that the analyzed speech includes the modeled content (e.g., command), an action corresponding to the verified content (e.g., command) is performed by an associated device.
64 Citations
33 Claims
-
1. A spoken command analyzer module comprising instructions embodied in one or more non-transitory machine accessible storage media, the spoken command analyzer module configured to cause a computing system comprising one or more computing devices to perform operations comprising:
-
receive data representative of a current speech sample captured by a sound capture device; with a model that models both user-specific acoustic properties of one or more prior speech samples and command-specific acoustic properties of the one or or more prior speech samples; analyze the data to determine substantive content of the speech and whether the substantive content includes a command to effect an action by an associated device, and analyze the data to determine identity of a human speaker of the speech and whether the identity matches an identity of a user who is authorized to issue the command; and in response to determining that the identity matches the authorized user'"'"'s identity and determining that the data includes the command and determining that the identified user is authorized to issue the command, issue an instruction to effect performance of the action by the associated device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of effecting an action by a device using human speech, the method comprising:
-
receiving data representative of human speech captured by a sound capture device associated with the device; with at least one computing device; using a model of both user-specific acoustic properties and command-specific acoustic properties of a user'"'"'s speech; analyzing the data to determine substantive content of the speech and whether the substantive content includes a command to effect an action by the device, and analyzing the data to determine identity of a human speaker of the speech and whether the identity matches an authorized user'"'"'s identity; and in response to determining that the identity matches the authorized user'"'"'s identity and determining that the data includes the command, issuing an instruction to effect performance of the action by the associated device. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. An apparatus for performing computing device effected actions, the apparatus comprising:
-
a sound capture device configured to output data representative of human speech captured by the sound capture device; at least one computing device configured to effect performance of an action response to receipt of a command; wherein the at least one computing device is configured to; jointly; analyze the data to determine substantive content of the speech and whether the substantive content includes the command to effect the action; analyze the data to determine identity of a human speaker of the speech and whether the identity matches an authorized user'"'"'s identity; and in response to determining that the identity matches the authorized user'"'"'s identity and determining that the data includes the command, issue an instruction to effect performance of the action. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33)
-
Specification