User authentication
First Claim
1. A device comprising:
- a processor configured to;
extract a set of parameters from an audio signal;
perform liveness verification by determining, based on a first plurality of parameters and a liveness data model, whether the audio signal corresponds to a first audio type indicating spoken speech or a second audio type indicating playback of recorded speech;
perform user verification by determining, based on a second plurality of parameters and a user speech model, whether the audio signal corresponds to speech of a particular user associated with the user speech model and that the audio signal corresponds to the first audio type, and refrain from performing the user verification based on determining that the audio signal corresponds to the second audio type;
perform keyword verification by determining, based on a third plurality of parameters and a keyword data model, whether the audio signal corresponds to a particular keyword, wherein the set of parameters includes the first plurality of parameters, the second plurality of parameters, and the third plurality of parameters; and
generate an output indicating that user authentication is successful in response to determining that the audio signal corresponds to the speech of the particular user, to the particular keyword, and to the first audio type.
1 Assignment
0 Petitions
Accused Products
Abstract
A device includes a processor configured to extract parameters from an audio signal. The processor is configured to perform liveness verification by determining, based on first parameters and a liveness data model, whether the audio signal corresponds to a first audio type indicating spoken speech, to perform user verification by determining, based on second parameters and a user speech model, whether the audio signal corresponds to speech of a particular user, and to perform keyword verification by determining, based on third parameters and a keyword data model, whether the audio signal corresponds to a particular keyword. The processor is configured to generate an output indicating that user authentication is successful in response to determining that the audio signal corresponds to speech of the particular user, to the particular keyword, and to the first audio type.
44 Citations
29 Claims
-
1. A device comprising:
a processor configured to; extract a set of parameters from an audio signal; perform liveness verification by determining, based on a first plurality of parameters and a liveness data model, whether the audio signal corresponds to a first audio type indicating spoken speech or a second audio type indicating playback of recorded speech; perform user verification by determining, based on a second plurality of parameters and a user speech model, whether the audio signal corresponds to speech of a particular user associated with the user speech model and that the audio signal corresponds to the first audio type, and refrain from performing the user verification based on determining that the audio signal corresponds to the second audio type; perform keyword verification by determining, based on a third plurality of parameters and a keyword data model, whether the audio signal corresponds to a particular keyword, wherein the set of parameters includes the first plurality of parameters, the second plurality of parameters, and the third plurality of parameters; and generate an output indicating that user authentication is successful in response to determining that the audio signal corresponds to the speech of the particular user, to the particular keyword, and to the first audio type. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
14. A method comprising:
-
receiving an audio signal at a device; extracting, at the device, a set of parameters from the audio signal; performing, at the device, liveness verification by determining, based on a first plurality of parameters and a liveness data model, whether the audio signal corresponds to a first audio type indicating spoken speech or a second audio type indicating playback of recorded speech; performing, at the device, user verification by determining, based on a second plurality of parameters and a user speech model, whether the audio signal corresponds to speech of a particular user associated with the user speech model, wherein the user speech model is distinct from the liveness data model; performing, at the device, keyword verification by determining, based on a third plurality of parameters and a keyword data model, whether the audio signal corresponds to a particular keyword, wherein the set of parameters includes the first plurality of parameters, the second plurality of parameters, and the third plurality of parameters; and generating, at the device, an output indicating that user authentication is successful based on determining that the audio signal corresponds to the speech of the particular user, to the particular keyword, and to the first audio type. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A computer-readable storage device storing instructions that, when executed by a processor, cause the processor to perform operations comprising:
-
extracting a set of parameters from an audio signal; performing liveness verification by determining, based on a first plurality of parameters and a liveness data model, whether the audio signal corresponds to a first audio type indicating spoken speech or a second audio type indicating playback of recorded speech; performing user verification by determining, based on a second plurality of parameters and a user speech model, whether the audio signal corresponds to speech of a particular user associated with the user speech model; performing keyword verification by determining, based on a third plurality of parameters and a keyword data model, whether the audio signal corresponds to a particular keyword, wherein the set of parameters includes the first plurality of parameters, the second plurality of parameters, and the third plurality of parameters; generating an output indicating that user authentication is successful in response to determining that the audio signal corresponds to the speech of the particular user, to the particular keyword, and to the first audio type; and updating the user speech model based on the set of parameters in response to determining that the user authentication is successful. - View Dependent Claims (21, 22, 23, 24, 25)
-
-
26. An apparatus comprising:
-
means for generating an output signal responsive to receiving an audio signal; means for extracting a set of parameters from the output signal; means for performing liveness verification by determining, based on a first plurality of parameters and a liveness data model, whether the audio signal corresponds to a first audio type indicating spoken speech or a second audio type indicating playback of recorded speech; means for performing user verification by determining, based on a second plurality of parameters and a user speech model, whether the audio signal corresponds to speech of a particular user associated with the user speech model, wherein the user speech model is distinct from the liveness data model; means for performing keyword verification by determining, based on a third plurality of parameters and a keyword data model, whether the audio signal corresponds to a particular keyword, wherein the set of parameters includes the first plurality of parameters, the second plurality of parameters, and the third plurality of parameters; and means for generating an output indicating that user authentication is successful in response to determining that the audio signal corresponds to the speech of the particular user, to the particular keyword, and to the first audio type. - View Dependent Claims (27)
-
-
28. A device comprising:
a processor configured to; extract a set of parameters from an audio signal; perform liveness verification by determining, based on a first plurality of parameters and a liveness data model, whether the audio signal corresponds to a first audio type indicating spoken speech or a second audio type indicating playback of recorded speech; perform user verification by determining, based on a second plurality of parameters and a user speech model, whether the audio signal corresponds to speech of a particular user associated with the user speech; perform keyword verification by determining, based on a third plurality of parameters and a keyword data model, whether the audio signal corresponds to a particular keyword, wherein the set of parameters includes the first plurality of parameters, the second plurality of parameters, and the third plurality of parameters; generate an output indicating that user authentication is successful in response to determining that the audio signal corresponds to the speech of the particular user, to the particular keyword, and to the first audio type; and generate a second output indicating that the user authentication failed in response to determining that the audio signal corresponds to the second audio type, wherein the second output is generated independently of performing the keyword verification and the user verification.
-
29. A device comprising:
a processor configured to; extract a set of parameters from an audio signal; perform liveness verification by determining, based on a first plurality of parameters and a liveness data model, whether the audio signal corresponds to a first audio type indicating spoken speech or a second audio type indicating playback of recorded speech, wherein the liveness data model is user-independent; perform user verification by determining, based on a second plurality of parameters and a user speech model, whether the audio signal corresponds to speech of a particular user associated with the user speech model; perform keyword verification by determining, based on a third plurality of parameters and a keyword data model, whether the audio signal corresponds to a particular keyword, wherein the set of parameters includes the first plurality of parameters, the second plurality of parameters, and the third plurality of parameters; and generate an output indicating that user authentication is successful in response to determining that the audio signal corresponds to the speech of the particular user, to the particular keyword, and to the first audio type.
Specification