SYSTEM AND METHOD FOR DYNAMIC FACIAL FEATURES FOR SPEAKER RECOGNITION
First Claim
1. A method of performing speaker verification, the method comprising:
- receiving a request to verify a speaker;
generating a text challenge that is unique to the request;
in response to the request, prompting the speaker to utter the text challenge;
recording a dynamic image feature of the speaker as the speaker utters the text challenge; and
performing speaker verification based on the dynamic image feature and the text challenge.
4 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for performing speaker verification. A system configured to practice the method receives a request to verify a speaker, generates a text challenge that is unique to the request, and, in response to the request, prompts the speaker to utter the text challenge. Then the system records a dynamic image feature of the speaker as the speaker utters the text challenge, and performs speaker verification based on the dynamic image feature and the text challenge. Recording the dynamic image feature of the speaker can include recording video of the speaker while speaking the text challenge. The dynamic feature can include a movement pattern of head, lips, mouth, eyes, and/or eyebrows of the speaker. The dynamic image feature can relate to phonetic content of the speaker speaking the challenge, speech prosody, and the speaker'"'"'s facial expression responding to content of the challenge.
84 Citations
20 Claims
-
1. A method of performing speaker verification, the method comprising:
-
receiving a request to verify a speaker; generating a text challenge that is unique to the request; in response to the request, prompting the speaker to utter the text challenge; recording a dynamic image feature of the speaker as the speaker utters the text challenge; and performing speaker verification based on the dynamic image feature and the text challenge. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for identifying a user, the system comprising:
-
a processor; a first module configured to control the processor to prompt the user to utter a unique text challenge; a second module configured to control the processor to record audio and video of the user while the user utters the unique text challenge; a third module configured to control the processor to perform a comparison of the audio and the video to a database of observable behavior based on the unique text challenge; a fourth module configured to control the processor to identify the user based on the comparison. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable storage medium storing instructions for serving requests for speaker verification which, when executed by a computing device, cause the computing device to perform steps comprising:
-
receiving, from a user verification device, a request to confirm a user identity; retrieving a user profile associated with the user identity; generating a unique text challenge based on the user profile; instructing the user verification device to prompt the user to utter the unique text challenge; receiving from the user verification device an audio recording and a video recording of the user uttering the unique text challenge; performing an analysis of the audio recording and the video recording based on the user profile; and sending a confirmation to the user verification device if the analysis meets a verification threshold. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification