System and method for speaker recognition on mobile devices
First Claim
1. A method of operation of a speaker recognition system, comprising:
- receiving, at an input of the speaker recognition system, at least one new voice sample from a user for speaker authentication;
determining a first offset value from scores assigned to previously received voice samples from said user, wherein the first offset value comprises an average of respective offsets of said scores from a mean score value;
determining a second offset value from at least one score assigned to the at least one new voice sample from said user;
determining whether a difference between said second offset value and said first offset value exceeds a proximity threshold; and
if the difference between said second offset value and said first offset value exceeds the proximity threshold, determining that further system training is required, and indicating to the user that said further system training is required.
3 Assignments
0 Petitions
Accused Products
Abstract
A speaker recognition system for authenticating a mobile device user includes an enrollment and learning software module, a voice biometric authentication software module, and a secure software application. Upon request by a user of the mobile device, the enrollment and learning software module displays text prompts to the user, receives speech utterances from the user, and produces a voice biometric print. The enrollment and training software module determines when a voice biometric print has met at least a quality threshold before storing it on the mobile device. The secure software application prompts a user requiring authentication to repeat an utterance based at least on an attribute of a selected voice biometric print, receives a corresponding utterance, requests the voice biometric authentication software module to verify the identity of the second user using the utterance, and, if the user is authenticated, imports the voice biometric print.
27 Citations
9 Claims
-
1. A method of operation of a speaker recognition system, comprising:
-
receiving, at an input of the speaker recognition system, at least one new voice sample from a user for speaker authentication; determining a first offset value from scores assigned to previously received voice samples from said user, wherein the first offset value comprises an average of respective offsets of said scores from a mean score value; determining a second offset value from at least one score assigned to the at least one new voice sample from said user; determining whether a difference between said second offset value and said first offset value exceeds a proximity threshold; and if the difference between said second offset value and said first offset value exceeds the proximity threshold, determining that further system training is required, and indicating to the user that said further system training is required. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A speaker recognition system, comprising:
-
an input for receiving at least one new voice sample from a user for speaker authentication, wherein the speaker recognition system is configured for performing a method comprising; receiving at least one new voice sample from the user for speaker authentication; determining a first offset value from scores assigned to previously received voice samples from said user, wherein the first offset value comprises an average of respective offsets of said scores from a mean score value; determining a second offset value from at least one score assigned to the at least one new voice sample from said user; determining whether a difference between said second offset value and said first offset value exceeds a proximity threshold; and if the difference between said second offset value and said first offset value exceeds the proximity threshold, determining that further system training is required, and indicating to the user that said further system training is required.
-
-
9. A computer program product, comprising a non-transitory computer-readable medium, having stored thereon instructions for causing a processor to perform a method comprising:
-
receiving at least one new voice sample from a user for speaker authentication; determining a first offset value from scores assigned to previously received voice samples from said user, wherein the first offset value comprises an average of respective offsets of said scores from a mean score value; determining a second offset value from at least one score assigned to the at least one new voice sample from said user; determining whether a difference between said second offset value and said first offset value exceeds a proximity threshold; and if the difference between said second offset value and said first offset value exceeds the proximity threshold, determining that further system training is required, and indicating to the user that said further system training is required.
-
Specification