Methods and system for reducing false positive voice print matching
First Claim
1. A method of reducing false positive matches in voice prints which comprises:
- receiving an audio communication from an unknown caller, separating a first portion of the audio communication into silent and non-silent segments, and evaluating the non-silent segments to determine which portions thereof are speech or non-speech;
generating a plurality of parameters that determine what is speech and non-speech in the non-silent segments;
using the generated parameters to determine what is speech and non-speech for at least the remainder of the audio communication;
comparing the speech to selected audio elements of a universal background model (UBM) that characterizes the speech of the unknown caller relative to a plurality of other audio elements of the UBM;
comparing the selected audio elements of the speech to matching audio elements of a recorded voice print from a plurality of fraudulent speakers to determine whether the speech belongs to a fraudulent speaker;
generating a first score based on the number of selected audio elements matching audio elements of the fraudulent speaker from the plurality of fraudulent speakers;
determining if the first score exceeds a predetermined threshold score for the fraudulent speaker; and
comparing the selected audio elements for the unknown caller where the score exceeds the predetermined threshold score to the selected audio elements of a voice print associated with a customer account and generating a second score based on the number of selected audio elements matching audio elements associated with the customer account;
wherein if the value of the first score exceeds the value of the second score, then generating an alert for display on an agent workstation.
1 Assignment
0 Petitions
Accused Products
Abstract
The methods, apparatus, and systems described herein are designed to reduce false positive voice print matching with fraudulent callers. A voice print of a call is created and compared to known voice prints to determine if it matches one or more of the known voice prints, and to transaction data associated with a database of voice prints. The methods include a pre-processing step to separate speech from non-speech, selecting a number of elements that affect the voice print the most, and/or generating a first score based on the number of selected audio elements matching audio elements of a voice print from the plurality of fraudulent speakers, determining if the first score exceeds a predetermined threshold score for the fraudulent speaker, and comparing the selected audio elements for the unknown caller, where the score exceeds the predetermined threshold score, to the voice prints associated with the customer account.
-
Citations
18 Claims
-
1. A method of reducing false positive matches in voice prints which comprises:
-
receiving an audio communication from an unknown caller, separating a first portion of the audio communication into silent and non-silent segments, and evaluating the non-silent segments to determine which portions thereof are speech or non-speech; generating a plurality of parameters that determine what is speech and non-speech in the non-silent segments; using the generated parameters to determine what is speech and non-speech for at least the remainder of the audio communication; comparing the speech to selected audio elements of a universal background model (UBM) that characterizes the speech of the unknown caller relative to a plurality of other audio elements of the UBM; comparing the selected audio elements of the speech to matching audio elements of a recorded voice print from a plurality of fraudulent speakers to determine whether the speech belongs to a fraudulent speaker; generating a first score based on the number of selected audio elements matching audio elements of the fraudulent speaker from the plurality of fraudulent speakers; determining if the first score exceeds a predetermined threshold score for the fraudulent speaker; and comparing the selected audio elements for the unknown caller where the score exceeds the predetermined threshold score to the selected audio elements of a voice print associated with a customer account and generating a second score based on the number of selected audio elements matching audio elements associated with the customer account; wherein if the value of the first score exceeds the value of the second score, then generating an alert for display on an agent workstation. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A non-transitory computer readable medium comprising a plurality of instructions stored therein that are adapted to reduce false positive matches when analyzing voice prints, the plurality of instructions comprising:
-
instructions that, when executed, receive an audio communication from an unknown caller, separating a first portion of the audio communication into silent and non-silent segments, and evaluating the non-silent segments to determine which portions thereof are speech or non-speech; instructions that, when executed, generate a plurality of parameters that determine what is speech and non-speech in the non-silent segments; instructions that, when executed, use the generated parameters to determine what is speech and non-speech for at least the remainder of the audio communication; instructions that, when executed, compare the speech to selected audio elements of a universal background model (UBM) that characterizes the speech of the unknown caller relative to a plurality of other audio elements of the UBM; instructions that, when executed, compare the selected audio elements of the speech to matching audio elements of a recorded voice print from a plurality of fraudulent speakers to determine whether the speech belongs to a fraudulent speaker; instructions that, when executed, generate a first score based on the number of selected audio elements matching audio elements of the fraudulent speaker from the plurality of fraudulent speakers; instructions that, when executed, determine if the first score exceeds a predetermined threshold score for the fraudulent speaker; and instructions that, when executed, compare the selected audio elements for the unknown caller where the score exceeds the predetermined threshold score to the selected audio elements of a voice print associated with a customer account and generating a second score based on the number of selected audio elements matching audio elements associated with the customer account; wherein if the value of the first score exceeds the value of the second score, then instructions that, when executed, generate an alert for display on an agent workstation. - View Dependent Claims (7, 8, 9, 10)
-
-
11. An audible fraud detection system adapted to minimize false positive voice print matches, which comprises:
-
a node comprising a processor and a non-transitory computer readable medium operably coupled thereto, the non-transitory computer readable medium comprising a plurality of instructions stored therein and being accessible to, and executable by, the processor, where the plurality of instructions comprises; instructions that, when executed, receive an audio communication from an unknown caller, separating a first portion of the audio communication into silent and non-silent segments, and evaluating the non-silent segments to determine which portions thereof are speech or non-speech; instructions that, when executed, generate a plurality of parameters that determine what is speech and non-speech in the non-silent segments; instructions that, when executed, use the generated parameters to determine what is speech and non-speech for at least the remainder of the audio communication; instructions that, when executed, compare the speech to selected audio elements of a universal background model (UBM) that characterizes the speech of the unknown caller relative to a plurality of other audio elements of the UBM; instructions that, when executed, compare the selected audio elements of the speech to matching audio elements of a recorded voice print from a plurality of fraudulent speakers to determine whether the speech belongs to a fraudulent speaker; instructions that, when executed, generate a first score based on the number of selected audio elements matching audio elements of the fraudulent speaker from the plurality of fraudulent speakers; instructions that, when executed, determine if the first score exceeds a predetermined threshold score for the fraudulent speaker; and instructions that, when executed, compare the selected audio elements for the unknown caller where the score exceeds the predetermined threshold score to the selected audio elements of a voice print associated with a customer account and generating a second score based on the number of selected audio elements matching audio elements associated with the customer account; wherein if the value of the first score exceeds the value of the second score, then instructions that, when executed, generate an alert for display on an agent workstation. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
Specification