Systems and methods for cluster-based voice verification
First Claim
Patent Images
1. A method of authenticating a telephone caller, the method comprising:
- receiving, by a processor of an authentication server, audio data including speech of the telephone caller;
analyzing, by the processor, the audio data to identify a plurality of words from the speech of the telephone caller and to identify an occurrence frequency for each of the plurality of words;
comparing, by the processor, the plurality of words and the occurrence frequencies to a plurality of word clusters, each word cluster comprising a plurality of associated words and an occurrence frequency for each of the plurality of associated words, and each word cluster being associated with one of a plurality of demographics;
determining, by the processor, a most similar word cluster of the plurality of word clusters to the audio data based on a similarity of the plurality of words and the plurality of associated words of the most similar cluster and a similarity of the occurrence frequencies of the plurality of words and the occurrence frequencies of the plurality of associated words of the most similar cluster;
receiving, by the processor, a purported identity of the telephone caller, the purported identity including caller demographic data;
comparing, by the processor, the caller demographic data to the demographic associated with the most similar word cluster; and
identifying, by the processor, the telephone caller as at least one of;
likely having the purported identity in response to determining the caller demographic data matches the demographic associated with the most similar word cluster, andunlikely to have the purported identity in response to determining the caller demographic data matches a demographic associated with a word cluster different from the most similar word cluster.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems for caller identification and authentication may include an authentication server. The authentication server may be configured to receive audio data including speech of a plurality of telephone calls, use audio data for at least a subset of the plurality of telephone calls to populate a plurality of word clusters each associated with a specific demographic, and/or use audio data for at least one of the plurality of telephone calls to identify the telephone caller making the telephone call based on determining a most similar word cluster of the plurality of word clusters to the audio data of the caller.
-
Citations
20 Claims
-
1. A method of authenticating a telephone caller, the method comprising:
-
receiving, by a processor of an authentication server, audio data including speech of the telephone caller; analyzing, by the processor, the audio data to identify a plurality of words from the speech of the telephone caller and to identify an occurrence frequency for each of the plurality of words; comparing, by the processor, the plurality of words and the occurrence frequencies to a plurality of word clusters, each word cluster comprising a plurality of associated words and an occurrence frequency for each of the plurality of associated words, and each word cluster being associated with one of a plurality of demographics; determining, by the processor, a most similar word cluster of the plurality of word clusters to the audio data based on a similarity of the plurality of words and the plurality of associated words of the most similar cluster and a similarity of the occurrence frequencies of the plurality of words and the occurrence frequencies of the plurality of associated words of the most similar cluster; receiving, by the processor, a purported identity of the telephone caller, the purported identity including caller demographic data; comparing, by the processor, the caller demographic data to the demographic associated with the most similar word cluster; and identifying, by the processor, the telephone caller as at least one of; likely having the purported identity in response to determining the caller demographic data matches the demographic associated with the most similar word cluster, and unlikely to have the purported identity in response to determining the caller demographic data matches a demographic associated with a word cluster different from the most similar word cluster. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of identifying a telephone caller, the method comprising:
-
receiving, by a processor of an authentication server, audio data including speech of a plurality of telephone calls; for at least a subset of the plurality of telephone calls, determining, by the processor, demographic data for a telephone caller making the telephone call; for at least the subset of the plurality of telephone calls, analyzing, by the processor, the audio data to identify a plurality of words from the speech of the telephone caller; receiving, by the processor, a plurality of word clusters, each word cluster associated with a specific demographic; populating, by the processor, at least one word cluster with at least a subset of the plurality of words from the speech of each telephone caller associated with the specific demographic based on the demographic data for the telephone caller; for each word cluster, determining, by the processor, a plurality of associated words from among at least the subset of the plurality of words and an occurrence frequency for each of the plurality of associated words; and for at least one of the plurality of telephone calls; analyzing, by the processor, the audio data to identify a plurality of words from the speech of the telephone caller and to identify an occurrence frequency for each of the plurality of words, comparing, by the processor, the plurality of words from the speech of the telephone caller and the occurrence frequency for each of the plurality of words from the speech of the telephone caller to the plurality of word clusters, based on the comparing, identifying, by the processor, a most similar word cluster of the plurality of word clusters to the audio data based on a similarity of the plurality of words from the speech of the telephone caller and the plurality of associated words of the most similar cluster and a similarity of the occurrence frequencies of the plurality of words from the speech of the telephone caller and the occurrence frequencies of the plurality of associated words of the most similar cluster, and determining, by the processor, a caller demographic of the telephone caller, the caller demographic being the same as the demographic of the most similar word cluster. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A system for caller identification and authentication, the system comprising:
-
a telephony recorder configured to record audio data for calls placed to at least one phone number; an authentication server comprising a processor and a non-transitory memory, the memory storing instructions that, when executed by the processor, cause the processor to perform processing comprising; receiving audio data including speech of a plurality of telephone calls; using audio data for at least a subset of the plurality of telephone calls to populate a plurality of word clusters, each word cluster being associated with a specific demographic, the populating of the plurality of word clusters comprising; for each of the subset of the plurality of telephone calls, determining demographic data for a telephone caller making the telephone call, and analyzing the audio data to identify a plurality of words from the speech of the telephone caller, and populating at least one word cluster with at least a subset of the plurality of words from the speech of each telephone caller associated with the specific demographic based on the demographic data for the telephone caller; and using audio data for at least one of the plurality of telephone calls to identify the telephone caller making the telephone call, the identifying comprising; analyzing the audio data to identify a plurality of words from the speech of the telephone caller and to identify an occurrence frequency for each of the plurality of words, comparing, the plurality of words and the occurrence frequencies to the plurality of word clusters, determining a most similar word cluster of the plurality of word clusters to the audio data based on a similarity of the plurality of words and the plurality of associated words of the most similar cluster and a similarity of the occurrence frequencies of the plurality of words and occurrence frequencies of the plurality of associated words of the most similar cluster, receiving a purported identity of the telephone caller, the purported identity including caller demographic data, determining whether the caller demographic data matches the demographic associated with the most similar word cluster, and identifying the telephone caller as; likely having the purported identity in response to determining that the caller demographic data matches the demographic associated with the most similar word cluster, or unlikely to have the purported identity in response to determining that the caller demographic data does not match the demographic associated with the most similar word cluster. - View Dependent Claims (17, 18, 19, 20)
-
Specification