×

Automated sorting of voice messages through speaker spotting

  • US 5,271,088 A
  • Filed: 04/07/1993
  • Issued: 12/14/1993
  • Est. Priority Date: 05/13/1991
  • Status: Expired due to Term
First Claim
Patent Images

1. In a method of automatically recognizing a speaker on a communication channel, including the steps of digitizing input speech signals into a series of frames of digital data representing the input speech, analyzing the speech frames by a speaker recognition module which compares the incoming speech to a reference set of speech features of a given group of different speakers obtained during prior training sessions and generates respective match score therefrom, and determining which speaker the input speech is identified with based upon the match scores with each speaker associated with at least one stored reference frame, in combination therewith, the improvement wherein:

  • said analysis of speech frames by said speaker recognition module is implemented through the use of a set of speech feature vectors to characterize a given speaker'"'"'s speech patterns, said speech feature vectors being non-parametric in nature andsaid comparison of incoming speech to reference speech features by said speaker recognition module includes generating a match score which is a sum of a ScoreA set equal to the average of the minimum Euclidean squared distance between the unknown speech frame and all reference frames of a given speaker over all frames of the unknown input, and ScoreB set equal to the average of the minimum Euclidean squared distance between each frame of the reference set to all frames of the unknown input, over all frames of the reference set of speech features,wherein the "distance" from uj to the reference message R is;

    ##EQU15## and the "distance" from ri to the unknown message U is;

    ##EQU16## wherein uj is the j-th frame of unknown message U and ri be the i-th frame of reference message R, and ##EQU17## and wherein said comparison of incoming speech to reference speech features includes a step of normalizing said match score with respect to said stored reference frame for all speakers to provide a normalized score and comparing all normalized scores for all speakers to select the speaker having a highest acceptable normalized score.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×