×

Blind diarization of recorded calls with arbitrary number of speakers

  • US 10,109,280 B2
  • Filed: 12/12/2017
  • Issued: 10/23/2018
  • Est. Priority Date: 07/17/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method for obtaining a speaker-identified transcription from audio data of multiple speakers, the method comprising:

  • obtaining the audio data and an unlabeled transcription of the audio data;

    separating the audio data into a sequence of utterances, wherein each utterance has acoustic features;

    clustering utterances having similar acoustic features;

    generating a hidden Markov model (HMM) from the clustered utterances;

    decoding the sequence of utterances using the HMM to associate each utterance with one of the multiple speakers;

    determining the identity of one or more of the multiple speakers by comparing the utterances associated with each of the multiple speakers to acoustic voiceprint models of known speakers; and

    labeling portions of the transcription corresponding to utterances of identified speakers with the speaker'"'"'s identity to obtain the speaker-identified transcription.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×