Please download the dossier by clicking on the dossier button x
×

Automated identification of verbal records using boosted classifiers to improve a textual transcript

  • US 10,224,036 B2
  • Filed: 06/15/2017
  • Issued: 03/05/2019
  • Est. Priority Date: 10/05/2010
  • Status: Active Grant
First Claim
Patent Images

1. A computerized method for automated identification of audio records to improve a textual transcript, the method comprising the steps of:

  • selecting a plurality of audio records from a database, the each of the plurality of audio records comprising supporting information;

    obtaining a first speech recognition output of the each of the plurality of audio records using a pooled language model;

    obtaining a second speech recognition output of the each of the plurality of audio records using a balanced language model;

    processing the each of the audio records into a feature vector comprising the first speech recognition output, the second speech recognition output, and speech recognition output comprising a plurality of audio record feature vectors, selected from a group consisting of a number of silences, a noise threshold, a number of silences per second, a total duration of silence per second, an average amplitude in the audio record, a standard deviation of the amplitude in the audio record, a number of long silences per second, and a total duration of long silences per second;

    creating a plurality of basic classifiers based on the feature vector and selected from a group consisting of language classifiers, decision tree classifiers, and k-nearest classifiers;

    evaluating the each of the plurality of basic classifiers;

    creating a plurality of boosted classifiers, the each of the plurality of boosted classifiers being a combination of the each of the plurality of basic classifiers;

    testing the performance of the each of the boosted classifiers on a test set of training vectors, and determining which of the each of the boosted classifiers performed the best;

    adding one of the plurality of basic classifiers to the boosted classifier, based on a first vector weight;

    testing the performance of the boosted classifiers;

    adjusting the first vector weight and testing the performance of the each of the boosted classifiers on the test set of training vectors, and determining which of the each of the boosted classifiers performs the best;

    selecting a best boosted classifier; and

    saving the best boosted classifier and supporting structures.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×