×

Systems and methods to develop training set of data based on resume corpus

  • US 10,748,118 B2
  • Filed: 04/05/2016
  • Issued: 08/18/2020
  • Est. Priority Date: 04/05/2016
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method comprising:

  • acquiring, by a computing system, a resume corpus;

    processing, by the computing system, the resume corpus to generate resume tokens from the resume corpus, wherein the processing comprises;

    determining a ratio based on co-occurrence of a first word and a second word of the resume corpus versus individual occurrence of the first word and the second word; and

    determining, based on the ratio, the existence of a bigram including the first word and the second word to be used as training data;

    training, by the computing system, a machine learning model to recommend a job classification based at least in part on the bigram; and

    applying, by the computing system, the machine learning model to recommend a job classification based on evaluation data.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×