×

Machine training for native language and fluency identification

  • US 10,431,203 B2
  • Filed: 09/05/2017
  • Issued: 10/01/2019
  • Est. Priority Date: 09/05/2017
  • Status: Active Grant
First Claim
Patent Images

1. A computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a device to cause a device to:

  • train a machine by a machine learning technique for recognizing speech utterance to determine language fluency level of a user,the training comprising at least;

    receiving native speaker recorded data from a database of recorded speech of at least one native speaker,receiving a language specific dictionary of heteronyms,parsing the native speaker recorded data and isolating the heteronyms from the native speaker recorded data,extracting linguistic features from the native speaker recorded data including at least linguistic features associated with the heteronyms, the linguistic features associated with the heteronyms including at least phonetics, andgenerating a language dependent machine learning model based at least on the linguistic features, wherein the language dependent machine learning model is trained to output a score indicating language fluency;

    generate a test corpus of sentences, wherein each sentence in the test corpus includes at least one pair of heteronyms, wherein heteronyms are words spelled identically but having different pronunciations and meanings from one another;

    cause presenting of a sentence from the test corpus to the user on a user interface display;

    receive a test speech utterance of the user uttering the presented sentence;

    execute the language dependent machine learning model operating on the test speech utterance to obtain user pronunciation of the presented sentence including the at least two heteronyms;

    evaluate a language fluency level of the user based on the obtained user pronunciation; and

    output a score representing the language fluency level of the user.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×