×

Systems and methods for language detection

  • US 9,535,896 B2
  • Filed: 05/23/2016
  • Issued: 01/03/2017
  • Est. Priority Date: 10/17/2014
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method of identifying a language of a message, the method comprising:

  • performing a plurality of language detection tests on text, each language detection test determining a respective set of scores, each score in the set of scores representing a likelihood that the message is in a respective language of a plurality of different languages;

    providing one or more combinations of the score sets as input to one or more distinct classifiers including a first classifier and a second classifier, wherein the first classifier was trained using outputs from a first combination of the language detection tests and the second classifier was trained using outputs from a different second combination of the language detection tests;

    obtaining as output from each of the one or more classifiers a respective indication that the message is in one of the plurality of different languages, the indication comprising a confidence score; and

    identifying the language of the message based on one of the confidence scores.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×