×

Named entity recognition on chat data

  • US 10,765,956 B2
  • Filed: 01/07/2016
  • Issued: 09/08/2020
  • Est. Priority Date: 01/07/2016
  • Status: Active Grant
First Claim
Patent Images

1. A method comprisingperforming by one or more computers:

  • training a statistical classifier to identify named entities using training data comprising a plurality of features, wherein one of the features is a word shape feature that comprises a respective token for each letter of a respective word, the respective token indicating that each letter of the respective word is one of an upper case letter, a lower case letter, and a digit;

    receiving a plurality of word strings in a first language, each received word string comprising a plurality of words;

    identifying at least one named entity in each received word string using the trained statistical classifier; and

    translating the received word strings from the first language to a second language, wherein translating comprises preserving the identified at least one named entity in the first language.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×