×

Efficient language identification

  • US 20060184357A1
  • Filed: 02/11/2005
  • Published: 08/17/2006
  • Est. Priority Date: 02/11/2005
  • Status: Active Grant
First Claim
Patent Images

1. A method of identifying the natural language of text comprising the steps of:

  • receiving text documents written in a known natural language;

    counting occurrences of unique features in the text documents to generate expected feature counts; and

    using a probability distribution and the expected feature counts to generate probability values as a function of actual feature occurrence.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×