×

Determining a natural language shift in a computer document

  • US 5,913,185 A
  • Filed: 12/20/1996
  • Issued: 06/15/1999
  • Est. Priority Date: 08/19/1996
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for detecting language shift points in a computer document written in a plurality of natural languages, comprising the steps of:

  • moving an interval through a text document in a computer memory, the interval containing a plurality of words in the document;

    for each position of the interval, determining a probability that text in the interval is written in each of a plurality of candidate languages according to a respective number of matches of words in the interval with words in each of a plurality of word lists of a few common words selected from each respective candidate language;

    for a first position of the interval, classifying a first candidate language having the highest probability as the current language within the interval;

    finding a language shift point in the document where the probability that a second candidate language is higher than the current language for a new position of the interval; and

    classifying the second candidate language as the current language in the document after the language shift point.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×