×

Method of automatic language identification for multi-lingual text recognition

  • US 20040006467A1
  • Filed: 11/29/2002
  • Published: 01/08/2004
  • Est. Priority Date: 10/18/2002
  • Status: Abandoned Application
First Claim
Patent Images

1. A method for automatically determining one or more languages associated with text in a bit-mapped image, comprising the steps of:

  • segmenting the image into a plurality of images of word token, recognition of separate characters in said images of word token, joining separate characters into groups presumably comprising words, forming at least one hypothesis about correspondence of the characters group, presumably comprising a word, to a certain language, accepting the hypothesis about correspondence of the characters group, presumably comprising a word, to a certain language;

    the said step of forming a hypothesis about correspondence of the characters group, presumably comprising a word, to a certain language, further comprises at least the following steps definition of selected language models set, estimation of word correspondence with lingual and non-lingual models.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×