×

SYSTEM AND METHODS FOR ARABIC TEXT RECOGNITION BASED ON EFFECTIVE ARABIC TEXT FEATURE EXTRACTION

  • US 20100272361A1
  • Filed: 04/27/2009
  • Published: 10/28/2010
  • Est. Priority Date: 04/27/2009
  • Status: Active Grant
First Claim
Patent Images

1. A method for automatically recognizing Arabic text, comprising:

  • acquiring a text image containing a line of Arabic characters;

    digitizing the line of the Arabic characters to form a two-dimensional array of pixels each associated with a pixel value, wherein the pixel value is expressed in a binary number;

    dividing the line of the Arabic characters into a plurality of line images;

    defining a plurality of cells in one of the plurality of line images, wherein each of the plurality of cells comprises a group of adjacent pixels;

    serializing pixel values of pixels in each of the plurality of cells in one of the plurality of line images to form a binary cell number;

    forming a text feature vector according to binary cell numbers obtained from the plurality of cells in one of the plurality of line images; and

    feeding the text feature vector into a Hidden Markov Model to recognize the line of Arabic characters.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×