×

Method and apparatus for character recognition using stop words

  • US 6,252,988 B1
  • Filed: 07/09/1998
  • Issued: 06/26/2001
  • Est. Priority Date: 07/09/1998
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for training an image classifier, the method comprising:

  • identifying a plurality of stop words, each stop word being from a same language and having an associated definition in such language, the plurality of stop words being identified as a function of a linguistic model and the plurality of stop words having an expected recognition coverage level associated therewith, wherein the plurality of stop words is limited to the following stop words;

    a, about, after, all, also, an, and, any, are, as, at, back, be, because, been, before, being, between, both, but, by, can, could, day, did, do, down, each, even, first, for, from, get, good, had, has, have, he, her, here, him, his, how, I, if, in, into, is, it, its, just, know, life, like, little, long, made, make, man, many, may, me, men, more, most, Mr., much, must, my, never, new, no, not, now, of, old, on, one, only, or, other, our, out, over, own, people, said, same, see, she, should, so, some, state, still, such, than, that, the, their, them, then, there, these, they, this, those, three, through, time, to, too, two, under, up, very, was, way, we, well, were, what, when, where, which, who, will, with, work, world, would, year, years, you, and your;

    comparing the plurality of stop words to a plurality of individual words in an input image, each stop word and each individual word being treated as a separate symbol during the comparing;

    identifying matches between particular ones of the stop words and particular ones of the individual words of the input image, wherein each particular stop word matches a same particular individual word throughout the input image, to form a plurality of recognized words;

    segmenting the plurality of recognized words to form a plurality of character prototypes; and

    training the image classifier using the plurality of character prototypes to recognize at least one character from the input image.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×