×

Document segmentation system

  • US 5,956,468 A
  • Filed: 01/10/1997
  • Issued: 09/21/1999
  • Est. Priority Date: 07/12/1996
  • Status: Expired due to Term
First Claim
Patent Images

1. A method of processing a mixed document defining a plurality of pixels, the method comprising the steps of:

  • (a) detecting a first set of the pixels corresponding to image pixels;

    (b) detecting a second set of the pixels corresponding to large text pixels;

    (c) computing, in a first color space, a first value corresponding to a white point of a media on which the document is printed;

    (d) computing, in the first color space, a second value corresponding to a black point of the media;

    (e) generating, via the first value, a table of values that are compensated for the white point of the media;

    (f) labeling, via the table of values, each of the pixels in the document with one of;

    (1) a color label;

    (2) a black label; and

    (3) a white label; and

    (g) applying a plurality of syntactic rules to pixel sequences having predetermined labels and predetermined lengths.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×