Identification of a blank page in an image processing system
First Claim
1. A method for eliminating streaks from consideration in the analysis of whether an electronic representation of a page generated from a scanning system is blank, wherein the method executes on a computer system, wherein the computer system includes a processor coupled to a memory, wherein the memory includes the electronic representation of the page, the method comprising the steps of:
- predetermining a streak width value and storing the streak width value in the computer system;
using the processor to determine the number and connectivity of pixels in the electronic representation and to designate pixels meeting a predetermined connectivity criterion as connected components;
using the processor to eliminate the number of connected components smaller in both dimensions than predetermined dimension values;
if any of the remaining connected components have a width dimension larger than the predetermined streak width value then using the processor to indicate that the page is not blank else if the number of remaining connected components is not larger than a predetermined allowed number then using the processor to indicate that the page is blank.
4 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides a robust technique for quickly determining whether a binary input image originated as a blank page. The technique provides reliable sensing in the presence of various image and scanner noise in the input image. In broad terms, the invention contemplates reducing the input image with a low threshold, labeling (by size) connected components (8-connected or 4-connected), and performing a threshold analysis. The threshold analysis typically entails size and numerical thresholds, taking into account the characteristic dimensions of expected types of noise. In specific embodiments, the reduction is performed as a textured reduction wherein the image is divided into tiles, and a single row of pixels in each tile is checked to see whether there are any ON pixels. If there are, the corresponding pixel in the reduced image is ON, otherwise it is OFF. Optional morphological operations are performed to remove expected sources of noise (e.g., pepper noise and thin horizontal lines). The invention further recognizes that a faxed page may contain vertical streaks that are not part of the original paper document. Thus, the threshold analysis typically allows a certain number of such streaks to be present without concluding that the page is not blank. A further understanding of the nature and advantages of the present invention may be realized by reference to the remaining portions of the specification and the drawings.
29 Citations
4 Claims
-
1. A method for eliminating streaks from consideration in the analysis of whether an electronic representation of a page generated from a scanning system is blank, wherein the method executes on a computer system, wherein the computer system includes a processor coupled to a memory, wherein the memory includes the electronic representation of the page, the method comprising the steps of:
-
predetermining a streak width value and storing the streak width value in the computer system; using the processor to determine the number and connectivity of pixels in the electronic representation and to designate pixels meeting a predetermined connectivity criterion as connected components; using the processor to eliminate the number of connected components smaller in both dimensions than predetermined dimension values; if any of the remaining connected components have a width dimension larger than the predetermined streak width value then using the processor to indicate that the page is not blank else if the number of remaining connected components is not larger than a predetermined allowed number then using the processor to indicate that the page is blank. - View Dependent Claims (2, 3, 4)
-
Specification