Document image segmentation system
First Claim
Patent Images
1. A system for document image segmentation, said system comprising:
- input means adapted to input a document image;
image pre-processing means adapted to pre-process said document image by maintaining the aspect ratio, said pre-processing means including a colour quantization means to give a pre-processed quantized image;
colour space transformation means adapted to receive said pre-processed quantized image and apply a Hue, Saturation and Value colour space transformation on said quantized image to derive a transformed image containing only saturation component of said quantized image;
first image energy calculation means adapted to receive said transformed image and calculate both horizontal and vertical energies of said transformed image to provide a first energy image by cumulating both of said calculated energies of said transformed image;
grayscale image conversion means adapted to receive said pre-processed quantized image and perform a grayscale conversion operation on said quantized image to provide a gray scale image;
second image energy calculation means adapted to receive said gray scale image and calculate both horizontal and vertical energies of said gray scale image to provide a second energy image by cumulating both of said calculated energies and said gray scale image;
computational means adapted to receive said first energy image and second energy image to compute a maximum of both the energies and provide a maximum energy image;
binarization means adapted to receive said maximum energy image and provide a binarized image;
dilation means adapted to receive said binarized image and perform a dilation operation to provide a dilated image;
clustering means adapted to receive said dilated image and formulate different clusters based on the density of the dilated areas and provide a clustered image; and
box creation means adapted to create bounding boxes enclosing each cluster in the clustered image to form an image of the document having image segments.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and a method for document image segmentation have been disclosed. Image segments are obtained by forming different clusters in a document image. The document image may include images of company logos, product marks or trademarks. The invention can perform image segmentation on any kind of complex colored image and can recognize logos, product-marks or trademarks which comprise text or graphics, wherein the text can be either of uniform font style or uneven font style such as fancy font styles, calligraphic styles or having different orientation.
-
Citations
10 Claims
-
1. A system for document image segmentation, said system comprising:
-
input means adapted to input a document image; image pre-processing means adapted to pre-process said document image by maintaining the aspect ratio, said pre-processing means including a colour quantization means to give a pre-processed quantized image; colour space transformation means adapted to receive said pre-processed quantized image and apply a Hue, Saturation and Value colour space transformation on said quantized image to derive a transformed image containing only saturation component of said quantized image; first image energy calculation means adapted to receive said transformed image and calculate both horizontal and vertical energies of said transformed image to provide a first energy image by cumulating both of said calculated energies of said transformed image; grayscale image conversion means adapted to receive said pre-processed quantized image and perform a grayscale conversion operation on said quantized image to provide a gray scale image; second image energy calculation means adapted to receive said gray scale image and calculate both horizontal and vertical energies of said gray scale image to provide a second energy image by cumulating both of said calculated energies and said gray scale image; computational means adapted to receive said first energy image and second energy image to compute a maximum of both the energies and provide a maximum energy image; binarization means adapted to receive said maximum energy image and provide a binarized image; dilation means adapted to receive said binarized image and perform a dilation operation to provide a dilated image; clustering means adapted to receive said dilated image and formulate different clusters based on the density of the dilated areas and provide a clustered image; and box creation means adapted to create bounding boxes enclosing each cluster in the clustered image to form an image of the document having image segments. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-implemented method for document image segmentation, said method comprising:
-
inputting a document image; pre-processing input image by maintaining the aspect ratio and performing colour quantization to give a quantized image; applying Hue, Saturation and Value colour space transformation on said quantized image; deriving a transformed image containing only saturation component of the quantized image; calculating both horizontal and vertical energies of said transformed image and cumulating both of the calculated energies; providing a first energy image; converting a quantized image into a grayscale image; calculating both horizontal and vertical energies of said gray scale image and cumulating both of the calculated energies; providing a second energy image; computing a maximum of both the energy images; providing a maximum energy image; binarizing said maximum energy image; dilating said binarized image; formulating different clusters based on the density of the dilated areas; creating bounding boxes enclosing each cluster in the clustered image; and forming an image of the document having image segments. - View Dependent Claims (8, 9, 10)
-
Specification