System of using neural network to distinguish text and picture in images and method thereof
First Claim
1. A system of using a neural network to distinguish text and pictures in an image, a set of training data being used to train the neural network in advance to generate text recognition knowledge, the system comprising:
- an image block division module, which extracts gray-level image data of the image and divides the gray-level image data into a plurality of image blocks, each of which contains a plurality of block columns each of which is made of a plurality of continuous pixels;
a neural network module, which uses the text recognition knowledge to process the continuous pixels of the block column, generating a text faith value for each of the pixels and obtaining a greatest text faith value; and
a text determination module, which compares a text threshold with the greatest text faith value to determine the status of the image block, wherein the training data include photo-to-text data, white-to-text data, text-to-photo/white data, text-to-text data, no text data, data of text with more than one edge, and data of text with halftoning noise.
2 Assignments
0 Petitions
Accused Products
Abstract
This specification discloses a system of using a neural network to distinguish text and pictures in an image and the method thereof. Using the knowledge of text recognition learned by the neural network in advance, images data of color brightness and gray levels in an image block are processed to generate a greatest text faith value. The system determines the text status of the image block by comparing a text threshold with the greatest text faith value. If the greatest text faith value is larger than the text threshold, then the image block is determined to contain text pixels; otherwise, the image block contains purely picture pixels. This achieves the goal of separating text and pictures in an image.
15 Citations
6 Claims
-
1. A system of using a neural network to distinguish text and pictures in an image, a set of training data being used to train the neural network in advance to generate text recognition knowledge, the system comprising:
-
an image block division module, which extracts gray-level image data of the image and divides the gray-level image data into a plurality of image blocks, each of which contains a plurality of block columns each of which is made of a plurality of continuous pixels; a neural network module, which uses the text recognition knowledge to process the continuous pixels of the block column, generating a text faith value for each of the pixels and obtaining a greatest text faith value; and a text determination module, which compares a text threshold with the greatest text faith value to determine the status of the image block, wherein the training data include photo-to-text data, white-to-text data, text-to-photo/white data, text-to-text data, no text data, data of text with more than one edge, and data of text with halftoning noise. - View Dependent Claims (2, 3)
-
-
4. A method of using a neural network to distinguish text and pictures in an image, a set of training data being used to train the neural network in advance to generate text recognition knowledge, the system comprising:
-
extracting gray-level image data of an image and dividing the gray-level image data into a plurality of image blocks, each of which contains a plurality of block columns each of which is made of a plurality of continuous pixels; inputting the gray levels of the continuous pixels of the block column, using the text recognition knowledge to process the continuous pixels to generate a text faith value of each of the block column and to obtain a greatest text faith value; and determining the text status of the image block according to the result of comparing a text threshold and the greatest text faith value, wherein the training data include photo-to-text data, white-to-text data, text-to-photo/white data, text-to-text data, no text data, data of text with more than one edge, and data of text with halftoning noise. - View Dependent Claims (5, 6)
-
Specification