Apparatus, method, and computer program for analyzing document layout
First Claim
1. A computer-readable medium storing a program for analyzing layout of text on a document image to extract text blocks for character recognition purposes, the program causing a computer to function as:
- an extraction condition memory storing a plurality of extraction conditions for use in extracting text blocks from a given document image;
a text block extractor to extract a first set of non-overlapping text blocks from the given document image in accordance with one of the extraction conditions stored in said extraction condition memory, the text block extractor to also extract a second set of non-overlapping text blocks from the same document image in a different way from the first set, in accordance with another of the extraction conditions; and
a text block consolidator to produce a consolidated set of text blocks by performing character recognition on each text block extracted by said text block extractor, evaluating validity of each text block based on a result of the character recognition, creating a consolidation source set by finding a text block of the first set which overlaps with a text block of the second set, adding both of those text blocks to the consolidation source set, and repeating operations of finding a text block of the first and second sets which overlaps with any of the text blocks belonging to the consolidated set and adding the found text block to the consolidation source set, and selecting a most valid combination of non-overlapping text blocks from among the text blocks belonging to the consolidation source set, based on the validity of each text block that has been evaluated.
1 Assignment
0 Petitions
Accused Products
Abstract
A document layout analysis program capable of extracting an appropriate set of text blocks from a given document image even in the case where the document layout is so complicated that conventional extraction methods with a single extraction condition would not work well. A plurality of different extraction conditions are stored in an extraction condition memory for use in extracting text blocks from a given document image. In accordance with those extraction conditions, a text block extractor extracts a plurality of sets of text blocks from the document image. A text block consolidator produces a consolidated set of text blocks by performing character recognition on each extracted text block, evaluating validity of each text block based on a result of the character recognition, and selecting most valid text blocks from among the plurality of sets of text blocks.
-
Citations
17 Claims
-
1. A computer-readable medium storing a program for analyzing layout of text on a document image to extract text blocks for character recognition purposes, the program causing a computer to function as:
-
an extraction condition memory storing a plurality of extraction conditions for use in extracting text blocks from a given document image; a text block extractor to extract a first set of non-overlapping text blocks from the given document image in accordance with one of the extraction conditions stored in said extraction condition memory, the text block extractor to also extract a second set of non-overlapping text blocks from the same document image in a different way from the first set, in accordance with another of the extraction conditions; and a text block consolidator to produce a consolidated set of text blocks by performing character recognition on each text block extracted by said text block extractor, evaluating validity of each text block based on a result of the character recognition, creating a consolidation source set by finding a text block of the first set which overlaps with a text block of the second set, adding both of those text blocks to the consolidation source set, and repeating operations of finding a text block of the first and second sets which overlaps with any of the text blocks belonging to the consolidated set and adding the found text block to the consolidation source set, and selecting a most valid combination of non-overlapping text blocks from among the text blocks belonging to the consolidation source set, based on the validity of each text block that has been evaluated. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A document layout analyzing apparatus for analyzing layout of text on a document image to extract text blocks for character recognition purposes, the apparatus comprising:
-
an extraction condition memory to store a plurality of extraction conditions for use in extracting text blocks from a given document image; a text block extractor to extract a first set of non-overlapping text blocks from the given document image in accordance with one of the extraction conditions stored in said extraction condition memory, as well as extracting a second set of non-overlapping text blocks from the same document image in a different way from the first set, in accordance with another of the extraction conditions; and a text block consolidator to produce a consolidated set of text blocks by performing character recognition on each text block extracted by said text block extractor, evaluating validity of each text block based on a result of the character recognition, creating a consolidation source set by finding a text block of the first set which overlaps with a text block of the second set, adding both of those text blocks to the consolidation source set, and repeating operations of finding a text block of the first and second sets which overlaps with any of the text blocks belonging to the consolidated set and adding the found text block to the consolidation source set, and selecting a most valid combination of non-overlapping text blocks from among the text blocks belonging to the consolidation source set, based on the validity of each text block that has been evaluated. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A document layout analyzing method for analyzing layout of text on a document image to extract text blocks for character recognition purposes, comprising:
-
storing a plurality of extraction conditions; extracting a first set of non-overlapping text blocks from the document image in accordance with one of the stored extraction conditions; extracting a second set of non-overlapping text blocks from the same document image in a different way from the first set, in accordance with another of the extraction conditions; performing character recognition on each extracted text block of the first and second sets; evaluating validity of each text block of the first and second sets, based on a result of the character recognition; creating a consolidation source set by finding a text block of the first set which overlaps with a text block of the second set, adding both of those text blocks to the consolidation source set, and repeating operations of finding a text block of the first and second sets which overlaps with any of the text blocks belonging to the consolidated set and adding the found text block to the consolidation source set; and producing a consolidated set of text blocks by selecting a most valid combination of non-overlapping text blocks from among the text blocks belonging to the consolidation source set, based on the validity of each text block that has been evaluated. - View Dependent Claims (14, 15, 16, 17)
-
Specification