Method and apparatus for producing a hybrid data structure for displaying a raster image
First Claim
1. A system for producing a raster image derived from a hybrid data structure including coded and non-coded portions from an input bitmap, the system comprising:
- (a) a data processing apparatus;
(b) means for performing recognition on an input bitmap that has been entered into said data processing apparatus to detect identifiable objects within said input bitmap, said means for performing recognition including;
(i) means for comparing each of said identifiable objects with a portion of said input bitmap corresponding to said identifiable object; and
(ii) means for adjusting the size of said identifiable object if said identifiable object is within a threshold size of said corresponding input bitmap portion;
(c) means for creating a hybrid data structure including coded portions corresponding to said identifiable objects and non-coded portions derived from portions of said input bitmap which do not correspond to said identifiable objects; and
(d) an output device for developing a visually perceptible raster image from said hybrid data structure that includes coded images of said identifiable objects and non-coded images of said non-identifiable portions of said input bitmap.
1 Assignment
0 Petitions
Accused Products
Abstract
A system for producing a raster image derived from coded and non-coded portions of a hybrid data structure from an input bitmap including (1) a data processing apparatus, (2) a recognizer which performs recognition on an input bitmap to the data processing apparatus to detect identifiable objects within the input bitmap, (3) a mechanism for producing a hybrid data structure including coded data corresponding to the identifiable objects and non-coded data derived from portions of the input bitmap which do not correspond to the identifiable objects, and (4) an output device capable of developing a visually perceptible raster image derived from the hybrid data structure. The raster image includes raster images of the identifiable objects and raster images derived from portions of the input bitmap that do not correspond to the identifiable objects. The invention includes a method for producing a hybrid data structure for a bitmap of an image having the steps of: (1) inputting a signal comprising a bitmap into a digital processing apparatus, (2) partitioning the bitmap into a hierarchy of lexical units, (3) assigning labels to a label list for each lexical unit of a predetermined hierarchical level, where labels in the label list have an associated confidence level, and (4) storing each lexical unit in a hybrid data structure as either an identifiable object or a non-identifiable object.
128 Citations
5 Claims
-
1. A system for producing a raster image derived from a hybrid data structure including coded and non-coded portions from an input bitmap, the system comprising:
-
(a) a data processing apparatus; (b) means for performing recognition on an input bitmap that has been entered into said data processing apparatus to detect identifiable objects within said input bitmap, said means for performing recognition including; (i) means for comparing each of said identifiable objects with a portion of said input bitmap corresponding to said identifiable object; and (ii) means for adjusting the size of said identifiable object if said identifiable object is within a threshold size of said corresponding input bitmap portion; (c) means for creating a hybrid data structure including coded portions corresponding to said identifiable objects and non-coded portions derived from portions of said input bitmap which do not correspond to said identifiable objects; and (d) an output device for developing a visually perceptible raster image from said hybrid data structure that includes coded images of said identifiable objects and non-coded images of said non-identifiable portions of said input bitmap. - View Dependent Claims (2)
-
-
3. A method for producing a hybrid data structure from a bitmap of an image including identifiable objects and non-identifiable objects comprising the steps of:
-
inputting a signal comprising a bitmap of an image into a digital processing apparatus; determining on said digital processing apparatus a geometric correction of said bitmap, said determining a geometric correction step including creating a distortion map of said bitmap and creating a layout correction transform from said distortion map and said bitmap; partitioning on said digital processing apparatus said bitmap into lexical units; assigning on said digital processing apparatus at least one label and an associated confidence level to each lexical unit; and storing on said digital processing apparatus each lexical unit in a hybrid data structure as an identifiable object if a label for said lexical unit has a confidence level greater than a threshold confidence level, and as a non-identifiable object if no label for said lexical unit has a confidence level greater than said threshold confidence level.
-
-
4. A method for producing a hybrid image comprising:
-
inputting a bitmap representing an image into a digital processing apparatus; segmenting said bitmap into lexical units; assigning a typeface to each of said lexical units of said bitmap; recognizing on said digital processing apparatus identifiable objects in said bitmap; performing a size adjustment to said identifiable objects; and creating with an output device coupled to said digital processing apparatus a visually perceptible image comprising rendered images of said size-adjusted identifiable objects and bitmap images of objects that were not recognized.
-
-
5. A method for producing a hybrid data structure from a bitmap of an image including identifiable objects and non-identifiable objects comprising the steps of:
-
inputting a signal comprising a bitmap of an image into a digital processing apparatus; partitioning on said digital processing apparatus said bitmap into lexical units, wherein said lexical units are organized into levels of a hierarchy which include a blob level, a character level, a word level, a text line level, a text block level, a page level, and a document level; assigning on said digital processing apparatus at least one label and an associated confidence level to each lexical unit; and storing on said digital processing apparatus each lexical unit in a hybrid data structure as an identifiable object if a label for said lexical unit has a confidence level greater than a threshold confidence level, and as a non-identifiable object if no label for said lexical unit has a confidence level greater than said threshold confidence level.
-
Specification