IMAGE PROCESSING APPARATUS, IMAGE PROCESSING METHOD, AND COMPUTER PROGRAM
0 Assignments
0 Petitions
Accused Products
Abstract
This invention generates a digital document by applying character recognition to character images in a document image, and rendering the character recognition result on the document image in a transparent color. This digital document allows to specify a part corresponding to a search keyword on the document image upon conducting a search. When this digital document is generated, it includes a description required to use glyph data (font data) of a simple character shape commonly to a plurality of character types as font data used upon rendering the character recognition result. Therefore, even when the digital document needs to save font data, an increase in file size can be minimized. Also, by rendering using a simple character shape, the data size of the font data itself can be reduced.
-
Citations
35 Claims
-
1-19. -19. (canceled)
-
20. An image processing apparatus comprising:
-
a character recognition unit configured to execute character recognition processing for a plurality of character images in a document image to obtain character codes corresponding to the respective character images; and a generation unit configured to generate a digital document, wherein the digital document includes the document image, a plurality of character codes obtained by said character recognition unit, glyph data, and a description required to render the glyph data corresponding to the plurality of character codes with a size smaller than sizes of the character images at positions corresponding to the character images in the document image, wherein the glyph data is used commonly to the plurality of character codes when rendering characters corresponding to the plurality of character codes. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
-
31. An image processing apparatus comprising:
-
a character recognition unit configured to execute character recognition processing for a plurality of character images in a document image to obtain character codes corresponding to the respective character images; and a generation unit configured to generate a digital document, wherein the digital document includes the document image, a plurality of character codes obtained by said character recognition unit, glyph data of an identical shape to be used upon rendering characters corresponding to the plurality of character codes, and a description required to render the glyph data corresponding to the plurality of character codes with a size smaller than sizes of the character images at positions corresponding to the character images in the document image.
-
-
32. An image processing method comprising:
-
controlling a character recognition unit to execute character recognition processing for a plurality of character images in a document image to obtain character codes corresponding to the respective character images; and controlling a generation unit to generate a digital document, wherein the digital document includes the document image, a plurality of character codes obtained in the step of controlling a character recognition unit, glyph data, and a description required to render the glyph data corresponding to the plurality of character codes with a size smaller than sizes of the character images at positions corresponding to the character images in the document image, wherein the glyph data is used commonly to the plurality of character codes when rendering characters corresponding to the plurality of character codes.
-
-
33. An image processing method comprising:
-
controlling a character recognition unit to execute character recognition processing for a plurality of character images in a document image to obtain character codes corresponding to the respective character images; and controlling a generation unit to generate a digital document, wherein the digital document includes the document image, a plurality of character codes obtained in the step of controlling a character recognition unit, glyph data of an identical shape to be used upon rendering characters corresponding to the plurality of character codes, and a description required to render the glyph data corresponding to the plurality of character codes with a size smaller than sizes of the character images at positions corresponding to the character images in the document image.
-
-
34. A non-transitory computer-readable storage medium retrievably storing a computer program for making a computer execute the steps of:
-
executing character recognition processing for a plurality of character images in a document image to obtain character codes corresponding to the respective character images; and generating a digital document, wherein the digital document includes the document image, a plurality of character codes obtained in the step of executing character recognition processing, glyph data, and a description required to render the glyph data corresponding to the plurality of character codes with a size smaller than sizes of the character images at positions corresponding to the character images in the document image, wherein the glyph data is used commonly to the plurality of character codes when rendering characters corresponding to the plurality of character codes.
-
-
35. A non-transitory computer-readable storage medium retrievably storing a computer program for making a computer execute the steps of:
-
executing character recognition processing for a plurality of character images in a document image to obtain character codes corresponding to the respective character images; and generating a digital document, wherein the digital document includes the document image, a plurality of character codes obtained in the step of executing character recognition processing, and glyph data of an identical shape to be used upon rendering characters corresponding to the plurality of character codes, and a description required to render the glyph data corresponding to the plurality of character codes with a size smaller than sizes of the character images at positions corresponding to the character images in the document image.
-
Specification