Glyph selection and electronic document generation
First Claim
1. An image processing apparatus comprising:
- a character recognition unit configured to perform character recognition on a plurality of character images in a document image to acquire a character code corresponding to each character image;
a selection unit configured to select a glyph to be used for each character image from a plurality of glyphs, based on both of a character width and character row height of each character image in the document image; and
a generation unit configured to generate an electronic document, wherein the electronic document includes the document image, a plurality of character codes acquired by the character recognition unit, the plurality of glyphs, and description data which indicates the glyphs to be used for the character codes,wherein each of the plurality of glyphs is shared and used by different character codes based on the description data when rendering characters that correspond to the plurality of character codes acquired by the character recognition unit, andwherein the description data is described based on the glyph selected by the selection unit for each character image in the document image.
1 Assignment
0 Petitions
Accused Products
Abstract
An image processing apparatus includes a character recognition unit configured to perform character recognition on a plurality of character images in a document image to acquire a character code corresponding to each character image, and a generation unit configured to generate an electronic document, wherein the electronic document includes the document image, a plurality of character codes acquired by the character recognition unit, a plurality of glyphs, and data which indicates the glyphs to be used to render each of the character codes, wherein each of the plurality of glyphs is shared and used by different character codes based on the data when rendering characters that correspond to the plurality of character codes acquired by the recognition unit.
31 Citations
29 Claims
-
1. An image processing apparatus comprising:
-
a character recognition unit configured to perform character recognition on a plurality of character images in a document image to acquire a character code corresponding to each character image; a selection unit configured to select a glyph to be used for each character image from a plurality of glyphs, based on both of a character width and character row height of each character image in the document image; and a generation unit configured to generate an electronic document, wherein the electronic document includes the document image, a plurality of character codes acquired by the character recognition unit, the plurality of glyphs, and description data which indicates the glyphs to be used for the character codes, wherein each of the plurality of glyphs is shared and used by different character codes based on the description data when rendering characters that correspond to the plurality of character codes acquired by the character recognition unit, and wherein the description data is described based on the glyph selected by the selection unit for each character image in the document image. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. An image processing apparatus comprising:
-
a character recognition unit configured to perform character recognition on a plurality of character images in a document image to acquire a character code corresponding to each character image; a selection unit configured to select a glyph to be used for each character image from a plurality of glyphs, based on a character spacing of each character image in the document image; and a generation unit configured to generate an electronic document, wherein the electronic document includes the document image, a plurality of character codes acquired by the character recognition unit, the plurality of glyphs, and description data which indicates the glyphs to be used for the character codes, wherein each of the plurality of glyphs is shared and used by different character codes based on the description data when rendering characters that correspond to the plurality of character codes acquired by the character recognition unit, and wherein the description data is described based on the glyph selected by the selection unit for each character image in the document image. - View Dependent Claims (15)
-
-
16. An image processing apparatus comprising:
-
a character recognition unit configured to perform character recognition on a plurality of character images in a document image to acquire a character code corresponding to each character image; a generation unit configured to generate an electronic document, wherein the electronic document includes the document image, a plurality of character codes acquired by the character recognition unit, a plurality of glyphs, and description data which indicates the glyphs to be used for the character codes, wherein each of the plurality of glyphs is shared and used by different character codes based on the description data when rendering characters that correspond to the plurality of character codes acquired by the character recognition unit, and wherein the glyphs are rendered in the lower part of a character row.
-
-
17. A method comprising:
-
acquiring a character code corresponding to each character image by performing a character recognition on a plurality of character images in a document image; selecting a glyph to be used for each character image from a plurality of glyphs, based on both of a character width and character row height of each character image in the document image; and generating an electronic document, wherein the electronic document includes the document image, a plurality of character codes acquired by performing the character recognition, the plurality of glyphs, and description data which indicates the glyphs to be used for the character codes, wherein each of the plurality of glyphs is shared and used by different character codes based on the description data when rendering characters that correspond to the plurality of character codes acquired by performing the character recognition, and wherein the description data is described based on the glyph selected by the selection unit for each character image in the document image. - View Dependent Claims (18, 19, 20, 21, 22, 23)
-
-
24. A method comprising:
-
acquiring a character code corresponding to each character image by performing a character recognition on a plurality of character images in a document image; selecting a glyph to be used for each character image from a plurality of glyphs, based on a character spacing of each character image in the document image; and generating an electronic document, wherein the electronic document includes the document image, a plurality of character codes acquired by performing the character recognition, the plurality of glyphs, and description data which indicates the glyphs to be used for the character codes, wherein each of the plurality of glyphs is shared and used by different character codes based on the description data when rendering characters that correspond to the plurality of character codes acquired by performing the character recognition, and wherein the description data is described based on the glyph selected by the selection unit for each character image in the document image. - View Dependent Claims (25)
-
-
26. A method comprising
acquiring a character code corresponding to each character image by performing a character recognition on a plurality of character images in a document image; - and
generating an electronic document, wherein the electronic document includes the document image, a plurality of character codes acquired by performing the character recognition, a plurality of glyphs, and description data which indicates the glyphs to be used for the character codes, wherein each of the plurality of glyphs is shared and used by different character codes based on the description data when rendering characters that correspond to the plurality of character codes acquired by performing the character recognition, and wherein the glyphs are rendered in the lower part of a character row.
- and
-
27. A computer program stored in a non-transitory computer-readable storage medium, the computer program causing a computer to execute steps of:
-
acquiring a character code corresponding to each character image by performing a character recognition on a plurality of character images in a document image; selecting a glyph to be used for each character image from a plurality of glyphs, based on both of a character width and character row height of each character image in the document image; and generating an electronic document, wherein the electronic document includes the document image, a plurality of character codes acquired by performing the character recognition, the plurality of glyphs, and description data which indicates the glyphs to be used for the character codes, wherein each of the plurality of glyphs is shared and used by different character codes based on the description data when rendering characters that correspond to the plurality of character codes acquired by performing the character recognition, and wherein the description data is described based on the glyph selected by the selection unit for each character image in the document image.
-
-
28. A computer program stored in a non-transitory computer-readable storage medium, the computer program causing a computer to execute steps of:
-
acquiring a character code corresponding to each character image by performing a character recognition on a plurality of character images in a document image; selecting a glyph to be used for each character image from a plurality of glyphs, based on a character spacing of each character image in the document image; and generating an electronic document, wherein the electronic document includes the document image, a plurality of character codes acquired by performing the character recognition, the plurality of glyphs, and description data which indicates the glyphs to be used for the character codes, wherein each of the plurality of glyphs is shared and used by different character codes based on the description data when rendering characters that correspond to the plurality of character codes acquired by performing the character recognition, and wherein the description data is described based on the glyph selected by the selection unit for each character image in the document image.
-
-
29. A computer program stored in a non-transitory computer-readable storage medium, the computer program causing a computer to execute steps of:
-
acquiring a character code corresponding to each character image by performing a character recognition on a plurality of character images in a document image; and generating an electronic document, wherein the electronic document includes the document image, a plurality of character codes acquired by performing the character recognition, a plurality of glyphs, and description data which indicates the glyphs to be used for the character codes, wherein each of the plurality of glyphs is shared and used by different character codes based on the description data when rendering characters that correspond to the plurality of character codes acquired by performing the character recognition, and wherein the glyphs are rendered in the lower part of a character row.
-
Specification