Apparatus and method for extracting circumscribed rectangles of characters in transplantable electronic document
First Claim
1. An apparatus for extracting circumscribed rectangles of one or more characters in a transplantable electronic document, comprising:
- a command and resource extraction device configured to extract one or more text-segment-related commands and one or more original font resources corresponding to text segments in one or more pages of the transplantable electronic document;
a division device configured to divide the original font resources into one or more fonts that need to be replaced, and one or more fonts that do not need to be replaced, in which the fonts that need to be replaced serve as fonts prepared to be replaced;
a font replacement device configured to seek fonts most similar to the fonts prepared to be replaced based on the aspect of character shape measurement in an outer replacement font table as candidate fonts for replacing the fonts prepared to be replaced, and then let the candidate fonts and the fonts that do not need to be replaced make up font resources after font replacement;
a measurement information extraction device configured to extract character shape measurement information of the characters in the text segments based on the font resources after font replacement; and
a calculation device configured to calculate the circumscribed rectangles of the characters based on the text-segment-related commands and the character shape measurement information of the characters,wherein the font replacement device utilizes a matching approach to determine similarities between fonts in the outer replacement font table, prepared to be selected and the fonts prepared to be replaced based on the aspect of character shape measurement, and then lets the fonts prepared to be selected having highest similarities serve as the candidate fonts, whereinthe font replacement device determines a similarity between a font prepared to be selected and a font prepared to be replaced based on a distance dis, calculated in equation (1) as follows, between the font prepared to be selected and the font prepared to be replaced;
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed are an apparatus and a method for extracting circumscribed rectangles of one or more characters in a transplantable electronic document. The apparatus comprises a command and resource extraction device for extracting text-segment-related commands and original font resources; a division device for dividing the original font resources into fonts; a font replacement device for seeking fonts, and obtaining font resources after font replacement; a measurement information extraction device for extracting character shape measurement information of the characters; and a calculation device for calculating the circumscribed rectangles of the characters.
-
Citations
7 Claims
-
1. An apparatus for extracting circumscribed rectangles of one or more characters in a transplantable electronic document, comprising:
-
a command and resource extraction device configured to extract one or more text-segment-related commands and one or more original font resources corresponding to text segments in one or more pages of the transplantable electronic document; a division device configured to divide the original font resources into one or more fonts that need to be replaced, and one or more fonts that do not need to be replaced, in which the fonts that need to be replaced serve as fonts prepared to be replaced; a font replacement device configured to seek fonts most similar to the fonts prepared to be replaced based on the aspect of character shape measurement in an outer replacement font table as candidate fonts for replacing the fonts prepared to be replaced, and then let the candidate fonts and the fonts that do not need to be replaced make up font resources after font replacement; a measurement information extraction device configured to extract character shape measurement information of the characters in the text segments based on the font resources after font replacement; and a calculation device configured to calculate the circumscribed rectangles of the characters based on the text-segment-related commands and the character shape measurement information of the characters, wherein the font replacement device utilizes a matching approach to determine similarities between fonts in the outer replacement font table, prepared to be selected and the fonts prepared to be replaced based on the aspect of character shape measurement, and then lets the fonts prepared to be selected having highest similarities serve as the candidate fonts, wherein the font replacement device determines a similarity between a font prepared to be selected and a font prepared to be replaced based on a distance dis, calculated in equation (1) as follows, between the font prepared to be selected and the font prepared to be replaced; - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of extracting circumscribed rectangles of one or more characters in a transplantable electronic document, comprising:
-
a command and resource extraction step of extracting one or more text-segment-related commands and one or more original font resources corresponding to text segments in one or more pages of the transplantable electronic document; a division step of dividing the original font resources into one or more fonts that need to be replaced, and one or more fonts that do not need to be replaced, in which the fonts that need to be replaced serve as fonts prepared to be replaced; a font replacement step of seeking fonts most similar to the fonts prepared to be replaced based on the aspect of character shape measurement in an outer replacement font table as candidate fonts for replacing the fonts prepared to be replaced, and then letting the candidate fonts and the fonts that do not need to be replaced make up font resources after font replacement; a measurement information extraction step of extracting character shape measurement information of the characters in the text segments based on the font resources after font replacement; and a calculation step of calculating the circumscribed rectangles of the characters based on the text-segment-related commands and the character shape measurement information of the characters, wherein a matching approach is utilized in the font replacement step to determine similarities between fonts in the outer replacement font table, prepared to be selected and the fonts prepared to be replaced based on the aspect of character shape measurement, and then lets the fonts prepared to be selected having highest similarities serve as the candidate fonts, wherein a similarity between a font prepared to be selected and a font prepared to be replaced is determined in the font replacement step based on a distance dis, calculated in equation (1) as follows, between the font prepared to be selected and the font prepared to be replaced;
-
Specification