Style sheet generation
First Claim
Patent Images
1. A method comprising:
- receiving a formatted document, the formatted document having formatted text comprising a plurality of words, each word comprising one or more characters, each character having a character appearance defined by one or more font properties and each word having a word appearance defined by the font properties of its characters, the formatted document being formatted on one or more pages, and each word having a fixed position relative to one of the one or more pages;
partitioning the formatted text into a plurality of groups of words based on the fixed positions of the words relative to their respective pages, the font properties of the words, or both;
assigning an element from a predefined set of markup language elements to each of two or more groups in the plurality of groups of words, the assigning being based on the fixed positions of the words relative to their respective pages, the font properties of the words, or both;
after the element is assigned to each of two or more groups of words, deriving an element style for the assigned element, the element style comprising a character style, a layout style or both, the character style being derived from the font properties of the characters of the words in the two or more groups of words to which the element is assigned, and the layout style being derived from the text properties of the two or more groups of words to which the element is assigned; and
creating an electronic document comprising a style sheet defining the element style.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus, including computer program apparatus, implementing techniques for processing a formatted computer-readable source document to create a style sheet. The techniques include partitioning the formatted text into multiple groups of words; deriving an element style for a predefined element that is assigned to two or more groups of words; and creating an electronic document including a style sheet defining the element style.
-
Citations
38 Claims
-
1. A method comprising:
-
receiving a formatted document, the formatted document having formatted text comprising a plurality of words, each word comprising one or more characters, each character having a character appearance defined by one or more font properties and each word having a word appearance defined by the font properties of its characters, the formatted document being formatted on one or more pages, and each word having a fixed position relative to one of the one or more pages; partitioning the formatted text into a plurality of groups of words based on the fixed positions of the words relative to their respective pages, the font properties of the words, or both; assigning an element from a predefined set of markup language elements to each of two or more groups in the plurality of groups of words, the assigning being based on the fixed positions of the words relative to their respective pages, the font properties of the words, or both; after the element is assigned to each of two or more groups of words, deriving an element style for the assigned element, the element style comprising a character style, a layout style or both, the character style being derived from the font properties of the characters of the words in the two or more groups of words to which the element is assigned, and the layout style being derived from the text properties of the two or more groups of words to which the element is assigned; and creating an electronic document comprising a style sheet defining the element style. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer program product, tangibly stored on a computer-readable medium, the product comprising instructions operable to cause a programmable system to:
-
receive a formatted document, the formatted document having formatted text comprising a plurality of words, each word comprising one or more characters, each character having a character appearance defined by one or more font properties and each word having a word appearance defined by the font properties of its characters, the formatted document being formatted on one or more pages, and each word having a fixed position relative to one of the one or more pages; partition the formatted text into a plurality of groups of words based on the fixed positions of the words relative to their respective pages, the font properties of the words, or both; assign an element from a predefined set of markup language elements to each of two or more groups in the plurality of groups of words, the assigning being based on the fixed positions of the words relative to their respective pages, the font properties of the words, or both; derive an element style for the assigned element after the element is assigned to each of two or more groups of words, the element style comprising a character style, a layout style or both, the character style being derived from the font properties of the characters of the words in the two or more groups of words to which the element is assigned, and the layout style being derived from the text properties of the two or more groups of words to which the element is assigned; and create an electronic document comprising a style sheet defining the element style. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. A system comprising at least one programmable processor and a machine-readable storage device storing instructions for execution by the at least one processor, the instructions comprising instructions to:
-
receive a formatted document, the formatted document having formatted text comprising a plurality of words, each word comprising one or more characters, each character having a character appearance defined by one or more font properties and each word having a word appearance defined by the font properties of its characters, the formatted document being formatted on one or more pages, and each word having a fixed position relative to one of the one or more pages; partition the formatted text into a plurality of groups of words based on the fixed positions of the words relative to their respective pages, the font properties of the words, or both; assign an element from a predefined set of markup language elements to each of two or more groups in the plurality of groups of words, the assigning being based on the fixed positions of the words relative to their respective pages, the font properties of the words, or both; derive an element style for the assigned element after the element is assigned to each of two or more groups of words, the element style comprising a character style, a layout style or both, the character style being derived from the font properties of the characters of the words in the two or more groups of words to which the element is assigned, and the layout style being derived from the text properties of the two or more groups of words to which the element is assigned; and create an electronic document comprising a style sheet defining the element style. - View Dependent Claims (30, 31, 32, 33, 34, 35, 36, 37, 38)
-
Specification