Data processing system and method for generating a representation for and random access rendering of electronic documents
First Claim
1. A computer program product for constructing a representation of a digital document, the digital document having descriptive markup defining a plurality of hierarchical elements, wherein each element except a root element has an ancestor element and wherein each element has a type name and may have a child element, a left sibling element and a right sibling element, and wherein at least one element contains text content, the computer program product comprising:
- a computer readable medium having computer program logic stored thereon, wherein the computer program logic defines;
a parser for providing a sequence of element events in response to detection of elements and text events in response to text content;
means, responsive to an element event, for assigning a unique element identifier to the element event;
means, responsive to an element event, for constructing the type name for the element event from the descriptive markup defining the element event; and
means, responsive to an element event, for constructing an element descriptor, addressable by the element identifier assigned to the element event, wherein the element descriptor stores an indication of the type name and any parent element, first child element, last child element, left sibling element, right sibling element, and text content contained within the element.
1 Assignment
0 Petitions
Accused Products
Abstract
A data processing system and method for generating a representation of an electronic document, for indexing the electronic document, for navigating the electronic document using its representation and for displaying the electronic document on an output device. The system and method are used with electronic documents having descriptive markup which describes the content or meaning of the document rather than its appearance. Such documents may be represented by a tree. Each markup element defines a node or element in a tree. The tree is represented by providing a unique identifier for each element and for accessing a descriptor of the element. An element descriptor preferably includes indications of the parent, first child, last child, left sibling, right sibling, type name and text location for the element. The document representation is used to facilitate navigation of the text for constructing navigational aids such as table of contents and full text indexing. A document is also provided with a style sheet for specifying desired formatting characteristics for each type of element in the document. To display the document, a suitable starting point is found on the basis of a selected starting point. The document is displayed beginning with the suitable starting point and the format characteristics for each element displayed are retrieved from the style sheet and applied to the text of the displayed element.
35 Citations
30 Claims
-
1. A computer program product for constructing a representation of a digital document, the digital document having descriptive markup defining a plurality of hierarchical elements, wherein each element except a root element has an ancestor element and wherein each element has a type name and may have a child element, a left sibling element and a right sibling element, and wherein at least one element contains text content, the computer program product comprising:
a computer readable medium having computer program logic stored thereon, wherein the computer program logic defines; a parser for providing a sequence of element events in response to detection of elements and text events in response to text content; means, responsive to an element event, for assigning a unique element identifier to the element event; means, responsive to an element event, for constructing the type name for the element event from the descriptive markup defining the element event; and means, responsive to an element event, for constructing an element descriptor, addressable by the element identifier assigned to the element event, wherein the element descriptor stores an indication of the type name and any parent element, first child element, last child element, left sibling element, right sibling element, and text content contained within the element. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
11. A process for generating a digital information product comprising computer data signals defining a representation of a digital document, the digital document having descriptive markup defining a plurality of hierarchical elements, wherein each element except a root element has an ancestor element and wherein each element has type name and may have a child element, a left sibling element, a right sibling element, and wherein at least one element contains text content, the process comprising:
-
parsing the document to provide a sequence of element events in response to detection of elements and text events in response to detection of text content; assigning, in response to an element event, a unique element identifier to each element event; constructing the type name for each element from the descriptive markup defining the element; and generating an element descriptor for each element, addressable by the unique element identifier assigned to the element, wherein the element descriptor stores an indication of any parent element, any first child element, and any left sibling element for the element, and encoding each element descriptor in a computer data signal. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A process for constructing a digital information product comprising computer data signals defining a representation of a digital document, the digital document having descriptive markup defining a plurality of hierarchical elements, wherein each element except a root element has an ancestor element and wherein each element has a type name and may have a child element, a left sibling element and a right sibling element, and wherein at least one element contains text content, the process comprising:
-
parsing the digital document to provide a sequence of element events in response to detection of elements and text events in response to detection of text content; assigning a unique element identifier to each element; constructing the type name for each element from the descriptive markup defining the element; and constructing, for each element, an element descriptor addressable by the element identifier assigned to the element, wherein the element descriptor stores an indication of the type name and any parent element, first child element, last child element, left sibling element, right sibling element, and text content contained within the element, and encoding each element descriptor in a computer data signal. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification