Data processing system and method for generating a representation for and for representing electronically published structured documents
First Claim
1. A data processing system for storing a representation of an electronic document, including first means for storing the electronic document, the electronic document having descriptive markup defining a plurality of hierarchical elements, wherein each element except a root element has an ancestor element and wherein each element has a type name and wherein an element may have a child element, a left sibling element, a right sibling element, and wherein at least one element contains text content, the data processing system comprising:
- second means for storing a value indicative of a parent element for each element having a parent element;
third means for storing a value indicative of a first child element for each element having a child element; and
fourth means for storing a value indicative of a left sibling element, for each element having a left sibling element.
2 Assignments
0 Petitions
Accused Products
Abstract
A data processing system and method for generating a representation of an electronic document, for indexing the electronic document, for navigating the electronic document using its representation and for displaying the electronic document on an output device. The system and method are used with electronic documents having descriptive markup which describes the content or meaning of the document rather than its appearance. Such documents may be represented by a tree. Each markup element defines a node or element in a tree. The tree is represented by providing a unique identifier for each element and for accessing a descriptor of the element. An element descriptor preferably includes indications of the parent, first child, last child, left sibling, right sibling, type name and text location for the element. The document representation is used to facilitate navigation of the text for constructing navigational aids such as table of contents and full text indexing. A document is also provided with a style sheet for specifying desired formatting characteristics for each type of element in the document. To display the document, a suitable starting point is found on the basis of a selected starting point. The document is displayed beginning with the suitable starting point and the format characteristics for each element displayed are retrieved from the style sheet and applied to the text of the displayed element.
187 Citations
54 Claims
-
1. A data processing system for storing a representation of an electronic document, including first means for storing the electronic document, the electronic document having descriptive markup defining a plurality of hierarchical elements, wherein each element except a root element has an ancestor element and wherein each element has a type name and wherein an element may have a child element, a left sibling element, a right sibling element, and wherein at least one element contains text content, the data processing system comprising:
-
second means for storing a value indicative of a parent element for each element having a parent element; third means for storing a value indicative of a first child element for each element having a child element; and fourth means for storing a value indicative of a left sibling element, for each element having a left sibling element. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method for generating a representation of an electronic document, the electronic document having descriptive markup defining a plurality of hierarchical elements, wherein each element except a root element has an ancestor element and wherein each element has a type name and may have a child element, a left sibling element, a right sibling element, and wherein at least one element contains text content, the method comprising the steps of:
-
parsing the document to provide a sequence of element events in response to detection of elements and text events in response to detection of text content; assigning, in response to an element event, a unique element identifier to each element event; constructing the type name for each element from the descriptive markup defining the element; and generating an element descriptor for each element, addressable by the unique element identifier assigned to the element, wherein the element descriptor stores an indication of any parent element, any first child element, and any left sibling element for the element. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A data processing system for constructing a representation of an electronic document, the electronic document having descriptive markup defining a plurality of hierarchical elements, wherein each element except a root element has an ancestor element and wherein each element has a type name and may have a child element, a left sibling element and a right sibling element and wherein at least one element contains text content, the data processing system comprising:
-
a parser for providing a sequence of element events in response to detection of elements and text events in response to text content; means, responsive to an element event, for assigning a unique element identifier to the element event; means, responsive to an element event, for constructing the type name for the element event from the descriptive markup defining the element event; and means, responsive to an element event, for constructing an element descriptor, addressable by the element identifier assigned to the element event, wherein the element descriptor stores an indication of the type name and any parent element, first child element, last child element, left sibling element, right sibling element, and text content contained within the element. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32)
-
-
33. A method for constructing a representation of an electronic document, the electronic document having descriptive markup defining a plurality of hierarchical elements, wherein each element except a root element has an ancestor element and wherein each element has a type name and may have a child element, a left sibling element and a right sibling element, and wherein at least one element contains text content, the method comprising the steps of:
-
parsing the electronic document to provide a sequence of element events in response to detection of elements and text events in response to detection of text content; assigning a unique element identifier to each element; constructing the type name for each element from the descriptive markup defining the element; and constructing, for each element, an element descriptor addressable by the element identifier assigned to the element, wherein the element descriptor stores an indication of the type name and any parent element, first child element, last child element, left sibling element, right sibling element, and text content contained within the element. - View Dependent Claims (34, 35, 36, 37, 38, 39, 40, 41, 42)
-
- 43. A digital information product containing an electronic document, wherein the electronic document has descriptive markup defining a plurality of hierarchical elements, wherein each element except a root element has an ancestor element and wherein each element has a type name and wherein an element may have a child element, a left sibling element, a right sibling element, and wherein at least one element contains text content, digital information product comprising a computer-readable medium on which computer-readable signals are stored, wherein the computer-readable signals define an element directory containing an element descriptor for each element wherein each element descriptor is accessible using an identifier of the element descriptor for an element and includes a value indicative of a parent element for each element having a parent element, a value indicative of a first child element for each element having a child element, and a value indicative of a left sibling element for each element having a left sibling element.
-
54. A data processing system for providing access to an electronic document, wherein the electronic document has descriptive markup defining a plurality of hierarchical elements, wherein each element except a root element has an ancestor element and wherein each element has a type name and wherein an element may have a child element, a left sibling element, a right sibling element, and wherein at least one element contains text content, the data processing system comprising:
-
a computer-readable medium on which computer-readable signals are stored, wherein the computer-readable signals define an element directory containing an element descriptor for each element wherein each element descriptor is accessible using an identifier of the element descriptor for an element includes a value indicative of a parent element for each element having a parent element, a value indicative of a first child element for each element having a child element, and a value indicative of a left sibling element, for each element having a left sibling element; and means for accessing the computer-readable medium using an identifier of an element to retrieve the element descriptor of the element.
-
Specification