Document processing apparatus having an authoring capability for describing a document structure
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus and method are disclosed for easily generating document data (tag file) in a form that makes it possible to perform various processes upon the document data. An original document (plain text) is divided into morphological elements, and morphological information is added thereto. Information representing the hierarchical document structures is also added. Furthermore information indicating referential relations between portions in the original document is also added.
-
Citations
33 Claims
-
1-32. -32. (canceled)
-
33. A document processing apparatus comprising:
-
automatic analysis means for automatically analyzing an electronic document and attaching hierarchical structure information representing a document structure to said electronic document in accordance with the result of said automatic analysis, said automatic analysis means automatically analyzes the document structure of said electronic document in the order from a lowest hierarchical level to a highest hierarchical level;
information presenting means for presenting information about the electronic document including said structure information at each hierarchical level so that a user may correct internal information associated with said electronic document on the basis of said information displayed on a display; and
correction means for correcting said internal information associated with said electronic document in response to an operation performed by the user in accordance with the internal information displayed on the display, said correction means corrects the internal structure of said electronic document by adding, removing, or modifying internal information in the order from the lowest hierarchical level to the highest hierarchical level, wherein said automatic analysis means comprises morpheme dividing means for dividing said electronic document into morphemes and morphological information attaching means for attaching morphological information to each said morpheme, and wherein morphological information includes marks.
-
Specification