APPARATUS, METHOD AND COMPUTER PROGRAM PRODUCT FOR PROCESSING DOCUMENTS
First Claim
1. A document processing apparatus comprising:
- an extracting unit that extracts text document information from a document data;
an analyzing unit that analyzes a modification relation of a character string included in the text document information;
an attribute unit that assigns an attribute indicating details of the modification relation to the character string, and embeds the attribute in the text document information;
a document specifying unit that specifies a document-specifying character string that specifies other text document information, using the text document information in which the attribute is embedded by the attribute unit;
a document-identification unit that assigns document identification information to the document-specifying character string, and embeds the document identification information in the text document information;
a receiving unit that receives a character string;
a determining unit that determines whether the text document information includes a document-specifying character string having the modification relation with the character string received by the receiving unit, based on the attribute and the document identification information embedded in the text document information; and
an identifying unit that identifies other text document information indicated by the document-specifying character string, when it is determined that the text document information includes the document-specifying character string.
4 Assignments
0 Petitions
Accused Products
Abstract
A document processing apparatus includes an extracting unit that extracts text document information from a document data; an analyzing unit that analyzes a modification relation of a character string included in the text document information; an attribute unit that assigns an attribute indicating details of the modification relation to the character string, and embeds the attribute in the text document information; a document specifying unit that specifies a document-specifying character string that specifies other text document information, using the text document information in which the attribute is embedded by the attribute unit; and a document-identification unit that assigns document identification information to the document-specifying character string, and embeds the document identification information in the text document information.
18 Citations
20 Claims
-
1. A document processing apparatus comprising:
-
an extracting unit that extracts text document information from a document data; an analyzing unit that analyzes a modification relation of a character string included in the text document information; an attribute unit that assigns an attribute indicating details of the modification relation to the character string, and embeds the attribute in the text document information; a document specifying unit that specifies a document-specifying character string that specifies other text document information, using the text document information in which the attribute is embedded by the attribute unit; a document-identification unit that assigns document identification information to the document-specifying character string, and embeds the document identification information in the text document information; a receiving unit that receives a character string; a determining unit that determines whether the text document information includes a document-specifying character string having the modification relation with the character string received by the receiving unit, based on the attribute and the document identification information embedded in the text document information; and an identifying unit that identifies other text document information indicated by the document-specifying character string, when it is determined that the text document information includes the document-specifying character string. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A document processing method comprising:
-
extracting text document information from a document data; analyzing a modification relation of a character string included in the text document information; assigning an attribute indicating details of the modification relation to the character string indicated by the modification relation, and embedding the attribute in the text document information; specifying a document-specifying character string indicating a character string that specifies other text document information, using the text document information in which the attribute is embedded in the embedding; assigning document identification information to the document-specifying character string, and embedding the document identification information in the text document information; receiving a character string; determining whether the text document information includes a document-specifying character string having the modification relation with the character string, based on the attribute and the document identification information embedded in the text document information; and identifying other text document information indicated by the document-specifying character string, when it is determined that the text document information includes the document-specifying character string. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer program product having a computer readable medium including programmed instructions for processing text information, wherein the instructions, when executed by a computer, cause the computer to perform:
-
extracting text document information from a document data; analyzing a modification relation of a character string included in the text document information; assigning an attribute indicating details of the modification relation to the character string indicated by the modification relation, and embedding the attribute in the text document information; specifying a document-specifying character string indicating a character string that specifies other text document information, using the text document information in which the attribute is embedded in the embedding; assigning document identification information to the document-specifying character string, and embedding the document identification information in the text document information; receiving a character string; determining whether there is a character string including a document-specifying character string having a modification relation with the character string, based on the attribute and the document identification information embedded in the text document information; and identifying other text document information indicated by the document-specifying character string, when it is determined that there is a character string including the document-specifying character string. - View Dependent Claims (18, 19, 20)
-
Specification