Semantics-bases indexing in a distributed data processing system
First Claim
1. A method for indexing information in a distributed data processing system, the method comprising:
- providing document structure templates comprising model document structures and semantics for the model document structures;
identifying the structure of a document;
selecting a document structure template in dependence upon the structure of the document and the model document structures in the document structure templates; and
storing search keywords from the document in records in a semantics-based search index according to the semantics from the selected document structure template.
1 Assignment
0 Petitions
Accused Products
Abstract
Indexing information in a distributed data processing system, including providing document structure templates comprising model document structures and semantics for the model document structures; identifying the structure of a document; selecting a document structure template in dependence upon the structure of the document and the model document structures in the document structure templates; and storing search keywords from the document in records in a semantics-based search index according to the semantics from the selected document structure template. Selecting a document structure template in dependence upon the structure of the document and the model document structures in the document structure templates typically further comprises comparing the structure of the document and the model document structures in the templates; and selecting a template whose model document structure matches the structure of the document.
-
Citations
30 Claims
-
1. A method for indexing information in a distributed data processing system, the method comprising:
-
providing document structure templates comprising model document structures and semantics for the model document structures;
identifying the structure of a document;
selecting a document structure template in dependence upon the structure of the document and the model document structures in the document structure templates; and
storing search keywords from the document in records in a semantics-based search index according to the semantics from the selected document structure template. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for indexing information in a distributed data processing system, the system comprising:
-
means for providing document structure templates comprising model document structures and semantics for the model document structures;
means for identifying the structure of a document;
means for selecting a document structure template in dependence upon the structure of the document and the model document structures in the document structure templates; and
means for storing search keywords from the document in records in a semantics-based search index according to the semantics from the selected document structure template. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer program product for indexing information in a distributed data processing system, the computer program product comprising:
-
a recording medium;
means, recorded on the recording medium, for providing document structure templates comprising model document structures and semantics for the model document structures;
means, recorded on the recording medium, for identifying the structure of a document;
means, recorded on the recording medium, for selecting a document structure template in dependence upon the structure of the document and the model document structures in the document structure templates; and
means, recorded on the recording medium, for storing search keywords from the document in records in a semantics-based search index according to the semantics from the selected document structure template. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification