Indexing structured documents
First Claim
Patent Images
1. A method for indexing structured documents comprising:
- identifying a structured document in a file system for indexing, the structured document having an identifier and at least one indexing-property;
extracting at least one index-value from the structured document in accordance with a pre-defined extraction rule-set; and
storing the at least one index-value with the identifier in an index-value data structure.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus, including computer program products, for indexing structured documents. A method includes identifying a structured document in a file system for indexing, the structured document having an identifier and at least one indexing-property, extracting at least one index-value from the structured document in accordance with a pre-defined extraction rule-set and storing the at least one index-value with the identifier in an index-value data structure.
57 Citations
17 Claims
-
1. A method for indexing structured documents comprising:
-
identifying a structured document in a file system for indexing, the structured document having an identifier and at least one indexing-property;
extracting at least one index-value from the structured document in accordance with a pre-defined extraction rule-set; and
storing the at least one index-value with the identifier in an index-value data structure. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method comprising:
-
identifying a plurality of structured documents in a file system for indexing, each of the structured documents having an identifier and at least one indexing-property;
extracting at least one index-value from each of the structured documents in accordance with a pre-defined extraction rule-set; and
storing the at least one index-value with the identifier in an index-value data structure for each of the plurality of structured documents. - View Dependent Claims (13, 14, 15)
-
-
16. An article comprising:
a storage medium having stored thereon instructions that when executed by a machine result in the following;
identify a structured document in a file system for indexing, the structured document having an identifier and at least one indexing-property;
extract at least one index-value from the structured document in accordance with a pre-defined extraction rule-set; and
store the at least one index-value with the identifier in an index-value data structure.
-
17. A computer program product, tangibly stored on a machine readable medium, for indexing structured documents, comprising instructions operable to cause a programmable processor to:
-
identify a structured document in a file system for indexing, the structured document having an identifier and at least one indexing-property;
extract at least one index-value from the structured document in accordance with a pre-defined extraction rule-set; and
store the at least one index-value with the identifier in an index-value data structure.
-
Specification