Real time structured summary search engine
First Claim
1. A method of processing electronic documents for subsequent retrieval, comprising the steps of:
- storing in memory a summary structure database describing the structure of summary records associated with each document, each structured summary record having at least one descriptor field with predefined allowed field entries identifying A characteristic of the document;
for the or each said descriptor field storing in memory respective groups of predefined criteria keywords associated with each of said allowed field entries;
analyzing each document to build a text index listing the occurrence of unique significant words in the document; and
matching said text index with said predefined criteria keywords to determine the appropriate corresponding field entry for the associated descriptor field in accordance with predetermined criteria.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of organizing electronic documents for storage and subsequent retrieval, involves storing a summary structure describing the structure of summary records associated with each document. Each structured summary record has at least one descriptor field representative of a characteristic of the document. Predefined field entries identify a characteristic of the document. Predefined keyword criteria associated with the field entries are stored. Each document is analyzed to build a text index listing the occurrence of unique significant words in the document. The text index is compared with the keyword criteria to determine the appropriate field entry for the document. For example, one descriptor field might related to topic, which could have the field entries of “financial” or “sports”. The preponderance of certain keyword criteria, such as “money” or “shares” would identify the document with the financial topic.
76 Citations
15 Claims
-
1. A method of processing electronic documents for subsequent retrieval, comprising the steps of:
-
storing in memory a summary structure database describing the structure of summary records associated with each document, each structured summary record having at least one descriptor field with predefined allowed field entries identifying A characteristic of the document;
for the or each said descriptor field storing in memory respective groups of predefined criteria keywords associated with each of said allowed field entries;
analyzing each document to build a text index listing the occurrence of unique significant words in the document; and
matching said text index with said predefined criteria keywords to determine the appropriate corresponding field entry for the associated descriptor field in accordance with predetermined criteria. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system for processing electronic documents for subsequent retrieval, comprising:
-
a memory storing a summary structure describing the structure of summary records associated with each document, each structured summary record having at least one descriptor field with predefined allowed field entries identifying a characteristic of the document;
a memory storing, for the or each said descriptor field, groups of predetermined criteria keywords associated with each of said respective allowed field entries;
means for analyzing each document to build a text index listing the occurrence of unique significant words in the document; and
means for matching said text index with said criteria keywords to determine the appropriate field entry for the associated descriptor field in accordance with predetermined criteria.
-
Specification