Office correspondence storage and retrieval system
First Claim
1. A method for abstracting and archiving a document in machine readable form comprising the steps of:
- (a) storing a dictionary of language terms commonly used in document preparation;
(b) appending codes to the language terms in said dictionary of language terms to identify selected parts of speech;
(c) comparing the language terms in an input document with the stored dictionary of language terms;
(d) selecting language terms from said input document which do not compare to the stored dictionary of language terms;
(e) selecting language terms from said input document which compare with language terms in said stored dictionary of language terms identified as selected parts of speech;
(f) coding the selected language terms with the identity of the input document; and
(g) storing the selected language terms for later recall.
0 Assignments
0 Petitions
Accused Products
Abstract
A system that intelligently abstracts and archives a document for storage and interprets a free form user retrieval query to recall the document from the storage file. The system includes a method for automatically selecting keywords from the document using a parts of a speech directory. A method is given for weighing the importance or centrality of each keyword with respect to the document of its origin. Using the same logic paths, a free form query that describes the document in the same manner that it would have to be described to a secretary to "find" it in a filing cabinet, the system automatically determines the key matching terms and finds the archived document(s) with the greatest affinity.
153 Citations
11 Claims
-
1. A method for abstracting and archiving a document in machine readable form comprising the steps of:
-
(a) storing a dictionary of language terms commonly used in document preparation; (b) appending codes to the language terms in said dictionary of language terms to identify selected parts of speech; (c) comparing the language terms in an input document with the stored dictionary of language terms; (d) selecting language terms from said input document which do not compare to the stored dictionary of language terms; (e) selecting language terms from said input document which compare with language terms in said stored dictionary of language terms identified as selected parts of speech; (f) coding the selected language terms with the identity of the input document; and (g) storing the selected language terms for later recall. - View Dependent Claims (2, 3)
-
-
4. A method for retrieving a document from storage in response to input language terms descriptive of the content of the document comprising the steps of:
-
(a) comparing each of the input language terms to stored document abstract files of language terms, each document abstract language term having associated with it a code identifying its part of speech, a count indicating its frequency of occurrence in the document, a count of the number of pages in the document, and an indicator of the position of occurrence of the term in the document; (b) accumulating a retrieval record for each document abstract file composed of the language terms that compare equal; (c) calculating a document retrieval value for each retrieval record using the part of speech code, frequency count, number of pages in the document, and position indicator for each language term in the retrieval record; (d) increasing the document retrieval value for each retrieval record that includes a month and/or year; and (e) selecting the document corresponding to the highest calculated retrieval value for output. - View Dependent Claims (5)
-
-
6. A system for abstracting a document in machine readable form comprising:
-
means for storing a dictionary of language terms commonly used in document preparation, said language terms including a code identifying certain ones of said language terms as selected parts of speech; means for receiving an input document of language terms in machine readable form, said input document including an identification code; a memory; control means connected to said means for storing, said means for receiving and said memory, including, means for comparing the language terms of said input document to said dictionary of language terms, first selecting means responsive to said means for comparing for selecting the language terms from said input document that compare unequal, second selecting means responsive to said means for comparing for selecting the language terms from said input document that compare equal and are coded as selected parts of speech; first counting means responsive to said first and second selecting means for counting the frequency of occurrence of each selected language term in the input document; second counting means responsive to said means for receiving for counting the number of pages in the document; means responsive to said first and second selecting means for calculating the position of occurrence of the selected language terms in the input document; and means responsive to said first and second selecting means, said first and second counting means, and said means for calculating for storing in said memory a record of each selected language term including the document identification code, the language term, the selected part of speech code, the frequency of occurrence count, the count of pages in the document, and the position of occurrence code. - View Dependent Claims (7)
-
-
8. A system for retrieving a document from storage in response to an input query of language terms descriptive of the content of the document comprising:
-
a memory having stored therein language term records including the language term, identification codes of documents containing the language term, a selected parts of speech code, a frequency of occurrence count for the language term, a count of pages in each document, and a position of occurrence code for each document identification code in each language term record; means for comparing the language terms of the input query to language term records stored in said memory; means for accumulating a retrieval record for each document identification code of each language term that compares equal; means responsive to said means for accumulating for calculating a document retrieval value for each retrieval record using the selected part of speech code, frequency of occurrence count, count of pages and position of occurrence code; and means responsive to said means for calculating for outputting from memory the document whose identification code corresponds to the identification code for the highest calculated retrieval value. - View Dependent Claims (9, 10, 11)
-
Specification