Analyzing externally generated documents in document management system
First Claim
1. A computer implemented method for analyzing an externally generated document for use in a document management system of the type having a Native Template database including a list of templates for one or more types of documents having common characteristics and a Conversion Database including a list of one or more data points associated with each listed document type, one or more Descriptive Text entries associated with each listed data point, and Proximity information which describes the location of the data point in relation to the Descriptive Text, the method comprising the steps of:
- (a) introducing the externally generated document into the system;
(b) indexing the externally generated document by recording the locations of words, sentences, paragraphs, and sections within the document;
(c) selecting a document type from the Native Template database that has characteristics in common with the externally generated document;
(d) selecting a data point from the template;
(e) searching the introduced document for Possible Data Points based on the Data Type of the selected data point in the Conversion Database;
(f) obtaining Proximity range information from the Conversion Database for the Descriptive Text entries associated with the selected data point;
(g) determining whether Possible Data Point values for the selected data point are located within the Proximity range for each Descriptive Text entry, using the index information created in (b);
(h) calculating a cumulative Evaluation Score for each Possible Data Point value based on its proximity to each Descriptive Text entry;
(i) recording the Possible Data Point with the highest score that has been accepted by the user;
(j) upon user acceptance of a Possible Data Point, storing additional Descriptive Text entries to apply to other externally generated documents; and
(k) repeating steps (d)-(j) until each data point has been selected.
2 Assignments
0 Petitions
Accused Products
Abstract
A computer implemented method for analyzing an externally generated document for use in a document management system having a Native Template database including a list of templates for one or more types of documents having common characteristics and a Conversion Database including a list of one or more data points associated with each listed document type, one or more descriptive text entries associated with each listed data point, and proximity range information relating to the location of the data point within the descriptive text. The externally generated document is introduced into the system. The locations of words, sentences, paragraphs, and sections within the document are recorded. A document type is selected from the Native Template database that has characteristics in common with the externally generated document. A data point is selected from the template. The introduced document is searched for Possible Data Points based on the Data Type of the selected data point in the Conversion Database. Proximity range information is obtained from the Conversion Database for the Descriptive Text entries associated with the selected data point. A determination is made as to whether Possible Data Point values for the selected data point are located within the Proximity range for each Descriptive Text entry. A cumulative Evaluation Score is calculated for each Possible Data Point value based on its proximity to each Descriptive Text entry. The Possible Data Point with the highest score that has been accepted by the user is recorded. Upon user acceptance of a Possible Data Point, additional Descriptive Text entries are stored to apply to other externally generated documents. These steps are repeated until each data point has been selected. The user reviews the recorded data which is approved, modified or rejected.
-
Citations
29 Claims
-
1. A computer implemented method for analyzing an externally generated document for use in a document management system of the type having a Native Template database including a list of templates for one or more types of documents having common characteristics and a Conversion Database including a list of one or more data points associated with each listed document type, one or more Descriptive Text entries associated with each listed data point, and Proximity information which describes the location of the data point in relation to the Descriptive Text, the method comprising the steps of:
-
(a) introducing the externally generated document into the system;
(b) indexing the externally generated document by recording the locations of words, sentences, paragraphs, and sections within the document;
(c) selecting a document type from the Native Template database that has characteristics in common with the externally generated document;
(d) selecting a data point from the template;
(e) searching the introduced document for Possible Data Points based on the Data Type of the selected data point in the Conversion Database;
(f) obtaining Proximity range information from the Conversion Database for the Descriptive Text entries associated with the selected data point;
(g) determining whether Possible Data Point values for the selected data point are located within the Proximity range for each Descriptive Text entry, using the index information created in (b);
(h) calculating a cumulative Evaluation Score for each Possible Data Point value based on its proximity to each Descriptive Text entry;
(i) recording the Possible Data Point with the highest score that has been accepted by the user;
(j) upon user acceptance of a Possible Data Point, storing additional Descriptive Text entries to apply to other externally generated documents; and
(k) repeating steps (d)-(j) until each data point has been selected. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer implemented method for analyzing an externally generated document for use in a document management system of the type having a Native Template database including a list of templates for one or more types of documents having common characteristics and Conversion Database including a list of one or more data points associated with each listed document type, one or more descriptive text entries associated with each listed data point, and Proximity information which describes the location of the data point in relation to the Descriptive Text, the method comprising the steps of:
-
(gg) setting up the system to determine how a specific variation of a document relates to a template. (hh) introducing the externally generated document into the system;
(ii) analyzing the introduced document by selecting a template from the Native Template database relating to a document type that has characteristics in common with the introduced document, selecting a data point from the template, searching the introduced document for text associated with the selected data point and recording the data point and the location of the associated text in the Conversion Database;
(jj) presenting the recorded data to the user for review; and
(kk) approving, modifying or rejecting the presented data. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29)
-
Specification