SYSTEM AND METHOD FOR ASSOCIATING STRUCTURED AND MANUALLY SELECTED ANNOTATIONS WITH ELECTRONIC DOCUMENT CONTENTS
First Claim
1. A computer-controlled method of managing manual annotation of electronic documents, whereby unknown electronic documents may be more accurately identified via automatic comparisons to document patterns derived from manually annotated electronic documents, comprising a first storage means for storing at least one of a plurality of documents;
- a second storage means for storing at least one of a plurality of document annotation definitions;
a third storage means for storing at least one of a plurality of selected document annotation values and at least one of a plurality of document index values identifying one of a plurality of said documents or portions thereof to which said selected document annotation values relate; and
a document annotation value capture means;
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method is provided for assisting a human document annotator in recording semantic judgments about the contents of sample electronic documents. A system administrator first configures and stores a document annotation definition at a server computer, providing a precise and consistent structure for annotating documents and portions of documents. Documents intended to serve as sample documents for pattern matching against unknown documents are collected and stored at the server computer. A human annotator located at a client computer connected by a net-work to the server computer requests a display of a sample document to be annotated. A document is transmitted in an annotatable form from the server computer to the client computer. The human document annotator reviews the annotatable document, records semantic judgments about the document using interactive controls displayed with the document, and transmits a set of selected annotation values to the server computer. The server computer then stores the values and associates them with the document. The set of annotated documents, enhanced by the addition of structured semantic judgment information, then may be queried by other document management systems, improving the accuracy with which other systems perform automated document retrieval, comparison or filtering actions.
-
Citations
24 Claims
-
1. A computer-controlled method of managing manual annotation of electronic documents, whereby unknown electronic documents may be more accurately identified via automatic comparisons to document patterns derived from manually annotated electronic documents, comprising a first storage means for storing at least one of a plurality of documents;
- a second storage means for storing at least one of a plurality of document annotation definitions;
a third storage means for storing at least one of a plurality of selected document annotation values and at least one of a plurality of document index values identifying one of a plurality of said documents or portions thereof to which said selected document annotation values relate; and
a document annotation value capture means;
- View Dependent Claims (2, 3, 4, 5, 6)
- a second storage means for storing at least one of a plurality of document annotation definitions;
-
7. A computer-controlled method of managing manual annotation of electronic documents, whereby unknown electronic documents may be more accurately identified via automatic comparisons to document patterns derived from manually annotated electronic documents, comprising the steps of:
-
(a) Providing a computer network means of data communications between at least one of a plurality of client computers each serving as a document annotation workstation and at least one of a plurality of server computers;
(b) Providing on said server computer;
i. a first storage means for storing at least one of a plurality of documents;
ii. a second storage means for storing at least one of a plurality of document annotation definitions; and
iii. a third storage means for storing at least one of a plurality of selected document annotation values and at least one of a plurality of document index values identifying one of a plurality of said documents or portions thereof to which said selected document annotation values relate;
(c) Providing for each of said document annotation workstations;
i. a display means for a simultaneous user interface screen display of at least one of a plurality of documents and at least one of a set of selectable document annotation value interactive input controls; and
ii. an input means enabling a human annotator to perform interactive entry of information and commands into said document annotation workstation;
(d) Providing on said server computer a document information distribution means configured to transmit to at least one of said plurality of document annotation workstations on demand a copy of an annotatable document including said full text of said document and including at least two of said plurality of selectable document annotation values and labels associated with said selectable document annotation values and including at least one of said plurality of document index values associated with said selectable document annotation values;
(e) Providing on said server computer an annotation reception means configured to receive and store at least one of a plurality of selected document annotation values and at least one of said plurality of document index values transmitted from at least one of said plurality of document annotation workstations;
(f) Responsive to an electronic document retrieval request, said request originated by one of a plurality of human annotators located at one of a plurality of said document annotation workstations, automatically selecting at least one of a plurality of documents stored at said server computer and transmitting to said document annotation workstation an annotatable document;
(g) Receiving and simultaneously displaying at said document annotation workstation a user interface screen display of said copy of said annotatable document and at least one of a plurality of interactive controls configured to accept input commands responsive to said human annotator'"'"'s selection of at least one of said selectable document annotation values;
(h) Providing at said document annotation workstation a screen display of an interactive control and an automated means causing, responsive to input of said human annotator, transmission to said server computer at least one of a plurality of said selected document annotation values selected by said human annotator;
(i) Responsive to receipt of said selected document annotation values from said document annotation workstation, automatically receiving and storing at said server computer said selected document annotation values and said document index values, whereby said selectable document annotation values are bound via said document index values to said document or to one of a plurality of said document text substrings. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
Specification