UTILIZING CLASSIFICATION AND TEXT ANALYTICS FOR ANNOTATING DOCUMENTS TO ALLOW QUICK SCANNING
1 Assignment
0 Petitions
Accused Products
Abstract
Classification, text analytics, and natural language processing are used to evaluate passages, extract text, identify concepts, and provide visual cues and notations to assist readers in scanning and evaluating large amounts of information in a document.
18 Citations
25 Claims
-
1-9. -9. (canceled)
-
10. A system for annotating a document comprising:
-
(a) a classifier with domain and document-type taxonomies, wherein the classifier is configured to; (i) determine a type of the document; and (ii) determine a subject domain of the document; (b) an annotation model with information to determine and drive an annotation strategy based on various document types; (c) a text analytics system with multiple domain models, wherein the subject domain determines which domain model to load into the text analytics system, and wherein the text analytics system is configured to; (i) provide annotations of each paragraph of the document based on the domain model and annotation model; and (d) a custom viewer/renderer application configured to annotate the document with the annotations and render the document including the annotations. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer program product for annotating a document, the computer program product comprising:
a computer readable storage medium having computer readable program code embodied therewith, the computer readable program code comprising; computer readable program code configured to obtain the document; computer readable program code configured to determine a type of the document; computer readable program code configured to determine a subject domain of the document; computer readable program code configured to determine an annotation strategy based on the type of document; computer readable program code configured to determine a domain model to load based on the subject domain; computer readable program code configured to segment the document into paragraphs and sections based on a document structure; computer readable program code configured to provide annotations for each paragraph of the document based on the domain model and annotation strategy; computer readable program code configured to annotate text in the document by applying the annotations to original text of the document; and computer readable program code configured to render the document including the annotations. - View Dependent Claims (20, 21, 22, 23, 24, 25)
Specification