Enhanced document input parsing
First Claim
Patent Images
1. A method, in an information handling system comprising a processor and a memory, of analyzing documents, the method comprising:
- receiving, at a question answer system, an electronic document that includes a content and a revision metadata generated by at least one identified revision author;
retrieving, from an endorser at the question answer system, endorsement data that endorses the identified revision author, wherein the endorser is different than the identified revision author;
identifying, by the question answer system, a topic area associated with the electronic document;
collecting, from one or more network sources by the question answer system, expertise-related data corresponding to the revision author relative to the identified topic area;
determining, by the question answer system, a revision author expertise level of the identified revision author based upon the endorsement data and the expertise-related data;
identifying, by the question answer system, a confidence level based on the revision author expertise level;
assigning, by the question answer system, the confidence level to the electronic document content; and
in response to determining that the confidence level of the electronic document is higher than a different confidence level of a different document;
selecting, by the question answer system, the electronic document over the different electronic document; and
generating, by the question answer system, one or more answers to a question based, at least in part, on the selected electronic document.
1 Assignment
0 Petitions
Accused Products
Abstract
An approach is provided for an information handling system that includes a processor and a memory to analyze documents. In the approach, an electronic document is received with the document including content, such as text, and revision metadata that is associated with the content. The revision metadata is analyzed and the approach identifies a confidence level based on the analysis. The confidence level is associated with the electronic document content. The confidence level can then be utilized by a Question and Answer (QA) system.
-
Citations
6 Claims
-
1. A method, in an information handling system comprising a processor and a memory, of analyzing documents, the method comprising:
-
receiving, at a question answer system, an electronic document that includes a content and a revision metadata generated by at least one identified revision author; retrieving, from an endorser at the question answer system, endorsement data that endorses the identified revision author, wherein the endorser is different than the identified revision author; identifying, by the question answer system, a topic area associated with the electronic document; collecting, from one or more network sources by the question answer system, expertise-related data corresponding to the revision author relative to the identified topic area; determining, by the question answer system, a revision author expertise level of the identified revision author based upon the endorsement data and the expertise-related data; identifying, by the question answer system, a confidence level based on the revision author expertise level; assigning, by the question answer system, the confidence level to the electronic document content; and in response to determining that the confidence level of the electronic document is higher than a different confidence level of a different document; selecting, by the question answer system, the electronic document over the different electronic document; and generating, by the question answer system, one or more answers to a question based, at least in part, on the selected electronic document. - View Dependent Claims (2, 3, 4, 5, 6)
-
Specification