Pattern identification and correction of document misinterpretations in a natural language processing system
First Claim
Patent Images
1. An information handling system comprising:
- one or more processors;
a memory coupled to at least one of the processors; and
a set of computer program instructions stored in the memory and executed by at least one of the processors in order to perform actions of;
generating a plurality of interpretations of a plurality of document phrases using a natural language processing model, wherein the plurality of interpretations are different than the plurality of document phrases;
receiving a plurality of corrections corresponding to a set of words included in the plurality of interpretations;
determining one or more misinterpretation patterns in response to evaluating the plurality of corrections against the plurality of interpretations and the plurality of document phrases;
generating one or more interpretation rules based upon the one or more misinterpretation patterns;
generating a synonym interpretation rule of a selected one of the one or more interpretation rules, wherein the synonym interpretation rule includes a synonym word of a selected one of the set of words included in the selected interpretation rule;
applying the one or more interpretation rules and the synonym interpretation rule to the natural language processing model; and
evaluating a new document phrase by the natural language processing model, wherein the evaluating further comprises;
generating a probabilistic score of the new document phrase based upon a sentence similarity between the new document phrase and the one or more interpretation rules or the synonym interpretation rule; and
ranking, based on the probabilistic score, a new interpretation of the new document phrase.
2 Assignments
0 Petitions
Accused Products
Abstract
An approach is provided in which a knowledge manager analyzes multiple document phrases using a natural language processing model and generates multiple interpretations based upon the analysis. The knowledge manager identifies misinterpretation patterns by comparing the multiple interpretations with multiple corrections that include corrections to the multiple interpretations. In turn, the knowledge manager generates interpretation rules based upon the identified patterns and applies the interpretation rules to the natural language processing model.
32 Citations
8 Claims
-
1. An information handling system comprising:
-
one or more processors; a memory coupled to at least one of the processors; and a set of computer program instructions stored in the memory and executed by at least one of the processors in order to perform actions of; generating a plurality of interpretations of a plurality of document phrases using a natural language processing model, wherein the plurality of interpretations are different than the plurality of document phrases; receiving a plurality of corrections corresponding to a set of words included in the plurality of interpretations; determining one or more misinterpretation patterns in response to evaluating the plurality of corrections against the plurality of interpretations and the plurality of document phrases; generating one or more interpretation rules based upon the one or more misinterpretation patterns; generating a synonym interpretation rule of a selected one of the one or more interpretation rules, wherein the synonym interpretation rule includes a synonym word of a selected one of the set of words included in the selected interpretation rule; applying the one or more interpretation rules and the synonym interpretation rule to the natural language processing model; and evaluating a new document phrase by the natural language processing model, wherein the evaluating further comprises; generating a probabilistic score of the new document phrase based upon a sentence similarity between the new document phrase and the one or more interpretation rules or the synonym interpretation rule; and ranking, based on the probabilistic score, a new interpretation of the new document phrase. - View Dependent Claims (2, 3, 4)
-
-
5. A computer program product stored in a computer readable storage medium, comprising computer program code that, when executed by an information handling system, causes the information handling system to perform actions comprising:
-
generating a plurality of interpretations of a plurality of document phrases using a natural language processing model, wherein the plurality of interpretations are different than the plurality of document phrases; receiving a plurality of corrections corresponding to a set of words included in the plurality of interpretations; determining one or more misinterpretation patterns in response to evaluating the plurality of corrections against the plurality of interpretations and the plurality of document phrases; generating one or more interpretation rules based upon the one or more misinterpretation patterns; generating a synonym interpretation rule of a selected one of the one or more interpretation rules, wherein the synonym interpretation rule includes a synonym word of a selected one of the set of words included in the selected interpretation rule; applying the one or more interpretation rules and the synonym interpretation rule to the natural language processing model; and evaluating a new document phrase by the natural language processing model, wherein the evaluating further comprises; generating a probabilistic score of the new document phrase based upon a sentence similarity between the new document phrase and the one or more interpretation rules or the synonym interpretation rule; and ranking, based on the probabilistic score, a new interpretation of the new document phrase. - View Dependent Claims (6, 7, 8)
-
Specification