Automatic, unsupervised paraphrase detection
First Claim
1. A system comprising:
- a processor;
a data bus coupled to the processor; and
a non-transitory, computer-readable storage medium embodying computer program code, the non-transitory, computer-readable storage medium being coupled to the data bus, the computer program code interacting with a plurality of computer operations and comprising instructions executable by the processor and configured for;
receiving a first phrase and a second phrase by a system;
analyzing the first phrase and the second phrase to provide a semantic and structural hierarchical comparison assessment, the semantic and structural hierarchical comparison assessment having an associated semantic and structural hierarchical comparison assessment value; and
determining whether the semantic and structural hierarchical comparison assessment value exceeds a predetermined paraphrase equivalency criteria;
responsive to determining the semantic and structural hierarchical comparison assessment value exceeds the predetermined paraphrase equivalency criteria, classifying the second phrase as being a rewording of the first phrase.
1 Assignment
0 Petitions
Accused Products
Abstract
A system, method, and computer-readable medium are disclosed for identifying paraphrases in a natural language processing (NLP) system comprising: receiving a first phrase and a second phrase by a system; analyzing the first phrase and the second phrase to provide a semantic and structural hierarchical comparison assessment, the semantic and structural hierarchical comparison assessment having an associated semantic and structural hierarchical comparison assessment value; and determining whether the semantic and structural hierarchical comparison assessment value exceeds a predetermined paraphrase equivalency criteria; and, responsive to determining the semantic and structural hierarchical comparison assessment value exceeds the predetermined paraphrase equivalency criteria, classifying the second phrase as being a rewording of the first phrase.
14 Citations
14 Claims
-
1. A system comprising:
-
a processor; a data bus coupled to the processor; and a non-transitory, computer-readable storage medium embodying computer program code, the non-transitory, computer-readable storage medium being coupled to the data bus, the computer program code interacting with a plurality of computer operations and comprising instructions executable by the processor and configured for; receiving a first phrase and a second phrase by a system; analyzing the first phrase and the second phrase to provide a semantic and structural hierarchical comparison assessment, the semantic and structural hierarchical comparison assessment having an associated semantic and structural hierarchical comparison assessment value; and determining whether the semantic and structural hierarchical comparison assessment value exceeds a predetermined paraphrase equivalency criteria; responsive to determining the semantic and structural hierarchical comparison assessment value exceeds the predetermined paraphrase equivalency criteria, classifying the second phrase as being a rewording of the first phrase. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A non-transitory, computer-readable storage medium embodying computer program code, the computer program code comprising computer executable instructions configured for:
-
receiving a first phrase and a second phrase by a system; analyzing the first phrase and the second phrase to provide a semantic and structural hierarchical comparison assessment, the semantic and structural hierarchical comparison assessment having an associated semantic and structural hierarchical comparison assessment value; and determining whether the semantic and structural hierarchical comparison assessment value exceeds a predetermined paraphrase equivalency criteria; responsive to determining the semantic and structural hierarchical comparison assessment value exceeds the predetermined paraphrase equivalency criteria, classifying the second phrase as being a rewording of the first phrase. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14)
-
Specification