System and method for comparative analysis of textual documents
First Claim
1. A method of comparing the semantic content of two or more documents, comprising:
- accessing two or more documents;
performing a linguistic analysis on each document;
outputting a quantified representation of the semantic content of each document; and
comparing the quantified representations using a defined algorithm.
7 Assignments
0 Petitions
Accused Products
Abstract
A system and method are presented for the comparative analysis of textual documents. In an exemplary embodiment of the present invention the method includes accessing two or more documents, performing a linguistic analysis on each document, outputting a quantified representation of a semantic content of each document, and comparing the quantified representations using a defined metric. In exemplary embodiments of the present invention such a metric can measure relative semantic closeness or distance of two documents. In exemplary embodiments of the present invention the semantic content of a document can be expressed as a semantic vector. The format of a semantic vector is flexible, and in exemplary embodiments of the present invention it and any metric used to operate on it can be adapted and optimized to the type and/or domain of documents being analyzed and the goals of the comparison.
-
Citations
32 Claims
-
1. A method of comparing the semantic content of two or more documents, comprising:
-
accessing two or more documents;
performing a linguistic analysis on each document;
outputting a quantified representation of the semantic content of each document; and
comparing the quantified representations using a defined algorithm. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method of comparing two or more documents, comprising:
-
linguistically analyzing two or more documents;
generating a semantic vector associated with each document; and
comparing the semantic vectors using a defined metric. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A system for comparing two or more documents, comprising:
-
a document inputter, arranged to access two or more documents;
a semantic analyzer, arranged to perform a linguistic analysis on each document;
a semantic quantifier, arranged to output a quantified representation of a semantic content of each document; and
a comparator, arranged to compare the quantified representations using a defined algorithm.
-
-
24. A system for comparing two or more documents, comprising:
-
a document inputter, arranged to access two or more documents;
a semantic analyzer, arranged to perform a linguistic analysis on each document;
a semantic vector generator, arranged to output a semantic vector associated with each document; and
a comparator, arranged to compare the semantic vectors using a defined metric. - View Dependent Claims (25)
-
-
26. A computer program product comprising a computer usable medium having computer readable program code means embodied therein, the computer readable program code means in said computer program product comprising means for causing a computer to:
-
access two or more documents;
perform a linguistic analysis on each document;
output a quantified representation of a semantic content of each document; and
compare the quantified representations using a defined algorithm.
-
-
27. A computer program product comprising a computer usable medium having computer readable program code means embodied therein, the computer readable program code means in said computer program product comprising means for causing a computer to:
-
linguistically analyzing two or more documents;
generating a semantic vector associated with each document; and
comparing the semantic vectors using a defined metric. - View Dependent Claims (28, 29, 30, 31, 32)
-
Specification