Process and system for mapping the relationship of the content of a collection of documents
First Claim
1. An automated process, using a computer, for mapping a relationship of content of a collection of documents, comprising:
- providing a collection of documents, each document including content;
determining relevance measures, each relevance measure representing a relevance between a pair of documents, each relevance measure comprising a log likelihood ratio relevance measure, and each relevance measure based upon a comparison of the content of the pair of documents; and
generating a content graph having nodes and edges, each edge connecting two nodes, the content graph having a node associated with each document and having an edge connecting nodes for which the relevance measure between associated documents is greater than a specified threshold;
such that the content graph maps the relationship of the content of the collection of documents.
6 Assignments
0 Petitions
Accused Products
Abstract
A process is provided for mapping the relationship of the content of a collection of documents (14). The process includes providing a collection (12) of documents (14) where each document (14) includes text. Relevance measures are determined that represent a relevance between each pair of documents (14) based upon the text of the documents (14). A graph (22) is then generated that has nodes (30) and edges (32) with each edge (32) connecting two nodes (30). The graph (22) has a node (30) associated with each document (14) and has an edge (32) connecting nodes (30) for which the relevance measure between associated documents (14) is greater than a specified threshold. In this manner, the graph (22) maps the relationship of the content of the collection (12) of documents (14).
-
Citations
20 Claims
-
1. An automated process, using a computer, for mapping a relationship of content of a collection of documents, comprising:
-
providing a collection of documents, each document including content; determining relevance measures, each relevance measure representing a relevance between a pair of documents, each relevance measure comprising a log likelihood ratio relevance measure, and each relevance measure based upon a comparison of the content of the pair of documents; and generating a content graph having nodes and edges, each edge connecting two nodes, the content graph having a node associated with each document and having an edge connecting nodes for which the relevance measure between associated documents is greater than a specified threshold; such that the content graph maps the relationship of the content of the collection of documents. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An automated system for mapping a relationship of content of a collection of documents, comprising:
-
a data storage device storing relevance measures, each relevance measure representing a relevance between a pair of documents in a collection of documents, each relevance measure comprising a log likelihood ratio relevance measure, and each relevance measure based upon a comparison of the content of the pair of documents; a memory device operable to store a software program; and a processor coupled to the data storage device and the memory device, the processor operable to execute the software program to; generate a content graph having nodes and edges, each edge connecting two nodes, the content graph having a node associated with each document and having an edge connecting nodes for which the relevance measure between associated documents is greater than a specified threshold; such that the content graph maps the relationship of the content of the collection of documents. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. An automated system for mapping a relationship of content of a collection of documents, comprising:
-
a collection of documents, each document including content; a relevance generator connected to access the collection of documents, the relevance generator operable to generate relevance measures, each relevance measure representing a relevance between a pair of documents, each relevance measure comprising a log likelihood ratio relevance measure, and each relevance measure based upon a comparison of the content of the pair of documents; a graph generator connected to access the relevance measures, the graph generator operable to generate a content graph having nodes and edges where each edge connects two nodes, the content graph having a node associated with each document and having an edge connecting nodes for which the relevance measure between associated documents is greater than a specified threshold; and a layout generator connected to access the content graph, the layout generator operable to generate a layout based upon the content graph to provide a visual display of the relationship between the documents. - View Dependent Claims (18, 19, 20)
-
Specification