Agent-based method for distributed clustering of textual information
First Claim
1. A computer method for storing information in a computer system having at least first and second computers for retrieval and display based on similarity of information, the method comprising:
- a first-tier program module operating on a first computer for determining a new document vector to characterize a new document for comparison of a similarity of the new document to other documents stored in the computer system;
the first-tier program module transmitting the new document vector to a second-tier program module operating on a second computer in the computer system;
the second-tier program module transmitting a similarity value to said first-tier program module which represents a comparison of the new document vector to at least one composite vector characterizing a similarity of a plurality of documents stored in the computer system; and
based on said similarity value received from said second-tier program module, said first-tier program module determining whether said new document should be transmitted to said second-tier program module for storage in the computer system.
3 Assignments
0 Petitions
Accused Products
Abstract
A computer method and system for storing, retrieving and displaying information has a multiplexing agent (20) that calculates a new document vector (25) for a new document (21) to be added to the system and transmits the new document vector (25) to master cluster agents (22) and cluster agents (23) for evaluation. These agents (22, 23) perform the evaluation and return values upstream to the multiplexing agent (20) based on the similarity of the document to documents stored under their control. The multiplexing agent (20) then sends the document (21) and the document vector (25) to the master cluster agent (22), which then forwards it to a cluster agent (23) or creates a new cluster agent (23) to manage the document (21). The system also searches for stored documents according to a search query having at least one term and identifying the documents found in the search, and displays the documents in a clustering display (80) of similarity so as to indicate similarity of the documents to each other.
79 Citations
23 Claims
-
1. A computer method for storing information in a computer system having at least first and second computers for retrieval and display based on similarity of information, the method comprising:
-
a first-tier program module operating on a first computer for determining a new document vector to characterize a new document for comparison of a similarity of the new document to other documents stored in the computer system;
the first-tier program module transmitting the new document vector to a second-tier program module operating on a second computer in the computer system;
the second-tier program module transmitting a similarity value to said first-tier program module which represents a comparison of the new document vector to at least one composite vector characterizing a similarity of a plurality of documents stored in the computer system; and
based on said similarity value received from said second-tier program module, said first-tier program module determining whether said new document should be transmitted to said second-tier program module for storage in the computer system. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer system for storing, retrieving and displaying information, the computer system being operable on at least one computer having a software operating system, the computer system comprising:
-
a multiplexing program module running on a first computer for receiving a new document and for calculating a new document vector for the new document, and for transmitting said new document vector to at least one next-tier program module; and
a next-tier program module running on a second computer for receiving said document vector from said multiplexing program module and for comparing said document vector to at least one composite vector representing the documents being stored for access through a third-tier program running on the second computer to determine if said document should be added to the documents being stored for access through the third-tier program module. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
Specification