Computer network information management system and method using intelligent software agents
First Claim
1. A method for managing information on a computer network having a server, at least one client node and at least one information provider node, and a database at the server containing pre-stored information from the at least one information provider node, comprising the steps of:
- gathering change data from the information provider node indicative of event changes at the information provider node relative to the pre-stored information in the database by information collection agents;
extracting information from the information provider node based on the change data;
transmitting the extracted information to the server;
storing the transmitted information in the database;
cataloging the stored information into hierarchical categories;
retrieving with a delivery agent based upon the hierarchical categories selected information from the stored information; and
transmitting the selected information to the client node, wherein the cataloging step is comprising the steps of;
generating a common word list (CWL) consisting of words that occur most often in a sample set of the pre-stored information;
generating a relevant word list (RWL) consisting of all words in the stored information not in the CWL;
calculating a relevance factor (RF) indicative of the relevance of the stored information to a current hierarchical category, based upon the RWL;
comparing the RF to a relevance threshold (RT); and
if the RF exceeds the RT, adding the extracted information to current category.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention provides a system for managing information on a computer network having a server by gathering summary data from the information provider node indicative of event changes at the information provider node by information collection agents extracting information from the information provider node based on the summary data; transmitting the extracted information to the server; storing the transmitted information in an event database; cataloging the stored information into hierarchical categories; retrieving with a delivery agent based upon the hierarchical categories selected information from the stored information; and transmitting the selected information to the client node.
-
Citations
15 Claims
-
1. A method for managing information on a computer network having a server, at least one client node and at least one information provider node, and a database at the server containing pre-stored information from the at least one information provider node, comprising the steps of:
-
gathering change data from the information provider node indicative of event changes at the information provider node relative to the pre-stored information in the database by information collection agents;
extracting information from the information provider node based on the change data;
transmitting the extracted information to the server;
storing the transmitted information in the database;
cataloging the stored information into hierarchical categories;
retrieving with a delivery agent based upon the hierarchical categories selected information from the stored information; and
transmitting the selected information to the client node, wherein the cataloging step is comprising the steps of;
generating a common word list (CWL) consisting of words that occur most often in a sample set of the pre-stored information;
generating a relevant word list (RWL) consisting of all words in the stored information not in the CWL;
calculating a relevance factor (RF) indicative of the relevance of the stored information to a current hierarchical category, based upon the RWL;
comparing the RF to a relevance threshold (RT); and
if the RF exceeds the RT, adding the extracted information to current category. - View Dependent Claims (2, 3, 4, 5)
generating a training word list (TWL) for the current category based upon the RWL for the pre-stored information already in the current category; and
calculating the relevance factor (RF) based upon the RWL and the TWL.
-
-
3. The method recited in claim 2, wherein the RF is calculated according to the following algorithm:
-
4. The method recited in claim 1, further comprising the steps of:
if the stored information is added to the current category, adding the words from the RWL to the TWL for the current category.
-
5. The method recited in claim 1, wherein if the stored information is added to the current category, further comprising the steps of:
-
creating a most frequent word list (MFWL) consisting of the words in the RWL ranked by order of occurrence;
identifying sentences in the stored information containing one or more of the highest ranked words in the MFWL; and
creating a summary of the stored information based upon the identified sentences.
-
-
6. A system for managing information in a computer network comprising:
-
an interconnection network;
a plurality of client nodes coupled to the interconnection network;
a plurality of information provider nodes coupled to the interconnection network;
a system server coupled to the interconnection network;
a database coupled to the system server containing pre-stored information from the information provider nodes;
means for autonomously gathering change data from the information provider node indicative of event changes at the information provider node relative to the pre-stored information the database;
extracting information from the information provider node based on the change data;
means for autonomously extracting information from the information provider nodes based upon the change data;
means for autonomously coordinating the extracting of information by the extracting means and for autonomously transmitting the extracted information to the system server via the interconnection network;
means located at the system server for cataloging the extracted information transmitted from the coordinating means, wherein means for cataloging is comprising;
means for generating a common word list (CWL) consisting of words that occur most often in a sample set of the pre-stored information;
means for generating a relevant word list (RWL) consisting of all words in the extracted information not in the CWL;
means for calculating a relevance factor (RF) indicative of the relevance of the extracted information to a current hierarchical category, based upon the RWL;
means for comparing the RF to a relevance threshold (RT); and
if the RF exceeds the RT, adding the extracted information to the current category. - View Dependent Claims (7, 8, 9, 10)
means for generating a training word list (TWL) for the current category based upon the RWL for the pre-stored information already in the current category; and
means for calculating the relevance factor (RF) based upon the RWL and the TWL.
-
-
8. The system recited in claim 7, wherein the RF is calculated according to the following algorithm:
-
9. The system recited in claim 6, further comprising:
means for adding the words from the RWL to the TWL for the current category, if the extracted information is added to the current category.
-
10. The system recited in claim 6, wherein if the extracted information is added to the current category, further comprising the steps of:
-
means for creating a most frequent word list (MFWL) consisting of the words in the RWL ranked by order of occurrence;
means for identifying sentences in the extracted information containing one or more of the highest ranked words in the MFWL; and
means for creating a summary of the extracted information based upon the identified sentences.
-
-
11. A system for managing information in a computer network gathered from a plurality of information provider nodes for transmission to a plurality of client nodes via an interconnection network comprising:
-
a system server coupled to the interconnection network;
a database coupled to the system server containing pre-stored information from the information provider nodes;
means for autonomously gathering change data from the information provider node indicative of event changes at the information provider node relative to the pre-stored information in the database;
extracting information from the information provider node based on the change data;
means for autonomously extracting information from the information provider nodes based upon the change data;
means for autonomously coordinating the extracting of information by the extracting means and for autonomously transmitting the extracted information to the system server via the interconnection network;
means located at the system server for cataloging the extracted information transmitted from the coordinating means, wherein means for cataloging is comprising;
means for generating a common word list (CWL) consisting of words that occur most often in a sample set of the pre-stored information, means for generating a relevant word list (RWL) consisting of all words in the extracted information not in the CWL;
means for calculating a relevance factor (RF) indicative of the relevance of the extracted information to a current hierarchical category, based upon the RWL;
means for comparing the RF;
to a relevance threshold (RT); and
if the RF exceeds the RT, adding the extracted information to the current category. - View Dependent Claims (12, 13, 14, 15)
means for generating a training word list (TWL) for the current category based upon the RWL for the pre-stored information already in the current category; and
means for calculating the relevance factor (RF) based upon the RWL and the TWL.
-
-
13. The system recited in claim 12, wherein the RF is calculated according to the following algorithm:
-
14. The system recited in claim 11, further comprising:
means for adding the words from the RWL to the TWL for the current category, if the extracted information is added to the current category.
-
15. The system recited in claim 11, wherein if the extracted information is added to the current category, further comprising the steps of:
-
means for creating a most frequent word list (MFWL) consisting of the words in the RWL ranked by order of occurrence;
means for identifying sentences in the extracted information containing one or more of the highest ranked words in the MFWL; and
means for creating a summary of the extracted information based upon the identified sentences.
-
Specification