Method for gathering and summarizing internet information
First Claim
1. A computer method of gathering and summarizing information available through a network, the method comprising:
- collecting information from a plurality of network sites according to respective maps of the network sites;
converting the collected information from HTML-language web pages to XML-language documents and storing the XML-language documents in a storage medium;
searching for documents according to a search query having at least one term and identifying the documents found in the search; and
displaying the documents so as to indicate similarity of the documents to each other.
3 Assignments
0 Petitions
Accused Products
Abstract
A computer method of gathering and summarizing information available through the Internet comprises collecting information from a plurality of Internet sites (14, 51) according to respective maps (52) of the Internet sites (14), converting the collected information from HTML-language web pages to XML-language documents (26, 53) and storing the XML-language documents in a storage medium, searching for documents (55) according to a search query (13) having at least one term and identifying the documents (26) found in the search, and displaying the documents as nodes (33) of a tree structure (32) having links (34) and nodes (33) so as to indicate similarity of the documents to each other.
-
Citations
18 Claims
-
1. A computer method of gathering and summarizing information available through a network, the method comprising:
-
collecting information from a plurality of network sites according to respective maps of the network sites;
converting the collected information from HTML-language web pages to XML-language documents and storing the XML-language documents in a storage medium;
searching for documents according to a search query having at least one term and identifying the documents found in the search; and
displaying the documents so as to indicate similarity of the documents to each other. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer system for gathering and summarizing information available through a network, the computer system being operable on at least one computer having a software operating system, the computer system comprising:
-
an agent hosting program for running under said software operating system;
a plurality of agent programs operating with said agent hosting program, said plurality of agent programs including programs for collecting documents from respective network sites;
wherein said agent program operates according to a stored search ontology providing a map of each respective network site and a time interval between search updates for the network site. - View Dependent Claims (14, 15, 16)
-
-
17. A computer system for gathering and summarizing information available through a network, the computer system being operable on at least one computer having a software operating system, the computer system comprising:
-
an agent hosting program for running under said software operating system;
a plurality of agent programs operating with said agent hosting program, said plurality of agent programs including programs for collecting documents from respective network sites;
wherein said plurality of agent programs operate according to a stored search ontology providing a map of each respective network site and a time interval between search updates for the network site; and
further comprising an agent for applying a similarity algorithm to documents found in the search of the network sites; and
a user interface agent for providing a display of the results of the search and the results of applying the similarity algorithm; and
an agent program for interfacing said user interface agent, said clustering agent and said plurality of agent programs for collecting documents from respective network sites. - View Dependent Claims (18)
-
Specification