Information collection system and method
First Claim
1. An information collection system comprising:
- means for acquiring a plurality of data files via a network;
means for analyzing the plurality of data files acquired using a prescribed extraction rule and an ontology of relational description of terms; and
means for extracting necessary information from said plurality of data files based on results from said means for analyzing.
1 Assignment
0 Petitions
Accused Products
Abstract
A user registers interests in order to obtain data relative to those interests. A plurality of web sites are searched based on the user'"'"'s interests. An extraction rule mechanism is applied to HTML documents acquired. A vocabulary information processing mechanism reads an ontology based on information received to obtain vocabulary information. An inference processing mechanism executes an inference operation based on axiom rules. An extracting position information identifying section extracts data objects relying on HTML document tags with respect to the acquired HTML documents, based on the extraction rules, the vocabulary information processing mechanism, and the inference processing mechanism.
58 Citations
22 Claims
-
1. An information collection system comprising:
-
means for acquiring a plurality of data files via a network;
means for analyzing the plurality of data files acquired using a prescribed extraction rule and an ontology of relational description of terms; and
means for extracting necessary information from said plurality of data files based on results from said means for analyzing. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. An application server comprising:
-
a user request receiving section for receiving information about a user'"'"'s interest;
an HTML acquiring section for acquiring HTML documents from a plurality of sites based on said information received from said user request receiving section;
a vocabulary information processing mechanism for reading an ontology based on said information received from said user request receiving section to acquire vocabulary information; and
an extracting position information identifying section for obtaining extraction data objects with respect to said HTML documents acquired from said HTML acquiring section, relying on tags of said HTML documents and based on said vocabulary information offered from said vocabulary information processing mechanism. - View Dependent Claims (8, 9, 10)
-
-
11. In a computer connected to a network, an information collection method comprising the steps of:
-
acquiring a plurality of data files via the network;
analyzing said plurality of acquired data files using a prescribed extraction rule and an ontology relational description of terms;
extracting useful information from said plurality of analyzed data files; and
reconstructing said extracted useful information in a manner useful to a user. - View Dependent Claims (12, 13)
-
-
14. In a computer connected to the Internet, an information collection method comprising the steps of:
-
receiving information about a user'"'"'s interest;
acquiring a plurality of documents via the Internet based on said user'"'"'s interest;
selecting a specific ontology based on said user'"'"'s interest from a plurality of stored ontologies; and
analyzing contents transversely with respect to said plurality of acquired documents using said selected specific ontology to extract useful information. - View Dependent Claims (15)
-
-
16. In a computer connected to a network, an information collection method comprising:
-
acquiring a plurality of Web pages including information expressed by different vocabularies with respect to associated contents, respectively;
extracting information from said plurality of acquired Web pages based on table tags or list tags;
analyzing said extracted information transversely with respect to the different vocabularies of said plurality of Web pages based on an ontology representing relationships between vocabularies;
summing up the analyzed information; and
transmitting a summing result to a user terminal. - View Dependent Claims (17)
-
-
18. A program product for causing a computer to have:
-
a function of acquiring a plurality of data files via a network;
a function of analyzing said plurality of acquired data files using a prescribed extraction rule and an ontology being relational description of terms;
a function of extracting useful information from said plurality of analyzed data files; and
a function of reconstructing said extracted useful information in a manner useful to a user. - View Dependent Claims (19, 20)
-
-
21. A program product for causing a computer to have:
-
a function of acquiring a plurality of documents via the Internet based on information about a user'"'"'s interest;
a function of selecting a specific ontology based on said user'"'"'s interest from a plurality of stored ontologies; and
a function of analyzing contents transversely with respect to said plurality of acquired documents using said selected specific ontology.
-
-
22. A program product for causing a computer to have:
-
a function of acquiring a plurality of Web pages including information expressed by different vocabularies with respect to associated contents, respectively;
a function of extracting information from said plurality of acquired Web pages based on table tags or list tags;
a function of analyzing said extracted information transversely with respect to the different vocabularies of said plurality of Web pages based on an ontology representing relationships between vocabularies; and
a function of summing up the analyzed information.
-
Specification