Method of and apparatus for gathering information, system for gathering information, and computer program
First Claim
1. An information-gathering apparatus comprising:
- a storing unit that acquires information on a first address of a first web-page from a server having a web archive that stores the first address and the first web-page corresponding to the first address, and stores the first address in a gathered-address table;
an access-log acquiring unit that accesses the first web-page based on the first address stored in the gathered-address table, accesses a second web-page linked to the first web-page, and acquires a second address of the second web-page as an access log;
a determining unit that determines whether the second address is stored in the gathered-address table; and
an information gathering unit that gathers the second web-page via the network based on the second address when the second address is not stored in the gathered-address table.
1 Assignment
0 Petitions
Accused Products
Abstract
An information-gathering system includes a web archive that stores a first web-page with a first address and first generation information corresponding to the first web-page, a determining unit that determines whether a linked web-page specified in a second web-page that is being referred is stored in the web archive based on a second address of the linked web-page and second generation information corresponding to the second web-page, and an information gathering unit that gathers the linked web-page via the network based on the second address when the determining unit determines that the linked web-page is not stored in the web archive.
35 Citations
9 Claims
-
1. An information-gathering apparatus comprising:
-
a storing unit that acquires information on a first address of a first web-page from a server having a web archive that stores the first address and the first web-page corresponding to the first address, and stores the first address in a gathered-address table;
an access-log acquiring unit that accesses the first web-page based on the first address stored in the gathered-address table, accesses a second web-page linked to the first web-page, and acquires a second address of the second web-page as an access log;
a determining unit that determines whether the second address is stored in the gathered-address table; and
an information gathering unit that gathers the second web-page via the network based on the second address when the second address is not stored in the gathered-address table.
-
-
2. An information-gathering system comprising:
-
a web archive that stores a first web-page with a first address and first generation information corresponding to the first web-page;
a determining unit that determines whether a linked web-page specified in a second web-page that is being referred is stored in the web archive based on a second address of the linked web-page and second generation information corresponding to the second web-page; and
an information gathering unit that gathers the linked web-page via the network based on the second address when the determining unit determines that the linked web-page is not stored in the web archive.
-
-
3. An information-gathering method comprising:
-
gathering a first web-page on a network;
storing the first web-page and a first address corresponding to the first web-page in a web archive;
acquiring information on a second address of a linked web-page specified in a second web-page that is being referred from a terminal;
determining whether the linked web-page is stored in the web archive; and
gathering the linked web-page via the network based on the second address when the linked web-page is not stored in the web archive.
-
-
4. An information-gathering method comprising:
-
acquiring information on a first address of a first web-page from a server having a web archive that stores the first address and the first web-page corresponding to the first address;
storing the first address in a gathered-address table;
accessing the first web-page based on the first address stored in the gathered-address table;
accessing a second web-page linked to the first web-page;
acquiring a second address of the second web-page as an access log;
determining whether the second address is stored in the gathered-address table; and
gathering the second web-page via the network based on the second address when the second address is not stored in the gathered-address table.
-
-
5. An information-gathering program making a computer execute steps comprising:
-
gathering a first web-page on a network;
storing the first web-page and a first address corresponding to the first web-page in a web archive;
acquiring information on a second address of a linked web-page specified in a second web-page that is being referred from a terminal;
determining whether the linked web-page is stored in the web archive; and
gathering the linked web-page via the network based on the second address when the linked web-page is not stored in the web archive. - View Dependent Claims (6)
-
-
7. An information-gathering program making a computer execute steps comprising:
-
acquiring information on a first address of a first web-page from a server having a web archive that stores the first address and the first web-page corresponding to the first address;
storing the first address in a gathered-address table;
accessing the first web-page based on the first address stored in the gathered-address table;
accessing a second web-page linked to the first web-page;
acquiring a second address of the second web-page as an access log;
determining whether the second address is stored in the gathered-address table; and
gathering the second web-page via the network based on the second address when the second address is not stored in the gathered-address table. - View Dependent Claims (8, 9)
-
Specification