Managing URLs
First Claim
Patent Images
1. A method for crawling and indexing items on an intranet using a search system, the method comprising:
- maintaining a table including item locators and an importance rank for each of the item locators;
crawling items on an intranet using a search system;
adding entries in an index for each of the crawled items;
during crawling by the search system, discovering new item locators and adding the new item locators to the table;
crawling new items on the intranet associated with the new item locators using the search system; and
adding new entries in the index for the new items until a configurable number of entries in the index is reached.
2 Assignments
0 Petitions
Accused Products
Abstract
Crawling pages is disclosed. Pages are crawled up to a target number of pages. Additional pages, that have an importance that is equal to or greater than an importance threshold, are crawled beyond the target number of pages. In some embodiments, pages having an importance less than an importance threshold are deleted.
46 Citations
27 Claims
-
1. A method for crawling and indexing items on an intranet using a search system, the method comprising:
-
maintaining a table including item locators and an importance rank for each of the item locators; crawling items on an intranet using a search system; adding entries in an index for each of the crawled items; during crawling by the search system, discovering new item locators and adding the new item locators to the table; crawling new items on the intranet associated with the new item locators using the search system; and adding new entries in the index for the new items until a configurable number of entries in the index is reached. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for crawling and indexing items on an intranet comprising:
one or more processors of a search system, the one or more processors being configured to; maintain a table including item locators and an importance rank for each of the item locators; crawl items on an intranet; add entries in an index for each of the crawled items; during crawling, discover new item locators and add the new item locators to the table; crawl new items on the intranet associated with the new item locators; and add new entries in the index for the new items until a configurable number of entries in the index is reached. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
19. A non-transitory computer-readable storage device comprising instructions for crawling and indexing items on an intranet that, when executed, cause one or more processors of a search system to perform the actions of:
-
maintaining a table including item locators and an importance rank for each of the item locators; crawling items on an intranet; adding entries in an index for each of the crawled items; during crawling, discovering new item locators and adding the new item locators to the table; crawling new items on the intranet associated with the new item locators; and adding new entries in the index for the new items until a configurable number of entries in the index is reached. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
Specification