Apparatus and method to support management of uniform resource locators and/or contents of database servers
First Claim
Patent Images
1. A method of finding a Uniform Resource Locator (URL) that points to a most updated authoritative source of information contained in database systems, the method comprising:
- crawling websites to determine likely publicly available records;
processing the likely publicly available records to determine a unique list of URLs each of which point to information content of crawled web sites that are likely to be the most updated authoritative source of the information content and wherein processing the likely publicly available records comprises applying an algorithm to information content of each crawled website to determine a likelihood of each crawled website as being the most authoritative source for providing specific information content.
0 Assignments
0 Petitions
Accused Products
Abstract
Methods and Systems for finding a Uniform Resource Locator (URL) that points to a most updated authoritative source of information contained in database systems, including crawling websites to determine likely publicly available records and processing the likely publicly available records to determine a unique list of URLs each of which point to information content of crawled web sites that are likely to be the most updated authoritative source of the information content.
-
Citations
28 Claims
-
1. A method of finding a Uniform Resource Locator (URL) that points to a most updated authoritative source of information contained in database systems, the method comprising:
-
crawling websites to determine likely publicly available records;
processing the likely publicly available records to determine a unique list of URLs each of which point to information content of crawled web sites that are likely to be the most updated authoritative source of the information content and wherein processing the likely publicly available records comprises applying an algorithm to information content of each crawled website to determine a likelihood of each crawled website as being the most authoritative source for providing specific information content. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 12)
submitting at least an updated portion of the unique list of URLs to an Internet search engine.
-
-
3. The method according to claim 2, wherein the Internet search engine provides at least one URL from the submitted unique lists of URLs for a user after a search query.
-
4. The method according to claim 1, further comprising:
registering web entities as authoritative sources for specific information content.
-
5. The method according to claim 4, further comprising:
sending specific information content of a registered web entity to an Internet search engine.
-
6. The method according to claim 5, wherein the specific information content of the registered web entity is provided to a user after a search query.
-
7. The method according to claim 1, wherein processing the likely publicly available records includes performing a sanity check to detect information content which may be at least one of politically incorrect content and offensive content to a user.
-
8. The method according to claim 1, wherein the information contained within the database systems relate to Notes Storage Facility (.NSF) files containing records.
-
9. The method according to claim 8, wherein the information contained in the Notes Storage Facility records are identified by a ReplicaID and UNiversal ID.
-
12. The system according to claim 4 further comprising a memory storing a module configured to parse content of each location and determine a likelihood of the records containing private information.
-
10. A system for obtaining a URL representing an authoritative source of information comprising:
-
a web crawler configured to search websites to compile a list content and location of records available for public viewing; and
a processor configured to apply an algorithm to information content of each searched website to determine a likelihood of each searched website as being a most authoritative source for providing specific information content, the processor further configured to reduce the list of content and location into a unique list of URLs, each of which point to specific information content provided by the most likely authoritative source. - View Dependent Claims (11, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. An information directory system accessible worldwide via the Internet, comprising:
-
at least one website having a virtual container containing a list of virtualized URLs that are retrievable by an Internet entity, the virtualized URLs previously processed to identify a most likely authoritative source of content specific information stored in data base systems, a memory, accessible by the virtual container of the website, wherein the memory stores a virtualized replica of content specific information of at least one website of a register authoritative user, and a first module stored in the memory, the module configured to call an algorithm to determine a likelihood of whether a query to the commercial website, looking for specific information content, is likely to be one of a machine or crawler accessing the commercial website, or a human accessing the commercial website. - View Dependent Claims (24, 25, 26, 27)
-
-
28. A method for virtualizing URLs identifying information sources in an Internet comprising:
-
separating a URL into a combination of DNS and HTTP protocols;
rearranging the DNS and HTTP protocols of the URL; and
rewriting the URL as a unique identifier that allows seamless switching between the DNS and HTTP protocols.
-
Specification