System and method of analyzing web content
First Claim
1. A computer-implemented method of categorizing a uniform resource locator (URL) based on web content associated with the URL using at least one electronic hardware processor configured to implement a database management module, the method comprising:
- associating a first categorization priority with a first URL collection method of a plurality of URL collection methods, and associating a second categorization priority with a second URL collection method of the plurality of URL collection methods, the second URL connection method being a different type of collection method than the first URL collection method, wherein each of the plurality of collection methods is performed using at least one electronic hardware processor, and each collection method comprises one of a web crawler, a Domain Name Server (DNS) database, and a honey client;
assigning a particular categorization priority to each of a plurality of URLs, the categorization priority assigned to each particular URL based on whether the first or the second URL collection method identifies the particular URL of the plurality of URLs, wherein the first categorization priority is assigned to the particular URL in response to the first URL collection method identifying the particular URL and the second categorization priority is assigned to the particular URL in response to the second URL collection method identifying the particular URL;
categorizing the plurality of URL'"'"'s in an order that is based on a difference between the first and second categorization priorities as assigned to each of the plurality of URLs;
storing the categorization of the plurality of URL'"'"'s in a categorization database, a second database derived from the categorization database configured for queries to categorize at least the plurality of URLs when requested by workstations, the workstations being separate from the one or more hardware processors configured to implement the database management module.
9 Assignments
0 Petitions
Accused Products
Abstract
Computer-implemented methods and systems for categorizing a uniform resource locator (URL) based on web content associated with the URL are disclosed. In one aspect, a method includes identifying a first URL using a first URL collection method, assigning, using an electronic processor, a first categorization priority to the first URL based on the first URL being identified using the first URL collection method, categorizing, the first URL based on the first categorization priority, identifying a second URL using a second URL collection method, assigning, using an electronic processor, a second categorization priority different than the first categorization priority based on the second URL having been identified using the second URL collection method; and categorizing, using an electronic processor, the second URL based on the second categorization priority.
-
Citations
20 Claims
-
1. A computer-implemented method of categorizing a uniform resource locator (URL) based on web content associated with the URL using at least one electronic hardware processor configured to implement a database management module, the method comprising:
-
associating a first categorization priority with a first URL collection method of a plurality of URL collection methods, and associating a second categorization priority with a second URL collection method of the plurality of URL collection methods, the second URL connection method being a different type of collection method than the first URL collection method, wherein each of the plurality of collection methods is performed using at least one electronic hardware processor, and each collection method comprises one of a web crawler, a Domain Name Server (DNS) database, and a honey client; assigning a particular categorization priority to each of a plurality of URLs, the categorization priority assigned to each particular URL based on whether the first or the second URL collection method identifies the particular URL of the plurality of URLs, wherein the first categorization priority is assigned to the particular URL in response to the first URL collection method identifying the particular URL and the second categorization priority is assigned to the particular URL in response to the second URL collection method identifying the particular URL; categorizing the plurality of URL'"'"'s in an order that is based on a difference between the first and second categorization priorities as assigned to each of the plurality of URLs; storing the categorization of the plurality of URL'"'"'s in a categorization database, a second database derived from the categorization database configured for queries to categorize at least the plurality of URLs when requested by workstations, the workstations being separate from the one or more hardware processors configured to implement the database management module. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer system for categorizing a uniform resource locator (URL), the system comprising one or more hardware processors configured to implement a database management module to:
-
associate a first categorization priority with a first URL collection method of a plurality of URL collection methods, and associate a second categorization priority with a second URL collection method of the plurality of URL collection methods, the second URL connection method being a different type of collection method than the first URL collection method, wherein each of the plurality of collection methods are performed using at least one electronic hardware processor, and each collection method comprises one of a web crawler, a Domain Name Server (DNS) database, and a honey client; assign a particular categorization priority to each of a plurality of URLs, the categorization priority assigned to each particular URL based on whether the first or the second URL collection method identifies the particular URL of the plurality of URLs, wherein the first categorization priority is assigned to the particular URL in response to the first URL collection method identifying the particular URL and the second categorization priority is assigned to the particular URL in response to the second URL collection method identifying the particular URL; categorize the plurality of URL'"'"'s in an order that is based on a difference between the first and second categorization priorities as assigned to each of the plurality of URLs; store the categorization of the plurality of URL'"'"'s in a categorization database, a second database derived from the categorization database configured for queries to categorize URLs requested by workstations, the workstations separate from the one or more hardware processors configured to implement the database management module. - View Dependent Claims (12, 13, 14)
-
-
15. A computer-implemented system using at least one electronic hardware processor configured to implement a database management module for identifying uniform resource locators (URLs) associated with malicious content, the system comprising:
-
an electronic hardware processor; and an electronic hardware memory for storing computer executable instructions that, when executed by the electronic hardware processor, cause the electronic hardware processor to associate a first categorization priority with a first URL collection method of a plurality of URL collection methods, and associate a second categorization priority with a second URL collection method of the plurality of URL collection methods, the second URL connection method being different than the first URL collection method, wherein each of the plurality of collection methods is performed using at least one electronic hardware processor, and each collection method comprises one of a web crawler, a Domain Name Server (DNS) database, and a honey client; assign a particular categorization priority to each of a plurality of URLs, the categorization priority assigned to each particular URL based on whether the first or the second URL collection method identifies the particular URL, wherein the first categorization priority is assigned to the particular URL in response to the first URL collection method identifying the particular URL and the second categorization priority is assigned to the particular URL in response to the second URL collection method identifying the particular URL; categorize the plurality of URL'"'"'s in an order that is based on a difference between the first and second categorization priorities; store the categorization of the plurality of URL'"'"'s in a categorization database, the categorization database configured for queries to categorize URLs requested by workstations, the workstations being separate from the one or more hardware processors configured to implement the database management module. - View Dependent Claims (16)
-
-
17. A non-transitory computer readable storage medium comprising instructions that when executed cause one or more hardware processors configured to implement a database management module to perform a method of categorizing a uniform resource locator (URL) based on web content associated with the URL, the method comprising:
-
associating a first categorization priority with a first URL collection method of a plurality of URL collection methods, and associating a second categorization priority with second URL collection method of the plurality of URL collection methods, the second URL connection method being different than the first URL collection method, wherein each of the plurality of collection methods is performed using at least one electronic hardware processor, and each collection method comprises one of a web crawler, a Domain Name Server (DNS) database, and a honey client; assigning a particular categorization priority to each of a plurality of URLs, the categorization priority assigned to each particular URL based on whether the first or the second URL collection method identifies the particular URL of the plurality of URLs, wherein the first categorization priority is assigned to the particular URL in response to the first URL collection method identifying the particular URL and the second categorization priority is assigned to the particular URL in response to the second URL collection method identifying the particular URL; categorizing the plurality of URL'"'"'s in an order that is based on a difference between the first and second categorization priorities; storing the categorization of the plurality of URL'"'"'s in a categorization database, the categorization database configured for queries to categorize URLs requested by workstations, the workstations separate from the one or more hardware processors configured to implement the database management module. - View Dependent Claims (18, 19, 20)
-
Specification