Probabilistic Association Based Method and System for Determining Topical Relatedness of Domain Names
First Claim
1. A method for calculating relatedness scores, which are indicative of relatedness of pairs of domain names requested by clients, the method comprising:
- receiving domain name system (DNS) traffic data, wherein the DNS traffic data includes at least domain names requested by clients and identities of the clients requesting the domain names;
generating sequences of the domain names based on the received DNS traffic data;
collecting co-occurrence counts for queried pairs of domain names;
applying a probabilistic association estimate to the collected counts to determine the relatedness scores of the queried pairs of domain names; and
storing the determined relatedness scores.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems, computer software and methods for calculating relatedness scores which are indicative of relatedness of pairs of domain names requested by clients are described. The method includes receiving DNS traffic data, wherein the DNS traffic data includes at least domain names requested by clients and identities of the clients requesting the domain names, generating sequences of the domain names based on the received DNS traffic data, collecting co-occurrence counts for queried pairs of domain names, applying a probabilistic association estimate to the collected counts to determine the relatedness scores of the queried pairs of domain names, and storing the determined relatedness scores.
-
Citations
19 Claims
-
1. A method for calculating relatedness scores, which are indicative of relatedness of pairs of domain names requested by clients, the method comprising:
-
receiving domain name system (DNS) traffic data, wherein the DNS traffic data includes at least domain names requested by clients and identities of the clients requesting the domain names; generating sequences of the domain names based on the received DNS traffic data; collecting co-occurrence counts for queried pairs of domain names; applying a probabilistic association estimate to the collected counts to determine the relatedness scores of the queried pairs of domain names; and storing the determined relatedness scores. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A server for calculating relatedness scores, which are indicative of a relatedness of pairs of domain names requested by clients, the server comprising:
-
an input/output interface configured to receive domain name system (DNS) traffic data, wherein the DNS traffic data includes at least domain names requested by clients and identities of the clients requesting the domain names; a processor connected to the input/output interface and configured to, generate sequences of the domain names based on the received DNS traffic data, collect co-occurrence counts for queried pairs of domain names, and apply a probabilistic association estimate to the collected counts to determine the relatedness scores of the queried pairs of domain names; and a memory connected to the processor and configured to store the determined relatedness scores. - View Dependent Claims (10, 11, 12, 13)
-
-
14. A computer readable medium storing computer executable instructions, wherein the instructions, when executed, implement a method for calculating relatedness scores, which are indicative of a relatedness of pairs of domain names requested by clients, the method comprising:
-
providing a system comprising distinct software modules, wherein the distinct software modules comprise a domain name system (DNS) traffic module, a sequence module, a co-occurrence module, and a probabilistic association estimate module; receiving at the DNS traffic module DNS traffic data, wherein the DNS traffic data includes at least domain names requested by clients and identities of the clients requesting the domain names; generating by the sequence module sequences of the domain names based on the received DNS traffic data; collecting co-occurrence counts for queried pairs of domain names in the co-occurrence module; applying, in the probabilistic association estimate module, a probabilistic association estimate to the collected counts to determine the relatedness scores of the queried pairs of domain names; and storing the determined relatedness scores. - View Dependent Claims (15, 16, 17, 18, 19)
-
Specification