Distributional Similarity Based Method and System for Determining Topical Relatedness of Domain Names
First Claim
1. A method for calculating relatedness scores of domain names, which are indicative of relatedness of pairs of domain names requested by clients, the method comprising:
- receiving domain name system (DNS) traffic data, wherein the DNS traffic data includes at least domain names requested by the clients and identities of the clients requesting the domain names;
generating, based on the identities of the clients, vectors including the requested domain names, wherein entries in the vectors correspond to client sessions in which the client has requested the domain names;
reducing a dimensionality of the vectors by applying a dimensionality reduction method for generating reduced vectors;
applying a similarity metric to the reduced vectors to calculate the relatedness scores; and
storing the relatedness scores of the domain names.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems, computer software and methods for calculating relatedness scores of domain names, which are indicative of relatedness of pairs of domain names requested by clients are described. The method includes receiving DNS traffic data, where the DNS traffic data includes at least domain names requested by the clients and identities of the clients requesting the domain names; generating, based on the identities of the clients, vectors including the requested domain names, where entries in the vectors correspond to client sessions in which the client has requested the domain names; reducing a dimensionality of the vectors by applying a dimensionality reduction method for generating reduced vectors; applying a similarity metric to the reduced vectors to calculate the relatedness scores; and storing the relatedness scores of the domain names.
48 Citations
20 Claims
-
1. A method for calculating relatedness scores of domain names, which are indicative of relatedness of pairs of domain names requested by clients, the method comprising:
-
receiving domain name system (DNS) traffic data, wherein the DNS traffic data includes at least domain names requested by the clients and identities of the clients requesting the domain names; generating, based on the identities of the clients, vectors including the requested domain names, wherein entries in the vectors correspond to client sessions in which the client has requested the domain names; reducing a dimensionality of the vectors by applying a dimensionality reduction method for generating reduced vectors; applying a similarity metric to the reduced vectors to calculate the relatedness scores; and storing the relatedness scores of the domain names. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A server for calculating relatedness scores of domain names, which are indicative of relatedness of pairs of domain names requested by clients, the server comprising:
-
an input/output interface configured to receive domain name system (DNS) traffic data, wherein the DNS traffic data includes at least domain names requested by the clients and identities of the clients requesting the domain names; a processor connected to the input/output interface and configured to, generate, based on the identities of the clients, vectors including the requested domain names, wherein entries in the vectors correspond to client sessions in which the client has requested the domain names, reduce a dimensionality of the vectors by applying a dimensionality reduction method for generating reduced vectors, and apply a similarity metric to the reduced vectors to calculate the relatedness scores; and a memory connected to the processor and configured to store the relatedness scores of the domain names. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer readable medium including computer executable instructions, wherein the instructions, when executed, implement a method for calculating relatedness scores of domain names, which are indicative of relatedness of pairs of domain names requested by clients, the method comprising:
-
providing a system comprising distinct software modules, wherein the distinct software modules comprise a domain name system (DNS) traffic module, a vector generating module, and a mathematical module; receiving DNS traffic data via the DNS traffic module, wherein the DNS traffic data includes at least domain names requested by the clients and identities of the clients requesting the domain names; generating in the vector generating module, based on the identities of the clients, vectors including the requested domain names, wherein entries in the vectors correspond to client sessions in which the client has requested the domain names; reducing in the mathematical module dimensionality of the vectors by applying a dimensionality reduction method for generating reduced vectors; applying a similarity metric to the reduced vectors to calculate the relatedness scores; and storing the relatedness scores of the domain names. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification