Predicting geographic location associated with network address
First Claim
1. A computer system for generating a decision tree for predicting a geographic location based on an internet protocol (IP) address associated with a user, the computer system comprising:
- a data store containing a decision tree that is constructed to produce a predicted geographic location for an IP address associated with the user; and
a computing device in communication with the data store, wherein the computing device is configured to;
obtain a first set of sample data which includes IP addresses and user profile information, wherein the user profile information includes geographic location information correlated with an IP address;
train the decision tree with the first set of sample data, wherein the decision tree includes a number of leaf nodes and each leaf node of the decision tree associates at least one IP address and a predicted geographic location;
obtain a second set of sample data which includes an IP address and user profile information including geographic location information correlated with an IP address;
prune the trained decision tree with the second set of sample data;
obtain a third set of sample data which includes IP addresses and user profile information including geographic location information correlated with an IP address; and
qualify the decision tree with the third set of sample data, wherein after qualification, each leaf node is assigned a confidence number, wherein the confidence number indicates a degree of accuracy of the predicted geographic location associated with at least one IP address corresponding to the leaf node.
1 Assignment
0 Petitions
Accused Products
Abstract
A decision tree is provided as a machine learning classifier to predict a user attribute, such as a geographical location of a user, based on a network address. More specifically, the decision tree is constructed via machine learning on a set of sample data that reflects a relationship between a network address and a user attribute of a “known user” whose profile information is recognizable. For a given network address, the decision tree can be used as a machine learning classifier to predict the most likely user attribute of a potential user. With the predicted attribute, a network service can target a group of potential users for various campaigns without recognizing the identities of the potential users.
85 Citations
37 Claims
-
1. A computer system for generating a decision tree for predicting a geographic location based on an internet protocol (IP) address associated with a user, the computer system comprising:
-
a data store containing a decision tree that is constructed to produce a predicted geographic location for an IP address associated with the user; and a computing device in communication with the data store, wherein the computing device is configured to; obtain a first set of sample data which includes IP addresses and user profile information, wherein the user profile information includes geographic location information correlated with an IP address; train the decision tree with the first set of sample data, wherein the decision tree includes a number of leaf nodes and each leaf node of the decision tree associates at least one IP address and a predicted geographic location; obtain a second set of sample data which includes an IP address and user profile information including geographic location information correlated with an IP address; prune the trained decision tree with the second set of sample data; obtain a third set of sample data which includes IP addresses and user profile information including geographic location information correlated with an IP address; and qualify the decision tree with the third set of sample data, wherein after qualification, each leaf node is assigned a confidence number, wherein the confidence number indicates a degree of accuracy of the predicted geographic location associated with at least one IP address corresponding to the leaf node. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-implemented method for predicting a geographic location of a user who accesses a network-based service and whose identity is not recognized, the computer-implemented method comprising:
-
obtaining, with a computer, an internet protocol (IP) address of a user device with which the user interacts with the network-based service; and providing, with the computer, the IP address of the user device to a classifier, and receiving from the classifier a predicted geographic location for the IP address provided; wherein the classifier is a decision tree having a number of leaf nodes, wherein each leaf node associates at least one IP address with a predicted geographic location. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer system for determining a user group for a user whose identity is not recognized, the computer system comprising:
-
a data store containing a classifier which is constructed to produce a predicted geographic location for a given internet protocol (IP) address; and a computing device in communication with the data store, wherein the computing device is configured to; obtain an IP address which is generated from an interaction of a user device with a network-based service; provide the IP address of the user device to the classifier; receive, from the classifier, a predicted geographic location associated with the IP address of the user device; and determine a user group that corresponds to the predicted geographic location; wherein the classifier is a decision tree having a number of paths, wherein the decision tree includes a number of leaf nodes and each leaf node of the decision tree associates at least one IP address with a predicted geographic location. - View Dependent Claims (21, 22, 23, 24)
-
-
25. A method of operating a computer to generate a decision tree for predicting a geographic location based on an internet protocol (IP) address associated with a user, the method comprising:
-
obtaining, with the computer, a first set of sample data which includes IP addresses and user profile information, wherein the user profile information includes geographic location information correlated with an IP address; training, with the computer, a decision tree with the first set of sample data, wherein each leaf node of the decision tree associates at least one IP address and a predicted geographic location; obtaining, with the computer, a second set of sample data which includes an IP address and user profile information including geographic location information correlated with an IP address; pruning, with the computer, the trained decision tree with the second set of sample data; obtaining, with the computer, a third set of sample data which includes IP addresses and user profile information including geographic location information correlated with an IP address; and qualifying the decision tree with the third set of sample data, wherein after qualification, each leaf node is assigned a confidence number, wherein the confidence number indicates a degree of accuracy of the predicted geographic location associated with at least one IP address corresponding to the leaf node. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32)
-
-
33. A non-transitory, computer-readable medium having computer-executable components encoded thereon that, in response to execution by a computer, cause the computer to map a predicted geographic location of a user to an internet protocol (IP) address, the computer-executable components comprising:
-
a request process component for receiving information which is generated from a user'"'"'s activities over a network, wherein the information includes an IP address associated with the user; and a classifier component for predicting a geographic location associated with the identified the IP address that is associated with the user; wherein the predicted geographic location of the user is obtained from the classifier component; and wherein the classifier component is a decision tree in which the decision tree includes a number of leaf nodes and each leaf node of the decision tree associates at least one IP address with a predicted geographic location. - View Dependent Claims (34, 35, 36)
-
-
37. A method of operating a computer to provide information targeted to a user when the user clicks on a webpage, the method comprising:
-
receiving, with the computer, clickstream information which is generated as the user clicks on the webpage, wherein the clickstream information includes an internet protocol address of a user device; identifying, with the computer, the internet protocol address from the clickstream information; providing, with the computer, the identified internet protocol address to a classifier; receiving, with the computer, a predicted geographic location from the classifier, wherein the classifier is a decision tree trained to predict a geographic location for a given internet protocol address, and wherein the decision tree includes a number of leaf nodes and each leaf node of the decision tree associates at least one IP address with a predicted geographic location; obtaining, with the computer, information targeted to a group of users in the predicted geographic location; and transmitting, with the computer, the obtained information to the user device.
-
Specification