System and method for evaluating clustering in case control data
First Claim
1. A method of evaluating clustering in case control data for a plurality of individuals taking into account dynamic location information, comprising:
- establishing a set of space time coordinates for each individual, the set of space time coordinates being indicative of a geographic location of a residence of the individual at a beginning time and an ending time;
establishing a case control identifier for each individual, the case control identifier having a first control value if the individual is a case and a second control value if the individual is not a case;
establishing a neighbor relationship value between each individual and the other individuals, wherein the neighbor relationship value between one individual and another individual has a first relationship value, if the one individual and the another individual are neighbors according a set of predetermined criteria and a second relationship value are not neighbors; and
, for at least one case individual whose case control identifier has the first value, establishing a spatially and temporally local case-control cluster statistic as a function of the set of space time coordinates of each individual, the case control identifier, and the neighbor relationship values between the one case individual and the other individuals.
0 Assignments
0 Petitions
Accused Products
Abstract
A method and system to evaluate clustering in case control data for a plurality of individuals taking into account dynamic location information. A set of space time coordinates for each individual is established. The set of space time coordinates indicate a geographic location of a residence of the individual at a beginning time and an ending time. A case control identifier for each individual is established. For at least one case individual whose case control identifier has the first value, a spatially and temporally local case-control cluster statistic is established as a function of the set of space time coordinates of each individual, the case control identifier, and the neighbor relationship values between the one case individual and the other individuals. Dynamic location information for exposure sources are used to establish a focused case-control cluster statistic as a function of the set of space time coordinates of each exposure source, the space time coordinates of each individual, the case control identifier, and the neighbor relationship values between the case individuals, the other individuals and the exposure sources.
-
Citations
27 Claims
-
1. A method of evaluating clustering in case control data for a plurality of individuals taking into account dynamic location information, comprising:
-
establishing a set of space time coordinates for each individual, the set of space time coordinates being indicative of a geographic location of a residence of the individual at a beginning time and an ending time;
establishing a case control identifier for each individual, the case control identifier having a first control value if the individual is a case and a second control value if the individual is not a case;
establishing a neighbor relationship value between each individual and the other individuals, wherein the neighbor relationship value between one individual and another individual has a first relationship value, if the one individual and the another individual are neighbors according a set of predetermined criteria and a second relationship value are not neighbors; and
,for at least one case individual whose case control identifier has the first value, establishing a spatially and temporally local case-control cluster statistic as a function of the set of space time coordinates of each individual, the case control identifier, and the neighbor relationship values between the one case individual and the other individuals. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of evaluating clustering in case control data for a plurality of individuals taking into account dynamic location information, comprising:
-
establishing a set of space time coordinates for each individual, the set of space time coordinates being indicative of a geographic location of a residence of the individual at a beginning time and an ending time;
establishing a case control identifier for each individual, the case control identifier having a first control value if the individual is a case and a second control value if the individual is not a case;
establishing a neighbor relationship value between each individual and the other individuals, wherein the neighbor relationship value between one individual and another individual has a first relationship value, if the one individual and the another individual are neighbors according a set of predetermined criteria and a second relationship value are not neighbors;
for at least one case individual whose case control identifier has the first value, establishing a spatially and temporally local case-control cluster statistic as a function of the set of space time coordinates of each individual, the case control identifier, and the neighbor relationship values between the one case individual and the other individuals;
establishing a probability of another individual being a case;
establishing a global statistic for spatial clustering of cases at a time, t, as a function of the case control identifiers and a neutral model of spatially heterogeneous population density;
establishing a sum of the global statistic for spatial clustering over times, T+1;
establishing first test statistic as a function of the global statistic for spatial clustering, the first test statistic being indicative of whether cases tend to cluster through time around a specific case;
identifying a focus individual, where cases may be clustering about the focus individual;
establishing a lifeline for the focus individual, the lifeline including the set of space time coordinates for the focus individual;
establishing a second test statistic representing a count of neighbors of the focus individual who are cases at a focus time; and
,establishing a third test statistic as a function of the second test statistic, the third test statistic representing count of neighbors of the focus individual who are cases between the beginning time and the ending time. - View Dependent Claims (12, 13)
-
-
14. A system for evaluating clustering in case control data for a plurality of individuals taking into account dynamic location information, comprising:
-
a database for storing the case control data; and
,a computer coupled to the database for establishing a set of space time coordinates for each individual as a function of the case control data, the set of space time coordinates being indicative of a geographic location of a residence of the individual at a beginning time and an ending time, for establishing a case control identifier for each individual, the case control identifier having a first control value if the individual is a case and a second control value if the individual is not a case, for establishing a neighbor relationship value between each individual and the other individuals, wherein the neighbor relationship value between one individual and another individual has a first relationship value, if the one individual and the another individual are neighbors according a set of predetermined criteria and a second relationship value are not neighbors, and, for at least one case individual whose case control identifier has the first value, for establishing a spatially and temporally local case-control cluster statistic as a function of the set of space time coordinates of each individual, the case control identifier, and the neighbor relationship values between the one case individual and the other individuals. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A system for evaluating clustering in case control data for a plurality of individuals taking into account dynamic location information, comprising:
-
a database for storing case control data;
a computer coupled to the database for establishing a set of space time coordinates for each individual as a function of the case control data, the set of space time coordinates being indicative of a geographic location of a residence of the individual at a beginning time and an ending time, for establishing a case control identifier for each individual, the case control identifier having a first control value if the individual is a case and a second control value if the individual is not a case, for establishing a neighbor relationship value between each individual and the other individuals, wherein the neighbor relationship value between one individual and another individual has a first relationship value, if the one individual and the another individual are neighbors according a set of predetermined criteria and a second relationship value are not neighbors, for at least one case individual whose case control identifier has the first value, for establishing a spatially and temporally local case-control cluster statistic as a function of the set of space time coordinates of each individual, the case control identifier, and the neighbor relationship values between the one case individual and the other individuals, for establishing a probability of another individual being a case, for establishing a global statistic for spatial clustering of cases at a time, t, as a function of the case control identifiers and a neutral model of spatially heterogeneous population density, for establishing a sum of the global statistic for spatial clustering over times, T+1, for establishing first test statistic as a function of the global statistic for spatial clustering, the first test statistic being indicative of whether cases tend to cluster through time around a specific case, for identifying a focus individual, where cases may be clustering about the focus individual, for establishing a lifeline for the focus individual, the lifeline including the set of space time coordinates for the focus individual, for establishing a second test statistic representing a count of neighbors of the focus individual who are cases at a focus time, and for establishing a third test statistic as a function of the second test statistic, the third test statistic representing count of neighbors of the focus individual who are cases between the beginning time and the ending time. - View Dependent Claims (25, 26, 27)
-
Specification