Systems and methods for evaluating a group detection tool
First Claim
1. A method of evaluating a group detection tool without knowledge regarding a composition of real-world groups to which individuals in an observed population belong, comprising:
- receiving, using a microprocessor, a set of data comprising a collection of asserted links between pairs of individuals;
creating, using the microprocessor, a list of the individuals present in the collection of asserted links, each individual on the list being assigned a unique numerical identifier;
creating, using the microprocessor, a square co-occurrence matrix describing the collection of asserted links, the square co-occurrence matrix comprising a plurality of rows and a plurality of columns each containing the list of individuals;
inserting, using the microprocessor, a link indicator in each cell of the square co-occurrence matrix, the link indicator having a first value if a first individual associated with the row and a second individual associated with the column are linked and a second value if they are not;
identifying, using the microprocessor, a plurality of square sub-co-occurrence matrices in the square co-occurrence matrix;
generating, using the microprocessor, a statistic for a characteristic of at least one class of the plurality of square sub-co-occurrence matrices;
identifying, using the microprocessor, one of the plurality of sub-co-occurrence matrices for each of the at least one class of square sub-co-occurrence matrices; and
evaluating, using the microprocessor, the performance of the group detection tool by;
applying the group detection tool to the set of data, anddetermining the extent to which a plurality of individuals in the at least one square sub-co-occurrence matrix are associated in one or more groups generated by the group detection tool.
7 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for evaluating a group detection tool are described. One described method includes a set of data including a collection of asserted links between pairs of individuals, creating a sorted list of the individuals present in the collection of asserted links, and creating a square co-occurrence matrix describing the collection of asserted links, the square co-occurrence matrix including a plurality of rows and a plurality of columns each containing the sorted list of individuals. The method also includes inserting a link indicator in each cell of the square co-occurrence matrix, the link indicator having a first value if a first individual associated with the row and a second individual associated with the column are linked and a second value if they are not, identifying a plurality of square sub-co-occurrence matrices in the square co-occurrence matrix, generating a statistic for a characteristic of at least one class of the plurality of square sub-co-occurrence matrices, and identifying one of the plurality of sub-co-occurrence matrices for each of the at least one class of square sub-co-occurrence matrices. The method further includes evaluating the performance of the group detection tool by: applying the group detection tool to the set of data, and determining the extent to which a plurality of individuals in the at least one square sub-co-occurrence matrix are associated in one or more groups generated by the group detection tool.
-
Citations
30 Claims
-
1. A method of evaluating a group detection tool without knowledge regarding a composition of real-world groups to which individuals in an observed population belong, comprising:
-
receiving, using a microprocessor, a set of data comprising a collection of asserted links between pairs of individuals; creating, using the microprocessor, a list of the individuals present in the collection of asserted links, each individual on the list being assigned a unique numerical identifier; creating, using the microprocessor, a square co-occurrence matrix describing the collection of asserted links, the square co-occurrence matrix comprising a plurality of rows and a plurality of columns each containing the list of individuals; inserting, using the microprocessor, a link indicator in each cell of the square co-occurrence matrix, the link indicator having a first value if a first individual associated with the row and a second individual associated with the column are linked and a second value if they are not; identifying, using the microprocessor, a plurality of square sub-co-occurrence matrices in the square co-occurrence matrix; generating, using the microprocessor, a statistic for a characteristic of at least one class of the plurality of square sub-co-occurrence matrices; identifying, using the microprocessor, one of the plurality of sub-co-occurrence matrices for each of the at least one class of square sub-co-occurrence matrices; and evaluating, using the microprocessor, the performance of the group detection tool by; applying the group detection tool to the set of data, and determining the extent to which a plurality of individuals in the at least one square sub-co-occurrence matrix are associated in one or more groups generated by the group detection tool. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A non-transitory computer-readable storage medium with an executable program stored thereon, wherein the program instructs a microprocessor to perform the following steps for evaluating a group detection tool without knowledge regarding a composition of real-world groups to which individuals in an observed population belong:
-
receiving a set of data comprising a collection of asserted links between pairs of individuals; creating a list of the individuals present in the collection of asserted links, each individual on the list being assigned a unique numerical identifier creating a square co-occurrence matrix describing the collection of asserted links, the square co-occurrence matrix comprising a plurality of rows and a plurality of columns each containing the list of individuals; inserting a link indicator in each cell of the square co-occurrence matrix, the link indicator having a first value if a first individual associated with the row and a second individual associated with the column are linked and a second value if they are not; identifying a plurality of square sub-co-occurrence matrices in the square co-occurrence matrix; generating a statistic for a characteristic of at least one class of the plurality of square sub-co-occurrence matrices; identifying one of the plurality of sub-co-occurrence matrices for each of the at least one class of square sub-co-occurrence matrices; and evaluating the performance of the group detection tool by; applying the group detection tool to the set of data, and determining the extent to which a plurality of individuals in the at least one square sub-co-occurrence matrix are associated in one or more groups generated by the group detection tool. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification