Structured relevance—a mechanism to reveal why data is related
First Claim
Patent Images
1. An apparatus, comprising:
- a computer;
an input port to receive a group including members of a data set;
a key symbol identifier to identify a key symbol for the group;
a weighted magnitude calculator to calculate a weighted magnitude for each key symbol based on a number of occurrences for each key symbol in the members divided by a total number of members for the group to form a quotient that is multiplied by a maximum value for that key symbol; and
a sorter to sort the key symbols based on the weighted magnitude for each key symbol;
wherein the weighted magnitudes of the sorted key symbols are used to represent relationships for each member of the group relative to remaining ones of the members, and wherein the group includes a group strength, the group strength is a combination of;
a group order for the group, a group weight for the group, and a group distance for the group, and wherein the combination is calculated to ensure that the group order is more significant than the group weight and the group weight is more significant than the group distance, and wherein the group weight is a maximum number of hops that must be made from any member within the group to reach a core of the data set and the core is a particular set of members of the data set such that each member is the nearest neighboring member of each other member, and wherein the group order is determined by a total number of the members within the group, and wherein the group distance is a calculated average distance for each member of the group to the nearest neighboring member for that member, and wherein the group strength is compared against other calculated group strengths for other groups to determine a strength between the group and the other groups relative to one another.
8 Assignments
0 Petitions
Accused Products
Abstract
A machine receives a group of members of a data set. The machine identifies key symbols from the members of the group or the data set. The machine then calculates, for each key symbol, a weighted magnitude for the key symbol in the group. The machine can then sort the key symbols according to their weighted magnitudes, and filter out common key symbols. The uncommon key symbols, as sorted according to their weighted magnitudes, can form a name for the group.
13 Citations
18 Claims
-
1. An apparatus, comprising:
-
a computer; an input port to receive a group including members of a data set; a key symbol identifier to identify a key symbol for the group; a weighted magnitude calculator to calculate a weighted magnitude for each key symbol based on a number of occurrences for each key symbol in the members divided by a total number of members for the group to form a quotient that is multiplied by a maximum value for that key symbol; and a sorter to sort the key symbols based on the weighted magnitude for each key symbol; wherein the weighted magnitudes of the sorted key symbols are used to represent relationships for each member of the group relative to remaining ones of the members, and wherein the group includes a group strength, the group strength is a combination of;
a group order for the group, a group weight for the group, and a group distance for the group, and wherein the combination is calculated to ensure that the group order is more significant than the group weight and the group weight is more significant than the group distance, and wherein the group weight is a maximum number of hops that must be made from any member within the group to reach a core of the data set and the core is a particular set of members of the data set such that each member is the nearest neighboring member of each other member, and wherein the group order is determined by a total number of the members within the group, and wherein the group distance is a calculated average distance for each member of the group to the nearest neighboring member for that member, and wherein the group strength is compared against other calculated group strengths for other groups to determine a strength between the group and the other groups relative to one another. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method, comprising:
-
using a processor, identifying a group including members of a data set; identifying key symbols in the group, the group including at least one member of the data set including other members; determining a weighted magnitude for each key symbol; sorting the key symbols based on a weighted magnitude for each of the key symbols, the sorted key symbols; and defining a group strength for the group as a combination of;
a group order for the group, a group weight for the group, and a group distance for the group, wherein the group order is a total number of members present within the group, the group weight is a maximum number of hops that must be made from any member within the group to reach a core of the data set and the core is a particular set of members of the data set such that each member is the nearest neighboring member of each other member, the group distance is a calculated average distance for each member of the group to the nearest neighboring member for that member, and wherein the combination is calculated to ensure that the group order is more significant than the group weight and the group weight is more significant than the group distance and comparing the group strength against other calculated strengths for other groups for determining a strength between the group and the other groups;wherein the weighted magnitudes of the sorted key symbols represent relationships for each member of the group relative to remaining ones of the members of the group. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. An article comprising a non-transitory storage medium, said non-transitory storage medium having stored thereon instructions, that, when executed by a machine, result in:
-
identifying a group including a particular member of a data set having multiple other members; identifying key symbols in the group; determining a weighted magnitude for each key symbol based at least in part on a number of occurrences for each key symbol in the members divided by a total number of members for the group to form a quotient that is multiplied by a maximum value for that key symbol; sorting the key symbols according to weighted magnitudes for each of the key symbols; and defining a group strength for the group as a combination of;
a group order for the group, a group weight for the group, and a group distance for the group, wherein the group order is a total number of members present within the group, the group weight is a maximum number of hops that must be made from any member within the group to reach a core of the data set and the core is a particular set of members of the data set such that each member is the nearest neighboring member of each other member, the group distance is a calculated average distance for each member of the group to the nearest neighboring member for that member, and wherein the combination is calculated to ensure that the group order is more significant than the group'"'"'s weight and the group'"'"'s weight is more significant than the group'"'"'s distance, and comparing the group strength against other calculated strengths for other groups for determining a strength between the group and the other groups;wherein the weighted magnitudes of the sorted key symbols represent relationships for each member of the group relative to remaining ones of the members of the group. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification