Processes for calculating item distances and performing item clustering
First Claim
Patent Images
1. A computer-implemented method of measuring the distance between a first item and a second item, the method comprising:
- identifying a first set of one or more paths in a hierarchical browse structure from a root node of the hierarchical browse structure to the first item, said hierarchical browse structure comprising multiple levels of browse nodes arranged in a hierarchy, each browse node representing a respective item category;
identifying a second set of one or more paths in the hierarchical browse structure from the root node to the second item;
identifying a plurality of shared browse nodes for the first and second items, said shared browse nodes being browse nodes that occur in both the first set of paths and the second set of paths, each shared browse node representing a respective item category in which the first and second items are both categorized;
calculating data values for particular shared browse nodes;
calculating a distance value that represents the distance between the first item and the second item such that the distance value is dependent upon at least (a) the number of said shared browse nodes, (b) the total number of browse nodes in the first and second sets of paths, and (c) said data values, such that different amounts of weight are given to different shared browse nodes in calculating the distance value; and
storing the distance value in computer storage.
1 Assignment
0 Petitions
Accused Products
Abstract
Computer-implemented processes are disclosed for clustering items, and for using item clusters to generate and/or present item recommendations. One process involves calculating distances between items based on how the items are categorized within a hierarchical browse structure. These distance calculations may then be used as a basis for forming clusters of items.
-
Citations
29 Claims
-
1. A computer-implemented method of measuring the distance between a first item and a second item, the method comprising:
-
identifying a first set of one or more paths in a hierarchical browse structure from a root node of the hierarchical browse structure to the first item, said hierarchical browse structure comprising multiple levels of browse nodes arranged in a hierarchy, each browse node representing a respective item category; identifying a second set of one or more paths in the hierarchical browse structure from the root node to the second item; identifying a plurality of shared browse nodes for the first and second items, said shared browse nodes being browse nodes that occur in both the first set of paths and the second set of paths, each shared browse node representing a respective item category in which the first and second items are both categorized; calculating data values for particular shared browse nodes; calculating a distance value that represents the distance between the first item and the second item such that the distance value is dependent upon at least (a) the number of said shared browse nodes, (b) the total number of browse nodes in the first and second sets of paths, and (c) said data values, such that different amounts of weight are given to different shared browse nodes in calculating the distance value; and storing the distance value in computer storage. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer system, comprising:
-
a computer data repository that stores a representation of a hierarchical browse structure for browsing a repository of items, said hierarchical browse structure comprising multiple levels of browse nodes, each browse node representing a particular item category; a distance calculation system comprising a machine, said distance calculation system configured to calculate distance values representing distances between particular items such that the distance value for a pair of items, item 1 and item 2, depends on (a) a number of browse nodes that are shared by both item 1 and item 2 in the hierarchical browse structure, and (2) weight values calculated for said shared browse nodes, said weight values calculated such that the shared browse nodes are not all weighted equally; and a clustering system configured to use at least the distance values calculated by the distance calculation system to form clusters of related items. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A computer-readable medium having stored thereon executable instructions which, when executed by a computer system, cause the computer system to:
-
identify a set of shared browse nodes for a first item and a second item that are represented in a hierarchical browse structure, said hierarchical browse structure comprising multiple levels of browse nodes, each browse node representing an item category, said first item accessible via a first set of one or more paths from a root node of the hierarchical browse structure, said second item accessible via a second set of one or more paths from the root node, said shared browse nodes being browse nodes that occur in both the first set of paths and the second set of paths, said first and second sets of paths reflecting categorizations of the first and second items, respectively, in the hierarchical browse structure; calculate data values for particular ones of said shared browse nodes; and calculate and store a distance value that represents a distance between the first item and the second item such that the distance value is dependent upon at least (a) the number of said shared browse nodes, (b) the total number of browse nodes in the first and second sets of paths, and (c) said data values, such that different amounts of weight are given to different shared browse nodes in calculating the distance value. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29)
-
Specification