Summarization and communication of large data sets
First Claim
Patent Images
1. A computer-implemented method, comprising:
- generating a structure for a plurality of web pages of a web site, the structure comprising a plurality of nodes representing the plurality of web pages;
determining a traffic pattern between a first web page, a second web page, and a third web page of the plurality of web pages, the traffic pattern determined based on web traffic information associated with the web site;
generating a visualization of the traffic pattern based on recursive grouping of a plurality of geometric shapes that correspond to the plurality of nodes, wherein generating the visualization comprises;
grouping, into a first group, geometric shapes representing the first web page and the second web page, wherein the grouping is based on the traffic pattern indicating that a volume of web traffic between the first web page and the second web page exceeds a first threshold, the first group having a first geometric shape that contains the geometric shapes;
setting a first size and a first label for the first geometric shape based on the web traffic between the first web page and the second web page; and
grouping, into a second group, the first geometric shape and a a geometric shape representing the third web page, wherein the grouping is based on the traffic pattern indicating that a volume of web traffic between the third web page and web pages in the first group is lower than the first threshold and exceeds a second threshold, wherein the second group has a second geometric shape that includes the first geometric shape of the first group and the geometric shape of the third web page; and
providing the visualization for display in a user interface to indicate the traffic pattern, wherein the visualization simultaneously displays (i) the first geometric shape of the first group as containing the geometric shapes of the first web page and the second web page and (ii) the second geometric shape of the second group as containing the first geometric shape of the first group and the geometric shape of the third web page, wherein the visualization sizes the geometric shape according to the first size and labels the first geometric shape according to the first label.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques for providing information about large data sets may be provided. For example, a summary of the data sets and of patterns between the data sets may be presented. Traffic associated with a network-based resource that includes a number of documents may be an example of large data sets. The traffic may be analyzed and traffic patterns may be determined. A structure may be generated based on the traffic patterns and may use nodes to represent the documents. Further, a visualization of the structure may be presented. The visualization may include recursive clusters of the nodes, where the clusters may be labeled based on the respective clustered nodes.
14 Citations
20 Claims
-
1. A computer-implemented method, comprising:
-
generating a structure for a plurality of web pages of a web site, the structure comprising a plurality of nodes representing the plurality of web pages; determining a traffic pattern between a first web page, a second web page, and a third web page of the plurality of web pages, the traffic pattern determined based on web traffic information associated with the web site; generating a visualization of the traffic pattern based on recursive grouping of a plurality of geometric shapes that correspond to the plurality of nodes, wherein generating the visualization comprises; grouping, into a first group, geometric shapes representing the first web page and the second web page, wherein the grouping is based on the traffic pattern indicating that a volume of web traffic between the first web page and the second web page exceeds a first threshold, the first group having a first geometric shape that contains the geometric shapes; setting a first size and a first label for the first geometric shape based on the web traffic between the first web page and the second web page; and grouping, into a second group, the first geometric shape and a a geometric shape representing the third web page, wherein the grouping is based on the traffic pattern indicating that a volume of web traffic between the third web page and web pages in the first group is lower than the first threshold and exceeds a second threshold, wherein the second group has a second geometric shape that includes the first geometric shape of the first group and the geometric shape of the third web page; and providing the visualization for display in a user interface to indicate the traffic pattern, wherein the visualization simultaneously displays (i) the first geometric shape of the first group as containing the geometric shapes of the first web page and the second web page and (ii) the second geometric shape of the second group as containing the first geometric shape of the first group and the geometric shape of the third web page, wherein the visualization sizes the geometric shape according to the first size and labels the first geometric shape according to the first label. - View Dependent Claims (2, 3, 4, 5, 17, 18, 19, 20)
-
-
6. A system for providing traffic information associated with a web site, comprising:
-
a processor; a memory communicatively coupled to the processor and bearing instructions that, upon execution by the processor, cause the system to at least; generate a tree structure for a web site based on traffic information associated with web pages of the web site, the tree structure comprising nodes and branches, each node located at a branch and representing a web page; generate a visualization of the traffic information based on recursive clustering of a plurality of geometric shapes that correspond to the nodes, wherein generate the visualization comprises; determine a first cluster by identifying first web pages for the first cluster, the first cluster including first nodes from the tree structure, the first nodes representing the first web pages, the first web pages identified based on the traffic information indicating a first frequency of navigation between the first web pages, the first cluster having a first geometric shape that contains first geometric shapes representing the first web pages; determine a second cluster by identifying second web pages for the second cluster, the second cluster including second nodes from the tree structure, the second nodes representing the second web pages, the second web pages identified based on the traffic information indicating a second frequency of navigation between the second web pages, the second cluster having a second geometric shape that contains second geometric shapes representing the second web pages, and include the first geometric shape of the first cluster in the second geometric shape of the second cluster based on the traffic information indicating that navigation within the first web pages occurs more frequently than navigation between the first web pages and the second web pages; and provide the visualization for display, wherein the visualization displays the second geometric shape of the second group as containing the geometric shape of the first group, displays the first geometric shapes in the first geometric shape, and displays the second geometric shapes in the second geometric shape. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A computer-implemented method, comprising:
-
providing an interface configured to present traffic information associated with a plurality of web pages of a web site; and causing the interface to present a visualization of the traffic information using a plurality of geometric shapes representative of the plurality of web pages, wherein; two or more geometric shapes are presented in a first geometric shape of a same first group based on determining that traffic volume between two or more web pages corresponding to the two or more geometric shapes is larger than a first threshold, a geometric shape is presented in a second geometric shape of a second group and different from the first geometric shape of the first group based on determining that traffic volume between a web page corresponding to the geometric shape and web pages corresponding to geometric shapes presented in the first geometric shapes of the first group is smaller than the first threshold, the second geometric shape of the second group includes the first geometric shape of the first group based on the determining that traffic volume between web pages corresponding to geometric shapes presented in the second geometric shape of the second group and not in the first geometric shape of the first group and the web pages corresponding to the geometric shapes presented in the first geometric shape of the first group is larger than a second threshold, and the visualization displays the second geometric shape of the second group as containing the first geometric shape of the first group and displays the plurality of geometric shapes. - View Dependent Claims (12, 13, 14, 15, 16)
-
Specification