SUMMARIZATION AND COMMUNICATION OF LARGE DATA SETS
First Claim
1. A computer-implemented method, comprising:
- generating a structure for a plurality of documents hosted on a network-based resource, the structure comprising a plurality of nodes representing the plurality of documents;
determining a traffic pattern between a first document, a second document, and a third document of the plurality of documents, the traffic pattern determined based on traffic information associated with the network-based resource;
grouping, based on the traffic pattern, a first node representing the first document and a second node representing the second document in a first group;
grouping, based on the traffic pattern, the first group and a third node representing the third document in a second group; and
displaying graphical indications in a user interface to indicate the traffic pattern by indicating the grouping of the first node and second node in the first group and by indicating the grouping of the first group and third node in the second group.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques for providing information about large data sets may be provided. For example, a summary of the data sets and of patterns between the data sets may be presented. Traffic associated with a network-based resource that includes a number of documents may be an example of large data sets. The traffic may be analyzed and traffic patterns may be determined. A structure may be generated based on the traffic patterns and may use nodes to represent the documents. Further, a visualization of the structure may be presented. The visualization may include recursive clusters of the nodes, where the clusters may be labeled based on the respective clustered nodes.
-
Citations
20 Claims
-
1. A computer-implemented method, comprising:
-
generating a structure for a plurality of documents hosted on a network-based resource, the structure comprising a plurality of nodes representing the plurality of documents; determining a traffic pattern between a first document, a second document, and a third document of the plurality of documents, the traffic pattern determined based on traffic information associated with the network-based resource; grouping, based on the traffic pattern, a first node representing the first document and a second node representing the second document in a first group; grouping, based on the traffic pattern, the first group and a third node representing the third document in a second group; and displaying graphical indications in a user interface to indicate the traffic pattern by indicating the grouping of the first node and second node in the first group and by indicating the grouping of the first group and third node in the second group. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system for providing traffic information associated with a web site, comprising:
-
a processor; a memory communicatively coupled to the processor and bearing instructions that, upon execution by the processor, cause the system to at least; generate a tree structure for a web site based on traffic information associated with web pages of the web site, the tree structure comprising nodes and branches, each node located at a branch and representing a web page; determine a first cluster by identifying, based on the traffic information, first web pages for the first cluster, the first cluster including first nodes from the tree structure, the first nodes representing the first web pages; and determine a second cluster by identifying, based on the traffic information, second web pages for the second cluster, the second cluster including second nodes from the tree structure, the second nodes representing the second web pages, wherein the first cluster and the second cluster indicate that navigation within the first web pages occurs more frequently than navigation between the first web pages and the second web pages. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A computer-implemented method, comprising:
-
providing an interface configured to present traffic information associated with a plurality of web pages of a website; and causing the interface to present the traffic information using a plurality of nodes representative of the plurality of web pages, wherein; two or more nodes are presented in a same first group based on determining that traffic volume between two or more web pages corresponding to the two or more nodes is larger than a first threshold, and a node is presented in a second group different from the first group based on determining that traffic volume between a web page corresponding to the node presented in the second group and web pages corresponding to nodes presented in the first group is smaller than the first threshold. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification