Latency-based routing and load balancing in a network
First Claim
Patent Images
1. A method, comprising:
- receiving, at a first network node, data indicative of a plurality of relative lag times for instantiation of a specific content item at a first plurality of network nodes, the specific content item being common for each network node of the first plurality of network nodes;
receiving, at the first network node, network performance data related to servicing a request for the specific content item for each network node of a second plurality of network nodes;
receiving a request for the specific content item from at least one network node of the second plurality of network nodes; and
selecting a second network node of the first plurality of network nodes to service the request for the specific content item, the second network node being configured based at least on the received network performance data and a first lag time for instantiation of the specific content item at the second network node with respect to a second lag time for instantiation of the specific content item at at least one other network node of the first plurality of network nodes.
3 Assignments
0 Petitions
Accused Products
Abstract
Aspects of the disclosure relate to routing queries to a network repository and load balancing in a network. For a network repository having a plurality of content storage sites, relative replication latency of data among a pair of content storages sites can be monitored. Data indicative of such replication latency can be distributed among the content storage sites and can be provided, for example, to a network node in a system layer. A traffic and control manager can determine routing pathways for queries based at least in part on the relative replication latency data and performance conditions of network nodes.
-
Citations
19 Claims
-
1. A method, comprising:
-
receiving, at a first network node, data indicative of a plurality of relative lag times for instantiation of a specific content item at a first plurality of network nodes, the specific content item being common for each network node of the first plurality of network nodes; receiving, at the first network node, network performance data related to servicing a request for the specific content item for each network node of a second plurality of network nodes; receiving a request for the specific content item from at least one network node of the second plurality of network nodes; and selecting a second network node of the first plurality of network nodes to service the request for the specific content item, the second network node being configured based at least on the received network performance data and a first lag time for instantiation of the specific content item at the second network node with respect to a second lag time for instantiation of the specific content item at at least one other network node of the first plurality of network nodes. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method, comprising:
-
providing, by a first network node, control signaling among the first network node and a first plurality of network nodes configured in a data replication topology in a data layer; receiving, by the first network node, data indicative of a plurality of relative replication latencies for instantiation of a specific data item at at least two of the first plurality of network nodes in response to the control signaling; generating, by the first network node, a data structure having the data indicative of the plurality of relative replication latencies for instantiation of the specific data item; receiving a request for the specific data item from at least one network node of a second plurality of network nodes; and selecting a second network node of the first plurality of network nodes to service the request for the specific data item based at least on accessing the data structure to select the second network node of the first plurality of network nodes based on a first relative replication latency for instantiation of the specific data item at the second network node of the first plurality of network nodes with respect to a second relative replication latency for instantiation of the specific content at at least one other network node of the first plurality of network nodes. - View Dependent Claims (7, 8, 9, 10, 11)
-
-
12. A system, comprising:
-
a first plurality of network nodes in an application layer; a second plurality of network nodes in a data layer, wherein at least one network node in the first plurality of network nodes is configured to provide data indicative of a plurality of relative replication latencies of a specific data item at the second plurality of network nodes in the data layer; and a first network node of the first plurality of network nodes in the application layer, the first network node being configured to; receive network performance data of at least one network node of the second plurality of network nodes to service a request for the specific data item, receive a request for the specific data item from at least one network node of the first plurality of network nodes, and in response to the network performance data of at least one network node of the second plurality of network nodes being below a threshold, select a second network node of the second plurality of network nodes to service the request for the specific data item based at least on a first relative replication latency for instantiation of the specific data item at the second network node of the second plurality of network nodes with respect to a second relative replication latency for instantiation of the specific content at at least one other network node of the second plurality of network nodes. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
Specification