technical: what might i mean by multipath? 1.via an overlay network 2.end-points are multi- homed,...

38
Technical: what might I mean by multipath? 1. Via an overlay network 2. End-points are multi- homed, and expose multiple IP addresses; routing works as it does now 3. End-systems set a path selector (e.g. 2 bits in the IP header), and multipath-aware routers use this to choose one of several available routes

Upload: emerson-cockrill

Post on 15-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Technical: what might I mean by multipath?

1. Via an overlay network

2. End-points are multi-homed, and expose multiple IP addresses; routing works as it does now

3. End-systems set a path selector (e.g. 2 bits in the IP header), and multipath-aware routers use this to choose one of several available routes

Page 2: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Political: what can multipath achieve?

• It’s happening anyway, and we might as well make sure it happens right

• It will increase competition between network providers

• It allows end-users to signal what sort of service they want

• It increases the capacity of the network• End-user performance is protected against link

failures• Multihoming is a simple and elegant way to support

mobility and address aggregation• Routing tables can be smaller• Nice maths

Page 3: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Mathematical models of multipath resource allocation

Damon Wischik, UCL

Partners from Trilogy

Damon Wischik, UCL

Partners from Trilogy

Page 4: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

4

• What is the capacity of a network in which traffic flows can be split over multiple paths?→ Generalized cut constraints

• What sort of rate-control algorithms might make full use of this capacity?→ Resource pooling; fairness and stability

• What routing support does there need to be, in order to make use of the capacity?→ Power of two choices

• Is there a conflict between routing at the end-systems, and traffic engineering in the network?→ Wardrop equilibrium

Page 5: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

I. Generalized cut constraints

“Resource pooling in queueing networks with dynamic routing”, Laws, 1992

“Loss Networks”, Kelly, 1991

Page 6: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

The multipath rate allocation problem

6

i2{1,2,3} are the flows

xi is the arrival rate of flow i

j2{1,2,3,4} are the resources

Cj is the capacity at resource j

p2{1,…,5} are the paths

Yp is the rate allocated to path p

For what arrival rates x is it possible to allocate rates y without overloading the resources?

Page 7: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Generalized cut constraints

7

x1+x2 · C1

x1+x3 · C3

x3 · C4

x1+x2+x3· C1+C4

x1+2x3 · C3+C4

x1+x2· C1+C2

x1+2x3 · C3+C4

Page 8: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Generalized cut constraints

8

x1+x2 · C1

x1+x3 · C3

x3 · C4

x1+x2+x3· C1+C4

x1+2x3 · C3+C4

x1+x2· C1+C2

x1+2x3 · C3+C4

a cut constraint

a generalized cut constraint

Page 9: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

9

Generalized cut constraints

• Theorem. The set X of admissible arrival rates can always be written

X = {x : ¸(k)¢x · ¹(k)¢C for all k}for a suitable set of variables ¸(k) and ¹(k)

• These constraints are called generalized cut constraints, corresponding to virtual resources

• Consider the linear program: “what is the least amount of extra capacity I need to add to be able to serve x?” The generalized cut constraints are the extreme points of the dual feasible set.

• There are algorithms for computing the generalized cut constraints

Page 10: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Capacity region

x1

x2

x3

These plots show the capacity region, i.e. the set of admissible arrival rates.

Multipath will increase the capacity region. By how much? For which flows does multipath bring the greatest benefit?

Page 11: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Questions

• Given Internet topology, what multipaths do we need to ensure that the generalized cut constraints are as broad as possible?– e.g. if each flow could have only two paths,

what should they be?– if you had complete flexibility, how big is the

admissible region?

Q

Page 12: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

II. Resource pooling

“State space collapse and diffusion approximation for a network operating under a proportional fair sharing policy”, Kang, Kelly, Lee, Williams

“Heavy traffic analysis of optimal scheduling algorithms for switched networks”, Shah, Wischik

Page 13: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Example of resource pooling

The total capacity of the four links is 36, and it can be evenly shared between the flows, even though none of the flows can balance its load across all four links. This is called resource pooling.

When a link fails, the flows should automatically re-balance themselves so that service is maintained. Link failures will mean there is less path diversity, and this may break resource pooling.

Page 14: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

“No constraints but the network itself”

x1

x2

x3

What are the characteristics of a good rate allocation algorithm?

A good algorithm will have decent performance whenever the arrival rates are admissible.

It will not add extra constraints. It will only ‘see’ the generalized cut constraints.

x1+x2· C1+C2

x1+2x3 · C3+C4

Page 15: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Revision of basic queueing theory

• Consider a M/G/1 processor sharing queue

• Jobs arise as a Poisson process, and job size distribution is arbitrary

• Let the utilization be ½

• Then the number of active jobs has a geometric distribution with mean ½/(1-½)

Page 16: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

A critically loaded network

x1

x2

x3

Let x1, x2, x3 be arrival rates for the arrival of TCP sessions, assumed to be Poisson arrivals. Let C1,…,C4 be link speeds divided by mean flow size.

Suppose that one generalized cut constraint is tight, while the other is not.

x1+x2 ¿ C1+C2

x1+2x3 ¼ C3+C4

Page 17: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Virtual queues at virtual resources

x1

x2

x3

Define a virtual queue V as follows:• For arrivals of type 1, put in one token• Every arrival of type 3, put in two tokens• Serve this queue at rate C3+C4

Define the workload W for the critical constraint —• the number of jobs of type 1,• + 2 x number of jobs of type 3

Then W¸V. A perfect algorithm would achieve W=V, by ensuring that neither resource 3 nor resource 4 is underutilized whenever V is non-empty. If either of these becomes underutilized then W>V.

A modified TCP achieves W=V. In this case V has a geometric distribution, and there are

• ¼ [x1/(x1+2x3)] W jobs on route 1• ¼ 0 jobs on route 2• ¼ [x1/(x1+2x3)] W jobs on route 3

x1+x2 ¿ C1+C2

x1+2x3 ¼ C3+C4

Page 18: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Resource pooling

x1

x2

x3

In effect, the system only ‘sees’ the virtual resources, and each of them behaves like a standard processor-sharing link. This is as good as possible.

To achieve this, whenever a virtual resource is non-empty none of the real resources that comprise it should go idle.

This is called resource pooling.

(In this example, resource pooling is a consequence of the fact that one virtual resource is heavily loaded. This is heavy traffic theory.)

Page 19: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

The multipath packet scheduling problem

19

Packets are sent on a route, chosen at the ingress. Servers have a choice of which routes to serve.

Should packets be held back until there is a clear path, or should they commit to a route straight away?

How should servers decide which packets to serve?

Q

Page 20: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

III. Fairness and stability of congestion control

“Overlay TCP for multi-path routing and congestion control”, Han, Shakkottai, Hollot, Srikant, Towsley, 2003.

“Stability of end-to-end algorithms for joint routing and rate control”, Kelly, Voice, 2005

Page 21: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Unfairness of per-path congestion control

• When some flows have several paths available, and other flows do not, it is fairer to allocate capacity to users rather than to paths

• Therefore we should not use standalone TCP congestion control on each path individually

• In maths: we want the congestion control to solve a user-centric utility maximization problem

Page 22: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Stability and responsiveness

• When a link fails, or when flows come and go, rates should rebalance gradually

• Otherwise the network will be susceptible to route flap / oscillations

• In maths: the dynamical system describing the algorithm should be stable

12

10

8

6

4

2

Page 23: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

IV. Power of two choices

“Queueing system with selection of the shortest of two queues: an asymptotic approach”, Vvedenskaya, Dobrushin, Karpelevich, 1996

“The power of two choices in randomized load balancing”, Mitzenmacher, 1996

Page 24: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

A simple example of load-balancing

Packets arrive as a random process. When they arrive they look at m queues, selected at random, and join the shorter queue

What is the resulting distribution of queue sizes?

Page 25: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

A simple example of load-balancing

Packets arrive as a random process. When they arrive they look at m queues, selected at random, and join the shorter queue

What is the resulting distribution of queue sizes?

Page 26: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

A simple example of load-balancing

The larger m is, the more uniform the queue sizes should be. Surprisingly, m=2 is enough to get most of the benefit.

If m=1 then Prob(Q>q)=½q

If m¸2 then Prob(Q>q)=½(mq-1)/(m-1)

(Here ½ is the average load, arrivals are Poisson, service is exponential.)

Out[118]=Out[117]= Out[155]=

m=1 m=2 m=5

A sample of queue sizes, for different values of m

Page 27: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Two paths good…

• Recall resource pooling:This is what happens when all the real resources that make up a virtual resource are kept fully occupied as long as possible. It means that the system is operating at peak efficiency.

• Perhaps, if each flow could balance itself across two real resources in every virtual resource, then we will achieve resource pooling.

x1x2

x3

x1+x2· C1+C2

x1+2x3 · C3+C4

Page 28: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

…as long as they are really random choices

If the choices are structured, e.g. each packet selects from a set of size m, and the sets are chosen at random from some arbitrary balanced distribution, then we need m=log(#queues) to get load balancing

“Balls and bins with structure: balanced allocations on hypergraphs”, Godfrey, 2008

!

Page 29: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Two choices, in a loss network

0 10 20 30 40 50 600 .0

0 .1

0 .2

0 .3

0 .4

0 .5

0 2000 4000 6000 8000 10 0000 .0

0 .1

0 .2

0 .3

0 .4

0 .5

Several parallel links.

Each call tries up to m links before giving up

A fully-connected graph.

Model 1. Each call tries the direct link, and if that is busy it fails

Model 2. Each call tries the direct link, and if that is busy it tries a 2-hop link

load

dropprob.

load

dropprob.

100%

100%

Page 30: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Two choices, in a loss network

0 10 20 30 40 50 600 .0

0 .1

0 .2

0 .3

0 .4

0 .5

0 2000 4000 6000 8000 10 0000 .0

0 .1

0 .2

0 .3

0 .4

0 .5

• The benefit of two choices is only apparent in critically loaded systems(in a model with equal load and no failures)

• Maybe multipath is only really useful for coping with traffic surges and link failures

• It is important to make sure that the alternative paths do not consume too many resources

Page 31: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Questions

• What traffic mix / routing will give us load balancing?– Maybe, if 50% of flows using a given virtual resource

have two choices of which real resource to use, this will be enough

– Is BitTorrent doing all this already?– How can we formulate this question properly?

• What is the relationship between heavy traffic resource pooling and the example of ‘power of two choices’?– How does ‘power of two choices’ work in networks,

i.e. not just an array of homogeneous resources?– One theory looks at the traffic matrix that leads to

resource pooling, the other looks at the routing choice that leads to resource pooling. How can we integrate these two?

Q

Page 32: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

V. Wardrop equilibrium

“Partially optimal routing”, Acemoglu, Johari, Ozdaglar, 2007

“How bad is selfish routing?”, Roughgarden, Tardos, 2002

Page 33: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Wardrop equilibrium

• Suppose the total traffic x is made up of many users who each can choose how to split their traffic, and they make their choice selfishly so as to minimize their congestion cost

• Let the congestion cost at resource 1 be C1(y1+y3) etc. Let the congestion cost of a path be the sum of the congestion costs along the route

• Then, if some path has greater congestion cost than others, no one will send any traffic over that path

• The resulting allocation is called a Wardrop equilibrium

Page 34: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Braess’s paradox

• Braess discovered a situation where by removing resource 3 we end up with lower congestion costs for all users

• This is called Braess’s paradox

• It is not an uncommon problem!

10z

10z

50+z

50+z

6z

each path costs 92

Page 35: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Braess’s paradox

• Braess discovered a situation where by removing resource 3 we end up with lower congestion costs for all users

• This is called Braess’s paradox

• It is not an uncommon problem!

10z

10z

50+z

50+z

6z

each path costs 83

Page 36: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Generalized Braess’s paradox

• Suppose an AS owns the middle of the network, and does load-balancing

• This will decrease the net congestion cost of traversing the middle of the network

• This may INCREASE the average congestion costs to users, by an extension of Braess’s paradox

• The relative cost C(selfish users & load-balancing)/C(selfish users)

can be arbitrarily large

• Plus other perverse outcomes...

Page 37: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

Questions

• Why do these problems not show up in the earlier model of capacity constraints? Is it because the model of congestion costs is fishy?

• The difference seems to be to do with congestion signals: the signals to which users respond are in this case not in alignment with the costs.

• It may be that the network has much more routing flexibility than do individual users. Does this lead to a different answer? How can we fit it into the model?

Q

Page 38: Technical: what might I mean by multipath? 1.Via an overlay network 2.End-points are multi- homed, and expose multiple IP addresses; routing works as it

More questions

• Are there appropriate ways of signalling congestion, such that user and network incentives align? For example, if BitTorrent responded appropriately, could it do a much better job of traffic engineering?

• Multipath should lead to more competition, which should force prices closer to costs. What are appropriate models of this? Is P2P already enough to force competition?

Q