ultra low queuing delay for all applications · pdf fileultra low queuing delay for all...

29
1 Ultra Low Queuing Delay for ALL Applications http://riteproject.eu/dctth IETF Journal: http://www.internetsociety.org/publications/ietf-journal-november-2015/ultra-low- delay-for-all Koen De Schepper , Inton Tsang . Olga Bondarenko, Bob Briscoe . [email protected] November, 2015

Upload: truongtu

Post on 08-Mar-2018

222 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

1

Ultra Low Queuing Delay for ALL Applications

http://riteproject.eu/dctth

IETF Journal: http://www.internetsociety.org/publications/ietf-journal-november-2015/ultra-low-

delay-for-all

Koen De Schepper, Inton Tsang .

Olga Bondarenko, Bob Briscoe .

[email protected]

November, 2015

Page 2: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

2

Super-Fast Internet?

Fast Devices with

Interactive Applications

Fiber-Fast Low Latency Networks

Nearby Data Centers

with Huge Processing Power

Page 3: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

3

Super-Fast User Experience ???

Fast Devices with

Interactive Applications Nearby Data Centers

with Huge Processing Power

Old Timer: Classic TCP Congestion Control

Q Q

Fiber-Fast Low Latency Networks

Q Q

Page 4: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

4

Better TCP Exists, but is not compatible with the current Internet

Fast Devices with

Interactive Applications Nearby Data Centers

with Huge Processing Power

Old Timer: Classic TCP Congestion Control

Q Q

Fiber-Fast Low Latency Networks

Q Q

L4S: Low Loss, Low Latency & Scalable TCP CC

Page 5: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

5

We have high bandwidth

­ but not predictable high speed & low latency

We will demonstrate the cause:

­ The presence of ‘classic’ TCP (Reno and Cubic)

­ The solution is ‘scalable’ TCP

Video link: <tbd>

Comparison of Classic TCP and Scalable TCP

Cloud-hosted Interactive Panoramic Video

Page 6: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

6

Demo:

­ real broadband network

­ real base round trip (7ms)

­ heavy background traffic (6 parallel file downloads)

­ DCTCP is available on Linux and Windows Server, used ‘as is’

Not Diffserv

­ not QoS at the expense of other traffic

­ every packet of every app: ultra-low delay

­ only IP header identifier, no transport header, no DPI

­ only 2 queues, no FQ (queue per flow)

Also if all TCP traffic is scalable TCP

­ there are no quality problems

Comparison of Classic TCP and Scalable TCP

Cloud-hosted Interactive Panoramic Video

Page 7: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

7

Comparison of Classic TCP and a Scalable TCP

Cloud-hosted Interactive Panoramic Video

Rendered Views

Requested Views

Rendered Views

Requested Views

DCTCP

Cubic

DCTCP

Cubic

Dual Queue

Coupled AQM

40Mbps

RTT=7ms

DSLAM SR/BNG SR

Panoramic 7K video

• 30 Mbps to stream

• single view = 4 Mbps

Interaction NW latencies:

• DCTCP: 7 … 10 msec

• Cubic : 7 … 100 msec

+ File Downloads

Page 8: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

8

We're not asking DISPATCH to dispatch something

We'll further show/explain the benefits to applications

But this involves

­ 3 wgs in the transport area

­ changes to hosts and network (bottlenecks) before it will bring benefits

Our question for DISPATCH, as 'customers' of the transport area:

­ Are the benefits useful?

­ Enough to overcome the chicken and egg deployment problem?

Why DISPATCH

Page 9: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

9

L4S-IP Classifier

Priority scheduler

L4S

Classic

Mark

Drop/ Mark

Coupled AQM needed for Compatibility

Simple Solution in the network:

DualQ AQM provides Low Latency Marking and Compatibility

Compatible to Scalable

p

Scalable-only AQM is very simple

Allows Independent Migration

Page 10: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

10

Data Centre

Demo on a Real BB Residential Testbed

xDSL

RTT = 7ms

Alcatel-Lucent 7302

Alcatel-Lucent 7750

VPRN

VLAN be

Alcatel-Lucent 7750

Alcatel-Lucent 7750

Alcatel-Lucent 7750

RGW

Mark:

20%

Drop:

4%

DualQ Coupled AQM

Utilization: 100%

File downloads 4

Web browsing [Items/s]

* 0 10 100

File downloads 2

Web browsing [Items/s]

* 0 10 100

DCTCP Client

Cubic Client

6,6Mbps

6,6Mbps

Cubic DCTCP

BNG

All values are measured from a live traffic capture

Page 11: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

11

Demo on a Real BB Residential Testbed

Page 12: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

12

Improved Web browsing experience

1 second timeout on lost connection setup packets

200ms timeout on lost last packets

Page 13: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

13

HTTP Adaptive Streaming (HAS) Experiments

Linux DCTCP

Linux Reno

AQM Server

PIE/FQ-Codel/DualQ

DSLAM SR/BNG SR Windows DCTCP/Ctcp

File downloads 4

Web browsing [Items/s]

* 0 10 100

Linux DCTCP Client

6,6Mbps

File downloads 2

Web browsing [Items/s]

* 0 10 100

Linux Reno Client

6,6Mbps

HAS Smooth Streaming

Windows DCTCP/Ctcp Client

20Mbps

RTT=20ms

Page 14: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

14

0

500

1000

1500

2000

Qu

alit

y [k

bp

s]

PIE: Ctcp + 10 Reno

HTTP Adaptive Streaming (HAS) Experiments

DualQ: DCTCP + 10 DCTCP FQ-Codel: Ctcp + 10 Reno

0

2000

4000

6000

Bit

rate

[kb

ps]

0

2000

4000

6000

0 200 400 600

Bit

rate

[kb

ps]

time

0 200 400 600

time

0 200 400 600

time

HAS

10 D

ow

nlo

ads

HAS

Page 15: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

15

Using Just a Scalable TCP connection

for Real-Time and Interactive Services?

Promising results

Prototyping / Evaluation by Applications

­ DCTCP is available in Linux and Windows Server

­ Collaboration

­ Try it for your application…

Focus on Application

Not on Transport Functionality

[email protected]

[email protected]

Page 16: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

16

­ Are the benefits useful?

­ Enough to overcome the chicken and egg deployment problem?

­ Support for adoption in the Transport Area?

Questions

Page 17: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

17

Backup

Page 18: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

18

Front-end

Server DCTCP

Back-end Server

Back-end Server

Back-end Server

RGW

DCTCP = Low Loss, Low Latency, and Throughput Scalability (L4S)

Reno

Cubic

Reno/Cubic

Reno

Cubic

Cloud Access Home

Large queues for high throughput and low drop

= Poor Latency

= Bad for interactive applications

ECN = No drop

ECN++ = Small queues

& Low latency & High throughput

Page 19: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

19

DCTCP

RGW

DCTCP to the HOME ?

DCTCP available on Windows Server and Linux 3.18

used internally in the data center

Windows and Linux 3.18 have DCTCP implementations ready

Cloud Access Home

Cubic

Reno

DCTCP

DCTCP

Page 20: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

20

What can be done in the Network?

Small Queues

AQM

Low Utilization

Low RTT Fairness

High Drop

No Burst Resilience

Lower Latency ?

Very High Link Speeds

Lots of Memory in Qs

Very Low Drop

High Fairness Variations

Very Slow Up to Speed

Lower Latency ?

No Solution in the Network only

Page 21: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

21

What can be done in the End-Systems?

Fix in TCP ?

High Utilization

High RTT Fairness

Reliable Stable Throughput

No Drop

Burst Resilient

Fast Up to Speed

Very Lower Latency !

Not compatible with Classic TCP

Need for Network support

Fix in Applications ? Same constraints

No Solution in the End-Systems only

Page 22: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

22

IP-ECN Classifier

Strict priority

scheduler

L4S

Classic

Dual Queue Coupled AQM

Concept 1: DualQ

DualQ to preserve low latency for L4S traffic

Page 23: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

23

ECN Classifier

Strict priority

scheduler

L4S

Classic

ECN marker

Drop

Coupled AQM to control priority traffic

Dual Queue Coupled AQM

Concept 2: Coupled AQM

p

DualQ to preserve low latency for L4S traffic

Page 24: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

24

ECN Classifier

Strict priority

scheduler

L4S

Classic

ECN marker

Drop

Coupled AQM for equal rate

Dual Queue Coupled AQM

Concept 3: Don’t Think Twice to mark

p

DualQ to preserve low latency for L4S traffic Mark if rand() < p

Drop if max(rand(), rand()) < p

pr 1

pr 1

Think twice to drop

L4S (DCTCP)

Classic (Reno / Cubic)

DON’T Think twice to mark

Page 25: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

25

&& =

2

2

k

LC

pp

4 parameters:

• L4S slope SL (bits)

• L4S threshold T (Q size)

• Classic slope SC (bits)

• EWMA value f (bits)

k= SL - SC QC=(qC-QC)

>>f+qC

ECN Classifier

Strict priority

scheduler

pL

pC

L4S

Classic

ECN marker

(QC<<SC) > max(R1, R2)

Drop

R1 R2

qL>T

qC

(qC<<SL) > R

R

qL

Coupling:

Detailed Implementation

2 Details L4S AQM if no Classic traffic

Classic smoothing

NO SMOOTHING for L4S

Page 26: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

26

Home

Data Centre

Demo on a Real BB Residential Testbed

xDSL

RTT = 7ms

Alcatel-Lucent 7302

Alcatel-Lucent 7750

VPRN

VLAN be

RED, PIE, FQ_Codel, DualQ

AQM Server

40Mbps Shaper

Alcatel-Lucent 7750

Alcatel-Lucent 7750

Alcatel-Lucent 7750

RGW Cubic

DCTCP Cubic DCTCP

BNG

Page 27: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

27

Opportunity to support a new TCP CC family

Classic Congestion Controller Family

• Reno with

• X RTTs per drop event

­ for Reno: X=45 RTTs on 40Mbps 20ms

X= 5600 RTTs on 1Gbps 100ms

gets worse in the future (also for Cubic)

• Designed for Drop based networks in the 80’

• Was known not to scale to higher throughputs

L4S Congestion Controller Family

• L4S with

• C marked packets per RTT

­ (C=2 for DCTCP)

frequent feedback is better control!

• Design now for the future, using ECN better,

to provide low loss, low latency and scalability

• DCTCP is big step forward and can be

improved with incremental evolution

pr 1 pr 1

Page 28: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

28

QUEUE SIZE AT DEQUEUE

1 TCP RENO FLOW (STEADY STATE)

Average Q size

p

0 50 100

q size [packets]

dctcp

reno

Pdf [%]

42

36

30

24

18

12

6

0

12

10

8

6

4

2

0

Pdf in 1s interval [%]

Page 29: Ultra Low Queuing Delay for ALL Applications · PDF fileUltra Low Queuing Delay for ALL Applications ... Alcatel-Lucent 7750 RGW Mark: 20% ... (bits) • L4S threshold T

29

QUEUE SIZE AT DEQUEUE

1 DCTCP FLOW (STEADY STATE)

0 50 100

q size [packets]

dctcp

reno

Pdf [%]

42

36

30

24

18

12

6

0

42

36

30

24

18

12

6

0

Pdf in 1s interval [%]

Instant Q size

p