scaling the s in sdn at azure 2016 - albert...model across multiple asics hardware/software...

19
Scaling the S in SDN at Azure Albert Greenberg Distinguished Engineer & Director of Engineering Microsoft Azure Networking

Upload: others

Post on 02-Jun-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

Scaling the S in SDN at Azure

Albert Greenberg

Distinguished Engineer & Director of Engineering

Microsoft Azure Networking

Page 2: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

2013: SDN in the Cloud2014: Host SDN and Express Route

2015: Contoller and SmartNIC

Coming soon: Containers and SONiC

ExpressRoute Partners

Page 3: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

SDN in Windows Server 2016 Apps

Azure Resource Manager Azure Resource Manager

MICROSOFT AZUREAZURE STACK

Consistency

Bringing Cloud-scale Technology to Enterprise on Premises with Windows Server 2016

Create Virtual Networks, Load Balancers, Security Groups, and more from Azure in Private Cloud

Supports Future Protocols – OVSDB, VxLAN for open integration of control + data plane, using VFP, Azure’s Virtual Switch

Page 4: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

What’s New In Data Plane?

Improving Scale

Page 5: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

PacketDirect

Hos

t

PacketDirect Client (VFP, virtual appliance, …)

NIC - PacketDirect Provider

Q1 Q2

CPU

PD Buffers managed by PD client

PacketDirect Platform

CPUs managed by PD client

Queues managed by PD client

CPUCPU CPU

New Windows direct I/O model for accelerating virtual networking and NFV

Port of VFP (Azure Virtual Switch) to PD: • Up to 2x improvement in

throughput/core for heavy VNET workloads

• Up to 4x PPS improvement E2ESupported on major merchant silicon NICs

Initially for public cloud, coming to Windows Server 2016 soon!

Page 6: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

HW is needed for scale, perf, and COGS at 40G+ • Multi-year ASIC cycle too slow • Need agility and serviceability

– SDN • SmartNIC combines agility of

SDN with speed+COGS of HW

Roll out Hardware as we do Software!

SmartNIC – Bringing the Flexibility and Serviceability of the Cloud to Hardware

Blade

SmartNIC

NIC ASIC

FPGA

CPU

ToR

Bump in the Wire: Reconfigurable FPGA +

NIC ASIC

Page 7: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

What’s New In Control Plane?

SDN for Containers

Page 8: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

Containers Improve Density

Container 1 Container 2 Container 3

VM OS files and libraries

Container 4 Container 5 Container 6

10X utilization: 100s of VMs per host to 1000s of Containers per VM, each with its own network

VM 1 VM 2

Host Host

Page 9: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

Goals for Container ManagementSpeed • Instantly provision container

network

Scale • 100s of 1000s of containers in same

virtual network

Seamless DevOps • Common APIs for VMs and Containers

Page 10: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

Open SDN Solution for Containers

Principles • Consistent solution: Works across VMs

and containers • Cross platform: Works on Linux and

Windows • Open API: Works with Mesos, Docker,

Chronos, KubernetesHow: New primitives • Network namespace isolation extended

to VFP: tremendous scale without being limited by OS/port limitations

• SDN policies: ACLs, IPAM, Routes, QOS …extended to containers

Page 11: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

Open SDN for Containers in Public Cloud

NAT-based Container Network Underlay Container Network

VM

Container 1 Container 2

Bridge

eth0 Eth0172.16.0.2 172.16.0.3

172.16.0.1 bridge0

vswitch

Eth010.0.0.1

Container policies

VM policies

VM (Linux / Windows)

Container 1 Container 2

Bridge

eth0

Veth veth

eth010.0.0.2 10.0.0.3

bridge0

VFP

Eth010.0.0.1

namespace

NATvethveth

namespace

Container +

VM policies

Page 12: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

On Prem Azure

GW

DemoNetwork Controller

Windows VM (Media Client)

Ubuntu VM

Container 1 (PHP server)

Ubuntu VM

Container 2 (PHP server)

Page 13: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

What’s New In Physical Network?

Software for Open Networking in the Cloud (SONiC)

Page 14: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

GoalsVelocity • Feature agility weekly • Bug mitigation daily

Serviceability • Modular, fine-grained hitless upgrades

and restarts • .deb/.rpm/container preferred over

reload OS image

Reliability • Improvements with zero customer

impact.

Page 15: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

OCP Contribution Dec 2014

OCP Acceptance July 2015

Common API and behavior model across multiple ASICs

Hardware/software disaggregation.

Independent evolution of hardware and software.

OCP Contribution March 2016

Provides L2/L3 functionalities ready for cloud

Loosely-coupled modular design

Separation of states and logic

SAI SONiC

Page 16: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

SONiC Completes the Open-Source Switch Stack

Chassis Supplier

ASIC Supplier

Linux

OCP

Various

ASIC

Switch Hardware

TransceiverFan/Power/LED

Network Applications

ASIC SDK

SAI

Ker

nel

Use

r Spa

ceSwitch State Service

Network device drivers

netdev ASIC PCI driverHW

Peripheral drivers

PAL - sysfs

SDIPAS

Object Library

Object Library

Linux

Page 17: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

SONiC Commercial Switch

Quagga -> Gobgpd Vendor BGP

BGP session

SONiC Demo: Servicing BGP

Receiver

Sender

Page 18: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

Unlocks Simpler, Best in Breed TechnologyEnables Ecosystem • Old Vs. New techniques

• Monitoring: Push Vs. Pull • Identity: AD Vs. TACACS • Configuration: Ansible Vs.

CLIMonitoring example • SNMP pulling not very efficient • Mapping counters to MIB and

back? • Streaming: faster, direct • Marching towards direct cloud

integration

Page 19: Scaling the S in SDN at Azure 2016 - Albert...model across multiple ASICs Hardware/software disaggregation. Independent evolution of hardware and software. OCP Contribution March 2016

Want to be a part of Scaling the S in SDN? We’re Hiring!

[email protected]