troubleshooting for intent-based...

37
Troubleshooting for Intent-based Networking Joon-Myung Kang and Mario A. Sánchez Hewlett Packard Labs Open Networking Summit 2017

Upload: ngodat

Post on 22-Jul-2018

222 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Troubleshooting for Intent-based NetworkingJoon-Myung Kang and Mario A. SánchezHewlett Packard Labs

Open Networking Summit 2017

Page 2: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Intent-based Networking

Policy Graph Abstraction and Demo

Troubleshooting and Demo

QnA

2

Page 3: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Software-Defined Networking

Application Plane(SDN Apps)

Control Plane(OpenDaylight, ONOS, etc.)

Infrastructure (Data) Plane(Cloud/IT/SDN/NFV)

Open APIs

Program Languages

Abstraction

SDN Northbound Interfaces

Infrastructure Control InterfacesVendor specific

Low-level specificsManual operations

3

Page 4: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Software-Defined Networking

Application Plane(SDN Apps)

Control Plane(OpenDaylight, ONOS, etc.)

Infrastructure (Data) Plane(Cloud/IT/SDN/NFV)

Open APIs

Program Languages

Abstraction

SDN Northbound Interfaces

Infrastructure Control InterfacesVendor specific

Low-level specificsManual operations

4

Page 5: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Intent-based Networking

Application Plane(SDN Apps)

Control Plane(OpenDaylight, ONOS, etc.)

Infrastructure (Data) Plane(Cloud/IT/SDN/NFV)

INTENT North Bound Interface

Infrastructure Control Interfaces

− Application Plane says “What” (doesn’t care how)− Control Plane reasons “How” (doesn’t care why)

Intent− “what”, not “how” (non-prescriptive)− Is portable− Is universal− Is compose-able− Is invariant− Is scale-able

Source: Dave Lenrow, “Intent As The Common Interface to Network Resources,” Intent Based Network Summit 2015 ONF Boulder: Intent NBI

Intent“I want my headache

to stop”

Prescription“Give me two

aspirins”

5

Page 6: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Intent-based NetworkingExamples

WEB/Gold/Working Hour

No connect/Wireless

Configure new guest WiFi

6

Page 7: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Intent-based NetworkingExamples

WEB/Gold/Working Hour

No connect/Wireless INVISIBLE

Configure new guest WiFi

7

Page 8: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Intent-based NetworkingOpen Source Efforts– ONF Open Source SDN Boulder

– Define Intent North Bound Interface (NBI)– http://opensourcesdn.org/projects/project-boulder-intent-northbound-interface-nbi/– https://community.opensourcesdn.org/wg/IntentNBI/dashboard

– OpenDaylight NIC– Network Intent Composition– Manage and direct network services and network resources based on

the given “Intent”– https://wiki.opendaylight.org/view/Network_Intent_Composition:Main

– ONOS Intent Framework– Allows applications to specify their network control desires in form

of policy rather than mechanism (Intent)– https://wiki.onosproject.org/display/ONOS/Intent+Framework

ONF Intent NBI – Definition and Principles, Draft Version 6, Sep. 2016

https://wiki.opendaylight.org/view/Network_Intent_Composition:Graph

8

Page 9: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Policy Graph Abstraction (PGA)PGA overviewTroubleshooting for Intent-based Networking

9

Page 10: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

PGA is RealPublic resources

ACM SIGCOMM 2015 London, UK

Research Paper and Demo Running System and Open Source Contributions

OpenStack Summit 2015, 2016

OpenDaylight Summit 2015, 2016

10

Page 11: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Policy Management in Practice

11

Page 12: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Policy Graph Abstraction (PGA)

Mktg&Cmp-B&Normal

Engg&Cmp-A&Normal

HTTP Web&Cloud

DNS

DB&Cloud

RemedyService

Engg&Cmp-A&Qn

Mktg&Cam-B&Qn

Ping,SSH

HTTP

monitor

SQL, monitorsync,

monitor

monitor

DNS DNS

*

*

BC

BC

BCLBFW

BCLBFW

DPIDPI

BC

BC

graph composition

QuarantinedRemedyService

Policy sources Graph abstraction Unified, conflict-free policy graph Deploy

12

Page 13: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

PGA Example − Label namespace across cloud services and network, capturing overlap vs. disjoint relations between labels

13

CPU Utilization

> 90% <= 90%

Page 14: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

PGA Example − Label namespace across cloud services and network, capturing overlap vs. disjoint relations between labels − 4 individual input policies

(a) Departments admin

Engg. MktgPing,SSH

Cloud

monitor

QuarantinedRemedyService

*

(b) Application admin

(d) Cloud operator(c) SDN app: HPE Net Protector

Campus Cloud*

*

HTTPEmpl Web

SQLsync

DBLB

Normal DNSDNS

(a) Enterprise IT admin

DPI FW BC

BC

Cmp-AMktg

Empl App

Web CampusDBNet

Protector

Status

Normal Qn

Tenant

Location

Engg: Campus-A Mktg: Campus-BApplication: CloudEmpl: Net protector

Engg

Label NamespaceLabel Mappings

disjoint

Cloud

Cmp-B

14

Page 15: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

PGA Example − 4 individual input policies− Label namespace across cloud services and network, capturing overlap vs. disjoint relations between labels − Proactive, automatic composition − Scalable algorithm: 13 mins to compose 20K ACL + service chain policies

(a) Departments admin

Engg. MktgPing,SSH

Cloud

monitor

QuarantinedRemedyService

*

(b) Application admin

(d) Cloud operator(c) SDN app: HPE Net Protector

Campus Cloud*

*

HTTPEmpl Web

SQLsync

DBLB

Normal DNSDNS

(a) Enterprise IT admin

DPI FW BC

BC

Mktg&Cmp-B&Normal

Engg&Cmp-A&Normal

HTTP Web&Cloud

DNS

DB&Cloud

RemedyService

Engg&Cmp-A&Qn

Mktg&Cam-B&Qn

Ping,SSH

HTTP

monitor

SQL, monitorsync,

monitor

monitor

DNS DNS

*

*

BC

BC

BCLBFW

BCLBFW

DPIDPI

BC

BC

compose

Cmp-AMktg

Empl App

Web CampusDBNet

Protector

Status

Normal Qn

Tenant

Location

Engg: Campus-A Mktg: Campus-BApplication: CloudEmpl: Net protector

Engg

Label NamespaceLabel Mappings

disjoint

Cloud

Cmp-B

15

Page 16: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

PGACurrent status

PGA implementation and impact− PGA model, composition, deployment, and tool to convert ACL policy configuration to PGA

intent specification

− PGA prototype for OpenStack (Juno ~ Newton)

− PGA Intent APIs and graph compiler contributed to ODL/NIC Beryllium release

− Troubleshooting for intent based policy management

− Conflict detection

− Composition correctness verification

− Intent addition/modification/deletion

16

Page 17: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Live DemoPGA BasicOperations

17

Page 18: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

PGA Demo

18

Page 19: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

TroubleshootingWith Intent-based Networking

Page 20: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Network debugging/troubleshooting a difficult task

Picture sources: http://simplearchitectures.blogspot.com/2013/08/addressing-data-center-complexity.htmlhttp://www.cisco.com/c/en/us/td/docs/solutions/Enterprise/Data_Center/ServerFarmSec_2-1/ServSecDC/8_NIDS.html

WEB NO CONNECT

Picture source: http://www.ntstn.com/category/troubleshooting/network-troubleshooting

Policy

Networkping traceroute

tcpdump

SNMP

sflow

Page 21: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Systematic troubleshooting

–Know intent of the operator–Check network behavior against operator intent

Intent-based networking

–Policy is a first-class citizen–Intent explicitly expressed at policy layer–Forwarding semantics explicitly defined–Code compiles policy description into lower-level configuration

Difficult to achievein legacy networks

Opportunity to rethink network

debugging

Page 22: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Intent-based Networking

Application Plane(SDN Apps)

Controller Plane(OpenDaylight, ONOS, etc.)

Infrastructure (Data) Plane(Cloud/IT/SDN/NFV)

INTENT North Bound Interface

Infrastructure Control Interfaces

– Control Apps – Specify routing/access control policies

– Logical view – Simplified/abstract representation of network

– Physical view – One-to-one correspondence with the physical network

– Controller’s job to configure the network devices (OpenFlow)

Page 23: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Intent-based Networking

Application Plane(SDN Apps)

Controller Plane(OpenDaylight, ONOS, etc.)

Infrastructure (Data) Plane(Cloud/IT/SDN/NFV)

INTENT North Bound Interface

Infrastructure Control Interfaces

– Control Apps – Specify routing/access control policies

– Logical view – Simplified/abstract representation of network

– Physical view – One-to-one correspondence with the physical network

– Controller’s job to configure the network devices (OpenFlow)

• Each layer performs one piece of translation process

• Every layer should correctly map to every other layer

• Most errors in SDN are mistranslations between layers

Page 24: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Checking network behavior against intent

–Early debugging tools for OpenFlow-enabled networks –Ndb, OFRewind, NetSight, netwatch, netshark, nprof…–Easier to discover the source of network problems

[Faulty device firmware, inconsistent flow rules, faulty routing…]

–Testing and verification complement network troubleshooting and debugging[Loop freedom, black holes, performance of OpenFlow switches…]

Too low level!

Page 25: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Knowing the operator’s intent

Does the Actual Network Behavior Match the Policy?

–If NO…Match the symptoms to responsible system component

–If YES…The policy itself is the problem, a human must resolve the discrepancy

–If unwanted behavior persists & all state layers are equivalent:–The configured policy must not match the operator’s intent

Page 26: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Troubleshooting System

Composed graph

User/App1 User/App2 User/AppnUser Intents

Input graphs

Infrastructure Controllers

PGA

Results

Metadata

GUIQuery

Query Examples– Reachability/Connectivity checking

– Can A talk to B?

– Security vulnerability or Risk assessment

– Addition/removal/edition correctnessTroubleshootingSystem

Page 27: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Troubleshooting ExamplesReachability

–Can A talk to B?–What EPG do nodes belong to?–Is there an edge connecting both EPGs?–What security groups should be checked?–What middleboxes should be checked?

Page 28: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Troubleshooting example Troubleshooting network connectivity (reachability)

(a) Departmentsadmin

Engg. MktgPing,SSH

Cloud

monitor

QuarantinedRemedyService

*

(b) Application admin

(d) Cloud operator(c) SDN app: HP Net Protector

Campus Cloud*

*

HTTPEmpl Web

SQLsync

DBLB

Normal DNSDNS

(a) Enterprise IT admin

DPI FW BC

BC

Mktg&Cmp-B&Normal

Engg&Cmp-A&Normal

HTTP Web&Cloud

DNS

DB&Cloud

RemedyService

Engg&Cmp-A&Qn

Mktg&Cam-B&Qn

Ping,SSH

HTTP

monitor

SQL, monitorsync,

monitor

monitor

DNS DNS

*

*

BC

BC

BCLBFW

BCLBFW

DPIDPI

BC

BC

compose

Cmp-A

Status

Tenant

Empl App

Mktg

Web DB Campus Cloud Net Protector

Normal Qn

Location

Engg: Campus-A Mktg: Campus-BApplication: CloudEmpl: Net protector

Cmp-B

Engg

Label Namespace

Label Mappings

disjoint

web

Engg client HR site

Page 29: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Troubleshooting example Intent addition/modification/removal

(a) Departmentsadmin

Engg. MktgPing,SSH

Cloud

monitor

QuarantinedRemedyService

*

(b) Application admin

(d) Cloud operator(c) SDN app: HP Net Protector

Campus Cloud*

*

HTTPEmpl Web

SQLsync

DBLB

Normal DNSDNS

(a) Enterprise IT admin

DPI FW BC

BC

Mktg&Cmp-B&Normal

Engg&Cmp-A&Normal

HTTP Web&Cloud

DNS

DB&Cloud

RemedyService

Engg&Cmp-A&Qn

Mktg&Cam-B&Qn

Ping,SSH

HTTP

monitor

SQL, monitorsync,

monitor

monitor

DNS DNS

*

*

BC

BC

BCLBFW

BCLBFW

DPIDPI

BC

BC

Cmp-A

Status

Tenant

Empl App

Mktg

Web DB Campus Cloud Net Protector

Normal Qn

Location

Engg: Campus-A Mktg: Campus-BApplication: CloudEmpl: Net protector

Cmp-B

Engg

Label Namespace

Label Mappings

disjoint

compare

29

Page 30: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Troubleshooting example Risk Assessment

Indicator may be composed using different data points:e.g. # of compromised hops; # of network functions traversed, etc.

What if a host from “Web&Cloud” is compromised?

What EPGs might be able to reach host ‘x’ (through intermediate host compromise)?

Page 31: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Troubleshooting Demo

MarketingEmployee

Campus Admin

10.10.20.1

Connectivity Problem

Intent edition

Remote desktopconnection

Page 32: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

PGA and Troubleshooting Demo

32

Page 33: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Summary

–Intent-based Networking is beneficial to simplify network control & management

33

Page 34: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Summary

–Intent-based Networking is beneficial to simplify network control & management–Policy Graph Abstraction (PGA) is one of the well-defined intent-based

management framework and we presented possible troubleshooting examples

34

Page 35: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Summary

–Intent-based Networking is beneficial to simplify network control & management–Policy Graph Abstraction (PGA) is one of the well-defined intent-based

management framework and we presented possible troubleshooting examples–Intent-level troubleshooting can help to easily do troubleshooting network

problems

35

Page 36: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Summary

–Intent-based Networking is beneficial to simplify network control & management–Policy Graph Abstraction (PGA) is one of the well-defined intent-based

management framework and we presented possible troubleshooting examples–Intent-level troubleshooting can help to easily identify network problems

–What’s next– More More More practical experiences from network operators/administrators/developers…

36

Page 37: Troubleshooting for Intent-based Networkingevents17.linuxfoundation.org/sites/events/files/slides/ONS2017... · Software-Defined Networking. ... Infrastructure (Data) Plane (Cloud/IT/SDN/NFV)

Thank [email protected]@hpe.com

37