new workloads and - open compute project · 2018. 5. 29. · new workloads and the evolving network...

13

Upload: others

Post on 11-Sep-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: NEW WORKLOADS AND - Open Compute Project · 2018. 5. 29. · NEW WORKLOADS AND THE EVOLVING NETWORK Uri Cummings Connectivity Group Chief Technical Officer, Data Center Group
Page 2: NEW WORKLOADS AND - Open Compute Project · 2018. 5. 29. · NEW WORKLOADS AND THE EVOLVING NETWORK Uri Cummings Connectivity Group Chief Technical Officer, Data Center Group

N E W W O R K L O A D S A N DT H E E V O L V I N G N E T W O R KUri Cummings

Connectivity Group Chief Technical Officer, Data Center Group

Intel Corporation

[email protected]

Page 3: NEW WORKLOADS AND - Open Compute Project · 2018. 5. 29. · NEW WORKLOADS AND THE EVOLVING NETWORK Uri Cummings Connectivity Group Chief Technical Officer, Data Center Group

3

By 2021,

95% of all data

center traffic

will be based in

the cloud.

Source:: Cisco Global Cloud Index, 2016-2021

Datacenter Traffic

Cloud Datacenter

Traditional Datacenter

Zett

abyt

es Pe

r Yea

r

Page 4: NEW WORKLOADS AND - Open Compute Project · 2018. 5. 29. · NEW WORKLOADS AND THE EVOLVING NETWORK Uri Cummings Connectivity Group Chief Technical Officer, Data Center Group

4

Data center network inflections

Cloud Scale:the big get bigger

Critical workloads redefine the network

Pervasive offloadsDistributed intelligence

Page 5: NEW WORKLOADS AND - Open Compute Project · 2018. 5. 29. · NEW WORKLOADS AND THE EVOLVING NETWORK Uri Cummings Connectivity Group Chief Technical Officer, Data Center Group

Data and the Emergence of accelerator networks

5

Emerging workloads redefining the networkNetworks reconfigured for east-west, jitter reduction, and accelerators

Network: Flexibly-connected topology that enables endpoints to share data (e.g., arbitrary non-cyclic topologies)

Scale-Out Fabric: Specialized form of a network, with a fully-connected topology, that is engineered and tuned to sustain performance & latency at high scale while reducing jitter (e.g., Cloud Ethernet and HPC Fabric)

• • • • • • • • • • • • Comms

Act

ivit

y

Task

Comms

Task Task

Comms ● ● ●Time

Task

Traditional Networks and datacenter FabricsNode-focused:

Accelerator scaling fabric:

•Light-weight network that interconnects a cluster of local accelerators with high-bandwidth, low-latency links•Lowest power/bit•Code & connectivity optimized for locality•Emerging Need: Data-lakes

•Used for high-bandwidth memory sharing/coordination at larger scale•Allows separation of accelerator vs. general communications, minimizing interference in the scaling fabric.

Act

ivit

y

Time

Comms ● ● ●

Task TaskTask

CommsComms

Page 6: NEW WORKLOADS AND - Open Compute Project · 2018. 5. 29. · NEW WORKLOADS AND THE EVOLVING NETWORK Uri Cummings Connectivity Group Chief Technical Officer, Data Center Group

6

The motivation driving acceleration and offload

Objective: Grow acceleration capabilities for network-related workloads

Specialized infrastructure

Reduce utilitycycles in CPU

Xeon Cores

Monetized Work

Infra.

Agility Infrastructure Acceleration Application Acceleration

Cloud optimizes atthe speed of Software

ContinuousIntegration

LowerDatacenter Tax

Enhance GrowingApplications

CPU

NIC

TOR

Infrastructure

Monetized work

Page 7: NEW WORKLOADS AND - Open Compute Project · 2018. 5. 29. · NEW WORKLOADS AND THE EVOLVING NETWORK Uri Cummings Connectivity Group Chief Technical Officer, Data Center Group

7

Emerging use case : bare-metal server

Bare metal servers Offer new usage models—full server rentals

Improve infrastructure security in cloud

Bolster performance, reduce cost/bit by removing bottlenecks

Page 8: NEW WORKLOADS AND - Open Compute Project · 2018. 5. 29. · NEW WORKLOADS AND THE EVOLVING NETWORK Uri Cummings Connectivity Group Chief Technical Officer, Data Center Group

8

Cloud-era data center network

Pod PCIeEthernet Inner Loop

TOR(Leaf)

Core(Spine)

TOR(Leaf)

CPU

Chipset

LAN

Node(Shelf)

Core(Spine)

Control CPU

Ethernet Switch

Control CPU

Accelerator Pool(Appliance)

CPU

Chipset

Control CPU

Ethernet Switch

Control CPU

Ethernet Switch

SoC

LAN

Ethernet Switch

CPU

Chipset

NIC NIC

Switch defines network architecture

Single mode becoming ubiquitous

Copper Optics >100G<25G

Bandwidth to the node is exploding• East-West communication• ML, DL, application & network acceleration

• Bandwidth and radix• Power efficiency• Reliability

NIC is innovation pointfor platform offloads

NICNIC

SwitchSwitch

Page 9: NEW WORKLOADS AND - Open Compute Project · 2018. 5. 29. · NEW WORKLOADS AND THE EVOLVING NETWORK Uri Cummings Connectivity Group Chief Technical Officer, Data Center Group

9

Motivating smart NIC

Programmable assets FPGA

CPUs

Flexible state machines

Networking assets Broad market NIC

business

Open source contribs

DPDK

Cores/Compute• Xeon/Core• Atom • Graphics/Media• QAT/Crypto/Comp.• Special purpose

(accelerators)

FPGA• Arria10• Stratix10• Flexible tile

architecture

Host interfaces• PCIe Gen3, 4 and 5• IAL (I/0, Memory,

Cache)• UPI/Coherent• On-package (Rlink)

Interface tech.• DMA• SVM/Coherent• Hardware queue

engines• Virtual device

manager

NETWORK• MAC/PHY/SerDes• RDMA (RoCE, iWARP)• OPA for HPC• Packer parsers• P4 engines• Schedulers• Switches

package• BGA: Low-cost to

large footprint• Package MCP &

Interfaces (Rlink)

software• Compilers, tools• WOS• OTC/Intel Clear Linux• P4 API working group• Libraries/Standards

bodies

Storage Memory• NVMe

Integrating end-to-end capabilities in the platform

Page 10: NEW WORKLOADS AND - Open Compute Project · 2018. 5. 29. · NEW WORKLOADS AND THE EVOLVING NETWORK Uri Cummings Connectivity Group Chief Technical Officer, Data Center Group

10

OCP NIC v3.0 enables the smart NIC

Community development since Feb 2017 solved major challenges

Improved serviceability

Manage NIC / platform transition

Ready for PCIe Gen4 and Gen5

Enlarge PCB space for smart NIC use case

Allow higher TDP ASICs

Improve mechanical interface

Single connector Dual connector

small large

Low profile—similar profile to OCP NIC2.0 card

Larger PCB width to support additional NIC

Up to 16 PCIe lanes Up to 32 PCIe lanes

80W max 150W max

Joint effort across system/NIC/Connector suppliers and end users

Page 11: NEW WORKLOADS AND - Open Compute Project · 2018. 5. 29. · NEW WORKLOADS AND THE EVOLVING NETWORK Uri Cummings Connectivity Group Chief Technical Officer, Data Center Group

11

Significant OCP contributions in…

10GbE SFP+

40GbE QSFP+

25GbE SFP28

>25 OCP-compliant NIC and PHY SKUs, covering 1GbE, 10GbE, 25GbE, and 40GbE

>1M Intel-branded ports shipped in OCP form factor (excluding partner shipments)

Fully compliant with 100G CWDM4-OCP, CWDM4 MSA, QSFP, and CAUI-4 specs

500m, 2km, or 10km reach on duplex single mode fiber

OCP Ethernet Network Adapters 100G Silicon Photonics

CWDM4-OCP 500m

CWDM4 2km CWDM4 10km

Page 12: NEW WORKLOADS AND - Open Compute Project · 2018. 5. 29. · NEW WORKLOADS AND THE EVOLVING NETWORK Uri Cummings Connectivity Group Chief Technical Officer, Data Center Group

Intel 400G CWDM8 QSFP-DD optical module

400G Transmitter: Optical Output

400G Receiver: Host Side Electrical Output

Technology for 400G is working in the lab

1271nm

1291nm

1311nm

1331nm

1351nm

1371nm

1391nm

1411nm

12

Page 13: NEW WORKLOADS AND - Open Compute Project · 2018. 5. 29. · NEW WORKLOADS AND THE EVOLVING NETWORK Uri Cummings Connectivity Group Chief Technical Officer, Data Center Group

13