fpgas: the key to accelerating high-speed storage systems · 2020-06-25 · example of line rate...
TRANSCRIPT
![Page 1: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/1.jpg)
Salil Raje
Executive Vice President & GM
Xilinx Data Center Business
FPGAs: The Key to Accelerating
High-Speed Storage Systems
![Page 2: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/2.jpg)
SSDs Have Been a Game Changer for Storage
1
10
100
1000
10000
1 2 3
MIC
RO
SE
CO
ND
S
LATENCY
>> 2
![Page 3: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/3.jpg)
VIDEOANALYTICS
MACHINE
LEARNING
FINANCIALLIFE
SCIENCES
DATABASE Apps
NetworkStorage
Compute
Explosion of UnstructuredData
Data Filtering
EncryptionDecompression
Compression
>> 3
![Page 4: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/4.jpg)
Continuously Evolving Standards
Data Filtering
EncryptionDecompression
CompressionHadoop
Spark
Aerospike
RocksDB
Cassandra
Foundation DB
GZip
zSTD
Huffman
LZ
Zipline
Brotli
LZ
Brotli
Zipline
DES
AES-XST
SHA1-256
Block chain
>> 4
![Page 5: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/5.jpg)
Controller
Bottlenecks Remain for Data Intensive Applications
PCIe
DRAMDRAM
Excessive data transfers
High latency
Limited BW
CPU not optimized for
these tasks
Processor-centric architecture
Flash
Compute
CPU
>> 5
![Page 6: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/6.jpg)
Controller
Emergence of Computational Storage as the Solution
PCIe
Computational storage architecture
CPU
Compute
acceleration
close to storage
Reduces required bandwidth
Reduces latency
More available
CPU cyclesComputeDRAMDRAM
Flash
>> 6
![Page 7: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/7.jpg)
Growing Industry Momentum for Computational Storage
>> 7
![Page 8: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/8.jpg)
How FPGAs Address the
Computational Storage Problem
![Page 9: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/9.jpg)
˃ Flash controllers
˃ Storage SystemsCache-offload
Storage System & Switching connectivity
Data Reduction
FPGAs in Storage Today
>> 9
![Page 10: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/10.jpg)
˃ Flexible, fully customizable architecture adapts to specific applications
Massive parallelism, I/O and customizable data path
˃ Performance, power and latency of dedicated HW + reconfigurability of SW
˃ More economical than ASIC/ASSP for many applications
FPGA Advantages for Computational Storage
FPGA FPGA FPGA
Encryption Accelerator Decryption Accelerator Analytics Accelerator
>> 10
![Page 11: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/11.jpg)
Architecture easily adapts to latest compression algorithms
FPGA Advantages for Changing Standards
FPGA FPGA FPGA
Gzip Accelerator Brotli Accelerator Zipline Accelerator
>> 11
![Page 12: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/12.jpg)
Example of Analytics Acceleration
Airline traffic in the USA from 1970 to Present
Flight Data — 1.2B EntriesAirport Data — 500M EntriesPlanes Data — 700M Entries
Q1: “Which cities originate the most flights with >10min delays? Q2: “Which airport in the Bay Area has the worst record?
Computational
Storage Drives
Scan, filter,
Hash-Agg
SSDCtrl
1x
4x
7x
13x
0
2
4
6
8
10
12
14
1
Re
lati
ve
P
erf
orm
an
ce
# FPGA Accelerators
QUERY PERFORMANCE
None 1 2 41
4FPGA
>> 12
![Page 13: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/13.jpg)
Example of Line Rate Hadoop Compression Acceleration
The challenge: Ingest real-timeretail sales data during peak shopping season
FPGA
1x
20x
0 5 10 15 20
Intel Skylake-SP 6152 @2.10GHz CPU (Ubuntu 16.04),
GB/s compression per CPU core = .0229. Alveo U50 =
10GB/s
CPU
FPGA
CPU
vs.
CPU can’t keep up with line-rate data ingestionmaking compression impractical
>> 13
![Page 14: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/14.jpg)
FPGA-based Data Compression Enables Server Consolidation
With 192TB (uncompressed)
2x Dual CPU Servers
Without Compression
Acceleration
50% Reduction in Nodes
40% Lower Cost
2x Accelerators, 96 TB (compressed)
Single Socket Server
Intel Skylake-SP 6152 @2.10GHz CPU (Ubuntu 16.04), GB/s compression per CPU core = .0229. Alveo U50 = 10GB/s, Assume 2:1
compression
With FPGA Compression
Acceleration
+
>> 14
![Page 15: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/15.jpg)
Computational Storage
Deployment Options
![Page 16: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/16.jpg)
˃ Integrated Accelerator and Flash
˃Benefits:
Easy to implement- plug & play
Adding capacity adds accelerators + performance
Ability to optimize BW between accelerator and flash
Ability to customize FTL for specific workloads
˃Vendors at FMS:
Samsung
Scaleflux
Computational Storage Drive (CSD)
PCIe
DRAMDRAM
CPU
Controller
Flash
FPGA
Controller
Flash
FPGA
Controller
Flash
FPGAFPGA FPGA FPGA
>> 16
![Page 17: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/17.jpg)
˃ Accelerator and Storage on same PCIe
subsystem
˃ Benefits:
SSD vendor independence
Plugs into standard slot
PCIe Peer-to-peer transfers for high bandwidth and low latency
˃ Vendors at FMS:
Bittware
Eideticom
Xilinx
Computational Storage Processor (CSP)
Peer-to-Peer Acceleration
PCIe
DRAMDRAM
CPU
FPGA
FPGA
>> 17
![Page 18: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/18.jpg)
˃ Accelerator in-line with storage
˃ Benefits:
SSD vendor independence
Independently scale accelerators and SSDs
Ability to optimize BW between accelerator and SSDs
˃ Vendors at FMS:
Bittware
Computational Storage Array (CSA)
PCIe
DRAMDRAM
CPU
FPGA
>> 18
![Page 19: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/19.jpg)
Future Directions
![Page 20: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/20.jpg)
Current Data Center Architecture: Fixed Resources, Sub-optimal Utilization
Ethernet
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
Accel
Accel
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
Accel
Accel
Accel
Accel
Accel
Accel
Accel
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
>> 20
![Page 21: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/21.jpg)
Ethernet
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Accel
Future Data Center : Disaggregated and Composable
Workload 1
Challenge: Reduced Bandwidth and Increased Latency
Workload 2
Workload 3
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
>> 21
![Page 22: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/22.jpg)
˃ Enables composability without
significant performance penalty
˃ Benefits
Performance and latency benefits of computational storage
Scale compute / storage independently
Higher density per rack
Lowest TCO
˃ Vendors at FMS:
Xilinx
Introducing Composable Storage Acceleration
NVMe over Fabrics
PCIe
>> 22
![Page 23: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/23.jpg)
SSD
SSD
Ethernet
Storage
Accel
Storage
Accel
Storage
Accel
Storage
Accel ˃ Moves some compute next to the data
˃ Network traffic reduced
˃ Latency improved
˃ Higher utilization with composable
infrastructure
Future DC: Composable + Adaptable Computational Storage
Reduced network traffic
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
Storage
Accel
FPGA
>> 23
![Page 24: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/24.jpg)
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
Ethernet
Smart
NIC
Smart
NIC
Smart
NIC
Smart
NIC
Smart
NIC
Smart
NIC
Smart
NIC
Smart
NIC
Smart
NIC
˃ Enables low latency high bandwidths acceleration of
network interface workloads.
˃ Enables significantly higher packets per second
˃ Offloads network functions from the CPU
Future DC: Composable + Adaptable Network Acceleration
FPGA
>> 24
![Page 25: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/25.jpg)
Ethernet
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
˃ Customizable acceleration up to
100x faster than CPUs for:
Video transcoding
ML inferencing
Financial modeling
…
Future DC: Composable + Adaptive Compute Acceleration
FPGA
>> 25
![Page 26: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/26.jpg)
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
CPU
SSD
SSD
Ethernet
Smart
NIC
Smart
NIC
Smart
NIC
Smart
NIC
Smart
NIC
Smart
NIC
Smart
NIC
Smart
NIC
Smart
NIC
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Compute
Accel
Storage
Accel
Storage
Accel
Storage
Accel
Storage
Accel
˃ Composable accelerated
storage, networking and
compute
˃ Optimized for each workload
˃ Optimal infrastructure
utilization
Future DC: Composable + Distributed Adaptive Acceleration
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
SSD
Storage
Accel
>> 26
![Page 27: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/27.jpg)
FPGAs are Key to Accelerating High-Speed Storage Systems
Computational storage addresses a broad range of application bottlenecks
Offers data center operators >5x performance boost and up to 2x reduction of TCO
Xilinx is leading the way in distributed adaptive acceleration
5x
2x
>> 27
![Page 28: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/28.jpg)
˃Visit Xilinx in booth 313
˃Visit our partners
Alpha Data, Bittware, Burlywood, Codelucida, GigaIO, Echo Streams, Eideticom, Everspin Technologies, IP-Maker, Mobiveil, Pliops, PLDA, Scaleflux, Smart IOPS, Samsung, SMART Modular, Toshiba Memory America, Western Digital
˃Visit our Computational Storage microsite
www.xilinx.com/computational-storage
˃Join SNIA working group for Computational Storage
Computational Storage in Action
>> 28
![Page 29: FPGAs: The Key to Accelerating High-Speed Storage Systems · 2020-06-25 · Example of Line Rate Hadoop Compression Acceleration The challenge: Ingest real-time retail sales data](https://reader033.vdocuments.net/reader033/viewer/2022050510/5f9ac763ba355806ce47b255/html5/thumbnails/29.jpg)
Adaptable.
Intelligent.