storage for data platforms - red hatpeople.redhat.com/~flucifre/talks/storage for data...

Post on 31-Jul-2020

7 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

STORAGE FOR DATA PLATFORMS

Federico Lucifredi & Kyle Bader

OpenStack Summit, Vancouver 2018

INSERT DESIGNATOR, IF NEEDED2

COMMON ARCHITECTURAL MODEL -PUBLIC OR PRIVATE CLOUD

PUBLIC CLOUD (AWS) PRIVATE CLOUD

AWS EC2 PROVISIONING

OPENSTACKPROVISIONING

AWS S3SHARED DATASETS

CEPH S3/SWIFTSHARED DATASETS

Hadoop

Presto

Spark Hadoop

Presto

Spark

THREE PILLARS FOR DATA PLATFORMS

Object Persistent Block

Low latency local block

OBJECT

● Shared persistence layer across compute clusters

● Extreme scalability○ Capacity○ Throughput

● Economical

OBJECT WORKLOADS

PERSISTENT BLOCK

● Arbitrary block capacity

● Capacity based IOPS

● Persistence helps with management of database lifecycles

○ Detach and reattach on larger instance○ Snapshots for backups and copy-on-write for secondaries

PERSISTENT BLOCK WORKLOADS

LOW LATENCY LOCAL BLOCK

● For distributed applications that tolerate instance failures

● Big data workloads○ Scratch data○ Intermediate spill data○ Buffers

LOW LATENCY LOCAL BLOCK

INSERT DESIGNATOR, IF NEEDED10

COMMON ARCHITECTURAL MODEL -PUBLIC OR PRIVATE CLOUD

PUBLIC CLOUD (AWS) PRIVATE CLOUD

AWS EC2 PROVISIONING

OPENSTACKPROVISIONING

AWS S3SHARED DATASETS

CEPH S3/SWIFTSHARED DATASETS

Hadoop

Presto

Spark Hadoop

Presto

Spark

THANK YOU!

COME TO SEE US AT THE RED HAT BOOTH

top related