new adventures in storage: cloud storage and cdmi paul

12
New adventures in storage: cloud storage and CDMI Paul Millar On behalf of the dCache team. CHEP 2015: Spring Okinawa

Upload: others

Post on 24-Apr-2022

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: New adventures in storage: cloud storage and CDMI Paul

New adventures in storage: cloud storage and CDMI

Paul MillarOn behalf of the dCache team.

CHEP 2015: Spring Okinawa

Page 2: New adventures in storage: cloud storage and CDMI Paul

What is dCache?LHC data stored on each storage system

Source: BDII (2014-11-14)

dCache (96 PB)

DPM (34 PB)

EOS (0 PB)

StoRM (20 PB)

CASTOR (14 PB)

BeStMan (7.6 PB)

Globus FTP (6.1 PB)

ARC (0.01 PB)

xrootd (22 PB)

Collaborations

Core team

Student mentor programme

dCache

dCache

1.5 FTEs

2 FTEs

5 FTEs

3 students

Page 3: New adventures in storage: cloud storage and CDMI Paul

Not just HEP:

● EGI FedCloud● Indigo DataCloud● HPC access to data via metadata

… are pushing for CDMI

Page 4: New adventures in storage: cloud storage and CDMI Paul

What is CDMI?● Industry standard protocol for cloud storage,● Optional functional building blocks:

filesystem, object store, protocol discovery, storing metadata, metadata querying language, queues, multi­tenant, accounting, retention polices, snapshots, …

● Community­proposed extensions:

“Scrubbing” metadata, file locality, server­side scatter­gather, job management, …

● Commercially available (DDN, NetApp)

Page 5: New adventures in storage: cloud storage and CDMI Paul

CDMI supports● Writing data with different cost/retention guarantees: tick.● Managing latency guarantees: tick.● Third­party copy: tick.● Notification when missing files: partial­tick [logging].● Notification when files deleted: partial­tick [logging].● Discover file popularity: partial­tick.● Storage accounting: tick.● Access monitoring: partial­tick (logging open/close).

Page 6: New adventures in storage: cloud storage and CDMI Paul

What is dCache plan?

● Adding initial CDMI support this summer,

filesystem­like access

Based on collaboration with HTW Berlin,● Next steps:

● Object store,● Storing metadata,● Querying metadata.

Page 7: New adventures in storage: cloud storage and CDMI Paul

Opportunities for HEP?● Community profiles for CDMI:

Map existing usage to CDMI● Simplify small (Tier­2/3) storage: just buy a CDMI box.● Extensions for additional use­cases:

Support any current usage not already in CDMI.

Add new functionality:● Register alternative locations (promote self­healing)● “Smart” vector reads (c.f., OpeNDAP),

Page 8: New adventures in storage: cloud storage and CDMI Paul

Thanks for listening … any questions?

Page 9: New adventures in storage: cloud storage and CDMI Paul

Backup slides

Page 10: New adventures in storage: cloud storage and CDMI Paul

Not just WLCG● JADE

(SMHB)● LOFAR

(Radio Astronomy)● Fermilab: Intensity Frontier

(Neutrino, Muon experiments)● DESY: “unlimited” DropBox­like service● Commercial partners

… etc, etc ...

Page 11: New adventures in storage: cloud storage and CDMI Paul

The scientific cloud vision

NFS

WebDAV

HPC & Grid Clusters

Low latency access

DropBox-like storage

Devices synchronise with storage

Remote access

Rich access via web- browser

Bulk WAN transfer

Moving huge datasets

CDMI

HTTPFTP

Cloud storage

Standard back- end for clusters and portals

dCacheNFS

Fast data ingestStandard devices at high data rates

Page 12: New adventures in storage: cloud storage and CDMI Paul

What are the benefits● Standard Interface:

“just buy a box”, clients come for free …● Push work to storage:

metadata operations, data retention, queues, ...● Scale better:

CRUD operations, object store, …● Can discover things:

Alternative protocols, file usage statistics, …