future plans at ral tier 1 shaun de witt. introduction current set-up short term plans final...

13
Future Plans at RAL Tier 1 Shaun de Witt

Upload: amy-williamson

Post on 17-Jan-2018

218 views

Category:

Documents


0 download

DESCRIPTION

Current Infrastructure Common CMS ATLAS Gen LHcB Common Services (nsd, Cupv, vmgr, vdqm) Instances Disk Layer (Tape and Disk ‘Pools’) TapeLayer (at least 1 dedicated drive)

TRANSCRIPT

Page 1: Future Plans at RAL Tier 1 Shaun de Witt. Introduction Current Set-Up Short term plans Final Configuration How we get there… How we plan/hope/pray to

Future Plans at RAL Tier 1

Shaun de Witt

Page 2: Future Plans at RAL Tier 1 Shaun de Witt. Introduction Current Set-Up Short term plans Final Configuration How we get there… How we plan/hope/pray to

Introduction• Current Set-Up• Short term plans• Final Configuration• How we get there…• How we plan/hope/pray to use CEPH

Page 3: Future Plans at RAL Tier 1 Shaun de Witt. Introduction Current Set-Up Short term plans Final Configuration How we get there… How we plan/hope/pray to

Current InfrastructureCommo

n

CMSATLAS Gen LHcB

Common Services(nsd, Cupv, vmgr, vdqm)

Instances

Disk Layer(Tape and Disk ‘Pools’)

TapeLayer(at least 1 dedicated drive)

Page 4: Future Plans at RAL Tier 1 Shaun de Witt. Introduction Current Set-Up Short term plans Final Configuration How we get there… How we plan/hope/pray to

ATLAS Instance Explodednsd01

nsd02SRM01 SRM02 SRM03 SRM04

HeadNode01RH

StagerTapeGateway

HeadNode02TransferMgr

HeadNode03TransferMgr

NSDXroot Mgr

atlasTape atlasDataDisk atlasScratchDisk

Xroot proxy

x12x1

Page 5: Future Plans at RAL Tier 1 Shaun de Witt. Introduction Current Set-Up Short term plans Final Configuration How we get there… How we plan/hope/pray to

Database ConfigurationRepack

Nameserver

Cupv…

CMS SRMCMS STGR

LHCb SRMLHCb STGR ATLAS SRM ATLAS

STGRGen SRMGen STGR

RepackNameserve

rCupv

CMS SRMCMS STGR

LHCb SRMLHCb STGR ATLAS SRM ATLAS

STGRGen SRMGen STGR

Primaries

Standbys

DataGuard

Page 6: Future Plans at RAL Tier 1 Shaun de Witt. Introduction Current Set-Up Short term plans Final Configuration How we get there… How we plan/hope/pray to

Short Term Plans…• Improve tape cache performance for

ATLAS– Tape rates limited by disk– Currently heavy IO (read/write from

grid/tape)– Currently configured with 10(7) ‘small’

server in RAID6– Would RAID-1(0) help?

Page 7: Future Plans at RAL Tier 1 Shaun de Witt. Introduction Current Set-Up Short term plans Final Configuration How we get there… How we plan/hope/pray to

The Future

Swift/S3 XROOT/gridFTP

CASTOR

Page 8: Future Plans at RAL Tier 1 Shaun de Witt. Introduction Current Set-Up Short term plans Final Configuration How we get there… How we plan/hope/pray to

What is • Erasure-coded CEPH High-throughput

Objectstore• EC uses 16+3• ALL user data planned to use erasure

coding (no replication)• S3/SWIFT recommended interfaces

– Xroot and gridFTP for legacy support• … more later

Page 9: Future Plans at RAL Tier 1 Shaun de Witt. Introduction Current Set-Up Short term plans Final Configuration How we get there… How we plan/hope/pray to

The Plan…• Move

– The data• Modify• Merge

Page 10: Future Plans at RAL Tier 1 Shaun de Witt. Introduction Current Set-Up Short term plans Final Configuration How we get there… How we plan/hope/pray to

The ‘Plan’ – Phase 1• Current disk purchases are usable for

CEPH and classic CASTOR• Start moving atlasScratchDisk over to echo

– Lifetime of files should be ~2 weeks– Allows us to monitor production use of echo

• Work with VOs to migrate relevant data using FTS

• Maintain classic CASTOR

Page 11: Future Plans at RAL Tier 1 Shaun de Witt. Introduction Current Set-Up Short term plans Final Configuration How we get there… How we plan/hope/pray to

The Plan – Phase 2• Once all disk-only (on echo?)

– Consolidate to single castor instance• With single shared diskpool• Tape drive dedication…(Tim knows)• Clear all diskcopies• Single 2/3 node RAC + stdby

– Common headnodes supporting all services– Maintain 3-4 SRMs

• Will probably be phased in

Page 12: Future Plans at RAL Tier 1 Shaun de Witt. Introduction Current Set-Up Short term plans Final Configuration How we get there… How we plan/hope/pray to

Accessing ECHO• VOs use gridFTP and xroot ATM

– Write using 1 protocol, read using either• But not S3/SWIFT

– Proposed gridFTP url (writing?)• gsiftp://gateway.domain.name/<pool>/<file>• Steers transfer to pool• Certificate (VO and Role) for AA

– Xroot URL• As suggested by Seb…

– But what about access?

Page 13: Future Plans at RAL Tier 1 Shaun de Witt. Introduction Current Set-Up Short term plans Final Configuration How we get there… How we plan/hope/pray to

The Known Unknowns• S3/SWIFT interoperability?• Will CASTOR/CEPH support EC pools?• Partial writes verboten?• Do users need to supply• Support for CEPH plugins