belle ii data handling system kihyeon cho * (on behalf of the belle ii computing group) *presenter...

29
Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science and Technology Information) October 28, 2010 NCC Seminar, Goyang, Korea 1

Post on 19-Dec-2015

219 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Belle II Data Handling System

Kihyeon Cho*

(on behalf of the Belle II Computing Group)

*PresenterHigh Energy Physics Team

KISTI (Korea Institute of Science and Technology Information)

October 28, 2010NCC Seminar, Goyang, Korea

1

Page 2: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

Contents

High Energy Physics Team @ KISTIBelle II ExperimentBelle II Data Handling System

Meta-data system Data cache system To test Large Scale Data Handling

With Belle Data With Belle II Data (Random data)

The interaction between HLT and StorageSummary

2

Page 3: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

High Energy Physics Team @ KISTI

3

High Energy Physics Team (Physicists) • Kihyeon Cho – P.I.• Soo-hyeon Nam - Senior Researcher • Junghyun Kim - Senior Researcher • YoungJin Kim - Postdoc • Taegil Bae - Postdoc• Jihye Moon - Researcher

Former members• Yongseok Oh – Prof. of Physics @ KNU • Daejung Kong - Contract Prof. @ KNU • Ilsung Cho - Contract Prof. @ KyungHee U. • Minho Jeung - Company • Hyunwoo Kim - Computing Team @ KISTI

Page 4: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

e-Sci-ence

Theory

Comput-ing

Ex-peri-men

t

To probe the Standard Model and search for New Physics

CDFBelle/Belle II

cf. LHCb

High Energy Physics Team @ KISTI

K.Cho and H.W.Kim, JKPS (2009)

Page 5: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho5

LHC@CERN,Europe

RHIC@BNL, USA

Belle*/Belle II* @ KEK, Japan

CDF*@FNAL, USA

e-Science @KISTI

IN2P3@France

국제 공동연구

한•불 입자물리연구소

*Collaboration• 한불입자물리연구소 CDF 그룹 한국 파트너 ( 조기현 )• Belle II Data Handling 워킹그룹장 ( 조기현 )

Page 6: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

Three approachesto

New Physics

New particlesand new interactions

Leptonphysics

Energy frontier experiments

LHC, ILC, …

LHC

ILC

Higgs, SUSY, Dark matter, New understanding of space-time…

n exp., m LFV, t LFV,gm-2, …

Neutrino mixing/masses, Lepton number non-

conservation…

J-PARC

Quark flavorphysics

High Energy Physics in the LHC era

KEKB upgrade

CP asymmetry, Baryogenesis,Left-right symmetry, New sources

of flavor mixing…

Super-B Factory, K exp., etc.

M. Yamakuchi, Belle II meeting (2008)

Page 7: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

Heavy Flavor Physics Experi-ments

7

Belle/Belle II CDF LHCb

Year 1998-2010 (Belle)2014 – (Belle II)

2001- 2009-

Place KEK, Japan Fermilab, USA CERN, Europe

Collabora-tion

13/47/~300(Belle II)(Nat./Ins./member)

15/63/620 15/54/730

σ 1 nb (10GeV)

150 μb (2TeV)

300~500 μb(7~14TeV)

CurrentLuminosity

1 ab-1 8 fb-1 180nb-1

Page 8: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

Belle Content Belle II

1998~2010 Time Schedule 2014~

1 ab-1 Luminosity 50 ab-1

1 Billion events 50 Billion

CP measurement Goal New Physics

Belle

Belle II

8

Belle vs. Belle II

To handle 50 times more data and to use grids ⇒ New data handling system

BB

Page 9: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho9

Major achievements Expected at Super B Factory

Start 2012 => 2014

Luminosity vs. Physics

Page 10: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho 10

Belle II Time schedule

Hara San

Page 11: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

Count down

2014 April – Collider turn on2012 February – Ready for Computing2011 Summer – Rough final system2010 Summer – Prototype system

Spec & Test

2010.5 PAC – Answers for questionsComputing & Pixel

11

Page 12: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

Belle II Computing

12=> To handle data, we need new data handling system.

Page 13: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

Belle II Data Handling Group(Chair: Kihyeon Cho)

KISTIHEP Team

– JungHyun Kim, Kihyeon Cho, YoungJin Kim, …

AMGA Team – Soonwook Hwang, Sunil Ahn, HanGi Kim, Tae-sang Huh, …

Melbourne Tom Fifield, Martin Sevior, …

KrokowMaciej Sitarz, Mitosz Zdybat, Rafat Grzymkowski, Henryk Polka, …

KEK, Karlsruhe, etc…

13

Page 14: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

Belle II Data Handling Group meeting

14

First and Third Thurs-day 5:00 PM (KEK Time)

Page 15: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

KEK

Grid Site Grid Site

Local Resources Local Resources Local Resources Local Resources

Ntuple Analysis

MC ProductionAnd Ntuple Production

Raw Data StorageAnd Processing

Cloud

MC Production(optional)

AMGA

DIRAC

UI

TapeCPU

Disk

Raw DatamDST Data

mDST MC

Ntuples

Data

Tools

Data

Tools

Data Tools

Data

Tools

Client

gbast2

Belle II computing model

Page 16: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

Data Handling Outlines

16

KEK Grid sites

plan

DIRAC

Page 17: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

To construct the DH system for Belle II experiment• To improve the scalability

and performance

• To run based on grid farm

⇒ AMGA (Arda Metadata Catalog for Grid Application)

AMGA

Data Cache

17

Belle II metadata system

DIRAC

DIRAC

Page 18: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

• We make the simple data tool which is not based on database.

18

Belle II data cache system

Event-driven meta-data catalog ⇒ Condition-driven meta-data catalog

Page 19: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

Large Scale data DH test with Belle Data

We perform searching for the interesting files with a table of meta-system and changing number of parallel processing.The linearity of search is stable up to 50 parallel simultaneous processing.

19

• # of files: 2013 files• # of events: 12 M events• # of luminosity: 5792 pb-1

• What queries? - run #, exp#, stream#...

InputOutput

Page 20: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho 20

Large Scale data DH test with Belle II Data (Random generating)

• With a table and multi-processing• Generating time: 400 files/sec

• With 30 multi-tables and multi-process-ing

• Generating time: 400 files/sec

Input: 70,000 files (140TB) The linearity of search is stable up to 50 parallel simultane-ous processing.It is almost same between using a table and using 30 multi-tables.

Page 21: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

KEK

Grid Site Grid Site

Local Resources Local Resources Local Resources Local Resources

Ntuple Analysis

MC ProductionAnd Ntuple Production

Raw Data StorageAnd Processing

Cloud

MC Production(optional)

Detector

DAQ

HLT

LFCAMGA

AMGA

DIRAC

UI

TapeCPU

Disk

Raw DatamDST Data

mDST MC

Ntuples

Data

Tools

Data

Tools

Data Tools

Data

Tools

Client

gbast2

The interface between HLT and Storage=> To apply AMGA

We assume two files/sec for both reading and writing for AMGA.Read-write optimization for meta-data

Generating for writing only 400 files/secTo test reading performance for 1Hz, 2Hz, 10Hz, 50Hz and 100 Hz

30kHz

6kHz

Page 22: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

ProductsConference talks

1. The Advanced Data Searching System with AMGA at the Belle II Experiment

J. H. KimCCP2009, Gaushung, Taiwan, Dec., 2009

2. Data Cache System at Belle II experimentK. ChoCCP2010, Trondheim, Norway, June 2010

3. Belle II Data Management system K. ChoCHEP 2010, Taipei, Taiwan, Nov. 2010

22

Page 23: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

대표 성과 우주 기원 밝히는 국제실험 주도 그 연구 성과 SCI 논문 출판

9 개국 12 개 연구기관의 30 여명의 Belle II DH 워킹그룹장 ( 조기현 ) 수행

KISTI 주도로 시스템 개발 그 연구성과 김정현 ( 제 1 저자 ), 조기현 ( 교신저자 ) 로 해외 SCI 저널

Computer physics Communication(IF: 1.958) 에 출판 국제공동연구 참여 => 주도 => 연구성과 도출

실간의 협력 서비스 파이프 라인 => 공동저자 등재 AMGA 관련 기반기술 개발실 ( 안선일 , 황순욱 ) 팜 관련 글로벌허브센터 ( 김법균 , 유진승 , 윤희준 , 장행진 )

S. Ahn, K.Cho, S.W.Hwang, J.Kim* et al, JKPS 56 (2010.10.15)

Page 24: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

언론 보도 (2010.04.07)

Page 25: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

Belle II DH Group – Daeduk net (2010.4.6)

25

http://www.hellodd.com

Page 26: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

언론보도 (2010.10.25) 13 회

26

Page 27: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

The 3rd Belle II Computing Workshop

Date: November 22-24 (Mon-Wed.) 2010 1st day (Monday): Off-line and HLT software 2nd day (Tuesday): Distributed Computing and Data Handling 3rd day (Wednesday)

Morning – Overflow Afternoon- Excursion

Place: KISTI, Daejeon, Korea As a chair of Belle II DH group, HEP team@ KISTI hosts Just after Belle II General Meeting @KEK, on Nov. 17-20

27

Page 28: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Kihyeon Cho

Summary

Metadata system and data cache system based on GridsTest of Large Scale Data Handling

Belle II DataBelle Data at KEK

Application of AMGA at HLTData Handling and Job Management for Belle II Grid

Þ This is a great success!Þ Keep going

28

In order to handle 50 times more data of Belle, Belle II Data Handling Group works on:

Page 29: Belle II Data Handling System Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science

Thank you.

[email protected]

29