belle ii data handling system kihyeon cho * (on behalf of the belle ii computing group) *presenter...
Post on 19-Dec-2015
219 views
TRANSCRIPT
Belle II Data Handling System
Kihyeon Cho*
(on behalf of the Belle II Computing Group)
*PresenterHigh Energy Physics Team
KISTI (Korea Institute of Science and Technology Information)
October 28, 2010NCC Seminar, Goyang, Korea
1
Kihyeon Cho
Contents
High Energy Physics Team @ KISTIBelle II ExperimentBelle II Data Handling System
Meta-data system Data cache system To test Large Scale Data Handling
With Belle Data With Belle II Data (Random data)
The interaction between HLT and StorageSummary
2
Kihyeon Cho
High Energy Physics Team @ KISTI
3
High Energy Physics Team (Physicists) • Kihyeon Cho – P.I.• Soo-hyeon Nam - Senior Researcher • Junghyun Kim - Senior Researcher • YoungJin Kim - Postdoc • Taegil Bae - Postdoc• Jihye Moon - Researcher
Former members• Yongseok Oh – Prof. of Physics @ KNU • Daejung Kong - Contract Prof. @ KNU • Ilsung Cho - Contract Prof. @ KyungHee U. • Minho Jeung - Company • Hyunwoo Kim - Computing Team @ KISTI
Kihyeon Cho
e-Sci-ence
Theory
Comput-ing
Ex-peri-men
t
To probe the Standard Model and search for New Physics
CDFBelle/Belle II
cf. LHCb
High Energy Physics Team @ KISTI
K.Cho and H.W.Kim, JKPS (2009)
Kihyeon Cho5
LHC@CERN,Europe
RHIC@BNL, USA
Belle*/Belle II* @ KEK, Japan
CDF*@FNAL, USA
e-Science @KISTI
IN2P3@France
국제 공동연구
한•불 입자물리연구소
*Collaboration• 한불입자물리연구소 CDF 그룹 한국 파트너 ( 조기현 )• Belle II Data Handling 워킹그룹장 ( 조기현 )
Kihyeon Cho
Three approachesto
New Physics
New particlesand new interactions
Leptonphysics
Energy frontier experiments
LHC, ILC, …
LHC
ILC
Higgs, SUSY, Dark matter, New understanding of space-time…
n exp., m LFV, t LFV,gm-2, …
Neutrino mixing/masses, Lepton number non-
conservation…
J-PARC
Quark flavorphysics
High Energy Physics in the LHC era
KEKB upgrade
CP asymmetry, Baryogenesis,Left-right symmetry, New sources
of flavor mixing…
Super-B Factory, K exp., etc.
M. Yamakuchi, Belle II meeting (2008)
Kihyeon Cho
Heavy Flavor Physics Experi-ments
7
Belle/Belle II CDF LHCb
Year 1998-2010 (Belle)2014 – (Belle II)
2001- 2009-
Place KEK, Japan Fermilab, USA CERN, Europe
Collabora-tion
13/47/~300(Belle II)(Nat./Ins./member)
15/63/620 15/54/730
σ 1 nb (10GeV)
150 μb (2TeV)
300~500 μb(7~14TeV)
CurrentLuminosity
1 ab-1 8 fb-1 180nb-1
Kihyeon Cho
Belle Content Belle II
1998~2010 Time Schedule 2014~
1 ab-1 Luminosity 50 ab-1
1 Billion events 50 Billion
CP measurement Goal New Physics
Belle
Belle II
8
Belle vs. Belle II
To handle 50 times more data and to use grids ⇒ New data handling system
BB
Kihyeon Cho9
Major achievements Expected at Super B Factory
Start 2012 => 2014
Luminosity vs. Physics
Kihyeon Cho 10
Belle II Time schedule
Hara San
Kihyeon Cho
Count down
2014 April – Collider turn on2012 February – Ready for Computing2011 Summer – Rough final system2010 Summer – Prototype system
Spec & Test
2010.5 PAC – Answers for questionsComputing & Pixel
11
Kihyeon Cho
Belle II Computing
12=> To handle data, we need new data handling system.
Kihyeon Cho
Belle II Data Handling Group(Chair: Kihyeon Cho)
KISTIHEP Team
– JungHyun Kim, Kihyeon Cho, YoungJin Kim, …
AMGA Team – Soonwook Hwang, Sunil Ahn, HanGi Kim, Tae-sang Huh, …
Melbourne Tom Fifield, Martin Sevior, …
KrokowMaciej Sitarz, Mitosz Zdybat, Rafat Grzymkowski, Henryk Polka, …
KEK, Karlsruhe, etc…
13
Kihyeon Cho
Belle II Data Handling Group meeting
14
First and Third Thurs-day 5:00 PM (KEK Time)
Kihyeon Cho
KEK
Grid Site Grid Site
Local Resources Local Resources Local Resources Local Resources
Ntuple Analysis
MC ProductionAnd Ntuple Production
Raw Data StorageAnd Processing
Cloud
MC Production(optional)
AMGA
DIRAC
UI
TapeCPU
Disk
Raw DatamDST Data
mDST MC
Ntuples
Data
Tools
Data
Tools
Data Tools
Data
Tools
Client
gbast2
Belle II computing model
Kihyeon Cho
Data Handling Outlines
16
KEK Grid sites
plan
DIRAC
Kihyeon Cho
To construct the DH system for Belle II experiment• To improve the scalability
and performance
• To run based on grid farm
⇒ AMGA (Arda Metadata Catalog for Grid Application)
AMGA
Data Cache
17
Belle II metadata system
DIRAC
DIRAC
Kihyeon Cho
• We make the simple data tool which is not based on database.
18
Belle II data cache system
Event-driven meta-data catalog ⇒ Condition-driven meta-data catalog
Kihyeon Cho
Large Scale data DH test with Belle Data
We perform searching for the interesting files with a table of meta-system and changing number of parallel processing.The linearity of search is stable up to 50 parallel simultaneous processing.
19
• # of files: 2013 files• # of events: 12 M events• # of luminosity: 5792 pb-1
• What queries? - run #, exp#, stream#...
InputOutput
Kihyeon Cho 20
Large Scale data DH test with Belle II Data (Random generating)
• With a table and multi-processing• Generating time: 400 files/sec
• With 30 multi-tables and multi-process-ing
• Generating time: 400 files/sec
Input: 70,000 files (140TB) The linearity of search is stable up to 50 parallel simultane-ous processing.It is almost same between using a table and using 30 multi-tables.
Kihyeon Cho
KEK
Grid Site Grid Site
Local Resources Local Resources Local Resources Local Resources
Ntuple Analysis
MC ProductionAnd Ntuple Production
Raw Data StorageAnd Processing
Cloud
MC Production(optional)
Detector
DAQ
HLT
LFCAMGA
AMGA
DIRAC
UI
TapeCPU
Disk
Raw DatamDST Data
mDST MC
Ntuples
Data
Tools
Data
Tools
Data Tools
Data
Tools
Client
gbast2
The interface between HLT and Storage=> To apply AMGA
We assume two files/sec for both reading and writing for AMGA.Read-write optimization for meta-data
Generating for writing only 400 files/secTo test reading performance for 1Hz, 2Hz, 10Hz, 50Hz and 100 Hz
30kHz
6kHz
Kihyeon Cho
ProductsConference talks
1. The Advanced Data Searching System with AMGA at the Belle II Experiment
J. H. KimCCP2009, Gaushung, Taiwan, Dec., 2009
2. Data Cache System at Belle II experimentK. ChoCCP2010, Trondheim, Norway, June 2010
3. Belle II Data Management system K. ChoCHEP 2010, Taipei, Taiwan, Nov. 2010
22
대표 성과 우주 기원 밝히는 국제실험 주도 그 연구 성과 SCI 논문 출판
9 개국 12 개 연구기관의 30 여명의 Belle II DH 워킹그룹장 ( 조기현 ) 수행
KISTI 주도로 시스템 개발 그 연구성과 김정현 ( 제 1 저자 ), 조기현 ( 교신저자 ) 로 해외 SCI 저널
Computer physics Communication(IF: 1.958) 에 출판 국제공동연구 참여 => 주도 => 연구성과 도출
실간의 협력 서비스 파이프 라인 => 공동저자 등재 AMGA 관련 기반기술 개발실 ( 안선일 , 황순욱 ) 팜 관련 글로벌허브센터 ( 김법균 , 유진승 , 윤희준 , 장행진 )
S. Ahn, K.Cho, S.W.Hwang, J.Kim* et al, JKPS 56 (2010.10.15)
언론 보도 (2010.04.07)
Kihyeon Cho
Belle II DH Group – Daeduk net (2010.4.6)
25
http://www.hellodd.com
언론보도 (2010.10.25) 13 회
26
Kihyeon Cho
The 3rd Belle II Computing Workshop
Date: November 22-24 (Mon-Wed.) 2010 1st day (Monday): Off-line and HLT software 2nd day (Tuesday): Distributed Computing and Data Handling 3rd day (Wednesday)
Morning – Overflow Afternoon- Excursion
Place: KISTI, Daejeon, Korea As a chair of Belle II DH group, HEP team@ KISTI hosts Just after Belle II General Meeting @KEK, on Nov. 17-20
27
Kihyeon Cho
Summary
Metadata system and data cache system based on GridsTest of Large Scale Data Handling
Belle II DataBelle Data at KEK
Application of AMGA at HLTData Handling and Job Management for Belle II Grid
Þ This is a great success!Þ Keep going
28
In order to handle 50 times more data of Belle, Belle II Data Handling Group works on: