amga training in egi tf 2011...amga training in egi tf 2011 • place : st clair 4, lyon conference...

22
AMGA Training in EGI TF 2011 Place : St Clair 4, Lyon Conference Centre Date: 2011.09.19, Monday Presenters SoonwooK Hwang, Taesang Huh, Geunchul Park, KISTI Timetable Session 1 (14:00 ~ 15:30) Overview of AMGA Hands-on Exercise on AMGA CLI client tool Coffee Break (15:30 ~ 16:00) Session 2 (16:00 ~ 17:30) Overview of AMGA GUI client called AMGA Manager Hands-on Exercise on AMGA Manager 1

Upload: others

Post on 04-Jun-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

AMGA Training in EGI TF 2011

• Place : St Clair 4, Lyon Conference Centre

• Date: 2011.09.19, Monday

• Presenters

– SoonwooK Hwang, Taesang Huh, Geunchul Park, KISTI

• Timetable

– Session 1 (14:00 ~ 15:30)

Overview of AMGA

Hands-on Exercise on AMGA CLI client tool

– Coffee Break (15:30 ~ 16:00)

– Session 2 (16:00 ~ 17:30)

Overview of AMGA GUI client called AMGA Manager

Hands-on Exercise on AMGA Manager

1

Page 2: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

Overview of AMGA(the EMI Metadata Software Product)

KISTI Supercomputing Center

Soonwook Hwang

2011. 09. 19

Page 3: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

3

Contents

• A Brief Overview of AMGA

• Main Features

• AMGA Client Tools

• AMGA Usage in Applications

• References

Page 4: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

4

A Brief Overview of AMGA

• The EMI metadata software product

• KISTI is responsible for the maintenance and evolution of AMGA in EMI

• EMI service to handle metadata on the Grid– Access to Metadata for files distributed on the Grid

– A simplified general access to relational data stored in database systems.

Page 5: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

5

History of AMGA (1/2)

• 2004 – the ARDA project evaluated existing Metadata Services from HEP experiments – AMI (ATLAS), RefDB (CMS), Alien Metadata Catalogue (ALICE)

– Similar goals, similar concepts

– Each designed for a particular application domain

Reuse outside intended domain difficult

– Several technical limitations: large answers, scalability, speed, lack of flexibility

• ARDA proposed an interface for Metadata access on the GRID – Based on requirements of LHC experiments

– But generic - not bound to a particular application domain

– Designed jointly with the gLite/EGEE team

Page 6: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

6

History of AMGA (2/2)

What is AMGA ? (ARDA Metadata Grid Application)

• Began as prototype to evaluate the Metadata Interface

– Evaluated by community since the beginning:

– Matured quickly thanks to users feedback

• Now part of gLite middleware and EMI distribution

• Requirements from HEP community

– Millions of files, 6000+ users, 200+ computing centres

– Mainly (real-only) file metadata

– Main concerns : scalability, performance, fault-tolerance, Support for Hierarchical Collection

• Requirements from Biomed community

– Smaller scale than HEP

– Main concerns : Security

Page 7: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

7

Metadata User Requirements

• I want to

– store some metadata information, e.g., about

files

In a structured way

– query a system about those information

– keep information about jobs running on the

Grid

I want my jobs to have read/write access to those

information using the grid proxy certificate

– NOT use a database

Page 8: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

8

Metadata Concepts in AMGA

• Entries

– Representation of real world entities, which we are attaching metadata to describe

them

• Attributes

– Type : Int, float, string

– Name/Key : the name of the attribute

– Value : Value of an entry’s attributes

• Schema

– a set of attributes

• Collections

– A set of entries associated with schema

– Can be organized hierarchically containing sub collections

• Query

– SELECT ... WHERE ... clause in SQL-like or SQL query language

Page 9: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

9

AMGA Analogies to RDBMS & File System

• Analogy to the RDBMS – Collection table

– Schema table schema

– Attribute schema column

– Entry table row/record

• Analogy to file system– Collection Directory

– Entry File

Page 10: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

10

Main Features

• AMGA server implemented as a C++ multiprocess server– back-end : Oracle, PostgreSQL, MySQL, SQLite

– front-end : TCP Streaming, WS-DAIR (SOAP)

• Streamed Bulk Operations

• Import existing databases

• Support for both AMGA SQL-like and Native SQL Query

TCP

Streaming

WS-DAIR

(SOAP)

MD

Server

Oracle

Postgre

SQL

MySQL

SQLite

AMGA Metadata Server

Client

Client

Page 11: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

11

Main Features

Security in AMGA

• Integration with Grid Security :

Grid proxy authentication and

VOMS authorization

• Secure client connection using

SSL

• Authorization using ACLS with

support for user and group

management

Page 12: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

Main Features

Replication in AMGA

• Metadata collections can be replicated to improve reliability,

scalability and performance

– Partial/full Replication:

– Master-Slave & Asynchronous communication model

12

Page 13: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

Main Features

Federation in AMGA

• A mechanism to integrate distributed metadata

– Provide a user with a virtualized view on metadata

• Two types: server side, client side

13

Page 14: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

14

AMGA Client Tools

• AMGA CLI (Command Line Interface) tools

– mdclient

Interactive CLI tool to interact with AMGA server

– mdcli

Non-interactive, one-shoot command line tool

used in the shell script to access AMGA metadata

• AMGA GUI Client

– AMGA manager

Easy-to-use, general purposed GUI interface to interact with

AMGA

• Programming API– Support for C/C++, Java, Python, Perl, PHP API

Page 15: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

AMGA Manager

15

• Easy-to-use, GUI client tool

– Interactive, easier and more intuitive metadata handling

Page 16: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

AMGA Usage in Applications

• Has been adopted by many user communities for their

metadata services including MDM, Belle II, WISDOM, e-

Health-Child and so on

• In 2010, downloaded 607 times from ~110 different sites

around the world

16

Medical Data Manager

Digital Library

Climate Research

Drug Discovery

High Energy Physics

# of AMGA Download in 2010

Page 17: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

AMGA Usage in Biomed

• Medical Data Manager – MDM– Store and access medical images

and associated metadata on the Grid

• Strong security requirements– Patient data is sensitive

– Data must be encrypted

– Metadata access must be restricted to authorized users

• AMGA used as metadata server– Demonstrates authentication, fine-

grained access control and encrypted access

– Used as a simplified DB

ImagesGUID Date

PatientID Doctor

DoctorName Hospital

Patient

Page 18: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

18

AMGA Use Case in Belle-II

Metadata download

E25 E27 E31 E21

E07 E09 E11 E13

E19 E21 E23 E17

E13 E15 E17 E19

AMGA server 1

AMGA server 2

AMGA server 3

AMGA server 4

E07

E13

E19

E25

local

AMGA server1

local

AMGA server2

Tier 0 : KEK

Tier 1

s1

Personal AMGAWeb Service

Metadata publish

metadata accessmetadata replicationmetadata redirection

Page 19: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

AMGA Usage as Task Manager in WISDOM

• WISDOM was International initiative to deploy large-

scale in-silico docking jobs on the grid

19

Page 20: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

AMGA Usage as Checkpoint Server in QCD

• QCD simulation is a long-running job lasting ~10 days

to complete

– Application-level checkpointing is required to successfully deploy

and run such long-duration jobs on the grid

20

Page 21: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

References

• AMGA Web Site

– http://amga.web.cern.ch/amga/

• EMI Software Repository

– http://emisoft.web.cern.ch/emisoft/index.html

21

Page 22: AMGA Training in EGI TF 2011...AMGA Training in EGI TF 2011 • Place : St Clair 4, Lyon Conference Centre • Date: 2011.09.19, Monday • Presenters –SoonwooK Hwang, Taesang Huh,

2222

[email protected]