grace project ist-2001-38100

18
GRACE Project IST-2001-38100 EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab

Upload: talon-griffith

Post on 03-Jan-2016

20 views

Category:

Documents


0 download

DESCRIPTION

GRACE Project IST-2001-38100 EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab. Overview. Project title: GRACE (Grid Search and Categorization Engine) FP5 1/9/2002 - 28/2/2005 (Duration: 30 m) GRACE Websitehttp://www.grace-ist.com. Partners. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: GRACE  Project IST-2001-38100

GRACE ProjectIST-2001-38100

EGAAP meeting – Den Haag, 25/11/2004Giuseppe Sisto – Telecom Italia Lab

Page 2: GRACE  Project IST-2001-38100

3GRACE – EGAAP meeting Den Haag, 25/11/2004

Overview

• Project title: GRACE (Grid Search and Categorization Engine)

• FP5

• 1/9/2002 - 28/2/2005 (Duration: 30 m)

• GRACE Website http://www.grace-ist.com

Page 3: GRACE  Project IST-2001-38100

4GRACE – EGAAP meeting Den Haag, 25/11/2004

Partners

• Both Industrial and Scientific: – Telecom Italia (TI)– GL Europe 2006 (GL)– European Organization for nuclear Research

(CERN) – Sheffield Hallman University (SHU)– Stockholm University Library– Stuttgart University Library

Page 4: GRACE  Project IST-2001-38100

5GRACE – EGAAP meeting Den Haag, 25/11/2004

Project Objectives

• Research and Knowledge Management tool

• Distributed search and categorization engine based on Grid Technology that enables just‑in‑time, flexible allocation of data and computational resources

• Handles unstructured textual information (text files, documents, Web pages, text stored in databases) in GRID environment

Page 5: GRACE  Project IST-2001-38100

6GRACE – EGAAP meeting Den Haag, 25/11/2004

Application Workflow

Frontend Application Grid

QuerySubmitting

QueryRouting

SearchResultsParsing

Downloading TextNormalizing

Querying

CategorizingSearchResults

Displaying

External Content Sources

Internal Content Sources

Page 6: GRACE  Project IST-2001-38100

7GRACE – EGAAP meeting Den Haag, 25/11/2004

Requirements

• No special requirements to run GRACE Grid jobs: – no specific software to be installed

• Current solution does not require the application to be permanently installed on the Grid– At present, it is sent through the job InputSandbox together

with the input files

• Need to provide transparent use of the Grid and easy access to the application, during validation phase. This could bring to some certification issue.

GRACE therefore adopted a solution based on the use of an application generic certificate

Page 7: GRACE  Project IST-2001-38100

8GRACE – EGAAP meeting Den Haag, 25/11/2004

GRACE Web Interface

Page 8: GRACE  Project IST-2001-38100

9GRACE – EGAAP meeting Den Haag, 25/11/2004

Timing

• collaboration with EDG and EGEE in order to be constantly aligned with their plans and developments

Sep02

RequirementsDesign

ValidationEvaluation

DeploymentBasic Tests

ImplementationVerification

2003

TB0 TB1 TB2 TB3

Jan01 Mar04

2001 2002 2004

Globus 1 EDG 1 EDG 1.4 EDG 2EGEE TBLCG-2

Apr04

Feb05

OGSA

2005

Page 9: GRACE  Project IST-2001-38100

10GRACE – EGAAP meeting Den Haag, 25/11/2004

Current Status

• GRACE application prototype has been implemented based on LCG-2 Grid middleware

• Two GRACE nodes have been added to the GILDA testbed with the support of the GILDA team

• Infrastructure:

– 2 sites : Turin (TI) and Milan (GL). An additional GRACE grid site at Sheffield (SHU) being explored

– Partners’ Registration Authorities set up for TI in Turin and GL in Milan. SHU RA to be discussed

– Software configured for GILDA VO, in collaboration with GILDA support team

– The necessary user and server certificates have been obtained.

– GRACE Grid sites tested and certified in GILDA

Page 10: GRACE  Project IST-2001-38100

11GRACE – EGAAP meeting Den Haag, 25/11/2004

Current Status

Computing Element (CE) cetilab.tilab.com

Worker Nodes (WN) wntilab.tilab.comwn1tilab.tilab.com wn2tilab.tilab.com  

Storage Element (SE) setilab.tilab.com

User Interface (UI) uitilab.tilab.com

Grid Elements running in TILAB:

Computing Element (CE) gridgl001.gl2006europe.com

Worker Nodes (WN) gridgl011.gl2006europe.comgridgl012.gl2006europe.comgridgl013.gl2006europe.com  

Storage Element (SE) gridgl002.gl2006europe.com

User Interface (UI) gridgl003.gl2006europe.com

Grid Elements running in GL2006:

Page 11: GRACE  Project IST-2001-38100

12GRACE – EGAAP meeting Den Haag, 25/11/2004

Current Status

• Application:– GRACE Grid jobs tested on GILDA

– GRACE Grid jobs integrated with GRACE application interface through EDG Java API calls

– First prototype demo available

– The application still unstable -> needs further testing, optimization as well as the integration of other functionalities

Page 12: GRACE  Project IST-2001-38100

13GRACE – EGAAP meeting Den Haag, 25/11/2004

Commitment in the next 6 Months

By the end of the project (02/2005):– Final prototype integrating all GRACE functionalities– Output of validation and evaluation activity

Objectives and future steps for the next few months are: – Integrate a number of relevant content sources related to the selected

Knowledge Domains: Physics, Computer Science and Engineering. Other Knowledge Domains can easily be added afterwards

– Add SHU site to the testbed

– Complete performance tests

– Run and complete the final validation and evaluation phase

Support on the configuration of GRACE site in Sheffield and the finalization of GRACE performance tests on GILDA would be helpful for the project activity

Page 13: GRACE  Project IST-2001-38100

14GRACE – EGAAP meeting Den Haag, 25/11/2004

Collaborations

• GIR (GGF Grid Information Retrieval)– similar approach regarding architecture– GRACE more powerful as far as Multilingualism and

Semantics / Ontologies are concerned, GIR more aligned with GGF/OASIS evolution

– Exchange of Documention (D1.1 User Requirements)– GRACE representative in GIR

• EGEE

• Crossgrid– possible integration of Crossgrid tools to increase efficiency

improve performances

Page 14: GRACE  Project IST-2001-38100

15GRACE – EGAAP meeting Den Haag, 25/11/2004

Added Value for GRACE Community to run on EGEE

• GRACE doesn’t have the necessary resources and expertise to deploy a fully functional Grid infrastructure

• EGEE/GILDA represents for the project the major middleware and infrastructure provider

• Run at an European-wide scale, enable just‑in‑time, flexible allocation of computational and data storage resources, making terabytes of information distributed on vast amounts of geographically distant locations highly accessible

Page 15: GRACE  Project IST-2001-38100

16GRACE – EGAAP meeting Den Haag, 25/11/2004

Interest of EGEE in supporting the Application

• GRACE is contributing to the infrastructure with the integration of some GRACE Grid nodes

• One of the first applications, not part of EGEE pilot application group, to have integrated with EGEE infrastructure

– GRACE has provided concrete and early feedback on the usage of the Grid

• GRACE merges the interests of user scientific communities of different domains

– GRACE can contribute to the dissemination of Grid awareness through new communities

Page 16: GRACE  Project IST-2001-38100

17GRACE – EGAAP meeting Den Haag, 25/11/2004

Expected Outcomes

• At the end of the project, GRACE will produce a fully functional prototype together with installation, configuration and usage instructions

• GRACE represents a first experience of Information Retrieval over the Grid and will provide feedback to the IR and KM research communities

Page 17: GRACE  Project IST-2001-38100

18GRACE – EGAAP meeting Den Haag, 25/11/2004

Next?

• Toolkit Improvement (robustness, scalability,…)

• Exploitation– Content Providers (Infoproviders, Libraries,

Research Institutes,…)– Public Administration

• Grid Infrastructure?

Page 18: GRACE  Project IST-2001-38100

19GRACE – EGAAP meeting Den Haag, 25/11/2004

Contact Persons

• Project Coordinator: Maurizio Cecchi (TI) [email protected]

• Project Mailing List: [email protected]

• Grid integration: Roberta Faggian (CERN) [email protected]

• TI Grid infrastructure: Giuseppe Sisto (TI) [email protected]

• GL Grid infrastructure: Lorenzo Gianoli (GL) [email protected]

• SHU Grid infrastructure: Mehrdad.Naderi (SHU) [email protected]