Pedro Andrade > IT-GD > D4Science Pedro Andrade CERN European Organization for Nuclear Research GD Group Meeting 27 October 2007 CERN (Switzerland)
2 Outline Who I am What is D4Science VREs, Infrastructure, gCube, Example CERN and D4Science Status What I do in GD D4Science responsibilities Interactions within GD (EGEE and ETICS) Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
3 Who I am
4 Short CV Name: Pedro Manuel Rodrigues de Sousa Andrade you already know… or maybe just 20% of it Nationality: Portuguese you can easily guess looking at the name Born: Lagares, Portugal you don’t know… even google knows very little Studied: Porto, Portugal you know at least the wine and the football club… I hope Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
5 Short CV PORTO University of Porto Student of Informatics and Computing Engineering 5 years GENEVA CERN Trainee at the ATLAS experiment 3 month PORTO University of Porto Researcher at the University Grid Lab 1 year GENEVA CERN Fellow at IT-EGE Staff at IT-GD 4 years Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
6 What is D4Science…
7 Introduction Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
8 VREs Virtual Research Environments many virtual organizations Repository, Catalogue, Search service few large institutions 1996 consumer Digital Library few small institutions 2001 consumer 2006 consumer & resource provider Digital Library Management System many small institutions 2003 consumer & data provider Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
9 VRE VREs are time-defined distributed environments to satisfy concrete collaboration needs of distributed user communities VRE users are organized in dynamic VOs (short & long lived) All infrastructure elements are represented as VRE resources VRE concepts: Support distributed cooperative activities Dynamic environment managed on demand Produce re-usable new knowledge using specialized tools Support heterogeneous content sources Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
10 VRE - Infrastructure The definition, hosting and operation of VREs is supported by a distributed infrastructure, composed by different types of resources: Hardware: D4Science gCube nodes EGEE gLite nodes (CE & SE) Services & Applications: gCube services External services and applications Collections & Auxiliary Resources: Data, Content, Metadata Schemas, Mappings, Transformation programs Three dimensional vision of sharing Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
11 VRE - gCube gCube is the software implementing VREs. It is composed by: Runtime environment Container of gCube Services (GT4, Java WS-Core) Framework to develop new gCube Services VRE management services IS, SW Repository, Broker, VRE Manager, etc VRE specialized services Content, Metadata, Search, Index, Annotation, etc Exploiting gLite computing and storage facilities VRE presentation services Based on portal/portlets technology Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
12 1)VRE Definition Example Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
13 Example 2)VRE Generation gCube VRE Manager service remotely deploys sufficient number of running instances gCube Metadata Management service applies requested metadata transformations gCube Index Management services creates necessary index (geo, full text, etc) … Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
14 Example 3)VRE Made Available Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
15 D4Science Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
16 D4Science collaboration with CERN EGEE-III Providing resources to EGEE infrastructure Exploiting the gLite middleware Participating in EGEE events Collaboration started in EGEE-I (with DILIGENT) ETICS2 Exploiting ETICS for gCube build and test activities Collaboration started in ETICS1 (with DILIGENT) CERN Library Possible collaboration in future projects D4Science & CERN Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
17 Status December 09 Project End November 09 2 CommunitiesThird MilestoneNew gCube November 08 2 CommunitiesSecond MilestoneNew gCube June 08 1 Community First Milestone Old gCube January 08 Project Start Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
18 Status Software Projects org.gcube 24 subsystems org.gcore 2 subsystems Software Components 160 software components 51 services 84 libraries 25 portlets 65 distribution components Infrastructure Sites CNR (Pisa, IT) NKUA (Athens, GR) ESA (Rome, IT) UNIBASEL (Basel, CH) Infrastructure Size 22 gCube nodes 103 Running Instances 44 gLite WNs 18 TB gLite Storage Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
19 What I do in IT-GD…
20 D4Science Responsibilities SA1 “Infrastructure Operation” Activity coordination Planning of infrastructure sites, resources, deployment Definition of infrastructure operational procedures Support, installation, upgrade, security, certification Supervision of infrastructure support activity Definition and execution of infrastructure monitoring tools Coordination with EGEE D4Science VO, D4Science gLite nodes Certification of gCube nodes Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
21 D4Science Responsibilities SA3 “Integration, Testing and Distribution” Responsible for “Deployment Testing” task Definition of deployment test strategy Deployment test execution and reporting Coordinate with ETICS Member of Release Managers group Definition of software release procedure Management of software release cycles Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
22 D4Science Responsibilities Quality Assurance Task Force Definition and implementation of project quality plan EU Reporting Deliverables Quarterly reports, Year reports Project reviews PEB representative Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
23 Interactions within GD (EGEE) Coordination between projects’ objectives and plans Coordination of D4S contribution to EGEE infrastructure OMC Support tickets related to gLite exploitation by gCube GGUS Usage of SAM to gather information about gLite services status Adoption of GridMap technology for monitoring D4S infrastructure Monitoring Possible experimentation of new gLite services PPS Coordination between D4S sites and EGEE ROCs ROCs Discussion of D4S VO in EGEE infrastructure OAG Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
24 Interactions within GD (ETICS) Coordination between projects’ objectives and plans Requirements, feedback, bug submission ETICS Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
25 Conclusion I hope that now you understand a bit better: What is D4Science doing Why you see me walking around The common activities between our work Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
26 Links D4Science Site: Distribution Site: Pedro Andrade > IT-GD > D4Science 27 October 2008, CERN
27 Thanks ! Questions ?