DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.

Slides:



Advertisements
Similar presentations
The Quantum Chromodynamics Grid James Perry, Andrew Jackson, Matthew Egbert, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Advertisements

Stephen Burke - WP8 Status - 9/5/2002 Partner Logo WP8 Status Stephen Burke, PPARC/RAL.
Stephen Burke - WP8 Status - 14/2/2002 Partner Logo WP8 Status Stephen Burke, PPARC/RAL.
Data Management Expert Panel - WP2. WP2 Overview.
C. Grimme, A. Papaspyrou Scheduling in C3-Grid AstroGrid-D Workshop Project: C3-Grid Collaborative Climate Community Data and Processing Grid Scheduling.
High Performance Computing Course Notes Grid Computing.
Job Submission The European DataGrid Project Team
WP 1 Grid Workload Management Massimo Sgaravatto INFN Padova.
INFSO-RI Enabling Grids for E-sciencE EGEE Middleware The Resource Broker EGEE project members.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Workload Management Massimo Sgaravatto INFN Padova.
Grid and High Energy Physics Paula Eerola Lunarc, Artist’s view on Grid, by Ursula Wilby, Sydsvenskan
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
EU 2nd Year Review – Jan – WP9 WP9 Earth Observation Applications Demonstration Pedro Goncalves :
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Riccardo Bruno INFN.CT Sevilla, Sep 2007 The GENIUS Grid portal.
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
Grid Computing - AAU 14/ Grid Computing Josva Kleist Danish Center for Grid Computing
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
F.Fanzago – INFN Padova ; S.Lacaprara – LNL; D.Spiga – Universita’ Perugia M.Corvo - CERN; N.DeFilippis - Universita' Bari; A.Fanfani – Universita’ Bologna;
Computational grids and grids projects DSS,
SLICE Simulation for LHCb and Integrated Control Environment Gennady Kuznetsov & Glenn Patrick (RAL) Cosener’s House Workshop 23 rd May 2002.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
A Grid Computing Use case Datagrid Jean-Marc Pierson.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
INFSO-RI Enabling Grids for E-sciencE Project Gridification: the UNOSAT experience Patricia Méndez Lorenzo CERN (IT-PSS/ED) CERN,
DataGrid is a project funded by the European Union VisualJob Demonstation EDG 1.4.x 2003 The EU DataGrid How the use of distributed resources can help.
DataGrid WP1 Massimo Sgaravatto INFN Padova. WP1 (Grid Workload Management) Objective of the first DataGrid workpackage is (according to the project "Technical.
Nadia LAJILI User Interface User Interface 4 Février 2002.
PNPI HEPD seminar 4 th November Andrey Shevel Distributed computing in High Energy Physics with Grid Technologies (Grid tools at PHENIX)
Grid Workload Management Massimo Sgaravatto INFN Padova.
- Distributed Analysis (07may02 - USA Grid SW BNL) Distributed Processing Craig E. Tull HCG/NERSC/LBNL (US) ATLAS Grid Software.
The European DataGrid Project Team The EU DataGrid.
7April 2000F Harris LHCb Software Workshop 1 LHCb planning on EU GRID activities (for discussion) F Harris.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
Grid checkpointing in the European DataGrid Project Alessio Gianelle – INFN Padova Rosario Peluso – INFN Padova Francesco Prelz – INFN Milano Massimo Sgaravatto.
The project of application for network computing in seismology --The prototype of SeisGrid Chen HuiZhong, Ze Ren Zhi Ma, Hu Bin Institute.
Introduction to Grid Computing Ed Seidel Max Planck Institute for Gravitational Physics
Replica Management Services in the European DataGrid Project Work Package 2 European DataGrid.
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
1 Media Grid Initiative By A/Prof. Bu-Sung Lee, Francis Nanyang Technological University.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
T3 analysis Facility V. Bucard, F.Furano, A.Maier, R.Santana, R. Santinelli T3 Analysis Facility The LHCb Computing Model divides collaboration affiliated.
David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN.
Grid User Interface for ATLAS & LHCb A more recent UK mini production used input data stored on RAL’s tape server, the requirements in JDL and the IC Resource.
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
High-Performance Computing Lab Overview: Job Submission in EDG & Globus November 2002 Wei Xing.
Open Grid Services for Earth Observation Pedro Gonçalves.
INFN - Ferrara BaBar Meeting SPGrid: status in Ferrara Enrica Antonioli - Paolo Veronesi Ferrara, 12/02/2003.
Grid Workload Management (WP 1) Massimo Sgaravatto INFN Padova.
The DataGrid Project NIKHEF, Wetenschappelijke Jaarvergadering, 19 December 2002
+ Support multiple virtual environment for Grid computing Dr. Lizhe Wang.
Stephen Burke – Sysman meeting - 22/4/2002 Partner Logo The Testbed – A User View Stephen Burke, PPARC/RAL.
Developing GRID Applications GRACE Project
INFN/IGI contributions Federated Clouds Task Force F2F meeting November 24, 2011, Amsterdam.
Antonio Fuentes RedIRIS Barcelona, 15 Abril 2008 The GENIUS Grid portal.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
DataGrid France 12 Feb – WP9 – n° 1 WP9 Earth Observation Applications.
Grid Computing: Running your Jobs around the World
The EDG Testbed Deployment Details
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
Introduction to Grid Technology
Presentation transcript:

DataGrid Kimmo Soikkeli Ilkka Sormunen

What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power and storage facilities belonging to different institutions. DataGrid is a project that aims to enable access to geographically distributed computing power and storage facilities belonging to different institutions. This will provide resources to process huge amounts of data coming from scientific experiments. This will provide resources to process huge amounts of data coming from scientific experiments. Project started (led by CERN) Project started (led by CERN) Project is funded by the European Union. Project is funded by the European Union.

Problems, which DataGrid tries to solve Different institutions may use different computing and storage systems and will also have local security rules. Different institutions may use different computing and storage systems and will also have local security rules. Researchers need to access all of the resources in a uniform, transparent and easy way. Researchers need to access all of the resources in a uniform, transparent and easy way. To use resources effectively, the user needs effective and dependable information systems that allow automatic resource discovery and allocation. To use resources effectively, the user needs effective and dependable information systems that allow automatic resource discovery and allocation.

DataGrid Applications High Energy Physics (HEP), led by CERN (Switzerland) High Energy Physics (HEP), led by CERN (Switzerland) Biology and Medical Image processing, led by CNRS (France) Biology and Medical Image processing, led by CNRS (France) Earth Observations (EO), led by the ESA/ESRIN (Italy) Earth Observations (EO), led by the ESA/ESRIN (Italy)

Virtual Organization Institutions and individuals belonging to the same community and working at the same scientific problems would greatly benefit from putting together their resources. Institutions and individuals belonging to the same community and working at the same scientific problems would greatly benefit from putting together their resources. Virtual Organization: concept which has been formulated to describe all those distributed communities willing to share their resources in order to achieve common goals. Virtual Organization: concept which has been formulated to describe all those distributed communities willing to share their resources in order to achieve common goals.

Work Packages 12 Work Packages 12 Work Packages - WP 1: Work Scheduling - WP 2: Data Management Working Groups - Testbed - Application - Middleware - Infrastructure 4 Working Groups - Testbed - Application - Middleware - Infrastructure

Middleware The Grid software is often called middleware because it is mid-level software that provides services to users and to the applications. The Grid software is often called middleware because it is mid-level software that provides services to users and to the applications. The DataGrid project is developing a new Grid middleware based on the Globus toolkit. The DataGrid project is developing a new Grid middleware based on the Globus toolkit.

The DataGrid Testbed Testbed: made up of one or more sites. Each site contains a certain number of machines, each one playing a different role. Testbed: made up of one or more sites. Each site contains a certain number of machines, each one playing a different role. First DataGrid TestBed released in mid- November First DataGrid TestBed released in mid- November New software modules have been developed and they have been used to set up a large European testbed that is now fully operational. New software modules have been developed and they have been used to set up a large European testbed that is now fully operational.

The Resource Broker: module that receives users' requests and queries the Information Index to find suitable resources. The Resource Broker: module that receives users' requests and queries the Information Index to find suitable resources. The Information Index: keeps information about the available resources. The Information Index: keeps information about the available resources. The Replica Manager: coordinates file replication across the testbed from one Storage Element to another. The Replica Manager: coordinates file replication across the testbed from one Storage Element to another. The Replica Catalog: keeps information about file replicas. The Replica Catalog: keeps information about file replicas.

The Computing Element: module which receives job requests and delivers them to the Worker Nodes, which will perform the real work. The Computing Element: module which receives job requests and delivers them to the Worker Nodes, which will perform the real work. The Worker Node: module installed on the machines which will process input data. The Worker Node: module installed on the machines which will process input data. The Storage Element: module installed on the machines which will provide storage space to the testbed. The Storage Element: module installed on the machines which will provide storage space to the testbed. The User Interface: module that allows users to access all the DataGrid service. The User Interface: module that allows users to access all the DataGrid service.

Submitting jobs The user specifies their requirements in a file using the Job Description Language (JDL). Example "myjob.jdl" The user specifies their requirements in a file using the Job Description Language (JDL). Example "myjob.jdl" The User creates a proxy process issuing the command: The User creates a proxy process issuing the command:"grid-proxy-init" The User submits their job issuing the command: The User submits their job issuing the command: "dg-job-submit myjob.jdl" The Resource Broker reads the user's requirements, finds suitable resources and finds the input data files. The Resource Broker reads the user's requirements, finds suitable resources and finds the input data files.

Submitting jobs The Resouces Broker submits the job to the selected Computing Element. Each submitted job is assigned a unique identifier. The Resouces Broker submits the job to the selected Computing Element. Each submitted job is assigned a unique identifier. The user can query the status of her job issuing the command "dg-job-status JobId" The user can query the status of her job issuing the command "dg-job-status JobId" When the status of the job is "Output Ready" the user can retrieve the output issuing the command: "dg-job-get-output dJobId" When the status of the job is "Output Ready" the user can retrieve the output issuing the command: "dg-job-get-output dJobId"

Web Interfaces, simulations tools and demonstrators Map Center Map Center - web based monitoring tool Genius Genius - web-based GUI for job Submission OptorSim OptorSim DataGrid Demonstrator DataGrid Demonstrator