Induction: What is e-Science and Grid computing? –April 26-28, 2004 - 1 What is e-Science and Grid computing? Dave Berry, NeSC EGEE is funded by the European.

Slides:



Advertisements
Similar presentations
An open source approach for grids Bob Jones CERN EU DataGrid Project Deputy Project Leader EU EGEE Designated Technical Director
Advertisements

Fighting Malaria With The Grid. Computing on The Grid The Internet allows users to share information across vast geographical distances. Using similar.
Research Councils ICT Conference Welcome Malcolm Atkinson Director 17 th May 2004.
National e-Science Centre Glasgow e-Science Hub Opening: Remarks NeSCs Role Prof. Malcolm Atkinson Director 17 th September 2003.
National e-Science Centre & e-Science Institute Malcolm Atkinson Director 2 nd March 2005.
SWITCH Visit to NeSC Malcolm Atkinson Director 5 th October 2004.
High Performance Computing Course Notes Grid Computing.
An overview of the EGEE project Bob Jones EGEE Technical Director DTI International Technology Service-GlobalWatch Mission CERN – June 2004.
EInfrastructures (Internet and Grids) US Resource Centers Perspective: implementation and execution challenges Alan Blatecky Executive Director SDSC.
INFSO-RI Enabling Grids for E-sciencE Concepts of grid computing Guy Warner NeSC Training Team
GEODE Workshop 16 th January 2007 Issues in e-Science Richard Sinnott University of Glasgow Ken Turner University of Stirling.
EU 2nd Year Review – 04 Feb – WP10 status report – n° 1 WP10 Status Report Vincent Breton (WP10 manager) Presentation address.
The LHC Computing Grid – February 2008 The Worldwide LHC Computing Grid Dr Ian Bird LCG Project Leader 15 th April 2009 Visit of Spanish Royal Academy.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Enabling, facilitating and delivering quality training in the UK and Internationally The challenge of grid training and education David Fergusson, Deputy.
EU 2nd Year Review – Jan – WP9 WP9 Earth Observation Applications Demonstration Pedro Goncalves :
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
SICSA student induction day, 2009Slide 1 Social Simulation Tutorial Session 6: Introduction to grids and cloud computing International Symposium on Grid.
Open Science Grid For CI-Days Internet2: Fall Member Meeting, 2007 John McGee – OSG Engagement Manager Renaissance Computing Institute.
Scientific Data Infrastructure: activities in the Capacities Programme of FP7 Presentation at euroCRIS Workshop, Brussels 15 September 2009 "The views.
ACA2006, June 26-29, Varna, Bulgaria Enabling GRID for Computer Algebra Applications Victor Edneral Skobeltsyn Institute of Nuclear Physics, Moscow State.
Storage and data services eIRG Workshop Amsterdam Dr. ir. A. Osseyran Managing director SARA
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
From GEANT to Grid empowered Research Infrastructures ANTONELLA KARLSON DG INFSO Research Infrastructures Grids Information Day 25 March 2003 From GEANT.
Patterns for E-Research Dave Berry, Research Manager E-Research within the University of Edinburgh, 2 nd March 2005.
Jarek Nabrzyski, Ariel Oleksiak Comparison of Grid Middleware in European Grid Projects Jarek Nabrzyski, Ariel Oleksiak Poznań Supercomputing and Networking.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Perspectives on Cyberinfrastructure Daniel E. Atkins Professor, University of Michigan School of Information & Dept. of EECS October 2002.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE – paving the way for a sustainable infrastructure.
Data and storage services on the NGS Mike Mineter Training Outreach and Education
Bob Jones Technical Director CERN - August 2003 EGEE is proposed as a project to be funded by the European Union under contract IST
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Authors: Ronnie Julio Cole David
Les Les Robertson LCG Project Leader High Energy Physics using a worldwide computing grid Torino December 2005.
ESFRI & e-Infrastructure Collaborations, EGEE’09 Krzysztof Wrona September 21 st, 2009 European XFEL.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
INFSO-RI Enabling Grids for E-sciencE EGEE is a project funded by the European Union under contract INFSO-RI Grid Accounting.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
INFSO-RI Enabling Grids for E-sciencE GILDA and GENIUS Guy Warner NeSC Training Team An induction to EGEE for GOSC and the NGS NeSC,
DTI Mission – 29 June LCG Security Ian Neilson LCG Security Officer Grid Deployment Group CERN.
EC Review – 01/03/2002 – WP9 – Earth Observation Applications – n° 1 WP9 Earth Observation Applications 1st Annual Review Report to the EU ESA, KNMI, IPSL,
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
2. WP9 – Earth Observation Applications ESA DataGrid Review Frascati, 10 June Welcome and introduction (15m) 2.WP9 – Earth Observation Applications.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Toward a common data and command representation for quantum chemistry Malcolm Atkinson Director 5 th April 2004.
Data and storage services on the NGS.
INFSO-RI Enabling Grids for E-sciencE The EGEE Project Owen Appleton EGEE Dissemination Officer CERN, Switzerland Danish Grid Forum.
EGEE is a project funded by the European Union under contract IST EGEE Summary NA2 Partners April
The National Grid Service Mike Mineter.
NERC e-Science Meeting Malcolm Atkinson Director & e-Science Envoy UK National e-Science Centre & e-Science Institute 26 th April 2006.
GRIDSTART a European GRID coordination attempt Fabrizio Gagliardi CERN.
EU 2nd Year Review – 04 Feb – WP10 status report – n° 1 WP10 Status Report Vincent Breton (WP10 manager) Presentation address.
Realizing the Promise of Grid Computing Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer Science.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
Northwest Indiana Computational Grid Preston Smith Rosen Center for Advanced Computing Purdue University - West Lafayette West Lafayette Calumet.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Support to scientific.
Grids and SMEs: Experience and Perspectives Emanouil Atanassov, Todor Gurov, and Aneta Karaivanova Institute for Parallel Processing, Bulgarian Academy.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Overview for ENVRI Gergely Sipos, Malgorzata Krakowian EGI.eu
GridMaGrid Users & Applications Conclusions 16/ Grid activities in Morocco Abderrahman El Kharrim CNRST - MaGrid Team Morocco Grid Workshop - Rabat,
Bob Jones EGEE Technical Director
Accessing the VI-SEEM infrastructure
Grid Computing: Running your Jobs around the World
Clouds , Grids and Clusters
The LHC Computing Grid Visit of Mtro. Enrique Agüera Ibañez
Gonçalo Borges, Mário David, Jorge Gomes
GRID COMPUTING PRESENTED BY : Richa Chaudhary.
EGI Webinar - Introduction -
Presentation transcript:

Induction: What is e-Science and Grid computing? –April 26-28, What is e-Science and Grid computing? Dave Berry, NeSC EGEE is funded by the European Union under contract IST

Induction: What is e-Science and Grid computing? –April 26-28, What is e-Science and Grid computing? EGEE Training Team EGEE is funded by the European Union under contract IST

Induction: What is e-Science and Grid computing? –April 26-28, Acknowledgements This talk includes slides from previous tutorials and talks delivered by: the EDG training team Roberto Barbera, INFN Ian Foster, Argonne National Laboratories Jeffrey Grethe, SDSC the National e-Science Centre Prepared by Dave Berry, NeSC

Induction: What is e-Science and Grid computing? –April 26-28, Goals of this module To introduce the concepts of e-Science and Grid computing Assuming no previous knowledge

Induction: What is e-Science and Grid computing? –April 26-28, Overview Motivation for Grid Computing The idea of e-Science Global drivers for Grid and e-Science Some examples The basic ideas of Grid technology

Induction: What is e-Science and Grid computing? –April 26-28, What is Grid computing? “Coordinated resource sharing and problem solving in dynamic, multi-institutional virtual organizations” (I.Foster) Resources are controlled by their owners The Grid infrastructure provides access to collaborators A Virtual Organisation is: People from different institutions working to solve a common goal Sharing distributed processing and data resources Enabling People to Work Together on Challenging Projects Science, Engineering, Medicine, … Public service, commerce too!

Induction: What is e-Science and Grid computing? –April 26-28, The Grid: networked data processing centres and ”middleware” software as the “glue” of resources. Researchers perform their activities regardless geographical location, interact with colleagues, share and access data Scientific instruments and experiments provide huge amount of data The Grid Vision

Induction: What is e-Science and Grid computing? –April 26-28, The Bad Old Days We speak piously of taking measurements and making small studies that will add another brick to the temple of science. Most such bricks just lie around the brickyard. Platt, J.R. (1964) Strong Inference. Science. 146:

Induction: What is e-Science and Grid computing? –April 26-28, The Grid Metaphor

Induction: What is e-Science and Grid computing? –April 26-28, Overview Motivation for Grid Computing The idea of e-Science Global drivers for Grid and e-Science Some examples The basic ideas of Grid technology

Induction: What is e-Science and Grid computing? –April 26-28, The Emergence of e-Science Invention and exploitation of advanced computational methods To generate, curate and analyse research data From experiments, observations and simulations Quality management, preservation and reliable evidence To develop and explore models and simulations Computation and data at extreme scales Trustworthy, economic, timely and relevant results To enable dynamic distributed virtual organisations Facilitating collaboration with information and resource sharing Security, reliability, accountability, manageability and agility

Induction: What is e-Science and Grid computing? –April 26-28, Why use Grids for Science? Scale of the problems Science increasingly done through distributed global collaborations enabled by the internet Grids provide access to: Very large data collections Terascale computing resources High performance visualisation Connected by high-bandwidth networks e-Science is more than Grid & Web Services It is what you do with them that counts

The Emergence of Global Knowledge Communities Slide from Ian Foster’s ssdbm 03 keynote

Induction: What is e-Science and Grid computing? –April 26-28, WIthin and Between Many Disciplines High Energy Physics Earthquake prediction Climatology Biosciences, Genetics Earth Observation Astronomy Composite materials research Engineering design Social sciences

Induction: What is e-Science and Grid computing? –April 26-28, Connecting people: Access Grid Microphones Cameras

Induction: What is e-Science and Grid computing? –April 26-28, Overview Motivation for Grid Computing The idea of e-Science Global drivers for Grid and e-Science Some examples The basic ideas of Grid technology

Induction: What is e-Science and Grid computing? –April 26-28, Global Drivers of e-Science Collaboration Enabling People to Work Together on Challenging Projects Digital technology – exponential growth Ubiquity & cost reduction Performance increase “Data deluge” Consequential Investment EU e-Infrastructure UK e-Science USA cyberinfrastructure Industry

Induction: What is e-Science and Grid computing? –April 26-28, Exponential Growth Gilder’s Law (32X in 4 yrs) Storage Law (16X in 4yrs) Moore’s Law (5X in 4yrs) Triumph of Light – Scientific American. George Stix, January 2001 Performance per Dollar Spent Optical Fibre (bits per second) Chip capacity (# transistors) Data Storage (bits per sq. inch) Number of Years Doubling Time (months)

Induction: What is e-Science and Grid computing? –April 26-28, Example: Astronomy No. & sizes of data sets as of mid-2002, grouped by wavelength 12 waveband coverage of large areas of the sky Total about 200 TB data Doubling every 12 months Largest catalogues near 1B objects Data and images courtesy Alex Szalay, John Hopkins University

Induction: What is e-Science and Grid computing? –April 26-28, How Different 2004 is from 1994 Enormous quantities of data: Petabytes For an increasing number of communities Gating step is not collection but analysis Ubiquitous Internet: >100 million hosts Collaboration & resource sharing the norm Security and Trust are crucial issues Ultra-high-speed networks: >10 Gb/s Global optical networks Bottlenecks: last kilometre & firewalls Huge quantities of computing: >100 Top/s Moore’s law gives us all supercomputers Organising their effective use is the challenge Moore’s law everywhere Instruments, detectors, sensors, scanners, … Organising their effective use is the challenge

Induction: What is e-Science and Grid computing? –April 26-28, Overview Motivation for Grid Computing The idea of e-Science Global drivers for Grid and e-Science Some examples The basic ideas of Grid technology

Induction: What is e-Science and Grid computing? –April 26-28, Example: Earth Observation ESA missions: About 100 Gbytes of data per day (ERS 1/2) 500 Gbytes, for the next ENVISAT mission (2002). Grid contribution to EO: Enhance the ability to access high level products Allow reprocessing of large historical archives Improve Earth science complex applications (data fusion, data mining, modelling …) Source: L. Fusco, June 2001 Federico.Carminati, EU review presentation, 1 March 2002

Induction: What is e-Science and Grid computing? –April 26-28, Example: BioInformatics Medical images Exam image patient key ACL Query the medical image database and retrieve a patient image Metadata 3. Retrieve most similar cases Similar images Low score images 2. Compute similarity measures over the database images Submit 1 job per image Bio-informatics Phylogenetics Search for primers Statistical genetics Bio-informatics web portal Parasitology Data-mining on DNA chips Geometrical protein comparison Medical imaging MR image simulation Medical data and metadata management Mammographies analysis Simulation platform for PET/SPECT Applications deployed Applications tested on EDG Applications under preparation

Induction: What is e-Science and Grid computing? –April 26-28, ATLASCMS LHCb ~6-8 PetaBytes / year ~10 8 events/year ~10 3 batch and interactive users Example: High-Energy Physics

Induction: What is e-Science and Grid computing? –April 26-28, Example: Wearable Devices Easy Plug and Play of Sensors Wireless connection using Positioning information from GPS Mobile medical technologies on a distributed Grid Sensor bus GPS aerial

Induction: What is e-Science and Grid computing? –April 26-28, Example: Medical Development Preparation and follow-up of medical missions in developing countries Support to local medical centres in terms of second diagnosis, patient follow-up and e-learning 2 missions (Ibagué & Chuxiong) with the french NPO « Chaîne de l’Espoir » used as test cases Ibagué Hand surgery Medical centre Clermont-Ferrand/Paris Chuxiong Example of HealthGRID application The grid impact : Improved telemedecine services Federation of patient databases Interactive e-learning (high bandwidth network required) Interactive e-learning Video-conferences Patient data Request for 2nd diagnostic

Induction: What is e-Science and Grid computing? –April 26-28, Overview Motivation for Grid Computing The idea of e-Science Global drivers for Grid and e-Science Some examples The basic ideas of Grid technology

Induction: What is e-Science and Grid computing? –April 26-28, Key concept The ability to negotiate resource-sharing arrangements among a set of participating parties (providers and consumers) and then to use the resulting resource pool for some purpose. (I.Foster)

Induction: What is e-Science and Grid computing? –April 26-28, Grids vs. Distributed Applications Distributed applications already exist, but they tend to be specialised systems intended for a single purpose or user group Grids go further and take into account: Different kinds of resources Not always the same hardware, data and applications Different kinds of interactions User groups or applications want to interact with Grids in different ways Dynamic nature Resources and users added/removed/changed frequently

Induction: What is e-Science and Grid computing? –April 26-28, Main Services of a Grid Architecture Service providers Publish the availability of their services via information systems Such services may come-and-go or change dynamically E.g. a testbed site that offers x CPUs and y GB of storage Service brokers Register and categorize published services and provide search capabilities E.g. 1) Resource Broker selects the best site for a “job” 2) Catalogues of data held at each testbed site Service requesters Single sign-on: log into the grid once Use brokering services to find a needed service and employ it E.g. CMS physicists submit a simulation job that needs 12 CPUs for 6 hours and 15 GB which gets scheduled, via the Resource Broker, on the CERN testbed site

Induction: What is e-Science and Grid computing? –April 26-28, Complex Infrastructure Users want access to compute power and data With security, reliability, trust, … This requires a complex infrastructure Registries Brokers Administration Policy Negotiation Etc. Users shouldn’t need to know the details Portals Problem-solving environments

Induction: What is e-Science and Grid computing? –April 26-28, Mammography: Computation Mammograms have different appearances, depending on image settings and acquisition systems Standard Mammo Format Standard Mammo Format Temporal mammography Computer Aided Detection 3D View Compute power can address several issues

Induction: What is e-Science and Grid computing? –April 26-28, DataImages The Logical View of this information is as a Single Resource Grid Patient Age … … Image … … 1.dcm … … 2.dcm … … 3.dcm … … 4.dcm ……… … … … … …….. ……… … … … … …….. ……… … … … … …….. ……… … … … … …….. ……… … … … … …….. ……… … … … … …….. ……… … … … … …….. ……… … … … … …….. Data DICOM Compute Standard Mammo Format Standard Mammo Format Data Mining Data Mining CADe CADi CADe CADi Mammography: Data

Induction: What is e-Science and Grid computing? –April 26-28, Mammography: Non-Functional Epidemiology Teaching Diagnosis Screening Epidemiology Teaching Diagnosis Screening Grid Ethics Legal Security Performance Manageability …… Scalability Auditability Epidemiology Teaching Diagnosis Screening Epidemiology Training Screening Anonymisation 256MB & 5 secs response Lossless Compression Encryption ~100 Centres Systems Administration Non-Repudiation

Induction: What is e-Science and Grid computing? –April 26-28, Grid security Resource providers are essentially “opening themselves up” to itinerant users Secure access to resources is required X.509 Public Key Infrastructure User’s identity has to be certified by (mutually recognized) national Certification Authorities (CAs) Resources (node machines) have to be certified by CAs Temporary delegation from users to processes to be executed “in user’s name” ( proxy and myproxy certificates ) Common agreed policies for accessing resource and handling user’s rights across different domains within Virtual Organizations

Induction: What is e-Science and Grid computing? –April 26-28, Summary Internet

Induction: What is e-Science and Grid computing? –April 26-28, Questions?