04/02/2019 The Use of Grid Technology in Large Scale Data Processing Collaboration Environments S.G. Ansari S.G. Ansari 04/02/2019.

Slides:



Advertisements
Similar presentations
Particle physics – the computing challenge CERN Large Hadron Collider –2007 –the worlds most powerful particle accelerator –10 petabytes (10 million billion.
Advertisements

Research Councils ICT Conference Welcome Malcolm Atkinson Director 17 th May 2004.
E-Science Update Steve Gough, ITS 19 Feb e-Science large scale science increasingly carried out through distributed global collaborations enabled.
Systems Engineering in a System of Systems Context
Grid S.G. Ansari 15 June June June 2015 VSWG – Observatoire de Genève Variability detection, period search with GaiaGrid S. Ansari, L. Eyer,
Grid S.G. Ansari 16 June June June 2015 GaiaGrid – A three Year Experience Salim Ansari Toulouse 20 th October, 2005.
What is Grid Computing? Grid Computing is applying the resources of many computers in a network to a single entity at the same time;  Usually to a scientific.
Architectural Design Establishing the overall structure of a software system Objectives To introduce architectural design and to discuss its importance.
Astronomical GRID Applications at ESAC Science Archives and Computer Engineering Unit Science Operations Department ESA/ESAC.
Computer Science Perspective Ludek Matyska Faculty of Informatics, Masaryk University, Brno and also CESNET, Prague.
The Gaia mission Data reduction activities in the UK Floor van Leeuwen, IoA.
DISTRIBUTED COMPUTING
CS 390- Unix Programming Environment CS 390 Unix Programming Environment Topics to be covered: Distributed Computing Fundamentals.
Instrumentation of the SAM-Grid Gabriele Garzoglio CSC 426 Research Proposal.
The roots of innovation Future and Emerging Technologies (FET) Future and Emerging Technologies (FET) The roots of innovation Proactive initiative on:
GridPP Deployment & Operations GridPP has built a Computing Grid of more than 5,000 CPUs, with equipment based at many of the particle physics centres.
Research Networks and Astronomy Richard Schilizzi Joint Institute for VLBI in Europe
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GRID ARCHITECTURE Chintan O.Patel. CS 551 Fall 2002 Workshop 1 Software Architectures 2 What is Grid ? "...a flexible, secure, coordinated resource- sharing.
Distribution and components. 2 What is the problem? Enterprise computing is Large scale & complex: It supports large scale and complex organisations Spanning.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
EC Review – 01/03/2002 – WP9 – Earth Observation Applications – n° 1 WP9 Earth Observation Applications 1st Annual Review Report to the EU ESA, KNMI, IPSL,
2. WP9 – Earth Observation Applications ESA DataGrid Review Frascati, 10 June Welcome and introduction (15m) 2.WP9 – Earth Observation Applications.
E. Solano. GAIA Meeting, Menorca, Oct 2009 GAIA and the Virtual Observatory Enrique Solano, LAEX/CAB (INTA-CSIC) Spanish VO Principal Investigator.
Function BIRN The ability to find a subject who may have participated in multiple experiments and had multiple assessments done is a critical component.
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
IT-DSS Alberto Pace2 ? Detecting particles (experiments) Accelerating particle beams Large-scale computing (Analysis) Discovery We are here The mission.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Support to scientific.
DataGrid France 12 Feb – WP9 – n° 1 WP9 Earth Observation Applications.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
Ian Bird, CERN WLCG Project Leader Amsterdam, 24 th January 2012.
EGI… …is a Federation of over 300 computing and data centres spread across 56 countries in Europe and worldwide …delivers advanced computing.
Virtual Laboratory Amsterdam L.O. (Bob) Hertzberger Computer Architecture and Parallel Systems Group Department of Computer Science Universiteit van Amsterdam.
A consolidated review of multiple analyses using JMP Clinical
Bob Jones EGEE Technical Director
Use of Cloud Computing for Implementation of e-Governance Services
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
AENEAS WP6 first conference call
Clouds , Grids and Clusters
Grid site as a tool for data processing and data analysis
SuperB and its computing requirements
European Middleware Initiative (EMI)
INTAROS WP5 Data integration and management
LifeWatch, costing and funding
The INES Archive in the era of Virtual Observatories
Can Statistical monitoring really improve data integrity?
ELIXIR: Potential areas for collaboration with e-Infrastructures
EGEE support for HEP and other applications
Distribution and components
Presented by Sam Supervised by Prof. Michael Lyu
Cognitus: A Science Case for HPC in the Nordic Region
Connecting the European Grid Infrastructure to Research Communities
Systems Analysis and Design 5th Edition Chapter 8. Architecture Design
OCR Level 3 Cambridge Technicals in IT
The Globus Toolkit™: Information Services
Grid Services B.Ramamurthy 12/28/2018 B.Ramamurthy.
Packet Classification with Evolvable Hardware Hash Functions
GEO-XIII Plenary St. Petersburg Russian Federation
Chapter 17: Client/Server Computing
Large Scale Distributed Computing
Future EU Grid Projects
Brian Matthews STFC EOSCpilot Brian Matthews STFC
CHAIN KoM – Rome, 13 December 2010
Chapter 5 Architectural Design.
ACTRIS – EMEP, THE WAY FORWARD
OU BATTLECARD: Oracle WebCenter Training
Presentation transcript:

04/02/2019 The Use of Grid Technology in Large Scale Data Processing Collaboration Environments S.G. Ansari S.G. Ansari 04/02/2019

The Human Eye Resolution: 2 car headlights 3 km away 04/02/2019 The Human Eye Resolution: 2 car headlights 3 km away 1 Eye pixel has an angular resolution of 3’ 3600/3x3 = 400 pixels/deg2 Angle of Eyesight = 45° 6000 deg2  2.5 x 106 pixels At a shutter of 20 Hz and 3x8 bit/picture element Raw data rate of 1 Gb/s 150 Mbytes/sec ! U. Bastian Univ. Heidelberg S.G. Ansari 04/02/2019

04/02/2019 S.G. Ansari 04/02/2019

The ESA Astronomical Data Volume 04/02/2019 The ESA Astronomical Data Volume C. Arviset (ESAC) ESA Science Data involve a large number of Data Centres distributed all across Europe. Access to these data is also widely distributed S.G. Ansari 04/02/2019

The Gaia Challenge An example of a Collaboration Environment 04/02/2019 The Gaia Challenge An example of a Collaboration Environment Managing 1 Petabyte of data Astrometry of 1 billion stars over 5-year time span 100 positional determinations per star to yield micro-arcsecond accuracy two photometric systems and 1 Radial Velocity spectrometer calibration data S.G. Ansari 04/02/2019

Collaborative Tasks Scientific: 04/02/2019 Collaborative Tasks Scientific: Shell Tasks involve the whole Gaia community Shell Tasks may be developed by autonomous groups, independent of a core team Shell Tasks deliver “derived” data Shell Tasks can be collaborative tools Shell Tasks are building blocks for data analysis. They may be combined to address more complex processing tasks Technical: Shell Tasks can be modular Shell Tasks access the Gaia Database to work on a subset of data Shell Task results can be independently validated. Less interaction with the core data. Shell Tasks could be developed in multiple set programming languages S.G. Ansari 04/02/2019

The Gaia Virtual Organisation 04/02/2019 Core Tasks RVS Quick Looks Photometry Astrometry ABS Minor Planets Variable Stars Fundamental Algos The Gaia Virtual Organisation Some 20 institutes collaborate on establishing a relevant set of tasks for the Gaia Data Processing S.G. Ansari 04/02/2019

The Grid The Grid is: It is ideal for the Shell Tasks 04/02/2019 The Grid The Grid is: A resource sharing concept Used to augment computational resources whenever and wherever needed Ideal to build a collaborative environment, where users can share algorithms and analyse data It is ideal for the Shell Tasks S.G. Ansari 04/02/2019

04/02/2019 The Grid Architecture The best current example of a Grid implementation is Google! Applications Middleware Infrastructure S.G. Ansari 04/02/2019

Abandon the geographical distribution 04/02/2019 Where do we go from here? The Grid exercise is relevant to very huge amounts of number crunching Network latency adds unnecessary overheads to the problem CPU is cheap Abandon the geographical distribution HOWEVER S.G. Ansari 04/02/2019

The Virtual Collaboration Aspect 04/02/2019 The Virtual Collaboration Aspect Grid infrastructure is ideal for large collaboration environments: 10 sites or more. Data can be distributed with a single central “master copy”. Virtual Organisations are the answer for future Collaborations S.G. Ansari 04/02/2019

04/02/2019 The Future As the quantity of data increases, so must the quality of its organisation and analysis Our scientific tools must reflect the changing ways with which we do conventional science Our interaction with the data must evolve. Analysis tools must become more human-friendly and intuitive S.G. Ansari 04/02/2019