Héméra: Scientific Challenges using Grid’5000 https://www.grid5000.fr/Hemera Christian Perez INRIA, France.

Slides:



Advertisements
Similar presentations
1 From Grids to Service-Oriented Knowledge Utilities research challenges Thierry Priol.
Advertisements

European Workshop on Grid-based Virtual Organisations & collaborative e-Enterprise applications Toan NGUYEN May 30th, 2003 London (UK) Business models,
SLA-Oriented Resource Provisioning for Cloud Computing
1 Coven a Framework for High Performance Problem Solving Environments Nathan A. DeBardeleben Walter B. Ligon III Sourabh Pandit Dan C. Stanzione Jr. Parallel.
Green Cloud Computing Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology,
Workshop on HPC in India Grid Middleware for High Performance Computing Sathish Vadhiyar Grid Applications Research Lab (GARL) Supercomputer Education.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
The CrossGrid project Juha Alatalo Timo Koivusalo.
Are P2P Data-Dissemination Techniques Viable in Today's Data- Intensive Scientific Collaborations? Samer Al-Kiswany – University of British Columbia joint.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
Workload Management Massimo Sgaravatto INFN Padova.
NGNS Program Managers Richard Carlson Thomas Ndousse ASCAC meeting 11/21/2014 Next Generation Networking for Science Program Update.
Beyond Music Sharing: An Evaluation of Peer-to-Peer Data Dissemination Techniques in Large Scientific Collaborations Thesis defense: Samer Al-Kiswany.
Community Manager A Dynamic Collaboration Solution on Heterogeneous Environment Hyeonsook Kim  2006 CUS. All rights reserved.
Overview of grid / cloud research in France Michel DAYDÉ Scientific Delegate at INS2/CNRS in charge of HPC / Grid / cloud Université de Toulouse - IRIT.
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
Computer Science Perspective Ludek Matyska Faculty of Informatics, Masaryk University, Brno and also CESNET, Prague.
4.x Performance Technology drivers – Exascale systems will consist of complex configurations with a huge number of potentially heterogeneous components.
Optimized Java computing as an application for Desktop Grid Olejnik Richard 1, Bernard Toursel 1, Marek Tudruj 2, Eryk Laskowski 2 1 Université des Sciences.
A Lightweight Platform for Integration of Resource Limited Devices into Pervasive Grids Stavros Isaiadis and Vladimir Getov University of Westminster
N. GSU Slide 1 Chapter 02 Cloud Computing Systems N. Xiong Georgia State University.
DISTRIBUTED COMPUTING
Ocean Observatories Initiative Common Execution Infrastructure (CEI) Overview Michael Meisinger September 29, 2009.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
INFSO-RI Enabling Grids for E-sciencE EGEODE VO « Expanding GEosciences On DEmand » Geocluster©: Generic Seismic Processing Platform.
Wireless Networks Breakout Session Summary September 21, 2012.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Tekin Bicer Gagan Agrawal 1.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Large-scale Deployment in P2P Experiments Using the JXTA Distributed Framework Gabriel Antoniu, Luc Bougé, Mathieu Jan & Sébastien Monnet PARIS Research.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
TESTBED FOR FUTURE INTERNET SERVICES TEFIS at the EU-Canada Future Internet Workshop, March Annika Sällström – Botnia Living Lab at Centre for.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Heavy and lightweight dynamic network services: challenges and experiments for designing intelligent solutions in evolvable next generation networks Laurent.
Suzhen Lin, A. Sai Sudhir, G. Manimaran Real-time Computing & Networking Laboratory Department of Electrical and Computer Engineering Iowa State University,
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Performance evaluation of component-based software systems Seminar of Component Engineering course Rofideh hadighi 7 Jan 2010.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
Scenarios for a Learning GRID Online Educa Nov 30 – Dec 2, 2005, Berlin, Germany Nicola Capuano, Agathe Merceron, PierLuigi Ritrovato
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Centre d’Excellence en Technologies de l’Information et de la Communication Evolution dans la gestion d’infrastructure de type Cloud (SDI)
1 Grid Activity Summary » Grid Testbed » CFD Application » Virtualization » Information Grid » Grid CA.
7. Grid Computing Systems and Resource Management
Overview and Comparison of Software Tools for Power Management in Data Centers Msc. Enida Sheme Acad. Neki Frasheri Polytechnic University of Tirana Albania.
An Overview of Scientific Workflows: Domains & Applications Laboratoire Lorrain de Recherche en Informatique et ses Applications Presented by Khaled Gaaloul.
Hemera KickOff October 5th, 2010 Working Group B5 Efficient management of very large volumes of information for data- intensive applications Gabriel Antoniu,
Euro-Par, HASTE: An Adaptive Middleware for Supporting Time-Critical Event Handling in Distributed Environments ICAC 2008 Conference June 2 nd,
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Distributed Geospatial Information Processing (DGIP) Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Efficient Opportunistic Sensing using Mobile Collaborative Platform MOSDEN.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Daniele Lezzi Execution of scientific workflows on federated multi-cloud infrastructures IBERGrid Madrid, 20 September 2013.
ScotGRID is the Scottish prototype Tier 2 Centre for LHCb and ATLAS computing resources. It uses a novel distributed architecture and cutting-edge technology,
XtreemOS IP project is funded by the European Commission under contract IST-FP Scientific coordinator Christine Morin, INRIA Presented by Ana.
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Grid Institute Scientific Council, September 10, 2008
Workload Management Workpackage
Clouds , Grids and Clusters
Similarities between Grid-enabled Medical and Engineering Applications
GSAF Grid Storage Access Framework
Sky Computing on FutureGrid and Grid’5000
Sky Computing on FutureGrid and Grid’5000
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Héméra: Scientific Challenges using Grid’ Christian Perez INRIA, France

2 Outline of the talk Overview of Héméra Some Scientific Challenges of Héméra Conclusion

3 Motivations Scientific issues  Large scale, volatile, complex systems Performance, fault tolerance, scalability, data storage, programming models, algorithms, resource management, etc. Methodological challenges Positioning  Mathematics,  Simulation  Emulation  Experimental testbed (Grid’5000)  Production environment

4 Hemera: An INRIA Large Scale Initiative Goals  Demonstrate ambitious up-scaling techniques for large scale distributed computing by carrying out several dimensioning experiments on the Grid’5000 infrastructure  Animate the scientific community around Grid’5000  Enlarge the Grid’5000 community by helping newcomers to make use of Grid’5000. Open to everyone in France and to collaborators Structure  A direction committee Very similar to the GIS Grid’5000  Scientific challenges  Working groups Resources  4 PhDs & 3 Post-docs since 2010  Two engineers Daniel Balouek (01/ /2013) Laurent Pouilloux (09/ /2014)

5 Hemera: Current Participant List 1. ACADIE - Assistance à la Certification d’Applications DIstribuées et Embarquées 2. ALGORILLE - Algorithms for the Grid 3. APO - Algorithmes Parallèles et Optimisation 4. ASAP - As Scalable As Possible: foundations of large scale dynamic distributed systems 5. ASCOLA - Aspect and composition languages 6. ASTRE - Architecture, Systèmes, Temps-Réel, Embarqués 7. AVALON - Algorithms and Software Architectures for Service Oriented Platforms 8. CC-IN2P3 - Equipe de recherche du Centre de Calcul de l'IN2P3 9. CEPAGE - Chercher et Essaimer dans les Plates-formes A Grande Echelle 10. DOLPHIN - Parallel Cooperative Multi-criteria Optimization 11. GRAND-LARGE - Global parallel and distributed computing 12. ICPS - Scientific Parallel Computing and Imaging 13. KERDATA - Cloud and Grid Storage for Very Large Distributed Data 14. OASIS - Active objects, semantics, Internet and security 15. MAESTRO - Models for the performance analysis and the control of networks 16. MESCAL - Middleware efficiently scalable 17. MINC - MIcro et Nanosystèmes pour les Communications sans fils 18. MRS – Modélisation et contrôle des Réseaux et Signaux 19. MYRIADS - Design and Implementation of Autonomous Distributed Systems 20. REGAL - Large-Scale Distributed Systems and Applications 21. ROMA - Resource Optimization: Models, Algorithms, and scheduling 22. RUNTIME - Efficient runtime systems for parallel architectures 23. SAGE - Simulations and Algorithms on Grids for Environment 24. ZENITH - Scientific Data Management

6 Hemera: Current Participant List Avalon, Roma, CC-IN2P3 o o o o o o o o o o Acadie, Apo, Astree Minc, MRS Algorille ICPS Asap, KerData, Myriads, Sage Ascola Cepage, Runtime Dolphin Grand-Large, Regal Mescal Oasis, Maestro o Zenith o

7 Outline of the talk Overview of Héméra Scientific Challenges of Héméra Conclusion

8 List of Working Groups Transparent, Safe and Efficient Large Scale Computing Energy Efficient Large Scale Experimental Distributed Systems  Green Luxembourg – 2013, January ( Bring Grids Power to Internet-Users thanks to Virtualization Technologies Efficient exploitation of highly heterogeneous and hierarchical large- scale systems Efficient management of very large volumes of information for data- intensive applications Completing challenging experiments on Grid’5000 Modeling Large Scale Systems and Validating their Simulators Network metrology and traffic characterization

9 List of Open Challenges Network  Traffic Awareness System  Energy Profiling of Large Scale Applications  Robustness of Large Systems in Presence of High Churn  Orchestrating Experiments on the gLite Production Grid Middleware  Large Scale Virtual Machine Deployment & Management Programming Paradigm  Large Scale Computing for Combinatorial Optimization Problems  Scalable Distributed Processing Using the MapReduce Paradigm Application Domain Specific  Multi-parametric Intensive Stochastic Simulations for Hydrogeology  Thinking GRID for Electromagnetic Simulation of Oversized Structures

10 List of Open Challenges Network  Traffic Awareness System  Energy Profiling of Large Scale Applications  Robustness of Large Systems in Presence of High Churn  Orchestrating Experiments on the gLite Production Grid Middleware  Large Scale Virtual Machine Deployment & Management Programming Paradigm  Large Scale Computing for Combinatorial Optimization Problems  Scalable Distributed Processing Using the MapReduce Paradigm Application Domain Specific  Multi-parametric Intensive Stochastic Simulations for Hydrogeology  Thinking GRID for Electromagnetic Simulation of Oversized Structures

11 Energy Profiling of Large Scale Applications (Energy) Leaders  Laurent Lefèvre (AVALON), Jean-Marc Pierson (IRIT), Jean-Marc Menaud (ASCOLA) Issues  Reduce energy consumption of large-scale infrastructure  Management of physical resources & virtualized resources Objective  Handle energy efficiency aspects of large scale applications deployed on multiple sites Roadmap  Model (complex) energy consumptions of systems and applications Need to profile applications  Develop software to log, store and expose energy usage Make use of the G5K energy sensing infrastructure  Experiments on large scale and heterogeneous infrastructure

12 Orchestrating Experiments on the gLite Production Grid Middleware (Orchestration) Leaders  Lucas Nussbaum (ALGORILLE), Frédéric Suter (AVALON) Issues  Production Grid Middleware Objective  Explore the use of the Grid’5000 testbed as a test environment for production grid software such as gLite and other related services Roadmap  Define a detailed procedure to deploy the gLite middleware on Grid’5000  Define reusable services Control of a large number of nodes, data management, experimental condition emulations, load and fault injection, instrumentation and monitoring, etc.  Develop experiment orchestration middleware  Perform large-scale experiments involving the gLite middleware and applications from production grids

Dynamic management of virtual machines (DynVM) Leader  Adrien Lebre (ASCOLA) Issue  Dynamic allocation of resources on a large amount of virtual machines Objective  Optimization of resource exploitation  Nation-wide management of virtual machines over Grid’5000  Use of a decision module for an efficient scheduling Roadmap  Perform a series of large scale experiments on Grid’

Dynamic management of virtual machines (DynVM) Experiment  Set of a nation-wide virtual network among 5 sites on 11 clusters (KaVLAN)  Simulation of charge by injecting load in the VM  Implementation of a global scenario Results  Management of Virtual Machines on 512 Physical Machines VM can be moved on another site using live migration capabilities  Implementation of portable and re-usable scripts Resource reservation Management of a large virtual network Communication between the nodes 14

15 Scalable Distributed Processing Using the MapReduce Paradigm (Map-Red) Leaders  Gabriel Antoniu (KERDATA), Gilles Fedak (AVALON) Issues  Distributed data-intensive applications (Peta-bytes)  Data storage layer Efficient, fine-grain, high throughput accesses to huge files Heavy concurrent access to the same file (R/W) Data location awareness Volatility Objective  Ultra-scalable MapReduce-based data processing on various platforms: Clouds: BlobSeer Desktop computing: BitDew Roadmap  Advanced data & meta-data management techniques  Scheduling issues Data & computation, heterogeneity, replication, etc.

16 Multi-Parametric Intensive Stochastic Simulations for Hydrogeology (Hydro) Leader  Jocelyne Erhel (SAGE) Issues  Groundwater resource management & remediation  Limited knowledge Highly heterogeneous and fractured geological formations  Numerical models Probabilistic data + uncertainty quantification methods  Stochastic framework (multiple simulations)  Large size geological domain to discretize Objective  Efficient execution of multi-parametric heavy computation simulations Roadmap  Study how to program, deploy & schedule the application  Validate the approach for increasing level of parallelism for 2D problems then 3D problems © Micas Project, INRIA Rennes

Multi-Parametric Intensive Stochastic Simulations for Hydrogeology (Hydro) Task done by D. Balouek (Hemera Engineer)  Automatize the deployment of the application  Execution of multi-parametric heavy computation simulations 200+ experiments on Grid’5000  From a few minutes up to 60 hours  Total experiment time > core-hours  Release of platform-dedicated scripts  Production & analysis of results on relevant metrics More information on  17

18 Conclusion Experimental platforms (and observation instruments) are essential in the computer science methodology - like in other sciences! Many research kinds may need Grid’5000  HPC, Grids (Classical/Desktop), Clouds, Distributed, Green, etc Network, System, Middleware, Applications  A validation tool for applications/middleware before going to production Still a difficult tool to use  Interface: A lot of know-how  Methodology: Not yet recognized benefits Hemera  Target to solve challenges & to structure the French community   Mi-term evaluation 2013, February 11 th Do not forget to put Grid’5000 tag in HAL!