Supporting Simulations on the Cloud using Workflows & Virtual Machines Gary Polhill Macaulay Land Use Research Institute Edoardo Pignotti Computing Science,

Slides:



Advertisements
Similar presentations
1 Applications Virtualization in VPC Nadya Williams UCSD.
Advertisements

Virtualization and Cloud Computing. Definition Virtualization is the ability to run multiple operating systems on a single physical system and share the.
Towards Autonomic Adaptive Scaling of General Purpose Virtual Worlds Deploying a large-scale OpenSim grid using OpenStack cloud infrastructure and Chef.
Jharrod LaFon (HPC-3) Jim Williams (HPC-3) 2011 Computer System, Cluster, and Networking Summer Institute Russell Husted (MTU) Derek Walker (NCA&TSU) Povi.
IARP/EURON Workshop on Robotics for Risky Interventions and Environmental Surveillance Mobile robot simulators and their application to hazardous and.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
Aleksi Kallio CSC – IT Center for Science Chipster and collaboration with other bioinformatics platforms.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 4.
Virtualization for Cloud Computing
Marlene Galea.  The JDK (Java Development Kit)  An IDE (Integrated Development Environment) ◦ Different IDEs include:  JCreator  NetBeans  BlueJ.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Reproducible Environment for Scientific Applications (Lab session) Tak-Lon (Stephen) Wu.
To run the program: To run the program: You need the OS: You need the OS:
A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School.
Cloud Computing Why is it called the cloud?.
Utility Computing Casey Rathbone 1http://cyberaide.org.edu.
Sikuli Ivailo Dinkov QA Engineer PhoneX Team Telerik QA Academy.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Installation and Integration of Virtual Clusters onto Pragma Grid NAIST Nara, Japan Kevin Lam 06/28/13.
1 port BOSS on Wenjing Wu (IHEP-CC)
ICT Day Term 4,  Virtualisation is growing in usage.  Current CPU’s are designed to support Virtualisation.  Businesses are looking at virtualisation.
Connecting OurGrid & GridSAM A Short Overview. Content Goals OurGrid: architecture overview OurGrid: short overview GridSAM: short overview GridSAM: example.
PrimoGENI Tutorial Miguel Erazo, Neil Goldman, Nathanael Van Vorst, and Jason Liu Florida International University Other project participants: Julio Ibarra.
Promoting Open Source Software Through Cloud Deployment: Library à la Carte, Heroku, and OSU Michael B. Klein Digital Applications Librarian
Tools and Utilities for parallel and serial codes in ENEA-GRID environment CRESCO Project: Salvatore Raia SubProject I.2 C.R. ENEA-Portici. 11/12/2007.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
DiProNN Resource Management System (DiProNN = Distributed Programmable Network Node) Tomáš Rebok Faculty of Informatics MU, Brno Czech.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
The New Zealand Institute for Plant & Food Research Limited Use of Cloud computing in impact assessment of climate change Kwang Soo Kim and Doug MacKenzie.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Usage of virtualization in gLite certification Andreas Unterkircher.
VIRTUAL MACHINES AND OPEN SOURCE SOFTWARE Installing multiple Operating Systems.
Nature Reviews/2012. Next-Generation Sequencing (NGS): Data Generation NGS will generate more broadly applicable data for various novel functional assays.
NA61/NA49 virtualisation: status and plans Dag Toppe Larsen CERN
Tools and Deployment University of Illinois at Urbana-Champaign.
The Eucalyptus Open-source Cloud Computing System Daniel Nurmi Rich Wolski, Chris Grzegorczyk, Graziano Obertelli, Sunil Soman, Lamia Youseff, Dmitrii.
Data Science Background and Course Software setup Week 1.
A scalable and flexible platform to run various types of resource intensive applications on clouds ISWG June 2015 Budapest, Hungary Tamas Kiss,
1 Grid Activity Summary » Grid Testbed » CFD Application » Virtualization » Information Grid » Grid CA.
1 Adventures in Web Services for Large Geophysical Datasets Joe Sirott PMEL/NOAA.
| nectar.org.au NECTAR TRAINING Module 4 From PC To Cloud or HPC.
Application Programming in Cloud via Swift Swift Tutorial, CCGrid 2013, Hour 2 Ketan Maheshwari.
Selenium server By, Kartikeya Rastogi Mayur Sapre Mosheca. R
Java State Explorer by: Richard Sherman Stephanie Taylor.
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
Information Initiative Center, Hokkaido University North 11, West 5, Sapporo , Japan Tel, Fax: General.
WP5 – Infrastructure Operations Test and Production Infrastructures StratusLab kick-off meeting June 2010, Orsay, France GRNET.
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI CloudBroker usage Zoltán Farkas MTA SZTAKI LPDS
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Accessing the VI-SEEM infrastructure
Virtualization for Cloud Computing
Cloud Technology and the NGS Steve Thorn Edinburgh University (Matteo Turilli, Oxford University)‏ Presented by David Fergusson.
CSC227: Operating Systems
LIGHTWEIGHT CLOUD COMPUTING FOR FAULT-TOLERANT DATA STORAGE MANAGEMENT
Use of HLT farm and Clouds in ALICE
MIRACLE Cloud-based reproducible data analysis and visualization for outputs of agent-based models Xiongbing Jin, Kirsten Robinson, Allen Lee, Gary Polhill,
Virtualisation for NA49/NA61
Dag Toppe Larsen UiB/CERN CERN,
Brad Sutton Assoc Prof, Bioengineering Department
LIGHTWEIGHT CLOUD COMPUTING FOR FAULT-TOLERANT DATA STORAGE MANAGEMENT
Dag Toppe Larsen UiB/CERN CERN,
StratusLab Tutorial (Bordeaux, France)
Virtualisation for NA49/NA61
Tools and Services Workshop Overview of Atmosphere
Abstract Machine Layer Research in VGrADS
Versatile HPC: Comet Virtual Clusters for the Long Tail of Science SC17 Denver Colorado Comet Virtualization Team: Trevor Cooper, Dmitry Mishin, Christopher.
The Globus Toolkit™: Information Services
Presentation transcript:

Supporting Simulations on the Cloud using Workflows & Virtual Machines Gary Polhill Macaulay Land Use Research Institute Edoardo Pignotti Computing Science, University of Aberdeen

Overview  The SwarmCloud project  Social simulation & the Cloud  A virtualised simulation environment  Facilitating access with workflows  Workflows on the Grid/Cloud  Conclusions

The SwarmCloud Project  Goal: to enable the FEARLUS simulation model to be run on National Grid Service infrastructure  What’s so hard about that?  FEARLUS is written in Obj-C for the Swarm libraries— notoriously difficult to install  Numbers of pre-requisite libraries  Can be sensitive to particular versions of libraries and Unix commands  Typical Grid use-cases involve running a script  e.g. Model calibration  Other software environments for social simulation exhibit the same issues (barrier to adoption)

How can social simulation benefit from the Cloud?  Enabling large-scale distributed simulations  e.g. Fraser et al. (2009 Science), Birkin et al. (2009 SSCoRe), Tang & Bennett (2009 US-IALE)  More challenging because of managing agent interactions across node boundaries  Enabling multiple runs of smaller-scale models for:  Exploring parameter space  Experimenting with algorithmic variants  Trying different scenarios  Increasing sensitivity (more runs with different seeds)  ‘Embarrassingly Parallel’

Software dependencies  Swarm software dependencies shown  ‘worst-case’  Java not immune  Pre-requisite jars can be required (with version sensitivity)  JRE version sensitivity  Obvious problems for programs/libraries using JNI

The solution: A virtualised simulation environment: simulationBox  We used a lightweight version of Centos 5.2 to create a virtualised simulation environment containing Swarm, MASON and RePast  Compressed image ~1GB in size   The simulation environment was created using Sun’s VirtualBox virtualisation software  Binaries available under a Personal Use and Evaluation Licence 

Virtualisation on the Cloud  The virtualisation approach relies on the Cloud host making virtualisation facilities available  The National Grid Service is testing a virtualisation service based on Xen/EUCALYPTUS at Oxford  Xen: Open Source standard for virtualisation   EUCALYPTUS (Elastic Utility Computing Architecture Linking Your Programs To Useful Systems): Open-source system for implementing clouds  Provides an authentication layer, and manages virtual machines using ec2 tools 

Facilitating access: Workflows  Virtualisation provides a replicable environment for compiling and running a social simulation  ‘User interface’ not very friendly  Access via ssh  Kepler workflow tool used to design simulation experiments

Upper Deeside Case-study

Workflows on the CLOUD  We have developed a number of Kepler workflow activities based on the ec2 tools:  Authenticate a user on the Cloud  Upload simulationBox to a node manager  Create instances of simulationBox on the Cloud  Query for available simulationBox instances on the Cloud  Compile a new simulation model on a simulationBox instance  Run the model multiple time across different VM instances  Collect and analyse results, saving them on the local machine  Our Kepler libraries can be downloaded from

Compile a simulation model

Run an experiment

Conclusions  For Cloud infrastructure supplying virtualisation services it is possible to run social simulations with specific computing requirements  Access is facilitated through workflows that can be shared amongst people  Website:  ESSA 2009 Tutorial on simulationBox  Large Scale Social Simulation Special Interest Group (ESSA)  SwarmFest 2010