© UC Regents 2010 Extending Rocks Clusters into Amazon EC2 Using Condor Philip Papadopoulos, Ph.D University of California, San Diego San Diego Supercomputer.

Slides:



Advertisements
Similar presentations
1 Applications Virtualization in VPC Nadya Williams UCSD.
Advertisements

Cloud Computing Mick Watson Director of ARK-Genomics The Roslin Institute.
University of Notre Dame
EHarmony in Cloud Subtitle Brian Ko. eHarmony Online subscription-based matchmaking service Available in United States, Canada, Australia and United Kingdom.
GLOBAL VIRTUAL CLUSTER DEPLOYMENT THROUGH A CONTENT DELIVERY NETWORK Pongsakorn U-chupala, Kohei Ichikawa (NAIST) Luca Clementi, Philip Papadopoulos (UCSD)
Cloud Computing Imranul Hoque. Today’s Cloud Computing.
An Approach to Secure Cloud Computing Architectures By Y. Serge Joseph FAU security Group February 24th, 2011.
Building on the BIRN Workshop BIRN Systems Architecture Overview Philip Papadopoulos – BIRN CC, Systems Architect.
S. Gadomski, "ATLAS computing in Geneva", journee de reflexion, 14 Sept ATLAS computing in Geneva Szymon Gadomski description of the hardware the.
Network+ Guide to Networks, Fourth Edition Chapter 10 Netware-Based Networking.
What is Cloud Computing?. Why call it “Cloud” Computing?
Cloud computing Tahani aljehani.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Amazon EC2 Quick Start adapted from EC2_GetStarted.html.
Virtualization A way To Begin with Virtual Reality… - Rahul Khanwani.
Red Hat Installation. Installing Red Hat Linux is the process of copying operating system files from a CD, DVD, or USB flash drive to hard disk(s) on.
Virtual Clusters Supporting MapReduce in the Cloud Jonathan Klinginsmith School of Informatics and Computing.
Cloud and Virtualization Panel Philip Papadopoulos UC San Diego.
© UC Regents 2010 More Nuts and Bolts in Virtualization.
Rocks Clusters SUN HPC Consortium November 2004 Federico D. Sacerdoti Advanced CyberInfrastructure Group San Diego Supercomputer Center.
1 The Virtual Reality Virtualization both inside and outside of the cloud Mike Furgal Director – Managed Database Services BravePoint.
Rocks cluster : a cluster oriented linux distribution or how to install a computer cluster in a day.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 7 2/23/2015.
Personal Cloud Controller (PCC) Yuan Luo 1, Shava Smallen 2, Beth Plale 1, Philip Papadopoulos 2 1 School of Informatics and Computing, Indiana University.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
Virtualization. Virtualization  In computing, virtualization is a broad term that refers to the abstraction of computer resources  It is "a technique.
High Performance Computing Cluster OSCAR Team Member Jin Wei, Pengfei Xuan CPSC 424/624 Project ( 2011 Spring ) Instructor Dr. Grossman.
ISG We build general capability Introduction to Olympus Shawn T. Brown, PhD ISG MISSION 2.0 Lead Director of Public Health Applications Pittsburgh Supercomputing.
So, Jung-ki Distributed Computing System LAB School of Computer Science and Engineering Seoul National University Implementation of Package Management.
Rocks ‘n’ Rolls An Introduction to Programming Clusters using Rocks © 2008 UC Regents Anoop Rajendra.
Ceph Storage in OpenStack Part 2 openstack-ch,
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Introduction to Cloud Computing
การติดตั้งและทดสอบการทำคลัสเต อร์เสมือนบน Xen, ROCKS, และไท ยกริด Roll Implementation of Virtualization Clusters based on Xen, ROCKS, and ThaiGrid Roll.
Virtualization Paul Krzyzanowski Distributed Systems Except as otherwise noted, the content of this presentation is licensed.
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
1 Evolution of OSG to support virtualization and multi-core applications (Perspective of a Condor Guy) Dan Bradley University of Wisconsin Workshop on.
Grids, Clouds and the Community. Cloud Technology and the NGS Steve Thorn Edinburgh University Matteo Turilli, Oxford University Presented by David Fergusson.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
Eucalyptus: An Open-source Infrastructure for Cloud Computing Rich Wolski Eucalyptus Systems Inc.
SoCal Infrastructure OptIPuter Southern California Network Infrastructure Philip Papadopoulos OptIPuter Co-PI University of California, San Diego Program.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Marc Fiuczynski Princeton University Marco Yuen University of Victoria PlanetLab & Clusters.
EVGM081 Multi-Site Virtual Cluster: A User-Oriented, Distributed Deployment and Management Mechanism for Grid Computing Environments Takahiro Hirofuchi,
Cluster Software Overview
1 Grid Activity Summary » Grid Testbed » CFD Application » Virtualization » Information Grid » Grid CA.
Ian Gable HEPiX Spring 2009, Umeå 1 VM CPU Benchmarking the HEPiX Way Manfred Alef, Ian Gable FZK Karlsruhe University of Victoria May 28, 2009.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Arun Madhavan Graduate Assistant, iPlant Collaborative Experiences with Eucalyptus.
The Hungarian ClusterGRID Project Péter Stefán research associate NIIF/HUNGARNET
Purdue RP Highlights TeraGrid Round Table May 20, 2010 Preston Smith Manager - HPC Grid Systems Rosen Center for Advanced Computing Purdue University.
Cloud computing: IaaS. IaaS is the simplest cloud offerings. IaaS is the simplest cloud offerings. It is an evolution of virtual private server offerings.
PRAGMA 25 Working Group Updates Resources Working Group Yoshio Tanaka (AIST) Phil Papadopoulos (UCSD) Most slides by courtesy of Peter, Nadya, Luca, and.
Course 03 Basic Concepts assist. eng. Jánó Rajmond, PhD
KAASHIV INFOTECH – A SOFTWARE CUM RESEARCH COMPANY IN ELECTRONICS, ELECTRICAL, CIVIL AND MECHANICAL AREAS
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
Building on virtualization capabilities for ExTENCI Carol Song and Preston Smith Rosen Center for Advanced Computing Purdue University ExTENCI Kickoff.
© 2007 UC Regents1 Rocks – Present and Future The State of Things Open Source Grids and Clusters Conference Philip Papadopoulos, Greg Bruno Mason Katz,
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Introduction to Operating Systems Concepts
Chapter 6: Securing the Cloud
Cloud Technology and the NGS Steve Thorn Edinburgh University (Matteo Turilli, Oxford University)‏ Presented by David Fergusson.
C Loomis (CNRS/LAL) and V. Floros (GRNET)
GWE Core Grid Wizard Enterprise (
Quattor in Amazon Cloud
Tools and Services Workshop Overview of Atmosphere
Lecture 14 Virtual Memory and the Alpha Memory Hierarchy
Haiyan Meng and Douglas Thain
Different types of Linux installation
Presentation transcript:

© UC Regents 2010 Extending Rocks Clusters into Amazon EC2 Using Condor Philip Papadopoulos, Ph.D University of California, San Diego San Diego Supercomputer Center California Institute for Telecommunications and Information Technology (Calit2)

© UC Regents 2010 Triton Resource Large Memory PSDAF 256 GB & 512 GB Nodes (32 core) 8TB Total 128 GB/sec ~ 9TF x28 Shared Resource Cluster 16 GB/Node 4 - 8TB Total 256 GB/sec ~ 20 TF x256 Background: So, You want to build a cluster? Campus Research Network UCSD Research Labs Large Scale Storage (Working on RFP) 2 – 4 PB GB/sec 3000 – 6000 disks

© UC Regents 2010 © 2009 UC Regents3 The Modern “Cluster” Architecture is Not Just an MPI Cluster Standard Compute Cluster Each logical Configuration is a Rocks Appliance CAMERA Bioinformatics

© UC Regents © 2005 UC Regents4 Technology transfer of commodity clustering to application scientists – “make clusters easy” Rocks is a cluster on a CD – Clustering software (PBS, SGE, Ganglia, Condor, … ) – Highly programmatic software configuration management – Put CDs in Raw Hardware, Drink Coffee, Have Cluster. Extensible using “Rolls” Large user community – Over 1PFlop of known clusters – Active user / support list of users – Estimate > 2000 installed cluster Active Development – 2 software releases per year – Code Development at SDSC – Other Developers (UCSD, Univ of Tromso, External Rolls Supports Redhat Linux, Scientific Linux, Centos and Solaris Can build Real, Virtual, and Hybrid Combinations Rocks

© UC Regents © 2005 UC Regents5 Rocks Breaks Apart the Software Stack into Rolls

© UC Regents Rolls on a Simple Cluster

© UC Regents Condor Roll Condor (updating to 7.4.2) Integration with Rocks command line to do basic Condor configuration customization To build a Condor Cluster with Rocks – Base, OS, Kernel, Condor Roll – Gives you local collector, scheduler Basic, Working Configuration that can be customized as required.

© UC Regents 2010 Virtual Cluster 2 Virtual Cluster 1 Virtual Clusters in Rocks Today Physical Hosting Cluster “Cloud Provider” Require: 1.Virtual Frontend 2.Nodes w/disk 3.Private Network 4.Power Virtual Clusters: May overlap one another on physical HW Need network isolation May be larger or smaller than physical hosting cluster

© UC Regents 2010 How Rocks Treats Virtual Hardware It’s just another piece of HW. – If RedHat supports it, so does Rocks Allows mixture of real and virtual hardware in the same cluster – Because Rocks supports heterogeneous HW clusters Re-use of all of the software configuration mechanics – E.g., a compute appliance is compute appliance Virtual HW must meet minimum HW Specs – 1GB memory – 36GB Disk space* – Private-network Ethernet – + Public Network on Frontend * Not strict – EC2 images are 10GB

© UC Regents Rocks Hybrid: Linux/Solaris/Physical/Virtual Xen VM Phys Linux Solaris

© UC Regents Basic EC2 Amazon Machine Images (AMIs) S3 – Simple Storage Service EBS – Elastic Block Store Amazon Cloud Storage Elastic Compute Cloud (EC2) Copy AMI & Boot AMIs are copied from S3 and booted in EC2 to create a “running instance” When instance is shutdown, all changes are lost – Can save as a new AMI

© UC Regents Basic EC2 AMI (Amazon Machine Image) is copied from S3 to EC2 for booting – Can boot multiple copies of an AMI as a “group” – Not a cluster, all running instances are independent If you make changes to your AMI while running and want them saved – Must repack to make a new AMI Or use Elastic Block Store (EBS) on a per-instance basis

© UC Regents Some Challenges in EC2 1.Defining the contents of your Virtual Machine (Software Stack) 2.Understanding limitations and execution model 3.Debugging when something goes wrong 4.Remembering to turn off your VM – Smallest 64-bit VM is ~$250/month running 7x24

© UC Regents What’s in the AMI? Tar file of a / file system – Cryptographically signed so that Amazon can open it, but other users cannot – Split into 10MB chunks, stored in S3 Amazon boasts more than 2000 public machine images – What’s in a particular image? – How much work is it to get your software part of an existing image? There are tools for booting and monitoring instances. Defining the software contents is “an exercise left to the reader”

© UC Regents The EC2 Roll Take a Rocks appliance and make it compatible with EC2: – 10GB disk partition (single) – DHCP for network – ssh key management – Other small adjustments Create an AMI bundle on local cluster – rocks create ec2 bundle Upload a bundled image into EC2 – rocks upload ec2 bundle Mini-tutorial on getting started with EC2 and Rocks

© UC Regents 2010 Putting all together: Virtual Cluster Experiment Nimrod – Monash University Rocks® – UC San Diego Condor – U. Wisconsin Amazon EC2 – Brought to you by Visa®

© UC Regents 2010 Virtual Cluster 2 Virtual Cluster 1 Virtual Clusters in Rocks Today Physical Hosting Cluster “Cloud Provider” Require: 1.Virtual Frontend 2.Nodes w/disk 3.Private Network 4.Power Virtual Clusters: May overlap one another on physical HW Need network isolation May be larger or smaller than physical hosting cluster

© UC Regents 2010 Nimrod.rockscluster.org Extended Cluster Experiment in PRAGMA Amazon EC2 Cloud Rocks – Created VM Monash eScience and Grid Engineering Laboratory NIMROD – Parameter Sweep/Optimization fiji.rocksclusters.org Hosting Cluster for Job Management

© UC Regents 2010 Extended Cluster Using Condor

© UC Regents 2010 Can Log into the Running VM

© UC Regents Steps to Make this Work Build Local Cluster with appropriate rolls – Rocks + Xen Roll + EC2 Roll + Condor Roll (+ NIMROD + … ) Create local appliance as VM using standard Rocks tools – Set ec2_enable attribute to build it as an EC2-Compatible VM – Build and test locally Bundle, Upload, Register as an EC2 AMI – Rocks command line tools Boot with appropriate meta data to register automatically with your local collector.  ec2-run-instances -t m1.large ami-219d7248 -d "condor:landphil.rocksclusters.org:40000:40050" – Requires one-time EC2 firewall settings Use your extended Condor Pool PREPARATION RUN

© UC Regents Summary Easily Extend your Condor pool into EC2 – Others can do this as well – Condor supports the public/private network duality of EC2 Have your software on both local cluster and remote VM in EC2 Mix and match – Local Physical, Local Virtual, Remote Virtual If you use Rocks, does not take extra effort