Magellan: Experiences from a Science Cloud Lavanya Ramakrishnan.

Slides:



Advertisements
Similar presentations
Creating HIPAA-Compliant Medical Data Applications with Amazon Web Services Presented by, Tulika Srivastava Purdue University.
Advertisements

Cloud computing is used to describe a variety of computing concepts that involve a large number of computers connected through a real-time communication.
Discussion of Infrastructure Clouds A NERSC Magellan Perspective Lavanya Ramakrishnan Lawrence Berkeley National Lab.
B. Ramamurthy 4/17/ Overview of EC2 Components (fig. 2.1) 10..* /17/20152.
Cloud Computing Open source cloud infrastructures Keke Chen.
Locality-Aware Dynamic VM Reconfiguration on MapReduce Clouds Jongse Park, Daewoo Lee, Bokyeong Kim, Jaehyuk Huh, Seungryoul Maeng.
Cloud Computing Imranul Hoque. Today’s Cloud Computing.
Cloud Computing Brandon Hixon Jonathan Moore. Cloud Computing Brandon Hixon What is Cloud Computing? How does it work? Jonathan Moore What are the key.
An Approach to Secure Cloud Computing Architectures By Y. Serge Joseph FAU security Group February 24th, 2011.
Virtualization in HPC Minesh Joshi CSC 469 Dr. Box Feb 1, 2012.
“It’s going to take a month to get a proof of concept going.” “I know VMM, but don’t know how it works with SPF and the Portal” “I know Azure, but.
Connecting HPIO Capabilities with Domain Specific Needs Rob Ross MCS Division Argonne National Laboratory
Cloud Computing (101).
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 4.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
Windows Server 2012 Certification and Training June 2012.
Cloud Computing Systems Lin Gu Hong Kong University of Science and Technology Sept. 21, 2011 Windows Azure—Overview.
Getting Started with Oracle Compute Cloud
U.S. Department of the Interior U.S. Geological Survey David V. Hill, Information Dynamics, Contractor to USGS/EROS 12/08/2011 Satellite Image Processing.
Introduction to Cloud Computing
Addressing a Globally Distributed Network The Future of Testing at Akamai Polish Network Operators Group 2013 Thomas Higdon – Akamai Technologies
This courseware is copyrighted © 2011 gtslearning. No part of this courseware or any training material supplied by gtslearning International Limited to.
VIRTUALIZATION AND CLOUD COMPUTING Dr. John P. Abraham Professor, Computer Engineering UTPA.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Copyright © 2010 Platform Computing Corporation. All Rights Reserved.1 The CERN Cloud Computing Project William Lu, Ph.D. Platform Computing.
Eucalyptus on FutureGrid: A case for Eucalyptus 3 Sharif Islam, Javier Diaz, Geoffrey Fox Gregor von Laszewski Indiana University.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 7 2/23/2015.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Customized cloud platform for computing on your terms !
Pepper: An Elastic Web Server Farm for Cloud based on Hadoop Author : S. Krishnan, J.-S. Counio Date : Speaker : Sian-Lin Hong IEEE International.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
Software Architecture
Science Clouds and FutureGrid’s Perspective June Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox
Introduction to Apache Hadoop Zibo Wang. Introduction  What is Apache Hadoop?  Apache Hadoop is a software framework which provides open source libraries.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
Grids, Clouds and the Community. Cloud Technology and the NGS Steve Thorn Edinburgh University Matteo Turilli, Oxford University Presented by David Fergusson.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
COMS E Cloud Computing and Data Center Networking Sambit Sahu
Eucalyptus: An Open-source Infrastructure for Cloud Computing Rich Wolski Eucalyptus Systems Inc.
From Virtualization Management to Private Cloud with SCVMM 2012 Dan Stolts Sr. IT Pro Evangelist Microsoft Corporation
The Eucalyptus Open-source Cloud Computing System Daniel Nurmi Rich Wolski, Chris Grzegorczyk, Graziano Obertelli, Sunil Soman, Lamia Youseff, Dmitrii.
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
HADOOP DISTRIBUTED FILE SYSTEM HDFS Reliability Based on “The Hadoop Distributed File System” K. Shvachko et al., MSST 2010 Michael Tsitrin 26/05/13.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
Lecture III: Challenges for software engineering with the cloud CS 4593 Cloud-Oriented Big Data and Software Engineering.
Experiments in Utility Computing: Hadoop and Condor Sameer Paranjpye Y! Web Search.
Cloud Architecture. SPI Model Cloud Computing Classification Model – SPI Cloud Computing Classification Model – SPI - SaaS: (Software as a Service) -
CLOUD COMPUTING WHAT IS CLOUD COMPUTING?  Cloud Computing, also known as ‘on-demand computing’, is a kind of Internet-based computing,
OpenStack Chances and Practice at IHEP Haibo, Li Computing Center, the Institute of High Energy Physics, CAS, China 2012/10/15.
Directions in eScience Interoperability and Science Clouds June Interoperability in Action – Standards Implementation.
Distributed Process Discovery From Large Event Logs Sergio Hernández de Mesa {
KAASHIV INFOTECH – A SOFTWARE CUM RESEARCH COMPANY IN ELECTRONICS, ELECTRICAL, CIVIL AND MECHANICAL AREAS
Alfresco on Azure Shah Rahman Founder and CEO, CloudlyIO.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
© 2015 MetricStream, Inc. All Rights Reserved. AWS server provisioning © 2015 MetricStream, Inc. All Rights Reserved. By, Srikanth K & Rohit.
Windows Server 2012 Certification and Training
CLOUD COMPUTING Presented to Graduate Students Mechanical Engineering Dr. John P. Abraham Professor, Computer Engineering UTPA.
Communication Needs in Agile Computing Environments Michael Ernst, BNL ATLAS Distributed Computing Technical Interchange Meeting University of Tokyo May.
Prof. Jong-Moon Chung’s Lecture Notes at Yonsei University
Chapter 6: Securing the Cloud
Cloud Computing By P.Mahesh
Cloud Testing Shilpi Chugh.
Dr. John P. Abraham Professor, Computer Engineering UTPA
Java in the cloud PaaS Platform in Comparison
Brandon Hixon Jonathan Moore
Amazon AWS Certified Solutions Architect Professional solutions-architect-professional-practice-test.html.
Cloud Security AWS as an example.
Cloud Security AWS as an example.
SQL Server on Amazon Web Services
SQL Server on Amazon Web Services
Presentation transcript:

Magellan: Experiences from a Science Cloud Lavanya Ramakrishnan

Magellan Overview Mission Determine the appropriate role for private cloud computing for mid-range tightly coupled computational models

Layout Describe experiences with cloud software stack – Eucalyptus – MapReduce: Hadoop Early science use cases and impact on application design and development Detail specific requirements for scientific use

Experience with Private Cloud Software Eucalyptus (1.6.2) – open source IaaS (infrastructure as a service) software – API compatible with Amazon – support for Elastic Block Store, Elastic IP addresses

Experiences with Eucalyptus Scalability – all VM network traffic is routed through a single cluster controller node *pro: good for security *con: network bottlenect, restricts scalability – concurrent VMs due to messaging size limit Image Management – need system administration skills – need to create, manage and upload correct images

Experiences with Eucalyptus Co-exist with other serivces – Using a number of system services, and assume it have the complete control of the system. Allocation and Accounting – hard to ensure fairness since first come first serve Logging and Monitoring – limited support : recovery: loss IP address assignment => restart all running instances

Experiences with Hadoop File System Access (1)considers only the data locality for a single file and does not handle applications that might have multiple input sets (2) HDFS also does not expose a POSIX interface, which makes it dicult for legacy applications to leverage the le system directly. Configuration (1) Has numberof site-specific and job-specific parameters that are hard to tune to achieve optimal performance.

Application Case Studies STAR – Streamed real-time data analysis Details STAR performed Real-time analysis of data coming from Brookhaven Nat. Lab Need on-demand access to computing resources to process realtime data Clouds as a platform for this application

Application Design and Development Image creation and management – system administration skills – determining what goes on image etc Data management – need to manage data storage properly Performance and reliability needs to be considered

Unique Needs and Features of a Science Cloud Science clouds need access to legacy data sets in HPC centers Science clouds need MapReduce implementations that account for characteristics of scientific data and analysis methods Science clouds need preinstalled, pre-tuned application software stacks. Science clouds need customizations for site- specific policies.

Conclusions Current day cloud computing solutions have gaps for science – performance, reliability, stability – programming models are difficult for legacy apps HPC centers can adopt some of the technologies and mechanisms – support for data-intensive workloads – allow custom software environments – provide different levels of service