Cloud Computing Open source cloud infrastructures Keke Chen.

Slides:



Advertisements
Similar presentations
Cloud computing is used to describe a variety of computing concepts that involve a large number of computers connected through a real-time communication.
Advertisements

By Fletcher Liverance For Dr. Jin, CS49995 February 5 th 2012.
© 2012 IBM Corporation Architecture of Quantum Folsom Release Yong Sheng Gong ( 龚永生 ) gongysh #openstack-dev Quantum Core developer.
B. Ramamurthy 4/17/ Overview of EC2 Components (fig. 2.1) 10..* /17/20152.
EHarmony in Cloud Subtitle Brian Ko. eHarmony Online subscription-based matchmaking service Available in United States, Canada, Australia and United Kingdom.
Amazon Web Services and Eucalyptus
Using EC2 with HTCondor Todd L Miller 1. › Introduction › Submitting an EC2 job (user tutorial) › New features and other improvements › John Hover talking.
Lecture 12: Cloud Computing-C Amazon Web Service Tutorial.
Cloud Computing Imranul Hoque. Today’s Cloud Computing.
1 NETE4631 Cloud deployment models and migration Lecture Notes #4.
Infrastructure as a Service (IaaS) Amazon EC2
An Approach to Secure Cloud Computing Architectures By Y. Serge Joseph FAU security Group February 24th, 2011.
1 Security on OpenStack 11/7/2013 Brian Chong – Global Technology Strategist.
Chien-Chung Shen Google Compute Engine Chien-Chung Shen
Introduction to DoC Private Cloud
Amazon EC2 Quick Start adapted from EC2_GetStarted.html.
Cisco and OpenStack Lew Tucker VP/CTO Cloud Computing Cisco Systems,
Tanenbaum 8.3 See references
Introduction to Amazon Web Services (AWS)
Utility Computing Casey Rathbone 1http://cyberaide.org.edu.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Eucalyptus on FutureGrid: A case for Eucalyptus 3 Sharif Islam, Javier Diaz, Geoffrey Fox Gregor von Laszewski Indiana University.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 7 2/23/2015.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Customized cloud platform for computing on your terms !
Building service testbeds on FIRE D5.2.5 Virtual Cluster on Federated Cloud Demonstration Kit August 2012 Version 1.0 Copyright © 2012 CESGA. All rights.
Nimbus & OpenNebula Young Suk Moon. Nimbus - Intro Open source toolkit Provides virtual workspace service (Infrastructure as a Service) A client uses.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
Accessing the Amazon Elastic Compute Cloud (EC2) Angadh Singh Jerome Braun.
+ CS 325: CS Hardware and Software Organization and Architecture Cloud Architectures.
The Blue Book pages 19 onwards
Creating an EC2 Provisioning Module for VCL Cameron Mann & Everett Toews.
K. Liu, Q. Huang, J. Xia, Z. Li, P. Lostritto, Chapter 4 How to use cloud computing?, In Spatial Cloud Computing: a practical approach, edited by.
Cloud Standard API and Contextualization
1 NETE4631 Amazon Cloud Offerings Lecture Notes #6.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
COMS E Cloud Computing and Data Center Networking Sambit Sahu
Eucalyptus: An Open-source Infrastructure for Cloud Computing Rich Wolski Eucalyptus Systems Inc.
Eucalyptus 3 (&3.1). Eucalyptus 3 Product Overview – Govind Rangasamy.
Magellan: Experiences from a Science Cloud Lavanya Ramakrishnan.
The Eucalyptus Open-source Cloud Computing System Daniel Nurmi Rich Wolski, Chris Grzegorczyk, Graziano Obertelli, Sunil Soman, Lamia Youseff, Dmitrii.
Cloud Computing is a Nebulous Subject Or how I learned to love VDF on Amazon.
Vignesh Ravindran Sankarbala Manoharan. Infrastructure As A Service (IAAS) is a model that is used to deliver a platform virtualization environment with.
Arun Madhavan Graduate Assistant, iPlant Collaborative Experiences with Eucalyptus.
OpenStack overview of the project Belmiro Daniel Rodrigues Moreira CERN IT-PES-PS January 2011 Disclaimer: This presentation reflects the experience and.
1 TCS Confidential. 2 Objective: In this session we will be able to learn  What is Openstack?  History  Capabilities  Openstack as IaaS  Advantages.
Launch Amazon Instance. Amazon EC2 Amazon Elastic Compute Cloud (Amazon EC2) provides resizable computing capacity in the Amazon Web Services (AWS) cloud.
INTRODUCTION TO HADOOP. OUTLINE  What is Hadoop  The core of Hadoop  Structure of Hadoop Distributed File System  Structure of MapReduce Framework.
Chapter 11 – Cloud Application Development. Contents Motivation. Connecting clients to instances through firewalls. Cloud Computing: Theory and Practice.
OpenStack Demo Kashif Mohammad University of Oxford 1 HepSysMan.
Introductory Tutorial: OpenStack, Chef, Hadoop, Hbase, Pig I590 Data Science Curriculum Big Data Open Source Software and Projects September Geoffrey.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Grant.
© 2015 MetricStream, Inc. All Rights Reserved. AWS server provisioning © 2015 MetricStream, Inc. All Rights Reserved. By, Srikanth K & Rohit.
GETTING STARTED WITH AWS AND PYTHON. OUTLINE  Intro to Boto  Installation and configuration  Working with AWS S3 using Bot  Working with AWS SQS using.
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI CloudBroker usage Zoltán Farkas MTA SZTAKI LPDS
INTRODUCTION TO AMAZON WEB SERVICES (EC2). AMAZON WEB SERVICES  Services  Storage (Glacier, S3)  Compute (Elastic Compute Cloud, EC2)  Databases (Redshift,
OpenStack.
Investigation authentication using AAF for the CVL on NeCTAR
Usage of Openstack Cloud Computing Architecture in COE Seowon Jung Systems Administrator, COE
Oracle Solaris Zones Study Purpose Only
OpenStack Ani Bicaku 18/04/ © (SG)² Konsortium.
OPNFV Arno Installation & Validation Walk-Through
Introduction to Apache
AWS Cloud Computing Masaki.
* Introduction to Cloud computing * Introduction to OpenStack * OpenStack Design & Architecture * Demonstration of OpenStack Cloud.
Different types of Linux installation
The Blue Book pages 19 onwards
OpenStack Summit Berlin – November 14, 2018
Presentation transcript:

Cloud Computing Open source cloud infrastructures Keke Chen

Outline  Project 3  Eucalyptus  OpenStack

Project 3: using AWS  Tasks (work from nimbus17 or your own PC) Create AWS account and setup the environment Try basic EC2 commands Start a hadoop cluster on EC2, using the hadoopEC2 tool Read the code of hadoopEC2 to understand how to interact with EC2 in shell scripts

Starting hadoop cluster on EC2  Read  Setup Check src/contrib/ec2/bin/hadoop-ec2- env.sh  You don’t need to change anything there You should setup your own environment variables in.profile,.login, or.bashrc  AWS_ACCOUNT_ID, AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY

Starting hadoop on EC2  copy $HADOOP_HOME/src/contrib/ec2 to your own directory  % bin/hadoop-ec2 launch-cluster your- cluster-name #ofslaves  % bin/hadoop-ec2 login your-cluster-name  Test your cluster /usr/local/hadoop-* Hadoop fsck /  Diagnose problems (understand the hadoop setup) hadoop-on-ubuntu-linux-single-node-cluster/ hadoop-on-ubuntu-linux-single-node-cluster/

Read the source of the EC2 tool  Check the script hadoop-ec2 and learn how to automatically launch instances Pass initialization scripts to instances Change Hadoop configuration  Answer some questions

Make your own AMI  install a recent Hadoop version e.g., 1.0.x in the AMI  HadoopEC2 provides some scripts but they need to be revised to work with the current setting

Experiment with HDFS and S3  Hadoop can use either HDFS or S3 as the storage for MapReduce.  You need to learn the performance difference for these two options How to configure Hadoop to use S3 Conduct a simple experiment to compare the performance of different storage

Most popular open-source AWS equivalence  Eucalyptus Started by UCSB researchers, now a company  OpenStack Started by NASA, now an open source platform

Eucalyptus  Compatible to AWS APIs (EC2, S3, mainly) Thus, Boto library can be used, too A good example for understanding how AWS works

 Paper “The Eucalyptus Open-source Cloud-computing System” How VM instances are managed How to provide virtual network (like elastic IP) How to provide data storage (like S3) A very brief description, but we can get something

System Design Data center CLC: cloud controller Walrus: storage controller similar to S3 CC: cluster controllerNC: node controller

Components: Node Controller  Make queries to discover physical resources # of cores Size of memory Available disk space State of VM instances  Propagate the information to Cluster Controller DescribeResource DescribeInstances  Run/terminate instances CLC  CC  NC  hypervisor (Xen)

Node controller  Start an instance Copy instance image from walrus or local cache Create endpoint in the virtual network overlay Instruct hypervisor to boot the instance  Stop an instance Instruct hypervisor to terminate the VM Tear down the virtual network endpoint Clean up the files associated with the instance

Cluster Controller  Gather/report information of NCs Through the interface provided by NCs Report the summary to CLC  Schedule incoming instance “run” requests to specific NCs  Control the virtual network overlay

Virtual network overlay  VM instance interconnectivity (between different nodes/networks) Not very well mentioned in Xen Connectivity, isolation and performance  At least one of a set of VMs be exposed externally Map the public IP to that instance  Restricted communication VMs in the same set can talk to each other VMs from different sets should be isolated

Virtual network overlay Each VM has a private IP; one VM in the set also has a public IP VLAN tag defines the subnet – to isolate sets of VMs Cluster Controller serves as the router between VM subnets - CC uses Linux iptable control traffics - Use iptable Network Address Translation (NAT) to define the map from Public IP to private IP

Storage Controller (Walrus)  Provide SOAP/REST interfaces Compatible with S3 – you can use S3 tools  Use Walrus to stream data in/out of the cloud  Store VM images (same as AMI) Root file system, kernel image, ramdisk image  No locking for object writes Conflict writes – late write overwrites the earlier

 Provides the same tool Amazon uses Generate AMI  Maintains a cache of images  Authentication is applied when NC accesses images

Cloud Controller  A collection of web services Resource services Data services Interface services

Cloud Controller: resource services  Receive user requests  Interact with CCs to allocate/deallocate  System Resource State (SRS) is maintained by querying CCs CCs will collect information from NCs  Follows a “transactional” operation Reservation, VM creation  commit Or errors  rollback  Realizing SLAs

Cloud Controller: data services  Handles the creation, modification, interrogation, and storage of stateful system and user data There is a system database…  Users can query the services Discover resource info (images, clusters) Manipulate abstract parameters(keypairs, security groups, network definitions) Recall some of AWS interfaces…

Cloud Controller: interface services  User-visible interfaces Programmatic interfaces (SOAP/REST) Web interface  Handling authentication  Provide system management tools

OpenStack

 Originated at NASA, with Rackspace  Driven by an open community process  Multiple hypervisors: Xen, KVM, ESXi, Hyper-V  First release: Oct 2010

Components  Nova – Compute (equivalent to EC2)  Swift – object storage (S3)  Image service (AMI)  Networking (virtual network)  Block storage (Elastic block storage)  Identity  Dashboard (AWS web console) -- mostly implemented with python

Fastest Growing Global Open Source Community COMPANIES TOTAL CONTRIBUTORS AVERAGE MONTHLY CONTRIBUTORS CODE CONTRIBUTIONS 1, , ,149 INDIVIDUAL MEMBERS COUNTRIES 121 As of July 2013

Global Community Countries with members

Developer Growth Contributors per month (ohloh)

1 Million+ Lines of Code Lines of code (ohloh)

Ecosystem Growth Participating Companies