Clouds , Grids and Clusters

Slides:



Advertisements
Similar presentations
Summary of Cloud Computing (CC) from the paper Abovce the Clouds: A Berkeley View of Cloud Computing (Feb. 2009)
Advertisements

UC Berkeley Above the Clouds A Berkeley View of Cloud Computing 1 UC Berkeley RAD Lab.
High Performance Computing Course Notes Grid Computing.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 6 2/13/2015.
1 On Death, Taxes, & the Convergence of Peer-to-Peer & Grid Computing Adriana Iamnitchi Duke University “Our Constitution is in actual operation; everything.
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
Seminar Grid Computing ‘05 Hui Li Sep 19, Overview Brief Introduction Presentations Projects Remarks.
CLOUD Demystifying the Jesse Dunietz SASS Talk A Survey of Cloud Computing.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
4b.1 Grid Computing Software Components of Globus 4.0 ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4b.
Globus Computing Infrustructure Software Globus Toolkit 11-2.
Grid and Cloud Computing Anda Iamnitchi CIS 6930 Spring 2011
Cloud Computing: Overview 1. This lecture What is cloud computing? What are its essential characteristics? Why cloud computing? Classification/service.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
An Introduction to Cloud Computing. The challenge Add new services for your users quickly and cost effectively.
Includes slides borrowed freely from The Globus team Building Grid Services and The Globus Toolkit ® CISE : Globus Tutorial Anda Iamnitchi.
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
Cloud Computing – The Cloud Dr. Jie Liu. Definition  Cloud computing is Web-based processing, whereby shared resources, software, and information are.
Grid Computing Anda Iamnitchi Federated Distributed Systems, Fall ‘06 Including slides adapted from presentations by Ian Foster, Lee Liming, Paul Jeffreys.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Cloud Computing: Overview
Cloud Computing Saneel Bidaye uni-slb2181. What is Cloud Computing? Cloud Computing refers to both the applications delivered as services over the Internet.
CEG7380 Cloud Computing Lecture 1
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
An emerging computing paradigm where data and services reside in massively scalable data centers and can be ubiquitously accessed from any connected devices.
SDN : What We’ve Learned Martìn Casado I’ve. Outline SDN : a History SDN : a Definition SDN : What I’ve Learned.
Introduction to Cloud Computing
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Advanced Topic in Information Systems: Cloud Computing Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science.
Integrated Financial Applications using Intuit’s PaaS Solution George Chiramattel, Intuit.
© Asite Solutions Ltd Cloud Computing World Forum A Global Software as a Service Platform for Supply Chain Collaboration Tony Ryan CEOwww.asite.com.
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
1 Cloud Computing Advisor : Cho-Chin Lin Student : Chien-Chen Lai.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
Cloud Computing is a Nebulous Subject Or how I learned to love VDF on Amazon.
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
Clouds, Grids and Clusters Prepared by M.Chandana Department of CSE Engineered for Tomorrow Course code: 10CS845.
Above the Clouds: A Berkeley View of Cloud Computing Annajiat Alim Rasel, P Shimul Bala, P Raquibul Bari, P Annajiat Alim.
Chapter 1 Characterization of Distributed Systems
Unit 3 Virtualization.
Accessing the VI-SEEM infrastructure
Grid and Cloud Computing
Chapter 6: Securing the Cloud
Lecture 2. Ecosystem, CAP, and Challenges
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
By: Raza Usmani SaaS, PaaS & TaaS By: Raza Usmani
The Future? Or the Past and Present?
Clouds , Grids and Clusters
SuperComputing 2003 “The Great Academia / Industry Grid Debate” ?
Cloud computing-The Future Technologies
Prepared by: Assistant prof. Aslamzai
An Introduction to Cloud Computing
Globus —— Toolkits for Grid Computing
Recap: introduction to e-science
Study course: “Computing clusters, grids and clouds” Andrey Y. Shevel
University of Technology
CEG7380 Cloud Computing Lecture 1
Introduction to Cloud Computing
Cloud Computing Dr. Sharad Saxena.
Above the Clouds A Berkeley View of Cloud Computing
Excelian Grid as a Service Offers Compute Power for a Variety of Scenarios, with Infrastructure on Microsoft Azure and Costs Aligned to Actual Use MICROSOFT.
Cloud Management Mechanisms
Cloud Computing and its Implementation
Syllabus and Introduction Keke Chen
Views of Cloud Computing
Presentation transcript:

Clouds , Grids and Clusters Course code: 10CS845 Clouds , Grids and Clusters Engineered for Tomorrow Prepared by M .Chandana Department of CSE

Grid: Resource-Sharing Environment Users: 1000s from 10s institutions Well-established communities Resources: Computers, data, instruments, storage, applications Owned/administered by institutions Applications: data- and compute-intensive processing Approach: common infrastructure

Grids vs. P2P Systems Functionality & infrastructure Grids P2P Large scale Weaker trust assumptions Ease of integration No centralized authority Intermittent resource/user participation Diversity in: Shared resources Sharing characteristics Variable technical support Infrastructure (sharable services) Support for diverse applications Grids P2P Scale & volatility On Death, Taxes, and the Convergence of Grid and P2P Systems, Foster and Iamnitchi, IPTPS’03

Grid: Definitions Definition 1: Infrastructure that provides dependable, consistent, pervasive, and inexpensive access to high-end computational capabilities (1998) Definition 2: A system that coordinates resources not subject to centralized control, using open, general-purpose protocols to deliver nontrivial Quality of Service (2002)

An Example: The Globus Toolkit - Initially developed at Argonne National Lab/University of Chicago and ISI/University of Southern California

How It Started While helping to build/integrate a diverse range of distributed applications, the same problems kept showing up over and over again. Too hard to keep track of authentication data (ID/password) across institutions Too hard to monitor system and application status across institutions Too many ways to submit jobs Too many ways to store & access files and data Too many ways to keep track of data Too easy to leave “dangling” resources lying around (robustness)

grid architecture in a nutshell

Forget Homogeneity! Trying to force homogeneity on users is futile. Everyone has their own preferences, sometimes even dogma. The Internet provides the model…

From Theory to Practice

Building a Grid (in Practice) Building a Grid system or application is currently an exercise in software integration. Define user requirements Derive system requirements or features Survey existing components Identify useful components Develop components to fit into the gaps Integrate the system Deploy and test the system Maintain the system during its operation This should be done iteratively, with many loops and eddys in the flow.

How it Really Happens Compute Server Simulation Tool Compute Server Web Browser Web Portal Registration Service Camera Telepresence Monitor Data Viewer Tool Camera Database service Chat Tool Data Catalog Database service Credential Repository Database service Certificate authority Users work with client applications Application services organize VOs & enable access to other services Collective services aggregate &/or virtualize resources Resources implement standard access & management interfaces

How it Really Happens (without Globus) Compute Server Simulation Tool B Compute Server Web Browser Web Portal Registration Service Camera Telepresence Monitor Data Viewer Tool Application Developer 10 Off the Shelf 12 Globus Toolkit Grid Community Camera C Database service Chat Tool Data Catalog D Database service Credential Repository E Database service Certificate authority Users work with client applications Application services organize VOs & enable access to other services Collective services aggregate &/or virtualize resources Resources implement standard access & management interfaces

How it Really Happens (with Globus) Globus GRAM Compute Server Simulation Tool Globus GRAM Compute Server Web Browser CHEF Globus Index Service Camera Telepresence Monitor Data Viewer Tool Application Developer 2 Off the Shelf 9 Globus Toolkit 4 Grid Community Camera Globus DAI Database service CHEF Chat Teamlet Implementations are provided by a mix of Application-specific code “Off the shelf” tools and services Tools and services from the Globus Toolkit Tools and services from the Grid community (compatible with GT) Glued together by… Application development System integration Globus MCS/RLS Globus DAI Database service MyProxy Globus DAI Database service Certificate Authority Users work with client applications Application services organize VOs & enable access to other services Collective services aggregate &/or virtualize resources Resources implement standard access & management interfaces

What Is the Globus Toolkit? The Globus Toolkit is a collection of solutions to problems that frequently come up when trying to build collaborative distributed applications. Not turnkey solutions, but building blocks and tools for application developers and system integrators. Some components (e.g., file transfer) go farther than others (e.g., remote job submission) toward end-user relevance. To date, the Toolkit has focused on simplifying heterogeneity for application developers. The goal has been to capitalize on and encourage use of existing standards (IETF, W3C, OASIS, GGF). The Toolkit also includes reference implementations of new/proposed standards in these organizations.

How To Use the Globus Toolkit By itself, the Toolkit has surprisingly limited end user value. There’s very little user interface material there. You can’t just give it to end users (scientists, engineers, marketing specialists) and tell them to do something useful! The Globus Toolkit is useful to application developers and system integrators. You’ll need to have a specific application or system in mind. You’ll need to have the right expertise. You’ll need to set up prerequisite hardware/software. You’ll need to have a plan.

Globus Toolkit Components Credential Management G T 4 Python WS Core [contribution] C WS Core Community Scheduler Framework Delegation Service WS Authentication Authorization Reliable File Transfer OGSA-DAI [Tech Preview] Grid Resource Allocation Mgmt (WS GRAM) Monitoring & Discovery System (MDS4) Java WS Core Community Authorization Service G T 3 Replica Location XIO Web Services Components Pre-WS Authentication Authorization GridFTP Grid Resource Allocation Mgmt (Pre-WS GRAM) Monitoring & Discovery System (MDS2) C Common Libraries G T 2 Non-WS Components Security Data Management Execution Management Information Services Common Runtime

From Grids to Cloud Computing Logical steps: Make the grids public Provide much simpler interfaces (and more limited control) Charge usage of resources Instead of relying on implicit incentives from science collaborations Ideally, a “pay-as-you-go” rate In reality: Different history Cloud computing as utility computing (1966 paper) However, the promise of cloud computing finds a great user base in science grids due to: Intense computations Huge amounts of storage needs Much of the Grid research community is now working on clouds How much of that is only rebranding is useful to understand

Outline What is Cloud Computing? Why now? Cloud killer apps Economics for users Economics for providers Challenges and opportunities Implications Case study: Amazon Web Services 19

What is Cloud Computing? Old idea: Software as a Service (SaaS) Def: delivering applications over the Internet Recently: “[Hardware, Infrastructure, Platform] as a service” Poorly defined so we avoid all “X as a service” Utility Computing: pay-as-you-go computing Illusion of infinite resources No up-front cost Fine-grained billing (e.g. hourly) Cloud computing: a new term for the long-held dream of utility computing (first defined in 1966) Refers to both the application delivered as services over the Internet and the hardware and software systems in the datacenters that provide those services. 20

Why Now? Experience with very large datacenters Unprecedented economies of scale Other factors Pervasive broadband Internet Fast x86 virtualization Pay-as-you-go billing model Standard software stack 21

Spectrum of Clouds Instruction Set VM (Amazon EC2, 3Tera) Bytecode VM (Microsoft Azure) Framework VM Google AppEngine, Force.com Lower-level, Less management Higher-level, More management EC2 Azure AppEngine Force.com 22

Cloud Killer Applications Mobile and web applications Extensions of desktop software Matlab, Mathematica Batch processing / MapReduce Oracle at Harvard, Hadoop at NY Times 23

Economics of Cloud Users Pay by use instead of provisioning for peak Demand Capacity Time Resources Demand Capacity Time Resources Unused resources Static data center Data center in the cloud 24 24

Economics of Cloud Users Risk of over-provisioning: underutilization Demand Capacity Time Resources Unused resources Static data center 25

Economics of Cloud Users Heavy penalty for under-provisioning Resources Demand Capacity Time (days) 1 2 3 Resources Demand Capacity Time (days) 1 2 3 Lost revenue Resources Demand Capacity Time (days) 1 2 3 Lost users 26

Economics of Cloud Providers (1) 5-7x economies of scale [Hamilton 2008] Resource Cost in Medium Data Centers Very Large Data Centers Ratio Network $95 / Mbps / month $13 / Mbps / month 7.1x Storage $2.20 / GB / month $0.40 / GB / month 5.7x Administration ≈140 servers/admin >1000 servers/admin 27

Economics of Cloud Providers (2) Price per KWH Where Possible Reasons Why 3.6¢ Idaho Hydroelectric power; not sent long distance. 10.0¢ California Electricity transmitted long distance over the grid; limited transmission lines in Bay Area; no coal fired electricity allowed in California. 18.0¢ Hawaii Must ship fuel to generate electricity. Price of kilowatt-hours of electricity by region.

Economics of Cloud Providers (3) Extra benefits Amazon: utilize off-peak capacity Microsoft: sell .NET tools Google: reuse existing infrastructure

Adoption Challenges Challenge Opportunity Availability: Outages DDoS Multiple providers & Data Centers Data lock-in Standardization Data Confidentiality and Auditability Encryption, VLANs, Firewalls; Geographical Data Storage 30

Growth Challenges Challenge Opportunity Data transfer bottlenecks FedEx-ing disks, Data Backup/Archival - Mailing disks is already provided by Amazon Performance unpredictability Improved VM support, flash memory, scheduling VMs Scalable storage Invent scalable store Bugs in large distributed systems Invent Debugger that relies on Distributed VMs Scaling quickly Invent Auto-Scaler that relies on ML; Snapshots 31

Policy and Business Challenges Opportunity Reputation Fate Sharing Offer reputation-guarding services like those for email Software Licensing Pay-for-use licenses; Bulk use sales 32

Long Term Implications Application software: Cloud & client parts, disconnection tolerance Infrastructure software: Resource accounting, VM awareness Hardware systems: Containers, energy proportionality 33

Some Views On Cloud Computing “The interesting thing about Cloud Computing is that we’ve redefined Cloud Computing to include everything that we already do. . . . I don’t understand what we would do differently in the light of Cloud Computing other than change the wording of some of our ads.” Larry Ellison (Oracle’s CEO), quoted in the Wall Street Journal, September 26, 2008

“A lot of people are jumping on the [cloud] bandwagon, but I have not heard two people say the same thing about it. There are multiple definitions out there of the cloud.” Andy Isherwood, Hewlett-Packard’s Vice President of European Software Sales, quoted in ZDnet News, December 11, 2008

“It’s stupidity. It’s worse than stupidity: it’s a marketing hype campaign. Somebody is saying this is inevitable — and whenever you hear somebody saying that, it’s very likely to be a set of businesses campaigning to make it true.” Richard Stallman, quoted in The Guardian, September 29, 2008