Cloud Computing – Issues, Research and Implementations

Slides:



Advertisements
Similar presentations
Welcome to Middleware Joseph Amrithraj
Advertisements

ICS 434 Advanced Database Systems
SLA-Oriented Resource Provisioning for Cloud Computing
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
CIM2564 Introduction to Development Frameworks 1 Overview of a Development Framework Topic 1.
Software Frameworks for Acquisition and Control European PhD – 2009 Horácio Fernandes.
Asper School of Business University of Manitoba Systems Analysis & Design Instructor: Bob Travica System architectures Updated: November 2014.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
Systems Analysis and Design in a Changing World, 6th Edition 1 Chapter 6.
Systems Architecture, Fourth Edition1 Internet and Distributed Application Services Chapter 13.
Chapter 9: Moving to Design
Chapter 2 Introduction to Systems Architecture. Chapter goals Discuss the development of automated computing Describe the general capabilities of a computer.
Introduction to Web Applications Instructor: Enoch E. Damson.
Copyright © 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1- 1.
Web-Enabling the Warehouse Chapter 16. Benefits of Web-Enabling a Data Warehouse Better-informed decision making Lower costs of deployment and management.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
THE DICOM 2013 INTERNATIONAL CONFERENCE & SEMINAR March 14-16Bangalore, India DICOM Medical Image Management the Challenges and Solutions – Cloud as a.
CLOUD COMPUTING. A general term for anything that involves delivering hosted services over the Internet. And Cloud is referred to the hardware and software.
VAP What is a Virtual Application ? A virtual application is an application that has been optimized to run on virtual infrastructure. The application software.
By Mihir Joshi Nikhil Dixit Limaye Pallavi Bhide Payal Godse.
Quality Assurance for Component- Based Software Development Cai Xia (Mphil Term1) Supervisor: Prof. Michael R. Lyu 5 May, 2000.
L6 - March 1, 2006copyright Thomas Pole , all rights reserved 1 Lecture 6: Software Packaging: Dynamically Integrable Components and Text Ch.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.
APPLICATION DELIVERY IN UNIVERSITIES Glen D. Hauser, Joel Ahmed Engineering Computer Center (ECC) College of Engineering University of Saskatchewan.
1 Dr. Markus Hillenbrand, ICSY Lab, University of Kaiserslautern, Germany A Generic Database Web Service for the Venice Service Grid Michael Koch, Markus.
+ CS 325: CS Hardware and Software Organization and Architecture Cloud Architectures.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
DISTRIBUTED COMPUTING
CONTENTS Arrival Characters Definition Merits Chararterstics Workflows Wfms Workflow engine Workflows levels & categories.
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
CS 390- Unix Programming Environment CS 390 Unix Programming Environment Topics to be covered: Distributed Computing Fundamentals.
Dr. Mohamed Osman Hegazi 1 Database Systems Concepts Database Systems Concepts Course Outlines: Introduction to Databases and DBMS. Database System Concepts.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
608D CloudStack 3.0 Omer Palo Readiness Specialist, WW Tech Support Readiness May 8, 2012.
1 Introduction to Middleware. 2 Outline What is middleware? Purpose and origin Why use it? What Middleware does? Technical details Middleware services.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
1 Introduction to Microsoft Windows 2000 Windows 2000 Overview Windows 2000 Architecture Overview Windows 2000 Directory Services Overview Logging On to.
Plan  Introduction  What is Cloud Computing?  Why is it called ‘’Cloud Computing’’?  Characteristics of Cloud Computing  Advantages of Cloud Computing.
Chapter 1 Computer Systems. Why study Computer Architecture? Examples Web Browsing - how does the browser access pages from a server? How can we create.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
9 Systems Analysis and Design in a Changing World, Fourth Edition.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
Chapter 2 Introduction to Systems Architecture. Chapter goals Discuss the development of automated computing Describe the general capabilities of a computer.
Hwajung Lee.  Interprocess Communication (IPC) is at the heart of distributed computing.  Processes and Threads  Process is the execution of a program.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Enabling the Future Service-Oriented Internet (EFSOI 2008) Supporting end-to-end resource virtualization for Web 2.0 applications using Service Oriented.
Microsoft Management Seminar Series SMS 2003 Change Management.
VMware vSphere Configuration and Management v6
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
Chapter 1 Basic Concepts of Operating Systems Introduction Software A program is a sequence of instructions that enables the computer to carry.
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
May 7-8, 2007ICVCI 2007 RTP Autonomic Approach to IT Infrastructure Management in a Virtual Computing Lab Environment H. Abdel SalamK. Maly R. MukkamalaM.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
1 Chapter 2: Operating-System Structures Services Interface provided to users & programmers –System calls (programmer access) –User level access to system.
© 2007 IBM Corporation IBM Software Strategy Group IBM Google Announcement on Internet-Scale Computing (“Cloud Computing Model”) Oct 8, 2007 IBM Confidential.
9 Systems Analysis and Design in a Changing World, Fifth Edition.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Unit 3 Virtualization.
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Platform as a Service.
Chapter 18 MobileApp Design
The Improvement of PaaS Platform ZENG Shu-Qing, Xu Jie-Bin 2010 First International Conference on Networking and Distributed Computing SQUARE.
Quality Assurance for Component-Based Software Development
Presentation transcript:

Cloud Computing – Issues, Research and Implementations 30th International Conference on Information Technology Interfaces June 2008 Cloud Computing – Issues, Research and Implementations Mladen A. Vouk Department Head and Professor of Computer Science, North Carolina State University 20101109, Mike Hsiao

Mladen A. Vouk, “Cloud Computing - Issues, Research and Implementations,” in Proc. 30th International Conference on Information Technology Interfaces (ITI 2008), June 2008, pp. 31-40. Mladen A. Vouk, “Cloud Computing - Issues, Research and Implementations,” Journal of Computing and Information Technology, Vol 16 (4), 2008, pp. 235-246.

Outline Introduction Cloud Computing An Implementation Research Issues Cyberinfrastructure Concepts Service-oriented Architecture, Components, Workflows, Virtualization Users Developers, Authors, Service Composition, End-users An Implementation Research Issues Conclusions

Virtual Computing Laboratory (VCL) - 1 http://vcl.ncsu.edu VCL is an award-winning open source implementation of a secure production-level on-demand utility computing and services oriented technology for wide-area access to solutions based on virtualized resources, including computational, storage and software resources.

Virtual Computing Laboratory (VCL) - 2 NCSU provides a remote access service that allows you to reserve a computer with a desired set of applications for yourself, and remotely access it over the Internet. This service is powered by the Apache Software Foundation's Virtual Computing Lab (VCL) software. You can use all your favorite applications such as Matlab, Maple, SAS, Solidworks, and many others. Linux, Solaris and numerous Windows environments are now available to all NC State students and faculty. https://cwiki.apache.org/VCL/

Virtual Computing Laboratory (VCL) - 3 If connecting to a Windows VCL environment - a Remote Desktop Connection client is required. If connecting to a Unix VCL environment such as Linux or Solaris - an ssh client plus an X windows system is required.

Virtual Computing Laboratory (VCL) - 4 One of the primary goals of VCL is to deliver a dedicated compute environment to a user for a limited time. The conceptual overview shows that remote users connect to the VCL Scheduling Application (the web VCL portal) and request access to a desired application environment.

Virtual Computing Laboratory (VCL) – 5 The VCL Infrastructure Web Server - The VCL Portal Linux, Apache, PHP. The php based web application is the heart of VCL and provides tools to request, manage and govern all VCL resources. All transactions occur with the database. Database Server Linux, MySQL. Holds all data related to VCL reservations, access controls, machine and environment inventory, log history, etc.

Virtual Computing Laboratory (VCL) – 6 Management node(s) - The processing engine Linux, VCLD (perl), image library. A set of individual blades or virtual machines can only be managed by a single management node. Typically there are anywhere from 80-120 physical computer nodes (blades) under one management node. The physical computer nodes can either be running a bare metal environment or a Virtual Machine hypervisor.

Introduction This paper discusses the concept of “cloud” computing, some of the issues it tries to address, related research topics, and a “cloud” implementation available today. The term (cloud) became popular sometime in October 2007 when IBM and Google announced a collaboration “Blue Cloud.”

Cloud Computing “Cloud” computing embraces cyberinfrastructure, virtualization, distributed computing, grid computing, utility computing, and networking, web and software services. It implies a service-oriented architecture, reduced information technology overhead for the end-user, greater flexibility, reduced total cost of ownership, on-demand services and many other things.

1 Service-oriented Architecture SOA is not a new concept. Examples of some of the first network-based service-oriented architectures are remote procedure calls (RPC), DCOM and Object Request Brokers (ORBs) based on the CORBA specifications. In an SOA environment, end-users request an (integrated) IT service at the desired functional, quality and capacity level, and receive it either at the time requested or at a specified later time. software applications, web-based services, personal and business “desktop” computing, high-performance computing.

2 Workflows An IT-assisted workflow represents a series of structured activities and computations that arise in information-assisted problem solving. A workflow can be represented by a directed graph of data flows. In the context of “cloud computing”, the key questions should be whether the underlying infrastructure is supportive of the workflow-oriented view of the world.

3 Virtualization It allows abstraction and isolation of lower level functionalities and underlying hardware.

Users CI: System or cyberinfrastructure developers

User Example (K-20 and continuing education) Support large numbers of users that range from very naive to very sophisticated (millions of student contact hours per year). Support construction and delivery of content and curricula for these users. For that, the system needs to provide support and tools for thousands of instructors, teachers, professors, and others that serve the students. Generate adequate content diversity, quality, and range. This may require many hundreds of authors. Be reliable and cost-effective to operate and maintain. The effort to maintain the system should be relatively small, although introduction of new paradigms and solutions may require a considerable start-up development effort.

Developers Cyberinfrastructure developers who are responsible for development and maintenance of the Cloud framework. They develop and integrate system hardware, storage, networks, interfaces, administration and management software, communications and scheduling algorithms, services authoring tools, workflow generation and resource access algorithms and software. They must be experts in specialized areas such as networks, computational hardware, storage, low level middleware, operating systems imaging, and similar. They also are responsible for keeping the complexity of the framework away from the higher level users through judicious abstraction, layering and middleware. Grid computing is a good example.

Authors (1/2) Service authors are developers of individual base-line “images” and services that may be used directly, or may be integrated into more complex service aggregates and workflows by service provisioning and integration experts. Image a tangible abstraction of the software stack. any base-line operating system, a hypervisor layer, desired middleware or application that runs on that OS, any end-user access solution that is appropriate (e.g., ssh, web, RDP, VNC, etc.).

Author (2/2) Some of the functionalities a cloud framework must provide for them are image creation tools, image and service management tools, service brokers, service registration and discovery tools, security tools, provenance collection tools, cloud component aggregations tools, resource mapping tools, license management tools, fault-tolerance and fail-over mechanisms, and so on [44]. It is important to note that the authors will NOT be cloud framework experts, and thus the authoring tools and interfaces must be appliances: easy-to-learn and easy-to-use and they must allow the authors to concentrate on the image and service development rather than struggle with the cloud infrastructure intricacies. [44] MLADEN VOUK, et al, “‘Powered by VCL’ – Using Virtual Computing Laboratory (VCL) Technology to Power Cloud Computing”. Proceedings of the 2nd International Conference on Virtual Computing (ICVCI), 15–16 May, 2008, RTP, NC, pp 1–10.

Service Composition 1/2 Similarly, services integration and provisioning experts should be able to focus on creation of composite and orchestrated solutions needed for an end-user. They sample and combine existing services and images, customize them, update existing services and images, and develop new composites. They may oversee the usage of the services, and may collect and manage service usage information, statistics.

Service Composition 2/2 Their expertise may range from workflow automation through a variety of tools and languages, to domain expertise needed to understand what aggregates of services, if any, the end-user needs.

End-users They require appropriately reliable and timely service delivery, easy-to-use interfaces, collaborative support, information about their services. For example,

An Implementation

NC State Cloud 1/3 Access to NC State Cloud reservations and management is either through a web portal, or through an API. Authentication, resource availability, image and other information are kept in a database. NC State undifferentiated resources are currently about 1000 IBM BladeCenter blades. Its differentiated services are teaching lab computers that are adopted into VCL when they are not in use (e.g., at night).

NC State Cloud 2/3 We serve about 60,000 to 100,000 “seat” reservation requests (mostly of the on-demand or “now” type) per semester. Typical single-seat user reservation is 1-2 hours. student and faculty population of more than 30,000 We currently have about 150 production images and another 450 or so other images. Most of the images serve single user seats and HPC cycles, with a smaller number focused on environment and workflow-based services. Example: by loading a number of blades with Hadoop-based images [18] one can implement a Google-like map/reduce environment, by loading an environment or group composed of Globus-based (open source Grid software) images one can construct a subcloud for grid-based computing

NC State Cloud 3/3 A typical NC State bare-metal blade serves about 25 student seats – a 25:1 ratio – considerably better than traditional labs at 5:1 to 10:1. Hypervisors and server-apps can increase utilization by another factor of 2 to 40, depending on the application and user profile.

Research Issues Economy-of-scale and economics of image and service construction depends to a large extent on the ease of construction and mobility of these images, not only within a cloud, but also among different clouds. Cloud provenance data, and in general metadata management, is an open issue. How to collect provenance information in a standardized and seamless way and with minimal overhead? How to store this information in a permanent way so that one can come back to it at anytime? How to present this information to the user in a logical manner?

Provenance Cloud Process provenance Cloud Data provenance dynamics of control flows and their progression, execution information, code performance tracking, etc. Cloud Data provenance dynamics of data and data flows, file locations, application input/ output information, etc. Cloud Workflow provenance structure, form, evolution, ..., of the workflow itself System (or Environment) provenance system information, O/S, compiler versions, loaded libraries, environment variables, etc.

Research Issues There is also an issue of the image portability and by implication of the image format. Security For end-users to feel comfortable with a “cloud” solution that holds their software, data and processes, there should exist considerable assurance. This raises the issues of end-to-end service isolation through VPN and SSH tunnels and VLANs, and the guarantees one may have that the data and the images keep their integrity in the “cloud”.

utilization

Conclusions This paper discusses the concept of “cloud” computing, the issues it tries to address, related research topics, and a “cloud” implementation based on VCL technology.

Comments The construction of an image should be like a SA (System Analysis) process. Need different roles and interfaces between each “layers” They do not discuss about the security in the cloud.