Working With Cloud - 3.

Slides:



Advertisements
Similar presentations
Ed Duguid with subject: MACE Cloud
Advertisements

SERVICE LEVEL AGREEMENTS The Technical Contract Within the Master Agreement.
Business Continuity Section 3(chapter 8) BC:ISMDR:BEIT:VIII:chap8:Madhu N PIIT1.
© 2009 EMC Corporation. All rights reserved. Introduction to Business Continuity Module 3.1.
CLOUD COMPUTING AN OVERVIEW & QUALITY OF SERVICE Hamzeh Khazaei University of Manitoba Department of Computer Science Jan 28, 2010.
SLA Basics Describes a set of non functional requirements of the service. Example : RTO time – Return to Operation Time if case of failure SLO – Service.
WORKSHOP SLA SPECIFICATION BY ETIENNE WERY Lawyer at the Paris & Brussels Bar, Partner ULYS IT OUTSOURCING SUMMIT 27/11/2003.
Supervisor : Mr. Hadi Salimi Advanced Topics in Information Systems Mazandaran University of Science and Technology February 4, 2011 Survey on Cloud Computing.
A T AXONOMY AND S URVEY OF C LOUD C OMPUTING S YSTEMS Reporter: Steven Chen Date: 2010/10/27 1.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 4.
SOFTWARE AS A SERVICE PLATFORM AS A SERVICE INFRASTRUCTURE AS A SERVICE.
Plan Introduction What is Cloud Computing?
CLOUD COMPUTING For Beginners.
Introduction to Cloud Computing
MIGRATING INTO A CLOUD P. Sai Kiran. 2 Cloud Computing Definition “It is a techno-business disruptive model of using distributed large-scale data centers.
CHAPTER FIVE Enterprise Architectures. Enterprise Architecture (Introduction) An enterprise-wide plan for managing and implementing corporate data assets.
Module 13: Network Load Balancing Fundamentals. Server Availability and Scalability Overview Windows Network Load Balancing Configuring Windows Network.
Introduction to Cloud Computing
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
Plan  Introduction  What is Cloud Computing?  Why is it called ‘’Cloud Computing’’?  Characteristics of Cloud Computing  Advantages of Cloud Computing.
What is the cloud ? IT as a service Cloud allows access to services without user technical knowledge or control of supporting infrastructure Best described.
Practical IT Research that Drives Measurable Results 1Info-Tech Research Group Get Moving with Server Virtualization.
Task Performance Group Provides Cutting-Edge E-Commerce B2B EDI Integration Using MegaXML SaaS Solution on Microsoft Azure Cloud Platform MICROSOFT AZURE.
Enabling Grids for E-sciencE Agreement-based Workload and Resource Management Tiziana Ferrari, Elisabetta Ronchieri Mar 30-31, 2006.
Microsoft Azure Speed >> Economics Scale. Microsoft Azure Speed >> Economics Scale.
Stamatia Bibi1, Dimitris Katsaros2, Panayiotis Bozanis2
Chapter 1 Characterization of Distributed Systems
Delivering on the Promise of a Virtualized Dynamic Data Center
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
Understanding The Cloud
Roles and Boundaries – 1/2
Workload Distribution Architecture
Scalable Web Apps Target this solution to brand leaders responsible for customer engagement and roll-out of global marketing campaigns. Implement scenarios.
VENDOR ON-BOARDING PROCESS
IOT Critical Impact on DC Design
New Heights by Guiding Them into the Cloud
Cloud Adoption Framework
Logo here Module 3 Microsoft Azure Web App. Logo here Module Overview Introduction to App Service Overview of Web Apps Hosting Web Applications in Azure.
ITIL:Service Design IT Services Management Martin Sarnovský
<Name of Product>Pilot Closeout Meeting <Customer Name>
How to prepare for the End of License of Windows Server 2012/R2
Scalable Web Apps Target this solution to brand leaders responsible for customer engagement and roll-out of global marketing campaigns. Implement scenarios.
Hyper-V Cloud Proof of Concept Kickoff Meeting <Customer Name>
Chapter 21: Cloud Computing and Related Security Issues
Service level Agreements
Chapter 22: Cloud Computing Technology and Security
Envisioning: Service Provider Organization
05 | Monitor and manage database implementations on Azure
Capitalising on the Cloud Applying new practices to deliver business
Intelledox Infiniti Helps Organizations Digitally Transform Paper and Manual Business Processes into Intuitive, Guided User Experiences on Azure MICROSOFT.
Kickoff Presentation Date of Presentation Presenter Name
Logsign All-In-One Security Information and Event Management (SIEM) Solution Built on Azure Improves Security & Business Continuity MICROSOFT AZURE APP.
Why? (or … am I really in the right track?)
Dell Data Protection | Rapid Recovery: Simple, Quick, Configurable, and Affordable Cloud-Based Backup, Retention, and Archiving Powered by Microsoft Azure.
Specialized Cloud Mechanisms
3 Cloud Computing.
Brandon Hixon Jonathan Moore
Cloud Computing Architecture
Working With Cloud - 2.
Specialized Cloud Architectures
Cloud Computing Architecture
NIST Cloud Computing Reference Architecture
Zendos Tecnologia Utilizes the Powerful, Scalable
Implementation of a small-scale desktop grid computing infrastructure in a commercial domain    
Capitalize on Your Business’s Technology
Harrison Howell CSCE 824 Dr. Farkas
Presentation transcript:

Working With Cloud - 3

Service Quality Metrics and SLA Models

Introduction Service-level agreements (SLAs) are a focal point of negotiations, contract terms, legal obligations, and runtime metrics and measurements. SLAs formalize the guarantees put forth by cloud providers, and correspondingly influence or determine the pricing models and payment terms. SLAs set cloud consumer expectations and are integral to how organizations build business automation around the utilization of cloud-based IT resources. The guarantees made by a cloud provider to a cloud consumer are often carried forward, in that the same guarantees are made by the cloud consumer organization to its clients, business partners, or whomever will be relying on the services and solutions hosted by the cloud provider.

Service Quality Metrics SLAs use service quality metrics to express measurable QoS characteristics, for example: Availability – up-time, outages, service duration. Reliability – minimum time between failures, guaranteed rate of successful responses. Performance – capacity, response time, and delivery time guarantees. Scalability – capacity fluctuation and responsiveness guarantees. Resiliency – mean-time to switchover and recovery.

Service Quality Metrics (2) Each service quality metric is ideally defined using the following characteristics: Quantifiable – The unit of measure is clearly set, absolute, and appropriate so that the metric can be based on quantitative measurements. Repeatable – The methods of measuring the metric need to yield identical results when repeated under identical conditions. Comparable – The units of measure used by a metric need to be standardized and comparable. For example, a service quality metric cannot measure smaller quantities of data in bits and larger quantities in bytes. Easily Obtainable – The metric needs to be based on a non- proprietary, common form of measurement that can be easily obtained and understood by cloud consumers.

Service Availability Metrics – Availability Rate Metric The overall availability of an IT resource is usually expressed as a percentage of up-time. For example, an IT resource that is always available will have an up-time of 100%. Description – percentage of service up-time Measurement – total up-time / total time Frequency – weekly, monthly, yearly Cloud Delivery Model – IaaS, PaaS, SaaS Example – minimum 99.5% up-time

Service Availability Metrics – Outage Duration Metric This service quality metric is used to define both maximum and average continuous outage service-level targets. Description – duration of a single outage Measurement – date/time of outage end – date/time of outage start Frequency – per event Cloud Delivery Model – IaaS, PaaS, SaaS Example – 1 hour maximum, 15 minute average”

Service Reliability Metrics - MTBF Reliability focuses on how often the service performs as expected, which requires the service to remain in an operational and available state. Mean-Time Between Failures (MTBF) Metric include: Description – expected time between consecutive service failures Measurement – Σ, normal operational period duration / number of failures Frequency – monthly, yearly Cloud Delivery Model – IaaS, PaaS Example – 90 day average”

Service Reliability Metrics – Reliability Rate Metric Overall reliability is more complicated to measure and is usually defined by a reliability rate that represents the percentage of successful service outcomes. This metric measures the effects of non-fatal errors and failures that occur during up-time periods. For example, an IT resource’s reliability is 100% if it has performed as expected every time it is invoked, but only 80% if it fails to perform every fifth time. Description – percentage of successful service outcomes under pre-defined conditions Measurement – total number of successful responses / total number of requests Frequency – weekly, monthly, yearly Cloud Delivery Model – SaaS Example – minimum 99.5%”

Service Performance Metrics Network capacity metric – Mbps/Gbps Storage device capacity metric – Gbytes/Terabytes/etc. Server capacity metric – CPU, RAM, storage Web app capacity metric – requests per mins (SaaS) Instance starting time metric – time to initialize new instance (IaaS, SaaS) Response time metric – million seconds (SaaS) Completion time metric – time required to complete a sync. task

Service Scalability Metrics Service scalability metrics are related to IT resource elasticity capacity, which is related to the maximum capacity that an IT resource can achieve, as well as measurements of its ability to adapt to workload fluctuations. For example, a server can be scaled up to a maximum of 128 CPU cores and 512 GB of RAM, or scaled out to a maximum of 16 load- balanced replicated instances. Metrics include Storage scalability (horizontal) metric – in respond to increased workload Server scalability (horizontal) metric – in respond to increased workload Server scalability (vertical) metric – in respond to workload fluctuations

Service Resiliency Metrics The ability of an IT resource to recover from operational disturbances is often measured using service resiliency metrics. When resiliency is described within or in relation to SLA resiliency guarantees, it is often based on redundant implementations and resource replication over different physical locations, as well as various disaster recovery systems. Resiliency metrics can be applied in three different phases to address the challenges and events that can threaten the regular level of a service: Design Phase – Metrics that measure how prepared systems and services are to cope with challenges. Operational Phase – Metrics that measure the difference in service levels before, during, and after a downtime event or service outage, which are further qualified by availability, reliability, performance, and scalability metrics. Recovery Phase – Metrics that measure the rate at which an IT resource recovers from downtime, such as the meantime for a system to log an outage and switchover to a new virtual server.

Service Resiliency Metrics (2) Mean-Time to Switchover (MTSO) Metric Description – the time expected to complete a switchover from a severe failure to a replicated instance in a different geographical area Measurement – (date/time of switchover completion – date/time of failure) / total number of failures Frequency – monthly, yearly Cloud Delivery Model – IaaS, PaaS, SaaS Example – 10 minute average Mean-Time System Recovery (MTSR) Metric Description – time expected for a resilient system to perform a complete recovery from a severe failure Measurement – (date/time of recovery – date/time of failure) / total number of failures Example – 120 minute average

SLA Guidelines – best practices and recommendations Mapping business cases to SLAs Working with cloud and on-premise SLAs Understanding the scope of an SLA Understanding the scope of SLA monitoring Documenting guarantees at appropriate granularity Defining penalties for non-compliance

SLA Guidelines – best practices and recommendations Incorporating non-measurable requirements – e.g. security and privacy requirements Disclosure of compliance verification and management Inclusion of specific metric formulas Considering independent SLA monitoring Archiving SLA data Disclosing cross-cloud dependencies