Stand Tall, and Carry a Precision Micrometer: Observations on Creating a Measurement Model for Virtual Machines David Boyes Sine Nomine Associates.

Slides:



Advertisements
Similar presentations
© 2007 Open Grid Forum Grids in the IT Data Center OGF 21 - Seattle Nick Werstiuk October 16, 2007.
Advertisements

Key Metrics for Effective Storage Performance and Capacity Reporting.
Proposal by CA Technologies, IBM, SAP, Vnomic
SDN + Storage.
Segmentation and Paging Considerations
PlanetLab Operating System support* *a work in progress.
Department of Industrial Management Engineering 1.Introduction ○Usability evaluation primarily summative ○Informal intuitive evaluations by designers even.
Silberschatz, Galvin and Gagne  2002 Modified for CSCI 399, Royden, Operating System Concepts Operating Systems Lecture 19 Scheduling IV.
Ethernet and switches selected topics 1. Agenda Scaling ethernet infrastructure VLANs 2.
Place your chosen image here. The four corners must just cover the arrow tips. For covers, the three pictures should be the same size and in a straight.
By Sean Danko.  What is Virtualization  How does Virtualization Work  History of Virtualization  Why Should I Virtualize  Infrastructure  Advantages.
Copyright © 1998 Wanda Kunkle Computer Organization 1 Chapter 2.1 Introduction.
PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.
Monitoring and Pollutant Load Estimation. Load = the mass or weight of pollutant that passes a cross-section of the river in a specific amount of time.
Sources of Data Levin and Fox Ch. 1: The Experiment The Survey Content Analysis Participant Observation Secondary Analysis 1.
CPE 731 Advanced Computer Architecture Snooping Cache Multiprocessors Dr. Gheith Abandah Adapted from the slides of Prof. David Patterson, University of.
Local and Regional Economic Analysis Tools
Unit-1 Introduction | VTU NOTES | QUESTION PAPERS | NEWS | VTU RESULTS | FORUM | BOOKSPAR ANDROID APP.
Resource Management in Virtualization-based Data Centers Bhuvan Urgaonkar Computer Systems Laboratory Pennsylvania State University Bhuvan Urgaonkar Computer.
Measuring zSeries System Performance Dr. Chu J. Jong School of Information Technology Illinois State University 06/11/2012 Sponsored in part by Deer &
The Microcomputer System In this presentation you will: Investigate the microprocessor Title Page.
Virtual LAN Design Switches also have enabled the creation of Virtual LANs (VLANs). VLANs provide greater opportunities to manage the flow of traffic on.
1 Validation & Verification Chapter VALIDATION & VERIFICATION Very Difficult Very Important Conceptually distinct, but performed simultaneously.
A Cloud is a type of parallel and distributed system consisting of a collection of inter- connected and virtualized computers that are dynamically provisioned.
Virtualization. Virtualization  In computing, virtualization is a broad term that refers to the abstraction of computer resources  It is "a technique.
Infrastructure for Better Quality Internet Access & Web Publishing without Increasing Bandwidth Prof. Chi Chi Hung School of Computing, National University.
Cloud Computing Energy efficient cloud computing Keke Chen.
So What? Operations Management EMBA Summer TARGET You are, aspire to be, or need to communicate with an executive that does not have direct responsibility.
Presenter : Ching-Hua Huang 2013/9/16 Visibility Enhancement for Silicon Debug Cited count : 62 Yu-Chin Hsu; Furshing Tsai; Wells Jong; Ying-Tsai Chang.
Methodology for producing the revised back series of population estimates for Julie Jefferies Population and Demography Division Office for.
Methodology Matters: Doing Research in the Behavioral and Social Sciences ICS 205 Ha Nguyen Chad Ata.
Server Virtualization
Securing the Network Infrastructure. Firewalls Typically used to filter packets Designed to prevent malicious packets from entering the network or its.
A Few Things You Don’t Know You Have To Worry About Yet David Boyes CMG Philadelphia, Nov 2008.
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
Modeling Virtualized Environments in Simalytic ® Models by Computing Missing Service Demand Parameters CMG2009 Paper 9103, December 11, 2009 Dr. Tim R.
7 Strategies for Extracting, Transforming, and Loading.
1EMC CONFIDENTIAL—INTERNAL USE ONLY FAST VP and Exchange Server 2010 Don Turner Consultant Systems Integration Engineer Microsoft TPM.
Capacity Planning - Managing the hardware resources for your servers.
Emir Halepovic, Jeffrey Pang, Oliver Spatscheck AT&T Labs - Research
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
Development of a QoE Model Himadeepa Karlapudi 03/07/03.
10-1 MGMG 522 : Session #10 Simultaneous Equations (Ch. 14 & the Appendix 14.6)
Accuracy vs. Precision. 2 minutes: Write what you think accuracy and precision mean 1 minute: discuss at your table what you think.
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Chapter 7 Memory Management Eighth Edition William Stallings Operating Systems: Internals and Design Principles.
Capacity Planning in a Virtual Environment Chris Chesley, Sr. Systems Engineer
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
Comparison of VPS Hosting and Cloud Hosting Features.
Partner Billing and Reporting
OPERATING SYSTEMS CS 3502 Fall 2017
Software Architecture in Practice
Architecture & System Performance
Architecture & System Performance
Systems Design: Activity Based Costing
Virtualization overview
Measuring Service in Multi-Class Networks
Oracle Solaris Zones Study Purpose Only
Measuring Social Life: How Many? How Much? What Type?
Overview Introduction VPS Understanding VPS Architecture
Chapter 7 Capacity Planning and Management
Spatial Data Analysis: Intro to Spatial Statistical Concepts
Cloud Computing Architecture
Spatial Data Analysis: Intro to Spatial Statistical Concepts
Replicated Binary Designs
Systems Design: Activity Based Costing
Poverty Maps for Sri Lanka
Presentation transcript:

Stand Tall, and Carry a Precision Micrometer: Observations on Creating a Measurement Model for Virtual Machines David Boyes Sine Nomine Associates

Introduction Observations on the problem – Accuracy – Fairness (particularly on chargeback information) Variations in Standard Techniques A Few Old Things are New Again Model Approximation Types Some thoughts on capacity planning and projection Bringing it all back together Q&A

Measurement in Virtual Machines Measurement takes place in multiple locations: – Within each virtual machine – At the supporting virtualization system level Numbers resulting from traditional measurement techniques are false and misleading Accurate measurement demands: – Correlation between virtual machine view and supporting system view – Correction of counts to accommodate multiple workloads

Factors in Virtualization Models Traditional Resource Utilization Factors – CPU – I/O – Storage (RAM and Disk) – Network Traffic Correction Factors – Total/Virtual CPU Ratio – I/O allocation to specific virtual machine – Allocation of storage resources and time element for occupancy – VLAN and traffic sampling

“ Classic ” Summary of Samples Utilization Chargeback assigns unit cost to each element – Simple arithmetic, right? – NO!!

Observed Problems False Data from Instrumentation Relative difficulty in building correlation between 1 st and 2 nd level observation Missing identification of application or virtual machine specific data in accounting and performance data streams Re-socialization of “ shared resources ” Inability of performance tooling to account for external costs

Virtual and Total Resource Mesurement are No Longer the Same In virtual machines, we have to capture the cost of instruction simulation and the operation of the virtualization environment – True cost measured by difference in CPU measured inside virtual machine vs CPU measured in hypervisor or “ host ” – Requires correlation of host measurement against “ inside ” measurement Clocks don ’ t always match! Also true for all the other factors! How can we get data for one machine separated from the entire mass?

Implications for Chargeback and Management What appears to be a “ fair share ” does not actually reflect real utilization – Most critical observation reflected in relatively non- scalable function (I/O, network) – Users want to pay only for what they use – Direct impact on capacity planning What ’ s a lad to do?

Borrowing From the Phone Company This isn ’ t a new problem either in the performance world or the billing world – the phone companies have had it for ages in reconciliation of cross-network charges. Can we borrow some ideas here? – Rating vs simple measurement – Peak-leveling models – Fuzzy correlation

Rating Vs Simple Sampling By using a correction factor based on correlation period rather than simple sums, we can modify the measurement according to business rules – Relaxes the requirement for precision timestamping and clock correlation – Allows workload costing feedback for management tooling in shared environment – User favorite: easy revaluation of data in case of dispute

Example Correction of CPU resource utilization effected by T/V ratio: Assumption still rests on ability of host instrumentation to report statistics by virtual machine Similar technique for other variables – Note sum for individual measures should be close to total amount per interval per processor (MP systems > 100%)

Example Note that current non-zSeries systems are weak on separation of data for individual partitions – work ongoing in DTMF, CIM/SNMP and WS-I workgroups to address additional granularity for virtual systems – Competing prototypes in pSeries LPAR and Sun Domain Mgr

Projection and Confidence Levels Goal: +/-.5% nominal Realistic expectation at this stage: 5-7% Projection at this point still weak on data.

Projection and Confidence Levels Tendency is toward under-correction (ie, overestimation of consumption) – Good if you ’ re a service provider! – If linked to auto-provisioning (eWLM, Superdome, etc) will trigger early provisioning of additional resources Model may be fine-tuned by adjusting rating interval: – Optimax for most transaction-oriented servers in sec intervals – Optimax for compute-intensive servers in sec intervals

Data Correlation Use of rating engine stream allows correlation requirement to be less stringent – Still some requirement for “ near ” timing, but buckets are large enough that most virtual machine monitors cannot span an interval.

Summary Virtual machine modeling presents a combination of old and new problems Additional sophistication in instrumentation will be sufficient for a truly representative model A reasonably accurate simulation can be provided by adjusting measurement based on rated intervals instead of simple accumulation

Q&A

Contact Info David Boyes Sine Nomine Associates