CompSci 296.2 Self-Managing Systems Shivnath Babu.

Slides:



Advertisements
Similar presentations
Welcome to Middleware Joseph Amrithraj
Advertisements

The Lucernex Cloud: A software-as-a-service solution delivered via the Cloud What is the Cloud? Cloud Computing is the future of all software applications,
FUTURE TECHNOLOGIES Lecture 13.  In this lecture we will discuss some of the important technologies of the future  Autonomic Computing  Cloud Computing.
1 Copyright © 2010 AQA and its licensors. All rights reserved. Introduction to the new specification GCSE Computer Science Paul Varey.
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Fabián E. Bustamante, Winter 2006 Autonomic Computing The vision of autonomic computing, J. Kephart and D. Chess, IEEE Computer, Jan Also - A.G.
Key Challenges in Information Processing James Hamilton Microsoft SQL Server
Copyright ©2003 Digitask Consultants Inc., All rights reserved Storage Area Networks Digitask Seminar April 2000 Digitask Consultants, Inc.
Proactive Prediction Models for Web Application Resource Provisioning in the Cloud _______________________________ Samuel A. Ajila & Bankole A. Akindele.
Yingping Huang and Gregory Madey University of Notre Dame A W S utonomic eb-based imulation Presented by Tariq M. King Published by the IEEE Computer Society.
CS 501: Software Engineering Fall 2000 Lecture 14 System Architecture I Data Intensive Systems.
Enterprise Business Processes and Applications (IS 6006) Masters in Business Information Systems 10 th Feb 2009 Fergal Carton Business Information Systems.
Principles of Marketing
Library Automation and Digital Libraries Class #5 LBSC 690 Information Technology.
Chapter 13 The First Component: Computer Systems.
Oracle Database Administration. Rana Almurshed 2 course objective After completing this course you should be able to: install, create and administrate.
Introduction and Review : Educational Technology 1
Software engineering on semantic web and cloud computing platform Xiaolong Cui Computer Science.
Towards Autonomic Hosting of Multi-tier Internet Services Swaminathan Sivasubramanian, Guillaume Pierre and Maarten van Steen Vrije Universiteit, Amsterdam,
Copyright Hendela System Consultants, Inc. All Rights Reserved. Hendela System Consultants, Inc. Services and Projects Art Hendela, President Hendela.
Introduction. Readings r Van Steen and Tanenbaum: 5.1 r Coulouris: 10.3.
Imagine sitting at your desk at work and using a Web browser to view live pictures of every room in your house. You see the plumber or the electrician.
1 Autonomic Computing An Introduction Guenter Kickinger.
Dynamic and Decentralized Approaches for Optimal Allocation of Multiple Resources in Virtualized Data Centers Wei Chen, Samuel Hargrove, Heh Miao, Liang.
Everything you need to know.  High Blood Pressure can be when your heart beats too many times in one minute or your heart could beat the correct amount.
CS492: Special Topics on Distributed Algorithms and Systems Fall 2008 Lab 3: Final Term Project.
Texas A&M University Department of Computer Science Sameh S. Sharkawi
Global NetWatch Copyright © 2003 Global NetWatch, Inc. Factors Affecting Web Performance Getting Maximum Performance Out Of Your Web Server.
Presentation Content Our service catalog Remote DBA Service Proactive DBA Service Why use Citagus’ Managed Solutions Benefits Our Value Proposition.
Classification of Computers
Distributed Computing Rik Sarkar. Distributed Computing Old style: Use a computer for computation.
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
SIOS – Comprehensive High Availability Options for your VMware Environment.
Information Technology HARDWARE Dr. GUVEN Aerospace Engineer (P.hD) Nuclear Science and Technology Engineer (M.Sc)
ENSI Swiss Federal Nuclear Safety Inspectorate ENSI ENSI Management System IAEA Regulatory Cooperation Forum, 18 September 2015 Dr. Hans Wanner, Director.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Introduction Complex and large SW. SW crises Expensive HW. Custom SW. Batch execution Structured programming Product SW.
Introduction. Readings r Coulouris, Dollimore and Kindberg Distributed Systems: Concepts and Design Edn. 3 m Note: All figures from this book.
CompSci Self-Managing Systems Shivnath Babu.
IBM Channel Marketing Speed Sheet – IBM Global Services ENABLEMENT DEPLOYMENT NameDescriptionDateAnnouncement New pricing structure- Support Line for Linux.
© 2010 IBM Corporation Smarter Systems for a Smarter Planet Presenter Name – Presenter Title MM/DD/Year.
© Copyright IBM Corporation 2013 June 2013 IBM Integrated System Test Page 1 IBM Integrated Solutions Test Enterprise Test Series: Ideal Stack Testing.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
Comprehensive Scientific Support Of Large Scale Parallel Computation David Skinner, NERSC.
CompSci Self-Managing Systems Shivnath Babu.
Measuring End-User Availability on the Web: Practical Experience Matthew Merzbacher (visiting research scientist) Dan Patterson (undergraduate) Recovery-Oriented.
1 CEG 2400 Fall 2012 Network Servers. 2 Network Servers Critical Network servers – Contain redundant components Power supplies Fans Memory CPU Hard Drives.
CompSci Self-Managing Systems Shivnath Babu.
Cloud Computing for Business Cloud Computing Services Cloud Computing Services.
Cloud Computing PaaS Platform as a Service Specialist Level Complete Certification Kit - Study Guide Book and Online Course 1 This program is designed.
AUTONOMIC COMPUTING B.Akhila Priya 06211A0504. Present-day IT environments are complex, heterogeneous in terms of software and hardware from multiple.
Introduction to High Availability
An Introduction to Attendance on Demand
Embracing Failure: A Case for Recovery-Oriented Computing
Oracle Database Administration
Large Distributed Systems
Maximum Availability Architecture Enterprise Technology Centre.
Regulating Data Flow in J2EE Application Server
MIGRATING TO NEW TECHNOLOGY
System Management MAS.
Developing Library, Research, and Information Literacy Skills
CS 351d Human-computer interaction Lecture 01 Introduction
Advanced Operating Systems
CompSci Self-Managing Systems
PITCH DECK.
Peer to Peer networks and Performance
Unit# 5: Internet and Worldwide Web
Topic:- Autonomic Computing Presented By :- Neera Gupta
CS 580 Human-computer interaction Lecture 01 Introduction
Building Topic/Trend Detection System based on Slow Intelligence
Presentation transcript:

CompSci Self-Managing Systems Shivnath Babu

2

3 Motivation Systems are becoming hard to manage Increasing size (both software and hardware)

4 Motivation WAN Clients Web server Application servers Database servers

5 Motivation WAN Clients Web server Application servers Database servers WAN

6 Motivation Systems are becoming hard to manage Increasing size (both software and hardware) Increasing heterogeneity (e.g., Grid systems) 24 x 7 operation 5 nines availability (system is down at most 5 minutes and 15 seconds per year)

7 Motivation

8 Downtime Costs (per Hour) Brokerage operations$6,450,000 Credit card authorization$2,600,000 Ebay (1 outage 22 hours)$225,000 Amazon.com$180,000 Package shipping services$150,000 Home shopping channel$113,000 Catalog sales center$90,000 Airline reservation center$89,000 Cellular service activation$41,000 On-line network fees$25,000 ATM service fees$14,000 Sources: InternetWeek 4/3/ Fibre Channel: A Comprehensive Introduction, R. Kembel 2000, p.8. ”...based on a survey done by Contingency Planning Research."

9 Motivation System administration cost is increasing

10 Cost of Storage Administration

11 Motivation System administration cost is increasing –Recently, $1 storage  $9 administration cost [Fujitsu] –Up to 75% of overall database ownership cost is for administration [Aberdeen] –Up to 80% of Information Technology (IT) budgets spent on maintenance [McKinsey]

12 Motivation System administration time & effort is increasing

13 Time Distribution for Database Mgmt.

14Motivation System administration time & effort is increasing – >40% of computer system outages caused by operator/administrator error Causes of system crashes Time ( ) % of System Crashes System management Software failure Hardware failure Other 53% 18% 10%

15 Global Storage Service Site Failures Hardware0% 28% Network 22% Human 41% Unknown 9% SW

16Motivation System administration time & effort is increasing – >40% of computer system outages caused by operator error System is too difficult to understand Decisions need to be made quickly, under pressure Not enough well-trained operators Changes are frequent –E.g., workload, hardware, people, data

17 The Real Problem … The obstacle is complexity … Dealing with it is the single most important challenge facing the IT industry Paul Horn, Director of Research, IBM

18 The Solution Let the system deal with the complexity of management Computer-science-wide push towards Self-Managing Systems IBM calls this new field Autonomic Computing

19 Autonomic Computing (IBM) Computer systems that can regulate themselves much in the same way as our autonomic nervous system regulates and protects our bodies Paul Horn, Director of Research, IBM

20 Autonomic Nervous System

21 Autonomic Nervous System Tells you heart how fast to beat, checks your blood’s sugar and oxygen levels, and controls your pupils so the right amount of light reaches your eyes as you read these words, monitors your temperature and adjusts your blood flow and skin functions to keep it at 98.6ºF Is autonomic -you can make a mad dash for the train without having to calculate how much faster to breathe and pump your heart, or if you’ll need that little dose of adrenaline to make it through the doors before they close

22 Autonomic Computing (IBM)

23 What will we do in this class? Read research papers Listen to guest lectures Goal of the class: Give structure to this field, e.g., Concretely defining problems that arise in this setting Identifying algorithms and techniques useful in this domain Proposing guidelines for designers of future systems and software Semester-long project

24 Outline Part 1: Motivating Factors, Problems, and Applications –From Internet services, database management, computational grids, weather analysis and prediction, oil reservoir optimization, and others Part 2: Algorithms and Techniques –Control theory, machine learning, performance modeling, stochastic optimization, massive data management, data integration, building blocks in systems, and others Part 3: Putting everything together, implications, and future work

25Evaluation Class participation 25% Project 75%

26 Resources Google keywords –Autonomic computing –Self-managing systems IBM autonomic computing web page IBM Journal special issue on autonomic computing Berkeley ROC project

27 In the next class Read an overview paper on self-managing systems Summary of work in this area Sample projects