CS526 Dr.Chow1 HIGH AVAILABILITY LINUX VIRTUAL SERVER By P. Jaya Sunderam and Ankur Deshmukh.

Slides:



Advertisements
Similar presentations
PHANI VAMSI KRISHNA.MADDALI. BASIC CONCEPTS.. FILE SYSTEMS: It is a method for storing and organizing computer files and the data they contain to make.
Advertisements

Silberschatz and Galvin  Operating System Concepts Module 16: Distributed-System Structures Network-Operating Systems Distributed-Operating.
Distributed System Structures Network Operating Systems –provide an environment where users can access remote resources through remote login or file transfer.
© 2011 VMware Inc. All rights reserved High Availability Module 7.
Objektorienteret Middleware Presentation 2: Distributed Systems – A brush up, and relations to Middleware, Heterogeneity & Transparency.
Agent Caching in APHIDS CPSC 527 Computer Communication Protocols Project Presentation Presented By: Jake Wires and Abhishek Gupta.
Distributed components
Network Operating Systems Users are aware of multiplicity of machines. Access to resources of various machines is done explicitly by: –Logging into the.
Dr. Zahid Anwar. Simplified Architecture of Linux Cluster Simplified Architecture of a Single Computer Simplified architecture of an enterprise cluster.
OCT1 Principles From Chapter One of “Distributed Systems Concepts and Design”
HA LVS Coda - Devjani Sinha. High Availability-HA n critical commercial applications move on the Internet n An elegant highly available system may have.
Distributed Systems Fall 2009 Replication Fall 20095DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
Hands-On Microsoft Windows Server 2003 Administration Chapter 5 Administering File Resources.
Lesson 1: Configuring Network Load Balancing
16: Distributed Systems1 DISTRIBUTED SYSTEM STRUCTURES NETWORK OPERATING SYSTEMS The users are aware of the physical structure of the network. Each site.
5.1 © 2004 Pearson Education, Inc. Exam Managing and Maintaining a Microsoft® Windows® Server 2003 Environment Lesson 5: Working with File Systems.
Computer Science Lecture 16, page 1 CS677: Distributed OS Last Class:Consistency Semantics Consistency models –Data-centric consistency models –Client-centric.
PRASHANTHI NARAYAN NETTEM.
11 SERVER CLUSTERING Chapter 6. Chapter 6: SERVER CLUSTERING2 OVERVIEW  List the types of server clusters.  Determine which type of cluster to use for.
Copyright © 2002 Wensong Zhang. Page 1 Free Software Symposium 2002 Linux Virtual Server: Linux Server Clusters for Scalable Network Services Wensong Zhang.
Microsoft Load Balancing and Clustering. Outline Introduction Load balancing Clustering.
H-1 Network Management Network management is the process of controlling a complex data network to maximize its efficiency and productivity The overall.
10/02/2004ELFms meeting1 Linux Virtual Server Miroslav Siket FIO-FS.
Module 13: Network Load Balancing Fundamentals. Server Availability and Scalability Overview Windows Network Load Balancing Configuring Windows Network.
Microsoft Internet Security and Acceleration (ISA) Server 2004 is an advanced packet checking and application-layer firewall, virtual private network.
Oracle10g RAC Service Architecture Overview of Real Application Cluster Ready Services, Nodeapps, and User Defined Services.
INSTALLING MICROSOFT EXCHANGE SERVER 2003 CLUSTERS AND FRONT-END AND BACK ‑ END SERVERS Chapter 4.
TRƯỜNG ĐẠI HỌC CÔNG NGHỆ Bộ môn Mạng và Truyền Thông Máy Tính.
Submitted by: Shailendra Kumar Sharma 06EYTCS049.
© 2005 Mt Xia Technical Consulting Group - All Rights Reserved. HACMP – High Availability Introduction Presentation November, 2005.
IMPROUVEMENT OF COMPUTER NETWORKS SECURITY BY USING FAULT TOLERANT CLUSTERS Prof. S ERB AUREL Ph. D. Prof. PATRICIU VICTOR-VALERIU Ph. D. Military Technical.
Cluster Reliability Project ISIS Vanderbilt University.
© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Introduction to HP Availability Manager.
High Availability NFS on Linux Winson Wang Hewlett-Packard Company Cupertino, CA Tel:
MCTS Guide to Microsoft Windows Server 2008 Applications Infrastructure Configuration (Exam # ) Chapter Three Configuring Windows Server 2008 Storage.
 Protocols used by network systems are not effective to distributed system  Special requirements are needed here.  They are in cases of: Transparency.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
VERITAS Cluster Server for Solaris Event Notification.
Beowulf Software. Monitoring and Administration Beowulf Watch 
Chapter 10 Chapter 10: Managing the Distributed File System, Disk Quotas, and Software Installation.
Plug-in for Singleton Service in Clustered environment and improving failure detection methodology Advisor:By: Dr. Chung-E-WangSrinivasa c Kodali Department.
Presented By: Samreen Tahir Coda is a network file system and a descendent of the Andrew File System 2. It was designed to be: Highly Highly secure Available.
 Load balancing is the process of distributing a workload evenly throughout a group or cluster of computers to maximize throughput.  This means that.
1 © 2005 Cisco Systems, Inc. All rights reserved. 111 © 2004, Cisco Systems, Inc. All rights reserved. CNIT 221 Security 2 ver.2 Module 8 City College.
Copyright © George Coulouris, Jean Dollimore, Tim Kindberg This material is made available for private study and for direct.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
High Availability in DB2 Nishant Sinha
The CoBFIT Toolkit PODC-2007, Portland, Oregon, USA August 14, 2007 HariGovind Ramasamy IBM Zurich Research Laboratory Mouna Seri and William H. Sanders.
Cloud Computing Computer Science Innovations, LLC.
You there? Yes Network Health Monitoring Heartbeats are sent to monitor health status of network interfaces Are sent over all cluster.
Enterprise Computing with Jini Technology Mark Stang and Stephen Whinston Jan / Feb 2001, IT Pro presented by Alex Kotchnev.
VCS Building Blocks. Topic 1: Cluster Terminology After completing this topic, you will be able to define clustering terminology.
Operating Systems Distributed-System Structures. Topics –Network-Operating Systems –Distributed-Operating Systems –Remote Services –Robustness –Design.
Truly Distributed File Systems Paul Timmins CS 535.
PERFORMANCE MANAGEMENT IMPROVING PERFORMANCE TECHNIQUES Network management system 1.
High Availability Clusters in Linux Sulamita Garcia EDS Unix Specialist
High Availability 24 hours a day, 7 days a week, 365 days a year…
File System Implementation
High Availability Linux (HA Linux)
Maximum Availability Architecture Enterprise Technology Centre.
Cluster Communications
VIRTUAL SERVERS Presented By: Ravi Joshi IV Year (IT)
Introduction to Networks
Replication Middleware for Cloud Based Storage Service
GARRETT SINGLETARY.
Fault Tolerance Distributed Web-based Systems
Server Cluster and LVS based Cluster
Presentation transcript:

CS526 Dr.Chow1 HIGH AVAILABILITY LINUX VIRTUAL SERVER By P. Jaya Sunderam and Ankur Deshmukh

CS526 Dr.Chow2 WHAT IS LINUX VIRTUAL SERVER? Highly scalable server built on cluster of real servers. Users see only a single IP address. Front end of the real servers is the load balancer. Schedules requests to the different servers and make parallel services of the cluster to appear as a virtual service on a single IP address. Scalability is achieved by transparently adding or removing a node in the cluster. Forms the basic building block of HA-LVS.

CS526 Dr.Chow3 CONTD..

CS526 Dr.Chow4 CONCEPT OF HIGH AVAILABILITY What is High Availability? Providing high availability of services for critical commercial applications Provided by detecting node or daemon failures and reconfiguring the system appropriately so that the workload can be taken over by the remaining nodes in the cluster. How it is used in LVS? Principle : One load balancer is the master while the other is slave. If master fails then the slave takes up the master’s pending tasks. Also checks the health status of the real servers.

CS526 Dr.Chow5 LVS to HA-LVS The LVS by itself is not configured to be highly available. Patch with other services to make it highly available. The high availability can be provided by using any of these services in conjunction with each other: 1. mon 2. heartbeat 3. ldirectord 4. Coda fault tolerant file system

CS526 Dr.Chow6 THE OVERALL PICTURE

CS526 Dr.Chow7 FAILURES INTO ACCOUNT BY HA-LVS HA-LVS handles two types of failures Director Failure: When the load balancer fails or crashes Handled by having a redundant director available One director maintains all the session and IP information On failover this information is handed over to the redundant director 2. Real server Failure: When the service to the real server fails or crashes Handled by using a signal running on the director that checks the health of real servers after every constant time interval. If a certain service is down the it is removed from the directors “ipvsadm” table. When the service comes back again the it is again added back to the director’s “ipvsadm” table

CS526 Dr.Chow8 HEARTBEAT/FAKE The "fake" software is used for the backup to takeover the IP addresses of the load balancer when the load balancer fails The "heartbeat" code is used to detect the status of the load balancer to activate/deactivate the "fake" on the backup server. Two heartbeat daemons run on the primary and the backup, they heartbeat the message like "I'm alive" each other through the serial line periodically. When the heart code daemon of the backup cannot hear the "I'm alive" message from the primary in the defined time, it activates the fake to take over the Virtual IP address to provide the load-balancing service. When it receives the "I'm alive" message from the primary later, it deactivate the fake to release the Virtual IP address, and the primary comes back to work again.

CS526 Dr.Chow9 MON General purpose resource monitoring system. Used for monitoring services and triggering alerts in events of failure which are viewed as two different tasks. In case of HA-LVS used to check the real servers health i.e. to check which services are down and which services are running, by sending alerts after certain time intervals. The fping.monitor is configured to detect whether the server nodes is alive every t seconds, and the relative service monitor is also configured to detect the service daemons on all the nodes every m seconds. An alert was written to remove/add a rule in the linux virtual server table while detecting the server node or daemon is down/up. Therefore, the load balancer can automatically mask service daemons or servers failure and put them into service when they are back.

CS526 Dr.Chow10 CODA Coda is a fault-tolerant distributed file system so that the files are easily available and easy to manage. It was developed by the Carnegie-Melon University in It has many features that are very desirable for network file systems. 1. disconnected operation for mobile computing 2. Is freely available under a liberal license 3. high performance through client side persistent caching 4. server replication 5. security model for authentication, encryption and access control 6. continued operation during partial network failures in server network 7. network bandwidth adaptation 8. good scalability 9. well defined semantics of sharing, even in the presence of network failures

CS526 Dr.Chow11 IMPLEMENTATION Heartbeat implementationHeartbeat implementation Overview of /etc/ha.d/ha.cf Key parameters usedKey parameters used Keep aliveKeep alive Dead timeDead time Warn timeWarn time UdpUdp Medium –Ethernet,serial consoleMedium –Ethernet,serial console Node –Name of machine in clusterNode –Name of machine in cluster Watchdog –Reboots system if heart doesn’t beat for a specified amt of timeWatchdog –Reboots system if heart doesn’t beat for a specified amt of time

CS526 Dr.Chow12 Heartbeat

CS526 Dr.Chow13 CONTINUED.. Mon implementation ServerServer ClientClient SERVER SERVER Runs on Load balancerRuns on Load balancer Monitors services for availability, sends alarms upon failuresMonitors services for availability, sends alarms upon failures CLIENT CLIENT Runs on the real serverRuns on the real server Sends commands to mon daemon and shows the results of the corresponding repliesSends commands to mon daemon and shows the results of the corresponding replies

CS526 Dr.Chow14 Mon

CS526 Dr.Chow15 References Masters thesis of C Prakash at UCCS thesis of C Prakash at UCCS