High-Availability Linux.  Reliability  Availability  Serviceability.

Slides:



Advertisements
Similar presentations
Express5800/ft series servers Product Information Fault-Tolerant General Purpose Servers.
Advertisements

Clustering Technology For Scaleability Jim Gray Microsoft Research
The google file system Cs 595 Lecture 9.
Chapter 19: Network Management Business Data Communications, 5e.
Telecommunications Management /635 Network Management.
2. Computer Clusters for Scalable Parallel Computing
Introduction to DBA.
High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
High Availability 24 hours a day, 7 days a week, 365 days a year… Vik Nagjee Product Manager, Core Technologies InterSystems Corporation.
Dinker Batra CLUSTERING Categories of Clusters. Dinker Batra Introduction A computer cluster is a group of linked computers, working together closely.
Chapter 19: Network Management Business Data Communications, 4e.
Red Hat Linux Network. Red Hat Network Red Hat Network is the environment for system- level support and management of Red Hat Linux networks. Red Hat.
1 ITC242 – Introduction to Data Communications Week 12 Topic 18 Chapter 19 Network Management.
8. Fault Tolerance in Software
Lesson 1: Configuring Network Load Balancing
The middleware that makes real time integration a reality.
1© Copyright 2011 EMC Corporation. All rights reserved. EMC RECOVERPOINT/ CLUSTER ENABLER FOR MICROSOFT FAILOVER CLUSTER.
11 SERVER CLUSTERING Chapter 6. Chapter 6: SERVER CLUSTERING2 OVERVIEW  List the types of server clusters.  Determine which type of cluster to use for.
Implementing High Availability
Adaptive Server Farms for the Data Center Contact: Ron Sheen Fujitsu Siemens Computers, Inc Sever Blade Summit, Getting the.
ABC Co. Network Implementation High reliability is primary concern – near 100% uptime required –Customer SLA has stiff penalty clauses –Everything is designed.
Next Generation of Apache Hadoop MapReduce Arun C. Murthy - Hortonworks Founder and Architect Formerly Architect, MapReduce.
Instructor: Tasneem Darwish1 University of Palestine Faculty of Applied Engineering and Urban Planning Software Engineering Department Software Systems.
High-Availability Methods Lesson 25. Skills Matrix.
Tool name : Firebug A URL for more information about the tool, or where to buy or download it : Firebug is.
WEB ENGINEERING LECTURE 4 BY Kiramat Rahman. outline  In this Lecture you will learn about:  Term “Software” and its relationship with “Hardware” 
Guide to Linux Installation and Administration, 2e 1 Chapter 9 Preparing for Emergencies.
INSTALLING MICROSOFT EXCHANGE SERVER 2003 CLUSTERS AND FRONT-END AND BACK ‑ END SERVERS Chapter 4.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
So, Jung-ki Distributed Computing System LAB School of Computer Science and Engineering Seoul National University Implementation of Package Management.
f ACT s  Data intensive applications with Petabytes of data  Web pages billion web pages x 20KB = 400+ terabytes  One computer can read
HA-OSCAR Chuka Okoye Himanshu Chhetri. What is HA-OSCAR? “High Availability Open Source Cluster Application Resources”
Redundancy. 2. Redundancy 2 the need for redundancy EPICS is a great software, but lacks redundancy support which is essential for some highly critical.
A Virtual Machine Monitor for Utilizing Non-dedicated Clusters Kenji Kaneda Yoshihiro Oyama Akinori Yonezawa (University of Tokyo)
Unified solution Easy to configure, manage, and monitor Reuse existing investments SAN/DAS environments Allow using HA hardware resources Fast seamless.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
August 3-4, 2004 San Jose, CA Developing a Complete VoIP System Asif Naseem Senior Vice President & CTO GoAhead Software.
FailSafe SGI’s High Availability Solution Mayank Vasa MTS, Linux FailSafe Gatekeeper
The application of DRBD in Linux-HA Haibo Zhang 4/28/2014.
Thank you. We request that you please turn off pagers and cell phones during class.
 High-Availability Cluster with Linux-HA Matt Varnell Cameron Adkins Jeremy Landes.
Distributed database system
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
Lesson 19-E-Commerce Security Needs. Overview Understand e-commerce services. Understand the importance of availability. Implement client-side security.
Terascala – Lustre for the Rest of Us  Delivering high performance, Lustre-based parallel storage appliances  Simplifies deployment, management and tuning.
Heartbeat Is a daemon that provides cluster infrastructure. It must be combined with a cluster resource manager. The CRM takes care of stopping and starting.
1 Putchong Uthayopas, Thara Angsakul, Jullawadee Maneesilp Parallel Research Group, Computer and Network System Research Laboratory Department of Computer.
High Availability in DB2 Nishant Sinha
Clustering Servers Chapter Seven. Exam Objectives in this Chapter:  Plan services for high availability Plan a high availability solution that uses clustering.
Clusters, Fault Tolerance, and Other Thoughts Daniel S. Katz JPL/Caltech SOS7 Meeting 4 March 2003.
Presentation-2 Group-A1 Professor:Mohamed Khalil Anita Kanuganti Hemanth Rao.
Ashish Prabhu Douglas Utzig High Availability Systems Group Server Technologies Oracle Corporation.
Configuration Life-Cycle Management on the TeraGrid Ti Leggett.
+ Clusters Alternative to SMP as an approach to providing high performance and high availability Particularly attractive for server applications Defined.
Data Communications and Networks Chapter 9 – Distributed Systems ICT-BVF8.1- Data Communications and Network Trainer: Dr. Abbes Sebihi.
High Availability Technologies for Tier2 Services June 16 th 2006 Tim Bell CERN IT/FIO/TSI.
1 TDTWG Report to RMS Recommended Solutions for SCR 745 ERCOT Unplanned System Outages and Failures Wednesday, August 10th.
IBM eServer xSeries Technical Conference © IBM Corporation Session ID: O24 Steve Dobbelstein Lake Buena Vista, FL September 8-12, 2003 Enterprise.
ENABLING HIGHLY AVAILABLE GRID SITES Michael Bryant Louisiana Tech University.
Chapter 19: Network Management
Douglas Potter IBI Minneapolis User Group November 2008
Troubleshooting Tools
High Availability 24 hours a day, 7 days a week, 365 days a year…
Integrating HA Legacy Products into OpenSAF based system
Consulting Services JobScheduler Architecture Decision Template
Maximum Availability Architecture Enterprise Technology Centre.
Cluster Communications
System and Communication Faults
Clustering Technology For Fault Tolerance
Fault Tolerance Distributed Web-based Systems
Presentation transcript:

High-Availability Linux

 Reliability  Availability  Serviceability

 What are HA clusters?  High-availability clusters are groups of computers that support sever applications that can be reliably utilized with a minimum of down-time.  Utilizes redundant computers in clusters that provide service when any system components fail.

 Failover – A process by which HA clusters detect hardware or software faults and restart the application on another system without requiring administrative intervention  Emphasis on a layered approach to redundancies

 Primary software of Linux-HA is called Heartbeat  No fixed limit on nodes, allowing use with clusters of any size  Parallel resource monitoring – as with normal computing, but can shift resources from one node to another if the initial node fails  Automatically removes failed nodes from the cluster

 Integrates with many popular software packages, including: Apache, DB2, Oracle, PostgreSQL  GUI included for easier controlling and monitoring of the clusters and relevant resources

 Originally capable of only handling two nodes at a time  Did not include resource monitoring  Would later switch to a layered design implementing n-node clusters

 Project was split into various separate packages  Pacemaker – Cluster resource manager component that handles resource management and node failure  Heartbeat – Now only refers to the layer used for communication between clusters and the individual nodes of the cluster.

 Resource Agents – a standardized interface for a cluster resource, used for translating operations to a cluster and determining success or failure per process  Cluster Glue – a set of libraries, tools, and utilities for use with Heartbeat and Pacemaker; this includes everything not covered by Heartbeat (messaging), Pacemaker (resource management), Resource Agents (cluster operations)  Local Resource Manager – similar to the Pacemaker, but solely exists for one client in the cluster, and is thus not “aware” of the status of the rest of the cluster