High Availability NFS on Linux Winson Wang Hewlett-Packard Company Cupertino, CA Tel: 408.447.4537.

Slides:



Advertisements
Similar presentations
Introduction to DBA.
Advertisements

Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Distributed Storage March 12, Distributed Storage What is Distributed Storage?  Simple answer: Storage that can be shared throughout a network.
High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
High Availability 24 hours a day, 7 days a week, 365 days a year… Vik Nagjee Product Manager, Core Technologies InterSystems Corporation.
Oracle Data Guard Ensuring Disaster Recovery for Enterprise Data
CS-550: Distributed File Systems [SiS]1 Resource Management in Distributed Systems: Distributed File Systems.
Network-Attached Storage
Objektorienteret Middleware Presentation 2: Distributed Systems – A brush up, and relations to Middleware, Heterogeneity & Transparency.
1 © Copyright 2010 EMC Corporation. All rights reserved. EMC RecoverPoint/Cluster Enabler for Microsoft Failover Cluster.
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition File-System Interface.
Chapter 10: File-System Interface
CS526 Dr.Chow1 HIGH AVAILABILITY LINUX VIRTUAL SERVER By P. Jaya Sunderam and Ankur Deshmukh.
Introduction to MC/ServiceGuard H6487S I.02 Module 2.
NFS. The Sun Network File System (NFS) An implementation and a specification of a software system for accessing remote files across LANs. The implementation.
1© Copyright 2011 EMC Corporation. All rights reserved. EMC RECOVERPOINT/ CLUSTER ENABLER FOR MICROSOFT FAILOVER CLUSTER.
Module – 7 network-attached storage (NAS)
CHAPTER Introduction to LANs. MODULE Purpose and Use of a Network.
National Manager Database Services
NETWORK FILE SYSTEM (NFS) By Ameeta.Jakate. NFS NFS was introduced in 1985 as a means of providing transparent access to remote file systems. NFS Architecture.
Network File System CIS 238. NFS (Network File System) The most commercially successful and widely available remote file system protocol Designed and.
Installing Samba Vicki Insixiengmay Jonathan Krieger.
1 COP 4343 Unix System Administration Unit 16: file server – samba.
1 © Talend 2014 Service Locator Talend ESB Training 2014 Jan Bernhardt Zsolt Beothy-Elo
Presented by: Alvaro Llanos E.  Motivation and Overview  Frangipani Architecture overview  Similar DFS  PETAL: Distributed virtual disks ◦ Overview.
SANPoint Foundation Suite HA Robert Soderbery Sr. Director, Product Management VERITAS Software Corporation.
Networked File System CS Introduction to Operating Systems.
1 Network File Sharing. 2 Module - Network File Sharing ♦ Overview This module focuses on configuring Network File System (NFS) for servers and clients.
Oracle10g RAC Service Architecture Overview of Real Application Cluster Ready Services, Nodeapps, and User Defined Services.
INSTALLING MICROSOFT EXCHANGE SERVER 2003 CLUSTERS AND FRONT-END AND BACK ‑ END SERVERS Chapter 4.
IMPROUVEMENT OF COMPUTER NETWORKS SECURITY BY USING FAULT TOLERANT CLUSTERS Prof. S ERB AUREL Ph. D. Prof. PATRICIU VICTOR-VALERIU Ph. D. Military Technical.
New SA Training Topic 8: File System Access  Our organization uses a variety of services for accessing files on remote systems  WWW  FTP  CIFS/SMB.
Samba version What is the Samba? Samba is a suite of programs which work together to allow clients to access to a server's filespace and printers.
© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Introduction to HP Availability Manager.
Page 1 of John Wong CTO Twin Peaks Software Inc. Mirror File System A Multiple Server File System.
What's New in HP-UX, Linux, and Disaster-Tolerant Clustering Solutions? John Foxcroft HA Clusters Training Consultant Hewlett-Packard Cupertino, California.
TELE 301 Lecture 10: Scheduled … 1 Overview Last Lecture –Post installation This Lecture –Scheduled tasks and log management Next Lecture –DNS –Readings:
Chapter 20 Distributed File Systems Copyright © 2008.
What is a Distributed File System?? Allows transparent access to remote files over a network. Examples: Network File System (NFS) by Sun Microsystems.
Chapter 10: File-System Interface Silberschatz, Galvin and Gagne ©2005 Operating System Concepts – 7 th Edition, Jan 1, 2005 Chapter 10: File-System.
FailSafe SGI’s High Availability Solution Mayank Vasa MTS, Linux FailSafe Gatekeeper
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
Sys Admin Course NFS and SAMBA Fourie Joubert. Sys Admin Course NFS NFS is the Network File System It allows Linux systems to share a file system, or.
NFS : Network File System SMU CSE8343 Prof. Khalil September 27, 2003 Group 1 Group members: Payal Patel, Malka Samata, Wael Faheem, Hazem Morsy, Poramate.
1 Linux Networking and Security Chapter 5. 2 Configuring File Sharing Services Configure an FTP server for anonymous or regular users Set up NFS file.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
GLOBAL EDGE SOFTWERE LTD1 R EMOTE F ILE S HARING - Ardhanareesh Aradhyamath.
High Availability in DB2 Nishant Sinha
Distributed File Systems Architecture – 11.1 Processes – 11.2 Communication – 11.3 Naming – 11.4.
Distributed File Systems 11.2Process SaiRaj Bharath Yalamanchili.
High Availability Technologies for Tier2 Services June 16 th 2006 Tim Bell CERN IT/FIO/TSI.

Distributed File Systems Group A5 Amit Sharma Dhaval Sanghvi Ali Abbas.
Storage Networking. Storage Trends Storage grows %/year, gets more complicated It’s necessary to pool storage for flexibility Intelligent storage.
CEG 2400 FALL 2012 Linux/UNIX Network Operating Systems.
Distributed Systems: Distributed File Systems Ghada Ahmed, PhD. Assistant Prof., Computer Science Dept. Web:
OSIsoft High Availability PI Replication Colin Breck, PI Server Team Dave Oda, PI SDK Team.
DISTRIBUTED FILE SYSTEM- ENHANCEMENT AND FURTHER DEVELOPMENT BY:- PALLAWI(10BIT0033)
Failover and High Availability
High Availability 24 hours a day, 7 days a week, 365 days a year…
File System Implementation
High Availability Linux (HA Linux)
Filesystem Caching (FS-Cache)
High Availability Planning
Maximum Availability Architecture Enterprise Technology Centre.
Multiple Processor Systems
Samba.
NFS.
Multiple Processor and Distributed Systems
Lecture 4: File-System Interface
Presentation transcript:

High Availability NFS on Linux Winson Wang Hewlett-Packard Company Cupertino, CA Tel:

Agenda Network File Systems on Linux Clustering Concepts Used in MC/ServiceGuard-Linux Toolkits for High Availability Network File Systems Toolkit Basic/Advanced Features Examples and Scenarios

Network File Systems on Linux Existing Network File Systems on Linux –Coda File System –NFS –SMB (CIFS) –AFS (Andrew File System) –NCP (Novell NetWare Core Protocol) Two most widely used Network File Systems –NFS (serve Unix-based users) –CIFS (serve Windows users)

NFS Basics Available on all Unix-Based Servers Versions –NFSv3, NFSv2, NFSv4 –Linux kernel 2.4 supports NFSv3 & NFSv2 Features –Seamless access –Security –Invulnerable to system crash or reboot –High performance

NFS Characteristics Remote procedure calls Retransmissions of messages Idempotent operations – executing operation again does not change the outcome A stateless server NFS file handle to identify files Caching on the client Maintaining Unix file system semantics

NFS File Handle A key data passed between Server and Client Encode NFS file information –A file system identifier, an index number of a mounted local file system –The inode number of the file within the file system –An inode generation number –Other information (listed in the /usr/include/linux/nfsd/nfsfh.h)

Maintaining Unix file system semantics Inode Generation Number Hidden.nfs files on client File and record locking –Need to maintain a state about the locked files –A separate locking daemon (lockd) for all clients

NFS States Utility (daemons) states are in files under /var/lib directory Mount and export file system states are in three files –rmtab –rtab –xtab File lock states are in files (file name is each client IP address) –statd/sm/{client-ip-address}

NFS Utilities and State Files /etc/exports /var/lib/nfs/etab/var/lib/nfs/rmtab/var/lib/nfs/xtab /proc/fs/nfs/exports exportfs rpc.mountd rpc.nfsd : Data flow

SAMBA Basics Provide SMB, CIFS services on Unix- based systems –File & print server –Authentication and Authorization –Name resolution –Service announcement

Clustering Concepts Used in MC/ServiceGuard Introducing MC/ServiceGuard High Availability with MC/ServiceGuard Features of MC/ServiceGuard Benefits of MC/ServiceGuard How MC/ServiceGuard Works MC/ServiceGuard Packages

Introducing MC/ServiceGuard root PKG-A mirror root mirror PKG-B PKG-A mirror PKG-B mirror Server AServer B hub

High Availability with MC/ServiceGuard root PKG-A mirror root mirror PKG-B PKG-A mirror PKG-B mirror Server AServer B hub

Features of MC/ServiceGuard Highly Available Cluster (applications recover to alternate node in < 60 seconds) LAN failure protection (fast local switch to standby LAN adapter using same IP address) Application Packages allow all resources for a package to be defined in one place Automatic cluster reconfiguration after a node failure Intelligent cluster reconfiguration after a node failure No idle resources Facilitates online hardware and software updates

Benefits of MC/ServiceGuard Applications remain available to users, even after a hardware or software failure LAN card failures do not cause an application outage Applications can be moved easily and transparently without client reconfiguration No manual user intervention is needed to recover from a node failure Data integrity is preserved during a node failure Every node runs a production application Applications available during hardware and software upgrades Flexible load balancing during failover

How MC/ServiceGuard Works Bridge PKG-A Disk PKG-B Disk PKG-C Disk PKG-D Disk PKG-E Disk PKG-F Disk PKG-A Mirror PKG-B Mirror PKG-C Mirror PKG-D Mirror PKG-E Mirror PKG-F Mirror Bridge Root1Root2Root3 Mirror LAN 0 LAN 1 Node 1Node 2 Node 3 PKG-A PKG-B PKG-C PKG-D PKG-E PKG-F

MC/ServiceGuard Packages Simple, flexible Package group all the resources for an application, including: –Volume groups –IP addresses –Services (processes) Packages are the entities that are managed and moved within a cluster environment PKG-A PKG-B PKG-C Service_Processes: App_Daemon_1 App_Daemon_2 Volume Groups: /dev/vg01 /dev/vg02 IP Addresses: Service_Processes: App_Daemon_1 App_Daemon_2 Volume Groups: /dev/vg01 /dev/vg02 IP Addresses:

Distributing Application Packages Balancing workload Redistributing applications after a node failure Minimizing impact on remaining nodes PKG-A PKG-B PKG-C PKG-D PKG-E PKG-F

Toolkits for High Availability Network File Systems Running under MC/ServiceGuard environment App configuration and interface with Package Control Script (a ServiceGuard component) Provide HA functions and configurations Configuration for Workload Distribution Toolkit for NFS –Bash script (for basic functions) –C code (for advanced functions) Toolkit for SAMBA –Bash script (for basic functions)

Package Control Script Package resources configuration –Mirror disks –Volume groups and file systems –Package (re-locatable) IP addresses Resources control functions on package Start & Stop –Start/stop mirroring disk (MD) process –Activate/Deactivate disk volume groups –Mount/un-mount file systems –Enable/Disable IP addresses –Start/stop package service processes –Invoke toolkit script

NFS Toolkit Configurations –Export File Systems (Export NFS directories that will be mounted by NFS client users.) –NFS Monitoring Service (Monitoring script) Functions for NFS Start & Stop –Start/stop NFS server (daemons) –Export/Un-export file systems –Start/stop monitor process –Synchronize NFS server states Current clients mount states (rmtab)

NFS daemons Daemons to be started & monitored –portmap – protmapper –rpc.mountd – mount daemon –nfsd – nfs daemon –lockd – lock daemon –rpc.statd – state (lockd uses it) –rpc.rquotad

rmtab Synchronization Monitoring rmtab (every 5 seconds) Use shared storage as a media Toolkit imports/exports rmtab data to/from shared storage PKG-A Export rmtab data Import rmtab data Package failove

NFS Toolkit Architecture NFS toolkit Package Control Script Data Definition Start/Stop Functions HA NFS Script Data Definition Start/Stop Functions NFS Monitor Script Monitor Processes Monitoring Functions Execute rmtab sync program rmtab sync monitor process MonitorExecute Start/Stop process Execute rmtab Monitor Read/Write

SAMBA Toolkit Configurations –SAMBA base configuration file –A sub-configuration file for a SAMBA package –For workload distribution –SAMBA Monitoring Service (Monitoring script) Functions for SAMBA Start & Stop –Start/stop SAMBA server (daemons) –Start/stop Monitor Process –Generate/regenerate SAMBA configuration file (for workload distribution) –Restart SAMBA server (for workload distribution)

SAMBA daemons Daemons to be started & monitored –smbd –nmbd

SAMBA Toolkit Architecture Toolkit Package Control Script Data Definition Start/Stop Functions Main Script Local Variables HA Functions Monitor Script Monitored Processes Monitor Functions runstart/stop Toolkit Interface Script Application Server Flags Start/Stop App Server start/stop Configuration Script User Configuration Data data import SAMBA Server start/stopmonitor

Sample Configuration hub NFS-ANFS-BNFS-C hub heartbeat data Node 1Node 2 Node 3 NFS-ANFS-BNFS-C Disk Array IP-A IP-BIP-C

NFS Toolkit Limitation NFS failover –Very similar to system restart (server crashed and reboot) –Start on the standby node –Lost kernel & system state info –Lost daemons state info Client may experience a momentary hang Access continues when failover completes

NFS Failover Lost kernel & system states –Inconsistent file handle between Server and Client (Server fails to decode the file handle) –Client may get error message during file read/write. Input/Output error, Stale NFS file handle Write error, Stale NFS file handle –Workaround: give a retry. (i.e. re-open the file.)

NFS Failover (cont) Lost daemons states –Current client’s mount states –Exported file system states –File lock states Toolkit capabilities –Synchronize client’s mount states –Export file systems –(Does not synchronize file lock states. Client need to reclaim the file lock)

Q & A