Download presentation
Presentation is loading. Please wait.
Published byLucy Hampton Modified over 6 years ago
1
Module 7: Server Cluster Maintenance and Troubleshooting
2
Overview Cluster Maintenance Troubleshooting Cluster Service
3
Cluster Maintenance Backup Restoring the First Node
Restoring Cluster Disks Restoring the Second Node Evicting a Node
4
Backup Backing Up the System State Backing Up the Local Disk
Backing Up the Cluster Disk
5
Restoring the First Node
Steps For Restoring a Server Cluster: Restore the first node Restore the cluster disks Restore the second node Perform node testing
6
Restoring Cluster Disks
Restoring Disk Signature Files Restoring the Data on the Cluster Disk Restoring the Cluster Configuration Files
7
Restoring the Second Node
Restoring the Remaining Node(s) of a Cluster Perform Node Testing
8
Evicting a Node Steps for Evicting a Node Back up both nodes
Verify backup Move all groups to the remaining node Stop Cluster service on the node to be removed Evict the node Unplug the server from the shared bus
9
Troubleshooting Cluster Service
Troubleshooting Tools Examining the Cluster Log Troubleshooting Network Communications SCSI Configuration Problems Group and Resource Failures Quorum Log Corruption
10
Troubleshooting Tools
Disk Manager Task Manager Performance Monitor Network Monitor Dr. Watson Services Snap-in
11
Examining the Cluster Log
Copy of cluster - Wordpad Creates a new cluster group 000003b b4::2000/10/02-19:44: [CS] Cluster Service started – Cluster Node Vers 000003b b4::2000/10/02-19:44: OS Version 000003b f0::2000/10/02-19:44: [CS] Service Starting… 000003b f0::2000/10/02-19:44: [EP] Initialization… 000003b f0::2000/10/02-19:44: [DM]: Initialization 000003b f0::2000/10/02-19:44: [DM]: Loading cluster database form D:\WINNT\clu 000003b f0::2000/10/02-19:44: [DM] DmpStartFlusher: Entry 000003b f0::2000/10/02-19:44: [DM] DmpStartFlusher: thread created 000003b f0::2000/10/02-19:44: [NM] Initializing… 000003b f0::2000/10/02-19:44: [NM] Local node name = SERVER1. 000003b f0::2000/10/02-19:44: [NM] Local node ID = 1. 000003b f0::2000/10/02-19:44: [NM] Creating object for node 1 (SERVER1) 000003b f0::2000/10/02-19:44: [NM] Initializing networks. 000003b f0::2000/10/02-19:44: [NM] Initializing network interfaces. 000003b f0::2000/10/02-19:44: [NM] Initializing complete. 000003b f0::2000/10/02-19:44: [NM] Starting worker thread… 000003b f0::2000/10/02-19:44: [API] Initializing 000003b f0::2000/10/02-19:44: [FM] Worker thread running 000003b f0::2000/10/02-19:44: [LM] :LMInitialize Entry. 000003b f0::2000/10/02-19:44: [LM] :TimerActInitialize Entry. 000003b f0::2000/10/02-19:44: [CS] Service Domain Account = 000003b f0::2000/10/02-19:44: [CS] Initializing RPC server. 000003b f0::2000/10/02-19:44: [INIT] Attempting to join cluster MYCLUSTER 000003b f0::2000/10/02-19:44: [JOIN] Spawning thread to connect to sponsor 10. 000003b f0::2000/10/02-19:44: [JOIN] Spawning thread to connect to sponsor 169 File Edit View Insert Format Help The IDs of the process and thread issuing the log entry timestamp event description In the above slide, when I printed this page in hard copy, the two bottom callouts are really difficult to read in hard copy. The box is transparent, so “The Ids of the process and thread…etc” are printed on top of the cluster log, and you can see the list of numbers underneath the boxes. I would try to have the graphic artist fix this.
12
Troubleshooting Network Communications
Troubleshooting Node-to-Node Communication Verify RPC Communication’s Verify Cluster Heartbeats Troubleshooting Client-to-Node Communications Check NetBT Cache with Nbtstat Ping IP Address WINS Static Mappings
13
SCSI Configuration Problems
SCSI Controllers SCSI Terminiation SCSI Cabling
14
Group and Resource Failures
Cluster Administrator – [MYCLUSTER (MYCLUSTER)] File View Window Help For Help, press F1 MYCLUSTER Groups Cluster Group Mygroup SQL Group Resources Cluster Configuration SERVER1 SERVER2 Name State Owner Reso Cluster IP Address Online SERVER2 IP Ad Cluster Name Online SERVER2 Netw Disk W: Online SERVER2 Physi Printer Spooler Online SERVER2 Print Public Failed SERVER2 File S NUM
15
Quorum Log Corruption Reset the Quorum Log
Clussvc –debug -resetquorumlog Delete the Quorum Log -noquorumlogging
16
Lab A: Cluster Maintenance
17
Review Cluster Maintenance Troubleshooting Cluster Service
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.