Presentation is loading. Please wait.

Presentation is loading. Please wait.

Module 7: Server Cluster Maintenance and Troubleshooting

Similar presentations


Presentation on theme: "Module 7: Server Cluster Maintenance and Troubleshooting"— Presentation transcript:

1 Module 7: Server Cluster Maintenance and Troubleshooting

2 Overview Cluster Maintenance Troubleshooting Cluster Service

3 Cluster Maintenance Backup Restoring the First Node
Restoring Cluster Disks Restoring the Second Node Evicting a Node

4 Backup Backing Up the System State Backing Up the Local Disk
Backing Up the Cluster Disk

5 Restoring the First Node
Steps For Restoring a Server Cluster: Restore the first node Restore the cluster disks Restore the second node Perform node testing

6 Restoring Cluster Disks
Restoring Disk Signature Files Restoring the Data on the Cluster Disk Restoring the Cluster Configuration Files

7 Restoring the Second Node
Restoring the Remaining Node(s) of a Cluster Perform Node Testing

8 Evicting a Node Steps for Evicting a Node Back up both nodes
Verify backup Move all groups to the remaining node Stop Cluster service on the node to be removed Evict the node Unplug the server from the shared bus

9 Troubleshooting Cluster Service
Troubleshooting Tools Examining the Cluster Log Troubleshooting Network Communications SCSI Configuration Problems Group and Resource Failures Quorum Log Corruption

10 Troubleshooting Tools
Disk Manager Task Manager Performance Monitor Network Monitor Dr. Watson Services Snap-in

11 Examining the Cluster Log
Copy of cluster - Wordpad Creates a new cluster group 000003b b4::2000/10/02-19:44: [CS] Cluster Service started – Cluster Node Vers 000003b b4::2000/10/02-19:44: OS Version 000003b f0::2000/10/02-19:44: [CS] Service Starting… 000003b f0::2000/10/02-19:44: [EP] Initialization… 000003b f0::2000/10/02-19:44: [DM]: Initialization 000003b f0::2000/10/02-19:44: [DM]: Loading cluster database form D:\WINNT\clu 000003b f0::2000/10/02-19:44: [DM] DmpStartFlusher: Entry 000003b f0::2000/10/02-19:44: [DM] DmpStartFlusher: thread created 000003b f0::2000/10/02-19:44: [NM] Initializing… 000003b f0::2000/10/02-19:44: [NM] Local node name = SERVER1. 000003b f0::2000/10/02-19:44: [NM] Local node ID = 1. 000003b f0::2000/10/02-19:44: [NM] Creating object for node 1 (SERVER1) 000003b f0::2000/10/02-19:44: [NM] Initializing networks. 000003b f0::2000/10/02-19:44: [NM] Initializing network interfaces. 000003b f0::2000/10/02-19:44: [NM] Initializing complete. 000003b f0::2000/10/02-19:44: [NM] Starting worker thread… 000003b f0::2000/10/02-19:44: [API] Initializing 000003b f0::2000/10/02-19:44: [FM] Worker thread running 000003b f0::2000/10/02-19:44: [LM] :LMInitialize Entry. 000003b f0::2000/10/02-19:44: [LM] :TimerActInitialize Entry. 000003b f0::2000/10/02-19:44: [CS] Service Domain Account = 000003b f0::2000/10/02-19:44: [CS] Initializing RPC server. 000003b f0::2000/10/02-19:44: [INIT] Attempting to join cluster MYCLUSTER 000003b f0::2000/10/02-19:44: [JOIN] Spawning thread to connect to sponsor 10. 000003b f0::2000/10/02-19:44: [JOIN] Spawning thread to connect to sponsor 169 File Edit View Insert Format Help The IDs of the process and thread issuing the log entry timestamp event description In the above slide, when I printed this page in hard copy, the two bottom callouts are really difficult to read in hard copy. The box is transparent, so “The Ids of the process and thread…etc” are printed on top of the cluster log, and you can see the list of numbers underneath the boxes. I would try to have the graphic artist fix this.

12 Troubleshooting Network Communications
Troubleshooting Node-to-Node Communication Verify RPC Communication’s Verify Cluster Heartbeats Troubleshooting Client-to-Node Communications Check NetBT Cache with Nbtstat Ping IP Address WINS Static Mappings

13 SCSI Configuration Problems
SCSI Controllers SCSI Terminiation SCSI Cabling

14 Group and Resource Failures
Cluster Administrator – [MYCLUSTER (MYCLUSTER)] File View Window Help For Help, press F1 MYCLUSTER Groups Cluster Group Mygroup SQL Group Resources Cluster Configuration SERVER1 SERVER2 Name State Owner Reso Cluster IP Address Online SERVER2 IP Ad Cluster Name Online SERVER2 Netw Disk W: Online SERVER2 Physi Printer Spooler Online SERVER2 Print Public Failed SERVER2 File S NUM

15 Quorum Log Corruption Reset the Quorum Log
Clussvc –debug -resetquorumlog Delete the Quorum Log -noquorumlogging

16 Lab A: Cluster Maintenance

17 Review Cluster Maintenance Troubleshooting Cluster Service


Download ppt "Module 7: Server Cluster Maintenance and Troubleshooting"

Similar presentations


Ads by Google