Troubleshooting beyond what you understand

Slides:



Advertisements
Similar presentations
The Scaling IQ Test: When Dev and Admin Collide Richard Campbell Strangeloop Networks.
Advertisements

Copyright © 2007 Quest Software The Changing Role of SQL Server DBA’s Bryan Oliver SQL Server Domain Expert Quest Software.
Command School On Task In Touch Online Software for Schools Developed by Schools.
DNN Performance & Scalability Planning, Evaluating & Improving : Part 1.
Debugging Strategies from Software Carpentry. Agan's Rules Many people make debugging harder than it needs to be by: Using inadequate tools Not going.
VMware vRealize Operations Management Pack for Citrix XenDesktop & XenApp.
CN1260 Client Operating System Kemtis Kunanuraksapong MSIS with Distinction MCT, MCITP, MCTS, MCDST, MCP, A+
Page 1 Monitoring, Optimization, and Troubleshooting Lecture 10 Hassan Shuja 11/30/2004.
6/13/2015 Visit the Sponsor tables to enter their end of day raffles. Turn in your completed Event Evaluation form at the end of the day in the Registration.
All the things you need to know before setting up AlwaysOn Michael Steineke SQL & BI Solution Lead Enterprise Architect Concurrency, Inc.
Basic Troubleshooting Concepts PC Tech Support. What is Troubleshooting Finding a cause to a problem and fixing it.
Barracuda Subscription and Support Upsell Opportunities
Why Diets Don’t Work.
Building AD-SQL-APP Server on AZURE
Modularity Most useful abstractions an OS wants to offer can’t be directly realized by hardware Modularity is one technique the OS uses to provide better.
# 66.
Server Upgrade HA/DR Integration
Debugging Intermittent Issues
name of trainer associate trainer | sparqs
BA Continuum India Pvt Ltd
Hitting the SQL Server “Go Faster” Button
SQL Saturday Pittsburgh
Transactional replication
Helpdesk to DBA in 60 Tips & Tricks for the new DBA
Coming out of the Shadows
Debugging Intermittent Issues
Outline Introduction Characteristics of intrusion detection systems
Presents RAP Week 6 MARCH 11TH 2013.
How We Think Of Computers
Studying.
Genius Webinar series, August 2013
Welcome John Doe.
DevOps Database Administration
Borrowing and Lending.
Making PowerShell Useful
.NET Debugging for the Production Environment
DevOps Database Administration
Making PowerShell Useful
Making Your Emergency Toolkit
Project Planning is a waste of time!!!
Migrating your SQL Server Instance
Example of a page header
Dalinda Galaviz Human Resources Troy Moldenhauer Admissions.
Truth Trees.
Troubleshooting beyond what you understand
How to Keep Running When Things Go Wrong
Making PowerShell Useful
with: My Designs in the Chaos
Assertiveness in Working with Deaf-Blind People
Outsourcing Database Administration
Introduction When searching for a new mattress, you have to make sure you know where to go to find the best one. The mattress you sleep on is going to.
EECS150 Fall 2007 – Lab Lecture #4 Shah Bawany
Your code is not just…your code
Debugging EECS150 Fall Lab Lecture #4 Sarah Swisher
What you need to learn to start learning infosec
Debugging EECS150 Fall Lab Lecture #4 Sarah Swisher
Planning and Storyboarding a Web Site
Summit Nashville /3/2019 1:48 AM
IST346: Operating Systems / Command Line Interfaces
name of trainer associate trainer | sparqs
The Troubleshooting theory
LO1 – Understand Computer Hardware
Review Time! (Yaaaaaaay!). Review Time! (Yaaaaaaay!)
Managing the mystery database
Ms. Chapin’s U.S. History Class
Vendor Software Lessons From Consulting Vendor Software.
Managing the mystery database
Your code is not just…your code
06 | SQL Server and the Cloud
SQL Server on Containers
Presentation transcript:

Troubleshooting beyond what you understand Or: How to figure out what’s broken so you can get some help from the real owner because your stuff never breaks. Right? Ryan McCauley #597 – Phoenix 2017

Ryan McCauley VB6/VB.NET developer for 10 years Full-time DBA/T-SQL dev for 6 years Currently Data and Reporting Manager at CableONE Microsoft Certified Professional (MCTS – SQL 2008 DBA) Active on Experts-Exchange and StackOverflow Twitter: @SQLRyan Blog: www.trycatchfinally.net Email: Ryan@KilaniMcCauley.com SQL SATURDAY | #597 | PHOENIX 2017

It Was a Dark and Stormy Night Also, applications are broken somewhere… Talk about the rotating DNS (backup NIC issue) Connections to SQL Server intermittent, but even Information comes in slowly – learn from it SQL SATURDAY | #597 | PHOENIX 2017

Agenda Today Ground rules Techniques Major symptoms Common confusion Next steps SQL SATURDAY | #597 | PHOENIX 2017

Ground Rules SQL SATURDAY | #597 | PHOENIX 2017

Ground Rules Never say “randomly”, say “intermittent” It’s not just your components Consider their interaction and what’s around You can always influence intermittent is something you don't yet understand, but it always has a cause when you say "random", you're saying you can't own it because it's not in your control Given same inputs, behavior of computers is always consistent See everything as something you own and can influence – you’re not helpless SQL SATURDAY | #597 | PHOENIX 2017

Ground Rules Something always changed! Always! Just maybe not on purpose Don’t take anything for granted! Both this class and in troubleshooting Monitoring only has a single perspective Only trust what you’ve verified

Techniques SQL SATURDAY | #597 | PHOENIX 2017

Techniques Figure out what it’s not If that’s true, what else would be true? Make the problem as small as possible Need to isolate it to prove it Does it work at all? Where can you connect from? Myers-Briggs and S (focus on resolving the examples) vs N (every example needs to fit pattern first) Small problem - You need to isolate it to prove it, especially to others Reproduce the problem in a second location with as much different as possible Hard to test system, need to test components SQL SATURDAY | #597 | PHOENIX 2017

Techniques Is it consistent? Can you find somewhere it’s not broken? Shared vs. Dedicated components VMs can dramatically complicate things Time it takes when it does run - does it vary? Is it quick or slow? same sources always broken? DAC FTP issue - 1 server takes 0.5 seconds, other 7 take 12-14 seconds, even for failed login Which components are shared vs. dedicated? VMs complicate this issue because everything is shared and live migration is seamless SQL SATURDAY | #597 | PHOENIX 2017

Simplify everything! How they get to your service Things your service depends on Your service Customers

Major symptom – cheat sheet SQL SATURDAY | #597 | PHOENIX 2017

Major Symptoms, part 1 Never works Intermittently not accessible Firewall or app not listening Intermittently not accessible What’s changing? Load balancer/cluster? Always slow but consistent Hardware config/resource Likely not load on shared components SQL SATURDAY | #597 | PHOENIX 2017

Major Symptoms, part 2 Intermittent/inconsistent slowness Hardware bottleneck or shared resource? Unchanging or predictable More likely configuration Shifting or unpredictable More likely capacity somewhere VM as shared component, harder to see the impact SQL SATURDAY | #597 | PHOENIX 2017

Common Confusion SQL SATURDAY | #597 | PHOENIX 2017

Common Confusion Login failures vs. firewall timeouts Ever used TCPING? Know common ports! Firewall rules – when are they evaluated? People blame “Kerberos” as a catch-all Ping isn’t the same as making sure the path is open! Ping doesn’t use a TCP port at all Talk about subnets/VLANs SQL SATURDAY | #597 | PHOENIX 2017

Slightly less dark and stormy… Back to the beginning… Resolve the DNS issue If time, talk about either Firewall timeouts when we moved reporting servers (5 minutes) Mis-aligned disks on clusters = consistently slow read times SQL SATURDAY | #597 | PHOENIX 2017

Next Steps SQL SATURDAY | #597 | PHOENIX 2017

Next Steps Learn about what you don’t know Shadowing, training, ask! Specialized knowledge not required, but can help If you don’t understand concept, ask It’s not resolved until you understand why! Root cause analysis is critical Don’t let “root cause analysis” be “it’s not happening anymore” or it resolved itself it’s not resolved until you know it’s not going to happen again! SQL SATURDAY | #597 | PHOENIX 2017

Thanks for attending, please visit the sponsors and complete an evaluation! SQL SATURDAY | #597 | PHOENIX 2017

The Sponsors!