1 Root-Cause Network Troubleshooting Optimizing the Process Tim Titus CTO, PathSolutions.

Slides:



Advertisements
Similar presentations
Ethernet Switch Features Important to EtherNet/IP
Advertisements

Network Devices Repeaters, hubs, bridges, switches, routers, NICs.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 3: VLANs Routing & Switching.
Chabot College Chapter 2 Review Questions Semester IIIELEC Semester III ELEC
Chapter 13 Network Design and Management
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 3: VLANs Routing & Switching.
11 TROUBLESHOOTING Chapter 12. Chapter 12: TROUBLESHOOTING2 OVERVIEW  Determine whether a network communications problem is related to TCP/IP.  Understand.
Click to edit Master subtitle style Chapter 19: Network Troubleshooting Instructor:
Introduction to Network Analysis and Sniffer Pro
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.1 Troubleshooting Working at a Small-to-Medium Business or ISP – Chapter 9.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 W. Schulte Chapter 5: Inter-VLAN Routing Routing And Switching.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 9: Troubleshooting the Network Connecting Networks.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 5: Inter-VLAN Routing Routing & Switching.
QoS Solutions Confidential 2010 NetQuality Analyzer and QPerf.
1 Version 3 Module 8 Ethernet Switching. 2 Version 3 Ethernet Switching Ethernet is a shared media –One node can transmit data at a time More nodes increases.
Chapter 15 Chapter 15: Network Monitoring and Tuning.
NetFlow Analyzer Drilldown to the root-QoS Product Overview.
Readying Data Networks for Converged Services Presented by: Jaye Armstrong-CCIE Director of Data Services.
Performance Management (Best Practices) REF: Document ID
Check Disk. Disk Defragmenter Using Disk Defragmenter Effectively Run Disk Defragmenter when the computer will receive the least usage. Educate users.
Troubleshooting Software Tools vs. Professional Test Equipment.
Chapter 11 Extending LANs: Fiber Modems, Repeaters, Bridges, & Switches Hub Bridge Switch.
Data Communications and Networks Chapter 2 - Network Technologies - Circuit and Packet Switching Data Communications and Network.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 5: Inter-VLAN Routing Routing And Switching.
1/28/2010 Network Plus Unit 5 Section 2 Network Management.
Ensuring Quality in the Contact Center Joseph Dumont Product Manager Contact Center Services Empirix, Inc.
Semester 1 Module 8 Ethernet Switching Andres, Wen-Yuan Liao Department of Computer Science and Engineering De Lin Institute of Technology
Hands-on Networking Fundamentals
Chapter 4: Managing LAN Traffic
Characterizing the Existing Internetwork PART 1
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 3: Implementing VLAN Security Routing And Switching.
Net Optics Confidential and Proprietary Net Optics appTap Intelligent Access and Monitoring Architecture Solutions.
Performance Management (Best Practices) REF: Document ID
© 2011 Cisco and/or its affiliates. All rights reserved. 1 High Performance Network Analysis Enterprise Operate Practice Cisco Services Andrew Wojtkowiak.
Module 8: Ethernet Switching
Network Instruments VoIP Analysis. VoIP Basics  What is VoIP?  Packetized voice traffic sent over an IP network  Competes with other traffic on the.
Cisco – Semester 4 – Chapter 7
Network Troubleshooting
1 © 2003, Cisco Systems, Inc. All rights reserved. CCNA 3 v3.0 Module 4 Switching Concepts.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 3: Implementing VLAN Security Routing And Switching.
Performance Management (Best Practices) REF: Document ID
1 Root-Cause VoIP Troubleshooting Optimizing the Process Tim Titus CTO, PathSolutions.
Chapter 8 1 Chap 8 – Network Troubleshooting Learning Objectives Establish a network baseline Describe troubleshooting methodologies and troubleshooting.
Network Troubleshooting
1 Microsoft Windows 2000 Network Infrastructure Administration Chapter 4 Monitoring Network Activity.
© 2008 Cisco Systems, Inc. All rights reserved.Cisco ConfidentialPresentation_ID 1 Chapter 3: VLANs Routing & Switching.
CISCO NETWORKING ACADEMY Chabot College ELEC Ethernet Switches.
Performance Management (Best Practices) REF: Document ID
Network management Network management refers to the activities, methods, procedures, and tools that pertain to the operation, administration, maintenance,
Confidential Rapid Troubleshooting for Data, VoIP, and Video VoIP Performance Manager.
Confidential Rapid Troubleshooting for Data, VoIP and Video VoIP & Network Performance Management Solutions.
+ Lecture#8: VLAN Asma AlOsaimi Topics VLAN Segmentation VLAN Implementation VLAN Security and Design 3.0.
1 Root-Cause Network Troubleshooting Optimizing the Process Tim Titus CTO PathSolutions.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public ITE PC v4.0 Chapter 1 1 ISP Help Desk Working at a Small-to-Medium Business or ISP – Chapter.
Connectors, Repeaters, Hubs, Bridges, Switches, Routers, NIC’s
Network Devices Repeaters, hubs, bridges, switches, routers, NIC’s.
Chapter 9 Optimizing Network Performance
Instructor Materials Chapter 8: Network Troubleshooting
Instructor Materials Chapter 6: VLANs
Planning and Troubleshooting Routing and Switching
Chapter 5: Inter-VLAN Routing
Configuring EtherChannels and Switch Troubleshooting
2018 Valid Cisco Exam Dumps IT-Dumps
Routing and Switching Essentials v6.0
CCNA Routing and Switching Routing and Switching Essentials v6.0
Chapter 3: Implementing VLAN Security
Network Performance Manager
Discovering Problems Network Anomalies can occur anytime, anyplace for a wide variety of reasons 3:09pm 20% Packet loss due to duplex mismatch 2:36pm Over-utilized.
Chapter 15: Network Monitoring and Tuning
Connectors, Repeaters, Hubs, Bridges, Switches, Routers, NIC’s
Presentation transcript:

1 Root-Cause Network Troubleshooting Optimizing the Process Tim Titus CTO, PathSolutions

2 Business disconnect Why is troubleshooting so hard? Troubleshooting methodology Tool selection Finding the root-cause Achieving Total Network Visibility Agenda

3 You’re responsible for the entire network Most network engineers know less about their network’s health and performance than their user community You can’t manage what you can’t measure -- Peter Drucker Business Disconnect

4 Business Reasons Networks are getting more complex Less staff remains to support the network Technical Reasons Proper methodology is not utilized Wrong tools are employed Why is Troubleshooting so Hard?

5 What graduates a junior level Engineer to a senior level Engineer is their troubleshooting methodology Troubleshooting Methodologies

6 “Do something to try to fix the problem” Reboot the device Change the network settings Replace hardware Re-install the OS Bad Methodology

7 Collect information Verify Original Problem is Solved and no new problems exist Create hypothesis Test hypothesis Implement fix Document fix Notify users Undo changes Good Methodology

8 Types of Tools Cable Testers Packet analyzers/capture Application Performance Monitoring (APM) Flow collectors SNMP Collectors Tool Selection

9 Results 4.3db of loss NEXT detected Cable Tester Actual VoIP Call You have information about Layer 1 on one link in the network Using a cable tester to solve a call quality problem Cable Testers

10 Good for: Confirming physical issues on one link in the network Bad for: Finding physical issues on the network Determining application usage Finding bandwidth limitations Finding device limitations Cable Testers

11 Results of VoIP Call Latency: 127ms Jitter: 87ms Packet loss: 8.2% Packet Capture Actual VoIP Call You have confirmation that there is a problem, but no idea which device or link caused the packet loss Using a sniffer to solve a call quality problem Packet Capture

12 Good for: Confirming packet loss (Are we missing packets?) Confirming packet contents issues (No QoS tagging on packets when there should be) Determining application-level issues (Source and destination IP and ports used for a session) Bad for: Finding physical, data-link, or network issues Finding bandwidth limitations Finding device limitations Packet Capture

13 Results of Simulation Latency: 127ms Jitter: 87ms Packet loss: 8.2% Agent Simulated VoIP Call You have knowledge of the experience across the network, but no understanding of the source or cause of the problem. Using APM to determine performance through the network Agent Application Performance Monitoring

14 Good for: Measuring user experience across the network (Are we having problems right now?) Bad for: Finding physical, data-link, or network issues Finding bandwidth limitations Finding device limitations Application Performance Monitoring

15 Results of Flow SourceIP: :80 DestinationIP: :3411 Packets: 251 Bytes: 19,386 Flow Collector Actual VoIP Call You have knowledge of a transfer across the network, but no recognition if there were any problems with the transfer. Using a flow collector to determine usage of the network Flow Record Flow Collectors

16 Good for: Determining communications across the network Who is using a link? When do they use it? What do they use it for? Bad for: Finding physical, data-link, or network issues Finding bandwidth limitations Finding device limitations Flow Collectors

17 Results of Collection WAN link is overloaded at 2:35pm SNMP Collector Actual VoIP Call You have data about conditions on some parts of the network, but no analysis of the problem or correlation to events Collecting information from switches and routers to discover faults SNMP Collectors

18 Good for: Tracking packet loss per interface/device (Are we dropping packets on a link? why?) Monitoring device and link resource limitations (Are we over-utilizing a link? Is the router CPU pegged?) Bad for: Determining who is using the network Finding application layer problems SNMP Collectors

19 Step 1: Identify the involved endpoints and where they are connected into the network Poor Quality VoIP Call Finding the Root-Cause

20 Step 2: Identify the full layer-2 path through the network from the first phone to the second phone Finding the Root-Cause

21 Step 3: Investigate involved switch and router health (CPU & Memory) for acceptable levels Finding the Root-Cause

22 Step 4: Investigate involved interfaces for: VLAN assignment DiffServe/QoS tagging Queuing configuration 802.1p Priority settings Duplex mismatches Cable faults Half-duplex operation Broadcast storms Incorrect speed settings Over-subscription TRANSIENT PROBLEM WARNING: If the error condition is no longer occurring when this investigation is performed, you may not catch the problem Finding the Root-Cause

23 In a perfect world, you want: Monitoring of:  Every switch, router, and link in the entire infrastructure  All error counters on the interfaces  QoS configuration and performance Continuous collection of information Automatic layer-1, 2, and 3 mapping from any IP endpoint to any other IP endpoint Problems identified in plain-English for rapid remediation This is what PathSolutions TotalView does Optimizing the Methodology

24 Install TotalView Result: One location is able to monitor all devices and links in the entire network for performance and errors All Switches and Routers are queried for information Deployment

25 Broad: All ports on all routers & switches Continuous: Health collected every 5 minutes Deep: 18 different error counters collected and analyzed Network Prescription engine provides plain- English descriptions of errors: Total Network Visibility® “This interface is dropping 12% of its packets due to a cable fault”

26 Establish Baseline of Network Health 7% Loss from cabling fault 12% Loss from Alignment Errors 11% Loss from Jumbo Frame Misconfiguration 28% Loss from Duplex mismatch Results Within 12 Minutes

27 Repair Issues 7% Loss from cabling fault 12% Loss from Alignment Errors 28% Loss from Duplex mismatch Results Within 12 Minutes 11% Loss from Jumbo Frame Misconfiguration

28 11:32am 100% Transmit utilization 15% Loss from discards Latency & Jitter penalty incurred 7:56am 18% Loss from Cable Fault 12:02pm 12% Loss from Collisions Path Analysis Report

29 Demo

30 Don’t turtle your network

31 With it, you will always have an easy way to map out your network on any white board! Free Network Equipment Magnet Set

32 Must be present to win Hourly iPad mini giveaways Booth #1951 Today’s Show Times 1:00 2:00 3:00 4:00

33 Special Guest at 1pm: Brian Chee on Station Aloha One of our more interesting implementations. Located 3 miles beneath the ocean surface. How exactly do you manage its performance? Attend this special session and discover what it’s like to manage our most unique network!

34