CIT 470: Advanced Network and System Administration

Slides:



Advertisements
Similar presentations
NAGIOS AND CACTI NETWORK MANAGEMENT AND MONITORING SYSTEMS.
Advertisements

Network Management Workshop intERlab at AIT Thailand March 11-15, 2008 Network Operations and Network Management.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Debugging.
ADVANCED FUNCTIONALITY & TROUBLESHOOTING. Page 2 Agenda Main topics Advanced Policy Manager Server configuration Resolving Apache Web Server security.
20-753: Fundamentals of Web Programming 1 Lecture 11: Web Server Case Study Fundamentals of Web Programming Lecture 11: Web Server Case Study.
Implementing a Highly Available Network
ManageEngine TM Applications Manager 8 Monitoring Custom Applications.
Measuring Performance Chapter 12 CSE807. Performance Measurement To assist in guaranteeing Service Level Agreements For capacity planning For troubleshooting.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 8 Introduction to Printers in a Windows Server 2008 Network.
Chapter 11: Maintaining and Optimizing Windows Vista
Open Source Software Tool Paige Asmann LI 815 Fall 2012.
Monitoring System Monitors Basics Monitor Types Alarms Actions RRD Charts Reports.
Voyager Server Security and Monitoring Best practices and tools.
OpStor - A multi vendor storage resource management and capacity forecasting software.
IBM Software Group Washington Area Informix User Group Forum 2004 The DB2 DBA Checklist Dwaine R Snow, DB2 & Informix.
New SA Training Topic 9: Logging, Monitoring, and Performance  Logging  Windows – “Auditing”  Linux – syslog  Monitoring  MRTG  Big Brother  Performance.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Help Desks.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Performance Monitoring.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Performance Monitoring.
Josh Riggs Utilizing Open Source Network Monitoring.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration People.
Network Management Tool Amy Auburger. 2 Product Overview Made by Ipswitch Affordable alternative to expensive & complicated Network Management Systems.
Overview of MSS System Human Actors Non-Human Actors In-house developed components Third party products.
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
TELE 301 Lecture 10: Scheduled … 1 Overview Last Lecture –Post installation This Lecture –Scheduled tasks and log management Next Lecture –DNS –Readings:
Guide to Linux Installation and Administration, 2e1 Chapter 10 Managing System Resources.
workshop eugene, oregon What is network management? System & Service monitoring  Reachability, availability Resource measurement/monitoring.
Computer Emergency Notification System (CENS)
Network Monitoring System for the UNIX Lab Bradley Kita Capstone Project Mentor: Dr C. David Shaffer Fall 2004/Spring 2005.
Linux Networking and Security
Network Monitoring Manage your business without blowing your budget. Learn how the Calhoun ISD utilizes free “Open Source” tools for real-time monitoring.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Change and Configuration Management.
1 Implementing Monitoring and Reporting. 2 Why Should Implement Monitoring? One of the biggest complaints we hear about firewall products from almost.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Disaster Recovery.
1 Oracle Enterprise Manager Slides from Dominic Gélinas CIS
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Change and Configuration Management.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration System Monitoring.
Monitoring and Managing Server Performance. Server Monitoring To become familiar with the server’s performance – typical behavior Prevent problems before.
Network Management Mechanisms Two major network management protocols: Simple Network Management Protocol (SNMP) Common Management Information Protocol.
Monitoring Alfresco with Nagios/Icinga Toni de la Fuente Alfresco Senior Solutions Engineer Blog: blyx.com
Difference between External and Internal Server Monitoring.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Services.
Alerting With MySQL and Nagios Sheeri Cabral Senior DB Admin/Architect
2008 Taipei, Taiwan An Introduction APRICOT 2008 Network Management Workshop February – Taipei, Taiwan Hervey Allen & Phil.
Network Monitoring Kathryn Hockman. What is Network Monitoring Network Monitoring involves the use of a system that is constantly monitoring a network.
Network Monitoring Sebastian Büttrich, NSRC / IT University of Copenhagen Last edit: February 2012, ICTP Trieste
Nagios - introduction Dhruba Raj Bhandari ( CCNA ) p Additions by Phil Regnauld.
'08 Rabat An Introduction AfNOG 2008 Network Management Workshop June 1-2 – Rabat, Morocco Hervey Allen & Phil Regnauld.
Administering the SOWN Network David R Newman & Chris Malton.
Network Management Workshop March – Bangkok, Thailand
Smarter Technology for Better Business
Understanding the New PTC System Monitor (PSM/Dynatrace) Application’s Capabilities and Advanced Usage Stephen Vaillancourt PTC Technical Support –Technical.
Application or server monitoring
Quick Look on dCache Monitoring at FNAL
Network Operations and Network Management
MCTS Guide to Microsoft Windows 7
Network Monitoring System
CIT 470: Advanced Network and System Administration
CIT 470: Advanced Network and System Administration
CIT 470: Advanced Network and System Administration
CIT 470: Advanced Network and System Administration
CIT 470: Advanced Network and System Administration
Nate Nelson I*LEVEL, Inc.
Backup Monitoring – EMC NetWorker
Backup Monitoring – EMC NetWorker
APACHE WEB SERVER.
CIT 470: Advanced Network and System Administration
Network Monitoring System
CIT 470: Advanced Network and System Administration
Presentation transcript:

CIT 470: Advanced Network and System Administration System Monitoring CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Topics Why monitoring? Historical monitoring Real-time monitoring Monitoring techniques Monit Web-based monitoring tools CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Why Monitoring? “If you aren’t monitoring a service, you can’t manage it.” CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Why Monitoring? Rapidly detect and fix problems. Identify the source of problems. Predict and avoid future problems. Document an SA’s achievements. CIT 470: Advanced Network and System Administration

Historical Monitoring Record long-term system statistics. Uptime. Performance. Security. Utilizations. Examples Web server uptime was 99.99% last year, compared to 99.9% the previous year. Peak network usage is 8 MBps, up from 5 MBps. Uses Capacity planning. Planning for reliability or security improvements. CIT 470: Advanced Network and System Administration

Historical Monitoring Processes Polling Take measurements at regular intervals. Store database of measurements. Graph summaries of collected data. Measurement Tools iostat vmstat ps sar CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Real-time Monitoring Alert SA to failures as they happen. Discover problems before customer does. Shorter outages. Better reputation. Real-time Monitor components Monitoring system (poll or alert). Notification system. CIT 470: Advanced Network and System Administration

Real-time Monitoring Techniques Polling Poll systems and applications for status. Ex: ping critical servers every 5 minutes. Alerting Many systems can send alerts to monitoring system when they detect a problem. Ex: RAID array logs a disk failure. CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Notification Types of notification Email Paging Phone call Reliability Notification system should not depend on system being monitored. Email can fail or have long delays. Pages are susceptible to third party failures and monitoring. CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Escalation What if the SA is on vacation? Notifications need to be transferrable. Static: reconfigure notifier before vacation. Dynamic: configurable set of receipients. Ex: If SA doesn’t respond in 1 hour, notify manager. CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Types of monitoring Availability Watch for outages in network, host, apps. Ex: cannot reach mail server. Capacity Check thresholds for CPU, mem, disk, network. Ex: mail spool disk is 95% full CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Active Monitoring Active monitoring systems can fix problems. Respond faster than a human can. Can typically only implement temporary fix. Can’t fix all problems: bad disk, out of paper. Risks Reliability: Test active responses thoroughly before deployment. Security: Active monitor typically needs admin access on all monitored systems. CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Levels of Testing Check server is pingable. Verifies connectivity from monitor only. Check that application is up. Make a TCP connection to service port. Check process or service list. End-to-end testing. Entire transaction as customer would do. Ex: send and receive an e-mail message. CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Running monit Starting monit [-v] Status monit status monit summary (also provides web interface on port 2812) Stopping monit quit CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Global configuration set daemon 60 set logfile syslog facility log_daemon set alert root@domain set mailserver my-server set httpd port 2812 address localhost allow localhost allow admin:monit CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Monitoring a Process check process apache with pidfile "/usr/local/apache/logs/httpd.pid" start = “/etc/init.d/httpd start" stop = "/etc/init.d/httpd stop" if failed port 80 and protocol http and request "/cgi-bin/printenv" then restart if cpu usage is greater than 60 percent for 2 cycles then alert if cpu usage > 98% for 5 cycles then restart if 2 restarts within 3 cycles then timeout Example from http://www.tildeslash.com/monit/doc/ CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Monitoring a File # Rotate log if it gets too big check file access_log with path /var/log/access_log if size > 100 Mb then exec "/usr/sbin/logrotate -f rotate_apache_now“ # Restart Apache if config changes check file httpd.conf with path /usr/local/apache/conf/httpd.conf if changed checksum then exec "/usr/local/apache/bin/apachectl graceful" Example from http://www.tildeslash.com/monit/doc/ CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Monitoring CPU check system localhost if loadavg (1min) > 5 then alert if loadavg (5min) > 3 then alert if memory usage > 80% then alert if cpu usage (user) > 80% then alert CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Monitoring a Disk check device rootfs with path / if space usage > 90% then alert check device varfs with path /var CIT 470: Advanced Network and System Administration

Monitoring Remote Hosts # Ping the host to see if it’s up check host foo with address foo.com if failed icmp type echo with timeout 15 seconds then alert # Detailed test, accessing web services check host foo with address foo if failed port 80 protocol http and request “/status” then alert if failed port 443 type TCPSSL and protocol http with timeout 15 seconds then alert Example from http://www.tildeslash.com/monit/doc/ CIT 470: Advanced Network and System Administration

Monitoring Tools Ganglia Cacti Nagios Zabbix Hyperic HQ Munin ZenOSS OpenNMS GroundWork God Monit

CIT 470: Advanced Network and System Administration Nagios CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Nagios Network Maps CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Nagios Graphs CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration Zabbix Graphs CIT 470: Advanced Network and System Administration

CIT 470: Advanced Network and System Administration References Mark Burgess, Principles of System and Network Administration, Wiley, 2000. Aeleen Frisch, Essential System Administration, 3rd edition, O’Reilly, 2002. Mike Loukides and Gian-Paolo D. Musumeci, System Performance Tuning, 2nd edition, O’Reilly, 2003. Monit doc, http://mmonit.com/monit/documentation/monit.pdf Evi Nemeth et al, UNIX System Administration Handbook, 3rd edition, Prentice Hall, 2001. Wikipedia, http://en.wikipedia.org/wiki/Comparison_of_network_monitoring_systems CIT 470: Advanced Network and System Administration