CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration System Monitoring.

Slides:



Advertisements
Similar presentations
ActiveXperts Network Monitor Monitors servers, workstations and devices for availability Alerts and corrects.
Advertisements

Network Management Workshop intERlab at AIT Thailand March 11-15, 2008 Network Operations and Network Management.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Debugging.
MCTS GUIDE TO MICROSOFT WINDOWS 7 Chapter 10 Performance Tuning.
ADVANCED FUNCTIONALITY & TROUBLESHOOTING. Page 2 Agenda Main topics Advanced Policy Manager Server configuration Resolving Apache Web Server security.
1 Chapter Overview Understanding Printer Administration Managing Printers Managing Documents Administering Printers Using a Web Browser Troubleshooting.
20-753: Fundamentals of Web Programming 1 Lecture 11: Web Server Case Study Fundamentals of Web Programming Lecture 11: Web Server Case Study.
Implementing a Highly Available Network
ManageEngine TM Applications Manager 8 Monitoring Custom Applications.
1 ITC242 – Introduction to Data Communications Week 12 Topic 18 Chapter 19 Network Management.
NGOP J.Fromm K.Genser T.Levshina M.Mengel V.Podstavkov.
Measuring Performance Chapter 12 CSE807. Performance Measurement To assist in guaranteeing Service Level Agreements For capacity planning For troubleshooting.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 8 Introduction to Printers in a Windows Server 2008 Network.
Chapter 11: Maintaining and Optimizing Windows Vista
Monitoring System Monitors Basics Monitor Types Alarms Actions RRD Charts Reports.
Maintaining and Updating Windows Server 2008
Voyager Server Security and Monitoring Best practices and tools.
OpStor - A multi vendor storage resource management and capacity forecasting software.
© 2006, The Technology FirmWWW.THETECHFIRM.COM 1 WINDOWS XP SUPPORT TOOLS.
IBM Software Group Washington Area Informix User Group Forum 2004 The DB2 DBA Checklist Dwaine R Snow, DB2 & Informix.
New SA Training Topic 9: Logging, Monitoring, and Performance  Logging  Windows – “Auditing”  Linux – syslog  Monitoring  MRTG  Big Brother  Performance.
1 Network Statistic and Monitoring System Wayne State University Division of Computing and Information Technology Information Technology.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Help Desks.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Help Desks.
MCTS Guide to Microsoft Windows 7
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Performance Monitoring.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Performance Monitoring.
Josh Riggs Utilizing Open Source Network Monitoring.
Course Presentation EEL5881, Fall, 2003 Project: Network Reliability Tests Project: Network Reliability Tests Team: Gladiator Team: Gladiator Shuxin Li.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration People.
Module 7: Fundamentals of Administering Windows Server 2008.
Network Management Tool Amy Auburger. 2 Product Overview Made by Ipswitch Affordable alternative to expensive & complicated Network Management Systems.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 11: Monitoring Server Performance.
Overview of MSS System Human Actors Non-Human Actors In-house developed components Third party products.
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
TELE 301 Lecture 10: Scheduled … 1 Overview Last Lecture –Post installation This Lecture –Scheduled tasks and log management Next Lecture –DNS –Readings:
Guide to Linux Installation and Administration, 2e1 Chapter 10 Managing System Resources.
workshop eugene, oregon What is network management? System & Service monitoring  Reachability, availability Resource measurement/monitoring.
Computer Emergency Notification System (CENS)
Network Monitoring System for the UNIX Lab Bradley Kita Capstone Project Mentor: Dr C. David Shaffer Fall 2004/Spring 2005.
Linux Networking and Security
What’s New in WatchGuard XCS v9.1 Update 1. WatchGuard XCS v9.1 Update 1  Enhancements that improve ease of use New Dashboard items  Mail Summary >
Network Monitoring Manage your business without blowing your budget. Learn how the Calhoun ISD utilizes free “Open Source” tools for real-time monitoring.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Change and Configuration Management.
1 Implementing Monitoring and Reporting. 2 Why Should Implement Monitoring? One of the biggest complaints we hear about firewall products from almost.
1 Periodic Processes and the cron Daemon The cron daemon is where all timed events are initiated. The cron system is serviced by the cron daemon. What.
Business Data Communications, Fourth Edition Chapter 11: Network Management.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Disaster Recovery.
1 Oracle Enterprise Manager Slides from Dominic Gélinas CIS
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Change and Configuration Management.
Monitoring and Managing Server Performance. Server Monitoring To become familiar with the server’s performance – typical behavior Prevent problems before.
Network Management Mechanisms Two major network management protocols: Simple Network Management Protocol (SNMP) Common Management Information Protocol.
Maintaining and Updating Windows Server 2008 Lesson 8.
Difference between External and Internal Server Monitoring.
CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration Services.
1 Major Printer Administration Tasks Managing printers Managing documents Troubleshooting printers Performing tasks requiring the Manage Printers permission.
Nagios - introduction Dhruba Raj Bhandari ( CCNA ) p Additions by Phil Regnauld.
CIT 470: Advanced Network and System Administration
Network Operations and Network Management
Network Monitoring System
CIT 470: Advanced Network and System Administration
CIT 470: Advanced Network and System Administration
CIT 470: Advanced Network and System Administration
CIT 470: Advanced Network and System Administration
CIT 470: Advanced Network and System Administration
Nate Nelson I*LEVEL, Inc.
CIT 470: Advanced Network and System Administration
Network Monitoring System
CIT 470: Advanced Network and System Administration
Presentation transcript:

CIT 470: Advanced Network and System AdministrationSlide #1 CIT 470: Advanced Network and System Administration System Monitoring

CIT 470: Advanced Network and System AdministrationSlide #2 Topics 1.Why monitoring? 2.Historical monitoring 3.Real-time monitoring 4.Monitoring techniques 5.Monit 6.Performance monitoring. 7.Performance tuning.

CIT 470: Advanced Network and System AdministrationSlide #3 Why Monitoring? “If you aren’t monitoring a service, you can’t manage it.”

CIT 470: Advanced Network and System AdministrationSlide #4 Why Monitoring? 1.Rapidly detect and fix problems. 2.Identify the source of problems. 3.Predict and avoid future problems. 4.Document an SA’s achievements.

CIT 470: Advanced Network and System AdministrationSlide #5 Historical Monitoring Record long-term system statistics. Uptime. Performance. Security. Utilizations. Examples Web server uptime was 99.99% last year, compared to 99.9% the previous year. Peak network usage is 8 MBps, up from 5 MBps last year. Uses Capacity planning. Planning for reliability or security improvements.

CIT 470: Advanced Network and System AdministrationSlide #6 Historical Monitoring Processes Polling Take measurements at regular intervals. Store database of measurements. Graph summaries of collected data. Measurement Tools iostat vmstat ps sar

CIT 470: Advanced Network and System AdministrationSlide #7 Real-time Monitoring Alert SA to failures as they happen. Discover problems before customer does. Shorter outages. Better reputation. Real-time Monitor components Monitoring system (poll or alert). Notification system.

CIT 470: Advanced Network and System AdministrationSlide #8 Real-time Monitoring Techniques Polling Poll systems and applications for status. Ex: ping critical servers every 5 minutes. Alerting Many systems can send alerts to monitoring system when they detect a problem. Ex: RAID array logs a disk failure.

CIT 470: Advanced Network and System AdministrationSlide #9 Notification Types of notification 1. 2.Paging 3.Phone call Reliability 1.Notification system should not depend on system being monitored. 2. can fail or have long delays. 3.Pages are susceptible to third party failures and monitoring.

CIT 470: Advanced Network and System AdministrationSlide #10 Escalation What if the SA is on vacation? Notifications need to be transferrable. Static: reconfigure notifier before vacation. Dynamic: configurable set of receipients. Ex: If SA doesn’t respond in 1 hour, notify manager.

CIT 470: Advanced Network and System AdministrationSlide #11 Types of monitoring Availability Watch for outages in network, host, apps. Ex: cannot reach mail server. Capacity Check thresholds for CPU, mem, disk, network. Ex: mail spool disk is 95% full

CIT 470: Advanced Network and System AdministrationSlide #12 Active Monitoring Active monitoring systems can fix problems. 1.Respond faster than a human can. 2.Can typically only implement temporary fix. 3.Can’t fix some problems: bad disk, out of paper. Risks Reliability: Test active responses thoroughly before deployment. Security: Active monitor typically needs admin access on all monitored systems.

CIT 470: Advanced Network and System AdministrationSlide #13 Levels of Testing 1.Check server is pingable. Verifies network connectivity from monitor only. 2.Check that application is up. Make a TCP connection to service port. Check process or service list. 3.End-to-end testing. Entire transaction as customer would do. Ex: send and receive an message.

CIT 470: Advanced Network and System AdministrationSlide #14 Running monit Starting monit [-v] Status monit status monit summary (also provides web interface on port 2812) Stopping monit quit

CIT 470: Advanced Network and System AdministrationSlide #15 Global configuration set daemon 60 set logfile syslog facility log_daemon set alert set httpd port 2812 address localhost allow localhost allow admin:monit

CIT 470: Advanced Network and System AdministrationSlide #16 Monitoring a Process check process apache with pidfile "/usr/local/apache/logs/httpd.pid" start = “/etc/init.d/httpd start" stop = "/etc/init.d/httpd stop" if failed port 80 and protocol http and request "/cgi-bin/printenv" then restart if cpu usage is greater than 60 percent for 2 cycles then alert if cpu usage > 98% for 5 cycles then restart if 2 restarts within 3 cycles then timeout

CIT 470: Advanced Network and System AdministrationSlide #17 Monitoring a File # Rotate log if it gets too big check file access_log with path /var/log/access_log if size > 100 Mb then exec "/usr/sbin/logrotate -f rotate_apache_now“ # Restart Apache if config changes check file httpd.conf with path /usr/local/apache/conf/httpd.conf if changed checksum then exec "/usr/local/apache/bin/apachectl graceful"

CIT 470: Advanced Network and System AdministrationSlide #18 Monitoring CPU check system localhost if loadavg (1min) > 5 then alert if loadavg (5min) > 3 then alert if memory usage > 80% then alert if cpu usage (user) > 80% then alert

CIT 470: Advanced Network and System AdministrationSlide #19 Monitoring a Disk check device rootfs with path / if space usage > 90% then alert check device varfs with path /var if space usage > 90% then alert

CIT 470: Advanced Network and System AdministrationSlide #20 Monitoring Remote Hosts # Ping the host to see if it’s up check host foo with address foo.com if failed icmp type echo with timeout 15 seconds then alert # Detailed test, accessing web services check host foo with address foo if failed port 80 protocol http and request “/status” then alert if failed port 443 type TCPSSL and protocol http with timeout 15 seconds then alert

CIT 470: Advanced Network and System AdministrationSlide #21 References 1.Mark Burgess, Principles of System and Network Administration, Wiley, Aeleen Frisch, Essential System Administration, 3 rd edition, O’Reilly, Mike Loukides and Gian-Paolo D. Musumeci, System Performance Tuning, 2 nd edition, O’Reilly, Monit doc, 5.Evi Nemeth et al, UNIX System Administration Handbook, 3 rd edition, Prentice Hall, 2001.