Download presentation
Presentation is loading. Please wait.
Published byTracey Barker Modified over 8 years ago
1
Streamlining Monitoring Infrastructure in IT-DB-IMS Charles Newey charlie@assemblyco.de › 19.08.2015
2
Part 1: Log Monitoring Charles Newey (charlie@assemblyco.de)2 Log Monitoring: Getting, processing, analysing, visualising, and reacting to information in log files …at scale? Log monitoring IRL Image: "Counting the rings" - Sam Beebe
3
Part 1: Log Monitoring Charles Newey (charlie@assemblyco.de)3 › Proprietary? › Splunk › Open-source? › Elasticsearch › Logstash › Kibana Also known as the “ELK stack” Elk- Wikimedia
4
Part 1: Log Monitoring Charles Newey (charlie@assemblyco.de)4 › Elasticsearch › High-performance scalable search engine › Logstash › Log transport and processing daemon › Logstash-Forwarder › Lightweight log shipper › Kibana › Visualisation dashboard for Elasticsearch Also known as the “ELK stack” Elk- Wikimedia
5
Part 1: Log Monitoring Charles Newey (charlie@assemblyco.de)5 › I’d never done anything with Puppet before › Puppet code is very easy to write … badly “The moment that Puppet goes wrong” DevOps Reactions
6
Part 2: Metric Collection and Visualisation › Need to store and visualise metrics for many machines › What needs to be measured? › Load average (1, 5, 15 min) › CPU temperature › Network connectivity (latency between gateways, etc.) › … and lots more › OpenTSDB! › A time-series database running on top of HBase › Data replicated 3 times (inside HDFS) Charles Newey (charlie@assemblyco.de)6
7
Part 2: Metric Collection and Visualisation › Metric collection agents? › tcollector (Python) › scollector (Go) › Building and distributing RPMs with Koji › Visualising with GNUPlot and Grafana Charles Newey (charlie@assemblyco.de)7
8
Part 3: WebLogic Log Analysis › WebLogic’s logging architecture isn’t designed well… › Parsing WebLogic log files is difficult › What are the implications of this? › Lots of regular expressions! Charles Newey (charlie@assemblyco.de)8 Source: XKCD
9
Part 3: WebLogic Log Analysis It was a challenge getting Logstash to parse WebLogic logs… › If you want logs to be readable and useful: › Don’t use several different formats for log messages › Don’t use several different (localised) formats for timestamps › Don’t make several different processes log to the same file › Don’t write multiline Java stack traces to a file… why? › Line-oriented tools won’t work (sed/awk/grep/etc) › Multiline logs must be parsed by a single Logstash thread Charles Newey (charlie@assemblyco.de)9
10
Part 3: WebLogic Log Analysis › Most importantly… › Don’t do all of these things at once Charles Newey (charlie@assemblyco.de)10
11
WebLogic Exceptions Dashboard Charles Newey (charlie@assemblyco.de) 11
12
Impact › Log files are searchable, easy to visualise and analyse › Can analyse downtime, traffic spikes, load spikes, etc › System metrics can be collected and stored indefinitely on HDFS for search, visualisation and diagnosis › WebLogic exceptions can be pinpointed, searched and visualised in NRT, making it easy to report to developers Charles Newey (charlie@assemblyco.de)12
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.