Problem Diagnosis & VISUALIZATION

Slides:



Advertisements
Similar presentations
 Open source software framework designed for storage and processing of large scale data on clusters of commodity hardware  Created by Doug Cutting and.
Advertisements

Mapreduce and Hadoop Introduce Mapreduce and Hadoop
Based on the text by Jimmy Lin and Chris Dryer; and on the yahoo tutorial on mapreduce at index.html
UC Berkeley Online System Problem Detection by Mining Console Logs Wei Xu* Ling Huang † Armando Fox* David Patterson* Michael Jordan* *UC Berkeley † Intel.
A Hadoop Overview. Outline Progress Report MapReduce Programming Hadoop Cluster Overview HBase Overview Q & A.
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Look Who’s Talking: Discovering Dependencies between Virtual Machines Using CPU Utilization HotCloud 10 Presented by Xin.
 Need for a new processing platform (BigData)  Origin of Hadoop  What is Hadoop & what it is not ?  Hadoop architecture  Hadoop components (Common/HDFS/MapReduce)
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Virtualization in Data Centers Prashant Shenoy
16: Distributed Systems1 DISTRIBUTED SYSTEM STRUCTURES NETWORK OPERATING SYSTEMS The users are aware of the physical structure of the network. Each site.
7/14/2015EECS 584, Fall MapReduce: Simplied Data Processing on Large Clusters Yunxing Dai, Huan Feng.
UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.
Google Distributed System and Hadoop Lakshmi Thyagarajan.
Copyright © 2012 Cleversafe, Inc. All rights reserved. 1 Combining the Power of Hadoop with Object-Based Dispersed Storage.
Dr. G Sudha Sadhasivam Professor, CSE PSG College of Technology Coimbatore INTRODUCTION TO HADOOP.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
Hadoop, Hadoop, Hadoop!!! Jerome Mitchell Indiana University.
Advanced Topics: MapReduce ECE 454 Computer Systems Programming Topics: Reductions Implemented in Distributed Frameworks Distributed Key-Value Stores Hadoop.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
THE HOG LANGUAGE A scripting MapReduce language. Jason Halpern Testing/Validation Samuel Messing Project Manager Benjamin Rapaport System Architect Kurry.
Introduction and Overview Questions answered in this lecture: What is an operating system? How have operating systems evolved? Why study operating systems?
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
Presented by CH.Anusha.  Apache Hadoop framework  HDFS and MapReduce  Hadoop distributed file system  JobTracker and TaskTracker  Apache Hadoop NextGen.
MapReduce: Hadoop Implementation. Outline MapReduce overview Applications of MapReduce Hadoop overview.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Introduction to Hadoop and HDFS
f ACT s  Data intensive applications with Petabytes of data  Web pages billion web pages x 20KB = 400+ terabytes  One computer can read
Automated Problem Diagnosis for Production Systems Soila P. Kavulya Scott Daniels (AT&T), Kaustubh Joshi (AT&T), Matti Hiltunen (AT&T), Rajeev Gandhi (CMU),
Benchmarking MapReduce-Style Parallel Computing Randal E. Bryant Carnegie Mellon University.
Apache Hadoop Daniel Lust, Anthony Taliercio. What is Apache Hadoop? Allows applications to utilize thousands of nodes while exchanging thousands of terabytes.
Presented by: Katie Woods and Jordan Howell. * Hadoop is a distributed computing platform written in Java. It incorporates features similar to those of.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
IBM Research ® © 2007 IBM Corporation Introduction to Map-Reduce and Join Processing.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
Hadoop & Neptune Feb 김형준.
Experiments in Utility Computing: Hadoop and Condor Sameer Paranjpye Y! Web Search.
{ Tanya Chaturvedi MBA(ISM) Hadoop is a software framework for distributed processing of large datasets across large clusters of computers.
Next Generation of Apache Hadoop MapReduce Owen
INTRODUCTION TO HADOOP. OUTLINE  What is Hadoop  The core of Hadoop  Structure of Hadoop Distributed File System  Structure of MapReduce Framework.
By: Joel Dominic and Carroll Wongchote 4/18/2012.
1 Student Date Time Wei Li Nov 30, 2015 Monday 9:00-9:25am Shubbhi Taneja Nov 30, 2015 Monday9:25-9:50am Rodrigo Sanandan Dec 2, 2015 Wednesday9:00-9:25am.
BIG DATA/ Hadoop Interview Questions.
Hadoop. Introduction Distributed programming framework. Hadoop is an open source framework for writing and running distributed applications that.
Map reduce Cs 595 Lecture 11.
Hadoop Aakash Kag What Why How 1.
Introduction to Distributed Platforms
INTRODUCTION TO BIGDATA & HADOOP
What is Apache Hadoop? Open source software framework designed for storage and processing of large scale data on clusters of commodity hardware Created.
Software Architecture in Practice
Chapter 14 Big Data Analytics and NoSQL
Applying Control Theory to Stream Processing Systems
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
Introduction to MapReduce and Hadoop
Introduction to HDFS: Hadoop Distributed File System
Understanding Real World Data Corruptions in Cloud Systems
Hadoop Clusters Tess Fulkerson.
Software Engineering Introduction to Apache Hadoop Map Reduce
MapReduce Computing Paradigm Basics Fall 2013 Elke A. Rundensteiner
Replication Middleware for Cloud Based Storage Service
Ministry of Higher Education
MapReduce Simplied Data Processing on Large Clusters
The Basics of Apache Hadoop
湖南大学-信息科学与工程学院-计算机与科学系
EECS 498 Introduction to Distributed Systems Fall 2017
Hadoop Basics.
Hadoop Technopoints.
Overview of big data tools
Lecture 16 (Intro to MapReduce and Hadoop)
Distributed Systems CS
MapReduce: Simplified Data Processing on Large Clusters
Presentation transcript:

Problem Diagnosis & VISUALIZATION Jiaqi Tan, Soila Kavulya, Xinghao Pan, Mike Kasick, Keith Bare, Eugene Marinelli, Rajeev Gandhi Priya Narasimhan Carnegie Mellon University

My Background 15 years working in fault-tolerant middleware Developed transparent fault-tolerant middleware Eternal, Immune Standards: Fault-Tolerant CORBA, Portable Interceptors for CORBA Hard problems addressed Strongly consistent replication & recovery Application-transparent fault tolerance Resolving conflicts between real-time & fault tolerance Overcoming nondeterminism, unrealistic assumptions Zero-downtime large-scale software upgrades

And Onto Automated Problem Diagnosis Diagnosing problems Creates major headaches for administrators Worsens as scale and system complexity grows Goal: automate it and get proactive Failure detection and prediction Problem determination (or “fingerpointing”) How: Instrumentation plus statistical analysis Carnegie Mellon University Priya Narasimhan © September 18September 18

Challenges in Problem Analysis Challenging in large-scale networked environment Can have multiple failure manifestations with a single root cause Can have multiple root causes for a single failure manifestation Problems and/or their manifestations can “travel” among communicating components A lot of information from multiple sources – what to use? what to discard? Automated fingerpointing Automatically discover faulty node in a distributed system Carnegie Mellon University Priya Narasimhan © September 18September 18

Exploration Current explorations Studied Hadoop PVFS Lustre Open-source implementation of Map/Reduce (Yahoo!), popular cloud-computing platform PVFS High-performance file system (Argonne National Labs) Lustre High-performance file system (Sun Microsystems) Studied Various types of problems Various kinds of instrumentation Various kinds of data-analysis techniques Carnegie Mellon University Priya Narasimhan © September 18September 18

Hadoop 101 Map/Reduce tasks JobTracker TaskTracker NameNode DataNode SLAVE NODES MASTER NODE Map/Reduce tasks JobTracker TaskTracker NameNode DataNode HDFS blocks Carnegie Mellon University Priya Narasimhan © September 18September 18

Why? Hadoop is fault-tolerant Heartbeats: detect lost nodes Speculative re-execution: recover work due to lost/laggard nodes Hadoop’s fault-tolerance can mask performance problems Nodes alive but slow Target failures for our diagnosis Performance degradations (slow, hangs) Carnegie Mellon University Priya Narasimhan © September 18September 18

Goals, Non-Goals Diagnose faulty Master/Slave node to user/admin Target production environment Don’t instrument Hadoop or applications additionally Use Hadoop logs as-is (white-box strategy) Use OS-level metrics (black-box strategy) Work for various workloads and under workload changes Support online and offline diagnosis Non-goals (for now) Tracing problem to offending line of code Carnegie Mellon University Priya Narasimhan © September 18September 18

Target Hadoop Clusters 4000-processor Yahoo!’s M45 cluster Production environment (managed by Yahoo!) Offered to CMU as free cloud-computing resource Diverse kinds of real workloads, problems in the wild Massive machine-learning, language/machine-translation Permission to harvest all logs and OS data each week 100-node Amazon’s EC2 cluster Production environment (managed by Amazon) Commercial, pay-as-you-use cloud-computing resource Workloads under our control, problems injected by us gridmix, nutch, pig, sort, randwriter Can harvest logs and OS data of only our workloads Carnegie Mellon University Priya Narasimhan © September 18September 18

Some Performance Problems Studied Fault Description Resource contention CPU hog External process uses 70% of CPU Packet-loss 5% or 50% of incoming packets dropped Disk hog 20GB file repeatedly written to Disk full Application bugs Source: Hadoop JIRA HADOOP-1036 Maps hang due to unhandled exception HADOOP-1152 Reduces fail while copying map output HADOOP-2080 Reduces fail due to incorrect checksum HADOOP-2051 Jobs hang due to unhandled exception HADOOP-1255 Infinite loop at Nameode Studied Hadoop Issue Tracker (JIRA) from Jan-Dec 2007 Carnegie Mellon University Priya Narasimhan © September 18September 18

Hadoop: Instrumentation SLAVE NODES MASTER NODE Map/Reduce tasks JobTracker TaskTracker NameNode DataNode OS data OS data HDFS blocks Hadoop logs Hadoop logs Carnegie Mellon University Priya Narasimhan © September 18September 18

How About Those Metrics? White-box metrics (from Hadoop logs) Event-driven (based on Hadoop’s activities) Durations Map-task durations, Reduce-task durations, ReduceCopy-durations, etc. System-wide dependencies between tasks and data blocks Heartbeat information: Heartbeat rates, Heartbeat-timestamp skew between the Master and Slave nodes Black-box metrics (from OS /proc) 64 different time-driven metrics (sampled every second) Memory used, context-switch rate, User-CPU usage, System-CPU usage, I/O wait time, run-queue size, number of bytes transmitted, number of bytes received, pages in, pages out, page faults Carnegie Mellon University Priya Narasimhan © September 18September 18

Log-Analysis Approach SALSA: Analyzing Logs as StAte Machines [USENIX WASL 2008] Extract state-machine views of execution from Hadoop logs Distributed control-flow view of logs Distributed data-flow view of logs Diagnose failures based on statistics of these extracted views Control-flow based diagnosis Control-flow + data-flow based diagnosis Perform analysis incrementally so that we can support it online Carnegie Mellon University Priya Narasimhan © September 18September 18

Applying SALSA to Hadoop Data-flow view: transfer of data to other nodes Control-flow view: state orders, durations Map outputs to Reduce tasks on other nodes [t] Launch Reduce task : [t] Reduce is idling, waiting for Map outputs [t] Repeat until all Map outputs copied [t] Start Reduce Copy (of completed Map output) [t] Finish Reduce Copy [t] Reduce Merge Copy [t] Launch Map task : [t] Copy Map outputs [t] Map task done Incoming Map outputs for this Reduce task Carnegie Mellon University Priya Narasimhan © September 18September 18

Distributed Control+Data Flow Distributed control-flow Causal flow of task execution across cluster nodes, i.e., Reduces waiting on Maps via Shuffles Distributed data-flow Data paths of Map outputs shuffled to Reduces HDFS data blocks read into and written out of jobs Job-centric data-flows: Fused Control+Data Flows Correlate paths of data and execution Create conjoined causal paths from data source before, to data destination after, processing Helps to trace correlated performance problems http://www.pdl.cmu.edu/ <your name here> © September 18September 18

What Else Do We Do? Analyze black-box data with similar intuition Derive PDFs and use a clustering approach Distinct behavior profiles of metric correlations Compare them across nodes Technique called Ganesha [HotMetrics 2009] Analyze heartbeat traffic Compare heartbeat durations across nodes Compare heartbeat-timestamp skews across nodes Different metrics, different viewpoints, different algorithms Carnegie Mellon University Priya Narasimhan © September 18September 18

Carnegie Mellon University Priya Narasimhan © September 18September 18

Putting the Elephant Together JobTracker Durations views TaskTracker heartbeat timestamps TaskTracker Durations views JobTracker heartbeat timestamps Job-centric data flows Black-box resource usage BliMEy: Blind Men and the Elephant Framework [CMU-CS-09-135 ] Carnegie Mellon University Priya Narasimhan © September 18September 18

Visualization To uncover Hadoop’s execution in an insightful way To reveal outcome of diagnosis on sight To allow developers/admins to get a handle as the system scales Value to programmers [HotCloud 2009] Allows them to spot issues that might assist them in restructuring their code Allows them to spot faulty nodes Carnegie Mellon University Priya Narasimhan © September 18September 18

Visualization(heatmaps) CPU Hog on node 1 visible on Map-task durations Carnegie Mellon University Priya Narasimhan © September 18September 18

Visualization (heatmaps) TaskTracker hang on node 1 not visible on Map-task durations TaskTracker hang on node 1 visible on Reduce-task durations http://www.pdl.cmu.edu/ Priya Narasimhan © September 18September 18

Visualizations (swimlanes) Long-tailed map Delaying overall job completion time Carnegie Mellon University Priya Narasimhan © September 18September 18

Current Developments State-machine extraction + visualization being implemented for the Hadoop Chukwa project Collaboration with Yahoo! Web-based visualization widgets for HICC (Hadoop Infrastructure Care Center) “Swimlanes” currently available in Chukwa trunk (CHUKWA-279) http://www.pdl.cmu.edu/ <your name here> © September 18September 18

Briefly: Diagnosis for PVFS PVFS: High-performance file system Focus on different kinds of problems Disk-related and network-related problems Motivated by anecdotal experiences of Argonne National Labs researchers Exploit the peer-similarity principle across I/O servers and metadata servers Diagnosis based on different instrumentation Black-box OS performance metrics, system-call traces http://www.pdl.cmu.edu/ Priya Narasimhan © September 18September 18

Briefly: Online Fingerpointing ASDF: Automated System for Diagnosing Failures Can incorporate any number of different data sources Can use any number of analysis techniques to process this data Can support online or offline analyses for Hadoop Currently plugging in our white-box & black-box algorithms Carnegie Mellon University Priya Narasimhan © September 18September 18

Example: ASDF for Hadoop config file data-analysis modules fpt-core administrator Maps/Reduces TaskTracker JobTracker Hadoop Log data-collection modules config file data-analysis modules data-collection modules fpt-core administrator http://www.pdl.cmu.edu/ Priya Narasimhan © September 18September 18

Briefly: Diagnosis-Driven Recovery The Problem: Naïve recovery might not mitigate the problem Problem might escalate Problem might still remain and show up later What if recovery were informed by diagnosis, in real-time? Preemptively migrate tasks to non-faulty servers, based on monitored data Carnegie Mellon University Priya Narasimhan © September 18September 18

Hard Problems Understanding the limits of black-box fingerpointing What failures are outside the reach of a black-box approach? What are the limits of “peer” comparison? What other kinds of black-box instrumentation exist? Scalability Scaling to run across large systems and understanding “growing pains” Visualization Helping system administrators visualize problem diagnosis Trade-offs More instrumentation and more frequent data can improve accuracy of diagnosis, but at what performance cost? Virtualized environments Do these environments help/hurt problem diagnosis? Carnegie Mellon University Priya Narasimhan © September 18September 18

Summary Automated problem diagnosis Current targets: Hadoop, PVFS, Lustre Initial set of failures Real-world bug databases, problems in the wild Short-term: Transition techniques into Hadoop code-base working with Yahoo! Long-term Scalability, scalability, scalability, …. Expand fault study Improve visualization, working with users Additional details USENIX WASL 2008 (white-box log analysis) USENIX HotCloud 2009 (visualization) USENIX HotMetrics 2009 (black-box metric analysis) HotDep 2009 (black-box analysis for PVFS) Carnegie Mellon University Priya Narasimhan © September 18September 18

For more information: http://www.ece.cmu.edu/~fingerpointing priya@cs.cmu.edu Carnegie Mellon University September 18