Causal Models, Learning Algorithms and their Application to Performance Modeling Jan Lemeire Parallel Systems lab November 15 th 2006.

Slides:



Advertisements
Similar presentations
Bayesian network for gene regulatory network construction
Advertisements

Autonomic Scaling of Cloud Computing Resources
Learning Causal Models of Multivariate Systems and the Value of it for the Performance Modeling of Computer Programs Jan Lemeire December 19 th 2007 Supervisor:
When are Graphical Causal Models not Good Models? CAPITS 2008 Jan Lemeire September 12 th 2008.
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 27 – Overview of probability concepts 1.
Probabilistic Reasoning Bayesian Belief Networks Constructing Bayesian Networks Representing Conditional Distributions Summary.
Probabilistic analog of clustering: mixture models
1 Chapter 5 Belief Updating in Bayesian Networks Bayesian Networks and Decision Graphs Finn V. Jensen Qunyuan Zhang Division. of Statistical Genomics,
Bayesian Network and Influence Diagram A Guide to Construction And Analysis.
Outline 1)Motivation 2)Representing/Modeling Causal Systems 3)Estimation and Updating 4)Model Search 5)Linear Latent Variable Models 6)Case Study: fMRI.
Week 11 Review: Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution.
Rutgers CS440, Fall 2003 Review session. Rutgers CS440, Fall 2003 Topics Final will cover the following topics (after midterm): 1.Uncertainty & introduction.
Introduction of Probabilistic Reasoning and Bayesian Networks
Overview of Inference Algorithms for Bayesian Networks Wei Sun, PhD Assistant Research Professor SEOR Dept. & C4I Center George Mason University, 2009.
Using Markov Blankets for Causal Structure Learning Jean-Philippe Pellet Andre Ellisseeff Presented by Na Dai.
From: Probabilistic Methods for Bioinformatics - With an Introduction to Bayesian Networks By: Rich Neapolitan.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
1Causality & MDL Causal Models as Minimal Descriptions of Multivariate Systems Jan Lemeire June 15 th 2006.
5/25/2005EE562 EE562 ARTIFICIAL INTELLIGENCE FOR ENGINEERS Lecture 16, 6/1/2005 University of Washington, Department of Electrical Engineering Spring 2005.
1 gR2002 Peter Spirtes Carnegie Mellon University.
Today Logistic Regression Decision Trees Redux Graphical Models
Bayesian Networks Alan Ritter.
Computer vision: models, learning and inference Chapter 10 Graphical Models.
Statistical Learning: Pattern Classification, Prediction, and Control Peter Bartlett August 2002, UC Berkeley CIS.
1 Bayesian Networks Chapter ; 14.4 CS 63 Adapted from slides by Tim Finin and Marie desJardins. Some material borrowed from Lise Getoor.
Causal Modeling for Anomaly Detection Andrew Arnold Machine Learning Department, Carnegie Mellon University Summer Project with Naoki Abe Predictive Modeling.
1 Day 2: Search June 9, 2015 Carnegie Mellon University Center for Causal Discovery.
If we measured a distribution P, what is the tree- dependent distribution P t that best approximates P? Search Space: All possible trees Goal: From all.
Bayes Net Perspectives on Causation and Causal Inference
Machine Learning CUNY Graduate Center Lecture 21: Graphical Models.
Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,
A Brief Introduction to Graphical Models
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Bayesian Learning By Porchelvi Vijayakumar. Cognitive Science Current Problem: How do children learn and how do they get it right?
第十讲 概率图模型导论 Chapter 10 Introduction to Probabilistic Graphical Models
Aprendizagem Computacional Gladys Castillo, UA Bayesian Networks Classifiers Gladys Castillo University of Aveiro.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
Introduction to Bayesian Networks
Generalizing Variable Elimination in Bayesian Networks 서울 시립대학원 전자 전기 컴퓨터 공학과 G 박민규.
Ch 8. Graphical Models Pattern Recognition and Machine Learning, C. M. Bishop, Revised by M.-O. Heo Summarized by J.W. Nam Biointelligence Laboratory,
Course files
Computing & Information Sciences Kansas State University Data Sciences Summer Institute Multimodal Information Access and Synthesis Learning and Reasoning.
INTERVENTIONS AND INFERENCE / REASONING. Causal models  Recall from yesterday:  Represent relevance using graphs  Causal relevance ⇒ DAGs  Quantitative.
Slides for “Data Mining” by I. H. Witten and E. Frank.
The famous “sprinkler” example (J. Pearl, Probabilistic Reasoning in Intelligent Systems, 1988)
1 BN Semantics 1 Graphical Models – Carlos Guestrin Carnegie Mellon University September 15 th, 2008 Readings: K&F: 3.1, 3.2, –  Carlos.
Dependency Networks for Collaborative Filtering and Data Visualization UAI-2000 발표 : 황규백.
Lecture 2: Statistical learning primer for biologists
Latent regression models. Where does the probability come from? Why isn’t the model deterministic. Each item tests something unique – We are interested.
Data Mining and Decision Support
Learning Bayes Nets Based on Conditional Dependencies Oliver Schulte Department of Philosophy and School of Computing Science Simon Fraser University Vancouver,
Machine Learning – Lecture 11
1 CMSC 671 Fall 2001 Class #20 – Thursday, November 8.
1 Learning P-maps Param. Learning Graphical Models – Carlos Guestrin Carnegie Mellon University September 24 th, 2008 Readings: K&F: 3.3, 3.4, 16.1,
CAUSAL REASONING FOR DECISION AIDING SYSTEMS COGNITIVE SYSTEMS LABORATORY UCLA Judea Pearl, Mark Hopkins, Blai Bonet, Chen Avin, Ilya Shpitser.
04/21/2005 CS673 1 Being Bayesian About Network Structure A Bayesian Approach to Structure Discovery in Bayesian Networks Nir Friedman and Daphne Koller.
1Causal Performance Models Causal Models for Performance Analysis of Computer Systems Jan Lemeire TELE lab May 24 th 2006.
1 BN Semantics 2 – Representation Theorem The revenge of d-separation Graphical Models – Carlos Guestrin Carnegie Mellon University September 17.
1 BN Semantics 1 Graphical Models – Carlos Guestrin Carnegie Mellon University September 15 th, 2006 Readings: K&F: 3.1, 3.2, 3.3.
Bayesian Brain Probabilistic Approaches to Neural Coding 1.1 A Probability Primer Bayesian Brain Probabilistic Approaches to Neural Coding 1.1 A Probability.
1 Day 2: Search June 9, 2015 Carnegie Mellon University Center for Causal Discovery.
1 Day 2: Search June 14, 2016 Carnegie Mellon University Center for Causal Discovery.
1 BN Semantics 3 – Now it’s personal! Parameter Learning 1 Graphical Models – Carlos Guestrin Carnegie Mellon University September 22 nd, 2006 Readings:
CS 2750: Machine Learning Directed Graphical Models
Probability Theory and Parameter Estimation I
Markov Properties of Directed Acyclic Graphs
CSCI 5822 Probabilistic Models of Human and Machine Learning
Center for Causal Discovery: Summer Short Course/Datathon
Causal Models Lecture 12.
Chapter 14 February 26, 2004.
Presentation transcript:

Causal Models, Learning Algorithms and their Application to Performance Modeling Jan Lemeire Parallel Systems lab November 15 th 2006

Pag. 2 Causal Performance Models Overview I. Causal Models II. Learning Algorithms III. Performance Modeling IV. Extensions

Pag. 3 Causal Performance Models I. Multivariate Analysis Variables Probabilistic model of joint distribution? Relational information? A priori unknown relations Experimental data

Pag. 4 Causal Performance Models A. Representation of distributions Factorization Reduction of factorization complexity Bayesian Network Ordering 1Ordering 2

Pag. 5 Causal Performance Models Conditional independence Qualitative property: P(rain|quality of speech)=P(rain) ? Markov condition in graph Variable becomes independent from all its non-descendants by conditioning on its direct parents. – graphical d-separation criterion B. Representation of Independencies

Pag. 6 Causal Performance Models Faithfulness Faithfulness: Joint Distribution  Directed Acyclic Graph Conditional independencies  d-separation Theorem: if a faithful graph exists, it is the minimal factorization. Independence-map: All independencies in the Bayesian network appear in the distribution

Pag. 7 Causal Performance Models Definition through interventions causal model + Conditional Probability Distributions + Causal Markov Condition = Bayesian network C. Representation of Causal Mechanisms Model of the underlying physical mechanisms

Pag. 8 Causal Performance Models Reductionism Causal modeling = reductionism Canonical representation: unique, minimal, independent Building block = P(X i |parents(X i )) Whole theory is based on this modularity Intervention = change of block

Pag. 9 Causal Performance Models Ultimate motivation for causality Model = canonical representation able to explain all qualitative properties (independencies) close to reality If causal mechanisms are unrelated model is faithful

Pag. 10 Causal Performance Models II. Learning Algorithms Two types: Constraint-based based on the independencies Scoring-based searches set of all models, give a score of how good they represent distribution

Pag. 11 Causal Performance Models Step 1: Adjacency search Property: adjacent nodes do not become independent Algorithm: start with full-connected graph check for marginal independencies check for conditional independencies

Pag. 12 Causal Performance Models Step 2: Orientation Property: V-structure can be recognized Algorithm: look for v-structures derived rules

Pag. 13 Causal Performance Models Assumptions General statistical assumptions: No selection bias Random sample Sufficient data for correctness of statistical tests Underlying network is faithful Causal sufficiency No unknown common causes

Pag. 14 Causal Performance Models Criticism Definition causality? About predicting the effect of changes to the system Faithfulness assumption Eg.: accidental cancellation Causal Markov Condition “All relations are causal” Learning algorithms are not robust Statistical tests make mistakes

Pag. 15 Causal Performance Models Part III: Performance Analysis High-Performance computing 1 processor parallel system Performance Questions: Performance prediction System-dependency? Parameter-dependency? Reasons of bad performance? Effect of Optimizations?

Pag. 16 Causal Performance Models Causal modeling (cf. COMO lab, VUB) Representation form Close to reality Learning algorithms TETRAD tool (open-source, java) PhD??

Pag. 17 Causal Performance Models Performance Models Aim performance analysis Support software developer High-performance applications Expected properties offer insight into causes performance degradation prediction estimate effect of optimizations reusable submodels separate application and system-dependency reason under uncertainty causal models

Pag. 18 Causal Performance Models Integrated in statistical analysis Statistical characteristics Regression analysis Probability table compression Outlier detection Iterative process 1. Perform additional experiments 2. Extract additional characteristics 3. Indicate exceptions 4. Analyze the divergences of the data points with the current hypotheses

Pag. 19 Causal Performance Models A. Model construction Model of computation time of LU decom- position algorithm elementsize (redundant variable) is sufficient for influence datatype -> cache misses regression analysis on submodels X=f(parents) analysis of parameters

Pag. 20 Causal Performance Models B. Detection of unexpected dependencies Point-to-point communication performance background communication

Pag. 21 Causal Performance Models C. Finding explanations for outliers Exceptional data in communication performance measurements Probability table compression => derived variable Interesting features

Pag. 22 Causal Performance Models IV. Complexity of Performance Data Mixture discrete and continuous variables Mutual Information & Kernel Density Estimation Non-linear relations Mutual Information & Kernel Density Estimation Deterministic relations Augmented models & Complexity criterion Context variables Work in progress Context-specific independencies Work in progress

Pag. 23 Causal Performance Models A. Information-theoretic Dependency Entropy of random variable X Discretized entropy for continuous variable Mutual Information

Pag. 24 Causal Performance Models B. Kernel Density Estimation See applets Trade-off maximal entropy <> typicalness Conclusions Limited number data points needed Discretization of continuous data justified Form-free dependency measure

Pag. 25 Causal Performance Models C. Deterministic relations Y=f(X) Y becomes independent from Z conditioned on X ~ violation of the intersection condition (Pearl ’88) Not faithfully describable Solution: augmented causal model - add regularity to model - adapt inference algorithms

Pag. 26 Causal Performance Models The Complexity Criterion Select simplest relation X & Y contain equivalent information about Z

Pag. 27 Causal Performance Models Augmented causal model Consistent models under Complexity Increase assumption Restrict conditional independencies Generalize d-separation Reestablish faithfulness {

Pag. 28 Causal Performance Models Theory works! Deterministic A B Probabilistic

Pag. 29 Causal Performance Models Conclusions Benefit of the integration of statistical techniques Causal modeling is a challenge –wants to know the inner from the outer More information – –