The Visual Causality Analyst: An Interactive Interface for Causal Reasoning Jun Wang, Stony Brook University Klaus Mueller, Stony Brook University, SUNY.

Slides:



Advertisements
Similar presentations
CS188: Computational Models of Human Behavior
Advertisements

Bayesian network for gene regulatory network construction
A Tutorial on Learning with Bayesian Networks
Discovering Cyclic Causal Models by Independent Components Analysis Gustavo Lacerda Peter Spirtes Joseph Ramsey Patrik O. Hoyer.
© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
Weakening the Causal Faithfulness Assumption
Outline 1)Motivation 2)Representing/Modeling Causal Systems 3)Estimation and Updating 4)Model Search 5)Linear Latent Variable Models 6)Case Study: fMRI.
Forecasting Using the Simple Linear Regression Model and Correlation
Exact Inference in Bayes Nets
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Experiments We measured the times(s) and number of expanded nodes to previous heuristic using BFBnB. Dynamic Programming Intuition. All DAGs must have.
Graduate Center/City University of New York University of Helsinki FINDING OPTIMAL BAYESIAN NETWORK STRUCTURES WITH CONSTRAINTS LEARNED FROM DATA Xiannian.
Introduction of Probabilistic Reasoning and Bayesian Networks
Causal Networks Denny Borsboom. Overview The causal relation Causality and conditional independence Causal networks Blocking and d-separation Excercise.
Rafi Bojmel supervised by Dr. Boaz Lerner Automatic Threshold Selection for conditional independence tests in learning a Bayesian network.
Using Markov Blankets for Causal Structure Learning Jean-Philippe Pellet Andre Ellisseeff Presented by Na Dai.
Integrating Bayesian Networks and Simpson’s Paradox in Data Mining Alex Freitas University of Kent Ken McGarry University of Sunderland.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
Bayesian Network Representation Continued
Nemours Biomedical Research Statistics April 2, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Project Update: Law Enforcement Resource Allocation (LERA) Visualization System Michael Welsman-Dinelle April Webster.
Modeling Gene Interactions in Disease CS 686 Bioinformatics.
1 gR2002 Peter Spirtes Carnegie Mellon University.
1 4. Multiple Regression I ECON 251 Research Methods.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Causal Models, Learning Algorithms and their Application to Performance Modeling Jan Lemeire Parallel Systems lab November 15 th 2006.
Causal Modeling for Anomaly Detection Andrew Arnold Machine Learning Department, Carnegie Mellon University Summer Project with Naoki Abe Predictive Modeling.
1 Day 2: Search June 9, 2015 Carnegie Mellon University Center for Causal Discovery.
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Bayes Net Perspectives on Causation and Causal Inference
Machine Learning CUNY Graduate Center Lecture 21: Graphical Models.
Summary of the Bayes Net Formalism David Danks Institute for Human & Machine Cognition.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Midterm Review Rao Vemuri 16 Oct Posing a Machine Learning Problem Experience Table – Each row is an instance – Each column is an attribute/feature.
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
A Comparison Between Bayesian Networks and Generalized Linear Models in the Indoor/Outdoor Scene Classification Problem.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
Introduction to Bayesian Networks
Markov Random Fields Probabilistic Models for Images
Learning Linear Causal Models Oksana Kohutyuk ComS 673 Spring 2005 Department of Computer Science Iowa State University.
Module networks Sushmita Roy BMI/CS 576 Nov 18 th & 20th, 2014.
Ch 8. Graphical Models Pattern Recognition and Machine Learning, C. M. Bishop, Revised by M.-O. Heo Summarized by J.W. Nam Biointelligence Laboratory,
Course files
Learning the Structure of Related Tasks Presented by Lihan He Machine Learning Reading Group Duke University 02/03/2006 A. Niculescu-Mizil, R. Caruana.
Learning With Bayesian Networks Markus Kalisch ETH Zürich.
Exploratory studies: you have empirical data and you want to know what sorts of causal models are consistent with it. Confirmatory tests: you have a causal.
Lecture 2: Statistical learning primer for biologists
Linear Discriminant Analysis and Logistic Regression.
Exact Inference in Bayes Nets. Notation U: set of nodes in a graph X i : random variable associated with node i π i : parents of node i Joint probability:
Visual Correlation Analysis of Numerical and Categorical Data on the Correlation Map Zhiyuan Zhang, Kevin T. McDonnell, Erez Zadok, Klaus Mueller.
Pattern Recognition and Machine Learning
A comparative approach for gene network inference using time-series gene expression data Guillaume Bourque* and David Sankoff *Centre de Recherches Mathématiques,
04/21/2005 CS673 1 Being Bayesian About Network Structure A Bayesian Approach to Structure Discovery in Bayesian Networks Nir Friedman and Daphne Koller.
1Causal Performance Models Causal Models for Performance Analysis of Computer Systems Jan Lemeire TELE lab May 24 th 2006.
1 Day 2: Search June 9, 2015 Carnegie Mellon University Center for Causal Discovery.
Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.
An Algorithm to Learn the Structure of a Bayesian Network Çiğdem Gündüz Olcay Taner Yıldız Ethem Alpaydın Computer Engineering Taner Bilgiç Industrial.
1Causal Inference and the KMSS. Predicting the future with the right model Stijn Meganck Vrije Universiteit Brussel Department of Electronics and Informatics.
1 Day 2: Search June 14, 2016 Carnegie Mellon University Center for Causal Discovery.
Inferring Regulatory Networks from Gene Expression Data BMI/CS 776 Mark Craven April 2002.
Irina Rish IBM T.J.Watson Research Center
Markov Properties of Directed Acyclic Graphs
CSCI 5822 Probabilistic Models of Human and Machine Learning
CSCI 5822 Probabilistic Models of Human and Machine Learning
Center for Causal Discovery: Summer Short Course/Datathon
Introduction to Logistic Regression
An Algorithm for Bayesian Network Construction from Data
Graduate School of Information Sciences, Tohoku University
Visual Causality Analysis Made Practical
Presentation transcript:

The Visual Causality Analyst: An Interactive Interface for Causal Reasoning Jun Wang, Stony Brook University Klaus Mueller, Stony Brook University, SUNY Korea 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University

Causality “Any relationship that cannot be defined from the distribution alone” [Pearl, 2010] Counterfactuals A causes B means: If A didn’t happen (change), B would not happen (change) All relations between variables in a system form a Causal Network 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University

Causal Networks Causal networks can be represented as Bayesian belief networks Directed Acyclic Graphs (DAGs) Augmented with conditional probability distributions CPT, CPD, Linear Regression, Logistic Regression, etc. Probabilistic Dependency and Causal Dependency Thus causal networks can be learned as Bayesian networks But with added constraints and assumptions 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University

Structure Learning Score-based algorithms Search through the space of possible structures (models) with some scoring function. K2 [Cooper & Herskowitz, 1992] GBPS [Spirtes & Meek, 1995] BDe metric [Heckerman et al. 1995] Sparse Candidate [Friedman et al. 1999] Exact [Koivisto & Sood, 2004][Silander & Myllymaki, 2006] GES [Chickering, 2002] GIES [Hauser & Bühlmann, 2012] … Constraint-based algorithms Find a graph that satisfies all the constraints implied by the data distribution. SGS [Spirtes et al. 2000] PC [Spirtes et al. 2000][Meek, 1995] TPDA [Cheng et al. 1997] Heuristic two-phase [Wang & Chan, 2010] TC [Pellet & Elisseeff, 2008] … 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University

Structure Learning Score-based algorithmsConstraint-based algorithms Build structure constrained by conditional independence/dependence calculated from data distributions Such conditional dependencies imply causal dependence and counterfactuals 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University

Conditional Independence and D-separation 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University

D-separation 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University Faithfulness Assumption There is a graph capable to express all CI relations in data. Causal Sufficiency No hidden confounder or selection bias. Chain of CausationConfounding Collision (V-structure) Collider

TC Algorithm [Pellet & Elisseeff, 2008] Start from an empty graph, 1.For each pair of variables in dataset, test for CI conditioning on all other variables. Connect the pair if they are dependent. Output: Moral Graph 2.For each pair of connected variables, search for colliders in variables forming triangles with them. Require a number of CI test exponential to the number of potential colliders 3.Orient V-structures and propagate. Output: Partial DAG 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University

CI Test 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University

Correlations of Categorical & Numerical Variables 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University XYZXYXY XzXz A1526 A3726 B7182 B8282 B9382

Level Value Mapping of Categorical Variables 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University Variable pair categorical/numerical PairwiseGlobal origin/horsepower origin/weight origin/displacement origin/mpg origin/timeTo60mph

Causality in Practical Application CI tests require good data quality to make correct judgements. Satisfaction of causal assumptions cannot be guaranteed. Hard to manage all causal relations when variable number is large. Cannot alter the learned structure and test hypotheses. Solution A Visual Analytical System! 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University

The Visual Causality Analyst 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University Running on auto mpg dataset [UCI Machine Learning Repository, 2013]

The Causality Analyst Analytical Stages 1.Data preparation Mapping levels of categorical variables 2.Structure Learning Learn causal structures with the TC algorithm 3.Regression Analysis Quantify causal relations with linear and logistic regression analyses Make dummy variables out of categorical variables 4.Visual Analytics with the Causal Graph Interactive analysis with visual feedback 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University

Visualization Patterns Vertices: variables Color: type of the variable ( numerical categorical) Edges: causal relations Direction Marks: direction and qualities of causal relation positive negative multiple Opacity: (maximum) causal strength measured by regression coefficients, scaled and enhanced by Dashed line: relation with unknown direction 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University

Regression Analysis Linear regression analysis Numerical dependent variable p-value, F-statistics, R-squared, etc. Logistic regression analysis Categorical dependent variable p-value, Deviance, Likelihood, etc. 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University

Case 1: Auto MPG dataset [UCI Machine Learning Repository, 2013] 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University The complete causal graph Filter edges with 0.4 coefficient threshold The causal chain related to mpg 8 variables, 392 observations

Case 1: Auto MPG dataset [UCI Machine Learning Repository, 2013] 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University The added causal relationRegression view of mpg before adding the edge Regression view of mpg after adding the edge

Case 2: Sales Campaign Dataset 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University The causal graphAll relations related to PipeRevn Regression view of PipeRevn and Cost 10 variables, 600 observations

Future Work Analytical visualization Visualize goodness of fitting for regression models of each node as node stroke thickness e.g. F-test score or Deviance, Automatic predictor analysis Automatic predictor analysis Fit data on existed structure Scoring the graph structure according to the dataset Causal inference within data clusters Integrate tools like Illustrative Parallel Coordinates [McDonnell and Klaus, 2008] Causality from time series data Time series chain graph and Granger causality graphs [Eichler, 2008] 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University

Other Potential Future Work More sophisticated CI test equivalence Data cleaning, e.g. outlier detection and removal Handling big data, e.g. incremental visualization Causal analysis involving interventional data … 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University

Summary Causality and Causal Network Constraint-based Structural Learning Value Mapping of Categorical Variables The Visual Causal Analyst Analytical Stages Visualization of Causal Graph with Statistical Assessment Interactive Analysis with Visual Feedback Prototype with Many Potential Future Work 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University

Thanks for attending my talk! 10/28/2015Jun Wang and Klaus Mueller, Stony Brook University