Temporal Causal Modeling with Graphical Granger Methods

Slides:



Advertisements
Similar presentations
Sequential learning in dynamic graphical model Hao Wang, Craig Reeson Department of Statistical Science, Duke University Carlos Carvalho Booth School of.
Advertisements

Statistics for Improving the Efficiency of Public Administration Daniel Peña Universidad Carlos III Madrid, Spain NTTS 2009 Brussels.
Yinyin Yuan and Chang-Tsun Li Computer Science Department
Factorial Mixture of Gaussians and the Marginal Independence Model Ricardo Silva Joint work-in-progress with Zoubin Ghahramani.
StatisticalDesign&ModelsValidation. Introduction.
Structural Equation Modeling
Analysis of the time-varying cortical neural connectivity in the newborn EEG: a time-frequency approach Amir Omidvarnia, Mostefa Mesbah, John M. O’Toole,
Forecasting Using the Simple Linear Regression Model and Correlation
Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.
Prediction with Regression
Unit Roots & Forecasting
Exploiting Sparse Markov and Covariance Structure in Multiresolution Models Presenter: Zhe Chen ECE / CMR Tennessee Technological University October 22,
The Price Relationship Study of Stock You-Sheng Liu 2011/01/04.
Tokyo Research Laboratory © Copyright IBM Corporation 2009 | 2009/04/ | SDM 2009 / Anomaly-Detection Proximity-Based Anomaly Detection using Sparse Structure.
Sparse Word Graphs: A Scalable Algorithm for Capturing Word Correlations in Topic Models Ramesh Nallapati Joint work with John Lafferty, Amr Ahmed, William.
Additional Topics in Regression Analysis
Multivariate Data Analysis Chapter 4 – Multiple Regression.
A Sparsification Approach for Temporal Graphical Model Decomposition Ning Ruan Kent State University Joint work with Ruoming Jin (KSU), Victor Lee (KSU)
Gehlavij Mohammadi  Basic definition  Graphical models for multivariate time series  Theory from TSC graph to causality graph  Degrees of causality.
MACHINE LEARNING 6. Multivariate Methods 1. Based on E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 Motivating Example  Loan.
Bayes Classifier, Linear Regression 10701/15781 Recitation January 29, 2008 Parts of the slides are from previous years’ recitation and lecture notes,
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Causal Modeling for Anomaly Detection Andrew Arnold Machine Learning Department, Carnegie Mellon University Summer Project with Naoki Abe Predictive Modeling.
Linear Regression Models Powerful modeling technique Tease out relationships between “independent” variables and 1 “dependent” variable Models not perfect…need.
Autocorrelation Lecture 18 Lecture 18.
Absolute error. absolute function absolute value.
1 Tetrad: Machine Learning and Graphcial Causal Models Richard Scheines Joe Ramsey Carnegie Mellon University Peter Spirtes, Clark Glymour.
Dynamical Correlation: A New Method to Quantify Synchrony Siwei Liu 1, Yang Zhou 1, Richard Palumbo 2, & Jane-Ling Wang 1 1 UC Davis; 2 University of Rhode.
Correlated Topic Models By Blei and Lafferty (NIPS 2005) Presented by Chunping Wang ECE, Duke University August 4 th, 2006.
Computer vision: models, learning and inference Chapter 19 Temporal models.
Statistical Modeling with SAS/STAT Cheng Lei Department of Electrical and Computer Engineering University of Victoria April 9, 2015.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Brain Mapping Unit The General Linear Model A Basic Introduction Roger Tait
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
Sparse Inverse Covariance Estimation with Graphical LASSO J. Friedman, T. Hastie, R. Tibshirani Biostatistics, 2008 Presented by Minhua Chen 1.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
Big Data at Home Depot KSU – Big Data Survey Course Steve Einbender Advanced Analytics Architect.
New Measures of Data Utility Mi-Ja Woo National Institute of Statistical Sciences.
Gaussian Processes Li An Li An
Hilbert Space Embeddings of Conditional Distributions -- With Applications to Dynamical Systems Le Song Carnegie Mellon University Joint work with Jonathan.
Paper Reading Dalong Du Nov.27, Papers Leon Gu and Takeo Kanade. A Generative Shape Regularization Model for Robust Face Alignment. ECCV08. Yan.
Chapter 16 Social Statistics. Chapter Outline The Origins of the Elaboration Model The Elaboration Paradigm Elaboration and Ex Post Facto Hypothesizing.
CpSc 881: Machine Learning
Chapter 10: Determining How Costs Behave 1 Horngren 13e.
CS Statistical Machine learning Lecture 12 Yuan (Alan) Qi Purdue CS Oct
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 14 l Time Series: Understanding Changes over Time.
Multi-label Prediction via Sparse Infinite CCA Piyush Rai and Hal Daume III NIPS 2009 Presented by Lingbo Li ECE, Duke University July 16th, 2010 Note:
Granger Causality for Time-Series Anomaly Detection By Zhangzhou.
Regularized Least-Squares and Convex Optimization.
Institute of Statistics and Decision Sciences In Defense of a Dissertation Submitted for the Degree of Doctor of Philosophy 26 July 2005 Regression Model.
Forecasting. Model with indicator variables The choice of a forecasting technique depends on the components identified in the time series. The techniques.
1 Autocorrelation in Time Series data KNN Ch. 12 (pp )
Financial Econometrics Lecture Notes 4
General Linear Model & Classical Inference
The general linear model and Statistical Parametric Mapping
CH 5: Multivariate Methods
Overview of Supervised Learning
CHAPTER 16 ECONOMIC FORECASTING Damodar Gujarati
A Non-Parametric Bayesian Method for Inferring Hidden Causes
The General Linear Model (GLM)
Estimating Networks With Jumps
Migration and the Labour Market
The general linear model and Statistical Parametric Mapping
The General Linear Model (GLM)
Multivariate Methods Berlin Chen
Multivariate Methods Berlin Chen, 2005 References:
Signal Processing on Graphs: Performance of Graph Structure Estimation
Longitudinal Data & Mixed Effects Models
Presentation transcript:

Temporal Causal Modeling with Graphical Granger Methods SIGKDD 07 August 13, 2007 Andrew Arnold (Carnegie Mellon University) Yan Liu (IBM T.J. Watson Research) Naoki Abe (IBM T.J. Watson Research)

Talk Outline Introduction and motivation Overview of Granger causality Graphical Granger methods Exhaustive Granger Lasso Granger SIN Granger Vector auto-regression (VAR) Experimental results

A Motivating Example: Key Performance Indicator Data (KPI) in Corporate Index Management [S&P] Variables Company HAL Year 1999 2000 2001 Quarter 4 1 2 3 Revenue ($M) 6.24 6.54 5.82 3.89 4.1 4.41 3.6 Revenue-to-RD 2.185704 1.734358 1.381822 0.416212 0.843057 0.906083 0.930714 Revenue-to-RD CAGR -0.61429 -0.47757 -0.32646 Innovation Index 0.517621 0.578062 0.567874 0.98624 0.696722 0.679335 .734627 Innovation Index CAGR 0.346008 0.175194 .229845 CapEx to Revenue 0.152292 0.258789 0.111111 0.63592 1.33114 1.389658 0.009722 compound annual growth rate Time

KPI Case Study: Temporal Causal Modeling for Identifying Levers of Corporate Performance How can we leverage information in temporal data to assist causal modeling and inference ? Key idea: A cause necessarily precedes its effects… Variables Company HAL Year 1999 2000 2001 Quarter 4 1 2 3 Revenue ($M) 6.24 6.54 5.82 3.89 4.1 4.41 3.6 Revenue-to-RD 2.185704 1.734358 1.381822 0.416212 0.843057 0.906083 0.930714 Revenue-to-RD CAGR -0.61429 -0.47757 -0.32646 Innovation Index 0.517621 0.578062 0.567874 0.98624 0.696722 0.679335 .734627 Innovation Index CAGR 0.346008 0.175194 .229845 CapEx to Revenue 0.152292 0.258789 0.111111 0.63592 1.33114 1.389658 0.009722 compound annual growth rate Time

Granger Causality Granger causality Introduced by the Nobel prize winning economist, Clive Granger [Granger ‘69] Definition: a time series x is said to “Granger cause” another time series y, if and only if: regressing for y in terms of past values of both y and x is statistically significantly better than regressing y on past values of y only Assumption: no common latent causes

Variable Space Expansion & Feature Space Mapping Fewer edges Fewer variables

Graphical Granger Methods Exhaustive Granger Test all possible univariate Granger models independently Lasso Granger Use L1-normed regression to choose sparse multivariate regression models [Meinshausen & Buhlmann, ‘06] SIN Granger Do matrix inversion to find correlations between features across time [Drton & Perlman, ‘04] Vector auto-regression (VAR) Fit data to linear-normal time series model [Gilbert, ‘95] Too many variables. How can we condense.

Exhaustive Granger vs. Lasso Granger Too many variables. How can we condense.

Baseline methods: SIN and VAR

Empirical Evaluation of Competing Methods Evaluation by simulation Sample data from synthetic (linear normal) causal model Learn using a number of competing methods Compare learned graphs to original model Measure similarity of output graph to original graph in terms of Precision of predicted edges Recall of predicted edges F1 of predicted edges Parameterize performance analysis Randomly sample graphs from parameter space Lag; Features; Affinity; Noise; Samples per feature; Samples per feature per lag Conditioning to see interaction effects E.g. Effect of # features when samples_per_feature_per_lag is small vs large

Experiment 1A: Performance vs. Factors - Random sampling all factors -

Experiment 1’s Efficiency

Experiment 1B: Performance vs. Factors - Fixing other factors - 13 13

Experiment 1C: Performance vs Experiment 1C: Performance vs. Factors - Detail: Parametric Conditioning -

Experiment 2: Learned Graphs

Experiment 3: Real World Data Output Graphs on the Corporate KPI Data