Causality Workbenchclopinet.com/causality Results of the Causality Challenge Isabelle Guyon, Clopinet Constantin Aliferis and Alexander Statnikov, Vanderbilt.

Slides:



Advertisements
Similar presentations
Active Learning Challenge Active Learning Challenge Isabelle Guyon (Clopinet, California) Gavin Cawley (University of East Anglia,
Advertisements

Alexander Statnikov1, Douglas Hardin1,2, Constantin Aliferis1,3
CORRELATIONAL RESEARCH o What are the Uses of Correlational Research?What are the Uses of Correlational Research? o What are the Requirements for Correlational.
1 Statistical Modeling  To develop predictive Models by using sophisticated statistical techniques on large databases.
Correlation and regression Dr. Ghada Abo-Zaid
Statistics 100 Lecture Set 7. Chapters 13 and 14 in this lecture set Please read these, you are responsible for all material Will be doing chapters
Can causal models be evaluated? Isabelle Guyon ClopiNet / ChaLearn
Lecture 5: Causality and Feature Selection Isabelle Guyon
Using Markov Blankets for Causal Structure Learning Jean-Philippe Pellet Andre Ellisseeff Presented by Na Dai.
The bumpy road of the search for a (good) cause
Causality Workbenchclopinet.com/causality The LOCANET task (Pot-luck challenge, NIPS 2008) Isabelle Guyon, Clopinet Alexander Statnikov, Vanderbilt Univ.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 4 Choosing a Research Design.
Feature selection methods from correlation to causality Isabelle Guyon NIPS 2008 workshop on kernel learning.
Linear Regression and Correlation Analysis
Review: The Logic Underlying ANOVA The possible pair-wise comparisons: X 11 X 12. X 1n X 21 X 22. X 2n Sample 1Sample 2 means: X 31 X 32. X 3n Sample 3.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
1 gR2002 Peter Spirtes Carnegie Mellon University.
Lecture 6: Causal Discovery Isabelle Guyon
Correlational Methods and Statistics. Correlation  Nonexperimental method that describes a relationship between two variables.  Allow us to make predictions.
Causal Modeling for Anomaly Detection Andrew Arnold Machine Learning Department, Carnegie Mellon University Summer Project with Naoki Abe Predictive Modeling.
Relationships Among Variables
Bayes Net Perspectives on Causation and Causal Inference
RESULTS OF THE WCCI 2006 PERFORMANCE PREDICTION CHALLENGE Isabelle Guyon Amir Reza Saffari Azar Alamdari Gideon Dror.
Chapter 2: The Research Enterprise in Psychology
Difference Two Groups 1. Content Experimental Research Methods: Prospective Randomization, Manipulation Control Research designs Validity Construct Internal.
Attention Deficit Hyperactivity Disorder (ADHD) Student Classification Using Genetic Algorithm and Artificial Neural Network S. Yenaeng 1, S. Saelee 2.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
VCE Further Maths Least Square Regression using the calculator.
Causality Workbenchclopinet.com/causality Cause-Effect Pair Challenge Isabelle Guyon, ChaLearn IJCNN 2013 IEEE/INNS.
Baseline Methods for the Feature Extraction Class Isabelle Guyon Best BER=1.26  0.14% - n0=1000 (20%) – BER0=1.80% GISETTE Best BER=1.26  0.14% - n0=1000.
Model of Prediction Error in Chaotic and Web Driven Business Environment Franjo Jović*, Alan Jović ** * Faculty of Electrical Engineering, University of.
Feature Selection and Causal discovery Isabelle Guyon, Clopinet André Elisseeff, IBM Zürich Constantin Aliferis, Vanderbilt University.
ArrayCluster: an analytic tool for clustering, data visualization and module finder on gene expression profiles 組員:李祥豪 謝紹陽 江建霖.
Participation in the NIPS 2003 Challenge Theodor Mader ETH Zurich, Five Datasets were provided for experiments: ARCENE: cancer diagnosis.
315 Feature Selection. 316 Goals –What is Feature Selection for classification? –Why feature selection is important? –What is the filter and what is the.
Biomedical Research Objective 2 Biomedical Research Methods.
L 1 Chapter 12 Correlational Designs EDUC 640 Dr. William M. Bauer.
1 Women’s Studies. 2 First Wave Feminism Mary Wollstonecraft’s (1792) A Vindication of the Rights of Women is one of the first written works to be called.
Introduction to Linear Regression
Causality challenge #2: Pot-Luck
Random Sets Approach and its Applications Basic iterative feature selection, and modifications. Tests for independence & trimmings (similar to HITON algorithm).
Jeff Howbert Introduction to Machine Learning Winter Regression Linear Regression.
Lecture 5: Causality and Feature Selection Isabelle Guyon
273 Discovery of Causal Structure Using Causal Probabilistic Network Induction AMIA 2003, Machine Learning Tutorial Constantin F. Aliferis & Ioannis Tsamardinos.
Designing multiple biometric systems: Measure of ensemble effectiveness Allen Tang NTUIM.
Competitions in machine learning: the fun, the art, and the science Isabelle Guyon Clopinet, Berkeley, California
Tell Me What You See and I will Show You Where It Is Jia Xu 1 Alexander G. Schwing 2 Raquel Urtasun 2,3 1 University of Wisconsin-Madison 2 University.
Presented By Dr / Said Said Elshama  Distinguish between validity and reliability.  Describe different evidences of validity.  Describe methods of.
Lecture 10: Correlation and Regression Model.
Research Methods. Research methods Naturalistic observation (Observational) Correlational research Experimentation Comparative method Cost-Benefit analysis.
Computational Approaches for Biomarker Discovery SubbaLakshmiswetha Patchamatla.
Guest lecture: Feature Selection Alan Qi Dec 2, 2004.
Challenges in causality: Results of the WCCI 2008 challenge Isabelle Guyon, Clopinet Constantin Aliferis and Alexander Statnikov, Vanderbilt Univ. André.
Evaluating VR Systems. Scenario You determine that while looking around virtual worlds is natural and well supported in VR, moving about them is a difficult.
276 Causal Discovery Methods Using Causal Probabilistic Networks MEDINFO 2004, T02: Machine Learning Methods for Decision Support and Discovery Constantin.
Chapter Thirteen Bivariate Correlation and Regression Chapter Thirteen.
The Visual Causality Analyst: An Interactive Interface for Causal Reasoning Jun Wang, Stony Brook University Klaus Mueller, Stony Brook University, SUNY.
Feature Selction for SVMs J. Weston et al., NIPS 2000 오장민 (2000/01/04) Second reference : Mark A. Holl, Correlation-based Feature Selection for Machine.
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Advanced Gene Selection Algorithms Designed for Microarray Datasets Limitation of current feature selection methods: –Ignores gene/gene interaction: single.
 Basic assumption: events are ___________ by some lawful order  Goals: › _________ and description › Understanding and _________ › ____________ and.
Introduction to translational and clinical bioinformatics Connecting complex molecular information to clinically relevant decisions using molecular.
Correlation.
Lecture 3: Causality and Feature Selection
L. Isella, A. Karvounaraki (JRC) D. Karlis (AUEB)
Quantitative design: Ungraded review questions
Feature Selection Ioannis Tsamardinos Machine Learning Course, 2006
Junheng, Shengming, Yunsheng 11/09/2018
Presentation transcript:

Causality Workbenchclopinet.com/causality Results of the Causality Challenge Isabelle Guyon, Clopinet Constantin Aliferis and Alexander Statnikov, Vanderbilt Univ. André Elisseeff and Jean-Philippe Pellet, IBM Zürich Gregory F. Cooper, Pittsburg University Peter Spirtes, Carnegie Mellon

Causality Workbenchclopinet.com/causality Causal discovery Which actions will have beneficial effects? …your health? …climate changes? … the economy? What affects…

Causality Workbenchclopinet.com/causality The system Systemic causality External agent

Causality Workbenchclopinet.com/causality Feature Selection X Y Predict Y from features X 1, X 2, … Select most predictive features.

Causality Workbenchclopinet.com/causality X Y Causation Predict the consequences of actions: Under “manipulations” by an external agent, some features are no longer predictive. Y

Causality Workbenchclopinet.com/causality Challenge Design

Causality Workbenchclopinet.com/causality Available data A lot of “observational” data. Correlation  Causality! Experiments are often needed, but: –Costly –Unethical –Infeasible This challenge, semi-artificial data: –Re-simulated data –Real data with artificial “probes”

Causality Workbenchclopinet.com/causality Four tasks Toy datasets Challenge datasets

Causality Workbenchclopinet.com/causality On-line feed-back

Causality Workbenchclopinet.com/causality Difficulties Violated assumptions: –Causal sufficiency –Markov equivalence –Faithfulness –Linearity –“Gaussianity” Overfitting (statistical complexity): –Finite sample size Algorithm efficiency (computational complexity): –Thousands of variables –Tens of thousands of examples

Causality Workbenchclopinet.com/causality Evaluation Fulfillment of an objective Prediction of a target variable Predictions under manipulations Causal relationships: Existence Strength Degree

Causality Workbenchclopinet.com/causality Setting Predict a target variable (on training and test data). Return the set of features used. Flexibility: –Sorted or unsorted list of features –Single prediction or table of results Complete entry = xxx0, xxx1, xxx2 results (for at least one dataset).

Causality Workbenchclopinet.com/causality Metrics Results ranked according to the test set target prediction performance “Tscore”: We also assess directly the feature set with a “Fscore”, not used for ranking.

Causality Workbenchclopinet.com/causality Toy Examples

Causality Workbenchclopinet.com/causality Lung Cancer SmokingGenetics Coughing Attention Disorder Allergy AnxietyPeer Pressure Yellow Fingers Car Accident Born an Even Day Fatigue LUCAS 0 : natural Causality assessment with manipulations

Causality Workbenchclopinet.com/causality LUCAS 1 : manipulated Lung Cancer Smoking Genetics Coughing Attention Disorder Allergy AnxietyPeer Pressure Yellow Fingers Car Accident Born an Even Day Fatigue Causality assessment with manipulations

Causality Workbenchclopinet.com/causality Lung Cancer SmokingGenetics Coughing Attention Disorder Allergy AnxietyPeer Pressure Yellow Fingers Car Accident Born an Even Day Fatigue LUCAS 2 : manipulated Causality assessment with manipulations

Causality Workbenchclopinet.com/causality Goal driven causality We define: V=variables of interest (e.g. MB, direct causes,...) We assess causal relevance: Fscore=f(V,S) Participants return: S=selected subset (ordered or not).

Causality Workbenchclopinet.com/causality Causality assessment without manipulation?

Causality Workbenchclopinet.com/causality Using artificial “probes” Lung Cancer SmokingGenetics Coughing Attention Disorder Allergy AnxietyPeer Pressure Yellow Fingers Car Accident Born an Even Day Fatigue LUCAP 0 : natural Probes P1P1 P2P2 P3P3 PTPT

Causality Workbenchclopinet.com/causality Probes Lung Cancer SmokingGenetics Coughing Attention Disorder Allergy AnxietyPeer Pressure Yellow Fingers Car Accident Born an Even Day Fatigue P1P1 P2P2 P3P3 PTPT LUCAP 1&2 : manipulated Using artificial “probes”

Causality Workbenchclopinet.com/causality Scoring using “probes” What we can compute (Fscore): –Negative class = probes (here, all “non-causes”, all manipulated). –Positive class = other variables (may include causes and non causes). What we want (Rscore): –Positive class = causes. –Negative class = non-causes. What we get (asymptotically): Fscore = (N TruePos /N Real ) Rscore (N TrueNeg /N Real )

Causality Workbenchclopinet.com/causality Results

Causality Workbenchclopinet.com/causality Challenge statistics Start: December 15, End: April 30, 2000 Total duration: 20 weeks. Last (complete) entry ranked: Number of ranked submissions Number of ranked entrants

Causality Workbenchclopinet.com/causality Learning curves

Causality Workbenchclopinet.com/causality AUC distribution

Causality Workbenchclopinet.com/causality REGED

Causality Workbenchclopinet.com/causality SIDO

Causality Workbenchclopinet.com/causality CINA

Causality Workbenchclopinet.com/causality MARTI

Causality Workbenchclopinet.com/causality Pairwise comparisons

Causality Workbenchclopinet.com/causality Top ranking methods According to the rules of the challenge: –Yin Wen Chang: SVM => best prediction accuracy on REGED and CINA. Prize: $400 donated by Microsoft. –Gavin Cawley: Causal explorer + linear ridge regression ensembles => best prediction accuracy on SIDO and MARTI. Prize: $400 donated by Microsoft. According to pairwise comparisons: –Jianxin Yin and Prof. Zhi Geng’s group: Partial Orientation and Local Structural Learning => best on Pareto front, new original causal discovery algorithm. Prize: free WCCI 2008 registration.

Causality Workbenchclopinet.com/causality Pairwise comparisons REGEDSIDO CINA MARTI

Causality Workbenchclopinet.com/causality Conclusion We have found good correlation between causation and prediction under manipulations. Several algorithms have demonstrated effectiveness of discovering causal relationships. We still need to investigate what makes then fail in some cases. We need to capitalize on the power of classical feature selection methods.