Causal Data Mining Richard Scheines Dept. of Philosophy, Machine Learning, & Human-Computer Interaction Carnegie Mellon.

Slides:



Advertisements
Similar presentations
Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California
Advertisements

Florida International University COP 4770 Introduction of Weka.
Topic Outline Motivation Representing/Modeling Causal Systems
Weka & Rapid Miner Tutorial By Chibuike Muoh. WEKA:: Introduction A collection of open source ML algorithms – pre-processing – classifiers – clustering.
Classification: Definition Given a collection of records (training set ) –Each record contains a set of attributes, one of the attributes is the class.
IS5152 Decision Making Technologies
An Overview of Machine Learning
Lecture Notes for Chapter 4 Introduction to Data Mining
1. Abstract 2 Introduction Related Work Conclusion References.
Rosa Cowan April 29, 2008 Predictive Modeling & The Bayes Classifier.
Collaborative Filtering in iCAMP Max Welling Professor of Computer Science & Statistics.
Decision Tree Rong Jin. Determine Milage Per Gallon.
COMP 328: Midterm Review Spring 2010 Nevin L. Zhang Department of Computer Science & Engineering The Hong Kong University of Science & Technology
Department of Computer Science, University of Waikato, New Zealand Eibe Frank WEKA: A Machine Learning Toolkit The Explorer Classification and Regression.
Searching for Patterns: Sean Early PSLC Summer School 2007 Question: Which is a better predictor of performance in a cognitive tutor, error rate or assistance.
Kernel Methods Part 2 Bing Han June 26, Local Likelihood Logistic Regression.
Causal Modeling for Anomaly Detection Andrew Arnold Machine Learning Department, Carnegie Mellon University Summer Project with Naoki Abe Predictive Modeling.
Educational Data Mining Overview John Stamper PSLC Summer School /25/2011 1PSLC Summer School 2011.
Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)
12 -1 Lecture 12 User Modeling Topics –Basics –Example User Model –Construction of User Models –Updating of User Models –Applications.
Chapter 5 Data mining : A Closer Look.
Educational Data Mining Ryan S.J.d. Baker PSLC/HCII Carnegie Mellon University Richard Scheines Professor of Statistics, Machine Learning, and Human-Computer.
Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
1 Tetrad: Machine Learning and Graphcial Causal Models Richard Scheines Joe Ramsey Carnegie Mellon University Peter Spirtes, Clark Glymour.
CS490D: Introduction to Data Mining Prof. Chris Clifton April 14, 2004 Fraud and Misuse Detection.
Intrusion Detection Jie Lin. Outline Introduction A Frame for Intrusion Detection System Intrusion Detection Techniques Ideas for Improving Intrusion.
Machine Learning CUNY Graduate Center Lecture 1: Introduction.
Data Mining Joyeeta Dutta-Moscato July 10, Wherever we have large amounts of data, we have the need for building systems capable of learning information.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Algorithms: The Basic Methods Witten – Chapter 4 Charles Tappert Professor of Computer Science School of CSIS, Pace University.
1 Causal Data Mining Richard Scheines Dept. of Philosophy, Machine Learning, & Human-Computer Interaction Carnegie Mellon.
Introduction to SQL Server Data Mining Nick Ward SQL Server & BI Product Specialist Microsoft Australia Nick Ward SQL Server & BI Product Specialist Microsoft.
Data Mining Teaching experience at the FIB. What is Data Mining? A broad set of techniques and algorithms brought from machine learning and statistics.
Data Mining Practical Machine Learning Tools and Techniques Chapter 4: Algorithms: The Basic Methods Section 4.6: Linear Models Rodney Nielsen Many of.
7.4 – Sampling Distribution Statistic: a numerical descriptive measure of a sample Parameter: a numerical descriptive measure of a population.
1 Statistical Techniques Chapter Linear Regression Analysis Simple Linear Regression.
1 Generative and Discriminative Models Jie Tang Department of Computer Science & Technology Tsinghua University 2012.
Copyright © 2004 Pearson Education, Inc.. Chapter 27 Data Mining Concepts.
Learning from observations
1 Improving quality of graduate students by data mining Asst. Prof. Kitsana Waiyamai, Ph.D. Dept. of Computer Engineering Faculty of Engineering, Kasetsart.
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
Overview of the final test for CSC Overview PART A: 7 easy questions –You should answer 5 of them. If you answer more we will select 5 at random.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
Data Mining and Decision Support
Chapter 6. Classification and Prediction Classification by decision tree induction Bayesian classification Rule-based classification Classification by.
DECISION TREES Asher Moody, CS 157B. Overview  Definition  Motivation  Algorithms  ID3  Example  Entropy  Information Gain  Applications  Conclusion.
Oct 29th, 2001Copyright © 2001, Andrew W. Moore Bayes Net Structure Learning Andrew W. Moore Associate Professor School of Computer Science Carnegie Mellon.
A Decision Support Based on Data Mining in e-Banking Irina Ionita Liviu Ionita Department of Informatics University Petroleum-Gas of Ploiesti.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Data-Driven Education
Usman Roshan Dept. of Computer Science NJIT
Experience Report: System Log Analysis for Anomaly Detection
CSE 4705 Artificial Intelligence
EDUCAUSE Annual Conference
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
RESEARCH APPROACH.
Prediction as Data Mining Task
Data Mining Lecture 11.
Basic Intro Tutorial on Machine Learning and Data Mining
Causal Data Mining Richard Scheines
3.1.1 Introduction to Machine Learning
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Prepared by: Mahmoud Rafeek Al-Farra
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Educational Data Mining Success Stories
Searching for Graphical Causal Models of Education Data
What is Artificial Intelligence?
Presentation transcript:

Causal Data Mining Richard Scheines Dept. of Philosophy, Machine Learning, & Human-Computer Interaction Carnegie Mellon

1. Predictive Data Mining Finding predictive relationships in data –What feature of student behavior predicts learning –Who will default on credit cards –Who will get an “A” in your course –Which HS students will do well at CMU –Do students cluster by “learning style”

Causal Data Mining Finding causal relationships in data –What feature of student behavior causes learning –What will happen when we make everyone take a reading quiz before each class –What will happen when we program our tutor to intervene to give hints after an error

Predictive Data Mining X1X2X3..XkY M F F N2.812M Data Mining Search Predictive Model Y = f(X1, X2, …Xk)

Predictive Data Mining Data Mining Search Predictive Model Y = f(X1, X2, …Xk) Model Classes 1.Simple Regression 2.Locally Weighted Regression 3.Logistic Regression 4.Neural Nets 5.Vector Support Machines 6.Decision Trees 7.Bayes Net 8.Naïve Bayes Classifier 9.Independent Components 10.Clustering 11.Etc.

Predictive Data Mining Predictive Model under Constraints Y = f(X1, X2, …Xk), e.g., f  Additive functions Data Mining Search

Predictive Data Mining Predictive Model under Constraints Y = f(X1, X2, …Xk), Or Probability Model under Constraints: P(Y | X1, X2, …, Xk), where P  Gaussian, with mean 0 Data Mining Search

Predictive Data Mining Decision Tree Search

Predictive Data Mining ≠ Causal Data Mining P(Y | X1, X2, …, Xk)  P(Y | X1 set, X2, …, Xk) Conditioning is not the same as intervening Teeth Slides

Causal Discovery Statistical Data  Causal Structure Background Knowledge - X 2 before X 3 - no unmeasured common causes Statistical Inference

Causal Discovery Software TETRAD IV

Full Semester Online Course in Causal & Statistical Reasoning

Course is tooled to record certain events:  Logins, page requests, print requests, quiz attempts, quiz scores, voluntary exercises attempted, etc. Each event was associated with attributes:  Time  student-id  Session-id

Printing and Voluntary Comprehension Checks: >