1 Center for Causal Discovery: Summer Workshop - 2015 June 8-11, 2015 Carnegie Mellon University.

Slides:



Advertisements
Similar presentations
Probability models- the Normal especially.
Advertisements

Structural Equation Modeling
Nemours Biomedical Research Statistics March 19, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Introduction to Hypothesis Testing CJ 526 Statistical Analysis in Criminal Justice.
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
1 SOC 3811 Basic Social Statistics. 2 Announcements  Assignment 2 Revisions (interpretation of measures of central tendency and dispersion) — due next.
Understanding Statistics in Research
Linear Regression and Correlation Analysis
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Business Statistics - QBM117 Interval estimation for the slope and y-intercept Hypothesis tests for regression.
Testing the Difference Between Means (Small Independent Samples)
Introduction to Hypothesis Testing CJ 526 Statistical Analysis in Criminal Justice.
Using Statistics in Research Psych 231: Research Methods in Psychology.
1 Richard Scheines Carnegie Mellon University Causal Graphical Models II: Applications with Search.
1 Day 2: Search June 9, 2015 Carnegie Mellon University Center for Causal Discovery.
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Psyc 235: Introduction to Statistics DON’T FORGET TO SIGN IN FOR CREDIT!
1/2555 สมศักดิ์ ศิวดำรงพงศ์
1 Tetrad: Machine Learning and Graphcial Causal Models Richard Scheines Joe Ramsey Carnegie Mellon University Peter Spirtes, Clark Glymour.
Let’s flip a coin. Making Data-Based Decisions We’re going to flip a coin 10 times. What results do you think we will get?
Confidence Intervals for the Regression Slope 12.1b Target Goal: I can perform a significance test about the slope β of a population (true) regression.
Inference for Linear Regression Conditions for Regression Inference: Suppose we have n observations on an explanatory variable x and a response variable.
X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ μ.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Chapter 8 Introduction to Hypothesis Testing
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
LECTURE 19 THURSDAY, 14 April STA 291 Spring
Inferential Statistics 2 Maarten Buis January 11, 2006.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
1 Tutorial: Causal Model Search Richard Scheines Carnegie Mellon University Peter Spirtes, Clark Glymour, Joe Ramsey, others.
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
1 Center for Causal Discovery: Summer Workshop June 8-11, 2015 Carnegie Mellon University.
Independent Samples 1.Random Selection: Everyone from the Specified Population has an Equal Probability Of being Selected for the study (Yeah Right!)
Part 2: Model and Inference 2-1/49 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
Learning Linear Causal Models Oksana Kohutyuk ComS 673 Spring 2005 Department of Computer Science Iowa State University.
PY 603 – Advanced Statistics II TR 12:30-1:45pm 232 Gordon Palmer Hall Jamie DeCoster.
Chapter 9 Introduction to the t Statistic. 9.1 Review Hypothesis Testing with z-Scores Sample mean (M) estimates (& approximates) population mean (μ)
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Unit 8 Section 8-3 – Day : P-Value Method for Hypothesis Testing  Instead of giving an α value, some statistical situations might alternatively.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Monday, October 22 Hypothesis testing using the normal Z-distribution. Student’s t distribution. Confidence intervals.
Simple examples of the Bayesian approach For proportions and means.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
Inferential Statistics Inferential statistics allow us to infer the characteristic(s) of a population from sample data Slightly different terms and symbols.
Aim: What is the P-value method for hypothesis testing? Quiz Friday.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Monday, October 21 Hypothesis testing using the normal Z-distribution. Student’s t distribution. Confidence intervals.
Descriptive and Inferential Statistics Descriptive statistics The science of describing distributions of samples or populations Inferential statistics.
The Idea of the Statistical Test. A statistical test evaluates the "fit" of a hypothesis to a sample.
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
1 Day 2: Search June 14, 2016 Carnegie Mellon University Center for Causal Discovery.
Tetrad 1)Main website: 2)Download:
Ex St 801 Statistical Methods Part 2 Inference about a Single Population Mean (HYP)
Inference about the slope parameter and correlation
Chapter 8 Hypothesis Testing with Two Samples.
Workshop Files
Workshop Files
Statistical Tests P Values.
Center for Causal Discovery: Summer Short Course/Datathon
Center for Causal Discovery: Summer Short Course/Datathon
Correlation and Regression
Statistical Power.
Presentation transcript:

1 Center for Causal Discovery: Summer Workshop June 8-11, 2015 Carnegie Mellon University

Outline 1)Motivation 2)Representing/Modeling Causal Systems 3)Estimation and Model fit 4)Hands on with Real Data 2

3 Estimation

4

5 Tetrad Demo and Hands-on 1)Select Template: “Estimate from Simulated Data” 2)Build the SEM shown below – all error standard deviations = 1.0 (go into the Tabular Editor) 3)Generate simulated data N=1000 4)Estimate model. 5)Save session as “Estimate1”

6 Estimation

7 Coefficient inference vs. Model Fit Coefficient Inference: Null: coefficient = 0, e.g.,  X1  X3 = 0 p-value = p(Estimated value  X1  X3 ≥.4788 |  X1  X3 = 0 & rest of model correct) Reject null (coefficient is “significant”) when p-value < ,  usually =.05

8 Coefficient inference vs. Model Fit Coefficient Inference: Null: coefficient = 0, e.g.,  X1  X3 = 0 p-value = p(Estimated value  X1  X3 ≥.4788 |  X1  X3 = 0 & rest of model correct) Reject null (coefficient is “significant”) when p-value < < ,  usually =.05, Model fit: Null: Model is correctly specified (constraints true in population) p-value = p(f(Deviation(  ml,S)) ≥ | Model correctly specified)

9 Coefficient inference vs. Model Fit >.05 Can’t reject 0, insignificant edge Can’t reject correct specification, model may be correctly specified Model fit  2 df  X1  X3 coefficient <.05 Can reject correct specification, Model not correctly specified Can reject 0 Significant edge p-value Null:  X1  X3 = 0 Null: Model is correctly specified

10 Model Fit X1X2X3 X11 X2 11 1 X3  1*  2 22 1 X1X2X3 X11 X2.61 X Specified ModelTrue Model  1 = r X1,X2 = ~.6  2 = r X2,X3 = ~.5  X1,X3 =  1   2 = ~.3 Implied Covariance MatrixPopulation Covariance Matrix

11 Model Fit X1X2X3 X11 X2 11 1 X3  1*  2 22 1 X1X2X3 X11 X2.61 X Specified Model True Model Population Covariance Matrix Unless r X1,X3 = r X1,X2 r X2,X3 Implied Covariance Matrix Estimated Covariance Matrix ≠ Sample Covariance Matrix

12 Model Fit X1X2X3 X11 X2 11 1 X3  1*  2 22 1 X1X2X3 X11 X2.61 X Specified Model True Model Population Covariance Matrix Implied Covariance Matrix Model fit: Null: Model is correctly specified (constraints true in population)  X1,X3 =  X1,X2  X2,X3 p-value = p(f(Deviation(  ml,S)) ≥  2 | Model correctly specified)

13

14 Tetrad Demo and Hands-on 1)Create two DAGs with the same variables – each with one edge flipped, and attach a SEM PM to each new graph (copy and paste by selecting nodes, Ctl-C to copy, and then Ctl-V to paste) 2)Estimate each new model on the data produced by original graph 3)Check p-values of: a)Edge coefficients b)Model fit 4)Save session as: “estimation2”

15 Break

16 Charitable Giving What influences giving? Sympathy? Impact? "The Donor is in the Details", Organizational Behavior and Human Decision Processes, Issue 1, 15-23, C. Cryder, with G. Loewenstein, R. Scheines. N = 94 TangibilityCondition[1,0]Randomly assigned experimental condition Imaginability[1..7]How concrete scenario I Sympathy[1..7]How much sympathy for target Impact[1..7]How much impact will my donation have AmountDonated[0..5]How much actually donated

Theoretical Hypothesis 17 Hypothesis 2

18 Tetrad Demo and Hands-on 1)Load charity.txt (tabular – not covariance data) 2)Build graph of theoretical hypothesis 3)Build SEM PM from graph 4)Estimate PM, check results

19 Foreign Investment Does Foreign Investment in 3 rd World Countries inhibit Democracy? Timberlake, M. and Williams, K. (1984). Dependence, political exclusion, and government repression: Some cross-national evidence. American Sociological Review 49, N = 72 POdegree of political exclusivity CVlack of civil liberties ENenergy consumption per capita (economic development) FIlevel of foreign investment

Case Study: Foreign Investment Alternative Models There is no model with testable constraints (df > 0) that is not rejected by the data, in which FI has a positive effect on PO.

21 Tetrad Demo and Hands-on 1)Load tw.txt (this IS covariance data) 2)Do a regression 3)Build an alternative hypothesis, Graph - SEM PM, SEM IM 4)Estimate PM, check results

22 Hands On Lead and IQ Lead:Lead concentration in baby teeth CIQ:child’s IQ score at 7 PIQ:Parent’s average IQ MED:mother’s education (years) NLB: number of live births prior to child MAB:mother’s age at birth of child FAB:father’s age at birth of child

23 Hands On Lead and IQ 1)Load leadiq1.tet 2)Specify different hypotheses, test the model fit on each 3)See if you can find a model (without using search), that is not rejected by the data