Center for Causal Discovery: Summer Short Course/Datathon

Slides:



Advertisements
Similar presentations
Probability models- the Normal especially.
Advertisements

Structural Equation Modeling
The Multiple Regression Model.
Topic Outline Motivation Representing/Modeling Causal Systems
Outline 1)Motivation 2)Representing/Modeling Causal Systems 3)Estimation and Updating 4)Model Search 5)Linear Latent Variable Models 6)Case Study: fMRI.
Chapter 12 Simple Regression
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
1 SOC 3811 Basic Social Statistics. 2 Announcements  Assignment 2 Revisions (interpretation of measures of central tendency and dispersion) — due next.
1 Center for Causal Discovery: Summer Workshop June 8-11, 2015 Carnegie Mellon University.
1 Day 2: Search June 9, 2015 Carnegie Mellon University Center for Causal Discovery.
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
1 Tetrad: Machine Learning and Graphcial Causal Models Richard Scheines Joe Ramsey Carnegie Mellon University Peter Spirtes, Clark Glymour.
Using Lock5 Statistics: Unlocking the Power of Data
Inferential Statistics 2 Maarten Buis January 11, 2006.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
1 Tutorial: Causal Model Search Richard Scheines Carnegie Mellon University Peter Spirtes, Clark Glymour, Joe Ramsey, others.
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
1 Center for Causal Discovery: Summer Workshop June 8-11, 2015 Carnegie Mellon University.
Welcome to Econ 420 Applied Regression Analysis Study Guide Week Six.
Independent Samples 1.Random Selection: Everyone from the Specified Population has an Equal Probability Of being Selected for the study (Yeah Right!)
Part 2: Model and Inference 2-1/49 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
Inferential Statistics Body of statistical computations relevant to making inferences from findings based on sample observations to some larger population.
Discussion of time series and panel models
Environmental Modeling Basic Testing Methods - Statistics III.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Monday, October 21 Hypothesis testing using the normal Z-distribution. Student’s t distribution. Confidence intervals.
1 Day 2: Search June 9, 2015 Carnegie Mellon University Center for Causal Discovery.
4-1 MGMG 522 : Session #4 Choosing the Independent Variables and a Functional Form (Ch. 6 & 7)
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
1 Day 2: Search June 14, 2016 Carnegie Mellon University Center for Causal Discovery.
Chapter 9 Introduction to the t Statistic
Methods of Presenting and Interpreting Information Class 9.
CHAPTER 12 More About Regression
Inference about the slope parameter and correlation
Chapter 11 – Test of Independence - Hypothesis Test for Proportions of a Multinomial Population In this case, each element of a population is assigned.
AP Biology Intro to Statistics
Day 3: Search Continued Center for Causal Discovery June 15, 2015
8-1 of 23.
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Chapter 8 Hypothesis Testing with Two Samples.
CHAPTER 12 More About Regression
AP Biology Intro to Statistics
Chapter 6 Predicting Future Performance
Workshop Files
Further Inference in the Multiple Regression Model
Workshop Files
Making Data-Based Decisions
Statistical Tests P Values.
Review: What influences confidence intervals?
Center for Causal Discovery: Summer Short Course/Datathon
Discrete Event Simulation - 4
Monday, October 19 Hypothesis testing using the normal Z-distribution.
Reasoning in Psychology Using Statistics
Reasoning in Psychology Using Statistics
Correlation and Regression
CHAPTER 6 Statistical Inference & Hypothesis Testing
Tutorial 1: Misspecification
CHAPTER 12 More About Regression
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 12 More About Regression
Chapter 6 Predicting Future Performance
Making Inferences about Slopes
Statistical Power.
Presentation transcript:

Center for Causal Discovery: Summer Short Course/Datathon - 2018 June 11-15, 2018 Carnegie Mellon University

Outline Models  Data Representing/Modeling Causal Systems Estimation and Model fit Hands on with Real Data Models  Data Markov Axiom and D-separation Model Equivalence Model Search

Estimation

Estimation

Tetrad Demo and Hands-on Build the standardized SEM IM shown below Generate simulated data N=1000 Estimate model. Save session as “Estimate1”

Estimation

Coefficient inference vs. Model Fit Coefficient Inference: Null: coefficient = 0, e.g., bX1 X3 = 0 p-value = p(Estimated value bX1 X3 ≥ .4788 | bX1 X3 = 0 & rest of model correct) Reject null (coefficient is “significant”) when p-value < a, a usually = .05

Coefficient inference vs. Model Fit Coefficient Inference: Null: coefficient = 0, e.g., bX1 X3 = 0 p-value = p(Estimated value bX1 X3 ≥ .4788 | bX1 X3 = 0 & rest of model correct) Reject null (coefficient is “significant”) when p-value < < a, a usually = .05, Model fit: Null: Model is correctly specified (constraints true in population) p-value = p(f(Deviation(Sml,S)) ≥ 5.7137 | Model correctly specified)

Coefficient inference vs. Model Fit bX1 X3 coefficient Model fit c2 df Null: bX1 X3 = 0 Null: Model is correctly specified p-value Can reject 0 Significant edge Can reject correct specification, Model not correctly specified < .05 Can’t reject 0, insignificant edge Can’t reject correct specification, model may be correctly specified p-value > .05

Model Fit Specified Model True Model Implied Covariance Matrix Population Covariance Matrix X1 X2 X3 1 b1 b1*b2 b2 X1 X2 X3 1 .6 .3 .5 b1 = rX1,X2 = ~ .6 b2 = rX2,X3 = ~ .5 b1 b2 = ~ .3 = rX1,X3

Model Fit ≠ rX1,X3 = ~ .5 Specified Model True Model Implied Covariance Matrix Population Covariance Matrix X1 X2 X3 1 b1 b1*b2 b2 X1 X2 X3 1 .6 .5 b1 = rX1,X2 = ~ .6 b2 = rX2,X3 = ~ .5 b1 b2 = ~ .3 = rX1,X3 ≠ rX1,X3 = ~ .5

Model Fit Specified Model True Model Implied Covariance Matrix Population Covariance Matrix X1 X2 X3 1 b1 b1*b2 b2 X1 X2 X3 1 .6 .5 Unless rX1,X3 = rX1,X2 rX2,X3 Estimated Covariance Matrix ≠ Sample Covariance Matrix

Model Fit Specified Model True Model Implied Covariance Matrix Population Covariance Matrix X1 X2 X3 1 b1 b1*b2 b2 X1 X2 X3 1 .6 .5 Model fit: Null: Model is correctly specified (constraints true in population) rX1,X3 = rX1,X2 rX2,X3 p-value = p(f(Deviation(Sml,S)) ≥ c2 | Model correctly specified)

Tetrad Demo and Hands-on Create two DAGs with the same variables – each with one edge flipped, and attach a SEM PM to each new graph (copy and paste by selecting nodes, Ctl-C to copy, and then Ctl-V to paste) Estimate each new model on the data produced by original graph Check p-values of: Edge coefficients Model fit Save session as: “estimation2”

Charitable Giving What influences giving? Sympathy? Impact? "The Donor is in the Details", Organizational Behavior and Human Decision Processes, Issue 1, 15-23, C. Cryder, with G. Loewenstein, R. Scheines. N = 94 TangibilityCondition [1,0] Randomly assigned experimental condition Imaginability [1..7] How concrete scenario I Sympathy [1..7] How much sympathy for target Impact [1..7] How much impact will my donation have AmountDonated [0..5] How much actually donated 1

Theoretical Hypothesis

Tetrad Demo and Hands-on Load charity.txt (tabular – not covariance data) Build graph of theoretical hypothesis Build SEM PM from graph Estimate PM, check results

Foreign Investment Does Foreign Investment in 3rd World Countries inhibit Democracy? Timberlake, M. and Williams, K. (1984). Dependence, political exclusion, and government repression: Some cross-national evidence. American Sociological Review 49, 141-146. N = 72 PO degree of political exclusivity CV lack of civil liberties EN energy consumption per capita (economic development) FI level of foreign investment 1

Tetrad Demo and Hands-on Load tw.txt (this IS covariance data) Do a regression Manually build an alternative hypotheses, Graph - SEM PM, SEM IM Estimate PM, check results

College Plans (Shaw,1979) 10,000 HS Seniors – Madison, WI Sex: 1,2 IQ: 1,2,3,4 SES: 1,2,3,4 PE: 1,2 CP: 1,2

College Plans (Hands-on) Load in College_plans_data Use file: College_plans_data Set Variables = discrete Attach Data  Graph Build plausible graph Add PM (input Graph and Data) Add Estimator Estimate Model