Statistics for Social and Behavioral Sciences Part IV: Causality Inference for Slope and Correlation Section 9.5 Prof. Amine Ouazad.

Slides:



Advertisements
Similar presentations
Chapter 12 Inference for Linear Regression
Advertisements

BA 275 Quantitative Business Methods
Statistics for Social and Behavioral Sciences Session #16: Confidence Interval and Hypothesis Testing (Agresti and Finlay, from Chapter 5 to Chapter 6)
Statistics for Social and Behavioral Sciences Part IV: Causality Randomized Experiments, ANOVA Chapter 12, Section 12.1 Prof. Amine Ouazad.
Inference for Regression
Statistics for Social and Behavioral Sciences Session #11: Random Variable, Expectations (Agresti and Finlay, Chapter 4) Prof. Amine Ouazad.
Objectives (BPS chapter 24)
Statistics for Social and Behavioral Sciences Session #9: Linear Regression and Conditional distribution Probabilities (Agresti and Finlay, Chapter 9)
PSY 307 – Statistics for the Behavioral Sciences
Chapter 10 Simple Regression.
The Simple Regression Model
Review Chapter 1-3. Exam 1 25 questions 50 points 90 minutes 1 attempt Results will be known once the exam closes for everybody.
T-test.
Linear Regression and Correlation Analysis
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
STAT 101 Dr. Kari Lock Morgan Exam 2 Review.
Chapter 7 Forecasting with Simple Regression
Statistics for Social and Behavioral Sciences Part IV: Causality Association and Causality Session 22 Prof. Amine Ouazad.
Chapter 12 Section 1 Inference for Linear Regression.
Midterm 1 Well done !! Mean 80.23% Median 84.6% Standard deviation of ppt. 5 th percentile is 53.
Correlation & Regression
Statistics for Social and Behavioral Sciences Session #15: Interval Estimation, Confidence Interval (Agresti and Finlay, Chapter 5) Prof. Amine Ouazad.
Statistics for Social and Behavioral Sciences Session #17: Hypothesis Testing: The Confidence Interval Method and the T-Statistic Method (Agresti and Finlay,
Active Learning Lecture Slides
Statistics for Social and Behavioral Sciences Part IV: Causality Multivariate Regression Chapter 11 Prof. Amine Ouazad.
Statistics for the Social Sciences Psychology 340 Fall 2013 Thursday, November 21 Review for Exam #4.
Introduction to Linear Regression and Correlation Analysis
Inference for regression - Simple linear regression
Statistics for Social and Behavioral Sciences Session #18: Literary Analysis using Tests (Agresti and Finlay, from Chapter 5 to Chapter 6) Prof. Amine.
Statistics for Social and Behavioral Sciences Session #14: Estimation, Confidence Interval (Agresti and Finlay, Chapter 5) Prof. Amine Ouazad.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Statistics for Social and Behavioral Sciences
1 G Lect 10a G Lecture 10a Revisited Example: Okazaki’s inferences from a survey Inferences on correlation Correlation: Power and effect.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Statistics for Social and Behavioral Sciences Session #6: The Regression Line C’ted (Agresti and Finlay, Chapter 9) Prof. Amine Ouazad.
QMS 6351 Statistics and Research Methods Regression Analysis: Testing for Significance Chapter 14 ( ) Chapter 15 (15.5) Prof. Vera Adamchik.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Statistics for Social and Behavioral Sciences Part IV: Causality Multivariate Regression R squared, F test, Chapter 11 Prof. Amine Ouazad.
Chapter 14 Inference for Regression AP Statistics 14.1 – Inference about the Model 14.2 – Predictions and Conditions.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Inference for regression - More details about simple linear regression IPS chapter 10.2 © 2006 W.H. Freeman and Company.
Simple Linear Regression ANOVA for regression (10.2)
Statistics for Social and Behavioral Sciences Part IV: Causality Comparison of two groups Chapter 7 Prof. Amine Ouazad.
Lecture 10 Chapter 23. Inference for regression. Objectives (PSLS Chapter 23) Inference for regression (NHST Regression Inference Award)[B level award]
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
Review Lecture 51 Tue, Dec 13, Chapter 1 Sections 1.1 – 1.4. Sections 1.1 – 1.4. Be familiar with the language and principles of hypothesis testing.
[1] Simple Linear Regression. The general equation of a line is Y = c + mX or Y =  +  X.  > 0  > 0  > 0  = 0  = 0  < 0  > 0  < 0.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Statistics for Social and Behavioral Sciences Session #19: Estimation and Hypothesis Testing, Wrap-up & p-value (Agresti and Finlay, from Chapter 5 to.
Inference for regression - More details about simple linear regression IPS chapter 10.2 © 2006 W.H. Freeman and Company.
Dependent (response) Variable Independent (control) Variable Random Error XY x1x1 y1y1 x2x2 y2y2 …… xnxn ynyn Raw data: Assumption:  i ‘s are independent.
Review Chapter 1-3. Exam 1 25 questions 50 points 90 minutes 1 attempt Results will be known once the exam closes for everybody.
Lesson Testing the Significance of the Least Squares Regression Model.
Chapter 26: Inference for Slope. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other.
Marginal Distribution Conditional Distribution. Side by Side Bar Graph Segmented Bar Graph Dotplot Stemplot Histogram.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
AP Statistics Chapter 14 Section 1.
Inference for Regression
Hypothesis Tests: One Sample
The Practice of Statistics in the Life Sciences Fourth Edition
Inference for Regression Lines
CHAPTER 29: Multiple Regression*
Section 11.2 Day 2.
CHAPTER 12 More About Regression
Chapter 14 Inference for Regression
Introductory Statistics
Presentation transcript:

Statistics for Social and Behavioral Sciences Part IV: Causality Inference for Slope and Correlation Section 9.5 Prof. Amine Ouazad

Statistics Course Outline P ART I. I NTRODUCTION AND R ESEARCH D ESIGN P ART II. D ESCRIBING DATA P ART III. D RAWING CONCLUSIONS FROM DATA : I NFERENTIAL S TATISTICS P ART IV. : C ORRELATION AND C AUSATION : T WO G ROUPS, R EGRESSION A NALYSIS Week 1 Weeks 2-4 Weeks 5-9 Weeks Multivariate regression coming up! Estimating a parameter using sample statistics. Confidence Interval at 90%, 95%, 99% Testing a hypothesis using the CI method and the t method. Sample statistics: Mean, Median, SD, Variance, Percentiles, IQR, Empirical Rule Bivariate sample statistics: Correlation, Slope Four Steps of “Thinking Like a Statistician” Study Design: Simple Random Sampling, Cluster Sampling, Stratified Sampling Biases: Nonresponse bias, Response bias, Sampling bias

Coming up “Comparison of Two Groups” Last Session. “Univariate Regression Analysis” This Session Saturday. (Section 9.5) “Association and Causality: Multivariate Regression” Thursday and Extra Session. Chapters 10 and 11. “Randomized Experiments (Cted), ANOVA”. Last Tuesday and Extra Session. Chapter 12. “Robustness Checks and Wrap Up”. Last Thursday.

Outline 1.Burger prices (y) and poverty (x) 2.Inference for the slope coefficient b 3.Inference for the correlation coefficient r Next time:Association and Causality

Burger prices and poverty Do fast food chains price their burger lower in neighborhoods that are poorer? Data on N = 167 stores in Price of the burger in each store.

Burger Prices and Percentage in Poverty in ZIP code

Recap on the slope coefficient y: dependent variable. x: explanatory or independent variable. y and x should be quantitative. Assume the relationship y i = a + b x i + e i. Slope coefficient b: An increase of x by 1 is associated with an increase of y by b. Relationship between slope and correlation:

Scatter plot Predicted values in red.

Outline 1.Burger prices (y) and poverty (x) 2.Inference for the slope coefficient b 3.Inference for the correlation coefficient r Next time:Association and Causality

Slope: Parameter vs Statistic True relationship: y =  +  x + . Parameter  would be measured if we had the entire population. On the sample, the statistic b is measured. – In general, b will not be equal to . Sampling distribution of b? Standard error of the slope coefficient b?

Confidence interval for the slope Build a confidence interval for the slope. t also provided by Table 5.1. Number of degrees of freedom df = N – 2. t statistic for the slope Built in a similar way as for previous statistics: Sampling distribution of the t statistic? Standard deviation of the t statistic? Number of degrees of freedom N-2.

Testing the null hypothesis H 0 : “  = 0”. H a : “  is different from 0” Two methods: – the confidence interval method and the t statistic method. Reject the null hypothesis at 95%: – if the 95% confidence interval does not include 0. – or if the t statistic is higher in absolute value than the t score at 95%. Same test can be performed for  using the value of a.

Standard Error of Slope b Given that the true relationship is y i =  +  x i +  i The estimated slope becomes: Realize that the source of uncertainty in b comes from the residual  i. or (efficient) (unbiased )

Two Examples b is precisely estimated.  has a low standard error. b is imprecisely estimated.  has a higher standard error. In both graphs, y = x + e SD(e) = 0.8SD(e) = 2

Back to Burgers Reading this output? Can we reject H0: “Fraction in poverty (prppov) has not impact on the burger price” ?

Outline 1.Burger prices (y) and poverty (x) 2.Inference for the slope coefficient b 3.Inference for the correlation coefficient r Next time:Association and Causality

Inference for the Correlation Coefficient True correlation  (a parameter) vs measured correlation r. H 0 : “  = 0 ” H a : “  is different from 0 “ Realize that, hence  = 0 if and only if  = 0. (Using the unbiased estimator of SD(  )). Degrees of freedom: N – 2.

Coming up: Reading : Chapter on “Comparing Two Groups”. Next chapter 9 with t tests for slope coefficients. Online quiz this weekend on this material. Session on Saturday at in the same room -> catch up for National Day. Make sure you come to sessions and recitations. For help: Amine Ouazad Office 1135, Social Science building Office hour: Tuesday from 5 to 6.30pm. GAF: Irene Paneda Sunday recitations. At the Academic Resource Center, Monday from 2 to 4pm.