Statistics for Social and Behavioral Sciences Part IV: Causality Randomized Experiments, ANOVA Chapter 12, Section 12.1 Prof. Amine Ouazad.

Slides:



Advertisements
Similar presentations
Multiple Regression W&W, Chapter 13, 15(3-4). Introduction Multiple regression is an extension of bivariate regression to take into account more than.
Advertisements

Statistics for Social and Behavioral Sciences Session #16: Confidence Interval and Hypothesis Testing (Agresti and Finlay, from Chapter 5 to Chapter 6)
Statistics for Social and Behavioral Sciences Session #11: Random Variable, Expectations (Agresti and Finlay, Chapter 4) Prof. Amine Ouazad.
Statistics for Social and Behavioral Sciences Session #9: Linear Regression and Conditional distribution Probabilities (Agresti and Finlay, Chapter 9)
July 1, 2008Lecture 17 - Regression Testing1 Testing Relationships between Variables Statistics Lecture 17.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
The Simple Regression Model
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
IEEM 3201 Two-Sample Estimation: Paired Observation, Difference.
Statistics 101 Class 9. Overview Last class Last class Our FAVORATE 3 distributions Our FAVORATE 3 distributions The one sample Z-test The one sample.
Chapter 9 - Lecture 2 Computing the analysis of variance for simple experiments (single factor, unrelated groups experiments).
Lecture 13: Tues., Feb. 24 Comparisons Among Several Groups – Introduction (Case Study 5.1.1) Comparing Any Two of the Several Means (Chapter 5.2) The.
CHAPTER 10 ESTIMATION AND HYPOTHESIS TESTING: TWO POPULATIONS Prem Mann, Introductory Statistics, 7/E Copyright © 2010 John Wiley & Sons. All right reserved.
Quantitative Business Methods for Decision Making Estimation and Testing of Hypotheses.
Statistics for Social and Behavioral Sciences Part IV: Causality Association and Causality Session 22 Prof. Amine Ouazad.
Midterm 1 Well done !! Mean 80.23% Median 84.6% Standard deviation of ppt. 5 th percentile is 53.
Statistics for Social and Behavioral Sciences Session #15: Interval Estimation, Confidence Interval (Agresti and Finlay, Chapter 5) Prof. Amine Ouazad.
Statistics for Social and Behavioral Sciences Session #17: Hypothesis Testing: The Confidence Interval Method and the T-Statistic Method (Agresti and Finlay,
Statistics for Social and Behavioral Sciences Part IV: Causality Multivariate Regression Chapter 11 Prof. Amine Ouazad.
Inference for regression - Simple linear regression
Statistics for Social and Behavioral Sciences Session #18: Literary Analysis using Tests (Agresti and Finlay, from Chapter 5 to Chapter 6) Prof. Amine.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Statistics for Social and Behavioral Sciences Session #14: Estimation, Confidence Interval (Agresti and Finlay, Chapter 5) Prof. Amine Ouazad.
More About Significance Tests
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
Statistics for Social and Behavioral Sciences
Today’s lesson Confidence intervals for the expected value of a random variable. Determining the sample size needed to have a specified probability of.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Statistics for Social and Behavioral Sciences Session #6: The Regression Line C’ted (Agresti and Finlay, Chapter 9) Prof. Amine Ouazad.
Statistics for the Behavioral Sciences Second Edition Chapter 11: The Independent-Samples t Test iClicker Questions Copyright © 2012 by Worth Publishers.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
Statistics for Social and Behavioral Sciences Part IV: Causality Multivariate Regression R squared, F test, Chapter 11 Prof. Amine Ouazad.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Statistics for Social and Behavioral Sciences Part IV: Causality Inference for Slope and Correlation Section 9.5 Prof. Amine Ouazad.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
Statistics for Social and Behavioral Sciences Part IV: Causality Comparison of two groups Chapter 7 Prof. Amine Ouazad.
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
Environmental Modeling Basic Testing Methods - Statistics III.
Three Broad Purposes of Quantitative Research 1. Description 2. Theory Testing 3. Theory Generation.
Review Lecture 51 Tue, Dec 13, Chapter 1 Sections 1.1 – 1.4. Sections 1.1 – 1.4. Be familiar with the language and principles of hypothesis testing.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 6 Hypothesis Tests with Means.
Inferential Statistics 4 Maarten Buis 18/01/2006.
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
June 30, 2008Stat Lecture 16 - Regression1 Inference for relationships between variables Statistics Lecture 16.
STATISTICAL INFERENCE PART VI HYPOTHESIS TESTING 1.
Statistics for Social and Behavioral Sciences Session #19: Estimation and Hypothesis Testing, Wrap-up & p-value (Agresti and Finlay, from Chapter 5 to.
MATB344 Applied Statistics I. Experimental Designs for Small Samples II. Statistical Tests of Significance III. Small Sample Test Statistics Chapter 10.
BHS Methods in Behavioral Sciences I May 9, 2003 Chapter 6 and 7 (Ray) Control: The Keystone of the Experimental Method.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Review Chapter 1-3. Exam 1 25 questions 50 points 90 minutes 1 attempt Results will be known once the exam closes for everybody.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Chapter 22 Inferential Data Analysis: Part 2 PowerPoint presentation developed by: Jennifer L. Bellamy & Sarah E. Bledsoe.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
Inference about the slope parameter and correlation
REGRESSION G&W p
Estimation & Hypothesis Testing for Two Population Parameters
Part Three. Data Analysis
CHAPTER 29: Multiple Regression*
Chapter 12 Inference on the Least-squares Regression Line; ANOVA
IE 355: Quality and Applied Statistics I Confidence Intervals
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Introductory Statistics
Presentation transcript:

Statistics for Social and Behavioral Sciences Part IV: Causality Randomized Experiments, ANOVA Chapter 12, Section 12.1 Prof. Amine Ouazad

Some of their claims Does the data back these claims??

Weight Loss Diet Are weight loss diets effective at reducing weight? 1.Dr Atkins’ new diet revolution. 2.Slim Fast Plan. 3.Weight Watchers. 4.Rosemary Conley’s eat yourself slim and fitness plan.

Statistics Course Outline P ART I. I NTRODUCTION AND R ESEARCH D ESIGN P ART II. D ESCRIBING DATA P ART III. D RAWING CONCLUSIONS FROM DATA : I NFERENTIAL S TATISTICS P ART IV. : C ORRELATION AND C AUSATION : T WO G ROUPS, R EGRESSION A NALYSIS Week 1 Weeks 2-4 Weeks 5-9 Weeks Randomized Experiments, ANOVA Estimating a parameter using sample statistics. Confidence Interval at 90%, 95%, 99% Testing a hypothesis using the CI method and the t method. Sample statistics: Mean, Median, SD, Variance, Percentiles, IQR, Empirical Rule Bivariate sample statistics: Correlation, Slope Four Steps of “Thinking Like a Statistician” Study Design: Simple Random Sampling, Cluster Sampling, Stratified Sampling Biases: Nonresponse bias, Response bias, Sampling bias

Coming up “Comparison of Two Groups” Last week. “Univariate Regression Analysis” Last Saturday, Section 9.5. “Association and Causality: Multivariate Regression” Last Saturday, Chapter 10. Monday, Tuesday, Chapter 11 – R Squared, F test. “Randomized Experiments and ANOVA”. Today. Chapter 12. “Wrap Up”. Tomorrow.

Measuring the Effectiveness of Weight Loss Programs Observational data: – Collect data on a sample of N individuals. – Ask the following questions: Did you follow a weight loss program in the last 6 months? Yes/No What was your weight (kg) 6 months ago? What is your weight (kg) now? – Compute the weight loss for each individual i in the sample. t test of the difference of means.

Outline 1.BBC Diet Program 2.Results: Are Diets Effective? 3.ANOVA: Are All Diets Equally Effective? Next time:Wrap Up

Experimental Data BBC Diet Program Identified potential participants via a BBC advertising campaign (television and other form of media). – Participants had a body mass index between 27 and 40. – Lives within 30 miles of a test center. – Aged between 18 and 65. If you take part in the experiment, we will randomly assign you a diet and you will follow that diet that we randomly chose for you. Sample Size N = 210. Started in July 2002.

Outline 1.BBC Diet Program 2.Results: Are Diets Effective? 3.ANOVA: Are All Diets Equally Effective? Next time:Wrap Up

Are Diets Effective? Number of individuals in each group

Is the Atkins Diet Ineffective: “H 0 :  Atkins =0” Mean weight loss in the Atkins Diet group: 8.5 kg. Number of participants in the Atkins Diet group: 29 Standard deviation of weight loss in the Atkins Diet group: 6.0 kg 95% confidence interval for mean weight loss? [,] t statistic for mean weight loss? Can we reject the null hypothesis at 95% confidence?

Is the Atkins Diet as Effective as the Weight Watchers Diet? H 0 : “  Atkins =  Weight Watchers ” H a : “  Atkins ≠  Weight Watchers ” See session on Employment Discrimination. Test of the difference of means.

se 1 = se 2 = se = m 1 -m 2 = t = Can we reject the null hypothesis that the Atkins diet is as effective as the Weight Watchers diet? Is the Atkins Diet (1) as Effective as the Weight Watchers Diet (2) ?

Outline 1.BBC Diet Program 2.Results: Are Diets Effective? 3.ANOVA: Are All Diets Equally Effective? Next time:Wrap Up

ANalysis Of VAriance (ANOVA) Question: Are all diets equally effective? H 0 : “  Atkins =  Weight Watchers =  Slim Fast =  Conley ” H a : “At least one diet is different from the others” – Could be for instance that Slim Fast is less effective than another diet (e.g.  Slim Fast <  Conley ). – Or for instance Conley is more effective than other diets (e.g.  Conley <  Atkins 

Formally, for K means: H0 : “  1 =  2 =…=  K ” H a : “For at least one pair k,k’  k ≠  k’ ” As usual we need: – The statistic: an F statistic – The sampling distribution of the statistic under the H 0 : and its degrees of freedom. – The p value of the statistic. ANalysis Of VAriance (ANOVA)

F Statistic Reminder: Variance is the square of the standard deviation. Follows an F distribution with df 1 =G-1 and df 2 =N-G degrees of freedom. Reject the H 0 at 95% confidence level, i.e. 5% significance level, if the p value is lower than The p value is given by ‘display Ftail(df 1,df 2,F)’ in Stata.

Within group variance We know the standard deviation for each group: 6.0, 7.9, 5.7, 5.5, Therefore we know the variance in each group: The within group is approximately the average of those variances: See section 12.1 of Agresti and Finlay if interested

Between group variance Take the mean of each diet group: Consider the overall mean across all diet groups: Compute the variance of those means, weighted by the number ng of observations in each diet group: See section 12.1 of Agresti and Finlay if interested

Applying t tests and ANOVA to Experimental Data We used three different methods: – Confidence interval on a mean. – t test for the equality of a mean to 0. – t test for the equality of two means. – ANOVA for the equality of multiple means. Are diets effective?  Yes !!! But not much difference between the 4 different ones we saw.

Coming up: Wrap Up Coverage for the final just ends right after the F test. Chapter on “Association and Causality”, and “Multivariate Regression”. Make sure you come to sessions and the last recitation. ANOVA is in chapter 12. Important for your future career !!! SundayMonday Multivariate Regression Tuesday Multivariate Regression The F test Wednesday Randomized Experiments and ANOVA Thursday Wrap up RecitationEvening session 7.30pm West Administration 002 Usual class 12.45pm Usual room Evening session 7.30pm West Administration 001 Usual class 12.45pm Usual room