Data Screening: Don’t Leave Home Without It

Slides:



Advertisements
Similar presentations
10-3 Inferences.
Advertisements

Correlation and Regression
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Simple Linear Regression Estimates for single and mean responses.
Session 2. Applied Regression -- Prof. Juran2 Outline for Session 2 More Simple Regression –Bottom Part of the Output Hypothesis Testing –Significance.
Chapter 4 Describing the Relation Between Two Variables
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: What it Is and How it Works Overview What is Bivariate Linear Regression? The Regression Equation How It’s Based on r.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: What it Is and How it Works. Overview What is Bivariate Linear Regression? The Regression Equation How It’s Based on r.
Regression and Correlation
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Topic 3: Regression.
Welcome to class today! Chapter 12 summary sheet Jimmy Fallon video
Slide Copyright © 2010 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Business Statistics First Edition.
Statistics for the Social Sciences Psychology 340 Spring 2005 Course Review.
1 Chapter 17: Introduction to Regression. 2 Introduction to Linear Regression The Pearson correlation measures the degree to which a set of data points.
© 2001 Prentice-Hall, Inc.Chap 14-1 BA 201 Lecture 23 Correlation Analysis And Introduction to Multiple Regression (Data)Data.
STA291 Statistical Methods Lecture 27. Inference for Regression.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Chapter 4 Correlation and Regression Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
Elementary Statistics Correlation and Regression.
Correlation & Regression
Lecture 9 TWO GROUP MEANS TESTS EPSY 640 Texas A&M University.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Correlation and Regression: The Need to Knows Correlation is a statistical technique: tells you if scores on variable X are related to scores on variable.
Inferential Statistics 4 Maarten Buis 18/01/2006.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
LESSON 6: REGRESSION 2/21/12 EDUC 502: Introduction to Statistics.
D/RS 1013 Data Screening/Cleaning/ Preparation for Analyses.
A Clear and Present Danger to the Caribbean: Climate Change or Climate Alarmism? Reynold J. Stone Department of Food Production.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Nonparametric Statistics Overview. Objectives Understand Difference between Parametric and Nonparametric Statistical Procedures Nonparametric methods.
Welcome to Applied Multiple Regression (Psych 308c)! Spring 2013 MI Professor Dale Berger ACB101 Tue Teaching.
Using the regression equation (17.6). Why regression? 1.Analyze specific relations between Y and X. How is Y related to X? 2.Forecast / Predict the variable.
Inference about the slope parameter and correlation
Covariance/ Correlation
BASIC ECONOMETRICS.
CHAPTER 12 More About Regression
12 Inferential Analysis.
Chapter 6 Predicting Future Performance
Covariance/ Correlation
Covariance/ Correlation
BIVARIATE REGRESSION AND CORRELATION
Correlation and Regression
2. Find the equation of line of regression
Lecture 14 Review of Lecture 13 What we’ll talk about today?
Nonparametric Statistics Overview
Comparing Several Means: ANOVA
Introduction to bivariate data
Anscombe’s Quartet.
Correlation and Regression
Why study statistics?.
Correlation and Regression
12 Inferential Analysis.
CHAPTER 12 More About Regression
Comparing Two Populations
Doing t-tests by hand.
Covariance/ Correlation
3 4 Chapter Describing the Relation between Two Variables
Day 68 Agenda: 30 minute workday on Hypothesis Test --- you have 9 worksheets to use as practice Begin Ch 15 (last topic)
CHAPTER 12 More About Regression
Chapter 6 Predicting Future Performance
Inference for Who? Students at I.S.U. What? Time (minutes).
Inferences 10-3.
Presentation transcript:

Data Screening: Don’t Leave Home Without It AEA Coffee Break Webinar 8 September 2011 Dale Berger dale.berger@cgu.edu http://wise.cgu.edu

Why screen data? Locate and correct errors Assure that assumptions of statistical procedures are met Become familiar with the data Context is critical

Bivariate data example Data constructed by Anscombe consists of four sets of 11 pairs of X-Y scores. Can these four sets of data be pooled? Bumble applied regression to each set separately and recorded summary statistics. Anscombe, F. J. (1973). Graphs in statistical analysis. The American Statistician, 27, 17-21.

These summary statistics are identical in all four sets Sample Size (N) 11 Mean of X 9.0 Mean of Y 7.5 Correlation 0.816 Linear Equation y′ = 3 + .5x Regression SS 27.50 Residual SS 13.75 (df = 9)

Bumble’s Conclusions: There is a strong linear relationship between X and Y, as is apparent from r = .819, F(1, 9) = 18.00, p=.00217. All four data sets are equivalent and probably were sampled from the same population. What more would you like to know?

Anscombe’s four data sets

Data Set 1

Data Set 2

Data Set 3

Data Set 4

You can see a lot by just looking ---Yogi Berra If you don’t look, you won’t see it. --- DB

The Most Important Test in Statistics The Intra-Ocular Trauma Test To err is human; to really screw up it takes a computer!

Thank you! Questions and comments? dale.berger@cgu.edu http://wise.cgu.edu