Stat 470-4 Today: Multiple comparisons, diagnostic checking, an example After these notes, we will have looked at 1.1-1.3 (skip figures 1.2 and 1.3, last.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Stat 112: Lecture 7 Notes Homework 2: Due next Thursday The Multiple Linear Regression model (Chapter 4.1) Inferences from multiple regression analysis.
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
Inference for Regression
Model Adequacy Checking in the ANOVA Text reference, Section 3-4, pg
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Chapter 12 Simple Linear Regression
Stat Today: Will consider the one-way ANOVA model for comparing means of several treatments.
Statistics 350 Lecture 16. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Stat Today: General Linear Model Assignment 1:
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
The Simple Regression Model
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
Analysis of Variance Chapter 3Design & Analysis of Experiments 7E 2009 Montgomery 1.
Chapter 19 Data Analysis Overview
Lecture 23 Multiple Regression (Sections )
13-1 Designing Engineering Experiments Every experiment involves a sequence of activities: Conjecture – the original hypothesis that motivates the.
Simple Linear Regression Analysis
Regression Diagnostics Checking Assumptions and Data.
Quantitative Business Analysis for Decision Making Simple Linear Regression.
Analysis of Variance & Multivariate Analysis of Variance
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Statistics 350 Lecture 17. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Statistical Methods in Computer Science Hypothesis Testing II: Single-Factor Experiments Ido Dagan.
Simple Linear Regression Analysis
13 Design and Analysis of Single-Factor Experiments:
Two-Way Analysis of Variance STAT E-150 Statistical Methods.
Correlation & Regression
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Inference for regression - Simple linear regression
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Hypothesis Testing in Linear Regression Analysis
5-1 Introduction 5-2 Inference on the Means of Two Populations, Variances Known Assumptions.
Announcements: Homework 10: –Due next Thursday (4/25) –Assignment will be on the web by tomorrow night.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
PROBABILITY & STATISTICAL INFERENCE LECTURE 6 MSc in Computing (Data Analytics)
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
Analyzing Data: Comparing Means Chapter 8. Are there differences? One of the fundament questions of survey research is if there is a difference among.
© Copyright McGraw-Hill CHAPTER 12 Analysis of Variance (ANOVA)
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
MBP1010H – Lecture 4: March 26, Multiple regression 2.Survival analysis Reading: Introduction to the Practice of Statistics: Chapters 2, 10 and 11.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
VI. Regression Analysis A. Simple Linear Regression 1. Scatter Plots Regression analysis is best taught via an example. Pencil lead is a ceramic material.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.1 One-Way ANOVA: Comparing.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Model Selection and Validation. Model-Building Process 1. Data collection and preparation 2. Reduction of explanatory or predictor variables (for exploratory.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.3 Two-Way ANOVA.
CPE 619 One Factor Experiments Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama in.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
ANOVA, Regression and Multiple Regression March
Lesson 14 - R Chapter 14 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
Analysis of Variance STAT E-150 Statistical Methods.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
F73DA2 INTRODUCTORY DATA ANALYSIS ANALYSIS OF VARIANCE.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.1 One-Way ANOVA: Comparing.
Chapter 4 Basic Estimation Techniques
Inference for Least Squares Lines
10.2 Regression If the value of the correlation coefficient is significant, the next step is to determine the equation of the regression line which is.
Stats Club Marnie Brennan
Chapter 13 Additional Topics in Regression Analysis
Chapter 10 – Part II Analysis of Variance
Presentation transcript:

Stat Today: Multiple comparisons, diagnostic checking, an example After these notes, we will have looked at (skip figures 1.2 and 1.3, last two paragraphs of section 1.3), 1.6 (skip matrix notation and constraints), 1.7 (Tukey method only) and 1.9 (ignore H matrix notation on page 35), 2.1, 2.2 We will not do 1.5 nor 1.8 Assignment 1:

Multiple Comparisons In previous example, we saw that there was a significant treatment effect…so what? If an ANOVA is conducted and the analysis suggests that there is a significant treatment effect, then a reasonable question to ask is

Multiple Comparisons Would like to see if there is a difference between treatments i and j Can use two-sample t-test statistic to do this For testing reject if Perform many of these tests

Multiple Comparisons Perform many of these tests Error rate must be controlled

Tukey Method Tests: Confidence Interval:

Back to Example

Diagnostic Checking – Residual Analysis To support the assumptions on which the analysis is based, we need to check for –have all effects been captured? –unequal variances –non-Normality –sequence effects Should do this before hypothesis testing and multiple comparisons The data plot (limited data) shows no strong evidence of non- Normality or unequal variances

Diagnostic Checking ANOVA model: Predicted response:, where – Residual: Estimates error

Diagnostic Plots Errors are assumed to be normally distributed –Useful plot Errors assumed to be independent –Useful plot Equal variances in each group –Useful plot

Normality Check Dot plot or histogram of residuals Normal probability plot of residuals (via software or by hand - see class handout)

Independence Check Plot residuals in the time sequence in which the data were collected X-axis denotes the sequence, Y-axis denotes the residual values Should observe

Independence Check Suppose the sequence of the observations (going across rows from top to bottom in the tabled data) is 1, 2, 11, 9, 5, 7, 6, 3, 4, 12, 10, 8

Equal Variances A useful plot is: Should observe:

Equal Variances

Comments The F-test is fairly robust – it is not very sensitive to departures from the assumption of Normal distributions. Often, simple transformations, such as the logarithm or square root, can make the Normal distribution assumption and the equal variance assumption more appropriate (Chapter 2)

Summary: Completely Randomized Design, One-Way ANOVA Method: Random assignment of treatments to experimental units ANOVA: Compare variation among treatments to variation within treatments to assess evidence of a difference among treatments Investigate and identify differences among Treatments, if any. Act on the findings

Comment: One-Way Model The one-way model, y ij =  +  i + e ij, e ij ~NID(0,  2 ) can be and is applied to data obtained in ways other than a completely randomized design Example: starting salaries for MBAs at different companies. Company is not a treatment that is applied to experimental units Analyzing the data according to the above model can answer whether apparent differences between companies are real or could be just due to chance. The randomness involved comes from the randomness of the hiring and salary-determination processes, not the random assignment of treatments to experimental units

General Linear Model ANOVA model can be viewed as a special case of the general linear model or regression model Suppose have response, y, which is thought to be related to p predictors (sometimes called explanatory variables or regressors) Predictors: x 1, x 2,…,x p Model:

Example: Rainfall (Exercise 2.16) In winter, a plastic rain gauge cannot be used to collect precipitation because it will freeze and crack. Instead, metal cans are used to collect snowfall and the snow is allowed to melt indoors. The water is then poured into a plastic rain gauge and a measurement recorded. An estimate of snowfall is obtained by multiplying this measurement by One observer questions this and decides to collect data to test the validity of this approach For each rainfall in a summer, she measures: (i) rainfall using a plastic rain gauge, (ii) using a metal can What is the current model being used?

Example: Rainfall (Exercise 2.16)

Seems to be a linear relationship Will use regression to establish linear relationship between x and y What should the slope be?

Example: Rainfall (Exercise 2.16)