1 Experimental Statistics - week 2 Review: 2-sample t-tests paired t-tests Thursday: Meet in 15 Clements!! Bring Cody and Smith book.

Slides:



Advertisements
Similar presentations
One-sample T-Test Matched Pairs T-Test Two-sample T-Test
Advertisements

Statistical Methods Lynne Stokes Department of Statistical Science Lecture 7: Introduction to SAS Programming Language.
Inferential Statistics
Is it statistically significant?
Chapter 8 The t Test for Independent Means Part 2: Oct. 15, 2013.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Comparing k Populations Means – One way Analysis of Variance (ANOVA)
Confidence Interval and Hypothesis Testing for:
Multiple regression analysis
PSY 307 – Statistics for the Behavioral Sciences
Lecture 13 – Tues, Oct 21 Comparisons Among Several Groups – Introduction (Case Study 5.1.1) Comparing Any Two of the Several Means (Chapter 5.2) The One-Way.
MARE 250 Dr. Jason Turner Hypothesis Testing II To ASSUME is to make an… Four assumptions for t-test hypothesis testing: 1. Random Samples 2. Independent.
MARE 250 Dr. Jason Turner Hypothesis Testing II. To ASSUME is to make an… Four assumptions for t-test hypothesis testing:
Regression Diagnostics Using Residual Plots in SAS to Determine the Appropriateness of the Model.
ANalysis Of VAriance (ANOVA) Comparing > 2 means Frequently applied to experimental data Why not do multiple t-tests? If you want to test H 0 : m 1 = m.
Independent Samples and Paired Samples t-tests PSY440 June 24, 2008.
Test statistic: Group Comparison Jobayer Hossain Larry Holmes, Jr Research Statistics, Lecture 5 October 30,2008.
Testing for differences between 2 means Does the mean weight of cats in Toledo differ from the mean weight of cats in Cleveland? Do the mean quiz scores.
Inferences About Process Quality
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
5-3 Inference on the Means of Two Populations, Variances Unknown
Lecture 13: Tues., Feb. 24 Comparisons Among Several Groups – Introduction (Case Study 5.1.1) Comparing Any Two of the Several Means (Chapter 5.2) The.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
WINKS 7 Tutorial 5 Tutorial 5 – Creating a data set and entering data (Comparing Two Means, t-test) Permission granted for use for instruction and for.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Inferential Statistics: SPSS
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Experimental Statistics - week 2
1 Experimental Statistics - week 3 Statistical Inference 2-sample Hypothesis Tests Review Continued Chapter 8: Inferences about More Than 2 Population.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
STAT 5372: Experimental Statistics Wayne Woodward Office: Office: 143 Heroy Phone: Phone: (214) URL: URL: faculty.smu.edu/waynew.
Introduction to SAS Essentials Mastering SAS for Data Analytics
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.2 Estimating Differences.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
1 Experimental Statistics - week 4 Chapter 8: 1-factor ANOVA models Using SAS.
Week 111 Power of the t-test - Example In a metropolitan area, the concentration of cadmium (Cd) in leaf lettuce was measured in 7 representative gardens.
Introduction to SAS Essentials Mastering SAS for Data Analytics
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation Note: Homework Due Thursday.
STA305 week21 The One-Factor Model Statistical model is used to describe data. It is an equation that shows the dependence of the response variable upon.
Testing Multiple Means and the Analysis of Variance (§8.1, 8.2, 8.6) Situations where comparing more than two means is important. The approach to testing.
T- and Z-Tests for Hypotheses about the Difference between Two Subsamples.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
The Completely Randomized Design (§8.3)
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Nonparametric Statistics
ANOVA: Analysis of Variance.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.1 One-Way ANOVA: Comparing.
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Experimental Statistics - week 3
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
Computing with SAS Software A SAS program consists of SAS statements. 1. The DATA step consists of SAS statements that define your data and create a SAS.
Other Types of t-tests Recapitulation Recapitulation 1. Still dealing with random samples. 2. However, they are partitioned into two subsamples. 3. Interest.
CRD, Strength of Association, Effect Size, Power, and Sample Size Calculations BUSI 6480 Lecture 4.
Chapter 1 Introduction to Statistics. Section 1.1 Fundamental Statistical Concepts.
Applied Epidemiologic Analysis - P8400 Fall 2002 Lab 3 Type I, II Error, Sample Size, and Power Henian Chen, M.D., Ph.D.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Chapter 7 Inference Concerning Populations (Numeric Responses)
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Hypothesis Testing Start with a question:
This Week Review of estimation and hypothesis testing
Psychology 202a Advanced Psychological Statistics
6-1 Introduction To Empirical Models
Introduction to SAS Essentials Mastering SAS for Data Analytics
STATISTICS INFORMED DECISIONS USING DATA
Presentation transcript:

1 Experimental Statistics - week 2 Review: 2-sample t-tests paired t-tests Thursday: Meet in 15 Clements!! Bring Cody and Smith book

2 p- Value (observed value of t) p-value Suppose t = is observed from data for test above Note: “ Large negative values” of t make us believe alternative is true the probability of an observation as extreme or more extreme than the one observed when the null is true

3 Note: -- if p-value is less than or equal to  then we reject null at the  significance level -- the p-value is the smallest level of significance at which the null hypothesis would be rejected

4 Find the p-values for Examples 1 and 2

5

6 Two Independent Samples Assumptions: Measurements from each population are –Mutually Independent  Independent within each sample  Independent between samples –Normally distributed (or the Central Limit Theorem can be invoked) Analysis differs based on whether the 2 populations have the same standard deviation

7 Two Cases Population standard deviations equal –Can obtain a better estimate of the common standard deviation by combining or “pooling” individual estimates Population standard deviations unequal –Must estimate each standard deviation –Very good approximate tests are available If Unsure, Do Not Assume Equal Standard Deviations

8 Equal Population Standard Deviations Test Statistic df = n 1 + n where

9 Behrens-Fisher Problem

10 Satterthwaite’s Approximate t Statistic (Approximate t df) (i.e. approximate t)

11 Often-Recommended Strategy for Tests on Means Test whether  1 =  2 (F-test ) –If the test is not rejected, use the 2-sample t statistics, assuming equal standard deviations –If the test is rejected, use Satterthwaite’s approximate t statistic NOTE: This is Not a good strategy –the F-test is highly susceptible to non-normality Recommended Strategy: –If uncertain about whether the standard deviations are equal, use Satterthwaite’s approximate t statistic

12 Example 3: Comparing the Mean Breaking Strengths of 2 Plastics Plastic A: Plastic B: Assumptions: Mutually independent measurements Normal distributions for measurements from each type of plastic Question: Question: Is there a difference between the 2 plastics in terms of mean breaking strength?

13 Example 3 - solution

14

15 Design: 50 people: randomly assign 25 to go on diet and 25 to eat normally for next month. Assess results by comparing weights at end of 1 month. Diet: No Diet: Diet: No Diet: Run 2-sample t-test using guidelines we have discussed. Is this a good design? New diet – Is it effective?

16 Better Design: Randomly select subjects and measure them before and after 1-month on the diet. Subject Before After : : : n Difference 3 15 : -3 Procedure: Calculate differences, and analyze differences using a 1-sample test “Paired t-Test”

17 Example 4: International Gymnastics Judging Question: Do judges from a contestant’s country rate their own contestant higher than do foreign judges? Data:

18 Example 4 solution

19

Introduction to SAS Programming Language

21 Fertilizer Data Brand 1 Brand cm 54.0 cm A researcher studies the effect of two fertilizer brands on the growth of plants. Thirteen plants grown under identical conditions except that 7 plants are randomly selected to receive Brand 1 and the remaining 6 are fertilized using Brand 2. The data for this experiment are as follows where the outcome measurement is the height of the plant after 3 weeks of growth (you may assume the heights to be normally distributed):

22 The Fertilizer data set as SAS needs to see it A 51.0 A 53.3 A 55.6 A 51.0 A 55.5 A 53.0 A 52.1 B 54.0 B 56.1 B 52.1 B 56.4 B 54.0 B 52.9

23 Case 1: Data within SAS FILE : DATA one; INPUT brand$ height; DATALINES; A 51.0 A B 54.0 E 52.9 ; PROC TTEST; CLASS brand; VAR height; TITLE ‘Fertilizer Data – 2-sample t-test'; RUN; SAS file for FERTILIZER data

24 Brief Discussion of Components of the SAS File: DATA Step DATA STATEMENT - the first DATA statement names the data set whose variables are defined in the INPUT statement -- in the above, we create data set 'one' INPUT STATEMENT - 2 forms 1. Freefield - can be used when data values are separated by 1 or more blanks INPUT NAME $ AGE SEX $ SCORE; ($ indicates character variable) 2. Formatted - data occur in fixed columns INPUT NAME $ 1-20 AGE SEX $ 26 SCORE 28-30; DATALINES STATEMENT - used to indicate that the next records in the file contain the actual data and the semicolon after the data indicates the end of the data itself

25 SPECIFYING THE ANALYSIS SPECIFYING THE ANALYSIS -- PROC STATEMENTS GENERAL FORM PROC xxxxx; implies procedure is to be run on most recently created data set PROC xxxxx DATA = data set name; Note: I did not have to specify DATA=one in the above example Example PROCs: PROC REG - regression analysis PROC ANOVA - analysis of variance PROC GLM - general linear model PROC MEANS - basic statistics, t-test for H 0 :  PROC PLOT - plotting PROC TTEST - t-tests PROC UNIVARIATE - descriptive stats, box-plots, etc. PROC BOXPLOT - boxplots

26 PROC TTEST Proc TTEST data = fn ; Class … ; (specify the classification variable) Var … / options; ( specify the variable for which the means are compared) Run;

27 SAS Syntax MUSTEvery command MUST end with a semicolon –Commands can continue over two or more lines Variable names are 1-8 characters (letters and numerals, beginning with a letter or underscore), but no blanks or special characters –Note: values for character variables can exceed 8 characters Comments –Begin with *, end with ;

28 Titles and Labels TITLE ‘…’ ; –Up to 10 title lines: TITLE ‘include your title here’; –Can be placed in Data Steps or Procs LABEL name = ‘…’ ; –Can be in a DATA STEP or PROC PRINT –Include ALL labels, then a single ; Note: For class assignments, place descriptive titles and labels on the output.

29 Case 2: Data in External File : FILENAME f1 ‘complete directory/file specification’; FILENAME f1 ‘fertilizer.data'; DATA one; INFILE f1; INPUT brand$ height; PROC TTEST; CLASS brand; VAR height; TITLE ‘Fertilizer Data – 2-sample t-test'; RUN;

30 PC SAS on Campus Library BIC Student Center SAS Learning Edition $125