Dependent t-tests. Factors affecting statistical power in the t-test Statistical power ability to identify a statistically significant difference when.

Slides:



Advertisements
Similar presentations
Comparing Two Means: One-sample & Paired-sample t-tests Lesson 12.
Advertisements

Defining Characteristics
Independent t -test Features: One Independent Variable Two Groups, or Levels of the Independent Variable Independent Samples (Between-Groups): the two.
PTP 560 Research Methods Week 9 Thomas Ruediger, PT.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Inference for a population mean BPS chapter 18 © 2006 W. H. Freeman and Company.
Statistical Decision Making
Using Statistics in Research Psych 231: Research Methods in Psychology.
PSY 307 – Statistics for the Behavioral Sciences
Nemours Biomedical Research Statistics March 19, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Lecture 11 Psyc 300A. Null Hypothesis Testing Null hypothesis: the statistical hypothesis that there is no relationship between the variables you are.
PSY 307 – Statistics for the Behavioral Sciences
Inferences about Means of Dependent Samples Chapter 12 Homework: 1-4, 7 Problems 3, 4, & 7: skip parts i and l, do not calculate U in part n.
Lecture 9: One Way ANOVA Between Subjects
PSYC512: Research Methods PSYC512: Research Methods Lecture 9 Brian P. Dyre University of Idaho.
Testing the Difference Between Means (Small Independent Samples)
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Way ANOVA (Cont.)
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Today Concepts underlying inferential statistics
Chapter 14 Inferential Data Analysis
Descriptive Statistics
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Significance and Meaningfulness Effect Size & Statistical Power 1.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Comparing Means From Two Sets of Data
Statistical Power The ability to find a difference when one really exists.
The Hypothesis of Difference Chapter 10. Sampling Distribution of Differences Use a Sampling Distribution of Differences when we want to examine a hypothesis.
Chapter 10 Comparing Two Means Target Goal: I can use two-sample t procedures to compare two means. 10.2a h.w: pg. 626: 29 – 32, pg. 652: 35, 37, 57.
Inferential Statistics 2 Maarten Buis January 11, 2006.
Where are we?. What we have covered: - How to write a primary research paper.
S519: Evaluation of Information Systems Social Statistics Inferential Statistics Chapter 10: t test.
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)
B AD 6243: Applied Univariate Statistics Hypothesis Testing and the T-test Professor Laku Chidambaram Price College of Business University of Oklahoma.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
Welcome to Econ 420 Applied Regression Analysis Study Guide Week Six.
Independent Samples 1.Random Selection: Everyone from the Specified Population has an Equal Probability Of being Selected for the study (Yeah Right!)
Independent t-tests Uses a sampling distribution of differences between means 1.
Jeopardy Hypothesis Testing t-test Basics t for Indep. Samples Related Samples t— Didn’t cover— Skip for now Ancient History $100 $200$200 $300 $500 $400.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
Chapter 10 The t Test for Two Independent Samples
SPSS Workshop Day 2 – Data Analysis. Outline Descriptive Statistics Types of data Graphical Summaries –For Categorical Variables –For Quantitative Variables.
Chapter 13 Repeated-Measures and Two-Factor Analysis of Variance
T-test Testing Inferences about Population Means.
T tests comparing two means t tests comparing two means.
T Test for Two Independent Samples. t test for two independent samples Basic Assumptions Independent samples are not paired with other observations Null.
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
Student’s t test This test was invented by a statistician WS Gosset ( ), but preferred to keep anonymous so wrote under the name “Student”. This.
Chapter 11 The t-Test for Two Related Samples
Chapter 13 Understanding research results: statistical inference.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Chapter 7 Inference Concerning Populations (Numeric Responses)
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Inferential Statistics Psych 231: Research Methods in Psychology.
The Paired-Samples t Test Chapter 10. Research Design Issues >So far, everything we’ve worked with has been one sample One person = Z score One sample.
Inferential Statistics Psych 231: Research Methods in Psychology.
When the means of two groups are to be compared (where each group consists of subjects that are not related) then the excel two-sample t-test procedure.
Dependent-Samples t-Test
CJ 526 Statistical Analysis in Criminal Justice
Statistics for the Social Sciences
Comparing Populations
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Statistical Power.
Rest of lecture 4 (Chapter 5: pg ) Statistical Inferences
Presentation transcript:

dependent t-tests

Factors affecting statistical power in the t-test Statistical power ability to identify a statistically significant difference when a difference between means actually exists

Decision Table: Correct DECISION REALITYREALITY Truth is everlasting, but our ideas about truth are interchangeable

Factors affecting statistical power in the t-test  level how much risk are YOU willing to take in making a Type I error Frank & Huck (1986, RQES): Why does everyone use the 0.05 level of significance? 0.01 conservative 0.10 liberal Power

Factors affecting statistical power in the t-test  level df (number of subjects) affects variability associated with the sample mean & variability within the sample limited by time & money GREATER n = GREATER POWER (point of diminishing return)

Statistics Humour One day there was a fire in a wastebasket in the Dean's office and in rushed a physicist, a chemist, and a statistician. The physicist immediately starts to work on how much energy would have to be removed from the fire to stop the combustion. The chemist works on which reagent would have to be added to the fire to prevent oxidation. While they are doing this, the statistician is setting fires to all the other wastebaskets in the office. "What are you doing?" they demanded. "Well to solve the problem, obviously you need a large sample size" the statistician replies.

Factors affecting statistical power in the t-test  level df (number of subjects) magnitude of the mean difference how different are the treatments imposed measurement errors sampling errors SIZE OF THE TREATMENT EFFECT

Factors affecting statistical power in the t-test  level df (number of subjects) magnitude of the mean difference variability how specified is your population control of extraneous variables

Estimated Standard Error of the Difference between 2 independent means

t-test for independent samples Smaller is better

Comparing paired (correlated) measures instead of group (uncorrelated) measures Match subjects what factors (variables) might affect time to exhaustion on the exercise bike daily diet? Fitness level? Genetics? Height? Weight? Age? Regular training program?

Comparing paired (correlated) measures instead of group (uncorrelated) measures Match subjects Repeated measures measure the SAME subject under both protocols test & retest pre & posttest condition 1 & condition 2

Comparing paired (correlated) measures instead of group (uncorrelated) measures Match subjects Repeated measures Subject serves as own Control

Comparing paired (correlated) measures instead of group (uncorrelated) measures Match subjects Repeated measures Subject serves as own Control Intra-subject variability should be LESS than Inter-subject variability

Dependent t-test (paired or correlated t-test) Pairs of scores are matched same subject in 2 conditions or matched subjects Question: Does ankle bracing affect load during landing? IV: brace condition DV: Vertical GRF

Steps to dependent t-test Set  (0.05) Set sample size One randomly selected group n = 7 condition 1: Brace condition 2: No brace Set H o (null hypothesis)

Set statistical hypotheses H o Null hypothesis Any observed difference between the two conditions will be attributable to random sampling error. H A Alternative hypothesis If H o is rejected, the difference is not attributable to random sampling error perhaps brace???

Steps to independent t-test Set  (0.05) Set sample size (n = 7) Set H o Test each subject in both conditions with a standardized protocol (drop landings) Note: condition performance order is randomized across subjects

GRF data

Steps to dependent t-test Set  (0.05) Set sample size (n = 7) Set H o Test each subject in both conditions Calculate descriptive statistics of each condition scattergram mean, SD, n

Figure 1. Scattergram of vertical GRF during landing in different brace conditions (N/kg)

Descriptive statistics for atble401.sav data

Steps to dependent t-test Set  (0.05) Set sample size (n = 7) Set H o Test each subject in both conditions Calculate descriptive statistics of each condition compare the condition means

How to compare the condition means Even if the two conditions were the same (samples drawn from the same population), would not expect the statistics to be the same Need a measure of expected variability against which the mean of the difference between paired scores (X i - Y i ) could be compared

Paired scores, so the data are somewhat correlated Calculate the difference between the two conditions for each case (X i - Y i ) Calculate the Mean Difference Use the correlation among the pairs of scores to reduce the error term (denominator) used to evaluate the difference between the means

t-test for dependent (paired) samples t = M diff SE MD

GRF data  = -20 Mean Diff = -2.9

t-test for dependent (paired) samples t = M diff SE MD Standard error of the Mean difference for Paired Scores

Estimated Standard Error of the Difference between 2 dependent means ?

If r = 0, this term reduces to the same equation as for independent groups

t-test for dependent (paired) samples t = M diff SE MD df = ??

t-test for dependent (paired) samples t = M diff SE MD df = n pairs - 1

Running the dependent t-test with SPSS Enter the data as pairs atble401.sav

Reporting paired t-test outcome Table 1. Descriptive statistics of vertical ground reaction force (in N/kg) for the two conditions (n = 7)

Reporting t-test outcome * Figure 1. Mean vertical GRF in the two conditions (* p  0.05)

Reporting t-test in text Descriptive statistics of the vertical ground reaction force (VGRF) data during landing in the two braced conditions are presented in Table 1 and graphically in Figure 1. A paired t-test indicated that the mean VGRF of 10.9 (SD = 3.5) N/kg in the braced condition was significantly higher (  = 0.05) than the mean VGRF of 8.0 (4.3) N/kg in the unbraced condition (t 6 = 3.57, p = 0.012). The mean difference of 2.9 N/kg represents a 36% higher VGRF during the landings with a brace compared to without a brace.

What if you set  = 0.01? Descriptive statistics of the vertical ground reaction force (VGRF) data during landing in the two braced conditions are presented in Table 1 and graphically in Figure 1. A paired t-test indicated that the mean VGRF of 10.9 (SD = 3.5) N/kg in the braced condition was...

What if you set  = 0.01? Descriptive statistics of the vertical ground reaction force (VGRF) data during landing in the two braced conditions are presented in Table 1 and graphically in Figure 1. A paired t-test indicated that the mean VGRF of 10.9 (SD = 3.5) N/kg in the braced condition was significantly higher (  = 0.01) than the mean VGRF of 8.0 (4.3) N/kg in the unbraced condition (t 6 = 3.57, p = 0.012). The mean difference of 2.9 N/kg represents a 36% higher VGRF during the landings with a brace compared to without a brace. not

Statistics Humour A student set forth on a quest To learn which of the world’s beers was best But his wallet was dried out At the first pub he tried out With two samples he flunked the means test Gehlbach, SH (2002) Interpreting the medical literature

Summary: both t-tests are of the form: t = Standard Error Mean Difference

To increase statistical power t = Standard Error Mean Difference Maximize Minimize

Choosing which t-test to use Independent no correlation between the two groups Dependent two sets of data (pair of scores) from matched subjects or from the same subject (repeated measures) data are correlated

Time for Lunch