MARE 250 Dr. Jason Turner Hypothesis Testing II To ASSUME is to make an… Four assumptions for t-test hypothesis testing: 1. Random Samples 2. Independent.

Slides:



Advertisements
Similar presentations
PTP 560 Research Methods Week 9 Thomas Ruediger, PT.
Advertisements

Inferential Statistics
Statistical Issues in Research Planning and Evaluation
5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.
AP Statistics – Chapter 9 Test Review
Confidence Interval and Hypothesis Testing for:
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
MARE 250 Dr. Jason Turner Analysis of Variance (ANOVA)
Hyp Test II: 1 Hypothesis Testing: Additional Applications In this lesson we consider a series of examples that parallel the situations we discussed for.
MARE 250 Dr. Jason Turner Hypothesis Testing II. To ASSUME is to make an… Four assumptions for t-test hypothesis testing:
MARE 250 Dr. Jason Turner Analysis of Variance (ANOVA) II.
BCOR 1020 Business Statistics
Analysis of Variance (ANOVA) MARE 250 Dr. Jason Turner.
MARE 250 Dr. Jason Turner Hypothesis Testing III.
MARE 250 Dr. Jason Turner Hypothesis Testing. This is not a Test… Hypothesis testing – used for making decisions or judgments Hypothesis – a statement.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.
Lecture 12 One-way Analysis of Variance (Chapter 15.2)
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Chapter 2 Simple Comparative Experiments
Chapter 11: Inference for Distributions
Copyright © 2010 Pearson Education, Inc. Chapter 24 Comparing Means.
Chapter 9 Hypothesis Testing.
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Hypothesis Testing MARE 250 Dr. Jason Turner.
5-3 Inference on the Means of Two Populations, Variances Unknown
Week 9 October Four Mini-Lectures QMM 510 Fall 2014.
Inferential Statistics
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Statistical hypothesis testing – Inferential statistics I.
AM Recitation 2/10/11.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Comparing Two Population Means
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Week 111 Power of the t-test - Example In a metropolitan area, the concentration of cadmium (Cd) in leaf lettuce was measured in 7 representative gardens.
MARE 250 Dr. Jason Turner Hypothesis Testing III.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 24 Comparing Means.
Chapter 9 Power. Decisions A null hypothesis significance test tells us the probability of obtaining our results when the null hypothesis is true p(Results|H.
Means Tests MARE 250 Dr. Jason Turner. Type of stats test called a means test Tests for differences in samples based upon their average (mean) and standard.
Hypothesis Testing A procedure for determining which of two (or more) mutually exclusive statements is more likely true We classify hypothesis tests in.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Chapter 9 Three Tests of Significance Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
STA 2023 Module 11 Inferences for Two Population Means.
AP Statistics Chapter 24 Comparing Means.
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
© Copyright McGraw-Hill 2004
T Test for Two Independent Samples. t test for two independent samples Basic Assumptions Independent samples are not paired with other observations Null.
MARE 250 Dr. Jason Turner Analysis of Variance (ANOVA)
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Handout Six: Sample Size, Effect Size, Power, and Assumptions of ANOVA EPSE 592 Experimental Designs and Analysis in Educational Research Instructor: Dr.
Comparing Means Chapter 24. Plot the Data The natural display for comparing two groups is boxplots of the data for the two groups, placed side-by-side.
T tests comparing two means t tests comparing two means.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.
MARE 250 Dr. Jason Turner Analysis of Variance (ANOVA)
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
When the means of two groups are to be compared (where each group consists of subjects that are not related) then the excel two-sample t-test procedure.
Lecture Notes and Electronic Presentations, © 2013 Dr. Kelly Significance and Sample Size Refresher Harrison W. Kelly III, Ph.D. Lecture # 3.
Two-Sample Hypothesis Testing
CONCEPTS OF HYPOTHESIS TESTING
Hypothesis tests for the difference between two means: Independent samples Section 11.1.
Chapter 9 Hypothesis Testing.
Defining the null and alternative hypotheses
Presentation transcript:

MARE 250 Dr. Jason Turner Hypothesis Testing II

To ASSUME is to make an… Four assumptions for t-test hypothesis testing: 1. Random Samples 2. Independent Samples 3. Normal Populations (or large samples) 4. Variances (std. dev.) are equal

When do I do the what now? “Well, whenever I'm confused, I just check my underwear. It holds the answer to all the important questions.” – Grandpa Simpson If all 4 assumptions are met: Conduct a pooled t-test - you can “pool” the samples because the variances are assumed to be equal If the samples are not independent: Conduct a paired t-test If the variances (std. dev.) are not equal: Conduct a non-pooled t-test If the data is not normal or has small sample size: Conduct a non-parametric t-test (Mann-Whitney)

When to pool, when to not-pool “"We have a pool and a pond…The pond would be good for you.” – Ty Webb Both tests are run by Minitab as “2-sample t-test” For pooled test check box – “Assume Equal Variances” For non-pooled, do not check box

Assessing Equal Variances… Equality of variance can checked by performing an F-test Often not recommended: Although pooled t-test is moderately robust to unequal variances, F test is extremely non-robust to such inequalities Pooled t-test will allow you to run an accurate test with some degree of unequal variance F-test is much more specific than pooled-t

Who did the What Now…

Assessing Equal Variances… F-test and Levene’s used to judge the equality of variances. In both tests, the null hypothesis (Ho) is that the population variances under consideration (or equivalently, the population standard deviations) are equal, and the alternative hypothesis (Ha) is that the two variances are not equal. The choice of test depends on distribution properties

What the F…? Use the F-test when the data come from a normal distribution - is not robust to departures from normality Use Levene's test when the data come from continuous, but not necessarily normal, distributions is less sensitive than the F-test, so use the F-test when your data are normal or nearly normal

When the F…? MINITAB calculates and displays a test statistic and p-value for both the F-test and Levene's test Ho: σ1 = σ2 2 population variances equal Ha: σ1 ≠ σ2 2 variances are not equal High p-values (above α-level) Fail to Reject Null - indicate no statistically significant difference between the variances (equality or homogeneity of variances) Low p-values (below α-level) Reject Null - indicate a difference between the variances (inequality of variances)

How the F…? STAT – Basic Statistics – 2-Variances Enter columns of data as before Under “Options” can modify α-level of test (but why would you do that) Note that by default, MINITAB gives you the results of both the F-test and Levene’s Must decide a priori which test you plan to utilize

Significance Level The probability of making a TYPE I Error (rejection of a true null hypothesis) is called the significance level (α) of a hypothesis test TYPE II Error Probability (β) – nonrejection of a false null hypothesis For a fixed sample size, the smaller we specify the significance level (α), the larger will be the probability (β), of not rejecting a false hypothesis

I have the POWER!!! The power of a hypothesis test is the probability of not making a TYPE II error (rejecting a false null hypothesis) t evidence to support the alternative hypothesis POWER = 1 - β Produce a power curve

We need more POWER!!! For a fixed significance level, increasing the sample size increases the power Therefore, you can run a test to determine if your sample size HAS THE POWER!!! By using a sufficiently large sample size, we can obtain a hypothesis test with as much power as we want

Power - the probability of being able to detect an effect of a given size Sample size - the number of observations in each sample Difference (effect) - the difference between μ for one population and μ for the other

Increasing the power of the test There are four factors that can increase the power of a two-sample t-test: 1.Larger effect size (difference) - The greater the real difference between m for the two populations, the more likely it is that the sample means will also be different. 2.Higher α-level (the level of significance) - If you choose a higher value for α, you increase the probability of rejecting the null hypothesis, and thus the power of the test. (However, you also increase your chance of type I error.) 3. Less variability - When the standard deviation is smaller, smaller differences can be detected. 4. Larger sample sizes - The more observations there are in your samples, the more confident you can be that the sample means represent m for the two populations. Thus, the test will be more sensitive to smaller differences.

Increasing the power of the test The most practical way to increase power is often to increase the sample size However, you can also try to decrease the standard deviation by making improvements in your process or measurement

Sample size Increasing the size of your samples increases the power of your test You want enough observations in your samples to achieve adequate power, but not so many that you waste time and money on unnecessary sampling If you provide the power that you want the test to have and the difference you want it to be able to detect, MINITAB will calculate how large your samples must be

When to pair, when to not-pair “All I got's two fives!” - Jean LaRose Test is run by Minitab directly as “paired t-test” Used when there is a natural pairing of the members of two populations Each pair consists of a member from one population and that members corresponding member in the other population Use difference between the two sample means

When to pair, when to not-pair “All I got's two fives!” - Jean LaRose Paired t-test assumptions: 1. Random Sample 2. Paired difference normally distributed; large n 3. Outliers can confound results Tests whether the difference in the pairs is significantly different from zero

Paired Test - Example For Example… If you are testing the effects of some experimental treatment upon a population e.g. – effect of new diet upon a single sample of fish However… Paired test must have equal sample sizes

When to parametric… Nonparametric procedures Statistical procedures that require very few assumptions about the underlying population. They are often used when the data are not from a normal population.

Non-Parametric Non-parametric t-test (Mann-Whitney): 1. Random Sample 2. Do not require normally distributed data 3. Outliers do not confound results Tests whether the difference in the pairs is significantly different from zero Non-parametric test are used heavily in some disciplines – although not typically in the natural sciences – often the “last resort” when data is not collected correctly, low “power”

Nonparametric tests: Less powerful than parametric tests. Thus, you are less likely to reject the null hypothesis when it is false. Often require you to modify the hypotheses. For example, most nonparametric tests concerning the population center are tests about the median rather than the mean. The test does not answer the same question as the corresponding parametric procedure. When a choice exists and you are reasonably certain that the assumptions for the parametric procedure are satisfied, then use the parametric procedure. Drawbacks of Nonparametric Tests