MARE 250 Dr. Jason Turner Hypothesis Testing III.

Slides:



Advertisements
Similar presentations
Statistical Techniques I
Advertisements

Hypothesis Testing Steps in Hypothesis Testing:
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 16 l Nonparametrics: Testing with Ordinal Data or Nonnormal Distributions.
PSY 307 – Statistics for the Behavioral Sciences Chapter 20 – Tests for Ranked Data, Choosing Statistical Tests.
Hypothesis Testing IV Chi Square.
S519: Evaluation of Information Systems
Université d’Ottawa / University of Ottawa 2001 Bio 4118 Applied Biostatistics L10.1 CorrelationCorrelation The underlying principle of correlation analysis.
Chapter Seventeen HYPOTHESIS TESTING
PSY 307 – Statistics for the Behavioral Sciences
MARE 250 Dr. Jason Turner Hypothesis Testing II To ASSUME is to make an… Four assumptions for t-test hypothesis testing: 1. Random Samples 2. Independent.
MARE 250 Dr. Jason Turner Hypothesis Testing II. To ASSUME is to make an… Four assumptions for t-test hypothesis testing:
Sample size computations Petter Mostad
Topic 2: Statistical Concepts and Market Returns
MARE 250 Dr. Jason Turner Hypothesis Testing. This is not a Test… Hypothesis testing – used for making decisions or judgments Hypothesis – a statement.
Sample Size Determination In the Context of Hypothesis Testing
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
Chapter 9 Hypothesis Testing.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Today Concepts underlying inferential statistics
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Hypothesis Testing MARE 250 Dr. Jason Turner.
Hypothesis Testing: Two Sample Test for Means and Proportions
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Transforming the data Modified from: Gotelli and Allison Chapter 8; Sokal and Rohlf 2000 Chapter 13.
Inferential Statistics
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Power and Sample Size IF IF the null hypothesis H 0 : μ = μ 0 is true, then we should expect a random sample mean to lie in its “acceptance region” with.
Choosing Statistical Procedures
AM Recitation 2/10/11.
Overview of Statistical Hypothesis Testing: The z-Test
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Comparing Two Population Means
RMTD 404 Lecture 8. 2 Power Recall what you learned about statistical errors in Chapter 4: Type I Error: Finding a difference when there is no true difference.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
MARE 250 Dr. Jason Turner Hypothesis Testing III.
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
Means Tests MARE 250 Dr. Jason Turner. Type of stats test called a means test Tests for differences in samples based upon their average (mean) and standard.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
FOUNDATIONS OF NURSING RESEARCH Sixth Edition CHAPTER Copyright ©2012 by Pearson Education, Inc. All rights reserved. Foundations of Nursing Research,
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
Experimental Design and Statistics. Scientific Method
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
© Copyright McGraw-Hill 2004
T Test for Two Independent Samples. t test for two independent samples Basic Assumptions Independent samples are not paired with other observations Null.
MARE 250 Dr. Jason Turner Analysis of Variance (ANOVA)
Applied Quantitative Analysis and Practices LECTURE#14 By Dr. Osman Sadiq Paracha.
Chapter 13 Understanding research results: statistical inference.
Homogeneity of Variance Pooling the variances doesn’t make sense when we cannot assume all of the sample Variances are estimating the same value. For two.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.
MARE 250 Dr. Jason Turner Analysis of Variance (ANOVA)
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Nonparametric Statistics Overview. Objectives Understand Difference between Parametric and Nonparametric Statistical Procedures Nonparametric methods.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Chapter 9 Introduction to the t Statistic
CHAPTER 9 Testing a Claim
Part Four ANALYSIS AND PRESENTATION OF DATA
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Chapter 9 Hypothesis Testing.
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
Presentation transcript:

MARE 250 Dr. Jason Turner Hypothesis Testing III

To ASSUME is to make an… Four assumptions for t-test hypothesis testing: 1. Random Samples 2. Independent Samples 3. Normal Populations (or large samples) 4. Variances (std. dev.) are equal

When do I do the what now? If all 4 assumptions are met: Conduct a pooled t-test - you can “pool” the samples because the variances are assumed to be equal If the samples are not independent: Conduct a paired t-test If the variances (std. dev.) are not equal: Conduct a non-pooled t-test If the data is not normal or has small sample size: Conduct a non-parametric t-test (Mann-Whitney)

Significance Level The probability of making a TYPE I Error (rejection of a true null hypothesis) is called the significance level (α) of a hypothesis test TYPE II Error Probability (β) – nonrejection of a false null hypothesis For a fixed sample size, the smaller we specify the significance level (α), the larger will be the probability (β), of not rejecting a false hypothesis

If H:0 is true If H:0 is false If H:0 is rejected TYPE I ERROR No Error If H:0 is not rejected No ErrorTYPE II ERROR

I have the POWER!!! The power of a hypothesis test is the probability of not making a TYPE II error (rejecting a false null hypothesis) t evidence to support the alternative hypothesis POWER = 1 - β Produce a power curve

We need more POWER!!! For a fixed significance level, increasing the sample size increases the power Therefore, you can run a test to determine if your sample size HAS THE POWER!!! By using a sufficiently large sample size, we can obtain a hypothesis test with as much power as we want

Power - the probability of being able to detect an effect of a given size Sample size - the number of observations in each sample Difference (effect) - the difference between μ for one population and μ for the other

Increasing the power of the test There are four factors that can increase the power of a two-sample t-test: 1.Larger effect size (difference) - The greater the real difference between m for the two populations, the more likely it is that the sample means will also be different. 2.Higher α-level (the level of significance) - If you choose a higher value for α, you increase the probability of rejecting the null hypothesis, and thus the power of the test. (However, you also increase your chance of type I error.) 3. Less variability - When the standard deviation is smaller, smaller differences can be detected. 4. Larger sample sizes - The more observations there are in your samples, the more confident you can be that the sample means represent m for the two populations. Thus, the test will be more sensitive to smaller differences.

Increasing the power of the test The most practical way to increase power is often to increase the sample size However, you can also try to decrease the standard deviation by making improvements in your process or measurement

Sample size Increasing the size of your samples increases the power of your test You want enough observations in your samples to achieve adequate power, but not so many that you waste time and money on unnecessary sampling If you provide the power that you want the test to have and the difference you want it to be able to detect, MINITAB will calculate how large your samples must be

MARE 250 Dr. Jason Turner Data Transformations

One advantage of using parametric statistics is that it makes it much easier to describe your data If you have established that it follows a normal distribution you can be sure that a particular set of measurements can be properly described by its mean and standard deviation If your data are not normally distributed you cannot use any of the tests that assume that it is (e.g. ANOVA, t test, regression analysis) Data Transformations

If your data are not normally distributed it is often possible to normalize it by transforming it. Transforming data to allow you to use parametric statistics is completely legitimate People often feel uncomfortable when they transform data because it seems like it artificially improves their results but this is only because they feel comfortable with linear or arithmetic scales However, there is no reason for not using other scales (e.g. logarithms, square roots, reciprocals or angles) where appropriate (See Chapter 13) Data Transformations

Different transformations work for different data types: Logarithms : Growth rates are often exponential and log transforms will often normalize them. Log transforms are particularly appropriate if the variance increases with the mean. Reciprocal : If a log transform does not normalize your data you could try a reciprocal (1/x) transformation. This is often used for enzyme reaction rate data. Square root : This transform is often of value when the data are counts, e.g. # urchins, # Honu. Carrying out a square root transform will convert data with a Poisson distribution to a normal distribution. Arcsine : This transformation is also known as the angular transformation and is especially useful for percentages and proportions Data Transformations

Which Transformation? Johnson Transformation is useful when the collected data are non-normal, but you want to apply a methodology that requires a normal distribution MINITAB selects one optimal distribution function from Johnson distribution system to transform the data to a normal distribution

Johnson Transformation

Determining the optimal transformation Johnson Transformation uses the following algorithm to determine an optimal transformation from the three families of distribution: SB, SL, and SU, where B, L, and U refer to the variable being bounded, lognormal, and unbounded Bounded - something is of finite size, and that this is the case if it is smaller than some other object that has a finite size. (Otherwise it is unbounded.) Johnson Transformation

How To? STAT – Quality Tools – Johnson Transformation Enter what variable to be transformed, and what the “new” transformed variable will be called Places transformed data into a new column in your MINITAB datasheet – can copy this into Excel and save FOREVER…