BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Hypothesis Testing An introduction. Big picture Use a random sample to learn something about a larger population.
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Inference Sampling distributions Hypothesis testing.
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
Chapter 10: Hypothesis Testing
Hypothesis testing Week 10 Lecture 2.
© 2010 Pearson Prentice Hall. All rights reserved Hypothesis Testing Basics.
Chapter Seventeen HYPOTHESIS TESTING
Section 7.1 Hypothesis Testing: Hypothesis: Null Hypothesis (H 0 ): Alternative Hypothesis (H 1 ): a statistical analysis used to decide which of two competing.
QM Spring 2002 Business Statistics Introduction to Inference: Hypothesis Testing.
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
Chapter 9 Hypothesis Testing.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Statistical hypothesis testing – Inferential statistics I.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
AM Recitation 2/10/11.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
Chapter 10 Hypothesis Testing
Confidence Intervals and Hypothesis Testing - II
Descriptive statistics Inferential statistics
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Introduction to Biostatistics and Bioinformatics
Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 8 Introduction to Hypothesis Testing
Statistical Analysis Statistical Analysis
Testing Hypotheses Tuesday, October 28. Objectives: Understand the logic of hypothesis testing and following related concepts Sidedness of a test (left-,
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Chapter 15 Data Analysis: Testing for Significant Differences.
Chapter 8 Introduction to Hypothesis Testing
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.
Chapter 20 Testing hypotheses about proportions
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Statistical Inference An introduction. Big picture Use a random sample to learn something about a larger population.
Chapter 9 Hypothesis Testing 9.1 The Language of Hypothesis Testing.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
© Copyright McGraw-Hill 2004
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Chapter 13 Understanding research results: statistical inference.
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
Today: Hypothesis testing p-value Example: Paul the Octopus In 2008, Paul the Octopus predicted 8 World Cup games, and predicted them all correctly Is.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
© 2010 Pearson Prentice Hall. All rights reserved STA220: Formulating Hypotheses and Setting Up the Rejection Region.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Hypothesis Tests Regarding a Parameter
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Hypothesis Testing: Hypotheses
Presentation transcript:

BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling

Statistics we have already learned (e.g., mean, variance, standard deviation) are descriptive statistics. Statistics that are tests for specific patterns in data using independent variables to generate hypotheses (Y vs. X) are inferential statistics (e.g., t-test, ANOVA, likelihood ratio tests). Procedure: Obtain an observed condition Compare observed condition to distribution of expected values (or their likelihood) Determine whether observed value is something interesting (i.e., it is unexpected relative to chance?) BIOL 582 Inferential Statistics

Inferential statistics compare observed value to distribution of ‘expected’ values Significance determined based upon how ‘extreme’ the observed value is relative to the distribution Resampling methods are one approach for generating expected distributions Observed value probability BIOL 582 Inferential Statistics Side Trip to learn about probability…

BIOL 582 Variant of the Fisher Exact Test Sir Ronald Aylmer Fisher (1890 –1962) Fisher’s exact test (named after Fisher for work he originally published in 1922, but also published in other ways after Is this meaningful?

BIOL 582 Variant of the Fisher Exact Test Sir Ronald Aylmer Fisher (1890 –1962) Fisher’s exact test (named after Fisher for work he originally published in 1922, but also published in other ways after Consider all possible ways to get 2 groups of 4 with these data m permutations; n values; g number of groups 28 possible ways to get 2 groups of 4!

BIOL 582 Variant of the Fisher Exact Test Sir Ronald Aylmer Fisher (1890 –1962) Fisher’s exact test (named after Fisher for work he originally published in 1922, but also published in other ways after Observed Random Do this for all 28 possible permutations and create a distribution of differences

2 ways to get 2.5 BIOL 582 Variant of the Fisher Exact Test Sir Ronald Aylmer Fisher (1890 –1962) Fisher’s exact test (named after Fisher for work he originally published in 1922, but also published in other ways after 28 possible absolute differences in averages between two groups 2/28 = 0.07 = probability of getting 2.5 from randomly sampling two groups from the same population

We can use resampling methods instead! Methods that take many samples from original data set in some specified way and evaluate the significance of the original based on these samples Approaches are nonparametric, because they generate a distribution from the data Are very flexible, and can allow for complicated designs Major variants: randomization, bootstrap, jackknife, Monte Carlo Enumerating all possible permutations becomes challenging for large data sets, several groups, and uneven group sizes. Table to the right shows the number of permutations needed for two equal sized groups BIOL 582 Resampling

Fisher’s exact test: a total enumeration of possible pairings of data (designed for contingency tables) Randomization can be used as a proxy (but for most any test statistic) Protocol Calculate observed statistic (e.g., ): E obs Reorder data set (i.e. randomly shuffle data) and recalculate statistic E rand Repeat for all possible combinations and generate distribution of possible statistics Percentage of E rand more extreme than E obs is significance level Note: E obs is treated as an iteration BIOL 582 Resampling E obs E rand E obs

Although it may not be feasible to consider all possible permutations, empirical and theoretical statistical research has shown that using MANY (e.g., > 1,000) random permutations works well. More specifically, the mean value obtained from the distribution of random values generally converges on the expected value for the NULL hypothesis. BIOL 582 Resampling E obs E rand E obs

What is a null hypothesis? In general, it’s the hypothesis that states “no relationship”, or “no difference”, which can be tested with evidence (data). If results from a hypothesis test support an alternative hypothesis (i.e., there is a relationship, or there is a difference), then the null hypothesis is rejected. The expected value for a null hypothesis states something about the population from which data are collected. It usually is presented as a difference equal to 0. BIOL 582 Null Hypotheses

Example: Imagine collecting a sample of 10 treefrogs from the Green River Biopreserve. The snout-to-vent length (SVL) is measured for each, and the sample mean is calculated. The sample mean is an estimate of the unknown, but existent, population mean. Go out and repeat the procedure. What is the expected difference in sample means between the two samples? BIOL 582 Null Hypotheses Answer: 0 Why: If the sample mean is an estimate of the population mean, and it’s accurate, then the difference in sample means should be the difference between the population mean and itself, which is 0!

BIOL 582 Null Hypotheses But there has to be some sort of expected error in sampling! A B Two samples collected from two populations Null Hypothesis (H 0 ): µ A = µ B ; or µ A - µ B = 0 H 0 implies that if the two population means are not different, sampling from both is tantamount to sampling from one. Important point: if two samples have parameter estimates that “behave” as if from one population, then randomly shuffling values between samples should indicate that the observed difference is not uncommon. If the sample parameter estimates are quite different, then the observed case should be rare compared to randomly shuffling values between samples. The best way to demonstrate this is with a physical example. Go to R

BIOL 582 The Language of Hypothesis Testing (1 – α)% Confidence intervals α is the accepted type I error rate (more later) E.g., a 95% Confidence interval is the distribution of values between 2.5 and 97.5 percentiles Imagine two confidence intervals from different populations (or samples): 25 +/ /- 5 Are the means different? Notice that the confidence intervals of the means overlap Question: do the populations have different means?

BIOL 582 The Language of Hypothesis Testing Hypothesis testing: Steps in Hypothesis Testing 1.A claim is made. 2.Evidence (sample data) is collected in order to test the claim. 3.The data are analyzed in order to support or refute the claim. Hypothesis testing is a procedure, based on sample evidence and probability, used to test claims regarding a characteristic of one or more populations. The null hypothesis, denoted H 0, is a statement to be tested. The null hypothesis is assumed true until evidence indicates otherwise. It is a statement regarding the value of a population parameter. The alternative hypothesis, denoted H a, is a claim to be tested. We are trying to find evidence for the alternative hypothesis. It is a claim about a population parameter.

BIOL 582 The Language of Hypothesis Testing Hypothesis testing: There are several ways to test a null hypothesis: 1. Equal hypothesis versus not equal hypothesis (two-sided test) H 0 : parameter = some value H a : parameter  some value 2. Equal hypothesis versus less than (one-sided test) H 0 : parameter = some value H a : parameter < some value 3. Equal hypothesis versus greater than (one-sided test) H 0 : parameter = some value H a : parameter > some value

BIOL 582 The Language of Hypothesis Testing Hypothesis testing: When we test a hypothesis, there are four possible outcomes: Four Outcomes from Hypothesis Testing 1.We reject the null when in fact the alternative is true. This decision would be correct. 2.We fail to reject the null when in fact the null is true. This decision would be correct. 3.We reject the null when in fact the null is true. This decision would be incorrect. This type of error is called a Type I error. 4.We fail to reject the null when in fact the alternative is true. This decision would be incorrect. This type of error is called a Type II error. Reality Conclusion H 0 is True H 0 is False Do not reject H 0 Correct ConclusionType II Error Reject H 0 Type I ErrorCorrect Conclusion

BIOL 582 The Language of Hypothesis Testing Hypothesis testing: Think about the Judicial Process: A person is innocent until proven guilty…. Thus, H 0 : defendant is innocent H a : defendant is guilty We have four possible outcomes: 1.The defendant is innocent and found to be innocent (i.e., null is true and the hypothesis test did not reject the null hypothesis). 2.The defendant is guilty and found to be guilty (i.e., null is false and the hypothesis test rejects the null hypothesis). 3.The defendant is really innocent but found to be guilty and sent to jail (Type I error). 4.The defendant is really guilty but found to be innocent and set free (Type II error).

BIOL 582 The Language of Hypothesis Testing Hypothesis testing: We may want to know the probability of making a Type I or Type II error.  : The probability of making a Type I error (called the level of significance)*  : The probability of making a Type II error. *  can be associated with empirical percentiles or associated with the tails of a theoretical distributions, such as standard normal or t -distribution. In both cases, it represents the probability of finding a value lower or higher than the confidence limits we describe (e.g.,  = 0.10 means we assume a 10% probability of finding a value outside of our confidence limits). In hypothesis testing,  is the probability of finding a value outside a range that is associated with the null hypothesis.

BIOL 582 The Language of Hypothesis Testing What kind of hypothesis tests can we learn? … we could systematically go through a bunch and learn them well, and learn in what situations to use them…. BORING! We will instead learn methods important to us (like linear models) and consider how to test hypotheses that are important

BIOL 582 Testing a Hypothesis Regardless, the following steps are useful to remember in hypothesis tests 1.State the null and alternative hypotheses. 2.Select the level of significance, . 3.Calculate the test statistic (e.g., Z value, t statistic, SS, means contrast, Λ 2 ) 4.Compare either 1.The critical value of the test statistic with the observed test statistic. 2.The observed statistic to a distribution of random statistics 5.State the conclusion. If the probability of finding a random statistic that exceeds the value of a test statistic associated with , is lower than , one rejects the null hypothesis In other words, if a chance outcome is rarer than our comfort level, we believe the null hypothesis is falsified.

BIOL 582 Testing a Hypothesis Now, let’s play with some data and see how this works. As we do, either on a notepad, or a document on your computer screen, always provide the following information for any test: H 0 : H a : α: Reject H 0 :? Conclusion: