Bootstrapping and Randomization Techniques Q560: Experimental Methods in Cognitive Science Lecture 15.

Slides:



Advertisements
Similar presentations
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Advertisements

Psych 5500/6500 The Sampling Distribution of the Mean Fall, 2008.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
 What is chi-square  CHIDIST  Non-parameteric statistics 2.
Lecture 10 Non Parametric Testing STAT 3120 Statistical Methods I.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 9: Hypothesis Tests for Means: One Sample.
Lecture 9: One Way ANOVA Between Subjects
One-way Between Groups Analysis of Variance
Lecture 9 Today: –Log transformation: interpretation for population inference (3.5) –Rank sum test (4.2) –Wilcoxon signed-rank test (4.4.2) Thursday: –Welch’s.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Today Concepts underlying inferential statistics
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 14: Non-parametric tests Marshall University Genomics.
Nonparametric and Resampling Statistics. Wilcoxon Rank-Sum Test To compare two independent samples Null is that the two populations are identical The.
Choosing Statistical Procedures
+ Quantitative Statistics: Chi-Square ScWk 242 – Session 7 Slides.
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
AM Recitation 2/10/11.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Overview of Statistical Hypothesis Testing: The z-Test
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Chapter 8 Introduction to Hypothesis Testing
Part IV Significantly Different: Using Inferential Statistics
Education 793 Class Notes T-tests 29 October 2003.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 23, Slide 1 Chapter 23 Comparing Means.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010.
Bootstrapping (And other statistical trickery). Reminder Of What We Do In Statistics Null Hypothesis Statistical Test Logic – Assume that the “no effect”
Inferential Statistics 2 Maarten Buis January 11, 2006.
Biostat 200 Lecture 7 1. Hypothesis tests so far T-test of one mean: Null hypothesis µ=µ 0 Test of one proportion: Null hypothesis p=p 0 Paired t-test:
9 Mar 2007 EMBnet Course – Introduction to Statistics for Biologists Nonparametric tests, Bootstrapping
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
Fall 2002Biostat Nonparametric Tests Nonparametric tests are useful when normality or the CLT can not be used. Nonparametric tests base inference.
Hypothesis Testing A procedure for determining which of two (or more) mutually exclusive statements is more likely true We classify hypothesis tests in.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Chapter 6 USING PROBABILITY TO MAKE DECISIONS ABOUT DATA.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
1 Psych 5500/6500 Introduction to the F Statistic (Segue to ANOVA) Fall, 2008.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
ANOVA Assumptions 1.Normality (sampling distribution of the mean) 2.Homogeneity of Variance 3.Independence of Observations - reason for random assignment.
Chapter 15 – Analysis of Variance Math 22 Introductory Statistics.
Experimental Design and Statistics. Scientific Method
Experimental Psychology PSY 433 Appendix B Statistics.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Three Frameworks for Statistical Analysis. Sample Design Forest, N=6 Field, N=4 Count ant nests per quadrat.
1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Ch11: Comparing 2 Samples 11.1: INTRO: This chapter deals with analyzing continuous measurements. Later, some experimental design ideas will be introduced.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
Handout Six: Sample Size, Effect Size, Power, and Assumptions of ANOVA EPSE 592 Experimental Designs and Analysis in Educational Research Instructor: Dr.
From Wikipedia: “Parametric statistics is a branch of statistics that assumes (that) data come from a type of probability distribution and makes inferences.
Hypothesis test flow chart
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Bootstrapping James G. Anderson, Ph.D. Purdue University.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
15 Inferential Statistics.
Inferential Statistics
Some Nonparametric Methods
Doing t-tests by hand.
What are their purposes? What kinds?
Presentation transcript:

Bootstrapping and Randomization Techniques Q560: Experimental Methods in Cognitive Science Lecture 15

Parametric and Nonparametric Methods Most of the tests we have examined so far have been parametric tests. Parametric tests must make assumptions about the distribution of the population, and estimate parameters of a theoretical probability distribution from statistics Nonparametric tests (distribution free) make no parameter assumptions; parameters are built from data, not the model

Parametric and Nonparametric Methods I lied about a couple of things related to parameters… “Enumerate” An example with Fisher’s Lady Tasting Tea…

The Binomial Distribution An example graph:

Probability and Frequency Graphs Example: For the population of scores shown below, what is the probability in a random draw of obtaining a score greater than 4? p(X>4) =

The Normal Distribution Diagram:

Sample Means Let’s do an example from a very small population of 4 scores: X: 2, 4, 6, 8

Sample Means We construct a distribution of sample means for n=2. 1. Step: Write down all 16 possible samples.

Sample Means 2. Step: Draw the distribution of sample means.

General Approach Original t-test: how abnormal is our mean difference relative to all possible mean differences? We built a chance model based on CLT, assumption of normality let’s us estimate SEM This is a general technique if the population is normal, and allows us to look up critical values from a table But we can compute a custom chance model for our particular observed data. If the distributions are not normal, we have to do this.

We might want a nonparametric test if: We don’t meet the assumptions for a parametric test We want to do tests on medians They aren’t just tools to break out if assumptions fail.... Generally, npar tests: Do not make assumptions about population parameters We just want to know the likelihood of the sample data falling as they did if the treatments are equally effective (random assignment to conditions is important factor)

Resampling Techniques: Pros: More liberal (and realistic) in their assumptions Distribution free…validity of test is unaffected by whether or not population distribution is known Cons: Tend to be less powerful…require more observations than parametric test to detect a true effect Note that this is still being debated (see papers on website) Computationally intensive

Resampling Techniques: Rather than assuming imaginary populations, they focus on the data at hand Divide into estimation methods, or custom chance models (desc/inferential) 1.Bootstrapping and Jackknifing 2.Randomization (Permutation) Tests 3.Sign Tests 4.Monte Carlo Techniques

Bootstrapping Parameters Make sure you have a random sample Rather than assuming our population is normal, we assume it is distributed exactly as our sample Sample w/ replacement from your “pseudo” population Create N random samples from your sample, and determine sampling distribution Example: Dependent-t, independent-t

Randomization Tests Different philosophy: primary hypothesis is “exchangeability”...if null is true, the arrangement of Ss to conditions is random Focus is on random assignment Note that this is philosophically different from NSHT We don’t talk about parameters or populations, but the manipulation in the experiment Rather than “different populations had the same mean” we use “different treatments had the same effect” Compare the observed data to all permutations

Randomization + Independent Samples If the null is true, the groups differ only by random assignment Compute observed mean difference t* Randomly assign scores to groups again from the full sample and compute mean difference; do this a large number of times, e.g. 10,000 How unlikely is t* under this chance distribution of random assignments? (i.e., how many are more extreme?)

Randomization and Dependent Samples Make sure you have a random sample If the null is true, the difference scores should be zero; +/- signs that are not zero should be chance Under null, each score has a 50/50 chance of being +/- Compute actual mean of difference scores, D* Select scores from the sample and randomly assign them a +/- sign…do this a large number of times, and this is your chance model. How unlikely is D* under this chance model?

References (on our website): Mewhort, D. J. K. (2005). A comparison of the randomization test with the F test when error is skewed. Behavioral Research Methods, 37, · Moore, D. S., G. McCabe, W. Duckworth, and S. Sclove (2003): Bootstrap Methods and Permutation Tests · Hesterberg, T. C., D. S. Moore, S. Monaghan, A. Clipson, and R. Epstein (2005): Bootstrap Methods and Permutation Tests, software.

Examples: S and R both have nice built-in routines for computing resampling statistics; SPSS does not. Let’s do some examples using the program Resampling.exe (available on our website) Paired samples comparison Oneway ANOVA with bootstrapping Bootstrapping correlation sig test (Data files are on our Website under Lecture 15)