THE INTRODUCTORY STATISTICS COURSE: A SABER TOOTH CURRICULUM? George W. Cobb Mount Holyoke College USCOTS Columbus, OH 5/20/05.

Slides:



Advertisements
Similar presentations
Using Randomization Methods to Build Conceptual Understanding in Statistical Inference: Day 1 Lock, Lock, Lock, Lock, and Lock MAA Minicourse – Joint Mathematics.
Advertisements

Chapter 8: Estimating with Confidence
Sampling Distributions and Sample Proportions
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Models and Modeling in Introductory Statistics Robin H. Lock Burry Professor of Statistics St. Lawrence University 2012 Joint Statistics Meetings San Diego,
A Fiddler on the Roof: Tradition vs. Modern Methods in Teaching Inference Patti Frazer Lock Robin H. Lock St. Lawrence University Joint Mathematics Meetings.
Review: What influences confidence intervals?
Business 205. Review Sampling Continuous Random Variables Central Limit Theorem Z-test.
Statistical Tests How to tell if something (or somethings) is different from something else.
1 Trust and divorce Separated or Divorced trust | No Yes | Total Low | | 247 | |
Hypothesis Testing for Population Means and Proportions
Using Simulation Methods to Introduce Statistical Inference Patti Frazer Lock Kari Lock Morgan Cummings Professor of Mathematics Assistant Professor of.
Building Conceptual Understanding of Statistical Inference with Lock 5 Dr. Kari Lock Morgan Department of Statistical Science Duke University Wake Forest.
Bootstrapping: Let Your Data Be Your Guide Robin H. Lock Burry Professor of Statistics St. Lawrence University MAA Seaway Section Meeting Hamilton College,
Bootstrapping applied to t-tests
Using Bootstrap Intervals and Randomization Tests to Enhance Conceptual Understanding in Introductory Statistics Kari Lock Morgan Department of Statistical.
Standard Error of the Mean
Sampling error Error that occurs in data due to the errors inherent in sampling from a population –Population: the group of interest (e.g., all students.
Introducing Inference with Bootstrap and Randomization Procedures Dennis Lock Statistics Education Meeting October 30,
Simulation and Resampling Methods in Introductory Statistics Michael Sullivan Joliet Junior College
1. An Overview of the Data Analysis and Probability Standard for School Mathematics? 2.
The Bootstrap Michael Sullivan Joliet Junior College
Using Simulation Methods to Introduce Inference Kari Lock Morgan Duke University In collaboration with Robin Lock, Patti Frazer Lock, Eric Lock, Dennis.
Section #4 October 30 th Old: Review the Midterm & old concepts 1.New: Case II t-Tests (Chapter 11)
CHAPTER 11: Sampling Distributions
Introduction Parameters are numerical descriptive measures for populations. For the normal distribution, the location and shape are described by  and.
Inference about Two Population Standard Deviations.
Research Methods for Counselors COUN 597 University of Saint Joseph Class # 6 Copyright © 2015 by R. Halstead. All rights reserved.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
How to Handle Intervals in a Simulation-Based Curriculum? Robin Lock Burry Professor of Statistics St. Lawrence University 2015 Joint Statistics Meetings.
1 Introduction to Estimation Chapter Concepts of Estimation The objective of estimation is to determine the value of a population parameter on the.
Building Conceptual Understanding of Statistical Inference Patti Frazer Lock Cummings Professor of Mathematics St. Lawrence University
Bootstrapping (And other statistical trickery). Reminder Of What We Do In Statistics Null Hypothesis Statistical Test Logic – Assume that the “no effect”
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 10. Hypothesis Testing II: Single-Sample Hypothesis Tests: Establishing the Representativeness.
Introducing Statistical Inference with Randomization Tests Allan Rossman Cal Poly – San Luis Obispo
Hypothesis Testing Hypothesis Testing Topic 11. Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question.
Chapter 10 – Sampling Distributions Math 22 Introductory Statistics.
Chapter 7: Sample Variability Empirical Distribution of Sample Means.
IGNITING A PASSION FOR CHANGE ARSONIST OR MERE ACCELERANT? George W. Cobb Mount Holyoke College USCOTS 2013 Raleigh, NC May 18.
Analysis of Variance (ANOVA) Can compare the effects of different treatments Can make population level inferences based on sample population.
Sampling Distribution WELCOME to INFERENTIAL STATISTICS.
RDPStatistical Methods in Scientific Research - Lecture 41 Lecture 4 Sample size determination 4.1 Criteria for sample size determination 4.2 Finding the.
Experimental Psychology PSY 433 Appendix B Statistics.
Stat 1510: Sampling Distributions
Building Conceptual Understanding of Statistical Inference Patti Frazer Lock Cummings Professor of Mathematics St. Lawrence University Canton, New York.
Give your data the boot: What is bootstrapping? and Why does it matter? Patti Frazer Lock and Robin H. Lock St. Lawrence University MAA Seaway Section.
Review I A student researcher obtains a random sample of UMD students and finds that 55% report using an illegally obtained stimulant to study in the past.
+ Using StatCrunch to Teach Statistics Using Resampling Techniques Webster West Texas A&M University.
Early Inference: Using Randomization to Introduce Hypothesis Tests Kari Lock, Harvard University Eric Lock, UNC Chapel Hill Dennis Lock, Iowa State Joint.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Modern Approaches The Bootstrap with Inferential Example.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
Bootstraps and Scrambles: Letting a Dataset Speak for Itself Robin H. Lock Patti Frazer Lock ‘75 Burry Professor of Statistics Cummings Professor of MathematicsSt.
Simulation-based inference beyond the introductory course Beth Chance Department of Statistics Cal Poly – San Luis Obispo
Using Randomization Methods to Build Conceptual Understanding in Statistical Inference: Day 1 Lock, Lock, Lock, Lock, and Lock Minicourse – Joint Mathematics.
Patti Frazer Lock Cummings Professor of Mathematics
Simulation Based Inference for Learning
Inference for Proportions
Comparing Two Proportions
When we free ourselves of desire,
Review: What influences confidence intervals?
Using Simulation Methods to Introduce Inference
Using Simulation Methods to Introduce Inference
Sampling Distribution of the Mean
Sampling Distribution Models
CHAPTER 11: Sampling Distributions
For a permutation test, we have H0: F1(x) = F2(x) vs
Presentation transcript:

THE INTRODUCTORY STATISTICS COURSE: A SABER TOOTH CURRICULUM? George W. Cobb Mount Holyoke College USCOTS Columbus, OH 5/20/05

Times (days) Mean SD Control (standard):22, 33, Treatment (new):19, 22, 25,

Question: Why, then, is the t-test the centerpiece of the introductory statistics curriculum? Answer: The t-test is what scientists and social scientists use most often. Question: Why does everyone use the t- test? Answer: Because it’s the centerpiece of the introductory statistics curriculum.

WHAT we teach: our Ptolemaic curriculum WHAT’S WRONG with what we teach: three reasons WHY we teach it anyway: the tyranny of the computable WHAT SHOULD we teach instead: putting inference at the center WHY SHOULD we teach it: an unabashed sales pitch

WHAT WE TEACH: Our Ptolemaic Curriculum

Epicycle

Eccentric

and so it goes …

WHY IT’S WRONG Obfuscation Opportunity cost Fraud

What’s this?

Chem 101 – General chemistry I Chem 201 – General chemistry II Chem 202 – Organic chemistry I Biol 150 – Intro Biol I: form & function Biol 200 – Intro Biol II: org. development Biol. 210 – Genetics & molecular biology Biol 340 – Eukaryotic molecular genetics

WHY WE TEACH IT ANYWAY (The tyranny of the computable)

WHAT WE SHOULD TEACH Put the logic of inference at the center of our curriculum

The three Rs of inference: RANDOMIZE, REPEAT, REJECT RANDOMIZE data production –To protect against bias –To provide a basis for inference random samples let you generalize to populations random assignment supports conclusions about cause and effect REPEAT by simulation to see what’s typical –Randomized data production lets you re-randomize, over and over, to see which outcomes are typical, which are not. REJECT any model that puts your data in its tail

WHY WE SHOULD TEACH IT (A dozen reasons)

If we teach the permutation test as the central paradigm for inference, then the model matches the production process, and so it allows us to stress the connection between data production and inference; the model is simple and easily grasped; the distribution is easy to derive for simple cases (small n) by explicitly listing outcomes; the distribution is easy to obtain by physical simulation for simple situations;

If we teach the permutation test as the central paradigm for inference, then the distribution is easy to obtain by a computer simulation whose algorithm is an exact copy of the algorithm for physical simulation; expected value and standard deviation can be defined concretely by regarding the simulated distribution as data; the normal approximation is empirical rather than “theory-by-fiat;”

If we teach the permutation test as the central paradigm for inference, then the entire paradigm generalizes easily to other designs (e.g., block designs), other test statistics, and other data structures (e.g., Fisher’s exact test); it is easy and natural to teach two distinct randomization schemes, with two kinds of inferences; it offers a natural way to introduce students to computer-intensive and simulation-based methods, and so offers a natural lead-in to such topics as the bootstrap; and

If we teach the permutation test as the central paradigm for inference, then it frees up curricular space for other modern topics; last, we should do it because Fisher told us to. Actually, he said in essence that we should do it, except that we can’t, and so we have been forced to rely on approximations:

“the statistician does not carry out this very simple and very tedious process, but his conclusions have no justification beyond the fact that they agree with those which could have been arrived at by this elementary method.”