Randomization workshop eCOTS May 22, 2014 Presenters: Nathan Tintle and Beth Chance.

Slides:



Advertisements
Similar presentations
Chapter 7 Sampling and Sampling Distributions
Advertisements

Using Technology to Engage Students and Enhance Learning Roxy Peck Cal Poly, San Luis Obispo
Implementation and Order of Topics at Hope College.
Panel at 2013 Joint Mathematics Meetings
MAY 22, 2014 ECOTS WORKSHOP PRESENTERS: NATHAN TINTLE AND BETH CHANCE HOUR #2 Randomization workshop.
6. Statistical Inference: Example: Anorexia study Weight measured before and after period of treatment y i = weight at end – weight at beginning For n=17.
An Active Approach to Statistical Inference using Randomization Methods Todd Swanson & Jill VanderStoep Hope College Holland, Michigan.
Review bootstrap and permutation
Chapter 4 Inference About Process Quality
I NTRODUCING S TATISTICAL I NFERENCE USING R ANDOMIZATION M ETHODS Todd Swanson & Jill VanderStoep Hope College Holland, Michigan.
CHAPTER 15: Tests of Significance: The Basics Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
January Structure of the book Section 1 (Ch 1 – 10) Basic concepts and techniques Section 2 (Ch 11 – 15): Inference for quantitative outcomes Section.
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Chapter 6 Sampling and Sampling Distributions
A new approach to introductory statistics Nathan Tintle Hope College.
Chapter 10: Hypothesis Testing
Chapter 7 Sampling and Sampling Distributions
Stat 301 – Day 14 Review. Previously Instead of sampling from a process  Each trick or treater makes a “random” choice of what item to select; Sarah.
Introducing Inference with Simulation Methods; Implementation at Duke University Kari Lock Morgan Department of Statistical Science, Duke University
Simulation and Resampling Methods in Introductory Statistics Michael Sullivan Joliet Junior College
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative.
Using Simulation Methods to Introduce Inference Kari Lock Morgan Duke University In collaboration with Robin Lock, Patti Frazer Lock, Eric Lock, Dennis.
Chapter 9 Comparing More than Two Means. Review of Simulation-Based Tests  One proportion:  We created a null distribution by flipping a coin, rolling.
Let’s flip a coin. Making Data-Based Decisions We’re going to flip a coin 10 times. What results do you think we will get?
Statistics: Unlocking the Power of Data Lock 5 Synthesis STAT 250 Dr. Kari Lock Morgan SECTIONS 4.4, 4.5 Connecting bootstrapping and randomization (4.4)
Using Lock5 Statistics: Unlocking the Power of Data
How to Handle Intervals in a Simulation-Based Curriculum? Robin Lock Burry Professor of Statistics St. Lawrence University 2015 Joint Statistics Meetings.
Statistics: Unlocking the Power of Data Lock 5 Afternoon Session Using Lock5 Statistics: Unlocking the Power of Data Patti Frazer Lock University of Kentucky.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Introducing Inference with Simulation Methods; Implementation at Duke University Kari Lock Morgan Department of Statistical Science, Duke University
Day 3: Sampling Distributions. CCSS.Math.Content.HSS-IC.A.1 Understand statistics as a process for making inferences about population parameters based.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
Introducing Inference with Bootstrapping and Randomization Kari Lock Morgan Department of Statistical Science, Duke University with.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Implementing a Randomization-Based Curriculum for Introductory Statistics Robin H. Lock, Burry Professor of Statistics St. Lawrence University Breakout.
CHAPTER 9 Testing a Claim
Give your data the boot: What is bootstrapping? and Why does it matter? Patti Frazer Lock and Robin H. Lock St. Lawrence University MAA Seaway Section.
+ Using StatCrunch to Teach Statistics Using Resampling Techniques Webster West Texas A&M University.
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
URBDP 591 A Lecture 16: Research Validity and Replication Objectives Guidelines for Writing Final Paper Statistical Conclusion Validity Montecarlo Simulation/Randomization.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
Tests of Significance We use test to determine whether a “prediction” is “true” or “false”. More precisely, a test of significance gets at the question.
Simulation-based inference beyond the introductory course Beth Chance Department of Statistics Cal Poly – San Luis Obispo
Teaching the statistical investigation process with simulation-based inference BETH CHANCE, CAL POLY- SAN LUIS OBISPO NATHAN TINTLE, DORDT COLLEGE.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
Using Randomization Methods to Build Conceptual Understanding in Statistical Inference: Day 1 Lock, Lock, Lock, Lock, and Lock Minicourse – Joint Mathematics.
Introducing Statistical Inference with Resampling Methods (Part 1)
Simulation Based Inference for Learning
Unit 5: Hypothesis Testing
CHAPTER 9 Testing a Claim
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Warm Up Check your understanding p. 541
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
CHAPTER 9 Testing a Claim
Making Data-Based Decisions
Stat 217 – Day 28 Review Stat 217.
CHAPTER 9 Testing a Claim
Stat 217 – Day 17 Review.
Using Simulation Methods to Introduce Inference
CHAPTER 9 Testing a Claim
Significance Tests: The Basics
Using Simulation Methods to Introduce Inference
Significance Tests: The Basics
CHAPTER 9 Testing a Claim
Intro to Confidence Intervals Introduction to Inference
Chapter 8: Estimating with Confidence
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Presentation transcript:

Randomization workshop eCOTS May 22, 2014 Presenters: Nathan Tintle and Beth Chance

Introductions Presenters Nathan Tintle, Dordt College Beth Chance, Cal Poly Participants Telling you a little about you!

Participant profile

Over 85% are here because of AP-equivalent introductory statistics Remainder Calculus based introductory statistics Other statistics course

Participant profile

Goals What is randomization-based inference? Why would I want it in my course? How does it look in my course? Challenged with new ideas Implementation tips and advice Comparisons with other randomization texts And a lot more!

Our primary goals are for you Understand what a randomization/ simulation based approach to statistical inference is Understand why it is an increasingly popular approach to teaching introductory statistics Have experienced two concrete examples of how it works in the classroom Have a sense of one of the major curriculum options available for teaching with this approach

Overview Hour #1 First minutes: Welcome/introductions/overview/goals Next minutes: What is a randomization-based curriculum? Why?* Next minutes: Activity: Meet Doris and Buzz * Hour #2 (after short 5 minute break) First minutes: The ISI curriculum: What, how, and why* Next minutes: Activity: Is yawning contagious?* Final minutes: Cautions, implementation, assessment* Final minutes: Next steps, class testing, ongoing discussion* *Ask questions both during and immediately following each presentation To ask a question, type the question into the Questions pane of the GoToWebinar control panel. Do not use the raise hands feature. During all sessions we STRONGLY encourage you to ask questions! Note: We will post slides after workshop, plus recording available via eCOTS

What do we mean by a randomization- based curriculum and why consider it?

Overview Why look at the content of Stat 101? Stat 101= general algebra based intro stats course (equivalent to AP Statistics) George Cobbs challenge about how the content might change Randomization/simulation as an overarching approach to statistical inference Some general trends, themes in randomization curricula to date

Brief history of stat ed Consensus curriculum by late 1990s, but nexus in early 1980s Descriptive Statistics Probability/Design/Sampling Distributions Inference (testing and intervals) GAISE College Report (2005) Six pedagogical suggestions for Stat 101: Conceptual understanding, Active learning, Real data, Statistical literacy and thinking, Use technology, and Use assessments for learning

Brief history of stat ed No real pressure to change content Major changes Computer changed from an institutionally owned behemoth to the individually owned desktop or laptop. As computers became ubiquitous, so did data collection, and with it the need for data analysis. Statistical practice changed as well. More computer intensive methods, large datasets, multivariable methods, etc. Recognition of the utility of simulation to enhance student understanding of random processes

Brief history of stat ed Other changes directly impacting Stat 101 Stats increasing in K-12 curriculum (NCTM, Common Core, Advanced Placement) (Franklin eCOTS plenary talk) Enrollments have skyrocketed (High school, two and four year colleges) Stat ed research has given us more knowledge of how and what students learn in Stat 101

Potential shortcomings Overlap with K-12 treatment of descriptive statistics is inefficient Eventually will also have exposure to informal inference Although computer-intensive methods have become a central part of statistical practice, they are largely or wholly absent from the typical first course. Assessment methods developed over the last decades show that student understanding of the logic of inference is typically limited at best (Cobb 2007, TISE The traditional first course does not devote sufficient time or space to the connections among the method of data production, the method used to analyze the data, and the scope of inference justified by the analysis. For randomization-based methods, these connections are simple and direct.

Intro Stat as a Cathedral of Tweaks (A George Cobb analogy) Boswell famously describe Samuel Johnson as a cathedral of tics. Thesis: The usual normal distribution-worshipping intro course is a cathedral of tweaks. The orthodox doctrine is simple Use the CLT to justify the normal Use the normal to compute tail areas Reject if |observed – expected 0 | > 2SEs Interval = Est +/- M 95 with M 95 = 2SEs

The Cathedral of Tweaks (a) z vs t : If we know the population SD we use z; when we estimate the SD we use t (b) z vs. t : (a) holds except for proportions; then we use z, not t, even when we estimate the SD. (c) Estimating the SD. For proportions, we estimate the SD for intervals, but use the null value for tests. (d) n vs. (n-1) vs. (n-2). The SE is SD/(root n): We divide by n because we have n observations. But for estimating the SD we divide by (n-1), even though there are n deviations … except that when we get to regression we use (n-2).

Still More Tweaks If your data set is not normal you may need to transform If you work with small samples there are guidelines for when you can use methods based on the normal, e.g., n > 30, or np > 5 and n(1-p) > 5

The consequence Few students ever leave our course seeing this

The consequence The better students may get a fuzzy impression

The consequence All too many noses stay too close to the canvas, and see disconnected details

A potential solution? Randomization = simulation, bootstrapping and/or permutation tests Use of computationally intensive methods to: Estimate/approximate the null distribution for significance tests Estimate/approximate the margin of error for confidence intervals

A potential solution: Simulation Flip coins or spin spinners to simulate the binomial distribution instead of starting with Binomial distribution theory Normal approximation to the binomial

Simulation example What are the chances a basketball player is shooting free throws better in the playoffs (16/20 in game 1) than they typically do, if they typically make 50% of their free throws? Flip 20 coins to simulate performance of player if no change in free throw percentage Repeat to assess the likelihood of such a player making 16 FTs How often would we get such a statistic as in the study by chance alone? That is, if still 50%?

A potential solution: Bootstrap Bootstrap Use 1000s of resamples (with replacement) of the observed data to generate an approximate sampling distribution which can be used to estimate the margin of error

Bootstrap example Example: Gather hours slept last night for 20 students 5,5,6,6,6,7,7,7,7,8,8,8,6.5,6.5,7,7.5,7.5,7.5,7,4 Mean=6.7 hours, SD=1.1 hours Find: 95% CI for population average sleep hours Bootstrap: Model the data gathering process: 1000 random samples of 20 with replacement, compute sample mean each time. Keep going!

Bootstrap example (re)Sample means

Bootstrap example Find middle 95% of resample means = 95% CI for true population mean 6.2 to 7.1 hours How much might these sample means vary from sample to sample by chance alone?

A potential solution: Permutation tests Permutation testing Compare 2 or more groups Null: No treatment effect; distribution of response variable is the same in all groups Ex. Is new treatment better than placebo?

A potential solution: Permutation tests Write the values of the response variable (cat or quant) on slips of paper. Shuffle slips and re- randomize to the two or more groups Recompute value of the statistic and get empirical null distribution, compare to actual statistic How often would we get such a statistic as in the study by chance alone if null is true?

A potential solution These methods may offer a quicker, less abstract bridge to the logic of inference while also emphasizing the scope of inference (random sampling, random assignment) May scaffold the transition to traditional (asymptotic; theory-based methods) better than traditional theory/probability theory, etc.

General trends Momentum behind randomization-approach to inference in last 8-10 years Cobb 2005 talk (USCOTS) Cobb 2007 paper (TISE) 2011 USCOTS: The Next Big Thing New and coming soon curricula Lock5 (theory and randomization, more traditional sequence of topics) Tintle et al. (theory and randomization, four pillars of inference and then chapters based on type of data) CATALST (emphasis on modelling) Others

General trends Many sessions at conferences talking about approach, benefits, questions/concerns Assessment: Two papers (Tintle et al. 2011, Tintle et al. 2012); Better on some things, do no harm on others; more coming

Q+A

Doris and Buzz Simulation for a single proportion

Introduction First main example (after brief Preliminaries) Story Questions for students Can we prove dolphins can communicate abstract concepts? What other explanations are there? How explain/justify to someone else? Chance model, simulation

Three S Strategy Statistic: Compute the statistic from the observed sample data. Simulate : Identify a model that represents a by chance explanation. Repeatedly simulate values of the statistic that could have happened when the chance model is true. Strength of evidence : Consider whether the value of the observed statistic is unlikely to occur when the chance model is true.

Dolphin Communication Statistic In one set of trials, Buzz chose the correct button 15 out of 16 times. Based on these results, do you think Buzz knew which button to push or is he just guessing?

Dolphin Communication Simulate coin flip = guess by Buzz heads = correct guess tails = wrong guess chance of heads = probability of correct button when Buzz is just guessing one set of 16 coin flips = one set of 16 attempts by Buzz

Dolphin Communication Simulate What might be on the front board in class of 25 students Larger class fine!

Dolphin Communication Simulate Still not convinced 15 is unlikely? Go to applet to get a very large class flipping coins with you Click on Applets, then click one proportion Applets are javascript and so work on all platforms including iPhones, iPads, etc.

Moving past Doris and Buzz Null/Alt hypotheses, non-50/50 null (1.2) Parameter (1.1) Strength of evidence P-value (1.2) Standardized statistic, Z (1.3) Two-sided tests, what impacts strength of evidence (1.4) Theory-based approaches (overlay normal) (1.5)

Q+A

End of hour #1 Short break-back in 1 minute!! Well start at 38 minutes after the hour