Download presentation
Presentation is loading. Please wait.
Published byStewart O’Brien’ Modified over 8 years ago
1
Using Simulation to Introduce Concepts of Statistical Inference Allan Rossman Cal Poly – San Luis Obispo arossman@calpoly.edu http://statweb.calpoly.edu/arossman/
2
Advertisement I will present a longer, more interactive version of this in a workshop on Saturday afternoon in Lincoln room Lunch provided! Thanks to John Wiley and Sonss Rossman Northwest Two-Year College Math Conf 2
3
3 3 Outline Who are you? Overview, motivation Four examples Advantages/merits Implementation suggestions Assessment suggestions Resources Q&A Rossman
4
Who are you? How many years have you been teaching? < 1 year 1-3 years 4-8 years 8-15 years > 15 years Northwest Two-Year College Math Conf 4Rossman
5
Who are you? How many years have you been teaching statistics? Never 1-3 years 4-8 years 8-15 years > 15 years Northwest Two-Year College Math Conf 5Rossman
6
Who are you? What is your background in statistics? No formal background A course or two Several courses but no degree Undergraduate degree in statistics Graduate degree in statistics Other Northwest Two-Year College Math Conf 6Rossman
7
Who are you? Have you used simulation in teaching statistics? Never A bit, to demonstrate probability ideas Somewhat, to demonstrate sampling distributions A great deal, as an inference tool as well as for pedagogical demonstrations Northwest Two-Year College Math Conf 7Rossman
8
88 Motivation “Ptolemy’s cosmology was needlessly complicated, because he put the earth at the center of his system, instead of putting the sun at the center. Our curriculum is needlessly complicated because we put the normal distribution, as an approximate sampling distribution for the mean, at the center of our curriculum, instead of putting the core logic of inference at the center.” – George Cobb (TISE, 2007) Northwest Two-Year College Math Conf Rossman
9
99 Example 1: Helper/hinderer? Sixteen pre-verbal infants were shown two videos of a toy trying to climb a hill One where a “helper” toy pushes the original toy up One where a “hinderer” toy pushes the toy back down Infants were then presented with the two toys from the videos Researchers noted which toy then infant chose to play with http://www.yale.edu/infantlab/socialevaluation/Helpe r-Hinderer.html http://www.yale.edu/infantlab/socialevaluation/Helpe r-Hinderer.html Northwest Two-Year College Math Conf Rossman
10
10 Example 1: Helper/hinderer? Data: 14 of the 16 infants chose the “helper” toy Two possible explanations Infants choose randomly, no genuine preference, researchers just got lucky Infants have a genuine preference for the helper toy Core question of inference: Is such an extreme result unlikely to occur by chance (random choice) alone … … if there were no genuine preference (null model)? Northwest Two-Year College Math Conf Rossman
11
11 Analysis options Could use the normal approximation to the binomial, but sample size is too small for CLT Could use a binomial probability calculation We prefer a simulation approach To illustrate “how often would we get a result like this just by random chance?” Starting with tactile simulation Northwest Two-Year College Math Conf Rossman
12
12 Strategy Students flip a fair coin 16 times Count number of heads, representing choices of helper and hinderer toys Under the null model of no genuine preference Repeat several times, combine results See how surprising it is to get 14 or more heads even with “such a small sample size” Approximate (empirical) p-value Turn to applet for large number of repetitions: www.rossmanchance.com/ISIapplets.html (One Proportion) www.rossmanchance.com/ISIapplets.html Northwest Two-Year College Math Conf Rossman
13
13 Results Pretty unlikely to obtain 14 or more heads in 16 tosses of a fair coin, so … Pretty strong evidence that pre-verbal infants do have a genuine preference for helper toy and were not just choosing at random Northwest Two-Year College Math Conf Rossman
14
Follow-up activity Facial prototyping Who is on the left – Bob or Tim? Northwest Two-Year College Math Conf 14Rossman
15
Follow-up activity Facial prototyping Does our sample result provide convincing evidence that people have a genuine tendency to assign the name Tim to the face on the left? How can we use simulation to investigate this question? What conclusion would you draw? Explain reasoning process behind conclusion Northwest Two-Year College Math Conf 15Rossman
16
16 Example 2: Dolphin therapy? Subjects who suffer from mild to moderate depression were flown to Honduras, randomly assigned to a treatment Is dolphin therapy more effective than control? Core question of inference: Is such an extreme difference unlikely to occur by chance (random assignment) alone (if there were no treatment effect)? Northwest Two-Year College Math Conf Rossman
17
17 Some approaches Could calculate test statistic, p-value from approximate sampling distribution (z, chi-square) But it’s approximate But conditions might not hold But how does this relate to what “significance” means? Could conduct Fisher’s Exact Test But there’s a lot of mathematical start-up required But that’s still not closely tied to what “significance” means Even though this is a randomization test Northwest Two-Year College Math Conf Rossman
18
18 Alternative approach Simulate random assignment process many times, see how often such an extreme result occurs 30 index cards representing 30 subjects Assume no treatment effect (null model) 13 improver cards, 17 non-improver cards Re-randomize 30 subjects to two groups of 15 and 15 Determine number of improvers in dolphin group Or, equivalently, difference in improvement proportions Repeat large number of times (turn to computer) Ask whether observed result is in tail of distribution Northwest Two-Year College Math Conf ? ? Rossman
19
19 Analysis www.rossmanchance.com/ISIapplets (Two Proportions) www.rossmanchance.com/ISIapplets Northwest Two-Year College Math Conf 19Rossman
20
20 Conclusion Experimental result is statistically significant And what is the logic behind that? Observed result very unlikely to occur by chance (random assignment) alone (if dolphin therapy was not effective) Providing evidence that dolphin therapy is more effective Northwest Two-Year College Math Conf Rossman
21
21 Example 3: Lingering sleep deprivation? Does sleep deprivation have harmful effects on cognitive functioning three days later? 21 subjects; random assignment Core question of inference: Is such an extreme difference unlikely to occur by chance (random assignment) alone (if there were no treatment effect)? Northwest Two-Year College Math Conf Rossman
22
22 One approach Calculate test statistic, p-value from approximate sampling distribution Northwest Two-Year College Math Conf Rossman
23
23 Another approach Simulate randomization process many times under null model, see how often such an extreme result (difference in group means) occurs Northwest Two-Year College Math Conf Rossman
24
Example 4: Draft lottery Rossman Northwest Two-Year College Math Conf 24
25
Closer look Rossman Northwest Two-Year College Math Conf 25 r = -0.226
26
Familiar refrain How often would such an extreme result (r 0.226) occur by chance alone from a fair, random lottery? Simulate! Rossman Northwest Two-Year College Math Conf 26
27
Simulation result Such an extreme result would virtually never occur from fair, random lottery Overwhelming evidence that lottery used was not random Rossman Northwest Two-Year College Math Conf 27
28
28 Advantages You can do this from beginning of course! Emphasizes entire process of conducting statistical investigations to answer real research questions From data collection to inference in one day As opposed to disconnected blocks of data analysis, then data collection, then probability, then statistical inference Leads to deeper understanding of concepts such as statistical significance, p-value, confidence Very powerful, easily generalized tool Flexibility in choice of test statistic (e.g. medians, odds ratio) Generalize to more than two groups Northwest Two-Year College Math Conf Rossman
29
Implementation suggestions Begin every example/activity with fundamental questions about the study/data Observational units? Variables? Types (cat/quant) and roles (expl/resp) of variables Observational study or experiment? Random sampling? Random assignment? Rossman Northwest Two-Year College Math Conf 29
30
Implementation suggestions Emphasize four pillars of inference Is there a significant effect/difference? How large is it? To what population can you generalize? Can you draw a cause/effect conclusion? Notice that last two questions highlight distinction between random sampling and random assignment Rossman Northwest Two-Year College Math Conf 30
31
31 Implementation suggestions What about normal-based methods: why? Do not ignore them! A common shape often arises for empirical randomization/sampling distributions Duh! Students will see t-tests in other courses, research literature Process of standardization has inherent value Gain intuition through formulas Northwest Two-Year College Math Conf 31Rossman
32
Implementation suggestions What about normal-based methods: how? Introduce after students have gained experience with randomization-based methods As a prediction of how simulation results would turn out Focus on standard deviation of statistic (standard error) Northwest Two-Year College Math Conf 32Rossman
33
33 Implementation suggestions What about interval estimation? Two possible simulation-based approaches Invert test Test “all” possible values of parameter, see which do not put observed result in tail Easy enough (but tedious) with one-proportion situation (sliders), but not as obvious how to do this with comparing two proportions Estimate +/- margin-of-error Could estimate margin-of-error with simulated randomization distribution Rough confidence interval as statistic + 2×(SD of statistic) Northwest Two-Year College Math Conf 33Rossman
34
34 Implementation suggestions Can we introduce SBI gradually? Yes! One class period: Use helper/hinderer activity to introduce concepts of statistical significance, p-value, could this have happened by random chance alone Two class periods: Also use dolphin therapy activity to introduce inference for comparing two groups (chance = random assignment) Three class periods: Also use sleep deprivation activity prior to two-sample t- tests (for quantitative response) Four class periods: Also use draft lottery activity (two quantitative variables) Northwest Two-Year College Math Conf 34Rossman
35
Assessment suggestions Quick assessment of understanding of class activity What did the cards represent? What did shuffling and dealing the cards represent? What implicit assumption about the two groups did the shuffling of cards represent? What observational units were represented by the dots on the dotplot? Why did we count the number of repetitions with 10 or more “successes” (that is, why 10 and why “or more”)? 35 Northwest Two-Year College Math Conf 35Rossman
36
36 Assessment suggestions Conceptual understanding of logic of inference Interpret p-value in context: Probability of observed data, or more extreme, under randomness hypothesis, if null model is true Summarize conclusion in context, and explain reasoning process Apply to new studies, new scenarios Define null model, design simulation, draw conclusion More complicated scenarios (e.g., compare 3 groups), new statistics (e.g., relative risk) Northwest Two-Year College Math Conf 36Rossman
37
37 Assessment suggestions Multiple-choice example (not simulation-based) Suppose one study finds that 30% of women sampled dream in color, compared to 20% of men. Study A sampled 100 people of each sex, whereas Study B sampled 40 people of each sex. Which study would provide stronger evidence that there is a genuine difference between men and women on this issue? A. Study A B. Study B C. The strength of evidence would be the same for these two studies Northwest Two-Year College Math Conf 37Rossman
38
38 Assessment suggestions Free response example (simulation-based) In a recent study, researchers presented young children (aged 5 to 8 years) with a choice between two toy characters who were offering stickers. One character was described as mean, and the other was described as nice. The mean character offered two stickers, and the nice character offered one sticker. Researchers wanted to investigate whether infants would tend to select the nice character over the mean character, despite receiving fewer stickers. They found that 16 of the 20 children in the study selected the nice character. Northwest Two-Year College Math Conf 38Rossman
39
39 Assessment suggestions Free response example (simulation-based) Describe (in words) the null model/hypothesis in this study. Suppose that you were to conduct a simulation analysis of this study to investigate whether the observed result provides strong evidence that children genuinely prefer the nice toy with one sticker over the mean toy with two stickers. Indicate what you would enter for the following three inputs: Probability of heads: _____ Number of tosses: _____ Number of repetitions: _____ Northwest Two-Year College Math Conf 39Rossman
40
40 Assessment suggestions Free response example (simulation-based) One of the following graphs was produced from a correct simulation analysis. The other two were produced from incorrect simulation analyses. Circle the correct one. Which of the following is closest to the p-value for this study? 5.0,.50,.05,.005 Northwest Two-Year College Math Conf 40Rossman
41
41 Assessment suggestions Free response example (simulation-based) Write an interpretation of this p-value in the context of this study (probability of what, assuming what?). Summarize your conclusion from this research study and simulation analysis. Northwest Two-Year College Math Conf 41Rossman
42
Resources Northwest Two-Year College Math Conf 42Rossman
43
Resources Northwest Two-Year College Math Conf 43Rossman
44
Resources Simulation-based inference blog: www.causeweb.org/sbi/ www.causeweb.org/sbi/ ISI applets: www.rossmanchance.com/ISIapplets.html Statkey app: lock5stat.com/statkey Northwest Two-Year College Math Conf 44Rossman
45
Thanks! Want to learn more? Workshop (with lunch) on Saturday afternoon in Lincoln room, thanks to John Wiley and Sons http://www.math.hope.edu/isi/ http://www.math.hope.edu/isi/ arossman@calpoly.edu arossman@calpoly.edu Northwest Two-Year College Math Conf 45Rossman
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.