ASA-TC 2013 1 MILO SCHIELD Augsburg College Member: International Statistical Institute US Rep: International Statistical Literacy Project Director, W.

Slides:



Advertisements
Similar presentations
Teaching activities towards Achievement Standard 2.9 Use statistical methods to make an inference. Lindsay Smith, University of Auckland Stats Day 2011.
Advertisements

Traps and pitfalls in medical statistics Arvid Sjölander.
Drawing Samples in “Observational Studies” Sample vs. the Population How to Draw a Random Sample What Determines the “Margin of Error” of a Poll?
Estimation of Sample Size
Objectives Look at Central Limit Theorem Sampling distribution of the mean.
1. Identify the variable(s) of interest (the focus) and the population of the study. 2. Develop a detailed plan for collecting data. Make sure sample.
1. Estimation ESTIMATION.
Copyright (c) Bani Mallick1 Lecture 2 Stat 651. Copyright (c) Bani Mallick2 Topics in Lecture #2 Population and sample parameters More on populations.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 6: Correlation.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #15.
Meta-analysis & psychotherapy outcome research
BPS - 3rd Ed. Chapter 131 Confidence intervals: the basics.
2015 USCOTS V1 1 Milo Schield, Augsburg College Member: International Statistical Institute US Rep: International Statistical Literacy Project Director,
Statistics for CS 312. Descriptive vs. inferential statistics Descriptive – used to describe an existing population Inferential – used to draw conclusions.
Full time and part time employment Coventry population in employment by gender Source: Annual Population Survey, Office for National Statistics
Descriptive Statistics: Part One Farrokh Alemi Ph.D. Kashif Haqqi M.D.
Chapter 5: Descriptive Research Describe patterns of behavior, thoughts, and emotions among a group of individuals. Provide information about characteristics.
Teaching Critical Thinking in a Statistics Course Prabha Betne LaGuardia Community College Mathematics Department November 18, 2006 NYSMATYC Region IV.
Second Stats Course: What should it be? Marc Isaacson Augsburg College Dept of Business Administration MILO SCHIELD Augsburg College Dept of Business Administration.
Inference for Distributions
Multiple Choice Questions for discussion
Determining Sample Size
1/15/2006 MILO SCHIELD Augsburg College Dept of Business Administration Director, W. M. Keck Statistical Literacy Project MAA Quantitative Literacy 15.
Two-Sample Inference Procedures with Means. Two-Sample Procedures with means two treatments two populationsThe goal of these inference procedures is to.
Basic statistics 11/09/13.
Estimation of Statistical Parameters
Many times in statistical analysis, we do not know the TRUE mean of a population of interest. This is why we use sampling to be able to generalize the.
Topic 5 Statistical inference: point and interval estimate
Chapter Twelve Census: Population canvass - not really a “sample” Asking the entire population Budget Available: A valid factor – how much can we.
2014-Schield-DSI-Stats-Curricullum V0A 1 by Milo Schield Member: International Statistical Institute US Rep: International Statistical Literacy Project.
Vegas Baby A trip to Vegas is just a sample of a random variable (i.e. 100 card games, 100 slot plays or 100 video poker games) Which is more likely? Win.
MAT 1000 Mathematics in Today's World. MAT 1000 Topics 1. Statistics Organize, summarize, and describe data 2. Probability Looking for patterns in uncertain.
Beak of the Finch Natural Selection Statistical Analysis.
STORIES AND STATISTICS. Prepared by Frank Swain National Coordinator for Science Training for Journalists Royal Statistical Society
Chapter 2 Data.
Chapter 1 Statistical Thinking What is statistics? Why do we study statistics.
11 Oct 2008 Quirk-PKAL 1 MILO SCHIELD, Augsburg College Director, W. M. Keck Statistical Literacy Project Vice President, National Numeracy Network Project.
Chapter 2 Notes Math Math 1680 Assignments Look over Chapter 1 and 2 before Wednesday Assignment #2: Chapter 2 Exercise Set A (all, but #7, 8, and.
1. Identify the variable(s) of interest (the focus) and the population of the study. 2. Develop a detailed plan for collecting data. Make sure sample.
Gathering Useful Data. 2 Principle Idea: The knowledge of how the data were generated is one of the key ingredients for translating data intelligently.
2015 ASA V2A 1 by Milo Schield, Augsburg College Member: International Statistical Institute US Rep: International Statistical Literacy Project Director,
Section 10.1 Confidence Intervals
BPS - 3rd Ed. Chapter 131 Confidence Intervals: The Basics.
 Producing Data: Experiments Vs. Surveys Chapter 5.
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Model Trendline Linear Excel 2013 V0F 1 by Milo Schield Member: International Statistical Institute US Rep: International Statistical Literacy Project.
Model Regress Linear 3Factor Excel 2013 V0F 1 by Milo Schield Member: International Statistical Institute US Rep: International Statistical Literacy Project.
Inference: Probabilities and Distributions Feb , 2012.
Regression Analysis: Part 2 Inference Dummies / Interactions Multicollinearity / Heteroscedasticity Residual Analysis / Outliers.
2015 ASA V4 1 by Milo Schield, Augsburg College Member: International Statistical Institute US Rep: International Statistical Literacy Project Director,
CHAPTER 9 Producing Data: Experiments BPS - 5TH ED.CHAPTER 9 1.
2015 StatChat2 V1 1 by Milo Schield, Augsburg College Member: International Statistical Institute US Rep: International Statistical Literacy Project Director,
Intro Stats Warm – Up 1.3 Determine the Level of Measurement of each (nominal, ordinal, interval, ratio) 1. The daily high temperatures in a city. 2. The.
Heart Disease Example Male residents age Two models examined A) independence 1)logit(╥) = α B) linear logit 1)logit(╥) = α + βx¡
BPS - 5th Ed. Chapter 91 Producing Data: Experiments.
Surveillance and Population-based Prevention Department for Prevention of Noncommunicable Diseases Displaying data and interpreting results.
11 Oct 2008 PKAL Quirk 1 MILO SCHIELD, Augsburg College Director, W. M. Keck Statistical Literacy Project Vice President, National Numeracy Network Project.
Developing A Thesis Chapter 2.1 – In Search of Good Data
Model Trendline Linear 3Factor Excel by Milo Schield Member: International Statistical Institute US Rep: International Statistical Literacy Project.
Chapter 3 Producing Data. Observational study: observes individuals and measures variables of interest but does not attempt to influence the responses.
Warm Up In May 2006, the Gallup Poll asked 510 randomly sampled adults the question “Generally speaking, do you believe the death penalty is applied fairly.
Research in Psychology Chapter Two 8-10% of Exam AP Psychology.
Copyright © 2009 Pearson Education, Inc. Chapter 2 Data.
Assessment in First Courses of Statistics: Today and Tomorrow Ann Ooms University of Minnesota.
Simulation-based inference beyond the introductory course Beth Chance Department of Statistics Cal Poly – San Luis Obispo
DESIGNING GOOD SURVEYS Laura P. Naumann Assistant Professor of Psychology Nevada State College.
Comparing Two Proportions Chapter 21. In a two-sample problem, we want to compare two populations or the responses to two treatments based on two independent.
Confidence Intervals: The Basics
What makes a first course in statistics?
Presentation transcript:

ASA-TC MILO SCHIELD Augsburg College Member: International Statistical Institute US Rep: International Statistical Literacy Project Director, W. M. Keck Statistical Literacy Project May 22, 2013 Slides at Adding Context to Introductory Statistics

ASA TC Students see less value in statistics after finishing the intro statistics course than before they started. 2.Six months after completing a statistics course, students forget half of what they learned. 3.Statistics courses are largely irrelevant—not just boring or technically difficult, but irrelevant. Enhrenberg (1954) 4.“become more difficult to provide an agreed-upon list of … topics … that all students should learn.” Pearl et al (2012). 2 Statistics Education has “issues”

ASA TC Why does Introductory Stats have these Issues? Traditional introductory statistics courses focus on variability – they are not math courses. But they don’t focus on context. Once the median is jettisoned in place of the mean, context is absent. The lack of context may explain: why students see less value after a course than before. why students forget half of what they learn in 6 mos. why students consider statistics irrelevant. why statistical educators cannot agree on topics.

ASA TC Thesis Adding context to introductory statistics will uphold context as the essence of statistics (e.g., statistics are numbers in context), more clearly separate statistics as a liberal art from mathematical statistics, improve student retention of key ideas, and improve student attitudes on the value of statistics. Consider five examples of context influencing statistics

ASA TC Influence of Context #1: Subject Bias* When asked their income, men over-stated by about 10% on average; women told the truth. When asked their weight, women understated by 10# on average; men typically told the truth. * Made-up statistics to illustrate the point.

ASA TC Influence of Context #2: Defining Groups or Conditions Number of US children with elevated lead: 27,000 in ,000 in 2010 CDC changed the standard in 2010 from 10 micrograms of lead per dl of blood to five.

ASA TC Influence of Context #3: What is taken into account The chance of a run of k heads in n flips of a fair coin depends on the context: “place pre- specified” versus “somewhere in the series.” The accuracy of a medical test depends on the context: confirming versus predicting. The predictive accuracy of a medical test depends on the context: the percentage of subjects tested that have the disease.

ASA TC Influence of Context #4: Choice of Population In predicting or explaining grade differences among first-year college students: SAT scores do a poor job for students at colleges that admit a narrow range of scores (highly selective colleges). SAT scores do a good job for students at colleges that admit a wide-range of scores.

ASA TC Influence of Context #5: Confounding The male-female difference in median* weights among 20-year-olds is 27 pounds. 27#: Male median wt: 156#; Female median wt: 129# Male median height: 70"; Female median height: 64" Median weight of 70” high females is 142# est. * The male-female difference in median weight for 20-year olds is 14 pounds after controlling for height.

ASA TC Influence of Context on Statistical Significance The foregoing shows how context can influence a statistic, but the focus of the intro statistics course is statistical significance. Q1. Can we show how each of these can influence statistical significance??? ABSOLUTELY!!! ABSOLUTELY!!! Do everything with tables and confidence intervals. Non-overlap means statistical significance. Q2. Can it be done with minimal math and time?

ASA TC Influence of Bias on Significance Response bias: Men likely to overstate income Sample bias: Rich less likely to do surveys

ASA TC Influence of Assembly on Significance Two definitions of “bullying” Two ways to combine subgroups to form groups

ASA TC Confounder Influence: Insignificance to Significance Necessary: Confounding must increase gap. Theorem: If the confidence intervals don’t overlap for the two values of the binary confounder and the order never reverses, then the confidence intervals at any standardized value will not overlap.

ASA TC Confounder Influence: Significance to Insignificance Necessary: Confounding must decrease the predictor gap. Location & age 1.5% The 95% Margin of Error Death RateCityRuralDiffCompare ALL22.7%29.4%6.7%Standard Over %30.0%1.0%smaller Under %24.0%2.0%smaller

ASA TC Conclusion #1 To uphold statistics as mathematics with a context, the introductory statistics course must be redesigned. The intro course needs much more focus on big ideas: Context (what is controlled), assembly (definitions) and bias are big ideas for non-statisticians. Randomness and statistical significance are big ideas for statisticians. Seeing how confounding, assembly and bias can influence statistical significance should be central for a “statistics-in-context” course.

ASA TC Conclusion #2 Thesis: Adding context to introductory statistics will improve student retention of key ideas, improve attitudes on the value of studying statistics, uphold context – not variability – as the essential difference between statistics and mathematics. Since this can be done with minimal math and very little time, the introductory statistics course should be re-designed as a “statistics-in-context” course!

ASA TC References ASA (2012). GAISE Report. Ehrenberg, A. S. C. (1976). We must preach what is practised: a radical review of statistical teaching. Journal of the Royal Statistical Society, Series D, 25(3),195–208. Pearl, D., Garfield, J., delMas, R., Groth, R., Kaplan, J. McGowan, H., and Lee, H.S. (2012). Connecting Research to Practice in a Culture of Assessment for Introductory College-level Statistics. ec_2012.pdf Schield, M. (2006). Presenting Confounding and Standardization Graphically. STATS Magazine, American Statistical Association. Fall pp Copy at

ASA TC Math-Stats Math is based on formulas, patterns & structure; Statistics is based on data.

ASA TC Examples “the central premise of statistical sampling theory—larger samples allow for more reliable conclusions about a population — does not translate directly to time series forecasting, where longer time series do not necessarily mean better forecasts.” Winkler (2009) “social and economic statistics, though numeric, is essentially [a] quantified history of society, not a branch of mathematics.” Winkler (2009)

ASA TC Real-life Examples vs. Context Some may point to the GAISE report (ASA 2010) recommending more real-life examples and hands- on analyses as an example of how statistics is keenly aware of context. But real-life examples (the birthday problem) don’t necessarily involve context in any significant way. Using context to deciding which test to use is quite different from seeing the influence of context on statistical significance.

ASA TC Confounder Influence: Insignificance to Significance Necessary: Confounding increases predictor gap. Increase is not always sufficient

ASA TC Influence of Context #6: Confounding The death rate among patients is typically higher at city research hospitals than at rural hospitals. The death rate among patients is typically lower at city research hospitals than at rural hospitals – for patients having similar health conditions.

ASA TC Confounder Influence: Significance to Insignificance Necessary: Confounding decreases predictor gap. Decrease is not always sufficient 1.5% The 95% Margin of Error Death RateCityRuralDiffCompare ALL22.7%29.4%6.7%Standard Over %30.0%1.0%smaller Under %24.0%2.0%smaller 0.4%The 95% Margin of Error Death RateCityRuralDiffCompare ALL22.7%29.4%6.7%Standard Over %30.0%1.0%smaller Under %24.0%2.0%smaller