Reasoning and Sense Making in Data Analysis & Statistics Beth Chance Henry Kranendonk Mike Shaughnessy.

Slides:



Advertisements
Similar presentations
C HAPTER 1.1 Analyzing Categorical Data. I NDIVIDUALS AND V ARIABLES Individuals are the objects described by a set of data. Individuals may be people,animals,
Advertisements

Copyright © Allyn & Bacon (2007) Statistical Analysis of Data Graziano and Raulin Research Methods: Chapter 5 This multimedia product and its contents.
Statistics 100 Lecture Set 7. Chapters 13 and 14 in this lecture set Please read these, you are responsible for all material Will be doing chapters
Psychology: A Modular Approach to Mind and Behavior, Tenth Edition, Dennis Coon Appendix Appendix: Behavioral Statistics.
Table of Contents Exit Appendix Behavioral Statistics.
Analysis. Start with describing the features you see in the data.
Math Alliance Project 4th Stat Session
1 Chapter 1: Sampling and Descriptive Statistics.
IB Math Studies – Topic 6 Statistics.
Designing a Continuum of Learning to Assess Mathematical Practice NCSM April, 2011.
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
Algebra 2 – Statistics Normal Distribution Unit 5
1 Hypothesis Testing In this section I want to review a few things and then introduce hypothesis testing.
Sampling Distributions
Quantitative Data Analysis Definitions Examples of a data set Creating a data set Displaying and presenting data – frequency distributions Grouping and.
Experimental Statistics I.  We use data to answer research questions  What evidence does data provide?  How do I make sense of these numbers without.
Statistical Thinking Space Shuttle O-ring Failure The night before the disaster, it was predicted that it would be a cold night (30°F). Some NASA engineers.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative.
Chapter 8 Introduction to Hypothesis Testing. Hypothesis Testing Hypothesis testing is a statistical procedure Allows researchers to use sample data to.
1. An Overview of the Data Analysis and Probability Standard for School Mathematics? 2.
Notes on Statistics Education (loosely based on C. Franklin's ppt at the ASA website) Curriculum Standards (PSSM) of the National Council of Teachers of.
Hypothesis Testing II The Two-Sample Case.
Think of a topic to study Review the previous literature and research Develop research questions and hypotheses Specify how to measure the variables in.
Objective To understand measures of central tendency and use them to analyze data.
GCSE Data Handling Coursework 1 Examining the Data examine carefully the data you are given it’s important to get a feel for the raw data before you use.
Key ideas of analysis & interpretation of data Visualize data – (tables, pictures, graphs, statistics, etc. to reveal patterns & relationships). Making.
DATA DETECTIVES TWSSP Tuesday. Agenda for today Distinguishing Distributions Old Faithful – Data Detectives Activity Head Measurement Hand Span Measurement.
GAISEing into the Common Core Standards A Professional Development Seminar sponsored by the Ann Arbor Chapter of the ASA.
Data Collection & Processing Hand Grip Strength P textbook.
Chapter 1: Exploring Data AP Stats, Questionnaire “Please take a few minutes to answer the following questions. I am collecting data for my.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
MAKING DECISIONS IN THE FACE OF VARIABILITY TWSSP Wednesday.
INTRODUCTORY STATISTICS Chapter 2 DESCRIPTIVE STATISTICS PowerPoint Image Slideshow.
Welcome to Common Core High School Mathematics Leadership
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 34 Chapter 11 Section 1 Random Variables.
Warm Up Researchers deigned an observational study to investigate the accepted “normal” body temperature of 98.6F. In the study, 148 healthy men and women.
Chapter 1 The Role of Statistics. Three Reasons to Study Statistics 1.Being an informed “Information Consumer” Extract information from charts and graphs.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
Day 3: Sampling Distributions. CCSS.Math.Content.HSS-IC.A.1 Understand statistics as a process for making inferences about population parameters based.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Overview of CCSS Statistics and Probability Math Alliance September 2011.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Time series Model assessment. Tourist arrivals to NZ Period is quarterly.
Statistics in Biology. Histogram Shows continuous data – Data within a particular range.
Inference: Probabilities and Distributions Feb , 2012.
Discovering Mathematics Week 5 BOOK A - Unit 4: Statistical Summaries 1.
PCB 3043L - General Ecology Data Analysis.
1.1 Analyzing Categorical Data Pages 7-24 Objectives SWBAT: 1)Display categorical data with a bar graph. Decide if it would be appropriate to make a pie.
Week 6. Statistics etc. GRS LX 865 Topics in Linguistics.
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Statistics is... a collection of techniques for planning experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, interpreting,
Collecting  Describing  Summarizing.   Statistics is a set of data (information) that has been collected. The data can be categorical or numerical.
Chapter 3: Describing Relationships
Statistical Thinking Julia Horring & Pip Arnold TEAM Solutions University of Auckland.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Chapter 13 Linear Regression and Correlation. Our Objectives  Draw a scatter diagram.  Understand and interpret the terms dependent and independent.
The rise of statistics Statistics is the science of collecting, organizing and interpreting data. The goal of statistics is to gain understanding from.
CHAPTER 12 More About Regression
Statistical analysis.
Statistical analysis.
CHAPTER 21 Developing Concepts of Data Analysis
PCB 3043L - General Ecology Data Analysis.
Understanding Randomness
Sampling Distributions
CHAPTER 21 Developing Concepts of Data Analysis
CHAPTER 12 More About Regression
The Old Faithful Geyser Prognostication
Algebra 1 Institute Overview
Samples and Populations
Presentation transcript:

Reasoning and Sense Making in Data Analysis & Statistics Beth Chance Henry Kranendonk Mike Shaughnessy

NCTM’s Series on Reasoning & Sense Making The Council has published its official position on the role and importance of focusing on students’ reasoning in the teaching of mathematics in the new publication Focus in High School Mathematics: Reasoning & Sense Making (NCTM, 2009)

Focus in High School Mathematics: Reasoning and Sense Making Purposes of the document: Giving direction and focus to high school mathematics Making reasoning and sense making the core Making connections

The Role of RSM in School Mathematics “Reasoning and sense making should occur in every mathematics classroom every day. In such an environment, teachers and students ask and answer such questions as ‘What’s going on here?’ and ‘Why do you think that?’”

Critical Need for the Reasoning and Sense Making Effort Students need opportunities to: Make conjectures Share their representations of problems Question their peers Justify their reasoning Pose further questions

Accompanying Series of FRSM Books Focus in Reasoning and Sense Making in: Statistics & Probability (Appeared, Fall 2009) Algebra (just out!) Geometry (next month) Equity in promoting Reasoning and Sense Making for ALL students (Fall) Reasoning with Technology (next spring)

Focus Focus on Statistical Reasoning igh School Mathematics: Reasoning and Sense Making

Core Statistical Concepts Involved in Reasoning and Sense Making “Notions such as distribution, center, spread, association, uncertainty, randomness, sampling, and statistical experiments are foundational concepts underlying the development of statistical reasoning.” (Guidelines for Assessment and Instruction in Statistics Education, GAISE, ASA 2006)

Central Role of Variability Statistical reasoning is also inherently different from mathematical reasoning, and effective development of it requires distinct exercises and experiences. In particular, statistical reasoning centers on a focus on making sense of and reasoning about variation in data in contextual situations.

Chapters in Reasoning and Sense Making in Statistics Country Data—A look at census data Old Faithful—Becoming a data detective Will women run faster than men in the Olympics? –Fitting models to data Starbucks Customers—Designing an observational study Memorizing Words—Is the treatment effect real? Soft Drinks and Heart Disease—Critiquing a statistical study

Our Goals Today To give you a ‘mini-experience’ from several of these investigations that enable teachers to tap student reasoning and sense making in statistics And, hope that these ‘teasers’ will intrigue you so that you will give them a try yourselves.

Country Data— A Look at Some Census Data Percentage breakdown of US Population in Five-Year Age Groups:

Comparing Countries

Table → Histogram → Boxplot

Will Women Run Faster Than Men? How can we use this data to investigate our research question?

Gap Times Female – Male (in sec) What does the above mean? How can we use this data to investigate our research question?

Time differences What trend do we see? If women were to run as fast as men, how would that be represented in this graph? How would a faster time for women be represented?

Regression Line The linear regression represents the gap. Is this line a good summary of what is happening? Why or why not?

Separate Regression Lines If women will run as fast or faster than men, how would it be presented in this graph?

Predicting the Distant Future How does the model use to represent the Olympic times breakdown? What might be the more accurate representation of the times in the future?

An Exponential Model In what way does this model represent the Olympic times now and in the future?

Eruptions of the Old Faithful Geyser— Becoming a Data Detective Data on wait times between successive eruptions (blasts) of geysers were first collected by the National Park Service and the U.S. Geological Survey in Yellowstone National Park. The data were collected to establish some baseline information that could then be used to track and compare long-term behavior of geysers.

Becoming a Data Detective In this investigation, you will put on a ‘data detective’ hat, investigate some data from Old Faithful, and conjecture about the time that someone might expect to wait for Old Faithful to erupt.

Old Faithful --Minutes Between Blasts –each row represents about 1 days’ data 1) ) ) ) ) ) ) ) ) ) ) ) ) ) ) )

Task Set Up Pick any row of these wait times so that your ‘group’ has a sample of a day of Old Faithful wait times Look over the data. Is there anything that you notice, or anything that you wonder about in your sample of data? Jot down some “notices” and “wonders.”

Graphical Representations Create at least one type of visual or graphical representation for that row of data to help to visualize any patterns in the wait times. On the basis of your data and graphs, make a decision about how long you would expect to wait between blasts of Old Faithful, and your reason why.

Examples of Student Reasoning Group A Reasoning: (made a bar graph of individual case values) “We picked a time more in the middle for our prediction, as some would be higher, some lower, but there were a lot of middle height bars that were around 70, so we think we’d have to wait about 70 minutes. “

Group B Reasoning (on stem & leaf plot) “We noticed there was a lot of variation in our data—a very wide spread—so we used the average as a middle point. We calculated the average wait time to be about 68 minutes, so we would predict we’d wait about that long—about an hour. “

Group C Reasoning 1 “On the basis of our first frequency graph we’d expect to wait about 75 minutes, because it shows most wait times for the eruption in the 75 to 79 minute range.

Group C Graph 1

Group C Reasoning 2 But then we saw that if we chose our intervals in another way we obtained something different. There is no obvious pattern here, and we thought that a person could just as easily wait about 55, or 75, or 85 minutes, because all three of those times were equally frequent in this (second) graph, each occurring 4 times.”

Group C Graph 2

Group D Reasoning “The middle 50 percent goes from 65 minutes to 82 minutes for day 2, and from about 53 minutes to 87 minutes for day 3. So, overall from the two days combined we concluded that 50 percent of the time you’d probably have to wait at least an hour, and perhaps as much as an hour and 20 minutes.”

Group D Graph

Group E Reasoning “We think we see a pattern in the data. There seems to be an up-down pattern in the wait times in day 3. It was easier to see when we connected the dots in our plot. Then we did the same thing for day 2, and the up- down pattern in wait times appears there, too. It’s not always perfect, but a long wait time is usually followed by a short time, and a short one by a long one.”

Group E Day 3 Graph

Group E Day 2 Graph

Analyzing the Progression of Student Reasoning in their Graphs Case value representations—and clustering Measures of center—mode, median, mean Looking at ‘likely range’—a sign of accounting for variation Closer look at variation over time—it’s not totally ‘random’ in this case—special cause variation vs. common cause variation Extensions—students pose and investigate new questions

Reasoning Extensions from Students’ “Wonders” Graph all the 1985 data for Old Faithful—what do you notice in the graph? What if we knew the ‘previous’ wait time? Would that give us extra information? What is Old Faithful doing now? Is it still behaving the same way?

Memorizing Words – Is the treatment effect real? Two groups of students each given a set of words to memorize – Group 1: Meaningful words – Group 2: Nonsense words We want to investigate whether giving students meaningful words increases the number of words they remember Question 1: How design such a study?

Examples of Student Reasoning Not sufficient detail – Write out a protocol that you could hand to another student and they could carry out to your exact specifications Randomizers – Really focus on what “random” means here, precise definition vs. everyday usage Too much detail – Convince them that random assignment really “works”

Examining the Data Group A: 12, 15, 12, 12, 10, 3, 7, 11, 9, 14, 9, 10, 9, 5, 13 Group B: 4, 6, 6, 5, 7, 5, 4, 7, 9, 10, 4, 8, 7, 3, 2 Mean = Mean = 5.80

What Conclusions Can We Draw from (Beyond) these Data? Is there a “genuine treatment effect”? What are some potential explanations for the differences we observe between these two groups? – Can we choose among them?

Dice Rolling

What Conclusions Can We Draw from (Beyond) these Data? Is there a “genuine treatment effect”? Could we have gotten such results just “by chance alone” from the random assignment process?

Statistical Significance Assume no treatment effect, everyone gets their same score no matter which group they were assigned to, can “luck of the draw” produce a difference in means as large as 4.27 in favor of Group A? Every score on an index card, shuffle and deal out two groups of 15, determine the difference in means (Group A – Group B)

Example Results Is 4.27 surprising?

Examples of Student Reasoning Not really going beyond the data Not considering variability Mistaking randomization distribution for confirmation of null model – Not using data

Reasoning Extensions Difference in Medians Changes in variability and sample size – Within group variability – Between group variability

Questions?