Activities that enhance the conceptual understanding of statistics

Slides:



Advertisements
Similar presentations
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
Advertisements

Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
© 2010 Pearson Prentice Hall. All rights reserved 8-1 Chapter Sampling Distributions 8 © 2010 Pearson Prentice Hall. All rights reserved.
Chapter 11: Random Sampling and Sampling Distributions
Objective: To test claims about inferences for proportions, under specific conditions.
Simulation and Resampling Methods in Introductory Statistics Michael Sullivan Joliet Junior College
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
The Probability of a Type II Error and the Power of the Test
Using the Empirical Rule. Normal Distributions These are special density curves. They have the same overall shape  Symmetric  Single-Peaked  Bell-Shaped.
Introduction to Statistical Inference Probability & Statistics April 2014.
AP Statistics Chapter 9 Notes.
Significance Testing Statistical testing of the mean (z test)
Week 8 Chapter 8 - Hypothesis Testing I: The One-Sample Case.
LECTURE 19 THURSDAY, 14 April STA 291 Spring
AP Statistics Chapter 20 Notes
Confidence intervals and hypothesis testing Petter Mostad
Chapter 18 – Part I Sampling Distribution for Sample Proportion Statistic.
Hypothesis Testing. The 2 nd type of formal statistical inference Our goal is to assess the evidence provided by data from a sample about some claim concerning.
Distribution of the Sample Means
Testing Hypotheses about a Population Proportion Lecture 31 Sections 9.1 – 9.3 Wed, Mar 22, 2006.
Chapter 7: The Distribution of Sample Means. Frequency of Scores Scores Frequency.
9-3 Testing a proportion. What if you wanted to test - the change in proportion of students satisfied with the french fry helpings at lunch? - the recent.
Describing a Score’s Position within a Distribution Lesson 5.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
Test of a Population Median. The Population Median (  ) The population median ( , P 50 ) is defined for population T as the value for which the following.
Test of a Population Median. The Population Median (  ) The population median ( , P 50 ) is defined for population T as the value for which the following.
Sampling Distributions
Inference for a Single Population Proportion (p)
Tests About a Population Proportion
9.3 Hypothesis Tests for Population Proportions
Hypothesis Tests on a population mean
Activities in Introductory Statistics
Sampling Distributions
Hypothesis Tests Regarding a Parameter
Chapter 11: Simple Linear Regression
Chapter 12 Tests with Qualitative Data
Using Simulation to Enhance Statistical Concepts
StatCrunch Workshop Hector Facundo.
Week 10 Chapter 16. Confidence Intervals for Proportions
Hypothesis Tests for a Population Mean,
Two-sided p-values (1.4) and Theory-based approaches (1.5)
Sampling Distribution Models
Chapter 7 Sampling Distributions.
Chapter 9 Hypothesis Testing.
Statistics for the Social Sciences
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…
Chapter 10 Analyzing the Association Between Categorical Variables
Comparing two Rates Farrokh Alemi Ph.D.
Hypothesis tests for the difference between two proportions
One-Way Analysis of Variance
Hypothesis Tests for Proportions
Chapter 7 Sampling Distributions.
Sampling Distributions
CHAPTER 9 Testing a Claim
Hypothesis Tests for a Standard Deviation
Sampling Distributions of Proportions section 7.2
Testing Hypotheses about a Population Proportion
Chapter 7: The Distribution of Sample Means
Chapter 7 Sampling Distributions.
CHAPTER 7 Sampling Distributions
Statistics for Business and Economics
Chapter 7: Sampling Distributions
The Normal Curve Section 7.1 & 7.2.
Chapter 7 Sampling Distributions.
Presentation transcript:

Activities that enhance the conceptual understanding of statistics Michael Sullivan Joliet Junior College sullystats@gmail.com or msulliva@jjc.edu sullystats.com

Getting data The Omnipresence of Data Some data is easy to obtain (*.csv or *.txt files) Other data must be extracted from a webpage.

deviations For each of the following data sets, decide which has the higher standard deviation, if any, without doing any computation. Explain the rationale behind your choice. Then, verify your choice by computing the standard deviation by hand.

The black swan Open the data file “Daily Percentage Change in S&P500” in StatCrunch. Draw a histogram of the variable “Percentage Change.” Describe the shape of the distribution. Compute the mean and standard deviation percentage change.

The Black Swan (c) Determine the percentage change value for Use the normal model to determine the proportion of observations that should be less than three standard deviations below the mean. Redraw the histogram with a lower class limit of the first class of -10 and a class width of 0.5. In StatCrunch, there is an option called “Dividers” in the histogram dialogue box. Select the Percent radio button. Click Compute! Double-click on the left-most divider and enter the value found in part (c). What proportion of observations are less than this value? Is this close to the value given by the normal model? Ans: -3.71

The black swan (f) In his book The Black Swan, author Nassim Nicholas Taleb warns of using the normal model for financial markets. According to the normal model, what proportion of the days should the percentage change of the S&P 500 be less than -7.0%? Ans: 0.0000000073 (g) The number of days one should expect to wait before observing an event equals 1/p where p is the probability of observing the event. Use the result of part (f) to determine the number of days one should expect to wait before observing a percentage change in the S&P500 of -7.0 or less. Ans: 136,986,301 (h) There were 4023 trading days from 11/7/00 to 11/3/2016. How many of these days had a percentage change in the S&P500 of -7.0 or less? What does this result suggest?

Student’s t-Distribution

Intro to hypothesis testing The proportion of the human population that is left-handed is 0.12. Mensa is an organization of high-IQ individuals (to qualify to be a member of Mensa one’s IQ must be at the 98th percentile). In a random sample of 20 members of Mensa, it was found that 4 were left-handed. Does this result suggest that a higher proportion of Mensa members are left-handed that the general population? What is the variable of interest in this study? Is it qualitative or quantitative? What are the null and alternative hypotheses? Why can’t a fair coin be used to simulate the outcomes of this study? Use the Urn applet in StatCrunch to build a null model that could be used to simulate the outcomes of the study. Hint: There are approximately 120,000 members of Mensa worldwide. Under the assumption the null hypothesis from part (b) is true, how many of the 120,000 Mensa members would be left-handed? How many would not be left-handed? Use the Urn applet built in part (d) to simulate 1 repetition of this study. Explain what the result represents. Use the Urn applet built in part (d) to simulate 2000 more repetitions of this study. What does the result suggest? Use the binomial probability distribution function to find P(X > 4) where n = 20 and p = 0.12. Sample proportion = 4/20 = 0.2. We want to test H0: p = 0.12 versus H1: p > 0.12 120,000(0.12) = 14,400 left-handed; 120,000(0.88) = 105,600

Intro to hypothesis testing The proportion of the human population that is left-handed is 0.12. Mensa is an organization of high-IQ individuals (to qualify to be a member of Mensa one’s IQ must be at the 98th percentile). In a random sample of 120 members of Mensa, it was found that 24 were left-handed. Does this result suggest that a higher proportion of Mensa members are left-handed that the general population? Build the Urn applet again and run 2000 simulations. Be sure to select proportions. Click Analyze. Find the mean and standard deviation of the 2000 sample proportions. Determine the theoretical mean and standard deviation of the sample proportion assuming p = 0.12. Compare to part (b). Use the normal model to find the probability of observing 24 or more left-handers from a population whose proportion is 0.12.

Testing a Hypothesis about a Population Mean: Simulation   Coors Field is home to the Colorado Rockies baseball team and is located in Denver, Colorado. Denver is approximately one mile above sea level where the air is thinner. Therefore, baseballs are thought to travel farther in this stadium. Does the evidence support this belief? In a random sample of 15 homeruns hit in Coors Field, the mean distance the ball traveled was 417.6 feet. Does this represent evidence to suggest that the ball travels farther in Coors Field than it does in the other Major League ballparks?

Testing a Hypothesis about a Population Mean: Simulation

Testing a Hypothesis about a Population Mean: Simulation To do this, we will obtain 1000 simple random samples of size n = 15 from the population. Data > Sample Select the variable “Distance”. Enter 15 for the sample size. Check the radio button “Stacked with sample ID”. Open in new data table. Compute the mean of each sample by selecting Stat > Summary Stats > Columns Select Distance(Sample) for the variable; group by Sample. Only compute the mean.

Why is the P-value so much larger using Student’s t-distribution?

The Bootstrap Adjust the data so that the mean of the sample data equals the mean stated in the null hypothesis.

Obtain a 10,000 bootstrap samples Obtain a 10,000 bootstrap samples. Click Analyze to export the results to the spreadsheet.

Compute the mean and standard deviation of the bootstrapped means Compute the mean and standard deviation of the bootstrapped means. Compare the standard deviation of the bootstrapped means to s/sqrt(n) where n = 15. They should be close. Remember, bootstrap standard errors tend to be smaller (narrowness bias).

Repeat the Simulation with New Sample Data

Bootstrap of New Data