Inference Bootstrapping for comparisons. Outcomes Understand the bootstrapping process for construction of a formal confidence interval for a comparison.

Slides:



Advertisements
Similar presentations
CHAPTER 14: Confidence Intervals: The Basics
Advertisements

What does integrated statistical and contextual knowledge look like for 3.10? Anne Patel & Jake Wills Otahuhu College & Westlake Boys High School CensusAtSchool.
Inference AS Every time you over-indulge, your life shortens – expert 11:40 AM Wednesday Dec 19, 2012
Chapter 10: Estimating with Confidence
Chapter 8: Estimating with Confidence
INFERENCE What can you find out about a population by looking at a sample?
C1, L2, S1 Sampling Error Dru Rose. C1, L2, S2 I wonder what percentage of all 600 Kare Kare College students travel to school by car ? Population 600.
Analysis. Start with describing the features you see in the data.
Level 1 Multivariate Unit
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
PPDAC Cycle.
Chapter 11 Asking and Answering Questions About The Difference Between Two Population Proportions Created by Kathy Fritz.
Lecture 6. Hypothesis tests for the population mean  Similar arguments to those used to develop the idea of a confidence interval allow us to test the.
Information for teachers This PowerPoint presentation gives examples of responses for the Conclusion section of the report. Students own answers will differ.
4.1 Introducing Hypothesis Tests 4.2 Measuring significance with P-values Visit the Maths Study Centre 11am-5pm This presentation.
10.3 Estimating a Population Proportion
Confidence Intervals and Two Proportions Presentation 9.4.
Ch 8 Estimating with Confidence. Today’s Objectives ✓ I can interpret a confidence level. ✓ I can interpret a confidence interval in context. ✓ I can.
Report Exemplar. Step 1: Purpose State the purpose of your investigation. Pose an appropriate comparison investigative question and do not forget to include.
RDPStatistical Methods in Scientific Research - Lecture 11 Lecture 1 Interpretation of data 1.1 A study in anorexia nervosa 1.2 Testing the difference.
Section Inference for Experiments Objectives: 1.To understand how randomization differs in surveys and experiments when comparing two populations.
Ch 8 Estimating with Confidence. Today’s Objectives ✓ I can interpret a confidence level. ✓ I can interpret a confidence interval in context. ✓ I can.
How much do you smoke?. I Notice... That the median for the males is 13.5 cigarettes per day and the median for females is 10 cigarettes per day. This.
Advanced Higher Statistics Data Analysis and Modelling Hypothesis Testing Statistical Inference AH.
Formal Inference Multivariate Internal. Introduction This report compares if Auckland or Wellington citizens are more likely to borrow more money. The.
Is there a difference in how males and female students perceive their weight?
Inference Practice for AS2.9. PPDAC - The Problem: I wonder if the median… I expect…
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Estimating with Confidence Section 10.1 Confidence Intervals: The Basics.
+ “Statisticians use a confidence interval to describe the amount of uncertainty associated with a sample estimate of a population parameter.”confidence.
Carrying out a statistics investigation. A process.
Stage 1 Statistics students from Auckland university Using a sample to make a point estimate.
Confidence intervals. Estimation and uncertainty Theoretical distributions require input parameters. For example, the weight of male students in NUS follows.
CONFIDENCE INTERVALS: THE BASICS Unit 8 Lesson 1.
PPDAC Cycle.
Use statistical methods to make an inference. Michelle Dalrymple.
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
Sampling variability & the effect of spread of population.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
8.1 Confidence Intervals: The Basics Objectives SWBAT: DETERMINE the point estimate and margin of error from a confidence interval. INTERPRET a confidence.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Statistical Thinking Julia Horring & Pip Arnold TEAM Solutions University of Auckland.
 NHANES 1000, MARIJANA USE  COMPARE WHAT HAPPENS WHEN WE CHANGE SAMPLE SIZE CENSUS AT SCHOOL, ARM SPAN LOOKING AT WHAT HAPPENS TO THE SAMPLING.
By Joanna Charteris.  posing a comparison investigative question using a given multivariate data set  selecting and using appropriate displays and summary.
Statistic Methods (3.10 – Internal 4 credits)
CHAPTER 10 Comparing Two Populations or Groups
Advanced Higher Statistics
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Inference.
SI This workshop may not be for you... With 100% Confidence.
Writing the executive summary section of your report
Inferences to the population
Confidence Intervals: The Basics
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Chapter 7: Sampling Distributions
CHAPTER 10 Comparing Two Populations or Groups
Chapter 8: Estimating With Confidence
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Variance and Hypothesis Tests
CHAPTER 10 Comparing Two Populations or Groups
Sampling Distributions
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Presentation transcript:

Inference Bootstrapping for comparisons

Outcomes Understand the bootstrapping process for construction of a formal confidence interval for a comparison situation. Understand the requirements of an investigative question, confidence interval interpretation, and conclusion at Level 8.

People with Higher IQ are Shunning Internet Explorer Read the article and highlight anything you might find suspicious.

Anatomy of a hoax Now read this article

How could you conduct an experiment that would determine whether or not there was any truth in the first article?

Comparison

The Doozer Construction Council (DCC) is looking at adding a height restriction to their workers to enhance doozer safety on the construction sites. Being a fair-minded council, the DCC want to ensure that the height restriction is not going to disadvantage female doozers in any way. The DCC have employed you to analyse a random sample of doozer heights to look at their concerns.

Comparison situation 2 Recently, Sprocket has taken to chasing doozers. Doc is quite concerned by this sudden change in Sprocket’s behaviour and wonders if the height of the doozers he is chasing has anything to do with why he is chasing them. Doc has collected information on a random sample of doozers, including their height and whether they had been chased by Sprocket in the last month. Doc has asked you to look at this data and let him know if it highlights any concerns.

Key Components to an Investigative Question – a good investigative question should include the following: VARIABLE being examined GROUPS being compared POPULATION inferences are being made about STATISTIC being estimated

VARIABLE being examined Comparison situation 1 Heights of Doozers in Fraggle Rock Comparison situation 2 Heights of Doozers in Fraggle Rock

GROUPS being compared Comparison situation 1 Heights of male and female Doozers in Fraggle Rock Comparison situation 2 Heights of Doozers in Fraggle Rock that have been chased by Sprocket and the heights of Doozers that have not been chased in the last month

POPULATION inferences are being made about Comparison situation 1 Difference in mean heights of male and female Doozers in Fraggle Rock Comparison situation 2 Difference in mean heights of Doozers in Fraggle Rock that have been chased by Sprocket and the heights of Doozers that have not been chased in the last month

STATISTIC being estimated Comparison situation 1 Mean/median heights of male and female Doozers in Fraggle Rock Comparison situation 2 Mean/median heights of Doozers in Fraggle Rock that have been chased by Sprocket and the heights of Doozers that have not been chased in the last month

I wonder what is the difference between the mean/median height of population B and the mean/median height of population A. I think the mean height of population B will be bigger because…..

Key Points The distribution of re-sample means (the bootstrap distribution) is similar to the distribution of means from repeated random sampling. Therefore we can use the bootstrap distribution to model the sampling variability in our data, and base our confidence interval on this bootstrap distribution.

We want to create a bootstrap confidence interval for the difference between mean heights of female doozers and mean heights of male doozers. We want to create a bootstrap confidence interval for the difference in mean heights between the doozers chased by Sprocket and the doozers not chased by Sprocket

iNZight

Key Components of any Confidence Interval Interpretation Strong link between the sample and the population Level of uncertainty evident eg pretty sure Population parameter identified Context clearly identified

Key Components of any Confidence Interval Interpretation Strong link between the sample and the population Level of uncertainty evident eg pretty sure Population parameter identified Context clearly identified “I’m pretty sure that the mean height difference between male and female doozers at Fraggle Rock is somewhere between mm and 2.18 mm”

Key Components of any Confidence Interval Interpretation Strong link between the sample and the population Level of uncertainty evident eg pretty sure Population parameter identified Context clearly identified “I’m pretty sure that the mean height difference between male and female doozers at Fraggle Rock is somewhere between mm and 2.18 mm”

Key Components of any Confidence Interval Interpretation Strong link between the sample and the population Level of uncertainty evident eg pretty sure Population parameter identified Context clearly identified “I’m pretty sure that the mean height difference between male and female doozers at Fraggle Rock is somewhere between mm and 2.18 mm”

Key Components of any Confidence Interval Interpretation Strong link between the sample and the population Level of uncertainty evident eg pretty sure Population parameter identified Context clearly identified “I’m pretty sure that the mean height difference between male and female doozers at Fraggle Rock is somewhere between mm and 2.18 mm”

Key Components of any Confidence Interval Interpretation - Justification “I’m pretty sure that the mean height of male doozers is between 2.18 mm taller and 0.59 mm shorter than female doozers at Fraggle Rock because the bootstrap interval has both positive and negative values.”

Comparison 2

“It is very likely that the mean height of the doozers not chased by Sprocket is between 1.05mm and 4.28mm more than the mean height of Doozers chased by Sprocket at Fraggle Rock.”

Worksheet

I can make the call that there is a difference between the median height of population A and the median height of population B if all values in the confidence interval are positive OR if all values in the confidence interval are negative. In addition, we can tell the direction of the difference by whether the confidence interval is completely positive, or completely negative.

I cannot make the call that there is a difference between the median heights of population A and population B if the confidence interval contains both negative and positive values Refer to positive and negative values and NOT that zero is in the interval

I am pretty sure that the median height difference of population B lies between 0.86 and 1.96 units more than the median height of population A

My call is…the median height of population B is bigger than the median height of population A. I can make this call because the sample median height for population B is (outside the overlap) greater than the sample upper quartile height for population A. I can also make this call because all the values in the bootstrap confidence interval (0.86, 1.96) are positive. I’m pretty sure the median height of population B is somewhere between 0.86 and 1.96 cm bigger than the median height of population A.

It is likely that the median height of population B lies between 0.4 and 1.57 units more than the median height of population A.

My call is… the median height of population B is bigger than the median height of population A. I can make this call because the distance between the sample median heights is greater than about 1/3 of the overall visible spread. I can also make this call because all the values in the bootstrap confidence interval (0.40, 1.57) are positive. I’m pretty sure the median height of population B is somewhere between 0.40 and 1.57 cm bigger than median height of population A.

It is a safe bet that …….

My call is… the median height of population B is bigger than the median height of population A. I can make this call because all the values in the bootstrap confidence interval (0.13, 1.19) are positive. I’m pretty sure the median height of population B is somewhere between 0.13 and 1.19 cm bigger than the population median for A.

I am pretty sure that the median height of population B is 0.86 units more or 0.19 units less than the median height of population A

I am not prepared to make a call as to whether population B’s median height is bigger because there are both negative and positive values in the confidence interval (-0.19, 0.86). The median height of population B could be bigger than, or it could be smaller than, or it could even be the same as the median height of population A.

I am not prepared to make a call about the difference between the heights of populations A and B because there are both negative and positive values in the confidence interval (- 0.52, 0.54). The median height of population B could be bigger than, or it could be smaller than, or it could even be the same as the median height of population A.