Stat 217 – Day 17 Review.

Slides:

Advertisements

Similar presentations

Inference Sampling distributions Hypothesis testing.

Advertisements

Launching into Inference: From Common Core to AP Statistics Daren Starnes The Lawrenceville School CMC South 2013.

Confidence Intervals for Proportions

Stat 301 – Day 15 Comparing Groups. Statistical Inference Making statements about the “world” based on observing a sample of data, with an indication.

Stat 217 – Day 22 Review. Last Time – Subtle Issues Make sure you have a random sample or interval doesn’t tell you much! Make sure you have a sample!

Stat 301 – Day 21 Adjusted Wald Intervals Power. Last Time – Confidence Interval for  When goal is to estimate the value of the population proportion.

Stat Day 16 Observations (Topic 16 and Topic 14)

Stat 301 – Day 14 Review. Previously Instead of sampling from a process  Each trick or treater makes a “random” choice of what item to select; Sarah.

Stat 512 Day 9: Confidence Intervals (Ch 5) Open Stat 512 Java Applets page.

Stat 512 – Lecture 12 Two sample comparisons (Ch. 7) Experiments revisited.

Normal Distribution (Topic 12)

Stat 217 – Day 10 Review. Last Time Judging “spread” of a distribution “Empirical rule”: In a mound-shaped symmetric distribution, roughly 68% of observations.

Stat 512 – Day 8 Tests of Significance (Ch. 6). Last Time Use random sampling to eliminate sampling errors Use caution to reduce nonsampling errors Use.

Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.

Stat 217 – Day 15 Statistical Inference (Topics 17 and 18)

Stat 512 – Lecture 11 Type I/Type II Errors Open Applets page Review.

A Broad Overview of Key Statistical Concepts. An Overview of Our Review Populations and samples Parameters and statistics Confidence intervals Hypothesis.

Inference We want to know how often students in a medium-size college go to the mall in a given year. We interview an SRS of n = 10. If we interviewed.

Statistics in Biology. Histogram Shows continuous data – Data within a particular range.

Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.

SECTION 7.2 Estimating a Population Proportion. Where Have We Been?  In Chapters 2 and 3 we used “descriptive statistics”.  We summarized data using.

Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,

Statistics 19 Confidence Intervals for Proportions.

Chapter 10 Confidence Intervals for Proportions © 2010 Pearson Education 1.

Sections 9.1 – 9.3.

CHAPTER 9 Testing a Claim

Chapter 9 Roadmap Where are we going?.

Confidence Intervals for Proportions

Unit 4 – Inference from Data: Principles

CHAPTER 9 Testing a Claim

CHAPTER 8 Estimating with Confidence

Unit 5 – Chapters 10 and 12 What happens if we don’t know the values of population parameters like and ? Can we estimate their values somehow?

Hypothesis Testing: One Sample Cases

Unit 5: Hypothesis Testing

Inference and Tests of Hypotheses

CHAPTER 9 Testing a Claim

Warm Up Check your understanding p. 541

CHAPTER 9 Testing a Claim

CHAPTER 10 Estimating with Confidence

Confidence Intervals for Proportions

Confidence Intervals for Proportions

Simulation-Based Approach for Comparing Two Means

Hypothesis Tests: One Sample

CHAPTER 9 Testing a Claim

Introduction to Inferential Statistics

Two-sided p-values (1.4) and Theory-based approaches (1.5)

Stat 217 – Day 28 Review Stat 217.

CHAPTER 9 Testing a Claim

Review: What influences confidence intervals?

CHAPTER 9 Testing a Claim

CHAPTER 9 Testing a Claim

Significance Tests: The Basics

Significance Tests: The Basics

Confidence Intervals with Proportions

CHAPTER 9 Testing a Claim

Pull 2 samples of 10 pennies and record both averages (2 dots).

Confidence Intervals for Proportions

Confidence Intervals for Proportions

CHAPTER 9 Testing a Claim

Chapter 8: Estimating with Confidence

Pull 2 samples of 20 pennies and record both averages (2 dots).

CHAPTER 9 Testing a Claim

CHAPTER 9 Testing a Claim

Chapter 9: Significance Testing

CHAPTER 9 Testing a Claim

Confidence Intervals for Proportions

Significance Test for a Mean

Analyzing and Interpreting Quantitative Data

CHAPTER 9 Testing a Claim

Presentation transcript:

Stat 217 – Day 17 Review

Last Time – Confidence interval for m Lab 2 – Goal is to estimate, on average, how long after the start of the party people tend to arrive. Close to sample mean Need to know how much sample mean might wander off, by random sampling chance alone, from the population mean (s/ ) Could also consider sample median

In general – Confidence interval Quantitative or categorical data? If categorical, can find a confidence interval for p 95% 2SD method: SE(p-hat) One-sample z-interval If quantitative, can find a confidence interval for m 95% 2SD method: SE(x-bar) One-sample t-interval Need to have decent sample size for these methods (e.g., 10S/10F or 20/normal)

Example 3.2 Calculating and interpreting a “one-sample z-interval” “margin-of-error” = .014 or about 1.4 percentage points Calculating and interpreting a “one-sample z-interval” Observed sample proportion: 713/1034 = .69 .69 + 1.96√(.69*(1-.69)/1034) .69 + .014 (.676, .704) I’m 95% confident that between 67.6% and 70.4% of the population will claim to have felt an impact from the Affordable Care Act Assuming the sample was representative and no nonsampling errors

Body Temperatures Body temps for 130 healthy adults n = 130 Mean = 29.249 0F s = .733 0F Symmetric distribution

Body Temperatures “margin-of-error” = .129 degrees Calculating and interpreting a “one-sample t-interval” Observed sample mean: 98.249 degrees with sample standard deviation s = .733 degrees 98.249 + 2ish (.733/sqrt(130)) 98.249 + .129 (98.12, 98.38) I’m 95% confident that the population mean healthy body temperature is between 98.12 degrees and 98.38 degrees Fahrenheit

Body Temperatures Notice this interval is not very wide: It’s an interval for the population mean, not one person

Body Temperatures Notice: This interval does not contain 98.6; we would not consider 98.6 to be a plausible value for the population mean body temperature Of course, not terribly far away (98.12, 98.38) Would still like to know more about how this sample was selected before deciding what population I think it is representative of

Section 3.4 Factors that impact width of confidence interval Larger sample size  Narrower interval Larger confidence level  Wider interval (Proportion closer to 0.5  wider) (Larger sample SD, s  wider)

Section 3.5 These confidence interval procedures only “work” if you have representative sample Vs. voluntary response bias Vs. bad sampling frame and no “nonrandom” (nonsampling) errors People change their minds People don’t remember correctly People lie/social expectation Leading questions Demeanor of interviewer

Gallup.com

Exam 1 May use one 8.5 x 11 (both sides) page of self-produced notes Mixture of multiple choice, short answer, longer questions (see quizzes, labs, investigations) Bring a calculator (not a cell phone) Access to the computer (e.g., applets) Probably a section of multiple choice questions on the computer

Exam 1 Resources Review handout Review questions/solutions Chapters 0-3 Self-check videos, self-tests, what went wrong Quiz solutions Access to pre-labs Grading comments on quizzes, investigations, labs

Exam 1 Advice Review handout, problems online Review labs Work problems Review labs Start with ideas that we have emphasized more often Be ready to interpret and explain

Some advice during exam If you get stuck on a problem, move on later parts, later problems Try to hit the highlights in your answer (e.g., not all sources of bias, just the most serious) Be succinct (think before you write) Read the question carefully Show all of your work, explain well communication points, no “it”! Read entire question before writing anything

Some big, big ideas Observational units, variable Probability What see in sample (descriptive) vs. saying something beyond the sample (inferential) Statistic vs. Parameter Interpretation of p-value, Statistical significance Estimation (confidence interval) Generalizability Interpretations, reasoning Properties, “what if” questions… How are you deciding this?

Main Topics Sample from a random process (e.g., coin toss, dolphins, kissing couples) Parameter: p = probability of “success” Statistic: sample proportion Random sample from a finite large population (e.g., Gallup poll) Parameter: p = population proportion of “successes” Statistic: sample proportion Consider sampling, nonsampling biases

Previously When have random sampling method, the mean of the “could have been” statistics will be equal to the population parameter >> Will believe sample is representative of pop’n The variability of the distribution of “could have been” statistics will decrease if you increase the sample size (number of observational units per sample) Population size (assuming it’s pretty big to begin with) doesn’t really matter

Agree?

Main Topics Sample from a random process or population with quantitative data Parameter: m = population mean Statistic: sample mean

Test of Significance Test a conjecture about parameter Assume null hypothesis is true Look at the random distribution of the statistic when the null hypothesis is true Simulation (One Proportion applet) Normal model (Theory Based Inference applet) If observed value is in the tail of the distribution (small p-value, large z), reject the null hypothesis. Otherwise “fail to reject.” FTR: Not convincing evidence against Ho

Test of Significance – Lab 2 Sample mean = 6.9 hours < 8 hours The population mean is 8 hours and we just got an unlucky random sample Small p-value discounts this explanation Our sample is not representative of the population Random sampling would discount this explanation The population mean is actually less than 8 hours

Make sure you recognize Interpreting the p-value vs. evaluating the p-value 3% of random samples … observed result … null hypothesis true I find this p-value to be small so I reject the null hypothesis Interpreting the confidence interval vs. the confidence level I’m 95% confident that…this method … .6225 and .7323 If we took thousands of intervals, roughly 95% of the resulting intervals should contain the parameter

Confidence Interval Want to estimate parameter from the sample data Could test all the possible values for parameter and make an interval of the ones that are not rejected Not practical Other ways to estimate a CI estimate + 2 standard deviations Get SD from simulation and/or from formula Normal-based inference (TBI applet) Larger sample sizes Interpretation: I’m 95% confident that the parameter is between these two values Procedure works 95% of the time

Questions? Optional Review Session Tonight Building 38, Room 219 Starting at 7:40