Presentation is loading. Please wait.

Presentation is loading. Please wait.

5: Introduction to estimation

Similar presentations


Presentation on theme: "5: Introduction to estimation"— Presentation transcript:

1 5: Introduction to estimation
Intro to estimation (confidence intervals) 5/24/2019 5: Introduction to estimation Intro to statistical inference Sampling distribution of the mean Confidence intervals (σ known) Student’s t distributions Confidence intervals (σ not known) Sample size requirements 5/24/2019 5: Intro to estimation

2 Statistical inference
Statistical inference  generalizing from a sample to a population with calculated degree of certainty Two forms of statistical inference Estimation  introduced this chapter Hypothesis testing  next chapter 5/24/2019 5: Intro to estimation

3 Parameters and estimates
Parameter  numerical characteristic of a population Statistics = a value calculated in a sample Estimate  a statistic that “guesstimates” a parameter Example: sample mean “x-bar” is the estimator of population mean µ Parameters and estimates are related but are not the same 5/24/2019 5: Intro to estimation

4 Parameters and statistics
Source Population Sample Notation Greek (μ, σ) Roman (x, s) Random variable? No Yes Calculated 5/24/2019 5: Intro to estimation

5 Sampling distribution of the mean
x-bar takes on different values with repeated (different) samples µ remain constant Even though x-bar is variable, it’s “behavior” is predictable The behavior of x-bar is predicted by its sampling distribution, the Sampling Distribution of the Mean (SDM) 5/24/2019 5: Intro to estimation

6 Simulation experiment
Distribution of AGE in population.sav (Fig. right) N = 600 µ = 29.5 (center) s = 13.6 (spread) Not Normal (shape) Conduct three sampling simulations For each experiment Take multiple samples of size n Calculate means Plot means  simulated SDMs Experiment A: each sample n = 1 Experiment B: each sample n = 10 Experiment C: each sample n = 30 5/24/2019 5: Intro to estimation

7 Results of simulation experiment
Findings: SDMs are centered on 29 (µ) SDMs become tighter as n increases SDMs become Normal as the n increases 5/24/2019 5: Intro to estimation

8 95% Confidence Interval for µ
Formula for a 95% confidence interval for μ when σ is known: 5/24/2019 5: Intro to estimation

9 Illustrative example Example SEM = s / n = 13.586 / 10 = 4.30
Population with σ = (known ahead of time) SRS  {21, 42, 11, 30, 50, 28, 27, 24, 52} n = 10, x-bar = 29.0 SEM = s / n = / 10 = 4.30 95% CI for µ = = xbar ± (1.96)(SEM) = 29.0 ± (1.96)(4.30) = 29.0 ± 8.4 = (20.6, 37.4) Margin of error 5/24/2019 5: Intro to estimation

10 Margin of error Margin or error  d = half the confidence interval
Surrounded x-bar with margin of error 95% CI for µ = xbar ± (1.96)(SEM) = 29.0 ± (1.96)(4.30) = 29.0 ± 8.4 point estimate margin of error 5/24/2019 5: Intro to estimation

11 Interpretation of a 95% CI
We are 95% confident the parameter will be captured by the interval. 5/24/2019 5: Intro to estimation

12 Other levels of confidence
Let a  the probability confidence interval will not capture parameter 1 – a  the confidence level Confidence level 1 – a Alpha level a z1–a/2 .90 .10 1.645 .95 .05 1.96 .99 .01 2.58 5/24/2019 5: Intro to estimation

13 (1 – a)100% confidence for μ Formula for a (1-α)100% confidence interval for μ when σ is known: 5/24/2019 5: Intro to estimation

14 Example: 99% CI, same data Same data as before
99% confidence interval for µ = x-bar ± (z1–.01/2)(SEM) = x-bar ± (z.995)(SEM) = 29.0 ± (2.58)(4.30) = 29.0 ± 11.1 = (17.9, 40.1) 5/24/2019 5: Intro to estimation

15 Confidence level and CI length
p. 5.9 demonstrates the effect of raising your confidence level  CI length increases  more likely to capture µ Confidence level CI for illustrative data CI length* 90% (21.9, 36.1) 14.2 95% (20.6, 37.4) 16.8 99% (17.9, 40.1) 22.2 * CI length = UCL – LCL 5/24/2019 5: Intro to estimation

16 Beware Prior CI formula applies only to It does not account for: SRS
Normal SDMs σ known ahead of time It does not account for: GIGO Poor quality samples (e.g., due to non-response) 5/24/2019 5: Intro to estimation

17 When σ is Not Known In practice we rarely know σ
Instead, we calculate s and use this as an estimate of σ This adds another element of uncertainty to the inference A modification of z procedures called Student’s t distribution is needed to account for this additional uncertainty 5/24/2019 5: Intro to estimation

18 Student’s t distributions
Brilliant! William Sealy Gosset ( ) worked for the Guinness brewing company and was not allowed to publish In 1908, writing under the the pseudonym “Student” he described a distribution that accounted for the extra variability introduced by using s as an estimate of σ 5/24/2019 5: Intro to estimation

19 t Distributions Student’s t distributions are like a Standard Normal distribution but have broader tails There is more than one t distribution (a family) Each t has a different degrees of freedom (df) As df increases, t becomes increasingly like z 5/24/2019 5: Intro to estimation

20 t table Each row is for a particular df
Columns contain cumulative probabilities or tail regions Table contains t percentiles (like z scores) Notation: tdf,p Example: t9,.975 = 2.26 5/24/2019 5: Intro to estimation

21 Same as z formula except replace z1-a/2 with t1-a/2 and SEM with sem
95% CI for µ, σ not known Formula for a (1-α)100% confidence interval for μ when σ is NOT known: Same as z formula except replace z1-a/2 with t1-a/2 and SEM with sem 5/24/2019 5: Intro to estimation

22 Illustrative example: diabetic weight
To what extent are diabetics over weight? Measure “% of ideal body weight” = (actual body weight) ÷ (ideal body weight) × 100% Data (n = 18): {107, 119, 99, 114, 120, 104, 88, 114, 124, 116, 101, 121, 152, 100, 125, 114, 95, 117} 5/24/2019 5: Intro to estimation

23 Interpretation of 95% CI for µ
Remember that the CI seeks to capture µ, NOT x-bar 95% confidence means that 95% of similar intervals would capture µ (and 5% would not) For the diabetic body weight illustration, we can be 95% confident that the population mean is between and 120.0 5/24/2019 5: Intro to estimation

24 Sample size requirements
Assume: SRS, Normality, valid data Let d  the margin of error (half confidence interval length) To get a CI with margin of error ±d, use: 5/24/2019 5: Intro to estimation

25 Sample size requirements, illustration
Suppose, we have a variable with s = 15 Smaller margins of error require larger sample sizes 5/24/2019 5: Intro to estimation

26 Acronyms SRS  simple random sample
SDM  sampling distribution of the mean SEM  sampling error of mean CI  confidence interval LCL  lower confidence limit UCL  lower confidence limit 5/24/2019 5: Intro to estimation


Download ppt "5: Introduction to estimation"

Similar presentations


Ads by Google