Presentation is loading. Please wait.

Presentation is loading. Please wait.

Intro to Statistical Inference

Similar presentations


Presentation on theme: "Intro to Statistical Inference"— Presentation transcript:

1 Intro to Statistical Inference
Estimation Each slide has its own narration in an audio file. For the explanation of any slide click on the audio icon to start it. Professor Friedman's Statistics Course by H & L Friedman is licensed under a  Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. 

2 Statistical Inference involves:
Estimation Hypothesis Testing Both activities use sample statistics (for example, X̅) to make inferences about a population parameter (μ). Estimation

3 Estimation Why don’t we just use a single number (a point estimate) like, say, X̅ to estimate a population parameter, μ? The problem with using a single point (or value) is that it will very probably be wrong. In fact, with a continuous random variable, the probability that the variable is equal to a particular value is zero. So, P(X̅=μ) = 0. This is why we use an interval estimator. We can examine the probability that the interval includes the population parameter. Estimation

4 Confidence Interval Estimators
How wide should the interval be? That depends upon how much confidence you want in the estimate. For instance, say you wanted a confidence interval estimator for the mean income of a college graduate: The wider the interval, the greater the confidence you will have in it as containing the true population parameter μ. You might have That the mean income is between 100% confidence $0 and $∞ 95% confidence $35,000 and $41,000 90% confidence $36,000 and $40,000 80% confidence $37,500 and $38,500 0% confidence $38,000 (a point estimate) Estimation

5 Confidence Interval Estimators
To construct a confidence interval estimator of μ, we use: X̅ ± Zα σ /√n (1-α) confidence where we get Zα from the Z table. When we don’t know σ we should really be using a different table (future lectures will cover this) but, often, if n is large (say n≥30), we may use s instead since we assume that it is close to the value of σ. Estimation

6 Confidence Interval Estimators
To be more precise, the α is split in half since we are constructing a two-sided confidence interval. However, for the sake of simplicity, we call the z-value Zα rather than Za/2 . Estimation

7 Question You work for a company that makes smart TVs, and your boss asks you to determine with certainty the exact life of a smart TV. She tells you to take a random sample of 100 TVs. What is the exact life of a smart TV made by this company? Sample Evidence: n = 100 X̅ = years s = 2.50 years Estimation

8 Answer – Take 1 Since your boss has asked for 100% confidence, the only answer you can accurately provide is: -∞ to + ∞ years. After you are fired, perhaps you can get your job back by explaining to your boss that statisticians cannot work with 100% confidence if they are working with data from a sample. If you want 100% confidence, you must take a census. With a sample, you can never be absolutely certain as to the value of the population parameter. This is exactly what statistical inference is: Using sample statistics to draw conclusions (e.g., estimates) about population parameters. Estimation

9 The Better Answer n = 100 X̅ = 11.50 years S = 2.50 years
at 95% confidence: 11.50 ± 1.96*(2.50/√100) 11.50 ± 1.96*(.25) 11.50 ± .49 The 95% CIE is: years ---- years [Note: Ideally we should be using σ but since n is large we assume that s is close to the true population standard deviation.] Estimation

10 The Better Answer - Interpretation
We are 95% confident that the interval from years to years contains the true population parameter, μ. Another way to put this is, in 95 out of 100 samples, the population mean would lie in intervals constructed by the same procedure (same n and same α). Remember – the population parameter (μ ) is fixed, it is not a random variable. Thus, it is incorrect to say that there is a 95% chance that the population mean will “fall” in this interval. Estimation

11 EXAMPLE: Life of a Refrigerator
The sample: n = 100 X̅ = 18 years s = 4 years Construct a confidence interval estimator (CIE) of the true population mean life (µ), at each of the following levels of confidence: (a)100% (b) 99% (c) 95% (d) 90% (e) 68% Estimation

12 EXAMPLE: Life of a Refrigerator
Again, in this example, we should ideally be using σ but since n is large we assume that s is close to the true population standard deviation. It should be noted that s2 is an unbiased estimator of σ2: E(s2) = σ2 σ2 = s2 = Estimation

13 EXAMPLE: Life of a Refrigerator
(a) 100% Confidence [α = 0, Zα = ∞] 100% CIE:  −∞ years ↔ +∞ years (b) 99% Confidence α = .01, Zα = (from Z table) 18 ± (4/√100) 18 ± 1.03 99% CIE:  16.97 years ↔ years (c) 95% Confidence α = .05, Zα = 1.96 (from Z table) 18 ± 1.96 (4/√100) 18 ± 0.78 95% CIE: years ↔ years Estimation

14 EXAMPLE: Life of a Refrigerator
(d) 90% Confidence α = .10, Zα = (from Z table) 18 ± (4/√100) 18 ± 0.66 90% CIE: years ↔ years  (e) 68% Confidence α = .32, Zα =1.0 (from Z table) 18 ± 1.0 (4/√100) 18 ± 0.4 68% CIE: years ↔ years Estimation

15 Balancing Confidence and Width in a CIE
How can we keep the same level of confidence and still construct a narrower CIE? Let’s look at the formula one more time: X̅ ± Zασ/√n The sample mean is in the center. The more confidence you want, the higher the value of Z, the larger the half-width of the interval. The larger the sample size, the smaller the half- width, since we divide by √n. So, what can we do? If you want a narrower interval, take a larger sample. What about a smaller standard deviation? Of course, this depends on the variability of the population. However, a more efficient sampling procedure (e.g., stratification) may help. That topic is for a more advanced statistics course. Estimation

16 Key Points Once you are working with a sample, not the entire population, you cannot be 100% certain of population parameters. If you need to know the value of a parameter certainty, take a census. The more confidence you want to have in the estimator, the larger the interval is going to be. Traditionally, statisticians work with 95% confidence. However, you should be able to use the Z-table to construct a CIE at any level of confidence. Estimation

17 More Homework for you. Do the rest of the problems in the lecture notes. Estimation


Download ppt "Intro to Statistical Inference"

Similar presentations


Ads by Google