Estimation.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

STATISTICAL INFERENCE PART V CONFIDENCE INTERVALS 1.
Sampling: Final and Initial Sample Size Determination
Confidence Intervals This chapter presents the beginning of inferential statistics. We introduce methods for estimating values of these important population.
Statistics for Business and Economics
Chapter 11- Confidence Intervals for Univariate Data Math 22 Introductory Statistics.
1 Confidence Interval for a Mean. 2 Given A random sample of size n from a Normal population or a non Normal population where n is sufficiently large.
Statistical Concepts (continued) Concepts to cover or review today: –Population parameter –Sample statistics –Mean –Standard deviation –Coefficient of.
Chapter Topics Confidence Interval Estimation for the Mean (s Known)
Fall 2006 – Fundamentals of Business Statistics 1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 7 Estimating Population Values.
Lecture 7 1 Statistics Statistics: 1. Model 2. Estimation 3. Hypothesis test.
Let sample from N(μ, σ), μ unknown, σ known.
Chapter 7 Estimating Population Values
8-1 Introduction In the previous chapter we illustrated how a parameter can be estimated from sample data. However, it is important to understand how.
BCOR 1020 Business Statistics
Inference on averages Data are collected to learn about certain numerical characteristics of a process or phenomenon that in most cases are unknown. Example:
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Statistical Intervals Based on a Single Sample.
Standard error of estimate & Confidence interval.
Standard Error of the Mean
Interval Estimates A point estimate gives a plausible single number estimate for a parameter. We may also be interested in a range of plausible values.
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Topic 5 Statistical inference: point and interval estimate
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
ESTIMATION. STATISTICAL INFERENCE It is the procedure where inference about a population is made on the basis of the results obtained from a sample drawn.
1 SAMPLE MEAN and its distribution. 2 CENTRAL LIMIT THEOREM: If sufficiently large sample is taken from population with any distribution with mean  and.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Statistical Interval for a Single Sample
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Statistical estimation, confidence intervals
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Introduction  Populations are described by their probability distributions and parameters. For quantitative populations, the location and shape are described.
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
11 Confidence Intervals – Introduction A point estimate provides no information about the precision and reliability of estimation. For example, the sample.
Chapter 8: Confidence Intervals based on a Single Sample
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7.4 Estimation of a Population Mean  is unknown  This section presents.
Confidence Intervals for a Population Mean, Standard Deviation Unknown.
Lesoon Statistics for Management Confidence Interval Estimation.
ESTIMATION OF THE MEAN. 2 INTRO :: ESTIMATION Definition The assignment of plausible value(s) to a population parameter based on a value of a sample statistic.
Ex St 801 Statistical Methods Inference about a Single Population Mean (CI)
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
We are interested in methods that produce an interval: Common interval methods for: Confidence intervals Prediction intervals Tolerance intervals Credibility/Probability.
1 Consider the k studies as coming from a population of effects we want to understand. One way to model effects in meta-analysis is using random effects.
Confidence Intervals.
Chapter 9 Hypothesis Testing
Introduction For inference on the difference between the means of two populations, we need samples from both populations. The basic assumptions.
More on Inference.
Confidence intervals for µ
ESTIMATION.
Introduction to Probability and Statistics Twelfth Edition
3. The X and Y samples are independent of one another.
Week 10 Chapter 16. Confidence Intervals for Proportions
More on Inference.
CONCEPTS OF ESTIMATION
Sampling Distribution
Sampling Distribution
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
Tutorial 9 Suppose that a random sample of size 10 is drawn from a normal distribution with mean 10 and variance 4. Find the following probabilities:
8.3 – Estimating a Population Mean
Confidence Intervals with Proportions
CHAPTER 12 More About Regression
IE 355: Quality and Applied Statistics I Confidence Intervals
ESTIMATION.
Chapter 8: Confidence Intervals
Chapter 8 Estimation.
Statistical inference for the slope and intercept in SLR
Uncertainty and confidence
Presentation transcript:

Estimation

Sampling Distributions Because estimators are based on random samples, they are random variates just like data! Estimators have distributions called sampling distributions Say are interested in mean Mn mass contained in bullets manufactured at a particular factory Lets use the average mass of Mn in a sample (of size n) to estimate the population mean mass: What might the distribution of look like if we take 1000 samples of size 10 over the course of a week?

Sampling Distributions Important features of an estimator’s sampling distribution: (Approximate) sampling distribution of Sampling dist mean: Sample size, n = 10 bullets Number of samples = 1000 Sampling dist s.d.:

Handy Unbiased Estimators An unbiased estimator of the mean that we always use is: Same as MLE estimate An unbiased estimator of the variance (which we will typically use as a variance estimator) is: Different from MLE estimate

Handy Unbiased Estimators An unbiased estimator for a proportion is: Heads, Success, etc, … An unbiased estimator of the standard error of p is:

Sampling Distributions Uncertainty in the estimate can be represented as standard deviation for the sampling distribution: is called the standard error of the estimator Estimated standard error of the sample average by plugging in

Interval Estimation We are interested in methods that produce an interval: Given the assumptions of the methods are satisfied, the interval covers the true value of the parameter with (approximate) probability at least 1 – a. Common interval methods for: Confidence intervals Prediction intervals Tolerance intervals Credibility/Probability intervals (Bayesian)

Confidence Intervals q is a parameter we are interested in and assume we don’t know its true value. e.g. a mean, a sd, a proportion, etc. Consider an experiment that will collect a sample of data. Then BEFORE we collect the data, we can devise procedure such that: Estimates we will get from the sample we have yet to collect

Confidence Intervals In order to get actual numerical values for and we perform the experiment and plug in the data The outcomes for this experiment are: Under the frequentist definition, probabilities (other than 0 or 1) only exist for outcomes of experiments that haven’t happened yet. After we collect data is a set of plausible values for q.

Confidence Intervals confidence is not probability Given a sample of data, the (1 − a)×100% confidence interval for a parameter estimate on the sample is: We are (1 − a)×100% confident that the true value of q is covered by The CI’s level of confidence: (1 − a)×100% is the same “number” as the CI –method’s probability of producing an interval that covers q, but… confidence is not probability

Confidence Intervals So how do we compute a (1 − a)×100% confidence interval given a set of data?? General Case: (1 − a)×100% CIs for the mean m : Sample size n, sd sX unknown and estimated: Two sided One sided, lower bound One sided, upper bound Student-t(n-1) quantiles qt(1-a/2,df=n-1) or qt(1-a,df=n-1)

Compute the Confidence Intervals A the mass of an unknown powder was determined 30 times. The Results are shown below (units: mg): 4.11, 3.70, 3.36, 3.68, 4.42, 3.23, 4.03, 4.03, 3.52, 4.75, 5.09, 3.47, 3.02, 4.24, 4.74, 4.51, 2.90, 4.15, 3.54, 3.81, 2.98, 3.82, 4.32, 3.06, 4.00, 4.05, 3.19, 3.17, 3.67, 4.37 Compute: The sample mean: The sample sd: The estimated standard error of the mean: The number of estimated standard errors that cover 95% of the sampling distribution symmetrically about the sample mean: ±

Compute the Confidence Intervals a. Sample mean = 3.83 d. For 95% , a = 0.05. 95% spread symmetrically about the mean we want t0.025, 29 and t0.975, 29 = ± 2.04523 b. Sample sd = 0.58 c. Est se of mean = 0.11 # Data from the question: x <- c(4.11, 3.70, 3.36, 3.68, 4.42, 3.23, 4.03, 4.03, 3.52, 4.75, 5.09, 3.47, 3.02, 4.24, 4.74, 4.51, 2.90, 4.15, 3.54, 3.81, 2.98, 3.82, 4.32, 3.06, 4.00, 4.05, 3.19, 3.17, 3.67, 4.37) n <- length(x) # Sample size mn <- mean(x) # Sample average (estimated mean) s <- sd(x) # Sample standard deviation se <- s/sqrt(n) # Estimated standard error of the mean alpha <- 0.05 # Level of significance conf <- 1 - alpha/2 # Level of confidence tt <- qt(p = conf, df = n-1) # t-quantile: The number of estimated standard # errors that cover conf*100% of the # sampling distribution for the mean.

Compute the Confidence Intervals e. Compute the two-sided 95% CI for the mean given this data: [ 3.83 – 2.04*0.11, 3.83 + 2.04*0.11 ] lo <- mn - tt*se hi <- mn + tt*se c(lo,hi) # Two-sided confidence interval for a set of # plauseable values for the mean given this sample. [3.61, 4.05]

Confidence Intervals For us, we can approximate the CI for any parameter we have encountered as (1 − a)×100% CIs for general parameter q : Two sided One sided, lower bound One sided, upper bound Student-t(n-1) quantiles qt(1-a/2,df=n-1) or qt(1-a,df=n-1)

Example Over a several month period the rate of attacks on a certain computer network per day were measured: 11.1, 12.3, 12.0, 11.3, 12.6, 12.9, 12.0, 13.2, 11.8, 13.2, 12.4, 10.3, 12.0, 12.1, 13.1 Compute the 90% lower confidence limit of the hack rate parameter.