Estimation Basic Concepts & Estimation of Proportions

Slides:



Advertisements
Similar presentations
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Advertisements

Statistical Estimation and Sampling Distributions
POINT ESTIMATION AND INTERVAL ESTIMATION
Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
Class notes for ISE 201 San Jose State University
Statistical Inference Chapter 12/13. COMP 5340/6340 Statistical Inference2 Statistical Inference Given a sample of observations from a population, the.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Evaluating Hypotheses
8 Statistical Intervals for a Single Sample CHAPTER OUTLINE
Chapter 8 Estimation: Single Population
OMS 201 Review. Range The range of a data set is the difference between the largest and smallest data values. It is the simplest measure of dispersion.
Chapter 7 Estimation: Single Population
Sampling Distributions
Lecture 7 1 Statistics Statistics: 1. Model 2. Estimation 3. Hypothesis test.
Inferences About Process Quality
Special Continuous Probability Distributions Normal Distributions
1 Dr. Jerrell T. Stracener EMIS 7370 STAT 5340 Probability and Statistics for Scientists and Engineers Department of Engineering Management, Information.
Chapter 7 Estimation: Single Population
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
1 Sampling and Sampling Distributions Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND STATISTICS.
1 Dr. Jerrell T. Stracener EMIS 7370 STAT 5340 Probability and Statistics for Scientists and Engineers Department of Engineering Management, Information.
Modeling Process Capability Normal, Lognormal & Weibull Models
QBM117 Business Statistics Estimating the population mean , when the population variance  2, is known.
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Random Sampling, Point Estimation and Maximum Likelihood.
1 Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND STATISTICS FOR SCIENTISTS AND ENGINEERS Systems.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
1 SMU EMIS 7364 NTU TO-570-N Inferences About Process Quality Updated: 2/3/04 Statistical Quality Control Dr. Jerrell T. Stracener, SAE Fellow.
1 Special Continuous Probability Distributions -Normal Distributions -Lognormal Distributions Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering.
1 Estimation of Standard Deviation & Percentiles Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY.
1 Special Continuous Probability Distributions -Exponential Distribution -Weibull Distribution Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering.
Topic 6 - Confidence intervals based on a single sample Sampling distribution of the sample mean - pages Sampling distribution of the.
Maz Jamilah Masnan Institute of Engineering Mathematics Semester I 2015/ Sampling Distribution of Mean and Proportion EQT271 ENGINEERING STATISTICS.
Chapter 8 : Estimation.
9.9 Single Sample: Estimating of a Proportion:.p = Population proportion of successes (elements of Type A) in the population.n = sample size X = no. of.
1 Continuous Probability Distributions Continuous Random Variables & Probability Distributions Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering.
CHAPTER SEVEN ESTIMATION. 7.1 A Point Estimate: A point estimate of some population parameter is a single value of a statistic (parameter space). For.
Sampling Distributions Sampling Distribution of Sampling Distribution of Point Estimation Point Estimation Introduction Introduction Sampling Distribution.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
1 Standard error Estimated standard error,s,. 2 Example 1 While measuring the thermal conductivity of Armco iron, using a temperature of 100F and a power.
Confidence Interval & Unbiased Estimator Review and Foreword.
Stracener_EMIS 7305/5305_Spr08_ Reliability Data Analysis and Model Selection Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering.
Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
CHAPTER 9 Inference: Estimation The essential nature of inferential statistics, as verses descriptive statistics is one of knowledge. In descriptive statistics,
Chapter 9: One- and Two-Sample Estimation Problems: 9.1 Introduction: · Suppose we have a population with some unknown parameter(s). Example: Normal( ,
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
ESTIMATION Prepared by: Paolo Lorenzo Bautista. Estimation  We wish to estimate a characteristic of the population, by using information from the sample.
Week 21 Order Statistics The order statistics of a set of random variables X 1, X 2,…, X n are the same random variables arranged in increasing order.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
President UniversityErwin SitompulPBST 10/1 Lecture 10 Probability and Statistics Dr.-Ing. Erwin Sitompul President University
1 Discrete Probability Distributions Hypergeometric & Poisson Distributions Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370.
1 Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND STATISTICS FOR SCIENTISTS AND ENGINEERS Systems.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate its.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Virtual University of Pakistan
Sampling Distributions
Sampling Distribution Estimation Hypothesis Testing
-Exponential Distribution -Weibull Distribution
STATISTICS POINT ESTIMATION
STATISTICAL INFERENCE
Random Sampling Population Random sample: Statistics Point estimate
CONCEPTS OF ESTIMATION
9.9 Single Sample: Estimating of a Proportion:
Chapter 8 Estimation.
Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Presentation transcript:

Estimation Basic Concepts & Estimation of Proportions EMIS 7370 STAT 5340 Department of Engineering Management, Information and Systems Probability and Statistics for Scientists and Engineers Estimation Basic Concepts & Estimation of Proportions Dr. Jerrell T. Stracener

Estimation Basic Concepts & Estimation of Proportions EMIS 7370 STAT 5340 Department of Engineering Management, Information and Systems Probability and Statistics for Scientists and Engineers Estimation Basic Concepts & Estimation of Proportions Dr. Jerrell T. Stracener

Estimation Types of estimates and methods of estimation Estimation - Binomial distribution - Estimation of a Proportion - Estimation of the difference between two proportions Estimation - Normal distribution - Estimation of the mean - Estimation of the standard deviation means

Estimation Estimation - Normal distribution (continued) - Estimation of the ratio of the two standard deviations - Tolerance intervals Estimation - Lognormal distribution Estimation - Weibull distribution Estimation - Unknown distribution Types - Continuous populations - Finite populations

Types of Estimates & Methods of Estimation

Definition - Statistic A statistic is a function of only the values of a random sample, X1, X2, …, Xn. For example is a statistic

Properties of Estimates A statistic is said to be an unbiased estimator of the parameter  if If we consider all possible unbiased estimators of some parameter , the one with the smallest variance is called the most efficient estimator of 

Types of Estimates Point Estimate A function of the values of a random sample that yields a single value, i.e., a point Interval Estimate An interval, whose end points are functions of the values of a random sample, for which one can assert with a specified confidence that the interval contains the parameter being estimated

Types of Estimates & Methods of Estimation If we use a sample mean to estimate the mean of a population, a sample proportion to estimate the probability of success on an individual trial, or a sample variance to estimate the variance of a population, we are in each case using a point estimate of the parameter in question. These estimates are called point estimates since they are single numbers, single points, used, respectively, to estimate , , and 2. Since we can hardly expect the point estimates based on samples to hit the parameters they are supposed to estimate exactly ‘on the nose’, it is often desirable to give an interval rather than a single number.

Types of Estimates & Methods of Estimation We can then assert with a certain probability (or degree of confidence) that such an interval contains the parameter it is intended to estimate. For instance, when estimating the average IQ of all college students in the US, we might arrive at a point estimate of 117, or we might arrive at an interval estimate to the effect that the interval from 113 to 121 contains the ‘true’ average IQ of all college students in the US.

Interval Estimates of  for Different Samples

Method of Maximum Likelihood Given independent observations x1, x2, ..., xn from a probability density function (continuous case) f(x; ) or probability mass function (discrete case) p(x; ) the maximum likelihood estimator is that which maximizes the likelihood function. L(X1, X2, ..., Xn; ) = f(X1; )·f(X2; )·...·f(Xn; ), if x is continuous = p(X1; )·p(X2; )·...·p(Xn; ), if x is discrete

Method of Maximum Likelihood Let x1, x2, ..., xn denote observed values in a sample. In the case of a discrete random variable the interpretation is very clear. The quantity L(x1, x2, ..., xn; ), the likelihood of the sample, is the following joint probability: P(X1 = x1, X2 = x2, ... , Xn = xn) This is the probability of obtaining the sample values x1, x2, ..., xn. For the discrete case the maximum likelihood estimator is one that results in a maximum value for this joint probability, or maximizes the likelihood of the sample.

Estimation of Proportions

Estimation of Proportions Estimation of the proportion, p, based on a random sample of the Binomial Distribution B(n,p) Point Estimation Interval Estimation Approximate Method Sample Size Exact Method Estimation of the difference in two proportions P1 - P2 based on random samples from B(n, P1) and B(n, p2).

Estimation of Proportions Point Estimation Interval Estimation

Estimation - Binomial Distribution Estimation of a Proportion, p X1, X2, …, Xn is a random sample of size n from B(n, p), where Point estimate of p: where fs = # of successes _ ^

Estimation - Binomial Distribution Approximate (1 - )·100% confidence interval for p: where and where , and is the value of the standard normal random variable Z such that ^ ^ ^ ^

Note When n is small and the unknown proportion p is believed to be close to 0 or to 1, the approximate confidence interval procedure established here is unreliable and, therefore, should not be used. To be on the safe side, one should require or ^ ^

Error in Estimating p by p ^ Error in Estimating p by p error ^

Error in Estimating p by p ^ Error in Estimating p by p If p is used as an estimate of p, we can be (1 - )·100% confident that the error will not exceed (1 - )·100% confident that the error will be less than a specified amount e when the sample size is ^ ^ ^ ^ ^

Error in Estimating p by p ^ Error in Estimating p by p If p is not used as an estimate of p, we can be at least (1 - )·100% confident that the error will not exceed a specified amount e when the sample size is ^

Example In a random sample of n = 500 families owning television sets in the city of Hamilton, Canada, it is found that x = 340 subscribed to HBO. a. How large a sample is required if we want to be 95% confident that our estimate of p, p, is within 0.02? b. How large a sample is required if we want to be 95% confident that our estimate of p is within ^

Example - Solution a. Let us treat the 500 families as a preliminary sample providing an estimate p = 0.68. ^ Therefore, if we base our estimate of p on a random sample of size 2090, we can be 95% confident that our sample proportion will not differ from the true proportion by more than 0.02

Example - Solution b. We shall now assume that no preliminary sample has been taken to provide an estimate of p. Consequently, we can be at least 95% confident that our sample proportion will not differ from the true proportion by more than 0.02 if we choose a sample of size

Example: Estimation of Binomial parameter p In a random sample of n = 500 families owning television sets in the city of Hamilton, Canada, it was found that fS = 340 owned color sets. Estimate the population proportion of families with color TV sets and determine a 95% confidence interval for the actual proportion of families in this city with color sets.

Example: solution The point estimate of p is = 340/500 = 0.68. Then, an approximate 95% confidence interval for p is where and so that ^ ^ ^ ^ ^

Example: solution and an approximate 95% confidence interval for p is (0.63911, 0.72089). Therefore, our “best” (point estimate) of p is 0.68 and we are about 95% confident that p is between 0.64 and 0.72.

( ) Estimation - Binomial Population Exact (1 - )·100% Confidence Interval for p: , where F1 = and , where F2 = , and is the value of x for which P(X> )=  NOTE: Use the ‘FINV’ function in Excel to get the values of F1 and F2 ( ) U L P ,

Example A random sample of 25 vehicle records are selected for audit from a large number of county records. It is found that 5 have errors. Estimate the population proportion of vehicle records having errors in terms of a point estimate and 95% confidence interval.

Example - solution An approximate 95% confidence interval for p is where  = 0.05 and ^

Example - solution Then ^ ^ ^

Example - solution ^

Example - solution An exact 95% confidence interval for p is where

Example - solution and

Estimation - Binomial Populations Estimation of the difference between two proportions Let X11, X12, …, , and X21, X22, …, , be random samples from B(n1, p1) and B(n2, p2) respectively Point estimation of p1 – p2 = p ^ ^ ^

Estimation - Binomial Populations Approximate (1 - )·100% confidence interval for where and ^ ^ ^ ^ ^ ^ ^ ^ ^ ^

Example: Estimation of P1 - P2 A certain change in a manufacturing process for component parts is being considered. Samples are taken using both the existing and the new procedure in order to determine if the new procedure results in an improvement. If 75 of 1500 items from the existing procedure were found to be defective and 80 of 2000 items from the new procedure were found to be defective, find a confidence interval for the true difference in the fraction of defectives between the existing and the new process.

Example: solution Let p1 and p2 be the true proportions of defectives for the existing and new procedures, respectively. Hence and and the point estimate of p = p1 - p2 is = 0.05 - 0.04 = 0.01 ^ ^ ^ ^ ^

Example: solution An approximate 90% confidence interval for p = p1 - p2 is where

Example: solution ^ ^ ^ ^ ^ ^

Example: solution Then Therefore an approximate 90% confidence interval for  = p1 - p2 is (-0.0017, 0.0217). or about 93% of the length of the confidence Interval favors the new procedure