Statistical Inference Chapter 12/13. COMP 5340/6340 Statistical Inference2 Statistical Inference Given a sample of observations from a population, the.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 6 Point Estimation.
Point Estimation Notes of STAT 6205 by Dr. Fan.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
Chapter 7. Statistical Estimation and Sampling Distributions
Chapter 7 Title and Outline 1 7 Sampling Distributions and Point Estimation of Parameters 7-1 Point Estimation 7-2 Sampling Distributions and the Central.
Statistical Estimation and Sampling Distributions
Sampling: Final and Initial Sample Size Determination
Estimation  Samples are collected to estimate characteristics of the population of particular interest. Parameter – numerical characteristic of the population.
SOLVED EXAMPLES.
Copyright © Cengage Learning. All rights reserved.
Chapter 7 Introduction to Sampling Distributions
Sampling Distributions
Chapter 6 Introduction to Sampling Distributions
Chapter 7 Sampling and Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Stat 321 – Lecture 26 Estimators (cont.) The judge asked the statistician if she promised to tell the truth, the whole truth, and nothing but the truth?
1 STATISTICAL INFERENCE PART I EXPONENTIAL FAMILY & POINT ESTIMATION.
Copyright © Cengage Learning. All rights reserved. 6 Point Estimation.
Lecture 7 1 Statistics Statistics: 1. Model 2. Estimation 3. Hypothesis test.
8-1 Introduction In the previous chapter we illustrated how a parameter can be estimated from sample data. However, it is important to understand how.
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
BCOR 1020 Business Statistics
Maximum likelihood (ML)
Chapter 7 Estimation: Single Population
Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University : Office : # A313
1 Introduction to Estimation Chapter Concepts of Estimation The objective of estimation is to determine the value of a population parameter on the.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Copyright © 2009 Cengage Learning Chapter 10 Introduction to Estimation ( 추 정 )
Random Sampling, Point Estimation and Maximum Likelihood.
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Chapter 7 Point Estimation
1 Lecture 16: Point Estimation Concepts and Methods Devore, Ch
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
CLASS: B.Sc.II PAPER-I ELEMENTRY INFERENCE. TESTING OF HYPOTHESIS.
Confidence Interval & Unbiased Estimator Review and Foreword.
1 Chapter 9: Sampling Distributions. 2 Activity 9A, pp
Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.
Chapter 6 parameter estimation
Sampling Theory and Some Important Sampling Distributions.
Chapter 9 Inferences Based on Two Samples: Confidence Intervals and Tests of Hypothesis.
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
Conditional Expectation
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
LEARNING OBJECTIVES After careful study of this chapter you should be able to do the following: 1.Explain the general concepts of estimating the parameters.
Sampling and Sampling Distributions
Copyright © Cengage Learning. All rights reserved.
Statistical Estimation
Sampling Distributions
ESTIMATION.
STATISTICS POINT ESTIMATION
6 Point Estimation.
Chapter 8: Fundamental Sampling Distributions and Data Descriptions:
Chapter 7 Sampling Distributions.
Chapter 7 Sampling Distributions.
Chapter 7 Sampling Distributions.
LESSON 18: CONFIDENCE INTERVAL ESTIMATION
Chapter 7 Sampling Distributions.
6 Point Estimation.
Chapter 8: Fundamental Sampling Distributions and Data Descriptions:
Chapter 7 Sampling Distributions.
Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Applied Statistics and Probability for Engineers
Fundamental Sampling Distributions and Data Descriptions
Presentation transcript:

Statistical Inference Chapter 12/13

COMP 5340/6340 Statistical Inference2 Statistical Inference Given a sample of observations from a population, the statistical inference consists in estimating characteristics of the population. A characteristic may be guessed to: –Be a number (point estimation) –Lay within an interval

Point Estimation

COMP 5340/6340 Statistical Inference4 Random Sampling A sample is a subset of observations out of a population (in general, the all population is not observable) For correct inference, sampling must be random

COMP 5340/6340 Statistical Inference5 Point Estimate A statistic is any function of random variables constituting one or more samples, provided that the function does not depend on any unknown parameter values A point estimate of a parameter  is a single number that can be regarded as the most plausible value of . A point estimate is obtained by selecting a suitable statistic and computing its value from a given sample. The selected statistic is called the point estimate of .

COMP 5340/6340 Statistical Inference6 Example We want to evaluate the packet loss rate on a given channel. 25 packets are sent. Let X = number of corrupted (lost) packets. The parameter to be estimated is p = the proportion of lost packets Propose an estimator

COMP 5340/6340 Statistical Inference7 Example (2) We assume that the waiting time for a bus is uniformly distributed. However, we do not know the upper limit of the probability distribution. We want to estimate this parameter of the uniform probability distribution. Propose an estimator

COMP 5340/6340 Statistical Inference8 Sampling Distribution Multiple samples can be drawn Each sample may yield a different estimate Therefore, the estimate is a random variable with a probability distribution. In order to “evaluate” an estimate, we want to have an idea of its: –Central tendency –Variability

COMP 5340/6340 Statistical Inference9 Unbiased Estimators A point estimator ˆ  is said to be an unbiased estimator of  if E(ˆ  ) =  for every possible value of . If  is not unbiased, the difference E(ˆ  ) –  is called the bias. Intuitively, an unbiased estimator is one that can equally underestimate or overestimate a given parameter.

COMP 5340/6340 Statistical Inference10 Estimators Mean Median [Max(samples)-Min(samples)]/2 Trimmed mean Xtr(10)

COMP 5340/6340 Statistical Inference11 Some Unbiased Estimators If X is a binomial random variable with parameter n and p, the sample proportion p=X/n is an unbiased estimator of p. Let X 1, X 2,…X n be a random sample from a distribution with mean  and variance  . Then the estimator is an unbiased estimator of  .

COMP 5340/6340 Statistical Inference12 Some Unbiased Estimators (2) Let X 1, X 2,…X n be a random sample from a distribution with mean  and variance  . Then the estimator is an unbiased estimator of . If the distribution is continuous and symmetric, then any trimmed mean is also an unbiased estimator.

COMP 5340/6340 Statistical Inference13 Desirable Properties of Estimators Unbiased Minimal variance –The precision of an estimator is measured by the standard error of the estimator, i.e.

Methods of Point Estimation

COMP 5340/6340 Statistical Inference15 Methods of Point Estimation Method of moments Maximum likelihood estimation (recommended for large samples)

COMP 5340/6340 Statistical Inference16 Method of Moments Reminder: Definition of moments Definition: Let X 1, X 2,…X n be a random sample from a p.m.f or p.d.f f(x). For k=1,2,3… the k th population moment or k th moment of the distribution f(x) is E(X k ). The k th sample moment is Let X 1, X 2,…X n be a random sample from a p.m.f or p.d.f f(x,  1 ….  m ) where  1 ….  m are the parameters whose values are unknown. Then, the moments estimators ˆ  1 …. ˆ  m are obtained by equating the first m sample moments to the first correspondng population moments and solving for  1 ….  m.

COMP 5340/6340 Statistical Inference17 Example 1 Let x 1, x 2,…x n represent a random sample of service time of n customers at some facility, where the underlying distributionis assumed exponential with parameter. How many parameters need to be estimated? What is the 1 st sample moment? What is the moment estimator of 

COMP 5340/6340 Statistical Inference18 Example 2 Let x 1, x 2,…x n represent a random sample from a gamma distribution with parameters  and . Reminder: E(X) =  and V(X) =  . How many parameters need to be estimated? What is the 1 st sample moment? What is the 2 nd sample moment? What are the estimators for  and  

COMP 5340/6340 Statistical Inference19 Example 3/Exercise Let x 1, x 2,…x n represent a random sample from a generalized negative binomial distribution with parameters r and p. Reminder: E(X) = ? and V(X) = ?. How many parameters need to be estimated? What is the 1 st sample moment? What is the 2 nd sample moment? What are the estimators for r and p 

COMP 5340/6340 Statistical Inference20 Maximum Likelihood Estimation Example: –10 packets are sent over a lossy channel with packet loss rate p. The 2nd, 4th, and 8th are lost. The joint p.mf. of this sample is: –f(x 1, x 2,…x 10 ;p) = (1-p)p(1-p)p(1-p)…p(1-p)(1-p) = p3(1-p)7 –For what value of p is the observed sample most likely to have occurred? –In other words, for which value of p is [ p3(1-p)7 ] maximized? –f(x 1, x 2,…x 10 ;p) is maximized for the value of p such that

COMP 5340/6340 Statistical Inference21 Maximum Likelihood Estimation Definition: - Let X 1, X 2,…X n have a joint p.m.f or p.d.f. f(x 1,x 2,…,x n,  1 ….  m ) where the parameters  1 ….  m take unknown values. -f(x 1,x 2,…,x n, ˆ  1 …. ˆ  m ) can be considered as a function of the parameters ˆ  1 …. ˆ  m and is called the likelihood function. - The maximum likelihood estimates ˆ  1 …. ˆ  m are those values for the qi that maximize the likelihood function.

COMP 5340/6340 Statistical Inference22 M.L.E Example Let x 1, x 2,…x n represent a random sample from an exponential distribution with parameter. Because of the independence, the likelihood function is a product of the individual p.d.f.’s. To maximize products, it is better to work with the ln (natural log)

COMP 5340/6340 Statistical Inference23 M.L.E Example (2) To find the value of that maximizes the likelihood function, we derive We solve the equation

Interval Estimation (Single Sample)

COMP 5340/6340 Statistical Inference25 Introduction to Confidence Interval Simple case. We are interested in the following parameter: the population mean . Assume (unrealistically) that: –The population distribution is normal –The value of the population standard deviation  is known.

COMP 5340/6340 Statistical Inference26 Introduction to Confidence Interval (2) Let x 1, x 2,…x n represent a random sample from normal distribution with mean  and standard deviation . The objective is to find a confidence interval of 95% for . What can we say of the random variable ? Probability distribution? Mean? Standard deviation? What is the standardized variable Z for Y? Using the normal distribution table: Normal 

COMP 5340/6340 Statistical Inference27 Introduction to Confidence Interval (3) Manipulating We get Substituting with the sample values

COMP 5340/6340 Statistical Inference28 100(1-  )% Confidence Interval Definition: a 100(1-  )% confidence interval for the mean  of a normal population when the value of  is known is given by 1-  0

COMP 5340/6340 Statistical Inference29 100(1-  )% Confidence Interval Common Values 1-  0 Any confidence is achievable (need normal distribution table) Common values used are 90%, 95%, and 99%

COMP 5340/6340 Statistical Inference30 Confidence, Sample Size, and Interval Length Increasing confidence increases the interval length (sample size fixed) To increase confidence without increasing interval length, one must increase sample size n 1-  0

Large Sample Confidence Intervals -Population mean - Proportion

COMP 5340/6340 Statistical Inference32 Confidence Interval for the Population Mean If n est sufficiently large has approximately a standard normal distribution. This implies that is a large-sample confidence interval for  with confidence level approximately 100(1-  )%.

COMP 5340/6340 Statistical Inference33 Confidence Interval for the Population Proportion A large-sample 100(1-  )% confidence interval for a population proportion is where, n is the sample size, and x is the number of successes. This interval is valid whenever

COMP 5340/6340 Statistical Inference34 Exercise 1) What should be the sample size to achieve 100(1-  )% confidence interval over an interval of length L? Assume p known 2) What should be the sample size to achieve 100(1-  )% confidence interval over an interval of length L? Assume p UNknown