Econ 3790: Business and Economics Statistics

Slides:



Advertisements
Similar presentations
University of Minnesota-Duluth, Econ-2030 (Dr. Tadesse) Inferential Statistics.
Advertisements

Chapter 6 Sampling and Sampling Distributions
Chapter 8 Interval Estimation Population Mean:  Known Population Mean:  Known Population Mean:  Unknown Population Mean:  Unknown n Determining the.
Chapter 7 Introduction to Sampling Distributions
Sampling Distributions
Chapter 7 Sampling and Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Chapter 8 Estimation: Single Population
OMS 201 Review. Range The range of a data set is the difference between the largest and smallest data values. It is the simplest measure of dispersion.
Sampling and Sampling Distributions Simple Random Sampling Point Estimation Sampling Distribution.
Chapter 7 Estimation: Single Population
QMS 6351 Statistics and Research Methods Chapter 7 Sampling and Sampling Distributions Prof. Vera Adamchik.
Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
1 1 Slide Interval Estimation Chapter 8 BA Slide A point estimator cannot be expected to provide the exact value of the population parameter.
Chapter 7 Sampling and Sampling Distributions n Simple Random Sampling n Point Estimation n Introduction to Sampling Distributions n Sampling Distribution.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 7, Part A Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 Introduction to Estimation Chapter Concepts of Estimation The objective of estimation is to determine the value of a population parameter on the.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Chap 6-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 6 Introduction to Sampling.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #5 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution.
1 Chapter 7 Sampling and Sampling Distributions Simple Random Sampling Point Estimation Introduction to Sampling Distributions Sampling Distribution of.
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved. Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 8 Interval Estimation Population Mean:  Known Population Mean:  Known Population.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal
1 Chapter 8 Interval Estimation. 2 Chapter Outline  Population Mean: Known  Population Mean: Unknown  Population Proportion.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Fundamentals of Business Statistics chapter7 Sampling and Sampling Distributions 上海金融学院统计系 Statistics Dept., Shanghai Finance University.
Chapter 6 Sampling and Sampling Distributions
Sampling Distributions
Confidence Intervals and Sample Size
Sampling Distributions
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
ESTIMATION.
Normal Distribution and Parameter Estimation
Chapter 7 (b) – Point Estimation and Sampling Distributions
Chapter 6 Confidence Intervals.
St. Edward’s University
Chapter 5, Part A [SBE 7/e, Ch. 7] Sampling and Sampling Distributions
St. Edward’s University
Chapter 7 Sampling Distributions.
Problems: Q&A chapter 6, problems Chapter 6:
Chapter 8 Interval Estimation
Chapter 6 Confidence Intervals.
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
Chapter 6 Confidence Intervals.
Chapter 7 Sampling Distributions.
Lecture 7 Sampling and Sampling Distributions
Chapter 7 Sampling Distributions.
Chapter 7 Sampling and Sampling Distributions
Chapter 6 Confidence Intervals.
Chapter 7 Sampling Distributions.
Presentation transcript:

Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal Email: yuppal@ysu.edu

Normal Probability Distribution Characteristics Probabilities for the normal random variable are given by areas under the curve. The total area under the curve is 1 (.5 to the left of the mean and .5 to the right). .5 .5 x Mean m

How to find probabilities of a random variable (x) which has a normal distribution. Convert the x values into the z scores or more formally, standardize x. After the conversion, we can use the z-scores to find probabilities from a table (called table of standard normal probabilities).

Standardizing the Normal Values or the z-scores Z-scores can be calculated as follows: We can think of z as a measure of the number of standard deviations x is from .

Standard Normal Probability Distribution A standard normal distribution is a normal distribution with mean of 0 and variance of 1. If x has a normal distribution with mean (μ) and Variance (σ), then z is said to have a standard normal distribution. s = 1 z

Characteristics of Standard Normal Distribution It is a type of the normal distribution. Its mean is zero and variance is one. Z-values on the left side of the mean are negative and right side of the mean are positive. Important point is what symmetry means in this kind of distribution? How do you interpret the values in the Standard Normal Table?

Example: Air Quality I collected this data on the air quality of various cities as measured by particulate matter index (PMI). A PMI of less than 50 is said to represent good air quality. The data is available on the class website. Suppose the distribution of PMI is approximately normal.

Example: Air Quality Suppose I want to find out the probability of air quality being good? What is the probability that PMI is greater than 80? What is the probability that PMI is with 2 standard deviations from the mean?

Computing x from a given z-score: Suppose I tell you that in our air quality example, the probability is 40% that standardized value of the PMI is between -z and +z. What are the corresponding x values?

Chapter 7, Part A Sampling and Sampling Distributions Simple Random Sampling Point Estimation Introduction to Sampling Distributions Sampling Distribution of

Statistical Inference The sample results provide only estimates of the values of the population characteristics. With proper sampling methods, the sample results can provide “good” estimates of the population characteristics. A parameter is a numerical characteristic of a population.

Statistical Inference The purpose of statistical inference is to obtain information about a population from information contained in a sample. A population is the set of all the elements of interest. A sample is a subset of the population.

Simple Random Sampling: Finite Population A simple random sample of size n from a finite population of size N is a sample selected such that each possible sample of size n has the same probability of being selected.

Simple Random Sampling: Finite Population Replacing each sampled element before selecting subsequent elements is called sampling with replacement. Sampling without replacement is the procedure used most often. In large sampling projects, computer-generated random numbers are often used to automate the sample selection process.

Point Estimation In point estimation we use the data from the sample to compute a value of a sample statistic that serves as an estimate of a population parameter. We refer to as the point estimator of the population mean . s is the point estimator of the population standard deviation . is the point estimator of the population proportion p.

Sampling Error When the expected value of a point estimator is equal to the population parameter, the point estimator is said to be unbiased. The absolute value of the difference between an unbiased point estimate and the corresponding population parameter is called the sampling error. Sampling error is the result of using a subset of the population (the sample), and not the entire population. Statistical methods can be used to make probability statements about the size of the sampling error.

Sampling Error The sampling errors are: for sample mean for sample standard deviation for sample proportion

Air Quality Example Let us suppose that the population of air quality data consists of 191 observations. How would you determine the following population parameters: mean, standard deviation, proportion of cities with good air quality.

Air Quality Example How about picking a random sample from this population representing the air quality? We shall use SPSS to do this random sampling for us. How would you use this sample to provide point estimates of the population parameters?

Summary of Point Estimates Obtained from a Simple Random Sample Population Parameter Parameter Value Point Estimator Point Estimate m = Population mean SAT score 40.9 = Sample mean SAT score …. s = Population std. deviation for SAT score 20.5 s = Sample std. deviation for SAT score ….. p = Population pro- portion .62 = Sample pro- portion wanting campus housing ….

of n elements is selected Process of Statistical Inference Population with mean m = ? A simple random sample of n elements is selected from the population. The value of is used to make inferences about the value of m. The sample data provide a value for the sample mean .

Sampling Distribution of The sampling distribution of is the probability distribution of all possible values of the sample mean . Expected Value of E( ) =  where:  = the population mean

Sampling Distribution of Standard Deviation of Finite Population Infinite Population A finite population is treated as being infinite if n/N < .05. is the finite correction factor. is also referred to as the standard error of the mean.

The Shape of Sampling Distribution of If the shape of the distribution of x in the population is normal, the shape of the sampling distribution of is normal as well. If the shape of the distribution of x in the population is approximately normal, the shape of the sampling distribution of is approximately normal as well. If the shape of the population is not approximately normal then If n is small, the shape of the sampling distribution of is unpredictable. If n is large (n≥ 30), the shape of the sampling distribution of can be assumed to be approximately normal.

Sampling Distribution of for the air quality example when the population is (almost) infinite

Sampling Distribution of for the air quality example when the population is finite

Relationship Between the Sample Size and the Sampling Distribution of E( ) = m regardless of the sample size. In our example, E( ) remains at 40.9. Whenever the sample size is increased, the standard error of the mean is decreased. With the increase in the sample size to n = 100, the standard error of the mean decreases.

Relationship Between the Sample Size and the Sampling Distribution of

Sampling Distribution of If we use a large random sample (n>30), then the sampling distribution of can be approximated by the normal distribution. If the sample is small, then the sampling distribution of can be normal only if we assume that our population has a normal distribution.

Sampling Distribution of for the air quality Index when n = 5. What is the probability that a simple random sample of 5 applicants will provide an estimate of the population mean air quality index that is within +/-2 of the actual population mean, μ? In other words, what is the probability that will be between 38.9 and 42.9?

Sampling Distribution of for the air quality Index when n = 100. What is the probability that a simple random sample of 100 applicants will provide an estimate of the population mean air quality index that is within +/-2 of the actual population mean, μ?

Relationship Between the Sample Size and the Sampling Distribution of Because the sampling distribution with n = 100 has a smaller standard error, the values of have less variability and tend to be closer to the population mean than the values of with n = 5. Basically, a given interval with smaller standard error (larger n) will cover more area under the normal curve than the same interval with larger standard error (smaller n).

Chapter 7, Part B Sampling and Sampling Distributions Sampling Distribution of

of n elements is selected Sampling Distribution of Making Inferences about a Population Proportion Population with proportion p = ? A simple random sample of n elements is selected from the population. The value of is used to make inferences about the value of p. The sample data provide a value for the sample proportion .

Sampling Distribution of The sampling distribution of is the probability distribution of all possible values of the sample proportion . Expected Value of where: p = the population proportion

Sampling Distribution of Standard Deviation of Finite Population Infinite Population is referred to as the standard error of the proportion.

Form of Sampling Distribution of The sampling distribution of can be approximated by a normal distribution whenever the sample size is large: Central Limit Theorem (CLT). The sample size is considered large whenever these conditions are satisfied: np > 5 and n(1 – p) > 5

Chapter 8: Interval Estimation Population Mean: s Known Population Mean: s Unknown

Margin of Error and the Interval Estimate A point estimator cannot be expected to provide the exact value of the population parameter. An interval estimate can be computed by adding and subtracting a margin of error to the point estimate. Point Estimate +/- Margin of Error The purpose of an interval estimate is to provide information about how close the point estimate is to the value of the parameter.

Margin of Error and the Interval Estimate The general form of an interval estimate of a population mean is In order to develop an interval estimate of a population mean, the margin of error must be computed using either: the population standard deviation s , or the sample standard deviation s These are also Confidence Interval.

Interval Estimate of a Population Mean: s Known Interval Estimate of m where: is the sample mean 1 - is the confidence coefficient z/2 is the z value providing an area of /2 in the upper tail of the standard normal probability distribution s is the population standard deviation n is the sample size

Interval Estimation of a Population Mean: s Known There is a 1 -  probability that the value of a sample mean will provide a margin of error of or less. Sampling distribution of 1 -  of all values /2 /2 

Summary of Point Estimates Obtained from a Simple Random Sample Population Parameter Parameter Value Point Estimator Point Estimate m = Population mean 40.9 = Sample mean s = Population std. deviation 20.5 s = Sample std. deviation ……. p = Population pro- portion .62 = Sample pro- portion

Example: Air Quality Consider our air quality example. Suppose the population is approximately normal with μ = 40.9 and σ = 20.5. This is σ known case. If you guys remember, we picked a sample of size 5 (n =5). Given all this information, What is the margin of error at 95% confidence level?

Example: Air Quality What is the margin of error at 95% confidence level. We can say with 95% confidence that population mean (μ) is between ± 18 of the sample mean. With 95% confidence, μ is between …. and …...

Interval Estimation of a Population Mean:s Unknown If an estimate of the population standard deviation s cannot be developed prior to sampling, we use the sample standard deviation s to estimate s . This is the s unknown case. In this case, the interval estimate for m is based on the t distribution. (We’ll assume for now that the population is normally distributed.)

t Distribution The t distribution is a family of similar probability distributions. A specific t distribution depends on a parameter known as the degrees of freedom. Degrees of freedom refer to the number of independent pieces of information that go into the computation of s.

t Distribution A t distribution with more degrees of freedom has less dispersion. As the number of degrees of freedom increases, the difference between the t distribution and the standard normal probability distribution becomes smaller and smaller.

t Distribution t distribution (20 degrees Standard of freedom) normal z, t

t Distribution For more than 100 degrees of freedom, the standard normal z value provides a good approximation to the t value. The standard normal z values can be found in the infinite degrees ( ) row of the t distribution table.

t Distribution Standard normal z values

Interval Estimation of a Population Mean: s Unknown Interval Estimate where: 1 - = the confidence coefficient t/2 = the t value providing an area of /2 in the upper tail of a t distribution with n - 1 degrees of freedom s = the sample standard deviation

Example: Air quality when σ is unknown Now suppose that you did not know what σ is. You can estimate using the sample and then use t-distribution to find the margin of error. What is 95% confidence interval in this case? The sample size n =5. So, the degrees of freedom for the t-distribution is 4. The level of significance ( ) is 0.05. s = ……

Summary of Interval Estimation Procedures for a Population Mean Can the population standard deviation s be assumed known ? Yes No Use the sample standard deviation s to estimate σ s Known Case Use Use s Unknown Case

Interval Estimation of a Population Proportion The general form of an interval estimate of a population proportion is

Interval Estimation of a Population Proportion Interval Estimate where: 1 - is the confidence coefficient z/2 is the z value providing an area of /2 in the upper tail of the standard normal probability distribution is the sample proportion