INFERENTIAL STATISTICS. By using the sample statistics, researchers are usually interested in making inferences about the population. INFERENTIAL STATISICS.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 9 Inferences Based on Two Samples.
Chapter 6 Confidence Intervals.
Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
Sampling: Final and Initial Sample Size Determination
Biostatistics Practice2. Example: A study shows that 70% of all patients coming to a certain medical clinic have to wait at least fifteen minutes to see.
Chapter 11- Confidence Intervals for Univariate Data Math 22 Introductory Statistics.
STATISTICAL INFERENCE PART V
ESTIMATION AND HYPOTHESIS TESTING
Chapter 7 Introduction to Sampling Distributions
Math 161 Spring 2008 What Is a Confidence Interval?
Ch 6 Introduction to Formal Statistical Inference.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
BCOR 1020 Business Statistics Lecture 17 – March 18, 2008.
Estimation Procedures Point Estimation Confidence Interval Estimation.
Chapter 6 Introduction to Sampling Distributions
Chapter 7 Sampling and Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Need to know in order to do the normal dist problems How to calculate Z How to read a probability from the table, knowing Z **** how to convert table values.
5-3 Inference on the Means of Two Populations, Variances Unknown
SAMPLING METHODS. Reasons for Sampling Samples can be studied more quickly than populations. A study of a sample is less expensive than studying an entire.
Inferential Statistics
5.4 The Central Limit Theorem Statistics Mrs. Spitz Fall 2008.
Chapter 6: Sampling Distributions
Review of normal distribution. Exercise Solution.
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
Statistical inference: confidence intervals and hypothesis testing.
Chapter 6 Confidence Intervals.
STATISTICAL INFERENCE PART VII
Dan Piett STAT West Virginia University
Understanding Inferential Statistics—Estimation
Estimation of Statistical Parameters
Lecture 14 Sections 7.1 – 7.2 Objectives:
ESTIMATION. STATISTICAL INFERENCE It is the procedure where inference about a population is made on the basis of the results obtained from a sample drawn.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Estimation Bias, Standard Error and Sampling Distribution Estimation Bias, Standard Error and Sampling Distribution Topic 9.
Estimates and Sample Sizes Lecture – 7.4
PARAMETRIC STATISTICAL INFERENCE
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Ch 6 Introduction to Formal Statistical Inference
Statistical estimation, confidence intervals
Sampling Error.  When we take a sample, our results will not exactly equal the correct results for the whole population. That is, our results will be.
Biostatistics Unit 5 – Samples. Sampling distributions Sampling distributions are important in the understanding of statistical inference. Probability.
Chapter Outline 2.1 Estimation Confidence Interval Estimates for Population Mean Confidence Interval Estimates for the Difference Between Two Population.
BPS - 3rd Ed. Chapter 131 Confidence Intervals: The Basics.
© Copyright McGraw-Hill 2000
CHAPTER SEVEN ESTIMATION. 7.1 A Point Estimate: A point estimate of some population parameter is a single value of a statistic (parameter space). For.
§ 5.3 Normal Distributions: Finding Values. Probability and Normal Distributions If a random variable, x, is normally distributed, you can find the probability.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Chapter 8: Confidence Intervals based on a Single Sample
Confidence Interval Estimation For statistical inference in decision making:
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
Chapter 8: Confidence Intervals based on a Single Sample
Chapter 7 Estimates, Confidence Intervals, and Sample Sizes
Confidence Intervals for a Population Mean, Standard Deviation Unknown.
8.1 Estimating µ with large samples Large sample: n > 30 Error of estimate – the magnitude of the difference between the point estimate and the true parameter.
Section 6.2 Confidence Intervals for the Mean (Small Samples) Larson/Farber 4th ed.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 3 – Slide 1 of 27 Chapter 11 Section 3 Inference about Two Population Proportions.
6-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
Confidence Intervals Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
Probability & Statistics Review I 1. Normal Distribution 2. Sampling Distribution 3. Inference - Confidence Interval.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Chapter 6: Sampling Distributions
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
ESTIMATION.
Chapter 6: Sampling Distributions
Statistics in Applied Science and Technology
CONCEPTS OF ESTIMATION
Determining Which Method to use
Presentation transcript:

INFERENTIAL STATISTICS

By using the sample statistics, researchers are usually interested in making inferences about the population. INFERENTIAL STATISICS. This branch of statistics is referred to as INFERENTIAL STATISICS. INFERENTIAL STATISICS i.Estimation ii.Hypothesis testing

The distribution of values for that sample statistic obtained from all possible samples of a population. The samples must all be the same size, and the sample statistic could be any descriptive sample statistic. Sampling Distribution The distribution is used for judging the quality (precision) of that estimate.

Example As a hypothetical example, suppose that in a population of 5 hospitalized patients, the length of hospitalization (in days) are given x 1 =5 x 2 =7 x 3 =4 x 4 =5 x 5 =3 Assume that a sample of size 2 will be selected. The list of all possible samples of size2, and the estimated avarege length of hospitalization per subject will be as follows

x 4,x 5 x 3,x 5 x 3,x 4 x 2,x 5 x 2,x 4 x 2,x 3 x 1,x 5 x 1,x 4 x 1,x 3 x 1,x 2 Estimated Average Possible Samples The distribution of sample means is called the Sampling Distribution of mean. This distribution is a Normal Distribution.

x 4,x 5 x 3,x 5 x 3,x 4 x 2,x 5 x 2,x 4 x 2,x 3 x 1,x 5 x 1,x 4 x 1,x 3 x 1,x 2 Estimated AveragePossible Samples If say x 4 were 15 instead of 5. What would change?

x 3,x 4,x 5 x 2,x 4,x 5 x 2,x 3,x 5 x 2,x 3,x 4 x 1,x 4,x 5 x 1,x 3,x 5 x 1,x 3,x 4 x 1,x 2,x 5 x 1,x 2, x 4 x 1,x 2,x 3 Estimated AveragePossible Samples Now, suppose that a sample of size 3 (n=3) is going to be selected from this population. List of all posible samples of size 3, and estimated average length of hospitalization will be:

x 3,x 4,x 5 x 2,x 4,x 5 x 2,x 3,x 5 x 2,x 3,x 4 x 1,x 4,x 5 x 1,x 3,x 5 x 1,x 3,x 4 x 1,x 2,x 5 x 1,x 2, x 4 x 1,x 2,x 3 Possible Samples Estimated Average Again assuming that x 4 =15, sample means are

A Normal Distribution is characterized by its mean and its standard deviation (or its variance). The mean of the sampling distribution of the sample mean is always equal to the population mean.

The population mean,  (which is unknown) in our hypothetical example is When n=2 mean of the sample means  =( )/5=4.8 days  =( )/5=6.8 days =( )/10=48/10=4.8 =( )/10=68/10=6.8 When n=3 mean of the sample means =( )/10=48/10=4.8 =( )/10=68/10=6.8

The scatter of all possible sample means, around the unknown population mean is measured by the variance (or standard deviation) of the sampling distribution of the sample means. The variance of all possible sample means is directly proportional with the population variance (  2 ) and inversly proportional with the sample size (n). Thus the variance of the sample means: the standard error of the mean: The standard deviation of sample means is also known as

When  (the population standard deviation) is unknown, it can be estimated from the sample. Then, the estimated standard error

The distribution of sample mean will possess the following properties: 1.The distribution of will be normal. 2.The mean of the distribution of will be equal to the mean of the population from which the samples were drawn. 3.The variance of the distribution of will be equal to the variance of the population divided by the sample size, Its square root, the standard deviation is referred to as the standard error

n=4n=3

How we can use these properties: Remember x i ~ N( ,  )  P(X  x 0 ) can be found by transforming x to z, where Instead of individual x i values, if our concern is the probability that the mean of a sample of size n, is greater than (or smaler than) x 0, that is P(  x 0 ) can be calculated similar to the previous transformation:

Individual x i ’sSample Mean

Example (Regarding X i ’s) If the mean and standard deviation of serum iron values for healty men are 120 and 15  gr/100 ml., respectively, what is the probability that a random selected healty man will have a serum iron value less than 115  gr/100 ml.? P(X i  115)=? P(Z i  -0.33)=0.37

(Regarding means, ‘s) If the mean and standard deviation of serum iron values for healty men are 120 and 15  gr/100 ml., respectively, what is the probability that a random sample of 50 men will yield a mean less than 115  gr/100 ml.? P(  115)=? P(Z i  -2.36)=0.009

The above properties can also be used to make INTERVAL ESTIMATION. A point estimation is a single numerical value used to estimate the corresponding population parameter  estimates An interval estimate consists of two numerical values defining an interval which, with a specified degree of confidence, we feel includes the parameter being estimated. This interval is also named as “The Confidence Interval” for a parameter.

Example The distribution of ‘s are normal. The shaded area covers 95% of the total area. In other words, 95% of the sample means, of the same size, fall within this interval

Example Suppose we wish to estimate the average score on the 2nd committee exam. To do this, a random sample of 10 students is selected with the following scores: 64, 37, 78, 73, 44, 59, 81, 51, 33, 46. From previous years experience, I can assume that the standad deviation of scores is approximately equal to 17. Point estimate of the population mean:

95% CI for the population mean: 90% CI for the population mean:

STANDARD NORMAL DISTRIBUTION TABLE P(0<Z<z i )

The t-distribution In the previous sections procedures were outlined for constructing CI for a population mean. These procedures require a knowledge about the population standard deviation, , from which the sample is drawn. This condition presents a problem when constructing confidence intervals and when finding probabilities related with the sample mean.

is normally distributed When  is unknown, use sample standard deviation to replace  The quantityfollows a t-distribution. Like the standard normal distribution, the t-distribution is also tabulated

When  is unknown, its estimate, the sample standard deviation is used to construct CI’s. The inequality becomes:

Example A researcher wishes to estimate the average age of the mother at first birth. He selects 10 mothers at random, and gathers the following data: Mother No Age at first birth

Point estimate of the population mean: Sample standard deviation: Estimated standard eror of the mean:

If the researcher wishes to be 95% confident in his estimate:

DISTRIBUTION OF THE SAMPLE PROPORTION From the population, take all possible samples of a given size and for each sample compute the sample proportion, p. The frequency distribution of p is normal. The mean of the distribution,  p, is equal to the population proportion, P. The standard deviation of the distribution (standad error of proportion) is equal to

How we can use this property: Example: To find some probabilities related to the sample proportion If x i ’s are normally distributed: Since ’s are normally distributed: Since p’s are normally distributed:

Example: Suppose we know that in a certain human population 8% are colorblind. If we randomly select 50 individuals from this population what is the probability that the proportion in the sample who are colorblind will be more than 10%?

CONFIDENCE INTERVAL FOR A POPULATION PROPORTION When P, population proportion is unknown, its estimate, the sample proportion, p can be used.

A researcher wishes to estimate, with 95% confidence, the proportion of woman who are at or below 20 years of age at first birth. Example

Point estimate of the population proportion: p=a/n=4/10=0.4 Estimated standard error of the mean:

In the above example if the sample size were 100 instead of 10, then the 95% confidence interval would be:

Among 250 students of Hacettepe University interwieved 185 responded that they reqularly read a daily newspaper. With 95% confidence, find an interval within which the proportion of students who regularly read a newspaper in Hacettepe University lie. Point estimate of the proportion of students who read a newspaper. The standard error of the estimate is

In oder words, the standard deviation of the proportions that can be computed from all possible samples of size 250 is The 95% Confidence Interval is: