Review of Basic Statistical Concepts

Slides:



Advertisements
Similar presentations
1.  Why understanding probability is important?  What is normal curve  How to compute and interpret z scores. 2.
Advertisements

BPS - 3rd Ed. Chapter 131 Confidence intervals: the basics.
1 (Student’s) T Distribution. 2 Z vs. T Many applications involve making conclusions about an unknown mean . Because a second unknown, , is present,
GS/PPAL Section N Research Methods and Information Systems A QUANTITATIVE RESEARCH PROJECT - (1)DATA COLLECTION (2)DATA DESCRIPTION (3)DATA ANALYSIS.
Standard error of estimate & Confidence interval.
Confidence Interval A confidence interval (or interval estimate) is a range (or an interval) of values used to estimate the true value of a population.
Review of normal distribution. Exercise Solution.
Chapter 11: Estimation Estimation Defined Confidence Levels
Estimation of Statistical Parameters
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Statistical estimation, confidence intervals
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
+ DO NOW. + Chapter 8 Estimating with Confidence 8.1Confidence Intervals: The Basics 8.2Estimating a Population Proportion 8.3Estimating a Population.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Review of Basic Statistical Concepts. Normal distribution Population Mean: μ and Standard Deviation: σ.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved. Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Estimates and Sample Sizes Chapter 6 M A R I O F. T R I O L A Copyright © 1998,
1 Estimation of Population Mean Dr. T. T. Kachwala.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 5 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
Ch 8 Estimating with Confidence 8.1: Confidence Intervals.
ESTIMATION OF THE MEAN. 2 INTRO :: ESTIMATION Definition The assignment of plausible value(s) to a population parameter based on a value of a sample statistic.
10.1 Estimating with Confidence Chapter 10 Introduction to Inference.
Dr.Theingi Community Medicine
Chapter 8 Confidence Interval Estimation Statistics For Managers 5 th Edition.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Confidence Intervals.
GS/PPAL Research Methods and Information Systems
Inference: Conclusion with Confidence
Chapter 8: Estimating with Confidence
Confidence Intervals for Proportions
Lecture 9-I Data Analysis: Bivariate Analysis and Hypothesis Testing
CHAPTER 8 Estimating with Confidence
Chapter 8: Estimating with Confidence
This will help you understand the limitations of the data and the uses to which it can be put (and the confidence with which you can put it to those.
ESTIMATION.
Inference: Conclusion with Confidence
Confidence Intervals for Proportions
Estimating the Population Mean Income of Lexus Owners
CHAPTER 8 Estimating with Confidence
Week 10 Chapter 16. Confidence Intervals for Proportions
Statistics in Applied Science and Technology
CHAPTER 8 Estimating with Confidence
CONCEPTS OF ESTIMATION
CHAPTER 8 Estimating with Confidence
Arithmetic Mean This represents the most probable value of the measured variable. The more readings you take, the more accurate result you will get.
Essential Statistics Introduction to Inference
Chapter 10: Estimating with Confidence
Chapter 8: Estimating with Confidence
CHAPTER 8 Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
CHAPTER 8 Estimating with Confidence
CHAPTER 8 Estimating with Confidence
CHAPTER 8 Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
CHAPTER 16: Inference in Practice
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
CHAPTER 8 Estimating with Confidence
CHAPTER 8 Estimating with Confidence
Chapter 8: Estimating with Confidence
Presentation transcript:

Review of Basic Statistical Concepts

Statistical Literacy means knowing… …how to read rates and percentages: e.g., percentage of MPPAL students who are full-time employees versus the percentage of full-time employees who are MPPAL students …how to interpret different definitions of a group e.g., which rate is bigger? Child birth rate among women or child birth rate among women ages 20-44? …the difference between (1) deterministic causes and (2) probabilistic causes: e.g., (1) gravity causes the pen to fall and (2) drunk driving causes automobile accidents

Probability The likelihood or chance an event will occur Expressed as a number between 0 (impossibility) and 1 (certainty) or as a percentage between 0 (impossibility) and 100% (certainty) Objectivist Frame: when repeating an experiment, how often does the event occur? Subjectivist Frame: what is the degree of belief in the likelihood of an event occurring (e.g., Bayesian probability)

Normal distribution Population Mean: μ and Standard Deviation: σ

Describing the Normal Distribution If the mean μ = 0 and σ2 = 1 (so σ = 1) and μ is normally distributed then then 95.4% of the values will fall between: μ ± 2*σ = μ ± 2 95% will fall between a slightly smaller interval: μ ± 1.96 90% will fall between μ ± 1.645 99% will fall between μ ± 2.576 99.7% will fall between μ ± 3 What percentage of all conceivable means will lie between -1.96 and +1.96?  95% 95% “Confidence Interval” is the interval one has the confidence will contain the population mean (μ) 95% of the time

95% Confidence Interval and Confidence Level What is the probability that we will observe a mean value for μ that lies outside of our 95% Confidence Interval? 5% or 0.05 Confidence Level is noted as α = 0.05 for a 95% Confidence Interval For α = 0.05, σ = 1 the Confidence Level = 1.96 When μ = 0; 95% of the population means will fall within (-1.96 and +1.96) For μ = 6.5 and Confidence Level (95%) = 0.47, what is the 95% Confidence Interval?

The Challenge We can never know if we are observing the “true” population mean since any observed population mean will deviate plus or minus σ (= a “standard deviation”) Any census of a graduating class will only be a sample of the “true” population of all graduating classes Uncertainty Source #1: If we are seeking to explain the key factors determining the CGPA of graduates, we have to account for the fact that the observed CGPA might deviate from the true mean by σ Uncertainty Source #2: If it is infeasible to conduct a census of all graduating students in even one year, and all we can do is sample the sample, then we have additional uncertainty related to the size of the sample

Uncertainty from Sampling We know there is one inescapable source of uncertainty (Uncertainty Source #1) The sampling error (Uncertainty Source #2) complicates this uncertainty … but in a predictable way. We know as the sample size increases the standard deviation in the observed mean values (the “standard error of the mean” denoted “s”) will approach the true standard deviation (σ) of the theoretical population: s = σ/n1/2

Margin of Error Whenever a sample is less than the census, there is a chance that s ≠ σ and the sample mean (avgX) ≠ μ If the sample observations X follow a “Student t” distribution, the logic of the Confidence Interval follows but some of the vocabulary changes We are interested in the probability that avgX from our sample will equal μ The probability that avgX ≠ μ is the Margin of Error If the Margin of Error = 5%, then the sample interval about the mean would include μ in 95% of similar samples

Confidence Interval, Margin of Error and Sample Size For a Margin of Error = 5%, we obtain a 95% Confidence Interval around the sample mean For an approximate normal distribution, the standard error is = s = σ/n1/2 Then the 95% Confidence Interval = avgX ± 1.96*s or the interval (-1.96*s and + 1.96*s) is 2*1.96*s = 3.92*s units wide (= 3.92*σ/n1/2 ) If σ is known and the Confidence Interval specified, one could precisely calculate the sample size needed from the above information Then the 95% Confidence Interval = avgX ± 1.96*s or the interval (-1.96*s and + 1.96*s) is 2*1.96*s = 3.92*s = 3.92*σ/n1/2 where 1.96 is the critical Z value for 95% If we know the population standard deviation, Generally, n = 1/B2 where B is the margin of error. To limit the Margin of Error to 5%, we need a sample size n = 400 [= 1/(.05)2] For a larger Margin of Error at say 10%, we need n = 100 [= 1/(.01)2] If n=50, we have a Margin of Error of about 50%

Sample Size when σ is unknown When the true standard deviation of the population is unknown, an indirect method of determining the sample size yields: For a 95% confidence interval and Margin of Error of … … 5%, a sample of 400 (n = 400) is needed … 10%, n = 100 … 3%, n = 1000 … 1%, n = 10,000