Introduction to Inferential Statistics. Introduction  Researchers most often have a population that is too large to test, so have to draw a sample from.

Slides:



Advertisements
Similar presentations
Chapter 6 Introduction to Inferential Statistics
Advertisements

A Sampling Distribution
Estimation in Sampling
Sampling Distributions and Sample Proportions
Psych 5500/6500 The Sampling Distribution of the Mean Fall, 2008.
Terminology A statistic is a number calculated from a sample of data. For each different sample, the value of the statistic is a uniquely determined number.
Chapter 10: Sampling and Sampling Distributions
Topics: Inferential Statistics
PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.
1 The Basics of Regression Regression is a statistical technique that can ultimately be used for forecasting.
Sampling Distributions
Part III: Inference Topic 6 Sampling and Sampling Distributions
1.  Why understanding probability is important?  What is normal curve  How to compute and interpret z scores. 2.
Sampling and Sampling Distributions Aims of Sampling Probability Distributions Sampling Distributions The Central Limit Theorem Types of Samples.
Chapter 7 Probability and Samples: The Distribution of Sample Means
Chapter 5 DESCRIBING DATA WITH Z-SCORES AND THE NORMAL CURVE.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
© Copyright McGraw-Hill CHAPTER 6 The Normal Distribution.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
A Sampling Distribution
Dan Piett STAT West Virginia University
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Review: Two Main Uses of Statistics 1)Descriptive : To describe or summarize a collection of data points The data set in hand = all the data points of.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution.
Chapter 11 – 1 Chapter 7: Sampling and Sampling Distributions Aims of Sampling Basic Principles of Probability Types of Random Samples Sampling Distributions.
Chapter 7: Sampling and Sampling Distributions
Chapter 7: Sample Variability Empirical Distribution of Sample Means.
Sampling Neuman and Robson Ch. 7 Qualitative and Quantitative Sampling.
Sampling Distribution and the Central Limit Theorem.
The Central Limit Theorem and the Normal Distribution.
Determination of Sample Size: A Review of Statistical Theory
Chapter 7 Probability and Samples: The Distribution of Sample Means.
Distributions of the Sample Mean
BUS216 Spring  Simple Random Sample  Systematic Random Sampling  Stratified Random Sampling  Cluster Sampling.
Chapter Thirteen Copyright © 2004 John Wiley & Sons, Inc. Sample Size Determination.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics.
Chapter 10: Introduction to Statistical Inference.
Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution.
Confidence Interval Estimation For statistical inference in decision making:
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a measure of the population. This value is typically unknown. (µ, σ, and now.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Sampling and Sampling Distributions Basic Business Statistics 11 th Edition.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Chapter 7: The Distribution of Sample Means. Frequency of Scores Scores Frequency.
Basic Business Statistics
Copyright © 2012 by Nelson Education Limited. Chapter 5 Introduction to inferential Statistics: Sampling and the Sampling Distribution 5-1.
POLS 7000X STATISTICS IN POLITICAL SCIENCE CLASS 5 BROOKLYN COLLEGE-CUNY SHANG E. HA Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for.
Warsaw Summer School 2015, OSU Study Abroad Program Normal Distribution.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
Chapter 7 Introduction to Sampling Distributions Business Statistics: QMIS 220, by Dr. M. Zainal.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Warsaw Summer School 2014, OSU Study Abroad Program Sampling Distribution.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Chapter 8 Sampling Methods and the Central Limit Theorem.
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Chapter 7 Probability and Samples
Introduction to Inference
Sampling Why use sampling? Terms and definitions
Statistical Analysis Urmia University
Sample Size Determination
Chapter 6, Introduction to Inferential Statistics
Statistics in Applied Science and Technology
The normal distribution
Chapter 7 Sampling Distributions
Sampling Distributions
Warsaw Summer School 2017, OSU Study Abroad Program
Chapter 7: The Distribution of Sample Means
Chapter 4 (cont.) The Sampling Distribution
Presentation transcript:

Introduction to Inferential Statistics

Introduction  Researchers most often have a population that is too large to test, so have to draw a sample from the population  Researchers collect a random sample from the population and generalize from the known characteristics of the sample to the unknown population

Random Sampling  To use inferential statistical techniques, it is required that samples be randomly drawn from the population of interest Non-random samples can be used for exploratory research  Meaning that you are going to explore a topic to see what variables are important to the topic  But, the conclusions cannot be generalized to the population  Random sampling requires a precise process of selection  Need to remember that randomness is not representativeness Because a sample is random does not guarantee that it is an exact representation of the population

Random Sampling  The probability is high that a randomly selected sample will be representative  The good thing about inferential statistics is that they allow you to state the probability of this type of error very precisely

Ways in Which Random Samples are Gathered

Simple Random Sample  For a simple random sample, each case and each combination of cases in the population must have an equal probability of being chosen for the sample  This kind of sample is used when you have a complete list of all cases in the population  Most researchers use tables of random numbers to select cases  This would be extremely time consuming for a large sample, so another technique is used

Systematic Sampling  Needed for large populations  Only the first case is randomly selected  After that, every kth case is selected Will choose the first case from the table of random numbers, then choose every 10 th case, or however many you need to reach a sample of the size you want Will divide your population size by the sample size to find the distance to the next score for your sample  This is not a random sample Need to make sure the list of the population is random

Stratified Sample  Proportional Stratified Sample This is used if you want to guarantee a representation of certain categories of cases (the same percentage as in the population) If you want to compare chemistry majors to criminology majors  You first ask students their major  Then you put them all in the same list sorted by major, then begin your random sample from the list  All majors will be included if your sample is large enough, but each would be included in the proportion they are in the original population Will have more criminology majors

Stratified Sample  Disproportionate Stratified Sampling If you need exactly the same number of students from each major  You separate the students by major  Then have to change your sampling fraction to account for differences in number of cases  If you need a sample of 50, and there are 100 zoology majors, you would choose every other one to get exactly 50 cases Problem with this is you cannot generalize directly to the population, since your sample will never be representative  The biggest problem in sampling is that there is no complete list of most populations

Cluster Sample  Used when there is no list of the members of the population  It involves random selection of geographical units (states, neighborhoods, or blocks)  And will test every case within the last geographical units  Not a random sample, so not as trustworthy, but cheaper to do

The Sampling Distribution

Introduction  Researchers have a great deal of information about the sample distribution, but they know nothing about the population  It is the population that is of interest We do not want to know what 2,000 people think out of the 100,000,000 or so adults in the U.S.  What you would want to know about the distribution of the population The shape of the distribution Some measure of central tendency Some measure of dispersion

The Normal Curve  You need to know the properties of the normal curve, which are based on the laws of probability, to find out information about the population from the sample  To do this, you use a device known as the sampling distribution It bridges the gap between the sample and the population The sampling distribution is the central concept in inferential statistics, so you need to understand the concept

Three Distributions  The sample distribution This is empirical (observed) and known It is collected by researchers and used to learn about the population  The population distribution It exists in reality, so is empirical, but it is unknown to the researcher The sole purpose of inferential statistics is to make inferences (meaning draw conclusions) about the population distribution

The Sampling Distribution  This is nonempirical (theoretical) Theoretical, since you only do one sample, and the sampling distribution is based on an infinite number of samples taken from that population  Laws of probability tell us much about this distribution  Theoretically, if you drew an infinite number of samples from a population Then you only computed the mean of each sample And you put the means on a graph to form a frequency polygon

The Sampling Distribution  We know that each sample mean will be slightly different Since each sample is not an exact representation of the population We know that most of the sample means will cluster around the true population value

Two Theorems About the Sampling Distribution  If repeated random samples of size N are drawn from a normal population with mean µ and standard deviation σ, then the sampling distribution of sample means will be normal with a mean µ and a standard deviation of σ /the square root of N So the mean of the sampling distribution will be the same as the mean of the population

Theorems  Since the samples are random, the means should miss an equal number of times on either side of the population value Making the distribution symmetrical A normal curve with a bell shape So, we know about the shape of the sampling distribution

Dispersion of the Sampling Distribution  We can also tell something about the dispersion (specifically the standard deviation) of the sampling distribution  The formula for the standard deviation of the sampling distribution is represented by the symbol σ/the square root of N Which is the standard deviation of the population divided by the square root of N

Dispersion of the Sampling Distribution  What this tells you is that in comparing a sampling distribution with a population distribution, there will always be more variance in the population distribution  As the sample size gets larger, the variance of the sampling distribution will get smaller (N = the number in the sample)  The above theorem applies to populations that are normally distributed on a particular variable

Central Limit Theorem  This second theorem is needed if the population distribution is not normal  If repeated random samples of size N are drawn from any population, with mean µ and standard deviation σ, then as N becomes large, the sampling distribution of sample means will approach normality, with mean µ and standard deviation σ/the square root of N

Large Samples  What, exactly, is meant by large A good rule of thumb is that if N is 100 or more, the Central Limit Theorem applies, and you can assume that the sampling distribution is normal in shape