Sampling distributions:

Slides:



Advertisements
Similar presentations
Chapter 7: The Distribution of Sample Means
Advertisements

Psychology: A Modular Approach to Mind and Behavior, Tenth Edition, Dennis Coon Appendix Appendix: Behavioral Statistics.
Objectives Look at Central Limit Theorem Sampling distribution of the mean.
Review: What influences confidence intervals?
1 Trust and divorce Separated or Divorced trust | No Yes | Total Low | | 247 | |
What z-scores represent
Sampling distributions:. In Psychology we generally make inferences about populations on the basis of samples. We therefore need to know what relationship.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Sampling distributions:
Chapter Sampling Distributions and Hypothesis Testing.
PROBABILITY AND SAMPLES: THE DISTRIBUTION OF SAMPLE MEANS.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 5 Chicago School of Professional Psychology.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview Central Limit Theorem The Normal Distribution The Standardised Normal.
Chapter 11: Random Sampling and Sampling Distributions
Chapter 5 DESCRIBING DATA WITH Z-SCORES AND THE NORMAL CURVE.
Probability and the Sampling Distribution Quantitative Methods in HPELS 440:210.
Hypothesis Testing:.
From Last week.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
AP Statistics Chapter 9 Notes.
From last lecture (Sampling Distribution): –The first important bit we need to know about sampling distribution is…? –What is the mean of the sampling.
Inferential statistics.  Summary from last week  The normal distribution  Hypothesis testing  Type I and II errors  Statistical power  Exercises.
Probability and Samples
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Probability.  Provides a basis for thinking about the probability of possible outcomes  & can be used to determine how confident we can be in an effect.
Chapter 7 Probability and Samples: The Distribution of Sample Means.
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Higher National Certificate in Engineering Unit 36 –Lesson 4 – Parameters used to Describe the Normal Distribution.
Chapter 7: The Distribution of Sample Means. Frequency of Scores Scores Frequency.
From the population to the sample The sampling distribution FETP India.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
m/sampling_dist/index.html.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Experimental Research
And distribution of sample means
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Normal Distribution.
Distribution of the Sample Means

Statistical Reasoning in Everyday Life
Summary descriptive statistics: means and standard deviations:
Week 10 Chapter 16. Confidence Intervals for Proportions
Higher National Certificate in Engineering
Probability and the Sampling Distribution
Review: What influences confidence intervals?
Summary descriptive statistics: means and standard deviations:
Sampling Distributions
Chapter 10: Estimating with Confidence
Sampling Distributions
Chapter 8: Estimating with Confidence
Chapter 7: The Distribution of Sample Means
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Estimating Population Parameters Based on a Sample
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Normal Distribution and z-scores
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Objectives 6.1 Estimating with confidence Statistical confidence
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Sampling distributions:

In Psychology we generally make inferences about populations on the basis of limited samples. We therefore need to know what relationship exists between samples and populations. A population of scores has a mean (μ), a standard deviation (δ) and a shape (e.g., normal distribution).

If we take repeated samples of scores, each sample has a mean (X), a standard deviation (s) and a shape. Due to random fluctuations, each sample is different - from other samples and from the parent population. Fortunately, these differences are predictable - and hence we can still use samples in order to make inferences about their parent populations.

What are the properties of the distribution of Sample Means? (a) The mean of the sample means is the same as the mean of the population of raw scores: (b) The standard deviation of the sample means is the "standard error". It is NOT the same as the standard deviation of the population of raw scores.

The standard error underestimates the population s.d.: The bigger the sample size, the smaller the standard error. i.e., variation between samples decreases as sample size increases. (This is because producing a sample mean reduces the influence of any extreme raw scores in a sample).

(c) The distribution of sample means tends to be normally distributed, no matter what the shape of the original distribution of raw scores in the population. This is called the Central Limit Theorem. This effect is more marked the bigger the sample size. This means that we can use the normal distribution as a simple "model" for many different populations - even though we don't know their true characteristics.

The Central Limit Theorem in action: (a) Frequency distribution of raw scores: high frequency of raw scores low population mean

(b) Frequency distribution of means of samples taken from the previous population: high frequency of sample means low population mean

Relationship of a raw score to the mean of the group of scores: Mean of all scores A particular raw score (a) Various proportions of scores fall within certain limits of the mean (i.e. 68% fall within the range of the mean +/- 1 s.d.; 95% within +/- 2 s.d., etc.). (b) Using z-scores, we can represent a given score in terms of how different it is from the mean of the group of scores.

Relationship of a sample mean to the population mean: A particular sample mean We can do the same with sample means: (a) we obtain a particular sample mean; (b) we can represent this in terms of how different it is from the mean of its parent population. Sample means vary randomly, but most will be reasonably close to the population mean. (e.g., 68% of sample means will fall within +/- 1 s.d. of the population mean, etc.).

If we obtain a sample mean that is much higher or lower than the population mean, there are two possible reasons: (a) our sample mean is a rare "fluke" (a quirk of sampling variation); (b) our sample has not come from the population we thought it did, but from some other, different, population. The greater the difference between the sample and population means, the more plausible (b) becomes.

A concrete example: The human population I.Q. is 100. A random sample of people has a mean I.Q. of 170. There are two explanations: (a) the sample is a fluke: by chance our random sample contained a large number of highly intelligent people. (b) the sample does not come from the population we thought they did: our sample was actually from a different population - e.g., aliens masquerading as humans.

Relationship between population mean and sample mean: high frequency of sample means low population mean I.Q. (100) sample mean I.Q. (170)

This logic can be extended to the difference between two samples from the same population: A common experimental design: We compare two groups of people: an experimental group and a control group. Experimental group get a "wolfman" drug. Control group get a harmless placebo. D.V.: number of dog-biscuits consumed. At the start of the experiment, they are two samples from the same population ("humans").

At the end of the experiment, are they: (a) still two samples from the same population (i.e., still two samples of "humans" - our experimental treatment has left them unchanged); OR (b) now samples from two different populations - one from the "population of humans" and one from the "population of wolfmen"?

We can decide between these alternatives as follows: The differences between any two sample means from the same population are normally distributed, around a mean difference of zero. Most differences will be relatively small, since the Central Limit Theorem tells us that most samples will have similar means to the population mean (and hence similar means to each other). If we obtain a very large difference between our sample means, it could have occurred by chance, but this is very unlikely - it is more likely that the two samples come from different populations.

Possible differences between two sample means: a big difference between mean of sample A and mean of sample B: high frequency of raw scores low mean of sample A mean of sample B low high sample means

Possible differences between two sample means (cont.): a small difference between mean of sample A and mean of sample B: high frequency of raw scores low mean of sample A mean of sample B low high sample means

Frequency distribution of differences between sample means: high frequency of differences between sample means low mean A bigger than mean B mean A smaller than mean B no difference size of difference between sample means ( "sample mean A" minus "sample mean B")

Frequency distribution of differences between sample means: A small difference between sample means (quite likely to occur by chance) high frequency of differences between sample means low mean A bigger than mean B mean A smaller than mean B no difference size of difference between sample means

Frequency distribution of differences between sample means: high A large difference between sample means (unlikely to occur by chance) frequency of differences between sample means low mean A bigger than mean B mean A smaller than mean B no difference size of difference between sample means