SJS SDI_151 Design of Statistical Investigations Stephen Senn Introduction to Sampling.

Slides:



Advertisements
Similar presentations
SJS SDI_161 Design of Statistical Investigations Stephen Senn Random Sampling I.
Advertisements

SJS SDI_171 Design of Statistical Investigations Stephen Senn Random Sampling 2.
Overview of Sampling Methods II
Introduction Simple Random Sampling Stratified Random Sampling
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Estimation in Sampling
Sampling Distributions
Sampling Distributions (§ )
Chapter 10: Sampling and Sampling Distributions
Dr. Chris L. S. Coryn Spring 2012
Topics: Inferential Statistics
Evaluating Hypotheses
Why sample? Diversity in populations Practicality and cost.
Statistical Inference and Sampling Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
Copyright (c) Bani Mallick1 Lecture 4 Stat 651. Copyright (c) Bani Mallick2 Topics in Lecture #4 Probability The bell-shaped (normal) curve Normal probability.
7-1 Chapter Seven SAMPLING DESIGN. 7-2 Sampling What is it? –Drawing a conclusion about the entire population from selection of limited elements in a.
Sampling Distributions
QMS 6351 Statistics and Research Methods Chapter 7 Sampling and Sampling Distributions Prof. Vera Adamchik.
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
Statistical inference Population - collection of all subjects or objects of interest (not necessarily people) Sample - subset of the population used to.
Sampling Methods.
How confident are we that our sample means make sense? Confidence intervals.
Chapter 7 ~ Sample Variability
Determining the Size of
Sampling Moazzam Ali.
1. Homework #2 2. Inferential Statistics 3. Review for Exam.
Introduction Parameters are numerical descriptive measures for populations. For the normal distribution, the location and shape are described by  and.
 The situation in a statistical problem is that there is a population of interest, and a quantity or aspect of that population that is of interest. This.
Virtual COMSATS Inferential Statistics Lecture-6
Random Sampling, Point Estimation and Maximum Likelihood.
Introduction to Inferential Statistics. Introduction  Researchers most often have a population that is too large to test, so have to draw a sample from.
1 Sampling Distributions Lecture 9. 2 Background  We want to learn about the feature of a population (parameter)  In many situations, it is impossible.
Standard Error and Confidence Intervals Martin Bland Professor of Health Statistics University of York
Chapter 11 – 1 Chapter 7: Sampling and Sampling Distributions Aims of Sampling Basic Principles of Probability Types of Random Samples Sampling Distributions.
Copyright © 2009 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Sampling Design and Analysis MTH 494 Lecture-30 Ossam Chohan Assistant Professor CIIT Abbottabad.
Chapter 7: Sampling and Sampling Distributions
STANDARD ERROR Standard error is the standard deviation of the means of different samples of population. Standard error of the mean S.E. is a measure.
Chapter 7: Sample Variability Empirical Distribution of Sample Means.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 7 Sampling and Sampling Distributions.
Confidence Intervals (Dr. Monticino). Assignment Sheet  Read Chapter 21  Assignment # 14 (Due Monday May 2 nd )  Chapter 21 Exercise Set A: 1,2,3,7.
Stats 845 Applied Statistics. This Course will cover: 1.Regression –Non Linear Regression –Multiple Regression 2.Analysis of Variance and Experimental.
Bangor Transfer Abroad Programme Marketing Research SAMPLING (Zikmund, Chapter 12)
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. PPSS The situation in a statistical problem is that there is a population of interest, and a quantity or.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 6-4 Sampling Distributions and Estimators.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Lecture VI Statistics. Lecture questions Mathematical statistics Sampling Statistical population and sample Descriptive statistics.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Introduction to Inference Sampling Distributions.
C1, L1, S1 Chapter 1 What is Statistics ?. C1, L1, S2 Chapter 1 - What is Statistics? A couple of definitions: Statistics is the science of data. Statistics.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
Topics Semester I Descriptive statistics Time series Semester II Sampling Statistical Inference: Estimation, Hypothesis testing Relationships, casual models.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
Sampling Design and Procedure
Stats 242.3(02) Statistical Theory and Methodology.
Virtual University of Pakistan
Chapter 6: Sampling Distributions
Advanced Quantitative Techniques
Virtual University of Pakistan
Inference: Conclusion with Confidence
Graduate School of Business Leadership
4 Sampling.
Combining Random Variables
Sampling.
1. Homework #2 (not on posted slides) 2. Inferential Statistics 3
Sampling Distributions (§ )
Presentation transcript:

SJS SDI_151 Design of Statistical Investigations Stephen Senn Introduction to Sampling

SJS SDI_152 Representative Inference So far in the course we have been interested in comparisons –with some sort of causal investigation We now look at the case where we are interested in collecting representative material –samples to describe populations First we consider some possible applications

SJS SDI_153 Applications of Sampling Methods Quality control of manufacturing processes Financial audit Opinion polls Clinical audit Anthropology Social surveys Ecological surveys –capture/recapture

SJS SDI_154 An Important Practical Distinction All of these application areas require sampling theory and careful consideration as to how samples are drawn. However some of them have a further difficulty, which is that the opinions of human beings have to be ascertained. In what follows we shall often take opinion polls/social surveys as typical examples of sampling problems. (But our first example is not of this sort.) This will enable us to discuss also the further problems that arise in these contexts. However, first we shall review some very elementary statistical concepts

SJS SDI_155 Standard Deviation/ Standard Error There is common confusion between standard deviation and standard error The standard deviation describes the spread of original values The standard error is a measure of reliability of some statistic based on the original values

SJS SDI_156 An Illustration of This Difference This will now be illustrated using a simple example This example is again a medical one –My apologies! –I need a large data set –This one will have to do

SJS SDI_157 Example Surv_2 Cross-over trial in asthma 790 baseline FEV 1 readings –Since baselines unaffected by treatment –Regard as homogenous sample –Ignore fact that they are repeated measures The following slide shows distribution of readings

SJS SDI_158

9 Distribution Curve skewed to the right –Clearly not Normal Statistics –Mean –Median –Variance 0.462

SJS SDI_1510 Sampling Suppose that we take simple random samples of size 10 –Take these at random from original distribution With replacment Calculate mean of these Study distribution of these means –This is what is called a sampling distribution Illustrated on next slide

SJS SDI_1511

SJS SDI_1512 Distribution Curve less obviously skewed to the right –Approximation to Normal is closer Distribution is narrower Statistics –Mean (very similar to previously) –Median (now much closer to mean) –Variance (approximately 1/10 of previous value)

SJS SDI_1513

SJS SDI_1514 The Different Variances Case 1 –Variance of original values –The square root of this is the standard deviation Case 2 –Variance of means –Square root of these is standard error of the mean (SEM) In general –Square root of the variance of a statistic (e.g. a mean) is a standard error

SJS SDI_1515 Standard Deviation v Standard Error Standard deviation –Used to describe variation of original values Can be population Can be sample Standard error –Used to describe reliability of a statistic. For example SE of mean SE of treatment differences

SJS SDI_1516 Estimating the Standard Error The standard error of a simple random sample of size n drawn from a population with variance 2 is / n. In practice 2, being a population parameter, is unknown so we estimate it using the sample variance, s 2. Hence we estimate the standard error of the mean by s/ n

SJS SDI_1517 Transformations Can be very valuable –Improve accuracy of analysis Under-utilised Previous FEV 1 example follows –log-transformation –data more nearly Normal But will not deal with all problems –Outliers ( in particular bad values)

SJS SDI_1518

SJS SDI_1519 Normal Distribution Ideal mathematical representation Rarely applies in practice to original data However, many sampling distributions have approximately Normal form This increases its utility considerably A combination of transformation of original data plus averaging can frequently make it applicable

SJS SDI_1520 Technical Terms (Schaeffer, Mendenhall and Ott) Element –Object on which a measurement is taken Population –A collection of elements about which we wish to make an inference Sampling units –Nonoverlapping collection of elements from the population that cover the entire population Sampling frame –A list of sampling units Sample –Collection of sampling units drawn from a frame

SJS SDI_1521 Probability Sampling Well-defined sampling frame Probabilistic rule for drawing sample Knowledge of rule and sampling frame enables probabilistic statements about the population There are various types of such sample –simple, cluster, stratified

SJS SDI_1522 Simple Random Sample We shall encounter this in more detail in the next lecture. For the moment we note a definition Sampling in which every member of the population has an equal chance of being chosen and successive drawings are independent Mariott, A Dictionary of Statistical Terms Only for simple random sampling is the standard error of the mean equal to / n

SJS SDI_1523 Quota Sampling Sampling frame not used May have rough idea of population composition Sampling carries on until various quotas are fulfilled –e.g 100 males, 100 females Difficult to make probabilistic statements about population