What is a sample? Epidemiology matters: a new introduction to methodological foundations Chapter 4.

Slides:



Advertisements
Similar presentations
External validity: to what populations do our study results apply?
Advertisements

Conclusion Epidemiology and what matters most
Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
Chapter 6 Sampling and Sampling Distributions
What is an exposure? What is a disease? How do we measure them? Epidemiology matters: a new introduction to methodological foundations Chapter 3.
An introduction Epidemiology matters: a new introduction to methodological foundations Chapter 1.
What is a population? What is population health?
Measures of disease occurrence and frequency
Chapter 10: Estimating with Confidence
Inference Sampling distributions Hypothesis testing.
Objectives (BPS chapter 18) Inference about a Population Mean  Conditions for inference  The t distribution  The one-sample t confidence interval 
Central Limit Theorem.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Chapter 7 Sampling Distributions
Sampling Distributions
Topic 7 Sampling And Sampling Distributions. The term Population represents everything we want to study, bearing in mind that the population is ever changing.
Chapter 7 Sampling and Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 9-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
QM Spring 2002 Business Statistics Sampling Concepts.
Chapter Sampling Distributions and Hypothesis Testing.
Part III: Inference Topic 6 Sampling and Sampling Distributions
11 Populations and Samples.
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 5 Chicago School of Professional Psychology.
Chapter 10: Estimating with Confidence
Chapter 7 Probability and Samples: The Distribution of Sample Means
How do cancer rates in your area compare to those in other areas?
Probability and the Sampling Distribution Quantitative Methods in HPELS 440:210.
Are exposures associated with disease?
Chapter 10 Hypothesis Testing
Epidemiologic Study Designs Nancy D. Barker, MS. Epidemiologic Study Design The plan of an empirical investigation to assess an E – D relationship. Exposure.
Sampling Theory Determining the distribution of Sample statistics.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
Chapter Nine Copyright © 2006 McGraw-Hill/Irwin Sampling: Theory, Designs and Issues in Marketing Research.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Study Designs Afshin Ostovar Bushehr University of Medical Sciences Bushehr, /4/20151.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
A short introduction to epidemiology Chapter 2b: Conducting a case- control study Neil Pearce Centre for Public Health Research Massey University Wellington,
DTC Quantitative Methods Survey Research Design/Sampling (Mostly a hangover from Week 1…) Thursday 17 th January 2013.
Is the association causal, or are there alternative explanations? Epidemiology matters: a new introduction to methodological foundations Chapter 8.
Chapter 2 Nature of the evidence. Chapter overview Introduction What is epidemiology? Measuring physical activity and fitness in population studies Laboratory-based.
A short introduction to epidemiology Chapter 10: Interpretation Neil Pearce Centre for Public Health Research Massey University, Wellington, New Zealand.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Sampling Methods and Sampling Distributions
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
Sampling Error SAMPLING ERROR-SINGLE MEAN The difference between a value (a statistic) computed from a sample and the corresponding value (a parameter)
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Inferential Statistics Part 1 Chapter 8 P
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 7 Sampling and Sampling Distributions.
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
Africa Program for Education Impact Evaluation Dakar, Senegal December 15-19, 2008 Experimental Methods Muna Meky Economist Africa Impact Evaluation Initiative.
T tests comparing two means t tests comparing two means.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Sampling Theory and Some Important Sampling Distributions.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Design of Clinical Research Studies ASAP Session by: Robert McCarter, ScD Dir. Biostatistics and Informatics, CNMC
Chapter 7 Introduction to Sampling Distributions Business Statistics: QMIS 220, by Dr. M. Zainal.
A short introduction to epidemiology Chapter 6: Precision Neil Pearce Centre for Public Health Research Massey University Wellington, New Zealand.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Sampling Theory Determining the distribution of Sample statistics.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Chapter 6 Sampling and Sampling Distributions
Chapter 9 Introduction to the t Statistic
Research Design
Sampling and Sampling Distributions
Introduction to Sampling Distributions
Sampling Distribution of a Sample Proportion
Presentation transcript:

What is a sample? Epidemiology matters: a new introduction to methodological foundations Chapter 4

Seven steps 1.Define the population of interest 2.Conceptualize and create measures of exposures and health indicators 3.Take a sample of the population 4.Estimate measures of association between exposures and health indicators of interest 5.Rigorously evaluate whether the association observed suggests a causal association 6.Assess the evidence for causes working together 7.Assess the extent to which the result matters, is externally valid, to other populations Epidemiology Matters – Chapter 12

1.Why take a sample? 2.How to take a representative sample 3.Quantifying sampling variability 4.How to take a purposive sample 5.Study design 6.Summary Epidemiology Matters – Chapter 43

1.Why take a sample? 2.How to take a representative sample 3.Quantifying sampling variability 4.How to take a purposive sample 5.Study design Epidemiology Matters – Chapter 44

Why take a sample?  Epidemiologists take samples to answer health- related research questions efficiently  A full census is the epidemiologic ideal  Reasons not to take a census all the time include lack of time, lack of money, and waste of resources Epidemiology Matters – Chapter 45

To take a sample 1.Specify population of interest 2.Specify a research question of interest Epidemiology Matters – Chapter 46

Specify population of interest  What are the characteristics of the population in which we would like to understand health?  Example: Do we want to know what the prevalence of diabetes is within New York City? New York State? The United States? Do we want to know the causes of diabetes?  The population of interest has to be specified before the sampling strategy defined Epidemiology Matters – Chapter 47

Specifying a question  Question of interest can help clarify appropriate way to sample population of interest  Questions asked can include estimating population parameters, or estimating causal effects of exposures on outcomes Epidemiology Matters – Chapter 48

Example, estimating population parameters Questions concerned with population parameters include  What proportion of individuals in the population of interest has breast cancer?  What is the mean blood pressure in the population?  How many new cases of HIV are diagnosed in the population over three years? Population parameters include estimates of  Proportions  Means  Standard deviations Sample required  Representative sample 9

Example, estimating causal effects of exposures on outcomes Questions for which these measures are needed are  Does exposure to pollution cause lung cancer?  Does suffering abuse in childhood cause depression in adulthood?  Does a specific genetic marker cause Alzheimer’s disease? Parameter of interest  Causal effect of an exposure on a health outcome Sampling concerns  Not representativeness (as in population parameters)  Whether individuals exposed to hypothesized cause of interest are comparable to individuals not exposed  Purposive sample sufficient 10

Representative and purposive  A representative sample is one where the sample that is taken has characteristics similar to the overall population  A purposive sample selects from the population base on some criterion  A representative sample may or may not include individuals who are comparable with respect to causal identification  A purposive sample may or may not be representative of a particular population of interest Epidemiology Matters – Chapter 411

1.Why take a sample? 2.How to take a representative sample 3.Quantifying sampling variability 4.How to take a purposive sample 5.Study design 6.Summary Epidemiology Matters – Chapter 412

How to take a representative sample  The simplest approach: a simple random sample  Each member of the population has an equal probability of being selected into the sample  A successful simple random sample should have the same basic characteristics as the original population Epidemiology Matters – Chapter 413

Taking a simple random sample 1.Enumerate all potential members of population of interest 2.Assign each member a probability of selection 3.Ensure selection of members are independent Epidemiology Matters – Chapter 414

Example: Sampling Farrlandia Epidemiology Matters – Chapter residents in Farrlandia Options for random selection: --Every 4th home, dice roll for selection within home Challenges include (a) clustered exposures, (b) unequal ‘home’ size Selected for sample

Example: Sampling Farrlandia Epidemiology Matters – Chapter residents in Farrlandia Select every Nth person in phone book Challenges include that not everyone is in phone book Selected for sample

Epidemiology Matters – Chapter 417  There is no perfect sample  The goal in epidemiology is to understand limitations of sampling methods and account for them The perfect sample?

Sampling Farrlandia Epidemiology Matters – Chapter 418

Sampling Farrlandia We want to collect our sample in such a way that the sample also has 50% exposed and 30% dotted. Epidemiology Matters – Chapter 419

Sampling Farrlandia We can use a simple random sample  ½ the population (25)  Probability of selection 1/50 or 2%  Random number generator Epidemiology Matters – Chapter 420

Sampling Farrlandia 21 Original Population Black solid1530% Black dots1020% Total black2550% Gray solid2040% Gray dots510% Total gray2550% Epidemiology Matters – Chapter 4

Sampling Farrlandia 22 Original Population Sample Black solid1530% Black dots1020% Total black2550% Gray solid2040% Gray dots510% Total gray2550% Black solid832% Black dots520% Total black1352% Gray solid1040% Gray dots28% Total gray1248% Epidemiology Matters – Chapter 4

1.Why take a sample? 2.How to take a representative sample 3.Quantifying sampling variability 4.How to take a purposive sample 5.Study design 6.Summary Epidemiology Matters – Chapter 423

Quantifying sampling variability  Sampled population will not have the exact same population parameters as complete population census  The ‘truth’, i.e., the population parameter of original population is called the true population parameter Epidemiology Matters – Chapter 424

Epidemiology Matters – Chapter 425 Variations in possible samples

Epidemiology Matters – Chapter 426

Epidemiology Matters – Chapter 427 Variations in possible samples

Epidemiology Matters – Chapter ,760 different possible samples of 5 Variations in possible samples

Quantifying uncertainty, Central Limit Theorem (CLT) 1.Average proportion across all possible samples = true population proportion  Example:  50% of true population has diabetes  Sample 1 has 100% diabetes  Sample 2 has 0% diabetes  Average of all samples will have 50% diabetes Epidemiology Matters – Chapter 429

Quantifying uncertainty, CLT 2.Variance around average sample proportions (standard error) p = sample proportion n = sample size Epidemiology Matters – Chapter 430

Quantifying uncertainty, CLT 3.Large samples will have normally distributed samples  > 30 people  No group < 5 people Epidemiology Matters – Chapter 431

Quantifying uncertainty, CLT Therefore the principal drivers of uncertainty are 1.Prevalence in the sample 2.Sample size The larger the sample size, the smaller the amount of uncertainty in the sample estimate Epidemiology Matters – Chapter 432

1.Why take a sample? 2.How to take a representative sample 3.Quantifying sampling variability 4.How to take a purposive sample 5.Study design 6.Summary Epidemiology Matters – Chapter 433

Purposive sample  Eligibility criteria for study is the central design element; entry is based on exposure status, or sometimes on health outcome status Epidemiology Matters – Chapter 434

1.Why take a sample? 2.How to take a representative sample 3.Quantifying sampling variability 4.How to take a purposive sample 5.Study design 6.Summary Epidemiology Matters – Chapter 435

Study design  Study design considerations are similar for representative or purposive sample  Study design reflects decisions made at one time point or over time  Timing of disease process can inform the study design Epidemiology Matters – Chapter 436

Study design options 1.Sample one moment in time, irrespective of disease status, measure disease and potential cause simultaneously 2.Sample over time, start with disease free individuals only, measure disease over time 3.Sample one moment in time, based on disease status Epidemiology Matters – Chapter 437

Farrlandia population Epidemiology Matters – Chapter 438

Farrlandia population Epidemiology Matters – Chapter 439

Farrlandia population Epidemiology Matters – Chapter 440

Farrlandia population Epidemiology Matters – Chapter 441

Farrlandia population Epidemiology Matters – Chapter 442

Farrlandia population Epidemiology Matters – Chapter 443

Farrlandia population Epidemiology Matters – Chapter 444

Farrlandia population Epidemiology Matters – Chapter 445

Farrlandia population Epidemiology Matters – Chapter 446

Option 1, Cross-sectional Epidemiology Matters – Chapter 447

Option 2, Cohort Epidemiology Matters – Chapter 448

Option 3, Case-control Epidemiology Matters – Chapter 449

1.Why take a sample? 2.How to take a representative sample 3.Quantifying sampling variability 4.How to take a purposive sample 5.Study design 6.Summary Epidemiology Matters – Chapter 450

Summary 1.Samples are efficient, representative or purposive 2.Representative sample; e.g., simple random sample 3.Sampling variability, standard error 4.Purposive sample, selection on exposure or disease status 5.Study designs can be cross-sectional, cohort, case- control Epidemiology Matters – Chapter 451

Seven steps 1.Define the population of interest 2.Conceptualize and create measures of exposures and health indicators 3.Take a sample of the population 4.Estimate measures of association between exposures and health indicators of interest 5.Rigorously evaluate whether the association observed suggests a causal association 6.Assess the evidence for causes working together 7.Assess the extent to which the result matters, is externally valid, to other populations Epidemiology Matters – Chapter 152

epidemiologymatters.org 53Epidemiology Matters – Chapter 1