Lecture 2 Cluster Investigation Dr. Bartlett and Dr. Geary Olsen

Slides:



Advertisements
Similar presentations
A small taste of inferential statistics
Advertisements

Reading the Dental Literature
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
S519: Evaluation of Information Systems Social Statistics Inferential Statistics Chapter 8: Significantly significant.
Chapter 5: Descriptive Research Describe patterns of behavior, thoughts, and emotions among a group of individuals. Provide information about characteristics.
DATASET INTRODUCTION 1. Dataset: Urine 2 From Cleveland Clinic
Thomas Songer, PhD with acknowledgment to several slides provided by M Rahbar and Moataza Mahmoud Abdel Wahab Introduction to Research Methods In the Internet.
 The “4 Steps” of Hypothesis Testing: 1. State the hypothesis 2. Set decision criteria 3. Collect data and compute sample statistic 4. Make a decision.
Multiple Choice Questions for discussion
1 Environmental Health Lecture 2 Cluster Investigation Dr. Bartlett and Dr. Geary Olsen.
Intervention Studies Principles of Epidemiology Lecture 10 Dona Schneider, PhD, MPH, FACE.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
April 6 -8, 2004 Cancer Clusters and Environmental Quality Shanghai-California Environmental Health Conference Richard Kreutzer, M.D. California – China.
Case Control Study Dr. Ashry Gad Mohamed MB, ChB, MPH, Dr.P.H. Prof. Of Epidemiology.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Unit 2 – Public Health Epidemiology Chapter 4 – Epidemiology: The Basic Science of Public Health.
Health and Disease in Populations 2002 Sources of variation (1) Paul Burton! Jane Hutton.
Case Control Studies Dr Amna Rehana Siddiqui Department of Family and Community Medicine October 17, 2010.
Statistics 300: Introduction to Probability and Statistics Section 1-4.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
AP Test Practice. A student organization at a university is interested in estimating the proportion of students in favor of showing movies biweekly instead.
Case Control study. An investigation that compares a group of people with a disease to a group of people without the disease. Used to identify and assess.
© 2010 Jones and Bartlett Publishers, LLC. Chapter 10 Field Epidemiology.
Epidemiological Study Designs And Measures Of Risks (1)
Methods of Presenting and Interpreting Information Class 9.
15 Inferential Statistics.
Understanding Epidemiology
Statistics 200 Lecture #9 Tuesday, September 20, 2016
Fundamentals of Epidemiology
Why is Research Important?
بسم الله الرحــــــمـن الرحيم
© 2010 Jones and Bartlett Publishers, LLC
What Is a Test of Significance?
Biostatistics Case Studies 2016
CASE-CONTROL STUDIES Ass.Prof. Dr Faris Al-Lami MB,ChB MSc PhD FFPH
The binomial applied: absolute and relative risks, chi-square
Probability and Statistics
Active Learning Lecture Slides
Types of Errors Type I error is the error committed when a true null hypothesis is rejected. When performing hypothesis testing, if we set the critical.
Sampling And Sampling Methods.
Random error, Confidence intervals and P-values
Chapter 2 Sociological Research Methods
Module 02 Research Strategies.
Lecture 1: Fundamentals of epidemiologic study design and analysis
Disease Detective Team!
Week 11 Chapter 17. Testing Hypotheses about Proportions
Analysis based on normal distributions
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Review – First Exam Chapters 1 through 5
Dr Seyyed Alireza Moravveji Community Medicine Specialist
Lecture 4 Section Wed, Sep 6, 2006
Chi Square (2) Dr. Richard Jackson
Public Health Surveillance
Research in Psychology
Epidemiologic Investigation
Populations, Samples, and Generalizing from a Sample to a Population
Interaction When the incidence of a disease in the presence of two or more risk factors differs from the incidence rate expected to result from their individual.
Epi-Ready Final Exercise Module 11.
Research Strategies.
The objective of this lecture is to know the role of random error (chance) in factor-outcome relation and the types of systematic errors (Bias)
Inferential statistics Study a sample Conclude about the population Two processes: Estimation (Point or Interval) Hypothesis testing.
Chapter 9 Hypothesis Testing: Single Population
What do Samples Tell Us Variability and Bias.
Vocab unit 2 Research.
Research Techniques Made Simple: Interpreting Measures of Association in Clinical Research Michelle Roberts PhD,1,2 Sepideh Ashrafzadeh,1,2 Maryam Asgari.
Epidemiological Designs
Type I and Type II Errors
RISK ASSESSMENT, Association and causation
Lesson Overview 1.1 What Is Science?.
Presentation transcript:

Lecture 2 Cluster Investigation Dr. Bartlett and Dr. Geary Olsen Environmental Health Lecture 2 Cluster Investigation Dr. Bartlett and Dr. Geary Olsen

Elizabeth Lyons – MSU Graduate

Environmental Health: Cluster Investigation A lot of what health departments do is respond to citizen complaints regarding clusters. Usually cancer clusters. They also participate in long-term studies.

Environmental Health: Cluster Investigation Somebody calls the health department: Three kids at our school have childhood leukemia Your vet clinic is right next to the playground, and we can smell pesticide coming from your clinic. We think the pesticide fumes are causing our kids to get leukemia.

Cluster Investigation Epidemiology Cluster - A number of persons, animals, or things gathered or situated close together. A cluster is a closely grouped series of events or cases of a disease, with well-defined distribution patterns, in relation to time or place (or both). Time cluster, space cluster, time-space cluster Random = happening by chance. P= probability. A P-value of .01 would mean that the probability of a event occurring by chance would be 1 in 100. (NO!)

Cluster Investigation Epidemiology If the null hypothesis is true, then P = .01 represents the probability that a difference as extreme as that observed (or more extreme) would occur (just by chance). With regards to clusters of disease, a P-value or Significance Level is meaningful and interpretable as a probability statement only if the observations were drawn at random from a defined population.

Time-Space Cluster investigation Why it is hard to use P values to determine if you have a cluster. The problem is that sometimes clusters happen “naturally” just by chance. Is your cluster due to something causing an increase rate of disease? Or is it one of these “chance” clusters?

What can cause a cluster? “Statistical” Clusters: - Given enough time and enough potential groupings, eventually there will be subsets of the data (a particular town, month, farm, sector, county, etc.) that may (by chance alone) have a higher rate of a disease than the entire population. Illinois Subdivision

A posteriori vs. A priori “After the fact” vs. “before the fact” What is the probability that, through the eons of time, I would be standing here before you with two arms, two eyes, 10 fingers and only one nose? The probability is 1.000…. (It has already happened!) But that’s not the point. The question was posed after observing the data. If I had 8 fingers, I would have asked a different question. When the question is posed based on what you observe in the data, then p values from statistical tests of association are no longer valid.

Random is a process, not a result. Consider the tables of random numbers. Look at about 100 of them until you see one that somehow doesn’t look random.

                 

                 

Statistical test of association Chi-square goodness of fit test Expect 10% of the numbers to be “7” 110 number in the cluster Would expect 11 to be “7” Observed 25 to be “7” P = .01 – but is it valid?

Texas sharpshooter Texas sharpshooter who shoots at the side of a barn and then draws a bull’s eye around the bullet hole. If you define a cluster (draw the bull’s eye) based on what you observe in the data (the bullet hole), then statistical tests can not be used to confirm the existence of the cluster (time-space association with a particular risk factor). Why are you studying “7” disease? Why are you studying it here?

“Statistical” Clusters: Given the large number of diseases and risk factors (employment, organizations, housing location, etc), some will appear to be associated with disease just by chance alone. “Given enough time, it is probable for the improbable to happen.” Albert Einstein

Other causes for clusters Biological clusters - are clusters of disease which have a biological basis. This is what we are looking for! Confounding (and time-space clusters) Legionnaire’s Disease in Michigan Reporting Bias (and time-space clusters) Rabies hysteria or apathy

Example: www.RUSick2.msu.edu For foodborne outbreaks: Statistical Clusters: Given enough food items and enough time, clusters of foodborne disease will occur which do not have a biological basis and do not represent common-source foodborne outbreaks. Pranksters or malicious intent Confounding Strawberries every spring Restaurant next to “an event”

Cluster Investigation Cancer clusters- these cannot be investigated like acute infectious disease clusters. 1. Long and indefinite incubation/induction period for the disease. 2. Routes of the cancer causing agents are usually through the environment not through personal contact, or consumption of food or beverage. “Hot pursuit” case control studies usually done in acute infectious disease outbreaks are not useful in cancer cluster analysis.

Cluster Investigation The following list of characteristics can help to identify a situation where a case-control study or a multicommunity investigation might be useful. There must be at least five cases to a cluster and they must have a high relative risk (RR). What is high? 2.0? 10.0? A unique and well known etiological agent is known to be the cause and the pathophysiologic mechanism for that agent is known. The agent is in the environment and can be measured there.

6. The route of exposure can be easily recalled. 4. The agent is persistent in the infected/exposed people but rare in normal populations and its unique physiologic response in the exposed can be measured. 5. There is a heterogeneity of exposure (range from high to low) within the neighborhood so effects can be easily measured. 6. The route of exposure can be easily recalled. 7. Multi-community studies can be done by looking at (otherwise) similar exposed and unexposed communities. 8. Endemic space cluster, not a space-time cluster that exists for a while then vanishes.

Cluster Investigation Minnesota Cluster Analysis Track Record 1. Information and education- 95% 2. Public initiated surveys- 4% 3. Validation, evaluation, feasibility and education- 1% 4. In-depth study- <1%

Cluster Investigation Evaluation of false positive reports is the price we must pay in order to identify the true biological cluster. Examples: The first few AIDS cases Foodborne outbreaks of E.coli 0157:H7 Most foodborne outbreaks (Note: These are all infectious agents!) Bitter Harvest (PBB)