Basic epidemiologic analysis with Stata

Slides:



Advertisements
Similar presentations
Analytical epidemiology
Advertisements

Dr Eva Batistatou. Outline of this presentation… What is epidemiology? The Fundamentals of Epidemiology course What is biostatistics? The Biostatistics.
M2 Medical Epidemiology
Using Excel Biostatistics 212 Lecture 4. Housekeeping Questions about Lab 3? –replace vs. recode Final Project Dataset! –“Housekeeping” commands vs. data.
Using Excel Biostatistics 212 Lecture 4. Housekeeping Finish Lab 2 today and/or start Lab 3 Mac Addendum Copying and pasting from Stata.
Basic epidemiologic analysis with Stata Biostatistics 212 Lecture 5.
EPID Introduction to Analysis and Interpretation of HIV/STD Data Confounding Manya Magnus, Ph.D. Summer 2001 adapted from M. O’Brien and P. Kissinger.
What is Interaction for A Binary Outcome? Chun Li Department of Biostatistics Center for Human Genetics Research September 19, 2007.
HSRP 734: Advanced Statistical Methods July 24, 2008.
Chapter 19 Stratified 2-by-2 Tables
Chance, bias and confounding
Introduction to Statistical Computing in Clinical Research Biostatistics 212 Course director: Mark Pletcher Teaching Assistant: Lee Zane.
1June In Chapter 19: 19.1 Preventing Confounding 19.2 Simpson’s Paradox (Severe Confounding) 19.3 Mantel-Haenszel Methods 19.4 Interaction.
BIOST 536 Lecture 12 1 Lecture 12 – Introduction to Matching.
Analysis of Complex Survey Data Day 3: Regression.
Epidemiology Kept Simple
Study Design and Measures of Disease Frequency Intermediate Epidemiology.
Basic epidemiologic analysis with Stata Biostatistics 212 Lecture 5.
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Confounding, Effect Modification, and Stratification.
EPIDEMIOLOGY Why is it so damn confusing?. Disease or Outcome Exposure ab cd n.
Biostat Didactic Seminar Series Analyzing Binary Outcomes: Analyzing Binary Outcomes: An Introduction to Logistic Regression Robert Boudreau, PhD Co-Director.
Stratification and Adjustment
INTRODUCTION TO EPIDEMIOLO FOR POME 105. Lesson 3: R H THEKISO:SENIOR PAT TIME LECTURER INE OF PRESENTATION 1.Epidemiologic measures of association 2.Study.
Analysis of Categorical Data
Measuring Associations Between Exposure and Outcomes.
Making a figure, dates, and other advanced topics Biostatistics 212 Lecture 6.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Basic epidemiologic analysis with Stata Biostatistics 212 Lecture 5.
Evidence-Based Medicine 3 More Knowledge and Skills for Critical Reading Karen E. Schetzina, MD, MPH.
Making a figure with Stata or Excel Biostatistics 212 Lecture 7.
EPI 811 – Work Group Exercise #2 Team Honey Badgers Alex Montoye Kellie Mayfield Michele Fritz Anton Frattaroli.
 Is there a comparison? ◦ Are the groups really comparable?  Are the differences being reported real? ◦ Are they worth reporting? ◦ How much confidence.
Making Tables and Figures with Stata Biostatistics 212 Lecture 6.
Organizing a project, making a table Biostatistics 212 Lecture 7.
Organizing a project, making a table Biostatistics 212 Session 5.
Basic epidemiologic analysis with Stata Part II Biostatistics 212 Lecture 6.
Basic epidemiologic analysis with Stata Biostatistics 212 Lecture 5.
October 15. In Chapter 19: 19.1 Preventing Confounding 19.2 Simpson’s Paradox 19.3 Mantel-Haenszel Methods 19.4 Interaction.
Confounding, Effect Modification, and Stratification HRP 261 1/26/04.
Organizing a project, making a table Biostatistics 212 Lecture 7.
Using Excel Biostatistics 212 Lecture 4. Housekeeping Questions about Lab 3? –replace vs. recode –Cross-checking/recoding missing values –Analysis of.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Stata 9, Epi tables, Survey, Factor Tables for epidemiologists Survey commands Factor analysis H.S.1.
Introduction to Statistical Computing in Clinical Research Biostatistics 212.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
What is “collapsing”? (for epidemiologists) Picture a 2x2 tables from Intro Epi: (This is a collapsed table; there are no strata) DiseasedUndiseasedTotal.
Analytical epidemiology Disease frequency Study design: cohorts & case control Choice of a reference group Biases Alain Moren, 2006 Impact Causality Effect.
Research Techniques Made Simple: Multivariable Analysis Marlies Wakkee Loes Hollestein Tamar Nijsten Department of Dermatology, Erasmus University Medical.
11/20091 EPI 5240: Introduction to Epidemiology Confounding: concepts and general approaches November 9, 2009 Dr. N. Birkett, Department of Epidemiology.
Basics of Biostatistics for Health Research Session 3 – February 21, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health Sciences.
Basics of Biostatistics for Health Research Session 1 – February 7 th, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
Instructor Resource Chapter 15 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
01/20151 EPI 5344: Survival Analysis in Epidemiology Confounding and Effect Modification March 24, 2015 Dr. N. Birkett, School of Epidemiology, Public.
Confounding and effect modification Epidemiology 511 W. A. Kukull November
Handout Eight: Two-Way Between- Subjects Design with Interaction- Assumptions, & Analyses EPSE 592 Experimental Designs and Analysis in Educational Research.
Basics of Biostatistics for Health Research Session 4 – February 28, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health Sciences.
Matched Case-Control Study Duanping Liao, MD, Ph.D Phone:
Confounding Biost/Stat 579 David Yanez Department of Biostatistics University of Washington July 7, 2005.
Introduction to Biostatistics, Harvard Extension School, Fall, 2005 © Scott Evans, Ph.D.1 Contingency Tables.
Measures of disease frequency Simon Thornley. Measures of Effect and Disease Frequency Aims – To define and describe the uses of common epidemiological.
Epidemiology 503 Confounding.
BMTRY 747: Introduction Jeffrey E. Korte, PhD
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Evaluating Effect Measure Modification
Discussion Week 1 (4/1/13 – 4/5/13)
Dr Luis E Cuevas – LSTM Julia Critchley
Presentation transcript:

Basic epidemiologic analysis with Stata Biostatistics 212 Session 4 Welcome to class! This is the first session ever of Biostatistics 212 – introduction to statistical computing in clinical research (or ISCCR, as Lee dubbed it).

Today... What’s the difference between epidemiologic and statistical analysis? 2 x 2 tables, OR’s and RR’s Interaction and confounding with 2 x 2’s Stata’s “Epitab” commands An introduction to logistic regression

Epi vs. Biostats Epidemiologic analysis – Interpreting clinical research data in the context of scientific knowledge Biostatistical analysis – Evaluating the role of chance

Epi vs. Biostats Epi –Confounding, interaction, and causal diagrams. What to adjust for? What do the adjusted estimates mean? C A B A C B

2 x 2 Tables “Contingency tables” are the traditional analytic tool of the epidemiologist Outcome + - + - a b OR = (a/b) /(c/d) = ad/bc RR = a/(a+b) / c/(c+d) Exposure c d

2 x 2 Tables Example Coronary calcium + - + OR = 2.1 (1.6 – 2.7) + - + - 106 585 691 OR = 2.1 (1.6 – 2.7) RR = 1.9 (1.6 – 2.4) Binge drinking 186 2165 2351 292 2750 3042

2 x 2 Tables There is a statistically significant association, but is it causal? Does male gender confound the association? Male Binge drinking Coronary calcium

2 x 2 Tables First, stratify… 106 585 186 2165 89 374 118 801 17 211 CAC + - 106 585 186 2165 + - RR = 1.94 (1.55-2.42) Binge In men In women CAC CAC + - + - 89 374 118 801 17 211 68 1364 (34%) (14%) + - + - Binge Binge (15%) (7%) RR = 1.50 (1.16-1.93) RR = 1.57 (0.94-2.62)

2 x 2 Tables …compare strata-specific estimates… (they’re about the same) In men In women CAC CAC + - + - 89 374 118 801 17 211 68 1364 (34%) (14%) + - + - Binge Binge (15%) (7%) RR = 1.50 (1.16-1.93) RR = 1.57 (0.94-2.62)

2 x 2 Tables …compare to the crude estimate 106 585 186 2165 89 374 CAC + - 106 585 186 2165 + - RR = 1.94 (1.55-2.42) Binge In men In women CAC CAC + - + - 89 374 118 801 17 211 68 1364 (34%) (14%) + - + - Binge Binge (15%) (7%) RR = 1.50 (1.16-1.93) RR = 1.57 (0.94-2.62)

2 x 2 Tables …and then adjust the summary estimate. 89 374 118 801 17 In men In women CAC CAC + - + - 89 374 118 801 17 211 68 1364 + - + - Binge Binge RR = 1.50 (1.16-1.93) RR = 1.57 (0.94-2.62) RRadj = 1.51 (1.21-1.89)

+ - 106 585 186 2165 + - RR = 1.94 (1.55-2.42) Binge In men In women CAC CAC + - + - 89 374 118 801 17 211 68 1364 (34%) (14%) + - + - Binge Binge (15%) (7%) RR = 1.50 (1.16-1.93) RR = 1.57 (0.94-2.62) RRadj = 1.51 (1.21-1.89)

2 x 2 Tables Tabulate – output not exactly what we want. The “epitab” commands Stata’s answer to stratified analyses cs, cc, ir csi, cci, iri tabodds, mhodds

2 x 2 Tables Example – demo using Stata cs cac binge cs cac binge, by(male) cs cac modalc cs cac modalc, by(racegender)

2 x 2 Tables Example – demo using Stata cc cac binge

2 x 2 Tables Epitab subtleties ir command Rate ratios, adjusted etc Related to poisson regression Intermediate commands – csi, cci, iri No dataset required – just 2x2 cell frequencies csi a b c d csi 106 186 585 2165 (for cac binge)

2 x 2 Tables Adjustment vs. stratification cs command does both But can’t adjust for other stuff simultaneously Binge drinking and CAC, by male, adjusted for age and race? mhodds cac binge age black, by(male)

2 x 2 Tables Testing for trend tabodds tabodds cac alccat tabodds cac alccat, adjust(age male black)

2 x 2 Tables A modern approach – logistic regression logistic cac binge logistic cac binge male xi: logistic cac modalc i.racegender (xi: allows you to use create “dummy” variables on the fly…) Provides all OR’s in the model, but interactions more cumbersome xi: logistic cac i.racegender*modalc mhodds cac modalc, by(racegender)

Summary Epitab commands are a great way to explore your data Emphasis on interaction Logistic regression is a more general approach, ubiquitous, but testing for interactions is more difficult…

Summary Immediate commands (e.g. csi) are very useful – just watch out for the b  c switch! You’ll get more practice with this is Epi Methods.

Lab this week Epidemiologic analysis of the coronary calcium – death dataset from Lab 1 Moderately long

To come… Lecture 5 – Tables with Excel, Word Lecture 6 – Figures with Stata, Excel And time to work on your final project.

See you on Thursday! Lab 4 due 11/16 Bring a floppy disc to all labs!