Aarhus and the A-Bomb Survivor Studies Donald A. Pierce RERF Hiroshima.

Slides:



Advertisements
Similar presentations
Dr Eva Batistatou. Outline of this presentation… What is epidemiology? The Fundamentals of Epidemiology course What is biostatistics? The Biostatistics.
Advertisements

1 General Iteration Algorithms by Luyang Fu, Ph. D., State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting LLP 2007 CAS.
CONCEPTS UNDERLYING STUDY DESIGN
Analysis of variance (ANOVA)-the General Linear Model (GLM)
Departments of Medicine and Biostatistics
Case-Control Studies (Retrospective Studies). What is a cohort?
Cox Model With Intermitten and Error-Prone Covariate Observation Yury Gubman PhD thesis in Statistics Supervisors: Prof. David Zucker, Prof. Orly Manor.
Basic Elements of Testing Hypothesis Dr. M. H. Rahbar Professor of Biostatistics Department of Epidemiology Director, Data Coordinating Center College.
Donald A. Pierce Radiation Effects Research Foundation, Hiroshima (retired) Radiation-related cancer incidence and non-cancer mortality among A-bomb survivors.
BIOST 536 Lecture 3 1 Lecture 3 – Overview of study designs Prospective/retrospective  Prospective cohort study: Subjects followed; data collection in.
Log-linear and logistic models Generalised linear model ANOVA revisited Log-linear model: Poisson distribution logistic model: Binomial distribution Deviances.
Log-linear and logistic models
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Linear and generalised linear models
BIOST 536 Lecture 2 1 Lecture 2 - Modeling Need to find a model that relates the outcome to the covariates in a meaningful way  Simplification of the.
Sample Size Determination Ziad Taib March 7, 2014.
Trieschmann, Hoyt & Sommer Risk Identification and Evaluation Chapter 2 ©2005, Thomson/South-Western.
1 CSI5388 Data Sets: Running Proper Comparative Studies with Large Data Repositories [Based on Salzberg, S.L., 1997 “On Comparing Classifiers: Pitfalls.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Study Design and Analysis in Epidemiology: Where does modeling fit? Meaningful Modeling of Epidemiologic Data, 2010 AIMS, Muizenberg, South Africa Steve.
Thyroid Disease among A-bomb Survivors Exposed in Childhood Roy Shore, Kyoji Furukawa, Misa Imaizumi Radiation Effects Research Foundation
Analytic Epidemiology
Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Multiple Choice Questions for discussion
Simple Linear Regression
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
TWO-STAGE CASE-CONTROL STUDIES USING EXPOSURE ESTIMATES FROM A GEOGRAPHICAL INFORMATION SYSTEM Jonas Björk 1 & Ulf Strömberg 2 1 Competence Center for.
Statistical approaches to analyse interval-censored data in a confirmatory trial Margareta Puu, AstraZeneca Mölndal 26 April 2006.
CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power.
ANALYTICAL STUDIES Prospective Studies COHORT Prepared by: Dr. Sahar Sabbour Community Medicine Department.
1 Using martingale residuals to assess goodness of fit for sampled risk set data Ørnulf Borgan Department of Mathematics University of Oslo Based on joint.
Joint Effects of Radiation and Smoking on Lung Cancer Risk among Atomic Bomb Survivors Donald A. Pierce, RERF Gerald B. Sharp, RERF & NIAID Kiyohiko Mabuchi,
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Role of Statisticians in Follow-Up of A-Bomb Survivors Donald A. Pierce Oregon Health & Sciences Univ. Retired from Radiation Effects Res. Fndn. Slides.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Estimating Causal Effects from Large Data Sets Using Propensity Scores Hal V. Barron, MD TICR 5/06.
Sub-regional Workshop on Census Data Evaluation, Phnom Penh, Cambodia, November 2011 Evaluation of Census Data using Consecutive Censuses United.
HSRP 734: Advanced Statistical Methods July 31, 2008.
Epidemiologic design from a sampling perspective Epidemiology II Lecture April 14, 2005 David Jacobs.
CREATE Biostatistics Core THRio Statistical Considerations Analysis Plan.
Describing the risk of an event and identifying risk factors Caroline Sabin Professor of Medical Statistics and Epidemiology, Research Department of Infection.
A short introduction to epidemiology Chapter 9: Data analysis Neil Pearce Centre for Public Health Research Massey University Wellington, New Zealand.
1 Chihiro HIROTSU Meisei (明星) University Estimating the dose response pattern via multiple decision processes.
BC Jung A Brief Introduction to Epidemiology - XIII (Critiquing the Research: Statistical Considerations) Betty C. Jung, RN, MPH, CHES.
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 11: Models Marshall University Genomics Core Facility.
Effect of the Reference Set on Frequency Inference Donald A. Pierce Radiation Effects Research Foundation, Japan Ruggero Bellio Udine University, Italy.
Instructor Resource Chapter 13 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
University of Warwick, Department of Sociology, 2012/13 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Logistic Regression II/ (Hierarchical)
How to compute CI’s – standard approach vs floating absolute risk GCRC Breakfast Workshop Series January 16, 2004 Patrick G. Arbogast Dept. of Biostatistics.
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
Building Valid, Credible & Appropriately Detailed Simulation Models
1 Study Design Imre Janszky Faculty of Medicine, ISM NTNU.
Date of download: 5/31/2016 From: Tipping the Balance of Benefits and Harms to Favor Screening Mammography Starting at Age 40 Years: A Comparative Modeling.
Case control & cohort studies
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
1 Borgan and Henderson: Event History Methodology Lancaster, September 2006 Session 8.1: Cohort sampling for the Cox model.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.
Estimating standard error using bootstrap
Ch3: Model Building through Regression
Age-Time Patterns of Radiation-Related Cancer Risk
CJT 765: Structural Equation Modeling
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Induction and latency (J-F Boivin, March 2006)
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Presentation transcript:

Aarhus and the A-Bomb Survivor Studies Donald A. Pierce RERF Hiroshima

2 Connections: Aarhus Biostats and RERF M. Vaeth has been instrumental at both places Both places owe much to a “Golden Age” in biostatistics starting in about 1975 My “sideline” theoretical work: likelihood asymptotics due to Barndorff-Nielsen, Aarhus Theoretical Statistics

3 Golden Age for Biostatistics: 1975 to ???? Generalized linear models: regression using likelihood rather than least squares, frequency data Developments in survival analysis: relative risk (Cox) regression Case control methods: relation to the above, improved understanding, extensions Developments for clinical trials: use of survival analysis methods Analysis of longitudinal data: sequential/clustered observations Advances in computing: individual computers and statistical software

4 Relevance to RERF and Aarhus Biostatistics This Golden Age had much to do with successes at RERF since 1980 and no doubt a lot to do with formation/success of Aarhus Biostatistics Intriguing and rich connections between analysis of survival times and regression analysis of Poisson data ---- cross-tabulations of cases and person-years These connections are central to effectively utilizing the RERF data: cohort of 100,000, followed up for 50 years, with a wide range of radiation doses Radiation risk for cancer at a given dose is not a “number” but a pattern depending on exposure age, time since exposure, attained age, and sex

5 Overview of RERF Radiation and Cancer Study By 1980 RERF methods were oriented to testing for radiation effects rather than description of them Effects for leukemia were fairly clear, but what to expect for other cancers was almost totally unknown Would there be much effect at all? How long would it last? It turns out that a brief radiation exposure increases cancer risk for all remaining lifetime Has mechanistic implications regarding radiation and cancer; and probably for carcinogenesis in general Cancer arises largely from somatic mutations and radiation adds to these ---- effect of exposures at every age is highly informative regarding this

6 Survival analysis and case-PY tabulations Data are fundamentally times of cancer incidence, for individuals exposed at various ages and to a wide range of doses Analysis best aims at directly estimating cancer rates, rather than alternative focus on response times Data are used in terms of detailed cross-tabulations of cases and PY, according to categories of dose, exposure age, sex, time since exposure, and attained age Imprecise rate estimates {cases/PY} for thousands of tabular cells, are then smoothed through regression models using the above covariables

7 Advantages of cross-tabulation approach Dealing with 100,000 persons and 10,000 cancer cases, while giving careful attention to age-time variations in risk Dealing with departures from proportional hazards formulation: infeasible to use standard Cox regression with age-dependent covariables The initial cross-tabulation effectively carries out the heavy calculations once and for all, reducing inhibitions to extensive exploratory analysis Direct analysis from the outset of rates, rather than response times, has conceptual advantages and allows for focus on excess risk

8 After considerable development, late 1980’s view of solid cancer relative risk ERR, excess relative risk, is the % increase in age-specific cancer rate

9 Current improved understanding Much of “exposure-age” effect was attained age variation. Related to stochastics of accumulation of mutations.

10 Age-time patterns of excess rates Novel methods, clarifying the age increase and modest sex effect (factor of 2 in ERR)

11 Description of dose response: sex-avg ERR at age 60 Interesting methodological issues

12 Overall Statistical Developments at RERF The above involved major developments in approach and software Epicure: Poisson and binomial regression, direct analysis of survival times, case-control studies Rich general class of risk models, fitting either RR or absolute rates, profile likelihood calculation, etc. Includes capabilities for making complex {case, PY} tables Methodology and software essential to RERF study and used elsewhere for radiation epidemiology and other purposes Will now consider some more specific developments central to the needs

13 Some more specific developments Allowing for dose-estimation errors (with Vaeth) –Involves neglected issues regarding covariate errors Simultaneous analysis for specific types of cancer –Allowing for parameters in common and distinctive Issues specific to low-dose risks –Bias related to comparison group Theoretical stochastic analysis of accumulation of mutations (with Vaeth) –Clarifies age patterns in relative risk Joint effect of smoking and radiation –Additive vs multiplicative effects Effects of selection of cohort by survival (with Vaeth) –“Radiation-resistant survivors”: controversial issue

14 Dose estimation errors Elaborate Dosimetry System estimates based on survivor location and shielding Reasonable to assume unbiased estimation, i.e. E(estimated dose | true dose) = true dose Surprisingly to many, this does not mean that E( true | estimated) = estimated Partly because estimation does not aim to use information provided by survival of the person Need to adjust dose estimates to allow for the apparent cohort distribution of true doses

15 RERF “survivor” dose estimates We explicitly estimate distribution of true doses, then the joint distribution of (true, estimated), finally arriving at a representation E(true | estimate) = g(estimate) X estimate These are called “survivor” dose estimates, used for all purposes at RERF The “cohort-specific” issues in this are general (not restricted to where survival is an issue), and are widely neglected in dealing with covariate errors

16 Adjustment factor for typical assumptions: 35% CV for each of “estimative” and “grouping” errors

17 Idealized stochastic modeling of radiation and cancer Cancer is largely due to accumulation of somatic mutations in stem cells Radiation can cause such mutations This allows for idealized stochastic modeling of the age-time patterns of radiation risk Results agree remarkably well with what is seen in the data In particular, this can explain the decrease in RR with increasing age Aim is to provide guidance for challenging descriptive analyses

18.. tumor age of person   (history-1)  (history-2)  (history-3)   (history-4) Exposure rate d(a) During exposure period, whatever are currently in effect in the various cells are increased to Action in a given cell without exposure Mathematical model for cancer and mutagenic exposure Without exposure, very general age-homogeneous Markov process (no assumption about number of mutations)

19 Consequences of idealized model Markov states may depend arbitrarily on mutational status of the cell: allows, e.g. for “mutator phenotypes” and selective growth advantage Cancer rates following dose D should have form Since fairly generally it is found that this suggests that one might expect essentially as seen in our data

20 Risks during exposure (for other applications) Main result becomes Final term is a Jacobian in the mathematics, but also corresponds to effect of exposure causing the final- required mutation Approximating as before, leading term in RR is The extra term is important in applications: miners exposed to radon, and effect of smoking on lung cancer

21 Effect of stopping smoking: Data from (another) major epidemiological study

22 Conclusions Considerable statistical development at RERF in past 20 years Methods and pressing issues should be of interest to many biostatisticians Close connections in two respects to Aarhus Biostatistics –Personal interactions –Golden age for biostatistics