PHOEBE UK Biobank: how big is “big”? Paul Burton Dept of Health Sciences Dept of Genetics University of Leicester.

Slides:



Advertisements
Similar presentations
Study Designs in Epidemiologic
Advertisements

Summary points with interesting prospects IRB – handling samples in different areas/countries Opportunities for young researchers Who reviews grants and.
METHODS FOR HAPLOTYPE RECONSTRUCTION
Epidemiologic study designs
Estimation and Reporting of Heterogeneity of Treatment Effects in Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare.
MALD Mapping by Admixture Linkage Disequilibrium.
Descriptive Research in Physical Activity Epidemiology Descriptive Research in Physical Activity Epidemiology CHAPTER 1 CHAPTER 11.
 What Problems Cause Distress and Impair Functioning?  What Problems Cause Distress and Impair Functioning?  Why do People Behave in Unusual Ways?
COHORT AND CASE-CONTROL DESIGNS Dr. N. Birkett, Department of Epidemiology & Community Medicine, University of Ottawa SUMMER COURSE: INTRODUCTION TO EPIDEMIOLOGY.
Manish Chaudhary BPH, MPH
Introduction to Molecular Epidemiology Jan Dorman, PhD University of Pittsburgh School of Nursing
COHORT STUDY DR. A.A.TRIVEDI (M.D., D.I.H.) ASSISTANT PROFESSOR
Study Design and Analysis in Epidemiology: Where does modeling fit? Meaningful Modeling of Epidemiologic Data, 2010 AIMS, Muizenberg, South Africa Steve.
U.S. Department of Health and Human Services National Institutes of Health National Heart, Lung, and Blood Institute THE NIH INITIATIVES AND PLANS FOR.
Data quality/usability and population -based biobanks Paul Burton Dept of Health Sciences Dept of Genetics University of Leicester.
Understanding Society: the UK Household Longitudinal Study Professor Vernon Gayle ISER, Essex University
Epidemiologic Study Designs Nancy D. Barker, MS. Epidemiologic Study Design The plan of an empirical investigation to assess an E – D relationship. Exposure.
Outlining a CONSORT statement Methods – Extension to biobank studies.
1 Chapter 8 Case-Control Studies. 2 Chapter Outline 8.1 Introduction 8.2 Identification of cases and controls 8.3 Obtaining information on exposure 8.4.
Research Study Design and Analysis for Cardiologists Nathan D. Wong, PhD, FACC.
Systematic reviews of genetic association studies Robert Walton Fiona Fong 15 March 2013.
P3G: an international consortium in Human Genome Epidemiology
Overview of EU medical genetics and population genomics research Milan Macek Prof. MD, DSc Charles University Prague Czech Republic St. Petersburg October.
Techniques for Data Linkage and Anonymisation – A Funders View Turing Gateway Meeting 23 rd October 2014 Dr Mark Pitman.
Study Designs Afshin Ostovar Bushehr University of Medical Sciences Bushehr, /4/20151.
 Is there a comparison? ◦ Are the groups really comparable?  Are the differences being reported real? ◦ Are they worth reporting? ◦ How much confidence.
Genetic-Environmental Interaction: Implications for Osteoporosis Prevention Strategies.
Study Designs in Epidemiologic
Early Genetics and Lifecourse Epidemiology (EAGLE) consortium CM van Duijn.
The Complexities of Data Analysis in Human Genetics Marylyn DeRiggi Ritchie, Ph.D. Center for Human Genetics Research Vanderbilt University Nashville,
Genetic Databases International Collaboration and Secondary Uses Pr Bartha Maria Knoppers Canada Research Chair in Law and Medicine Genetics and Society.
Discussion for a statement for biobank and cohort studies in human genome epidemiology John P.A. Ioannidis, MD International Biobank and Cohort Studies.
Barriers and Tools to the Present and Future of Population Genetics Pr Bartha Maria Knoppers Canada Research Chair in Law and Medicine HGM 2006.
Wisconsin Genomics Initiative W isconsin M edical R esearch T riangle Wisconsin Medical Discovery Triangle.
Jianfeng Xu, M.D., Dr.PH Professor of Public Health and Cancer Biology Director, Program for Genetic and Molecular Epidemiology of Cancer Associate Director,
Basic concept of clinical study
Public Population Projects in Genomics International Working Groups Working Meeting September th, 2005, Hinxton, UK.
Overview of Study Designs. Study Designs Experimental Randomized Controlled Trial Group Randomized Trial Observational Descriptive Analytical Cross-sectional.
Study designs. Kate O’Donnell General Practice & Primary Care.
Professor Bill Ollier Combining the strengths of UMIST and The Victoria University of Manchester.
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
C2BAT: Using the same data set for screening and testing. A testing strategy for genome-wide association studies in case/control design Matt McQueen, Jessica.
Descriptive study design
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Leicester Warwick Medical School Health and Disease in Populations Revision Paul Burton.
Realising MRC’s Vision in Health and Bioinformatics MRC Open Council Meeting July 2014 Janet Valentine Head of Population Health and Informatics.
Power and Meta-Analysis Dr Geraldine M. Clarke Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for.
Measures of disease frequency Simon Thornley. Measures of Effect and Disease Frequency Aims – To define and describe the uses of common epidemiological.
Evaluating the UK Breast Screening Programme Study design and practicalities Louise Johns, Sue Moss, Tony Swerdlow Department of Health Cancer Screening.
Genetics as a determinant of health: new challenges for epidemiology
Some epidemiological principles and methods
Born In Bradford Neil Small Professor of Health Research School of Health Studies University of Bradford 25 Trinity Road, Bradford, BD5 OBB Tel:
METHODS SECTION OF A RESEARCH PROPOSAL
Epidemiology and Genomics Research Program
Descriptive study design
Statistical modelling of complex biological mechanisms by combining multiple data sources: an example on gene environment interactions Prof. Jeanine. J.
Epidemiologic Measures of Association
Is the freedom from Cognitive Impairment really at hand?
Biostatistics Case Studies 2016
Design of Clinical Trials
Longitudinal Designs.
Marker heritability Biases, confounding factors, current methods, and best practices Luke Evans, Matthew Keller.
Copyright © 2003 Delmar Learning, a Thomson Learning company
Lecture 1: Fundamentals of epidemiologic study design and analysis
11/20/2018 Study Types.
Wisconsin Genomics Initiative
Interpreting Epidemiologic Results.
EPIDEMIOLOGY AS A TOOL TO EVALUATE QUALITY OF CARE
Generating reliable evidence on the determinants of NCDs
Presentation transcript:

PHOEBE UK Biobank: how big is “big”? Paul Burton Dept of Health Sciences Dept of Genetics University of Leicester

PHOEBE Structure of talk  What is UK Biobank?  How big is “big”?  Expected event rates in UK Biobank  Biobank harmonisation

PHOEBE What is UK Biobank?

PHOEBE  A prospective cohort study  500,000 adults across UK  Middle aged (40-69 years)  A population-based biobank Not disease or exposure based  “Broad spectrum” not “fully representative”  Individuals not families  MRC, Wellcome Trust, DH, Scottish Executive £61M Basic design features

PHOEBE  Longitudinal health tracking  Nested case-control studies  Long time-horizon  Owned by the Nation  Central Administration – Manchester PI: Prof Rory Collins - Oxford  6 collaborating groups (RCCs) of university scientists Basic design features

PHOEBE  Primary justifications Roles that can best be fulfilled by a new large cohort study of the type represented by UK Biobank  Secondary justifications Roles that could be provided by other types of study, but given that UK Biobank is to go ahead anyway these additional roles can be taken on at relatively low marginal cost Justification for UK Biobank

PHOEBE A platform for research in biomedical science  Studies of the joint effects of genes and environment/life-style  Genotype-based studies***  The genetics of disease progression  Direct association of genes with disease***  Universal controls  Family-based studies

PHOEBE How big is “big”? With Anna Hansell, Imperial College

PHOEBE  Contemporary pre-eminence of genetic association studies rather than genetic linkage studies  Covers both stand-alone case-control studies, and nested case-control studies in large cohorts  Sample size determining in both settings The statistical power of case-control studies

PHOEBE Power calculations  Work with the least powerful setting Binary disease, binary genotype, binary environmental exposure  Logistic regression; interactions = departure from a multiplicative model  Complexity (arbitrary but realistic)

PHOEBE Summarise power using MDORs calculated by ‘iterative simulation’  Estimate minimum ORs detectable with 80% power at stated level of statistical significance under specified scenario

PHOEBE Summary of results  80% power for genotype frequency = 0.1 Genetic main effect  1.5, p=10 -4  5,000 cases Genetic main effect  1.3, p=10 -4  10,000 cases Genetic main effect  1.2, p=10 -4  20,000 cases Genetic main effect  1.4, p=10 -7  10,000 cases Genetic main effect  1.3, p=10 -7  20,000 cases G:E interaction with environmental exposure prevalance = 0.2  2.0, p=10 -4  20,000 cases

PHOEBE Expected event rates in UK Biobank With Anna Hansell, Imperial College

PHOEBE Taking account of  Age range at recruitment years  Recruitment over 5 years  All cause mortality  Disease incidence (“healthy cohort effect”)  Migration overseas  Withdrawal from the study

PHOEBE

Smaller sample sizes

PHOEBE Conclusions  Having taken account of realistic bioclinical complexity, a cohort-based biobank such as UK Biobank needs to be very large if it is to provide a stand-alone infrastructure  Anything much less than 500,000 recruits will severely curtail the number of diseases that will be able to be studied based on that biobank alone  The value of any biobank will be greatly augmented if it proves possible to set up a coherent and scientifically harmonised international network of biobanks and large cohort studies

PHOEBE Harmonising biobanks internationally

PHOEBE Why harmonise?  Investigate less common (but not rare!!!) conditions UKBB: Ca stomach 2,500 cases in 29 years  6 UKBB equivalents:  10,000 cases in 20 years  Investigate smaller ORs GME 1.5  1.2 requires 5,000  20,000  4 UKBB equivalents  Analysis based on subsets – homogeneous classes of phenotype, or e.g. by sex

PHOEBE Why harmonise?  Earlier analyses UKBB: Alzheimers disease, 10,000 cases in 18 yrs  5 UKBB equivalents  9 years  Events at younger ages  Broad range of environmental exposures  Aim for 4-6 UKBB equivalents 2M – 3M recruits

PHOEBE International biobank harmonisation programs  Public Population Program in Genomics (P 3 G) Tom Hudson, Bartha Knoppers, Leena Peltonen …  Population Biobanks FP6 Co-ordination Action (PHOEBE – Promoting Harmonization Of Epidemiological Biobanks in Europe) Camilla Stoltenberg, Leena Peltonen, Paul Burton …  Human Genome Epidemiology Network (HuGENet) Muin Khoury, Julian Little …

PHOEBE Extra slides

PHOEBE Hattersley AT, McCarthy MI. Lancet 2005;366: Examples of some polymorphisms or haplotypes that have shown consistent association with complex disease

PHOEBE Genetic main effects

PHOEBE Whole genome scan  Genetic main effect, p<10 -7

PHOEBE Gene-environment interaction  20,000 cases

PHOEBE Gene-environment interaction  10,000 cases

PHOEBE Rarer genotypes  Genetic main effects

PHOEBE Proposed assessment visit model Welcome5 Consent5 Blood/Urine10 Touchscreen Questionnaire25 Interviewer Questionnaire5 Physical Measures15 Exit5 TOTAL70

PHOEBE Taking account of  Age range at recruitment years  Recruitment over 5 years  All cause mortality  Disease incidence (“healthy cohort effect”)  Migration overseas  Comprehensive withdrawal (max 1/500 p.a.)  Partial withdrawal ( c.f Birth Cohort)

PHOEBE

Necessary to contact subjects

PHOEBE Issues that are often ignored in standard power calculations  Multiple testing/low prior probability of association*  Interactions*  Unobserved frailty  Misclassification* Genotype Environmental determinant Case-control status  Subgroup analyses*  Population substructure

PHOEBE Harmonisation  Prospective  Retrospective Description Comparison Harmonised synthesis

PHOEBE