Complex Adaptive Systems and Human Health: Statistical Approaches in Pharmacogenomics Kim E. Zerba, Ph.D. Bristol-Myers Squibb FDA/Industry Statistics.

Slides:



Advertisements
Similar presentations
Statistical methods for genetic association studies
Advertisements

Genetic Approaches to Thinking, Moving and Feeling
Gene-by-Environment and Meta-Analysis Eleazar Eskin University of California, Los Angeles.
CZ5225 Methods in Computational Biology Lecture 9: Pharmacogenetics and individual variation of drug response CZ5225 Methods in Computational Biology.
Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
Genetic research designs in the real world Vishwajit L Nimgaonkar MD, PhD University of Pittsburgh
Multiple Comparisons Measures of LD Jess Paulus, ScD January 29, 2013.
Objectives Cover some of the essential concepts for GWAS that have not yet been covered Hardy-Weinberg equilibrium Meta-analysis SNP Imputation Review.
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
MALD Mapping by Admixture Linkage Disequilibrium.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL FastANOVA: an Efficient Algorithm for Genome-Wide Association Study Xiang Zhang Fei Zou Wei Wang University.
Dr. Almut Nebel Dept. of Human Genetics University of the Witwatersrand Johannesburg South Africa Significance of SNPs for human disease.
Estimating “Heritability” using Genetic Data David Evans University of Queensland.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
Human Migrations Saeed Hassanpour Spring Introduction Population Genetics Co-evolution of genes with language and cultural. Human evolution: genetics,
Genome-Wide Association Studies
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Human Genomic Variation The Story of SNPs. Single Nucleotide Polymorphisms (SNPs)  In addition to variation in microsatellites (VNTRs), genetic variation.
Haplotype Discovery and Modeling. Identification of genes Identify the Phenotype MapClone.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
Broad-Sense Heritability Index
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Case(Control)-Free Multi-SNP Combinations in Case-Control Studies Dumitru Brinza and Alexander Zelikovsky Combinatorial Search (CS) for Disease-Association:
Population Genetics: Chapter 3 Epidemiology 217 January 16, 2011.
Pharmacogenetics and Pharmacogenomics Eric Jorgenson 2/24/9.
The Complexities of Data Analysis in Human Genetics Marylyn DeRiggi Ritchie, Ph.D. Center for Human Genetics Research Vanderbilt University Nashville,
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
©Edited by Mingrui Zhang, CS Department, Winona State University, 2008 Identifying Lung Cancer Risks.
CS177 Lecture 10 SNPs and Human Genetic Variation
Gene Hunting: Linkage and Association
Lecture 19: Association Studies II Date: 10/29/02  Finish case-control  TDT  Relative Risk.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics
Jianfeng Xu, M.D., Dr.PH Professor of Public Health and Cancer Biology Director, Program for Genetic and Molecular Epidemiology of Cancer Associate Director,
INTRODUCTION TO ASSOCIATION MAPPING
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
Lab 13: Association Genetics December 5, Goals Use Mixed Models and General Linear Models to determine genetic associations. Understand the effect.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
Allele Frequencies: Staying Constant Chapter 14. What is Allele Frequency? How frequent any allele is in a given population: –Within one race –Within.
Remainder of Chapter 23 Read the remaining materials; they address information specific to understanding evolution (e.g., variation and nature of changes)
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
An quick overview of human genetic linkage analysis
Association analysis Genetics for Computer Scientists Biomedicum & Department of Computer Science, Helsinki Päivi Onkamo.
The International Consortium. The International HapMap Project.
The Interactions of Selection With Genetic Drift Can Be Complicated Because the Changes in p Induced By Drift are Random and Ever-Changing Three Important.
In The Name of GOD Genetic Polymorphism M.Dianatpour MLD,PHD.
An quick overview of human genetic linkage analysis Terry Speed Genetics & Bioinformatics, WEHI Statistics, UCB NWO/IOP Genomics Winterschool Mathematics.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Admixture Mapping Controlled Crosses Are Often Used to Determine the Genetic Basis of Differences Between Populations. When controlled crosses are not.
Types of genome maps Physical – based on bp Genetic/ linkage – based on recombination from Thomas Hunt Morgan's 1916 ''A Critique of the Theory of Evolution'',
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
Statistical Analysis of Candidate Gene Association Studies (Categorical Traits) of Biallelic Single Nucleotide Polymorphisms Maani Beigy MD-MPH Student.
An atlas of genetic influences on human blood metabolites Nature Genetics 2014 Jun;46(6)
Evolution of Populations. Individual organisms do not evolve. This is a misconception. While natural selection acts on individuals, evolution is only.
Evolution of Populations
약물유전체학 Pharmacogenomics Kangwon National Univ School of Medicine Hee Jae Lee PhD.
8 and 11 April, 2005 Chapter 17 Population Genetics Genes in natural populations.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Population Genetics Measuring Evolutionary Change Over Time.
Press report 13/10/ publications selected.
MULTIPLE GENES AND QUANTITATIVE TRAITS
Consideration for Planning a Candidate Gene Association Study With TagSNPs Shehnaz K. Hussain, PhD, ScM Epidemiology 243: Molecular.
Diversity of Individuals and Evolution of Populations
MULTIPLE GENES AND QUANTITATIVE TRAITS
Genome-wide Associations
Exercise: Effect of the IL6R gene on IL-6R concentration
Medical genomics BI420 Department of Biology, Boston College
Medical genomics BI420 Department of Biology, Boston College
Presentation transcript:

Complex Adaptive Systems and Human Health: Statistical Approaches in Pharmacogenomics Kim E. Zerba, Ph.D. Bristol-Myers Squibb FDA/Industry Statistics Workshop Statistics: From Theory to Regulatory Acceptance September 2003 Bethesda, Maryland Disclaimer The views presented are my own and do not necessarily represent those of Bristol-Myers Squibb

Outline l Complex Adaptive Systems and Human Health l Approach and Some Key Statistical Issues with Genetic Polymorphisms in Pharmacogenomics l Where Do We Go from Here?

The Genetic Paradigm DNA RNA Protein Gene Phenotype Gene Disease

Simple, Monogenic < 2% Complex, Multifactorial > 98% Non-Infectious Human Disease Load

GENOME TYPE UNIQUE (Initial Conditions) UNIQUE ENVIRONMENTAL HISTORY HISTORY FUTURE NORM OF NORM OF REACTION PHYSIOLOGICAL FITNESS, HEALTH - + TIME-SPACE CONTINUUM INDIVIDUAL NOW BLOOD PRESSURE PRESSURE REGULATION REGULATION LIPID METABOLISM CARBOHYDRATE METABOLISM HAEMOSTASIS Risk of Disease Agents participate in dynamic Agents participate in dynamic network and are not direct causes network and are not direct causes Each individual is a complex Each individual is a complex adaptive system and the adaptive system and the fundamental unit of organization fundamental unit of organization See : Zerba and Sing, 1993, Current Opinion in Lipidology 4: , Zerba et al. 2000, Human Genetics 107: for more detail Network organized hierarchically Network organized hierarchically and heterarchically into fields and heterarchically into fields Health or disease is Health or disease is an emergent feature based on an emergent feature based on interactions among many interactions among many agents, including genes and agents, including genes and environments environments Fields are domains of relational Fields are domains of relational order among agents order among agents Stronger relationships within fields, Stronger relationships within fields, weaker relationships among fields weaker relationships among fields Unique genome type provides initial Unique genome type provides initial conditions and capacity for change conditions and capacity for change Context and time are key to understanding influence of genetic variation Context and time are key to understanding influence of genetic variation Complex Adaptive Systems and Human Health

Genes Biomarkers Endpoints ? ? ? Complex Adaptive Systems Approach to PGx

Some Key Statistical Issues for Pharmacogenomics Studies Using Genetic Polymorphisms  Gene/Polymorphism Selection  Linkage Disequilibrium  Admixture and Population Stratification  Invariance  Context Dependence  Time

Gene/Polymorphism Selection  Genome Scan – Genes not identified a priori – Genotyping – 25K - 500K polymorphisms genotyped for each subject (not practical yet) for each subject (not practical yet) – DNA Pooling – 25K million polymorphisms – Case-control allele frequency differences for each polymorphism  Candidate Genes

F SiSiSiSi Candidate Genes Unknown and unmeasured functionalpolymorphism One of numerous non-functionalpolymorphisms  Assume that any association of S i with phenotype, P, is because of linkage disequilibrium between F and S i is because of linkage disequilibrium between F and S i P FS = p F p S + D FS FPCandidateGeneRegion

Admixture SNP+- p + = 0.8 p + = 0.2 p + = 0.5 Population IPopulation II II AdmixedPopulation  = proportion of population I = 0.5

Consider two subpopulations, I and II: For each subpopulation, there is linkage equilibrium between a disease allele, F, and a marker allele, S, P F I S I = p F I p S I ; P F II S II = p F II p S II ; D F I S I = D F II S II = 0. P F I S I = p F I p S I ; P F II S II = p F II p S II ; D F I S I = D F II S II = 0. In the admixed population (I + II), there is linkage disequilibrium between F and S, P FS = p F p S +  p F I - p F II )(p S I - p S II ) P FS = p F p S +  p F I - p F II )(p S I - p S II ) SubpopulationsProportions Disease Allele FrequencyDifference Marker Allele FrequencyDifference AdmixtureLinkageDisequilibrium

Admixture and Population Stratification  Admixture linkage disequilibrium dissipates quickly in a randomly mating population quickly in a randomly mating population  Common clinical trial feature: > 1 ethnic group – Population stratification  Ethnicity is a confounder – Population stratification can create linkage disequilibrium just like admixture only spurious – Type I or Type II error inflation

False-Positive Endpoint Association Example  Unbalanced design – Unequal numbers of each group:  I = 0.67 – Marker allele: p + = 0.8 in ethnic group I p + = 0.2 in ethnic group II p + = 0.2 in ethnic group II – Disease risk: p F = 0.8 for ethnic group I p F = 0.2 for ethnic group II p F = 0.2 for ethnic group II Not considered in analysis

Population Genetic Structure and the Search for Functional Mutations: Quantitative Traits FREQUENCY and SCALE contribute to inferences about SNP- phenotype associations: SCALE Genotype Phenotype(Biomarker) ? SNP FREQUENCY AA Aa aa FunctionalMutation? SSR =  f i (Y i - Y) 2 Analysis of Variance Approach ?

papapapa pApApApA AA Aa aa P Aa Aa SNP Population Stratification and Genotype Frequencies  Stratification can result in decreased heterozygote decreased heterozygote frequencies relative to frequencies relative to expectation: expectation: Ethnic Group II Ethnic Group I AverageGenotypeFrequencies P aa P AA P Aa = 2p A p a - 2D A (D A positive in example)

DADADADA Sum of Squares Bias + -  Population stratification can result in overestimation of quantitative phenotypic overestimation of quantitative phenotypic variation associated with genetic variation relative variation associated with genetic variation relative to Hardy-Weinberg equilibrium expectation to Hardy-Weinberg equilibrium expectation D A --> +

An example from Apolipoprotein E Biology   Molecular weight: 34 kD   Synthesized in most organs – – liver, brain, gonads, kidney, spleen, muscle   Key physiological role in lipid transport – – ligand for the LDL (ApoB-E) receptor   Structural gene on chromosome 19 – – polymorphic with three common alleles  2  3  4 AA 112 Cys Cys Arg AA 158 Cys Arg ArgSNP SNP Invariance, Context and Time 5’ 3’ Note: combination of SNPs involved

Quebec, Canada N = 201 Nancy, France N = 223 Munster, Germany N = 1000 Helsinki, Finland N=207 Rochester, MN, USA N=226 Cholesterol (mg/dL) 22 33 44 Invariance Alleles From Sing et al. (1996) Genetic architecture of common multifactorial diseases, pp In:Chadwick and Cardew (eds.) Variation in the human genome, Ciba Foundation Symposium 197, John Wiley & Sons, New York

Variance x Age Window Midpoint (years) Age Window Midpoint (years) Bootstrap Significance Tests 2222A > P P < 0.10  P < 0.05 Changes in ApoE Additive Genetic Variance with Age Context and Time Rochester, MN Males, N=1035 From Zerba et al. 1996, Genetics 143:

Where Do We Go From Here?  Study design in genetic setting  Genetic stratification  Genomic control  Ascertainment bias correction in choice of which polymorphisms to study  Contexts/Interactions-- which ones are important?  New analytical methods needed – Combinations of SNPs within and among genes and environments may be involved may be involved – Haplotype Reconstruction – Combinatorial Partitioning  Missing genotypes for individual polymorphisms  Sampling vs technical variability in DNA pooling studies  Multiplicity-- p-value adjustment not a trivial problem Some Additional Statistical Challenges