Bernard Keavney Institute of Human Genetics University of Newcastle, UK. Recent developments in genetic epidemiology relevant to PURE.

Slides:



Advertisements
Similar presentations
Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
Advertisements

Julia Krushkal 4/11/2017 The International HapMap Project: A Rich Resource of Genetic Information Julia Krushkal Lecture in Bioinformatics 04/15/2010.
Genetic Analysis in Human Disease
Genome-wide Association Study Focus on association between SNPs and traits Tendency – Larger and larger sample size – Use of more narrowly defined phenotypes(blood.
Polymorphisms: Clinical Implications By Amr S. Moustafa, M.D.; Ph.D. Assistant Prof. & Consultant, Medical Biochemistry Dept. College of Medicine, KSU.
Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.
Association Mapping David Evans. Outline Definitions / Terminology What is (genetic) association? How do we test for association? When to use association.
Signatures of Selection
Outline to SNP bioinformatics lecture
Genomics An introduction. Aims of genomics I Establishing integrated databases – being far from merely a storage Linking genomic and expressed gene sequences.
CS177 Lecture 9 SNPs and Human Genetic Variation Tom Madej
Dr. Almut Nebel Dept. of Human Genetics University of the Witwatersrand Johannesburg South Africa Significance of SNPs for human disease.
Predicting the Function of Single Nucleotide Polymorphisms Corey Harada Advisor: Eleazar Eskin.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Genomewide Association Studies.  1. History –Linkage vs. Association –Power/Sample Size  2. Human Genetic Variation: SNPs  3. Direct vs. Indirect Association.
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
SNPs DNA differs between humans by 0.1%, (1 in 1300 bases) This means that you can map DNA variation to around 10,000,000 sites in the genome Almost all.
Restriction Fragment Length Polymorphisms (RFLPs) By Amr S. Moustafa, M.D.; Ph.D. Assistant Prof. & Consultant, Medical Biochemistry Dept. College of.
Course Overview Personalized Medicine: Understanding Your Own Genome Fall 2014.
Selecting TagSNPs in Candidate Genes for Genetic Association Studies Shehnaz K. Hussain, PhD, ScM Assistant Professor Department of Epidemiology, UCLA.
Haplotype Discovery and Modeling. Identification of genes Identify the Phenotype MapClone.
Genome Variations & GWAS
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
Genetic Analysis in Human Disease. Learning Objectives Describe the differences between a linkage analysis and an association analysis Identify potentially.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
HapMap: application in the design and interpretation of association studies Mark J. Daly, PhD on behalf of The International HapMap Consortium.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
National Taiwan University Department of Computer Science and Information Engineering Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
©Edited by Mingrui Zhang, CS Department, Winona State University, 2008 Identifying Lung Cancer Risks.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
Molecular & Genetic Epi 217 Association Studies
CS177 Lecture 10 SNPs and Human Genetic Variation
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
Gene Hunting: Linkage and Association
From Genome-Wide Association Studies to Medicine Florian Schmitzberger - CS 374 – 4/28/2009 Stanford University Biomedical Informatics
Genome-Wide Association Study (GWAS)
National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.
Complement Factor H Polymorphism in Age- Related Macular Degeneration* *Klein RJ, et al. Science. 2005; 308:
Genome Biology and Biotechnology 4. The variable human genome Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute for.
10cM - Linkage Mapping Set v2 ABI Median intermarker distance: 4.7 Mb Mean intermarker distance: 5.6 Mb Mean genetic gap distance: 8.9 cM Average Heterozygosity.
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
Polymorphism Haixu Tang School of Informatics. Genome variations underlie phenotypic differences cause inherited diseases.
Julia N. Chapman, Alia Kamal, Archith Ramkumar, Owen L. Astrachan Duke University, Genome Revolution Focus, Department of Computer Science Sources
ABC for the AEA Basic biological concepts for genetic epidemiology Martin Kennedy Department of Pathology Christchurch School of Medicine.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
An quick overview of human genetic linkage analysis
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
The International Consortium. The International HapMap Project.
In The Name of GOD Genetic Polymorphism M.Dianatpour MLD,PHD.
An quick overview of human genetic linkage analysis Terry Speed Genetics & Bioinformatics, WEHI Statistics, UCB NWO/IOP Genomics Winterschool Mathematics.
Motivations to study human genetic variation
Admixture Mapping Controlled Crosses Are Often Used to Determine the Genetic Basis of Differences Between Populations. When controlled crosses are not.
Analyzing DNA using Microarray and Next Generation Sequencing (1) Background SNP Array Basic design Applications: CNV, LOH, GWAS Deep sequencing Alignment.
Signals of natural selection in the HapMap project data The International HapMap Consortium Gil McVean Department of Statistics, Oxford University.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
SNP Detection Congtam Pham 2/24/04 Dr. Marth’s Class.
Nucleotide variation in the human genome
Gene Hunting: Design and statistics
Itsik Pe’er, Yves R. Chretien, Paul I. W. de Bakker, Jeffrey C
Medical genomics BI420 Department of Biology, Boston College
Medical genomics BI420 Department of Biology, Boston College
Haplotypes When the presence of two or more polymorphisms on a single chromosome is statistically correlated in a population, this is a haplotype Example.
Selecting a Maximally Informative Set of Single-Nucleotide Polymorphisms for Association Analyses Using Linkage Disequilibrium  Christopher S. Carlson,
Presentation transcript:

Bernard Keavney Institute of Human Genetics University of Newcastle, UK. Recent developments in genetic epidemiology relevant to PURE

Objectives Brief revision of some genetic “basics” Developments in genetic markers and genotyping technology Ethnicity, genetic variation and disease The potential impact of rare variants on common diseases: epidemiological and technological challenges.

Monogenic HCM, LQTS (disease genes) Genetic contribution to cardiovascular diseases genes environment (large-effect susceptibility genes) oligogenic Non-genetic Congenital HD Hypertension T II DM Atherosclerosis (small-effect susceptibility genes) polygenic

Common variants which affect human diseases HLA: Autoimmunity and infection APOE4: Alzheimer’s, CHD, lipids FV Leiden: Venous thrombosis PPARG: Type II Diabetes KCJN11: Type II Diabetes PTPN22: RhA, Type 1 Diabetes Insulin: Type I Diabetes NOD2: Crohn’s disease CF-H: Age-related MD RET: Hirschprung disease

Candidate gene association studies: a uniquely non-replicable area of science Six of 166 replicated in >75% of studies (4%)* Study sizes too small Statistical significance levels not stringent enough Meta-analyses: problem of publication bias Most conducted in urban Western Caucasian populations Minimal environmental heterogeneity within individual studies Minimal amount of “gene space” tested *Hirschhorn et al. Genet. Med. 2002

Genome figures The human genome: 3,200,000,000 base pairs 5% gene coding regions (1% expressed sequence) Noncoding regulatory elements are situated near genes 20,000 genes Any two genomes: 99.9% identical 3.2M differences between any two individuals 11,000,000 sites vary in at least 1% of the world’s population (Polymorphisms) Every site compatible with life has been mutated several times in this generation alone

Single nucleotide polymorphisms (SNPs): the mapping tool for association studies CAACTGTGTAGGTTGAG CAACTGTGTTGGTTGAG Between 2000 and million SNPs have been identified. For mapping, focus hitherto on common SNPs (MAF > 0.05): ancient power to detect given effect greater 90% of human variation is due to common alleles Most common variants are found in all world populations Technology to find rare variants has not been available thus far Expect one common SNP every ~600 bp Total of 7M genomewide……Which ones to type? And how many? Coding (amino acid change) Minority Noncoding Some regulatory

SNPs in dbSNP

The degree of association between a disease allele and a marker allele determines power Disease Causal SNP Marker SNP Testing two associations in one. DHDHABBADHDHABBA The arrangement of two or more alleles on a chromosome is called a haplotype Locus 1 Locus 2

The degree of association between a disease allele and a marker allele determines power Disease Causal SNP Marker SNP Testing two associations in one. DHDHABBADHDHABBA The arrangement of two or more alleles on a chromosome is called a haplotype Locus 1 Locus 2

Chromosomes are mosaics reflecting ancestral haplotypes

ACE gene diagram Position of 10 polymorphisms typed at the ACE locus 2 10 haplotypes could be generated from these genotypes

T A T A T C G I A 3 T A T A T T G I A 3 T A T A T C A I A 3 C C C T C C A D G 2 C C C T C C G D G 2 C A D G 2T A C A T C A D G 2T A T A T. Clade A Clade B Clade C X Keavney et al 1998

Oct 2005: Characterisation of most of the common genetic variation present genomewide in four world populations

HapMap project Phase I: 1 common SNP (MAF>0.05) every 5 Kb in 269 DNA samples (1 million SNPs) Yoruba from Ibadan, Nigeria European ancestry from Utah, US Han Chinese from Beijing Japanese from Tokyo 10 x 500Kb regions Resequenced in 48 individuals All SNPs genotyped in 269 samples Phase II : 4 million common SNPs Goal: to assess feasibility of whole-genome association studies and provide the “road map”of SNPs to type

HapMap phase I data

Recombination rates, haplotype lengths and gene location Chromosome 9q13

The POMC gene Intron 1 (3709bp) Exon 1 (85bp) Exon 2 (151bp) Intron 2 (2887bp) Exon 3 (833bp) RsaIC1032GC8246T There are no common polymorphisms in the translated sequence 5’ Baker et al Diabetes 2005

WHR adjusted for age, sex, smoking, alcohol, exercise, with or without BMI Difference 0.2 SD per allele. P=0.003 for C1032G; p=NS for RsaI N=1426 P< Means (95% CIs) Baker et al. Diabetes 2005

Genome-wide association studies are feasible: HapMap data

Chip-based genotyping provides the possibility to type 500,000 SNPs in a single individual today. Chip-based WGA study using 116,204 SNPs identified the role of Factor H in AMD (Klein et al. April 2005)

The within-population component of genetic variation accounts for most of human genetic diversity Rosenberg et al. Science individuals from 52 populations; 377 autosomal microsatellites 47% of 4199 alleles present in all regions 7% alleles region-specific; median q=0.01

Few SNPs rare in one panel are common in another HapMap 2005

Ioannidis et al. Nat Genet Heterogeneity of allele frequencies and disease O.R.s in meta-analyses of 43 gene-disease associations I 2 =75% shown by red line

Disease-causing variants: common or rare alleles? With a few exceptions (e.g. ACE I/D and plasma ACE) this is empirically confirmed

20Kb shown All common haplotypes at LEP are captured by these markers C538T is a rare allele (q<0.01) Leptin gene polymorphisms and cardiovascular risk Gaukrodger et al. 2005

LEP C538T polymorphism, arterial stiffness and carotid IMT TraitEstimate (SE)95% CI Pulse pressureDisplacement*1.00 (0.31)0.39 – 1.61 Polygenic h2 $ 0.24 (0.06)0.12 – 0.36 Mean IMTDisplacement0.90 (0.36)0.19 – 1.61 Polygenic h20.20 (0.07)0.06 – 0.34 Residual correlation 0.13 (0.04)0.04 – 0.21 Gaukrodger et al. JMG 2005

Rare alleles with large effect contribute to HDL cholesterol variation in the “normal range” APOA1 ABCA1 LCAT Sequenced Coding Region 128 High HDLC (>95%) 128 Low HDLC (<5%) Low HDLC High HDLC Var +213 Var Variants affected function Replicated in 2 nd population No association between HDLC and common variants in these genes 1/6 of those with HDLC <5% had a mutation These would be missed by a “common variant only” strategy Cohen et al. Science 2004

High-throughput sequencing technologies from September 2005 issues of “Science” and “Nature”

Conclusions Technological progress is very rapid: prospect of WGA scans on large numbers of samples in near future Many studies (eg UK Biobank) focus on gene- environment interaction but often environmental heterogeneity is minimal There remains a pressing need to describe and validate genetic associations with CVD in populations other than US and Western European Caucasians