A single-nucleotide polymorphism tagging set for human drug metabolism and transport Kourosh R Ahmadi, Mike E Weale, Zhengyu Y Xue, Nicole Soranzo, David.

Slides:



Advertisements
Similar presentations
Association Tests for Rare Variants Using Sequence Data
Advertisements

Genetic Analysis of Genome-wide Variation in Human Gene Expression Morley M. et al. Nature 2004,430: Yen-Yi Ho.
Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
Efficient Algorithms for Genome-wide TagSNP Selection across Populations via the Linkage Disequilibrium Criterion Authors: Lan Liu, Yonghui Wu, Stefano.
Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.
Association Mapping David Evans. Outline Definitions / Terminology What is (genetic) association? How do we test for association? When to use association.
MALD Mapping by Admixture Linkage Disequilibrium.
Ingredients for a successful genome-wide association studies: A statistical view Scott Weiss and Christoph Lange Channing Laboratory Pulmonary and Critical.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Signatures of Selection
Dr. Almut Nebel Dept. of Human Genetics University of the Witwatersrand Johannesburg South Africa Significance of SNPs for human disease.
More Powerful Genome-wide Association Methods for Case-control Data Robert C. Elston, PhD Case Western Reserve University Cleveland Ohio.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
A coalescent computational platform for tagging marker selection for clinical studies Gabor T. Marth Department of Biology, Boston College
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
Positional Cloning LOD Sib pairs Chromosome Region Association Study Genetics Genomics Physical Mapping/ Sequencing Candidate Gene Selection/ Polymorphism.
Picking SNPs Application to Association Studies Dana Crawford, PhD SeattleSNPs PGA University of Washington March 20, 2006.
CSE 291: Advanced Topics in Computational Biology Vineet Bafna/Pavel Pevzner
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
SNPs DNA differs between humans by 0.1%, (1 in 1300 bases) This means that you can map DNA variation to around 10,000,000 sites in the genome Almost all.
SNP Selection University of Louisville Center for Genetics and Molecular Medicine January 10, 2008 Dana Crawford, PhD Vanderbilt University Center for.
Selecting TagSNPs in Candidate Genes for Genetic Association Studies Shehnaz K. Hussain, PhD, ScM Assistant Professor Department of Epidemiology, UCLA.
Haplotype Discovery and Modeling. Identification of genes Identify the Phenotype MapClone.
Haplotype Blocks An Overview A. Polanski Department of Statistics Rice University.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Linear Reduction for Haplotype Inference Alex Zelikovsky joint work with Jingwu He WABI 2004.
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
Conservation of genomic segments (haplotypes): The “HapMap” n In populations, it appears the the linear order of alleles (“haplotype”) is conserved in.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
CS177 Lecture 10 SNPs and Human Genetic Variation
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
SNPs and the Human Genome Prof. Sorin Istrail. A SNP is a position in a genome at which two or more different bases occur in the population, each with.
Informative SNP Selection Based on Multiple Linear Regression
From Genome-Wide Association Studies to Medicine Florian Schmitzberger - CS 374 – 4/28/2009 Stanford University Biomedical Informatics
C Reactive Protein Coronary Heart Disease Genetics Collaboration BMJ 2011;342:d548.
Supplemental Figure 1A. A small fraction of genes were mapped to >=20 SNPs. Supplemental Figure 1B. The density of distance from the position of an associated.
BGRS 2006 SEARCH FOR MULTI-SNP DISEASE ASSOCIATION D. Brinza, A. Perelygin, M. Brinton and A. Zelikovsky Georgia State University, Atlanta, GA, USA 123.
Large-scale recombination rate patterns are conserved among human populations David Serre McGill University and Genome Quebec Innovation Center UQAM January.
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
Linear Reduction Method for Tag SNPs Selection Jingwu He Alex Zelikovsky.
Association analysis Genetics for Computer Scientists Biomedicum & Department of Computer Science, Helsinki Päivi Onkamo.
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
The International Consortium. The International HapMap Project.
Practical With Merlin Gonçalo Abecasis. MERLIN Website Reference FAQ Source.
Linkage Disequilibrium and Recent Studies of Haplotypes and SNPs
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Admixture Mapping Controlled Crosses Are Often Used to Determine the Genetic Basis of Differences Between Populations. When controlled crosses are not.
Synteny - many distantly related species have co- linear maps for portions of their genomes; co-linearity between maize and sorghum, between maize and.
Association Mapping in Families Gonçalo Abecasis University of Oxford.
SNP Detection Congtam Pham 2/24/04 Dr. Marth’s Class.
Of Sea Urchins, Birds and Men
Complex disease and long-range regulation: Interpreting the GWAS using a Dual Colour Transgenesis Strategy in Zebrafish.
Consideration for Planning a Candidate Gene Association Study With TagSNPs Shehnaz K. Hussain, PhD, ScM Epidemiology 243: Molecular.
Genome Wide Association Studies using SNP
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Genome-wide Associations
Genetic variations associated with diabetic nephropathy and type II diabetes in a Japanese population  S. Maeda, N. Osawa, T. Hayashi, S. Tsukada, M.
A Flexible Bayesian Framework for Modeling Haplotype Association with Disease, Allowing for Dominance Effects of the Underlying Causative Variants  Andrew.
Medical genomics BI420 Department of Biology, Boston College
Haplotypes at ATM Identify Coding-Sequence Variation and Indicate a Region of Extensive Linkage Disequilibrium  Penelope E. Bonnen, Michael D. Story,
CAG Expansion in the Huntington Disease Gene Is Associated with a Specific and Targetable Predisposing Haplogroup  Simon C. Warby, Alexandre Montpetit,
Medical genomics BI420 Department of Biology, Boston College
Selecting a Maximally Informative Set of Single-Nucleotide Polymorphisms for Association Analyses Using Linkage Disequilibrium  Christopher S. Carlson,
Hunting for Celiac Disease Genes
SNPs and CNPs By: David Wendel.
Presentation transcript:

A single-nucleotide polymorphism tagging set for human drug metabolism and transport Kourosh R Ahmadi, Mike E Weale, Zhengyu Y Xue, Nicole Soranzo, David P Yarnall, James D Briley, Yuka Maruyama, Mikiro Kobayashi, Nicholas W Wood, Nigel K Spurr, Daniel K Burns, Allen D Roses, Ann M Saunders & David B Goldstein Nature Genetics 37, ( January 2005) Presented by Navdeep

A single-nucleotide polymorphism tagging set for human drug metabolism and transport Background Aims Methods Results Conclusions References Outline

Tagging SNPs a, SNPs. four versions of the same chromosome region in different people showing three bases where variation occurs. Each SNP has two possible alleles; the first SNP in panel a has the alleles C and T. b, Haplotypes. A haplotype is made up of a particular combination of alleles at nearby SNPs. a. For this region, most of the chromosomes in a population survey turn out to have haplotypes 1–4. c, Tag SNPs. Genotyping just the three tag SNPs out of the 20 SNPs is sufficient to identify these four haplotypes uniquely. A SNP or a set of SNPs that have been selected on the basis of linkage disequilibrium (LD) patterns to represent other SNPs

Tagging SNPs Advantages of tagging SNPs –Fewer SNPs can be used to construct genome wide linkage disequilibrium map. Potential problems –How well do the selected tags represent undetected variation in the original sample? –How well will the tags represent variation (both detected and undetected) in a new sample from the same population? –How well do they represent different populations having different LD patterns?

The ability of tSNP sets to tag dropped SNPs

Aims Identify and evaluate tSNPs for genes involved in the absorption, distribution, metabolism and excretion of drugs (ADME genes)

Materials and Methods Selection of tSNPs –haplotype r 2 criterion Evaluation of tSNPs –dropped SNP−plus−resampling approach Comparison of SNPs with different MAF –two-tailed Wilcoxon paired-sample rank test Genes, SNP selection and choice of populations –a target density of 1 SNP of MAF > 10% per 2 kb of genomic DNA Gene clusters –no two genes from a cluster to be separated by more than 50 kb

Haplotype r 2 Haplotype r 2 is the coefficient of determination (ie. The proportion of explained variation) obtained from a standard linear regression of the allelic state (coded 0/1) of a SNP in question against the haplotypes determined by the tSNP set. This regression is equivalent to a one-way analysis of variance with each tSNP-defined haplotype as a separate group. It allows assessment of the loss of power resulting from typing a tSNP as opposed to the causal variant with which it is associated

Long range Linkage Disequilibrium

Minor allelic frequency (MEF) Indicates the number of occurrences of an allele seen in the total number of chromosomes typed at the SNP site

Performance of tags selected from the full data set

Performance of tags selected from the reduced* data set *SNPs with MAFs < 5% excluded

The effect of initial genotyping density on tag performance

Cosmopolitan tSNP set suitable for both European and Japanese populations

Performance of selected tSNPs in representing candidate functional variation

Performance of selected tSNPs in different population sample

Conclusions The effect of MAF on tSNP performance is heavily dependent on the size of the LD sample Comprehensive tagging will require a high genotyping density (one SNP of MAF ≥5% per 2.5 kb ) Performance of population specific tSNPs sets in predicting functional variants is similar to that of random SNPs. Haplotype r 2 based tSNP selection is highly effective even when applied to a population different from LD sample. Rare variants are not well tagged.

References Goldstein, D.B., Ahmadi, K.R., Weale, M.E. & Wood, N.W. Genome scans and candidate gene approaches in the study of common diseases and variable drug responses. Trends Genet. 19, 615−622 (2003) Goldstein, D.B., Tate, S.K. & Sisodiya, S.M. Pharmacogenetics goes genomic. Nat. Rev. Genet. 4, 937−947 (2003). Pritchard, J.P. & Przeworski, M. Linkage disequilibrium in humans: models and data. Am. J. Hum. Genet. 69, 1−14 (2001) Carlson, C.S. et al. Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. Am. J. Hum. Genet. 74, 106−120 (2004).