The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College

Slides:



Advertisements
Similar presentations
Linkage and Genetic Mapping
Advertisements

Lecture 2 Strachan and Read Chapter 13
SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
Genetic Analysis in Human Disease
Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.
Association Mapping David Evans. Outline Definitions / Terminology What is (genetic) association? How do we test for association? When to use association.
MALD Mapping by Admixture Linkage Disequilibrium.
CS177 Lecture 9 SNPs and Human Genetic Variation Tom Madej
Dr. Almut Nebel Dept. of Human Genetics University of the Witwatersrand Johannesburg South Africa Significance of SNPs for human disease.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
A coalescent computational platform for tagging marker selection for clinical studies Gabor T. Marth Department of Biology, Boston College
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
A coalescent computational platform to predict strength of association for clinical samples Gabor T. Marth Department of Biology, Boston College
Positional Cloning LOD Sib pairs Chromosome Region Association Study Genetics Genomics Physical Mapping/ Sequencing Candidate Gene Selection/ Polymorphism.
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary, May 2006.
Lecture X.X1. 2 The informatics of SNPs and Haplotypes Gabor T. Marth Department of Biology, Boston College
Human Genome Sequence and Variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary,
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
Selecting TagSNPs in Candidate Genes for Genetic Association Studies Shehnaz K. Hussain, PhD, ScM Assistant Professor Department of Epidemiology, UCLA.
Haplotype Discovery and Modeling. Identification of genes Identify the Phenotype MapClone.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Simple Nucleotide.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
Genetic Analysis in Human Disease. Learning Objectives Describe the differences between a linkage analysis and an association analysis Identify potentially.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Software tools for the analysis of medically important sequence variations Gabor T. Marth, D.Sc. Boston College Department of Biology
Medical variations Gabor T. Marth Boston College Biology Department BI543 Fall 2013 February 5, 2013.
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.
National Taiwan University Department of Computer Science and Information Engineering Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
The Complexities of Data Analysis in Human Genetics Marylyn DeRiggi Ritchie, Ph.D. Center for Human Genetics Research Vanderbilt University Nashville,
A single-nucleotide polymorphism tagging set for human drug metabolism and transport Kourosh R Ahmadi, Mike E Weale, Zhengyu Y Xue, Nicole Soranzo, David.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
Conservation of genomic segments (haplotypes): The “HapMap” n In populations, it appears the the linear order of alleles (“haplotype”) is conserved in.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
CS177 Lecture 10 SNPs and Human Genetic Variation
Gene Hunting: Linkage and Association
Genome-Wide Association Study (GWAS)
A coalescent computational platform to predict strength of association for clinical samples Gabor T. Marth Department of Biology, Boston College
National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.
Complement Factor H Polymorphism in Age- Related Macular Degeneration* *Klein RJ, et al. Science. 2005; 308:
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
Polymorphism Haixu Tang School of Informatics. Genome variations underlie phenotypic differences cause inherited diseases.
Julia N. Chapman, Alia Kamal, Archith Ramkumar, Owen L. Astrachan Duke University, Genome Revolution Focus, Department of Computer Science Sources
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Linear Reduction Method for Tag SNPs Selection Jingwu He Alex Zelikovsky.
SNPs, Haplotypes, Disease Associations Algorithmic Foundations of Computational Biology II Course 1 Prof. Sorin Istrail.
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
The International Consortium. The International HapMap Project.
Linkage Disequilibrium and Recent Studies of Haplotypes and SNPs
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College
A coalescent computational platform to predict strength of association for clinical samples Gabor T. Marth Department of Biology, Boston College
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Single Nucleotide Polymorphisms (SNPs
Genomic Analysis: GWAS
Of Sea Urchins, Birds and Men
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Power to detect QTL Association
Linking Genetic Variation to Important Phenotypes
BI820 – Seminar in Quantitative and Computational Problems in Genomics
Medical genomics BI420 Department of Biology, Boston College
Medical genomics BI420 Department of Biology, Boston College
Haplotypes When the presence of two or more polymorphisms on a single chromosome is statistically correlated in a population, this is a haplotype Example.
Research for medical discovery at the Computational Genomics Laboratory at Boston College Biology Gabor T. Marth Department of Biology, Boston College.
SNPs and CNPs By: David Wendel.
Presentation transcript:

The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College

Lecture overview 1. Phenotypic effects caused by known genetic variants 2. Genetic mapping to find genetic variants that cause diseases – linkage analysis and association studies 3. Genome-wide association mapping resources – the HapMap 4. Structural and epigenetic variations in disease

1. Phenotypic effects caused by known genetic variants

Many SNPs do have phenotypic effects Badano and Katsanis, NRG 2002 some notable genetic diseases: cystic fibrosis cycle-cell anemia

Genetic variants in Pharmacogenetics Evans and Rellig, Science 1999

Genetic variants in Pharmacogenetics Evans and Rellig, Science 1999

Using genotype information in the drug development pipeline Roses. NRG 2004

Are all genetic variants functional? ~ 10 million known SNPs SNPs, on the scale of the genome, can be described well with the “neutral theory” of sequence variations the vast majority of SNPs likely to have no functional effects How do we find the few functional variants in the background of millions of non-functional SNPs?

2. Genetic mapping to find genetic variants that cause diseases – linkage analysis and association studies

Genetic mapping

Allelic association (linkage) allelic association is the non- random assortment between alleles i.e. it measures how well knowledge of the allele state at one site permits prediction at another marker site functional site significant allelic association between a marker and a functional site permits localization (mapping) even without having the functional site in our collection allelic association, and the use of genetic markers is the basis for mapping functional alleles

Mendelian diseases have simple inheritance genotype inheritance genotype + phenotype inheritance

Linkage analysis compares the transmission of marker genotype and phenotype in families

Complex disease – complex inheritance Badano and Katsanis, NRG 2002

Allele frequency and relative risk Brinkman et al. Nature Reviews Genetics advance online publication; published online 14 March 2006 | doi: /nrg1828

Association study strategies region(s) interrogated: single gene, list of candidate genes (“candidate gene study”), or entire genome (“genome scan”) direct or indirect: causative variant marker that is co-inherited with causative variant single-SNP marker or multi- SNP haplotype marker single-stage or multi-stage

Association study strategies 2. LD-driven – based entirely on the reduction of redundancy presented by the linkage disequilibrium (LD) between SNPs; tags represent other SNPs they are correlated with 1. hypothesis driven (i.e. based on gene function) causative variant for economy, one cannot genotype every SNP in thousands of clinical samples: marker selection is the process where a subset of all available SNPs is chosen

Marker selection depends on genome LD Daly et al. NG 2001

Case-control association testing searching for markers with “significant” marker allele frequency differences between cases and controls; these marker signify regions of possible causative alleles AF(cases) AF(controls) clinical cases clinical controls genotyping cases and controls at various polymorphisms

3. Genome-wide association mapping resources – the HapMap

The HapMap resource goal: to map out human allele and association structure of at the kilobase scale deliverables: a set of physical and informational reagents

LD structure in four human populations International HapMap Consortium, Nature 2005

LD varies across samples African reference (YRI) there are large differences in LD between different human populations… European reference (CEU) … and even between samples from the same population. Other European samples

Sample-to-sample LD differences make tagSNP selection problematic groups of SNPs that are in LD in the HapMap reference samples may not be in a future set of clinical samples… … and tags that were selected based on LD in the HapMap may no longer work (i.e. represent the SNPs they were supposed to) in the clinical samples… … possibly resulting in missed disease associations.

Marker selection with additional samples test if markers selected from the HapMap continue to “tag” other SNPs in their original LD group

Representative computational samples

Two methods of computational sample generation “HapMap” “cases” “controls” HapMap Method 1. “Data-relevant Coalescent”. This algorithm uses a population genetic model to connect mutations in the HapMap reference to mutations in future clinical samples. Full model but computationally slow. Method 2. The PAC method (product of approximate conditionals, Li & Stephens). This method constructs “new” samples as mosaics of existing haplotypes, mimicking the effects of recombination. An approximation but fast.

LD difference -- comparison to extra experimental genotypes / / / we have analyzed two extra genotype sets collected at the HapMap SNPs in three genome regions, from our clinical collaborators (Prof. Thomas Hudson, McGill; Prof. Stanley Nelson, UCLA)

Genome-wide scans for human diseases Klein et al, Science 2005 SNPs in Complement Factor H (CFH) gene are associated with Age-related Macular Degeneration (AMD)

4. Somatic, structural and epigenetic variants in disease

Somatic mutations © Brian Stavely, Memorial University of Newfoundland the detection of somatic mutations, and their distinction from inherited polymorphism, is important to separate pre-disposing variants from mutations that occur during disease progression e.g. in cancer 1. detect the mutations 2. classify whether somatic or inherited

Detecting somatic mutations with comparative data based on comparison of cancer and normal tissue from the same individual often cancer tissue is highly heterogeneous and the somatic mutant allele may represent at low allele frequency

Detecting somatic mutations with subtraction if normal tissue samples are not available, we detect SNPs in cancer tissue against e.g. the human genome reference sequence subtract apparent mutations that are present in sequence variation databases search for evidence that these mutations are genetic

Detecting somatic mutations in murine mtDNA we have applied our methods for somatic mutation detection in murine mitochondrial sequences heteroplasmyhomoplasmy we will be applying our methods for human nuclear DNA from our collaborators

Structural variants in disease Feuk et al. Nature Reviews Genetics 7, 85–97 (February 2006) | doi: /nrg1767

Structural variations and phenotype Feuk et al. Nature Reviews Genetics 7, 85–97 (February 2006) | doi: /nrg1767

Epigenetics and cancer Baylin at al. NRC 2006.

Informatics of detection / integration of varied genetic and epigenetic data chromatin structure gene expression profiles copy number changes methylation profiles chromosome rearrangements repeat expansions somatic mutations