Human non-synonymous SNP: molecular function, evolution and disease Shamil Sunyaev Genetics Division, Brigham & Women’s Hospital Harvard Medical School.

Slides:



Advertisements
Similar presentations
Lecture 41 Prof Duncan Shaw. Genetic Variation Already know that genes have different alleles - how do these arise? Process of mutation - an alteration/change.
Advertisements

CZ5225 Methods in Computational Biology Lecture 9: Pharmacogenetics and individual variation of drug response CZ5225 Methods in Computational Biology.
Genome-wide Association Study Focus on association between SNPs and traits Tendency – Larger and larger sample size – Use of more narrowly defined phenotypes(blood.
Chapter 19 Evolutionary Genetics 18 and 20 April, 2004
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Copyright © 2013 Pearson Education, Inc. All rights reserved. Exploring Biological Anthropology: The Essentials, 3 rd Edition CRAIG STANFORD JOHN S. ALLEN.
Signatures of Selection
CS177 Lecture 9 SNPs and Human Genetic Variation Tom Madej
14 Molecular Evolution and Population Genetics
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
Molecular Evolution with an emphasis on substitution rates Gavin JD Smith State Key Laboratory of Emerging Infectious Diseases & Department of Microbiology.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Molecular Evolution Course #27615 Anders Gorm Pedersen Molecular Evolution Group Center for Biological Sequence.
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary, May 2006.
Scott Williamson and Carlos Bustamante
Molecular Evolution, Part 2 Everything you didn’t want to know… and more! Everything you didn’t want to know… and more!
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Molecular Evolution Course #27615 Anders Gorm Pedersen Molecular Evolution Group Center for Biological Sequence.
Genetica per Scienze Naturali a.a prof S. Presciuttini Mutation Rates Ultimately, the source of genetic variation observed among individuals in.
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
Genome Variations & GWAS
Modes of selection on quantitative traits. Directional selection The population responds to selection when the mean value changes in one direction Here,
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
14 Population Genetics and Evolution. Population Genetics Population genetics involves the application of genetic principles to entire populations of.
Quantitative genetics: traits controlled by alleles at many loci
Chapter 5 Characterizing Genetic Diversity: Quantitative Variation Quantitative (metric or polygenic) characters of Most concern to conservation biology.
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
Evolution Chapters Evolution is both Factual and the basis of broader theory What does this mean? What are some factual examples of evolution?
SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.
The Complexities of Data Analysis in Human Genetics Marylyn DeRiggi Ritchie, Ph.D. Center for Human Genetics Research Vanderbilt University Nashville,
SNP molecular function, evolution and disease Md Imtiyaz Hassan, Ph.D.
Allele. Alternate form of a gene gene variant autosome.
1 Association Analysis of Rare Genetic Variants Qunyuan Zhang Division of Statistical Genomics Course M Computational Statistical Genetics.
What host factors are at play? Paul de Bakker Division of Genetics, Brigham and Women’s Hospital Broad Institute of MIT and Harvard
Construction of Substitution Matrices
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Genome wide association studies (A Brief Start)
NEW TOPIC: MOLECULAR EVOLUTION.
The Future of Genetics Research Lesson 7. Human Genome Project 13 year project to sequence human genome and other species (fruit fly, mice yeast, nematodes,
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Chapter 23: Evaluation of the Strength of Forensic DNA Profiling Results.
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College
Please feel free to chat amongst yourselves until we begin at the top of the hour. 1.
Single Nucleotide Polymorphisms (SNPs) By Amira Jhelum Rahul Shweta.
In populations of finite size, sampling of gametes from the gene pool can cause evolution. Incorporating Genetic Drift.
Human survivorship Developed Developing Bob May (2007), TREE 22:
Genome-Wides Association Studies (GWAS) Veryan Codd.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Substitution Matrices and Alignment Statistics BMI/CS 776 Mark Craven February 2002.
Single Nucleotide Polymorphisms (SNPs
SNPs and complex traits: where is the hidden heritability?
Nucleotide variation in the human genome
Genetic Linkage.
upstream vs. ORF binding and gene expression?
Natural Selection Garielle Wagnac.
Gene Hunting: Design and statistics
Genetic Linkage.
Beyond GWAS Erik Fransen.
Common Single-Nucleotide Polymorphisms Act in Concert to Affect Plasma Levels of High-Density Lipoprotein Cholesterol  Victor Spirin, Steffen Schmidt,
Chapter 7 Multifactorial Traits
Genetic Drift, followed by selection can cause linkage disequilibrium
Genetic Linkage.
INTRODUCTION TO MOLECULAR GENETICS
INTRODUCTION TO MOLECULAR GENETICS
Presentation transcript:

Human non-synonymous SNP: molecular function, evolution and disease Shamil Sunyaev Genetics Division, Brigham & Women’s Hospital Harvard Medical School Harvard-M.I.T. Division of HST

Effect on molecular function Phenotype Natural selection Medical Genetics Structural Biology Biochemistry Evolutionary Genetics

Predicting the effect of mutations in proteins

Why is this useful? n Understanding variation in molecular function and structure n Evolutionary genetics: comparison of polymorphism and divergence rates between different functional categories is a robust way to detect selection

Linkage analysis Rare

Classical association studies ControlDisease Common

Why is this useful? n Rare human developmental disorders / mouse mutagenesis screens: linkage studies are impossible n Genetics of complex disease: SNP prioritization n Genetics of complex disease: Rare variants

Technically, polymorphism should not exist!

Quantitative trait Mendelists Biometricians Forces to maintain variation: Selection Mutation

Common disease / Common variant Trade off (antagonistic pleiotropy) Balancing selection Recent positive selection Reverse in direction of selection Examples APOEAlzheimer’s disease AGTHypertension CYP3AHypertension CAPN10Type 2 diabetes

Individual human genome is a target for deleterious mutations ! ~40% of human Mendelian diseases are due to hypermutable sites Frequency of deleterious variants is directly proportional to mutation rate (q=  /s)

Multiple mostly rare variants Many deleterious alleles in mutation-selection balance Examples Plasma level of HDL-C Plasma level of LDL-C Colorectal adenomas

What about late onset phenotypes?

Harmful mutations n Function: damaging n Evolution: deleterious n Phenotype: detrimental n Advantageous pseudogenization (Zhang et al. 2006) n Gain of function disease mutations n Sickle Cell Anemia

protein multiple alignment profile

PolyPhen

Prediction rate of damaging substitutions possibly probably Disease mutations Divergence 82%57% 9% 3% Polymorphism 27% 15%

10% of PolyPhen false-positives are due to compensatory substitutions

Polyphen Phylogenetic measures PAM * * BLOSUM * * BLOSUM * * BLOSUM * * Site-specific structural/phylogenetic measures * * * Estimate of selection coefficient Williamson et al., PNAS 2005

de novo mutation effect spectrum NO DELETERIOUS POLYMORPHISM LOTS OF DELETERIOUS POLYMORPHISM Effect of new mutation may range from lethal, to neutral, to slightly beneficial

Mutation effect spectrum NO DELETERIOUS POLYMORPHISM LOTS OF DELETERIOUS POLYMORPHISM ?

Neutral mutation model Human ACCTTGCAAAT Chimpanzee ACCTTACAAAT Baboon ACCTTACAAAT Prob(TAC->TGC)  Prob(TGC->TAC) Prob(XY 1 Z->XY 2 Z) 64x3 matrix

Strongly detrimental mutations

Effectively neutral mutations

Mildly deleterious mutations

54 genes, 757 individuals inflammatory response 236 genes, individuals DNA repair and cell cycle pathways 518 genes, individuals

The majority of missense mutations observed at frequency below 1% are deleterious Frequency itself is a reliable predictor of function!

Fitness and selection coefficient Wild typeNew mutation N 1 = 4 N 2 = 3 Fitness 1 N1N1 N2N2 = 1 – s Selection coefficient

Mildly deleterious mutations 54 genes, 757 individuals inflammatory response 236 genes, individuals DNA repair and cell cycle pathways 518 genes, individuals

Fraction of detectable polymorphism

Human effective population size present past Estimation of selection coefficient - simulation

Human effective population size present past -log(s) F singl (s)F MAF>25% (s) Selection coefficient SNP probability to be observed

Classical association studies ControlDisease Common

“Mutation enrichment” association studies ControlDisease Rare

ControlDisease “Mutation enrichment” association studies

Rare missense variants in NPC1L1 gene contributes to variability in cholesterol absorption and plasma levels of low-density lipoproteins (LDLs) Cohen J et al., PNAS 2006 in press Nonsynonymous sequence variants in ABCA1 gene were significantly more common in individuals with low HDL-C ( 95th percentile). Cohen J et al., Science 2004 Multiple rare variants in different genes account for multifactorial inherited susceptibility to colorectal adenomas Fearnhead NS et al., PNAS 2004 “Mutation enrichment” association studies

Cholesterol

Adopted from Brewer et al., 2003

Effect of rare nsSNPs on HDL-C

What about common alleles of smaller effect? n Population of 3500 individuals with known plasma levels of HDL-C n Population includes both genders and three ethnic groups n 839 SNPs genotyped n Independent population of 800 individuals for validation

What about common alleles of smaller effect? n Introduce a linear model (ANCOVA) n Subsequently add SNPs to the linear model n Include SNPs based on the likelihood ratio test n Prioritizing SNPs based on conservation did not help

Effect of common SNPs on HDL-C HDL

And a different population … HDL

Acknowledgements The lab: Gregory Kryukov, Steffen Schmidt, Saurabh Asthana, Victor Spirin, Ivan Adzhubey Bioinformatics:Human genetics: Vasily RamenskyJonathan Cohen