Nebojsa Mirkovic, Carles Ferrer-Costa, Marc A. Marti-Renom, Alvaro N.A. Monteiro, Andrej Sali Functional Consequences of the SNPs : BRCA1, Membrane Transporters.

Slides:



Advertisements
Similar presentations
applications of genome sequencing projects
Advertisements

LS-SNP: Large-scale annotation of coding non- synonymous SNPs based on multiple information sources -Bioinformatics April 2005.
Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
Zhen Shi June 2, 2010 Journal Club. Introduction Most disease-causing mutations are thought to confer radical changes to proteins (Wang and Moult, 2001;
Genetic Screening for Alzheimer’s Disease Thorstensen: Genetic Screening for Alzheimer's Disease 1.
1 Single Nucleotide Polymorphisms (SNP) Gary Jones SPE, Technology Center 1600 (703)
Protein Structure Database Introduction Database of Comparative Protein Structure Models ModBase 生資所 g 詹濠先.
Outline to SNP bioinformatics lecture
Dr. Almut Nebel Dept. of Human Genetics University of the Witwatersrand Johannesburg South Africa Significance of SNPs for human disease.
Predicting the Function of Single Nucleotide Polymorphisms Corey Harada Advisor: Eleazar Eskin.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
Human non-synonymous SNP: molecular function, evolution and disease Shamil Sunyaev Genetics Division, Brigham & Women’s Hospital Harvard Medical School.
Bioinformatics Ch1. Introduction (continue-2) 阮雪芬 Nov7, 2002 NTUST
FINAL EXAM: TAKE-HOME Assessment of Significance in Cancer Gene SNPs.
Genome Browsers Ensembl (EBI, UK) and UCSC (Santa Cruz, California)
1 Functional prediction in proteins (purifying and positive selection)
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
PolyPhen and SIFT: Tools for predicting functional effects of SNPs Epi 244 Spring 2009 Sam S. Oh.
SNP Resources: Finding SNPs Databases and Data Extraction Mark J. Rieder, PhD SeattleSNPs Variation Workshop March 20-21, 2006.
Georgia Wiesner, MD CREC June 20, GATACAATGCATCATATG TATCAGATGCAATATATC ATTGTATCATGTATCATG TATCATGTATCATGTATC ATGTATCATGTCTCCAGA TGCTATGGATCTTATGTA.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Friday 17 rd December 2004Stuart Young Capstone Project Presentation Predicting Deleterious Mutations Young SP, Radivojac P, Mooney SD.
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
Identification and evaluation of causative genetic variants corresponding to a certain phenotype Xidan Li.
CS177 Lecture 10 SNPs and Human Genetic Variation
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
Development and Application of SNP markers in Genome of shrimp (Fenneropenaeus chinensis) Jianyong Zhang Marine Biology.
Quantitative Genetics
What is a SNP?. Lecture topics What is a SNP? What use are they? SNP discovery SNP genotyping Introduction to Linkage Disequilibrium.
Supplemental Table 1. Primers used in Ion Torrent sequencing of HPV. Bar codes id_strSequence IonXpress_1CTAAGGTAAC IonXpress_2TAAGGAGAAC IonXpress_3AAGAGGATTC.
1 DNA Polymorphisms: DNA markers a useful tool in biotechnology Any section of DNA that varies among individuals in a population, “many forms”. Examples.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
Introduction Hereditary predisposition (mutations in BRCA1 and BRCA2 genes) contribute to familial breast cancers. Eighty percent of the.
The International Consortium. The International HapMap Project.
In The Name of GOD Genetic Polymorphism M.Dianatpour MLD,PHD.
Single nucleotide polymorphisms and Large scale variation
Motif Search and RNA Structure Prediction Lesson 9.
Slides Presented During Q & A. Expected Events in Untreated Patients Over 5 Years a Taiwan Observational Cohort Study and Safety Cohort b Chen CJ et al.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Figure S1 Figure S1. Phylogenetic tree of LexA binding sites in cyanobacteria, B.subtilis,  - proteobacteria and E.coli. Binding sites of cyanobacteria.
Human survivorship Developed Developing Bob May (2007), TREE 22:
A high-resolution map of human evolutionary constraints using 29 mammals Kerstin Lindblad-Toh et al Presentation by Robert Lewis and Kaylee Wells.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Human survivorship Developed Developing Bob May (2007), TREE 22:
Single Nucleotide Polymorphisms (SNPs
Genomic Analysis: GWAS
Nucleotide variation in the human genome
Neutrality Test First suggested by Kimura (1968) and King and Jukes (1969) Shift to using neutrality as a null hypothesis in positive selection and selection.
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Gene Hunting: Design and statistics
Diverse Functional Properties of Wilson Disease ATP7B Variants
Polymorphisms GWAS traits.
By Michael Fraczek and Caden Boyer
VWF sequence variants: innocent until proven guilty
Polymorphisms GWAS traits.
By Stitziel, Tseng, Pervouchine, Goddeau, Kasif, Liang
The Genetic of Earwax Wet earwax is a dominant allele!
Medical genomics BI420 Department of Biology, Boston College
Pan-Cancer Analysis of Mutation Hotspots in Protein Domains
(A) Schematic of the TGGT1_254250/TgPRELID gene, mRNA, and predicted protein. (A) Schematic of the TGGT1_254250/TgPRELID gene, mRNA, and predicted protein.
Systems-wide Identification of cis-Regulatory Elements in Proteins
Medical genomics BI420 Department of Biology, Boston College
Structure, Exchange Determinants, and Family-Wide Rab Specificity of the Tandem Helical Bundle and Vps9 Domains of Rabex-5  Anna Delprato, Eric Merithew,
Genetics of Langerhance Cell Histiocytosis
Guilty as charged Cancer Cell
Conserved motifs in the ABC
Presentation transcript:

Nebojsa Mirkovic, Carles Ferrer-Costa, Marc A. Marti-Renom, Alvaro N.A. Monteiro, Andrej Sali Functional Consequences of the SNPs : BRCA1, Membrane Transporters and More

Single Nucleotide Polymorphisms (SNPs) single base pair replacements of appreciable allelic frequency in the population ( >1% ); predominant form of human genetic variation (90%); number predicted to range from one to several million per genome; predicted frequency: 1/1000bp; predicted number per gene: 4-12 on average (limited datasets); classification by genomic location: ncSNPs, cSNPs (synonymous, non-synonymous: 24,000-40,000 per genome).

identify gene(s) that underlie numerous genetic disorders and multifactorial traits (eg, pharmacological response); SNPs are probably the biggest class of pathogenic changes in the human genome (coding and regulatory regions); several genetic variants connected to genetic disorders ( E4 allele of APOE with Alzheimer disease, FV Leiden allele with deep-venuos thrombosis, and CCR532 with resistance to HIV infection ); numerous indirect evidence of functional impact; markers of choice in genetic studies. Significance of SNP Analysis

Br 32 Br 42 * Br Identification of Sequence Variants in Genes of Interest

Project Goal What is the likelihood that a given SNP destroys the function of a protein? Approach design a classification function that can predict functional impact of a particular SNP and relies on a combination of sequence, structure and genetic properties from a well-characterized set of nsSNPs; build ModSNP, a structural database of SNPs, containing protein structure models for all known nsSNPs; apply the classification function to a number of specific examples and to all the SNPs in ModSNP.

BRCA1 Project

Human BRCA1 Tandem BRCT Repeats 200 aa RING NLS BRCT Globular regions Nonglobular regions BRCA1 BRCT repeats Williams, Green, Glover. Nat.Struct.Biol. 8, 838, 2001

Missense Mutations in BRCT Domains by Function transcription activation cancer associated V1665M D1692N G1706A D1733G M1775V P1806A M1652K L1657P E1660G H1686Q R1699Q K1702E Y1703H F1704S L1705P S1715N S1722F F1734L G1738E G1743R A1752P F1761I F1761S M1775E M1775K L1780P I1807S V1833E A1843T M1652T V1653M L1664P T1685A T1685I M1689R D1692Y F1695L V1696L R1699L G1706E W1718C W1718S T1720A W1730S F1734S E1735K V1736A G1738R D1739E D1739G D1739Y V1741G H1746N R1751P R1751Q R1758G L1764P I1766S P1771L T1773S P1776S D1778G D1778H D1778N M1783T A1823T V1833M W1837R W1837G S1841N A1843P T1852S P1856T P1859R C1787S G1788D G1788V G1803A V1804D V1808A V1809A V1809F V1810G Q1811R P1812S N1819S C1697R R1699W A1708E S1715R P1749R M1775R M1652I A1669S no transcription activation unknown not cancer associated unknown

Mapping of the Mutations on the Surface of BRCA1 BRCT Domains R1699Q/W G1743R K1702F L1657P D1692N D1773G E1660G

Location of Putative BRCT-Protein Interaction Site RMSMVVSGLTPEEFMLVYKFARKHHITLTNLITEETTHVVMKTDAEFVCERTLKYFLGIAGGKWVVSYF WVTQSIKERKMLNEHDFEVRGDVVNGRNHQGPKRARESQDRKIFRGLEICCYGPFTNMPTDQLEWMVQL CGASVVKELSSFTLGTGVHPIVVVQPDAWTEDNGFHAIGQMCEAPVVTREWVLDSVALYQCQELDTYLI PQIP

Mutation Features MutationMutation likelihood Phylo- genetic entropy Accessi bility Residue rigidity Neighor- hood rigidity Percent relative volume change Polarity change Percent relative ASA change Charge change M1652I 10.27B M1652K 0.27B V1653M 10.00B L1657P 0.00E E1660G 0.43E V1665M 10.00B Y1703H 00.00B

YES charge change + buriedness YES NO <30A 3 ≥60A 3 <90A 3 ≥90A 3 rigid (<-0.7) non-rigid ( ≥ -0.7) exposed buried residue rigidity volume change functional site or 1 class phylogenetic entropy polarity change 0 <0 - NO non 0 ≥0≥0 YES + - “Decision” Tree for Predicting Functional Impact of Genetic Variants NO 2 class <60A 3 ≥30A 3 neighborhood rigidity buriedness residue rigidity volume change charge change polarity change phylogenetic entropy other information (helix breaker, turn breaker) other information (helix breaker, turn breaker) + mutation likelihood volume change - residue rigidity volume change polarity change phylogenetic entropy other information (helix breaker, turn breaker) + mutation likelihood functional site buriedness START neighborhood rigidity neighborhood rigidity charge change

Rationalization of Functionally Characterized Mutants M1652I ++ Likely replacement, ‘I’ allowed in multiple sequence alignment. L1657P ?- Predicted binding site, large volume change in rigid neighborhood, unlikely replacement at completely preserved position. C1697R -- Two class polarity change, unlikely replacement at completely preserved position. S1715N ?- Not explained. Likely replacement. Prediction of Function of Uncharacterized Mutants M1652T - Large volume change in rigid neighborhood. V1653M + Likely replacement. G1738R - Very large volume change at flexible position, unlikely replacement at completely preserved position. T1773S + Likely replacement, predicted binding site.

SNP Web Server

Membrane Transporter Project

Chang, Roth. Science 293, 1793, 2001 Eco-msbA dimer, 1jsq Domain Organization and Structure of E.Coli ABC Transporter MsbA 1582 crystalized not crystalized A BC A Walker motif A ( ) B Walker motif B ( ) C ABC transporter signature ( ) transmembranenucleotide binding transmembrane intracellular extracellular

PMT Database a database of in-site detected and/or verified polymorphisms in various mammalian membrane transporters with ample population genetics data (allelic frequencies, ethnic distribution); 59 proteins divided in two groups (24 and 25 proteins respectively) depending on the SNP-detection approach; 10 ABC transporters, 5 with SNPs reported so far (BSEP, MDR1, MDR3, MRP1, MRP2).

Pieper et al., Nucl. Acids Res

ModPipe Results for Transmembrane Proteins from the PMT Database 49 proteins submitted; 11 proteins could not be modeled; 15 proteins have bad fold assessment; 8 proteins have good fold assessment, but bad model score; 15 proteins have good fold assessment and good model score. ProteinBSEPMDR1MRP2MDR3MRP1 Coverage6/98/133/182/70/5 Coverage of ABC Transporters

Modeled regions of MDR1 Predicted Domain Organization and Structure Model of Human ABC Transporter MDR modeled not modeled repeat 1repeat 2 intracellular, extracellular ATP-binding loop transmembrane helix ATP-binding cassette 1jj7A ( )1g291 ( )

MDR1 (ABC-B1) Numb. of models Seq. Id. [%] E-valueCoverage (aa) Mapped SNPs ABC ~2403 ABC ~2405 SNPASA Secondary Structure HydrophilicityVolume Blosum score Energy S400NHalfCoil L662RHalfCoil R669CExposedHelix P1051AHalfStrand W1108RHalfCoil S1141TExposedCoil V1251IHalfStrand T1256KHalfCoil Sequence and structure features of SNPs

Significance a comprehensive structural study of the functional impact of mutations focused on a particular protein family; a large and informative mutation set used; rationalization of the characterized mutation set used to develop a predictive tool; possible contribution to genetic studies (eg, candidate gene approach) and medical practice (with other methods).