Challenges and accomplishments in molecular prediction Yanay Ofran.

Slides:



Advertisements
Similar presentations
Functional Site Prediction Selects Correct Protein Models Vijayalakshmi Chelliah Division of Mathematical Biology National Institute.
Advertisements

1 Introduction to Sequence Analysis Utah State University – Spring 2012 STAT 5570: Statistical Bioinformatics Notes 6.1.
Protein structure prediction.. Protein folds. Fold definition: two folds are similar if they have a similar arrangement of SSEs (architecture) and connectivity.
Protein Structure Prediction using ROSETTA
Protein Tertiary Structure Prediction
Structural bioinformatics
Protein Threading Optimization Using Consensus Homology Modeling Maliha Sarwat ( ), Tasmin Tamanna Haque ( ) Department of Computer Science.
Homology 3D modeling and effect of mutations. X-ray crystallography (70,714 in PDB) need crystals Nuclear Magnetic Resonance (NMR) (9,312) proteins in.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Chapter 9 Structure Prediction. Motivation Given a protein, can you predict molecular structure Want to avoid repeated x-ray crystallography, but want.
Protein Structure, Databases and Structural Alignment
Protein Fold recognition Morten Nielsen, Thomas Nordahl CBS, BioCentrum, DTU.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Protein Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Thomas Blicher Center for Biological Sequence Analysis
Protein Fold recognition
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Protein Tertiary Structure. Primary: amino acid linear sequence. Secondary:  -helices, β-sheets and loops. Tertiary: the 3D shape of the fully folded.
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
In double vision when drunk By Thomas Huber 23 November 2001 Alexandra Headland.
Protein Tertiary Structure Prediction Structural Bioinformatics.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
PolyPhen and SIFT: Tools for predicting functional effects of SNPs Epi 244 Spring 2009 Sam S. Oh.
Bioinformatics (3 lectures) Why bother about proteins/prediction What is bioinformatics Protein databases Making use of database information –Predictions.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein structure prediction 29/01/2015 Mail: Prof. Neri Niccolai Simone Gardini
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Protein Tertiary Structure Prediction
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Forces and Prediction of Protein Structure Ming-Jing Hwang ( 黃明經 ) Institute of Biomedical Sciences Academia Sinica
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
COMPARATIVE or HOMOLOGY MODELING
Rising accuracy of protein secondary structure prediction Burkhard Rost
Representations of Molecular Structure: Bonds Only.
Lecture 12 CS5661 Structural Bioinformatics Motivation Concepts Structure Prediction Summary.
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
Sequence Alignment Goal: line up two or more sequences An alignment of two amino acid sequences: …. Seq1: HKIYHLQSKVPTFVRMLAPEGALNIHEKAWNAYPYCRTVITN-EYMKEDFLIKIETWHKP.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
Applied Bioinformatics Week 12. Bioinformatics & Functional Proteomics How to classify proteins into functional classes? How to compare one proteome with.
Protein Tertiary Structure. Protein Data Bank (PDB) Contains all known 3D structural data of large biological molecules, mostly proteins and nucleic acids:
PREDICTION OF CATALYTIC RESIDUES IN PROTEINS USING MACHINE-LEARNING TECHNIQUES Natalia V. Petrova (Ph.D. Student, Georgetown University, Biochemistry Department),
JM - 1 Introduction to Bioinformatics: Lecture XI Computational Protein Structure Prediction Jarek Meller Jarek Meller Division.
Protein Structure Prediction ● Why ? ● Type of protein structure predictions – Sec Str. Pred – Homology Modelling – Fold Recognition – Ab Initio ● Secondary.
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Modelling protein tertiary structure Ram Samudrala University of Washington.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Programme Last week’s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues Summary.
Classification of protein and domain families Sequence to function Protein Family Resources and Protocols for Structural and Functional Annotation of Genome.
Motif Search and RNA Structure Prediction Lesson 9.
BMC Bioinformatics 2005, 6(Suppl 4):S3 Protein Structure Prediction not a trivial matter Strict relation between protein function and structure Gap between.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Homology 3D modeling and effect of mutations. X-ray crystallography (70,714 in PDB) need crystals Nuclear Magnetic Resonance (NMR) (9,312) proteins in.
Modelling proteomes: Application to understanding HIV disease progression Ram Samudrala Department of Microbiology University of Washington How does the.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Forces and Prediction of Protein Structure Ming-Jing Hwang ( 黃明經 ) Institute of Biomedical Sciences Academia Sinica
We propose an accurate potential which combines useful features HP, HH and PP interactions among the amino acids Sequence based accessibility obtained.
Considerations for multi-omics data integration Michael Tress CNIO,
PROTEIN MODELLING Presented by Sadhana S.
Computational Structure Prediction
Protein Structure Prediction and Protein Homology modeling
Protein dynamics Folding/unfolding dynamics
Prediction of protein function from sequence analysis
Molecular Modeling By Rashmi Shrivastava Lecturer
Homology Modeling.
Protein structure prediction.
Programme Last week’s quiz results + Summary
Protein Homology Modelling
Additional file 3 >HWI-EAS344:7:70:153:1969#0/1 Length = 75 
Presentation transcript:

Challenges and accomplishments in molecular prediction Yanay Ofran

accumulation of data not knowledge >70 million (as of ) Off chart since 1997

Central dogma – it’s all in the sequence DNARNAProtein Function Structure

- Structure - Function - Interaction

Annotation transfer: structure Rost (1999) Protein Engineering 12: PDB similar structure dissimilar structure

Score = 83.2 bits (205), Expect = 9e-17 Identities = 18/101 (X%), Positives = 36/101 (35%), Gaps = 2/101 (1%) Query: 111 AAGGIAAKYLARKNSSVFGFIGCGTQAYFQLEALRRVFDIGEVKAYDVREKAAKKF 170 AA +A + L + +G G ++L + V + + A + Sbjct: 153 AAVELAERELGSLHDKTVLVVGAGEMGKTVAKSLVD-RGVRAVLVANRTYERAVEL 211 Query: 171 EDRGISASVQPAEEASRCDVLVTTTPSRKPVVKAEWVEEGT R DV+V+ T + PV+ + V E Sbjct: 212 GGEAVRFDE-LVDHLARSDVVVSATAAPHPVIHVDDVREAL % 150aa >P1 MEDLVSVGITHKEAEVEELEKARFESDEAVRDIVESFGLSGS VLLQTSNRVEVYASGARDRAEELGDLIHDDAWVKRGSEAVRH LFRVASGLESMMVGEQEILRQVKKAYDRAARLGTLDEALKIV FRRAINLGKRAREETRISEGAVSI >P2 METLILTQEEVESLISMDEAMNAVEEAFRLYALGKAQMPPKV YLEFEKGDLRAMPAHLMGYAGLKWVNSHPGNPDKGLPTVMAL MILNSPETGFPLAVMDATYTTSLRTGAAGGIAAKYL P1P2 Structure prediction by homology

Score = 33.9 bits (77), Expect = Identities = 14/58 (y%), Positives = 28/58 (48%), Gaps = 2/58 (3%) Query: 178 SVQPAEEASRCDVLVTTTPSRKPVVKAEWVEEGTHINAIGADGPGKQELD-VEILKKA EE ++ D+LV T + +VK EW++ G + G + ++ E ++A Sbjct: 198 TAHLDEEVNKGDILVVATGQPE-MVKGEWIKPGAIVIDCGINYKVVGDVAYDEAKERA 254 >P1 MEDLVSVGITHKEAEVEELEKARFESDEAVRDIVESFGLSGS VLLQTSNRVEVYASGARDRAEELGDLIHDDAWVKRGSEAVRH LFRVASGLESMMVGEQEILRQVKKAYDRAARLGTLDEALKIV FRRAINLGKRAREETRISEGAVSI >P2 MLELLPTAVEGVSQAQITGRPEWIWLALGTALMGLGTLYFLV KGMGVSDPDAKKFYAITTLVPAIAFTMYLSMLLGYGLTMVPF GGEQNPIYWARYADWLFTTPLLLLDLALLVDADQGTILALVG ADGIMIGTGLVGALTK P1 40% 50aa P2 Structure prediction by homology

Structure prediction from sequence Liu & Rost (2002) Bioinformatics 18:

Rost et al. (2003) CMLS 60: Annotation Transfer %seq id.Hssp val E val Annotation transfer: Function

Annotation transfer: interaction Mika et al. (2006) PLoS CB Protein A and protein B bind each other. Do A’ and B’, their respective homologues, interact as well?

Interaction sites by homology

Annotation Transfer Limit of annotation transfer Seq ID 100% 0% structureFunctioninteraction Blind annotation transfer Ab initio

Annotation Transfer Template and model

>5(5; 2)(0.5;2)(0.5; 0.5-)(-2;-0.5)(2-; 5-)<5-

Combine ( I-TASSER, by Zhang )

Annotation Transfer Limit of annotation transfer

Annotation Transfer Some methods can do it

Local vs. non local interaction Levinthal “Paradox”: A protein with 100 amino acid has ~10 48 possible conformations => calculation unfeasible. Let’s assumes (generously): A protein can sample structures per second. It would take this protein about seconds ~ years to try out all the possible conformations. (Time since the big bang ~10 10 years).

Local vs. non local interaction

Annotation Transfer Low RMSD Witsow & Piatigorsky (1999) Science

Annotation Transfer High RMSD Subtilin (5sic) Chymotrypsin (5cha)

Annotation Transfer Challenges for next CASP i.modeling the structure of single-residue mutants. ii.modeling structure changes associated with specificity changes within protein families. iii.devise scoring functions that will reliably pick the most accurate models from a set of candidate structures produced by current new fold methods.

Annotation Transfer Olympic games of predictions Structure – CASP Interaction – CAPRI Function – AFP, CASP

Combine interaction + seq. analysis to predict function Li et al (2005) Nature biotech

Combine interaction + seq. analysis to predict function Li et al (2005) Nature biotech

- Structure - Function - Interaction

Predicting DNA binding sites Ofran et al (2007) in press

Predicting DNA binding sites Ofran et al (2007) in press

c-Myb + C/EBPβ bound to DNA

Identifying novel DNA binding proteins accuracy