Applications of Homology Modeling Hanka Venselaar.

Slides:



Advertisements
Similar presentations
Applications of Homology Modeling
Advertisements

1 Introduction to Sequence Analysis Utah State University – Spring 2012 STAT 5570: Statistical Bioinformatics Notes 6.1.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Applications of Homology Modeling Hanka Venselaar.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Structural bioinformatics
The CMBI: Bioinformatics Content  Bioinformatics   Bioinformatics tools & databases Hanka Venselaar CMBI UMC Radboud February 2009.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Tertiary protein structure viewing and prediction July 1, 2009 Learning objectives- Learn how to manipulate protein structures with Deep View software.
CISC667, F05, Lec21, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction 3-Dimensional Structure.
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
The 7 steps of Homology modeling. 1: Template recognition and initial alignment.
Homology modelling ? X-ray ? NMR ?. Homology Modelling !
Thomas Blicher Center for Biological Sequence Analysis
©CMBI 2002 Homology modelling ? X-ray ? NMR ? Intro Proteins Modelling 8 Steps Detect Threading Alignment Template Side chain Indels Optimize Validate.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Tertiary protein structure modelling May 31, 2005 Graded papers will handed back Thursday Quiz#4 today Learning objectives- Continue to learn how to manipulate.
©CMBI 2005 Why align sequences? Lots of sequences with unknown structure and function. A few sequences with known structure and function If they align,
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Protein: Linear chain of amino acids called residues (4 in this toy protein) Ser Trp Leu O N N N N O O C C C C O O CαCα CαCα CαCα CαCα Lys H H H H H The.
Applications of Homology Modeling Hanka Venselaar.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Computing for Bioinformatics Lecture 8: protein folding.
Homology modelling ? X-ray ? NMR ?. Homology Modelling !
Applications of Homology Modeling Hanka Venselaar.
Homology Modeling Seminar produced by Hanka Venselaar.
Protein Structures.
Protein Structure 101 Alexey Onufriev, Virginia Tech
Bioinformatics Ayesha M. Khan Spring 2013.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Protein Tertiary Structure Prediction
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
Computer-Assisted Drug Design (1) i)Random Screening ii)Lead Development and Optimization using Multivariate Statistical Analyses. iii)Lead Generation.
Tertiary Structure Prediction Methods Any given protein sequence Structure selection Compare sequence with proteins have solved structure Homology Modeling.
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
COMPARATIVE or HOMOLOGY MODELING
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Representations of Molecular Structure: Bonds Only.
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
© Wiley Publishing All Rights Reserved. Protein 3D Structures.
Rerun of essentials of week one From Rotamers to Models and back via the Entropy of Water.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
©CMBI 2003 MUTANT DESIGN BIO- INFORMATICS QUESTION ‘MOLECULAR BIOLOGY’ BIOPHYSICS.
Protein Structure Modelling Many sequences - few structures Homology Modelling - Based on Sequence Similarity with Sequences of Known Structures.
Alexey Onufriev, Virginia Tech
Secondary structure prediction
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
©CMBI 2009 Alignment & Secondary Structure You have learned about: Data & databases Tools Amino Acids Protein Structure Today we will discuss: Aligning.
Module 3 Protein Structure Database/Structure Analysis Learning objectives Understand how information is stored in PDB Learn how to read a PDB flat file.
Structure prediction: Homology modeling
Predicting Protein Structure: Comparative Modeling (homology modeling)
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Programme Last week’s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues Summary.
Alignment & Secondary Structure You have learned about: Data & databases Tools Amino Acids Protein Structure Today we will discuss: Aligning sequences.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
PROTEIN STRUCTURE (Donaldson, March 10,2003) What are we trying to learn about genes and their proteins: Predict function for unknown protein by comparison.
Protein Tertiary Structure Prediction Structural Bioinformatics.
PROTEIN MODELLING Presented by Sadhana S.
Protein Structure Visualisation
Computational Structure Prediction
Protein Structure Prediction and Protein Homology modeling
Protein Structure Prediction
Protein Structures.
Molecular Modeling By Rashmi Shrivastava Lecturer
Homology Modeling.
Protein structure prediction.
Programme Last week’s quiz results + Summary
Homology modeling in short…
Presentation transcript:

Applications of Homology Modeling Hanka Venselaar

This seminar…. Homology Modeling… Why? What? When? How? And a few real world examples….

Hearing loss No structure: MGTPWRKRKGIAGPGLPDLSCALVLQPRAQVGTMSPAI ALAFLPLVVTLLVRYRHYFRLLVRTVLLRSLRDCLSGLRI EERAFSYVLTHALPGDPGHILTTLDHWSSRCEYLSHMG PVKGQILMRLVEEKAPACVLELGTYCGYSTLLIARALPP GGRLLTVERDPRTAAVAEKLIRLAGFDEHMVELIVGSSE DVIPCLRTQYQLSRADLVLLAHRPRCYLRDLQLLEAHAL LPAGATVLADHVLFPGAPRFLQYAKSCGRYRCRLHHTG LPDFPAIKDGIAQLTYAGPG DFNB 63 Sequence:

KKIALSDARSMKHALREIKIIRRL DHDNIVKVYEVLGPKGTDLQGELF KFSVAYIVQEYMETDLARLLEQGT LAEEHAKLFMYQLLRGLKYIHSAN VLHRDLPANIFISTEDLVLKIGDF GLARIVDQHYSHKGYLSEGLVTKW YRSPRLLLSPNNYTKAIDMWAAGC ILAEMLTGRMLFAGAHELEQMQLL ETIPVIREEDKDELLRVMPSFVSS ? Why homology modeling? Lab Translation Bioinformatics ATOM 1 N GLN A N ATOM 2 CA GLN A C ATOM 3 C GLN A C ATOM 4 O GLN A O ATOM 5 CB GLN A C ATOM 6 CG GLN A C ATOM 7 CD GLN A C ATOM 8 OE1 GLN A O ATOM 9 NE2 GLN A N ATOM 10 N SER A N ATOM 11 CA SER A C ATOM 12 C SER A C ATOM 13 O SER A O ATOM 14 CB SER A C ATOM 15 OG SER A O ATOM 16 N CYS A N ATOM 17 CA CYS A C ATOM 18 C CYS A C ATOM 19 O CYS A O ATOM 20 CB CYS A C ATOM 21 SG CYS A S ATOM 22 N LEU A N 4

Protein structures – 4 levels Primary Secondar y Tertiary Quaternary  Shape of the protein determines its function…..

Protein structures…where can we find them? Protein DataBank =

PDB-file: contains the coördinaties for every atom in a protein Visualisation with PDB-viewers -Jmol -PyMol -SwissPDB viewer -YASARA

So, 3D Protein-structures provide useful information But…… Not enough protein structures in the PDB database

Predictions/Annotations

Homology modeling in short… Prediction of structure based upon a highly similar structure 2 basic assumptions: Structure defines function During evolution structures are more conserved than sequence 2 basic assumptions: Structure defines function During evolution structures are more conserved than sequence Use one structure to predict another

Homology modeling – When? Example: by 80 residues  30% identity sufficient O

Homology modeling in short… Prediction of structure based upon a highly similar structure Add sidechains, Molecular Dynamics simulation on model Unknown structure NSDSECPLSHDG || || | || NSYPGCPSSYDG NSDSECPLSHDG || || | || NSYPGCPSSYDG Model sequence Known structure Back bone copied Copy backbone and conserved residues Model!

The 8 steps of Homology modeling

1: Template recognition and initial alignment

BLAST your sequence against PDB Initial alignment Best hit is usually your template

1: Template recognition and initial alignment 2: Alignment correction

Functional residues  conserved Use multiple sequence alignments Deletions  shift gaps CPISRTGASIFRCW CPISRTA---FRCW CPISRT---AFRCW CPISRTGASIFRCW CPISRTA---FRCW CPISRT---AFRCW CPISRTAAS-FRCW CPISRTG-SMFRCW CPISRTA--TFRCW CPISRTAASHFRCW CPISRTGASIFRCW CPISRTA---FRCW CPISRTAAS-FRCW CPISRTG-SMFRCW CPISRTA--TFRCW CPISRTAASHFRCW CPISRTGASIFRCW CPISRTA---FRCW Both are possible Multipe sequence alignment Correct alignment  Sequence with known structure  Your sequence

2: Alignment correction Core residues  conserved Use multiple sequence alignments Deletions in your sequence  shift gaps Known structure FDICRLPGSAEAV Model FNVCRMP---EAI Model FNVCR---MPEAI S G P L A E R C IV C R M P E V C R M P E  Correct alignment F-D- -A-V

1: Template recognition and initial alignment 2: Alignment correction 3: Backbone generation

Making the model…. Copy backbone of template to model Make deletions as discussed (Keep conserved residues)

1: Template recognition and initial alignment 2: Alignment correction 3: Backbone generation 4: Loop modeling

Known structure GVCMYIEA---LDKYACNC Your sequence GECFMVKDLSNPSRYLCKC Loop library, try different options

1: Template recognition and initial alignment 2: Alignment correction 3: Backbone generation 4: Loop modeling 5: Sidechain modeling

5: Side-chain modeling Several options Libraries of preferred rotamers based upon backbone conformation

1: Template recognition and initial alignment 2: Alignment correction 3: Backbone generation 4: Loop modeling 5: Sidechain modeling 6: Model optimization

Molecular dynamics simulation Remove big errors Structure moves to lowest energy conformation

1: Template recognition and initial alignment 2: Alignment correction 3: Backbone generation 4: Loop modeling 5: Sidechain modeling 6: Model optimization 7: Model validation

7: Model Validation Second opinion by PDBreport /WHATIF Errors in active site?  new alignment/ template No errors?  Model!

1: Template recognition and initial alignment 2: Alignment correction 3: Backbone generation 4: Loop modeling 5: Sidechain modeling 6: Model optimization 7: Model validation 8: Iteration

Model! 1: Template recognition and initial alignment 2: Alignment correction 3: Backbone generation 4: Loop modeling 5: Sidechain modeling 6: Model optimization 7: Model validation 8: Iteration

8 steps of homology modeling 1: Template recognition and initial alignment 2: Alignment correction 3: Backbone generation 4: Loop modeling 5: Side-chain modeling 6: Model optimization 7: Model validation 8: Iteration Alignment Modeling Correction

Hearing loss Structure! MGTPWRKRKGIAGPGLPDLSCALVLQPRAQVGTMSPAI ALAFLPLVVTLLVRYRHYFRLLVRTVLLRSLRDCLSGLRI EERAFSYVLTHALPGDPGHILTTLDHWSSRCEYLSHMG PVKGQILMRLVEEKAPACVLELGTYCGYSTLLIARALPP GGRLLTVERDPRTAAVAEKLIRLAGFDEHMVELIVGSSE DVIPCLRTQYQLSRADLVLLAHRPRCYLRDLQLLEAHAL LPAGATVLADHVLFPGAPRFLQYAKSCGRYRCRLHHTG LPDFPAIKDGIAQLTYAGPG DFNB 63 Sequence:

Mutation: Tryptophan 105 -> Arginine Hydrophobic contacts from the Tryoptohan are lost, introduction of an hydrophilic and charged residue

The three mutated residues are all important for the correct positioning of Tyrosine 111 Tyrosine 111 is important for substrate binding Published in Nature Genetics: 2008 Oct 26.

Voorbeeld: C-terminale deletie van 10 aa in Dectine Afdeling: Interne geneeskunde of Internal Medicine >Dectin_1_Isoform_a MEYHPDLENLDEDGYTQLHFDSQSNTRIAVVS EKGSCAASPPWRLIAVILGILCLVILVIAVVL GTMAIWRSNSGSNTLENGYFLSRNKENHSQPT QSSLEDSVTPTKAVKTTGVLSSPCPPNWIIYE KSCYLFSMSLNSWDGSKRQCWQLGSNLLKIDS SNELGFIVKQVSSQPDNSFWIGLSRPQTEVPW LWEDGSTFSSNLFQIRTTATQENPSPNCVWIH VSVIYDQLCSVPSYSICEKKFSM

MSQSTQTNEFLSPEVFQHIWDFLEQPICSVQPIDLNFVDEPSEDGATNKI EISMDCIRMQDSDLSDMWPQYTNLGLLNSMDQQIQNGSSSTSPYNTDHAQ NSVTAPSPYAQPSSTFDALSPSPAIPSNTDYPGPHSFDVSFQQSSTAKSA TWTYSTELKKLYCQIAKTCPIQIKVMTPPPQGAVIRAMPVYKKAEHVTEV VKRCPNHELSREFNEGQIAPPSHLIRVEGNSHAQYVEDPITGRQSVLVPY EPPQVGTEFTTVLYNFMCNSSCVGGMNRRPILIIVTLETRDGQVLGRRCF EARICACPGRDRKADEDSIRKQQVSDSTKNGDGTKRPFRQNTHGIQMTSI KKRRSPDDELLYLPVRGRETYEMLLKIKESLELMQYLPQHTIETYRQQQQ QQHQHLLQKQTSIQSPSSYGNSSPPLNKMNSMNKLPSVSQLINPQQRNAL TPTTIPDGMGANIPMMGTHMPMAGDMNGLSPTQALPPPLSMPSTSHCTPP PPYPTDCSIVSFLARLGCSSCLDYFTTQGLTTIYQIEHYSMDDLASLKIP EQFRHAIWKGILDHRQLHEFSSPSHLLRTPSSASTVSVGSSETRGERVID AVRFTLRQTISFPPRDEWNDFNFDMDARRNKQQRIKEEGE P63 sequence Structure! EEC syndrome

Arginine Serine Mutation R  S Loss of negative charge Loss of interaction with the DNA

Homology Modeling… What? Prediction of an unknown structure based on an homologous and known structure Why? To answer biological and medical questions when the “real” structure is unknown When? A template with enough identity must be available How? 8 Steps  Use the models for mutant analysis, experimental design and understanding of the protein in general To conclude….