Computational Analysis of Proteins Dr. K. Sivakumar Department of Chemistry SCSVMV University Chemistry – Our Life, Our Future National.

Slides:



Advertisements
Similar presentations
Tutorial Homology Modelling. A Brief Introduction to Homology Modeling.
Advertisements

PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Structural bioinformatics
Lipids, Proteins, and Carbohydrates
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
What is bioinformatics? Answer: It depends who you ask.
Visualizing Protein Structures. Genetic information, stored in DNA, is conveyed as proteins.
Day 2. Genetic information, stored in DNA, is conveyed as proteins.
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Introduction to Structural Bioinformatics Dong Xu Computer Science Department 271C Life Sciences Center 1201 East Rollins Road University of Missouri-Columbia.
Thomas Blicher Center for Biological Sequence Analysis
Computational Biology, Part 10 Protein Structure Prediction and Display Robert F. Murphy Copyright  1996, 1999, All rights reserved.
The Protein Data Bank (PDB)
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
Proteins and Protein Function Charles Yan Spring 2006.
CISC667, F05, Lec20, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction Protein Secondary Structure.
Protein Tertiary Structure. Primary: amino acid linear sequence. Secondary:  -helices, β-sheets and loops. Tertiary: the 3D shape of the fully folded.
Protein structure prediction May 30, 2002 Quiz#4 on June 4 Learning objectives-Understand difference between primary secondary and tertiary structure.
Protein structure determination & prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray.
Protein Tertiary Structure Prediction Structural Bioinformatics.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Computing for Bioinformatics Lecture 8: protein folding.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Associations of amphipathic molecules in aqueous solutions.
Chemistry 121(001) Winter 2015 Introduction to Organic Chemistry and Biochemistry Instructor Dr. Upali Siriwardane (Ph.D. Ohio State)
Protein Structure Prediction and Analysis
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Protein Tertiary Structure Prediction
Part II : Introduction To Protein Structure Kong Lesheng Victor Tong Joo Chuan National University of Singapore.
Wellcome Trust Workshop Working with Pathogen Genomes Module 3 Sequence and Protein Analysis (Using web-based tools)
Development of Bioinformatics and its application on Biotechnology
P2 Discussion 1. Revise on Central Dogma 2
Basic Introduction of BLAST Jundi Wang School of Computing CSC691 09/08/2013.
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
Enzymes (Proteins) Standards 1b, 1h, 4e, 4f, From the largest entity in the Universe to the smallest entity that makes up all the matter in the Universe.
Fast Search Protein Structure Prediction Algorithm for Almost Perfect Matches1 By Jayakumar Rudhrasenan S Primary Supervisor: Prof. Heiko Schroder.
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
© Wiley Publishing All Rights Reserved. Protein 3D Structures.
3D structure -Swiss Pdb Viewer
Neural Networks for Protein Structure Prediction Brown, JMB 1999 CS 466 Saurabh Sinha.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Amino Acids Colorless, crystalline, water soluble substances Distinguishing features are a -COOH group and a -NH 3 group attached to the same carbon R.
1 Enter the following Micro-RNA sequence into the box Run MFold and look at the results MFold Using MFold to predict RNA secondary structure
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Module 3 Protein Structure Database/Structure Analysis Learning objectives Understand how information is stored in PDB Learn how to read a PDB flat file.
Protein Tertiary Structure. Protein Data Bank (PDB) Contains all known 3D structural data of large biological molecules, mostly proteins and nucleic acids:
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Predicting Protein Structure: Comparative Modeling (homology modeling)
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Example of regression by RBF-ANN Prediction of charge on peptides after electron-spray ionization in mass spectrometry What are the best attributes to.
Homology Modeling 原理、流程,還有如何用該工具去預測三級結構 Lu Chih-Hao 1 1.
1.  Introduction  STARTING a SPDB view Session  Basic SPDB view Commands  Advanced SPDB view Commands  Ending a SPDB view Session 2.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Proteins Structure Predictions Structural Bioinformatics.
Amino Acids, Peptides, and Proteins. Peptides and proteins are polymers of amino acids linked together by amide bonds.
Bioinformatics Computing 1 CMP 807 – Day 4 Kevin Galens.
PROTEIN MODELLING Presented by Sadhana S.
Computational Structure Prediction
Protein Structure Prediction and Protein Homology modeling
Molecular Modeling By Rashmi Shrivastava Lecturer
Homology Modeling.
Presentation transcript:

Computational Analysis of Proteins Dr. K. Sivakumar Department of Chemistry SCSVMV University Chemistry – Our Life, Our Future National Workshop on Modern Techniques in Analytical Chemistry

AMINO ACIDS: THE BUILDING BLOCKS OF PROTEINS Triple & single letter codes of amino acids General structure of an amino acid Amino Acid Triple letter code Single letter code AlanineAlaA CysteineCysC Aspartic acidAspD Glutamic acidGluE PhenylalaninePheF GlycineGlyG HistidineHisH IsoleucineIleI LysineLysK LeucineLeuL MethionineMetM AsparagineAsnN ProlineProP GlutamineGlnQ ArginineArgR SerineSerS ThreonineThrT ValineValV TryptophanTrpW TyrosineTyrY 2

PROTEIN SEQUENCING ( Order of amino acids in proteins) MALSFTVGQLIFLFWTMRITEASPD Methionine Alanine Leucine Serine Phenylalanine Protein sequence Protein sequencer Protein sequencing - determining the order of amino acid sequence Methods– Mass Spec., Edman degradation,…. Amino acids in a protein - determines the properties of proteins Proteins are sequenced - by microbiologists and biotechnologists for various purposes. 3

4 Refer “GENOME” by Sujatha, for simple explanations on sequencing process

5 Various levels of protein structure……..

Methane Primary structure Secondary structure Tertiary structure Protein Primary structure Secondary structure Tertiary structure M for Metheonine M for group of atoms C for carbon C for single atom

Protein sequences are continuously submitted by sequencing centers and updated in protein databases. Till date more than 10 Lac proteins are sequenced and publicly made available through protein databases. For example, 524,420 Protein Sequence Databases No. of Sequences 1,365,912 13,593,921 7

Sequence growth in Protein sequence databases: Ref: SwissProt – Feb’ 2011Ref: GenomeNet – Feb’ 2011

70,947 Till 01, Feb, ,420 - ~ 5 Lac Protein Sequence Databases No. of Sequences 1,365,912 - > 10 Lac 13,593,921 - ~ 1 Cr The ONLY Protein Structure DatabaseNo. of Structure Ref: K. Sivakumar, Advanced BioTech, V (9), (2007)

10 PDB contains (70,947) structures determined by X-ray, NMR & Electron microscopy EM ~350 NMR ~8,700 X-ray ~60,500

 Most of the sequenced proteins lack a descriptive, documented physico-chemical and STRUCTURAL characterization.  Because, experimental methods (X-ray, NMR, EM) are,  Trial and error based  Time consuming  Expensive 11 Computational methods are,  Minimizing the number of experimental trials.  Reduces the cost of experimental investigation.  Facilitates experimental analysis be more focused. Ref: K. Sivakumar, S. Balaji, Ganga Radhakrishnan, Journal of Theoretical and Computational Chemistry, 6 (1), (2007).

12 Need for computational analysis > 10 Lac sequences are available in public databases Sequences are highly valuable resources, because… Huge amount of structural, functional & evolutionary information are locked up in sequences By contrast, the # of unique protein structures is very less - this represents a huge information deficit So, We need to construct 3D Models by COMPUTATIONAL METHODS

13 3D Structure can be modelled by… Homology Modeling Threading Ab initio

Ref: K. Sivakumar, Advanced BioTech, IV (11), (2006) Repeated with other suitable templates 14 Homology Modeling – Principle…

? KQFTKCELSQNLYDIDGYGRIALPELICTMF HTSGYDTQAIVENDESTEYGLFQISNALWCK SSQSPQSRNICDITCDKFLDDDITDDIMCAK KILDIKGIDYWIAHKALCTEKLEQWLCEKE Predicting Protein Structure: Comparative Modeling (formerly, homology modeling) Use as template & model 8lyz 1alc KVFGRCELAAAMKRHGLDNYRGYSLGNWVCAAKFES NFNTQATNRNTDGSTDYGILQINSRWWCNDGRTPGS RNLCNIPCSALLSSDITASVNCAKKIVSDGNGMNAW VAWRNRCKGTDVQAWIRGCRL Share Similar Sequence Homologous Target sequence Template sequence Template structure

What is Homology Modeling? Predicts the three-dimensional structure of a given protein sequence (TARGET) based on an alignment to one or more known protein structures (TEMPLATES) If similarity between the TARGET sequence and the TEMPLATE sequence is detected, structural similarity can be assumed. In general, 30% sequence identity is required for generating useful models.

17 Homology Modeling Get protein sequence from sequence database

18 Click to get protein details

19 Click to get protein sequence

20 protein sequence in fasta format Save it in a notepad for further use

21 Using Protein Blast server to find similar STRUCTURE Click to search, similar structures in PDB Paste sequence in Fasta format Choose PDB

22 Graphical summary of Blastp suite Blast search of O70456 Vs PDB

23 List of similar structure - Blastp suite

24 Detailed summary of Blastp suite

25 Paste sequence only Type the PDB ID Method1: EsyPred3D server - Submit the sequence and PDB ID Click to submit

26 Get built in structure through in Inbox

27 Download the attached the *.pdb file and save it

28 Open and visualize the *.pdb file in RasMol

29 Open and visualize the *.pdb file in RasMol

30 Method2: SWISS-MODEL server Click for modeling

31 Submit sequence only in Fasta format (without PDB ID) Similarity search (BlastP) will be done by SWISS-MODEL server Paste sequence Click to submit

32 Get built in structure through in Inbox

33 The links in the will lead to Click to download 3D structure

34 Open and visualize the *.pdb file in RasMol

35 Structure retrieval from Protein 3D Structure Database – PDB……….

36 Structure retrieval from Protein 3D Structure Database – PDB………. PDB ID Click for protein details 491 sequence in SwissProt for « Keratin »

37 Structure retrieval from Protein 3D Structure Database – PDB………. Click for downloading structure

38 Structure retrieval from Protein 3D Structure Database – PDB………. Save & Know the location

39 Open and visualize the *.pdb file in RasMol Structure of 3EUU

40 MNRVDLSLFIPDSLTAETGDLKIKTYKVVLIAR AASIFGVKRIVIYHDDADGEARFIRDILTYMDT PQYLRRKVFPIMRELKHVGILPPLRTPHHPTG Sequence data Structural data (in notepad) Structural data (in RasMol)

41 Built model validation by ProQ server Click for uploading structure

42 Built model validation by ProQ server Click & upload the structure

43 Built model validation by ProQ server Submit after uploading

44 Built model validation by ProQ server result

45 Built model validation by Ramachandran Plot Click & upload the structure

46 Submit after uploading Built model validation by Ramachandran Plot….

47 Built model validation by Ramachandran Plot…. RESULTS G.N.Ramachandran

Ref: K. Sivakumar, S. Balaji, Ganga Radhakrishnan, Journal of Chemical Sciences, 119 (5), (2007) 3D structure modeling and validation 48

Disulphide bridges in 3D structure of Q01758 Backbone of Q01758 (rainbow smelt fish) 10 Cysteines - ball and stick 10 Sulphur in Cysteines and 5 SS bonds (dotted lines) 49

Disulphide bridges in 3D structure of P05140 Ribbon model of P05140 (sea raven) 10 Cysteines - ball and stick 10 Sulphur in Cysteines and 5 SS bonds (dotted lines) 50

Secondary structure prediction from modeled 3D structure Q01758 P05140 Beta strand  -helices Coil 51

52 Finding cavities in the built model using Castp server Click for calculation

53 Finding cavities in the built model using Castp server Click, upload & Submit the structure

54 Finding cavities in the built model using Castp server - RESULTS

55 For literature

56

58

59 Download sequence file for any one of the following proteins from Swissprot/Protein Information Resource/Protein Research Foundation, Antifreeze Vascular Endothelial growth factor protein Keratin Generate atleast 3 homology models using EsyPred server or SWISS- model server (i.e., using different PDB structures) Visualize the structure using RasMol tool Compare and Evaluate the modelled 3D structure using RamPage, ProQ Server and Combinatorial Extension servers. EXERCISE Target sequence codeTemplate (PDB) Codes RamPageProQ Percentage of residues in favoured region LG ScoreMaxSub

60 Generate the report in MS-Word file and submit to Repeat the exercise for other protein sequences of your choice EXERCISE……

Thank you all! 61 P05140