PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D.

Slides:



Advertisements
Similar presentations
LSM2104/CZ2251 Essential Bioinformatics and Biocomputing Essential Bioinformatics and Biocomputing Protein Structure and Visualization (2) Chen Yu Zong.
Advertisements

Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
Web Resources for Bioinformatics Vadim Alexandrov and Mark Gerstein.
PROTEOMICS 3D Structure Prediction. Contents Protein 3D structure. –Basics –PDB –Prediction approaches Protein classification.
Tutorial Homology Modelling. A Brief Introduction to Homology Modeling.
1 Protein Structure, Structure Classification and Prediction Bioinformatics X3 January 2005 P. Johansson, D. Madsen Dept.of Cell & Molecular Biology, Uppsala.
Protein Tertiary Structure Prediction
Tema 14. Bases of protein structure and structural prediction. Structural data bank. Protein Data Bank. Molecular Visualization Tools for 3D. Prediction.
Structural bioinformatics
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Protein structure determination. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography,
Protein structure (Part 2 of 2).
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
The Protein Data Bank (PDB)
ProteinStructuralDatabases. Proteins are built from amino-acids. Introduction H | NH2-c-CO2H | R.
CISC667, F05, Lec20, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction Protein Secondary Structure.
Protein Tertiary Structure. Primary: amino acid linear sequence. Secondary:  -helices, β-sheets and loops. Tertiary: the 3D shape of the fully folded.
Protein structure determination & prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray.
IV. Protein Structure Prediction and Determination Methods of protein structure determination Critical assessment of structure prediction Homology modelling.
Protein Structure and Function Prediction. Predicting 3D Structure –Comparative modeling (homology) –Fold recognition (threading) Outstanding difficult.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein structures in the PDB
Protein structure Classification Ole Lund, Associate professor, CBS, DTU.
BLOSUM Information Resources Algorithms in Computational Biology Spring 2006 Created by Itai Sharon.
PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D.
Protein Structure Prediction II
Introduction to Bioinformatics - Tutorial no. 8 Protein Prediction: - PROSITE - Pfam - SCOP - TOPITS - genThreader.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Protein Structures.
Bioinformatics Ayesha M. Khan Spring 2013.
Protein Structure Prediction and Analysis
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Protein Tertiary Structure Prediction
Chapter 12 Protein Structure Basics. 20 naturally occurring amino acids Free amino group (-NH2) Free carboxyl group (-COOH) Both groups linked to a central.
Part II : Introduction To Protein Structure Kong Lesheng Victor Tong Joo Chuan National University of Singapore.
Structural alignment Protein structure Every protein is defined by a unique sequence (primary structure) that folds into a unique.
Macromolecular structure
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
COMPARATIVE or HOMOLOGY MODELING
Protein 3D-structure analysis Exercises. Practicals Find update frequency for RCSB PDB: weekly. When was the last update? How many protein structures.
SMART Teams: Students Modeling A Research Topic Jmol Training 101!
Gene Annotation and Analysis Lab Work Reference: European Multimedia Bioinformatics Educational Resource.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Bioinformatics 2 -- Lecture 8 More TOPS diagrams Comparative modeling tutorial and strategies.
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
CATH – a hierarchic classification of protein domain structures Rui Kuang.
Neural Networks for Protein Structure Prediction Brown, JMB 1999 CS 466 Saurabh Sinha.
PROTEIN STRUCTURE CLASSIFICATION SUMI SINGH (sxs5729)
1 Enter the following Micro-RNA sequence into the box Run MFold and look at the results MFold Using MFold to predict RNA secondary structure
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Protein Strucure Comparison Chapter 6,7 Orengo. Helices α-helix4-turn helix, min. 4 residues helix3-turn helix, min. 3 residues π-helix5-turn helix,
Module 3 Protein Structure Database/Structure Analysis Learning objectives Understand how information is stored in PDB Learn how to read a PDB flat file.
Protein Tertiary Structure. Protein Data Bank (PDB) Contains all known 3D structural data of large biological molecules, mostly proteins and nucleic acids:
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Homology modeling with SWISS-MODEL
DDPIn Distance and Density Based Protein Indexing David Hoksza Charles University in Prague Department of Software Engineering Czech Republic.
1.  Introduction  STARTING a SPDB view Session  Basic SPDB view Commands  Advanced SPDB view Commands  Ending a SPDB view Session 2.
Guidelines for sequence reports. Outline Summary Results & Discussion –Sequence identification –Function assignment –Fold assignment –Identification of.
Structural alignment methods Like in sequence alignment, try to find best correspondence: –Look at atoms –A 3-dimensional problem –No a priori knowledge.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Lecture 10 CS566 Fall Structural Bioinformatics Motivation Concepts Structure Solving Structure Comparison Structure Prediction Modeling Structural.
An Efficient Index-based Protein Structure Database Searching Method 陳冠宇.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Sequence: PFAM Used example: Database of protein domain families. It is based on manually curated alignments.
Homology 3D modeling Miguel Andrade Mainz, Germany Faculty of Biology,
Protein Structure Prediction and Protein Homology modeling
Protein Structures.
Protein structure prediction.
Presentation transcript:

PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D structure prediction ModBase-A database of 3D struc. Predict. Protein Structure Prediction

How are structures solved experimentally? X-Ray crystalography: Diffraction patterns are recorded from x-ray beams hitting a crystalized array of molecules. NMR: Nuclear magnetic resonance, magnetic nuclei absorb and re-emit magnetic radiation in frequencies depending on their properties. Cryo-EM: In Cryo-electron microscopy molecules are frozen in a thin tube, EM records many low resolution projections of the molecule and it is computationaly combined. Many other methods exist.

Embedding: from distances to shape חיפהת"אירושליםאילתאשדוד חיפה ת"א ירושלים אילת אשדוד

PDB: Curation of solved structures

PDB file Accession number Java based visualization tools Structural Classification

PDB provides the atomic coordinates of the structure : Which can be viewed by different visualization tools

SCOP: Structural Classification of Proteins Based on known protein structures Manually created by visual inspection Hierarchical database structure: –Class, Fold, Superfamily, Family, Protein and Species

Parents of node Children of node Node

Parents of node Children of node Node

CATH: Protein Structure Classification by Class, Architecture, Topology and Homology Class: The secondary structure composition: mainly-alpha, mainly-beta and alpha-beta. Architecture: The overall shape of the domain structure. Orientations of the secondary structures : e.g. barrel or 3- layer sandwich. Topology: Structures are grouped into fold groups at this level depending on both the overall shape and connectivity of the secondary structures. Homologous Superfamily: Evolutionary conserved structures

CATH: Protein Structure Classification by Class, Architecture, Topology and Homology

Prediction: Comparative Modeling Various methods: –Homology modeling –Protein threading –Side-chain geometry prediction Accuracy of the comparative model is related to the sequence identity on which it is based >50% sequence identity = high accuracy 30%-50% sequence identity = 90% modeled <30% sequence identity = low accuracy (many errors)

SWISS-MODEL An automated protein homology modeling server.

SWISS-MODEL The SWISS-MODEL algorithm can be divided into three steps: 1.Search for suitable templates: the server finds all similarities of a query sequence to sequences of known structure. It uses the BLASTP2 program with the ExNRL-3D database (a derivative of PDB database, specified for SWISS-MODEL). You get these partial results as a SwissModel TraceLog file. 2.Check sequence identity with target: All templates with sequence identities above 25% are selected 3.Create the model using the ProModII program. You get this as a SwissModel-Model file.

SWISS-MODEL Get PDB file by Load to J-Mol

Single Structure Homology Modeling

Swiss-Model file Structures used for the homology model query

ModBase A Homology Model Database

GenTHREADER An automated protein threading server. Input sequence Type of Analysis (PSIPRED,MEMSAT,genTHREAD)

GenTHREADER

Output The output sequences show some extent of sequence homology But high level of secondary structure conservation

Ab inito modeling Based on physical (chemical) properties of amino acids –Leading contender in the field foldit –Crowd-sourcing software –Designed as a game where the goal is to optimize a structure –Dozens of published papers referencing itpublished papers

Exercise In this exercise we will analyze two structures of the protein Lysozyme. the sequences of those proteins have small differences. 1.Download Pymol (after registering) 2. Load the two structures 1LYD.pdb,1L35.pdb 3.Use the Cartoon option for visualizing the structures. 4.Align the structures using the command: align /1lyd,/1l35 Analyze the difference in structures, what is the RMSD (Root Mean Square – represents the distance between the structures)?

Results Show Cartoon Hide lines