Corrections. SEQUENCE 4 >seq4 MSTNNYQTLSQNKADRMGPGGSRRPRNSQHATASTPSASSCKEQQKDVEH EFDIIAYKTTFWRTFFFYALSFGTCGIFRLFLHWFPKRLIQFRGKRCSVE NADLVLVVDNHNRYDICNVYYRNKSGTDHTVVANTDGNLAELDELRWFKY.

Slides:



Advertisements
Similar presentations
Assignment of PROSITE motifs to topological regions: Application to a novel database of well characterised transmembrane proteins Tim Nugent.
Advertisements

Progress in Transmembrane Protein Research 12 Month Report Tim Nugent.
Structural Classification and Prediction of Reentrant Regions in Alpha-Helical Transmembrane Proteins: Application to Complete Genomes Håkan Viklunda,
Assignment of PROSITE motifs to topological regions: Application to a novel database of well characterised transmembrane proteins Tim Nugent 6 Month.
Secondary structure prediction from amino acid sequence.
Russell Group, Protein Evolution _________ ____. Russell Group, Protein Evolution _________ ____ Rob Russell Cell Networks University of Heidelberg Putting.
Molecular Basis for Relationship between Genotype and Phenotype DNA RNA protein genotype function organism phenotype DNA sequence amino acid sequence transcription.
Javad Jamshidi Fasa University of Medical Sciences Proteins Into membranes and Organelles and Vesicular Traffic Moving.
Tools to analyze protein characteristics Protein sequence -Family member -Multiple alignments Identification of conserved regions Evolutionary relationship.
©CMBI 2005 Exploring Protein Sequences – Part 1 Part 1: Patterns and Motifs Profiles Hydropathy Plots Transmembrane helices Antigenic Prediction Signal.
© Wiley Publishing All Rights Reserved. Analyzing Protein Sequences.
Prediction of protein localization and membrane protein topology Gunnar von Heijne Department of Biochemistry and Biophysics Stockholm Bioinformatics Center.
Tools to analyze protein characteristics Protein sequence -Family member -Multiple alignments Identification of conserved regions Evolutionary relationship.
An Introduction to Bioinformatics Protein Structure Prediction.
Protein Secondary Structure : Kendrew Solves the Structure of Myoglobin “Perhaps the most remarkable features of the molecule are its complexity.
Today’s menu: -UniProt - SwissProt/TrEMBL -PROSITE -Pfam -Gene Onltology Protein and Function Databases Tutorial 7.
PREDICTION OF PROTEIN FEATURES Beyond protein structure (TM, signal/target peptides, coiled coils, conservation…)
Lecture 19-20: Protein Synthesis and the Genetic Code and Synthesis of Membrane Proteins Reading Assignment: Chapter 40 and 43, pgs and ;
Predicting Function (& location & post-tln modifications) from Protein Sequences June 15, 2015.
Lecture 3. α domain structures Coiled-coil, knobs and hole packing Four-helix bundle Donut ring large structure Globin fold Ridges and grooves model CS882,
A Genetic Approach to Analyzing Membrane Protein Topology Colin Manoil and Jon Beckwith Science, Vol. 233, , September 26, 1986.
Advanced Tools and Algorithms in Bioinformatics Chittibabu Guda Summer, 2004 UCSD Extension, Department of Biosciences.
Secondary Structure Prediction Protein Analysis Workshop 2008 Bioinformatics group Institute of Biotechnology University of helsinki Hung Ta
Proteins: Amino Acid Chains DNA Polymerase from E. coli Standard amino acid backbone: Carboxylic acid group, amino group, the alpha hydrogen and an R group.
Levels of Protein Structure
Epitope Selection Rational Vaccine design. Immune System Differential distribution of MHC molecules Cell activation affects the level of MHC expression.
Secondary Structure Prediction and Signal Peptides Protein Analysis Workshop 2012 Bioinformatics group Institute of Biotechnology University of helsinki.
Day 2: Protein Sequence Analysis 1.Physico-chemical properties. 2.Cellular localization. 3.Signal peptides. 4.Transmembrane domains. 5.Post-translational.
© Wiley Publishing All Rights Reserved. Protein 3D Structures.
You have worked for 2 years to isolate a gene involved in axon guidance. You sequence the cDNA clone that contains axon guidance activity. What do you.
AMPK and Apoptosis Emeline Van Goethem José Manuel López Apoptosis or programmed cell death is an important part of the development of all multicellular.
Mrs. Einstein Research in Molecular Biology. Importance of proteins for cell function: Proteins are the end product of the central dogma YOU are your.
Localization prediction of transmembrane proteins Stefan Maetschke, Mikael Bodén and Marcus Gallagher The University of Queensland.
es/by-sa/2.0/. From Protein Sequence to Protein Properties Prof:Rui Alves Dept Ciencies.
LECT 20: PROTEIN SYNTHESIS AND TRANSLATIONAL CONTROL High fidelity of protein synthesis from mRNA is essential. Mechanisms controling translation accuracy.
Manually Adjusting Multiple Alignments Chris Wilton.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
PROTEINS BIT 230 Biochemistry Purification Characterization.
Protein Properties Function, structure Residue features Targeting Post-trans modifications BIO520 BioinformaticsJim Lund Reading: Chapter , 11.7,
MEMBRANE STRUCTURE LECTURE 4 CHAPTER 10.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Protein motif /domain Structural unit Functional unit Signature of protein family How are they defined?
1 Computational Approaches(1/7)  Computational methods can be divided into four categories: prediction methods based on  (i) The overall protein amino.
Protein Structure Prediction. Protein Sequence Analysis Molecular properties (pH, mol. wt. isoelectric point, hydrophobicity) Secondary Structure Super-secondary.
Structure and Function
Protein families, domains and motifs in functional prediction May 31, 2016.
Predicting Structural Features Chapter 12. Structural Features Phosphorylation sites Transmembrane helices Protein flexibility.
Protein families, domains and motifs in functional prediction
Prediction of protein features. Beyond protein structure
Protein Families, Motifs & Domains.
Sequence based searches:
C N TM1 (a) WARLVMCFVLVLITTSIWTLIMV SOSUI PSORT II
7.3 Translation udent_view0/chapter3/animation__how_translation_work s.html.
Introduction & overview
Sequence alignment of C-terminal phosphorylated plant aquaporins
Protein Structure Prediction
Proteins: Secondary Structure Alpha Helix
Relationship between Genotype and Phenotype
Phosphopeptides identified harboring minimal binding motifs
Import Determinants of Organelle-Specific and Dual Targeting Peptides of Mitochondria and Chloroplasts in Arabidopsis thaliana  Changrong Ge, Erika Spånning,
ExPASy (Expert Protein Analysis System)
Percentage of proteins identified in envelope membrane extracts according to the purification method and the number of transmembrane domains. Percentage.
Protein information in the Human Protein Atlas.
Correction of translational start site by identification of N-terminal peptide. Correction of translational start site by identification of N-terminal.
N-terminal extension of a gene using peptides mapping upstream to an annotated start site. N-terminal extension of a gene using peptides mapping upstream.
Volume 5, Issue 3, Pages (March 1997)
Relationship between Genotype and Phenotype
General structure of RIFINs and STEVORs
Phosphopeptides identified harboring minimal binding motifs
Looking at periodicity in protein sequence and structure
Presentation transcript:

Corrections

SEQUENCE 4 >seq4 MSTNNYQTLSQNKADRMGPGGSRRPRNSQHATASTPSASSCKEQQKDVEH EFDIIAYKTTFWRTFFFYALSFGTCGIFRLFLHWFPKRLIQFRGKRCSVE NADLVLVVDNHNRYDICNVYYRNKSGTDHTVVANTDGNLAELDELRWFKY RKLQYTWIDGEWSTPSRAYSHVTPENLASSAPTTGLKADDVALRRTYFGP NVMPVKLSPFYELVYKEVLSPFYIFQAISVTVWYIDDYVWYAALIIVMSL YSVIMTLRQTRSQQRRLQSMVVEHDEVQVIRENGRVLTLDSSEIVPGDVL VIPPQGCMMYCDAVLLNGTCIVNESMLTGESIPITKSAISDDGHEKIFSI DKHGKNIIFNGTKVLQTKYYKGQNVKALVIRTAYSTTKGQLIRAIMYPKP ADFKFFRELMKFIGVLAIVAFFGFMYTSFILFYRGSSIGKIIIRALDLVT IVVPPALPAVMGIGIFYAQRRLRQKSIYCISPTTINTCGAIDVVCFDKTG TLTEDGLDFYALRVVNDAKIGDNIVQIAANDSCQNVVRAIATCHTLSKIN NELHGDPLDVIMFEQTGYSLEEDDSESHESIESIQPILIRPPKDSSLPDC QIVKQFTFSSGLQRQSVIVTEEDSMKAYCKGSPEMIMSLCRPETVPENFH DIVEEYSQHGYRLIAVAEKELVVGSEVQKTPRQSIECDLTLIGLVALENR LKPVTTEVIQKLNEANIRSVMVTGDNLLTALSVARECGIIVPNKSAYLIE HENGVVDRRGRTVLTIREKEDHHTERQPKIVDLTKMTNKDCQFAISGSTF SVVTHEYPDLLDQLVLVCNVFARMAPEQKQLLVEHLQDVGQTVAMCGDGA NDCAALKAAHAGISLSEAEASIAAPFTSKVADIRCVITLISEGRAALVTS YSAFLCMAGYSLTQFISILLLYWIATSYSQMQFLFIDIAIVTNLAFLSSK TRAHKELASTPPPTSILSTASMVSLFGQLAIGGMAQVAVFCLITMQSWFI PFMPTHHDNDEDRKSLQGTAIFYVSLFHYIVLYFVFAAGPPYRASIASNK AFLISMIGVTVTCIAIVVFYVTPIQYFLGCLQMPQEFRFIILAVATVTAV ISIIYDRCVDWISERLREKIRQRRKGA

Compute pI/Mw tool !!! If you choose the wrong format for the sequence… With the correct format:

ProtParam

SAPS

SAPS (1)

SAPS (2)

doi: /bioinformatics/bti797

The coiled-coil domains are annotated according to 3D structure data (experimental data)

Coiled-coil prediction Coils

Coils prediction

Coiled-coil prediction PairCoil (not always working…)

Paircoil prediction

Coiled-coil prediction PairCoil2

Parcoil2 results

Coiled-coil prediction Sliding window (Protscale)

Sliding window amino acid scale- example:

Bad results---- Bad results….

Sliding windows and amino acid scales Transmembrane domain: alpha-helix of 20 amino acids (hydrophobic) -> amino acid scales: hydrophobicity and alpha helix -> sliding window size: 20 amino acids

Protscale Amino acid scale: Kyte and Doolittle (hydrophobicity) Sliding window size: 21 amino acids

Protscale Amino acid scale: Chou&Fasman (alpha helix) Sliding window size: 21

Sliding windows and amino acid scales Transmembrane domain: alpha-helix of 20 amino acids (hydrophobic) -> amino acid scales: hydrophobicity and alpha helix -> sliding window size: 20 amino acids

Method based HMM or NN

HMMTOP

Protein: seq4 Length: 1127 N-terminus: IN Number of transmembrane helices: 8 Transmembrane helices:

TMHMM (1)

TMHMM (2)

TMpred (1)

PSORT II (1)

- Look for the presence of a signal peptide.

No signal peptide Signal peptides are often predicted as ‘transmembrane’ domains (or vice versa) as they amino acids with similar biochemical properties (hydrophic and alpha helix).

Transmembrane: resume HMMTOP (8 TM) PSORT II (10 TM) Tmpred (10 TM) TMHMM (11 TM) in out Big loop

? missed TM

The protein is known to contain 12 TM: one TM is missing at the N-terminus The possible ways to find the correct protein topology is to do a multiple alignment with other family members, or to do some 3D experiment (which are difficult with proteins containing transmembrane domains) Kristian Axelsen: personnal communication SEQ4 = Q9N323Q9N323

The Aquaglyceroporin contains ½ transmembrane regions which can not be predicted by programs, because the region is too short (less than 20 amino acids). There is no way to predict such transmembrane regions, except by doing 3D experiments. 3D experiments is the only way to confirm and ‘predict’ correctly transmembrane domains. Similarity analysis could then help to predict such regions in other protein of the same family. P0AER0

M3 and M7 are ‘demi’ transmembrane: not predictable

Look for the transmembrane regions of P31243 (try the different transmembrane prediction programs): your conclusions ?

No transmembrane domains are found by any program because this protein, a porin, is anchored in the membrane by a specific 3D structure called beta barrel which does not have any alpha helix….

‘beta barrel’ Mainly composed of beta-sheets in a 16-stranded beta-barrel formation and forms a pore in the membrane nm in diameter. Note that the orientation of the strands is such that side chains alternately point into the interior and exterior of the pore; the former are strongly polar residues while the latter are very hydrophobic.

Beta barrel Porin from Rhodobacter

Alignment of the 2 isoforms The gene has two in-frame initiation codons and two different proteins are made by alternative initiation (of translation)

According to this publication (PubMed: ), there is a 'Dual targeting of spinach protoporphyrinogen oxidase II to mitochondria and chloroplasts by alternative use of two in- frame initiation codons'.

Immunoblot analysis of Protox II in spinach leaf. Watanabe N et al. J. Biol. Chem. 2001;276: ©2001 by American Society for Biochemistry and Molecular Biology chloromitoTotal leaf

Q94IG7 – Long isoform wolfPSORT: chloroplast TargetP: chloroplast CH score: MI score: ER score: Other location: SignalP-NN: not secreted score (D): SignalP-HMM: not secreted SP probability: 6.2% SA probability: 0.2% ChloroP: chloroplast prediction score: MITOPROT: mitochondria !!! exported to mitochondria with a probability of 0.71 !!!! Q94IG7 – Short isoform wolfPSORT: mitochondrial TargetP: mitochondrial CH score: MI score: ER score: Other location: SignalP-NN: not secreted score (D): SignalP-HMM: not secreted SP probability: 3.1% SA probability: 5% ChloroP: not in chloroplast prediction score: MITOPROT: other location exported to mitochondria with a probability of 0.33 !!!!!!

Cystein (61 modifications) and serine (46 modifications) are the amino acids with the highest number of known associated PTM. Beware: Resid considers the selenocystein as a PTM…this is not the case !

Phosphorylation

P03372

UniProt data: Experimentally proved P03372

The phosphorylation sites are localized on the ‘surface’ of the protein (homodimer) (where the amino acid are accessible to the kinases !)

O-glycosylation

P02724

Myristoylation

P51876

NMT

Myristoylator

Protein: secreted protein (P02751, fibronectin) Can be predicted: -Subcellular location (PSORT, TargetP) -Domains (InterPro) -Signal -Sulfation -N-glycosylation -O-glycosylation -Phosphorylation (Not predictable…) (predictable…)

THE END