Epitope prediction algorithms Urmila Kulkarni-Kale Bioinformatics Centre University of Pune.

Slides:



Advertisements
Similar presentations
Secondary structure prediction from amino acid sequence.
Advertisements

Slide 1 of 38 T-cell EPITOPES PREDICTION OF HEMAGGLUTININ, NEURAMINIDASE AND MATRIX PROTEIN OF INFLUENZA A VIRUS USING SUPPORT VECTOR MACHINE AND HIDDEN.
Prediction of B cell epitopes Pernille Haste Andersen Immunological Bioinformatics CBS, DTU
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Department of Systems Biology Technical University of Denmark Immunological Bioinformatics Introduction to the.
Computer Aided Vaccine Design Dr G P S Raghava. Concept of Drug and Vaccine Concept of Drug Concept of Drug –Kill invaders of foreign pathogens –Inhibit.
MHC Polymorphism Ole Lund. Objectives What is HLA polymorphism? What is it good for? How does it make life difficult for vaccine design? Definition of.
Folding and flexibility. Outline What is protein folding ? How proteins fold in vivo ? What is protein flexibility ?
1 Protein Structure, Structure Classification and Prediction Bioinformatics X3 January 2005 P. Johansson, D. Madsen Dept.of Cell & Molecular Biology, Uppsala.
Computational Immunology An Introduction Rose Hoberman BioLM Seminar April 2003.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU B cell epitopes and predictions Pernille Haste Andersen, Ph.d. student Immunological.
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Chapter 9 Structure Prediction. Motivation Given a protein, can you predict molecular structure Want to avoid repeated x-ray crystallography, but want.
MHC Polymorphism. MHC Class I pathway Figure by Eric A.J. Reits.
Protein Structure Modeling (1). Protein Folding Problem A protein folds into a unique 3D structure under physiological conditions Lysozyme sequence: KVFGRCELAA.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
1-month Practical Course Genome Analysis Lecture 3: Residue exchange matrices Centre for Integrative Bioinformatics VU (IBIVU) Vrije Universiteit Amsterdam.
Epitope Selection Rational Vaccine design. Why? Therapeutic vaccines Therapeutic vaccines Treatment of viral infections (e.g., HIV, HCV), and resistant.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU B cell epitopes and B cell epitope predictions Morten Nielsen, CBS, BioCentrum,
Anti-idiotypes and Immunity Dr. Ziad Jaradat. Anti-idiotypes and Immunity The immune system of an individual can make millions of different kinds of antibodies:
Antigen recognition by T cells Zheng W. Chen, M.D., Ph.D.
Comparative Genomics of Viruses: VirGen as a case study Dr. Urmila Kulkarni-Kale Bioinformatics Centre University of Pune Pune
Protein Tertiary Structure Prediction
Use of bioinformatics in drug development and diagnostics
Methods MHC class-I T cell epitope prediction for Nef Consensus and ancestral sequences of the Nef protein for the different HIV-1 subtypes were obtained.
Protein analysis and proteomics (Part 2 of 2). Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics by Jonathan.
Epitope mapping of Gliadin - A trigger of Celiac Disease Ph.D. Student Nicole H. Petersen Bioorganic Chemistry KU LIFE.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
PROTEINS PROTEINS Levels of Protein Structure.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
Genome to Vaccinome: Immunoinformatics & Vaccine design case studies Urmila Kulkarni-Kale Bioinformatics Centre University of Pune
Protein Structure Modelling Many sequences - few structures Homology Modelling - Based on Sequence Similarity with Sequences of Known Structures.
A Study of Residue Correlation within Protein Sequences and its Application to Sequence Classification Christopher Hemmerich Advisor: Dr. Sun Kim.
110/30/2015 Antigens Antigens Hugh B. Fackrell 210/30/2015 ä Assigned Reading ä Content Outline ä Performance Objectives ä Key terms ä Key Concepts ä.
Immunoinformatics & Vaccine design. Outline Immunology recap What is reverse vaccinology? Immunoinformatics –Databases (Knowledgebases) –Algorithms (B-
Telling self from non-self: Learning the language of the Immune System Rose Hoberman and Roni Rosenfeld BioLM Workshop May 2003.
HMMs for alignments & Sequence pattern discovery I519 Introduction to Bioinformatics.
Structural Bioinformatics Section Vaccine Research Center/NIAID/NIH
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
1 Protein Structure Prediction (Lecture for CS397-CXZ Algorithms in Bioinformatics) April 23, 2004 ChengXiang Zhai Department of Computer Science University.
1 Web Site: Dr. G P S Raghava, Head Bioinformatics Centre Institute of Microbial Technology, Chandigarh, India Prediction.
Proteins. Protein Function  Catalysis  Structure  Movement  Defense  Regulation  Transport  Antibodies.
November 18, 2000ICTCM 2000 Introductory Biological Sequence Analysis Through Spreadsheets Stephen J. Merrill Sandra E. Merrill Marquette University Milwaukee,
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Antigen and Antigenicity Antigen and Antigenicity
Recurring conformation of the human immunodeficiency virus type 1 gp120 V3 loop Robyn L. Stanfield, Jayant B. Ghiara, Erica O. Saphire, Albert T. Profy,
ANTIGENS  Antigen: Any substance reacting with the products of any specific immune response (Ig or T cells)  Immunogen: Any substance capable to induce.
Antigens & Immunogen.
Russell Group, Protein Evolution _________ ____ Rob Russell Cell Networks University of Heidelberg Interactions and Modules: the how and why of molecular.
Protein Properties Function, structure Residue features Targeting Post-trans modifications BIO520 BioinformaticsJim Lund Reading: Chapter , 11.7,
B cell epitopes and predictions
Lecture 1: Immunogenetics Dr ; Kwanama
Specific Defenses of the Host Part 2 (acquired or adaptive immunity)
Bioinformatics in Vaccine Design
September 2K3Bioinformatics Centre, University of Pune, Pune. 1 Role of Bioinformatics in designing vaccines Urmila Kulkarni-Kale Information Scientist.
Janeway’s Immunobiology
What is phage display? An in vitro selection technique using a peptide or protein genetically fused to the coat protein of a bacteriophage.
Protein Structure Prediction. Protein Sequence Analysis Molecular properties (pH, mol. wt. isoelectric point, hydrophobicity) Secondary Structure Super-secondary.
Structure and Function
Structural Bioinformatics Elodie Laine Master BIM-BMC Semester 3, Genomics of Microorganisms, UMR 7238, CNRS-UPMC e-documents:
Multiple Sequence Alignment Dr. Urmila Kulkarni-Kale Bioinformatics Centre University of Pune
Bioinformatics Overview
Extra Tree Classifier-WS3 Bagging Classifier-WS3
Multiple Sequence Alignment
Predicting Active Site Residue Annotations in the Pfam Database
Computational Analysis
Telling self from non-self: Learning the language of the Immune System
Volume 5, Issue 2, Pages (August 1996)
Loyola Marymount University
Claude Daniel, Stephen Horvath, Paul M Allen  Immunity 
Protein structure prediction
Presentation transcript:

Epitope prediction algorithms Urmila Kulkarni-Kale Bioinformatics Centre University of Pune

October 2K5© Bioinformatics Centre, UoP2 Vaccine development In Post-genomic era: Reverse Vaccinology Approach. Rappuoli R. (2000). Reverse vaccinology. Curr Opin Microbiol. 3:

October 2K5© Bioinformatics Centre, UoP3 Genome Sequence Proteomics Technologies In silico analysis DNA microarrays High throughput Cloning and expression In vitro and in vivo assays for Vaccine candidate identification Global genomic approach to identify new vaccine candidates

October 2K5© Bioinformatics Centre, UoP4 In Silico Analysis Gene/Protein Sequence Database Disease related protein DB Candidate Epitope DB VACCINOME Peptide Multiepitope vaccines Epitope prediction

October 2K5© Bioinformatics Centre, UoP5

October 2K5© Bioinformatics Centre, UoP6 Types of Epitopes Sequential / Continuous epitopes: recognized by Th cells linear peptide fragments amphipathic helical 9-12 mer Conformational / Discontinuous epitopes: recognized by both Th & B cells non-linear discrete amino acid sequences, come together due to folding exposed mer

October 2K5© Bioinformatics Centre, UoP7 Properties of Epitopes They occur on the surface of the protein and are more flexible than the rest of the protein. They have high degree of exposure to the solvent. The amino acids making the epitope are usually charged and hydrophilic.

October 2K5© Bioinformatics Centre, UoP8 Methods to identify epitopes 1.Immunochemical methods ELISA : Enzyme linked immunosorbent assay Immunoflurorescence Radioimmunoassay 2.X-ray crystallography: Ag-Ab complex is crystallized and the structure is scanned for contact residues between Ag and Ab. The contact residues on the Ag are considered as the epitope. 3.Prediction methods: Based on the X-ray crystal data available for Ag-Ab complexes, the propensity of an amino acid to lie in an epitope is calculated.

October 2K5© Bioinformatics Centre, UoP9 Antigen-Antibody (Ag-Ab) complexes Non-obligatory heterocomplexes that are made and broken according to the environment Involve proteins (Ag & Ab) that must also exist independently Remarkable feature: –high affinity and strict specificity of antibodies for their antigens. Ab recognize the unique conformations and spatial locations on the surface of Ag Epitopes & paratopes are relational entities

October 2K5© Bioinformatics Centre, UoP10 Antigen-Antibody complex

October 2K5© Bioinformatics Centre, UoP11 Ab-binding sites: Sequential & Conformational Epitopes ! Sequential Conformational Ab-binding sites Paratope

October 2K5© Bioinformatics Centre, UoP12 B cell epitope prediction algorithms : Hopp and Woods –1981 Welling et al –1985 Parker & Hodges Kolaskar & Tongaonkar – 1990 Kolaskar & Urmila Kulkarni T cell epitope prediction algorithms : Margalit, Spouge et al Rothbard & Taylor – 1988 Stille et al –1987 Tepitope -1999

October 2K5© Bioinformatics Centre, UoP13 Hopp & Woods method Pioneering work Based on the fact that only the hydrophilic nature of amino acids is essential for an sequence to be an antigenic determinant Local hydrophilicity values are assigned to each amino acid by the method of repetitive averaging using a window of six Not very accurate

October 2K5© Bioinformatics Centre, UoP14 Welling’s method Based on the % of each aa present in known epitopes compared with the % of aa in the avg. composition of a protein. assigns an antigenicity value for each amino acid from the relative occurrence of the amino acid in an antigenic determinant site. regions of 7 aa with relatively high antigenicity are extended to aa depending on the antigenicity values of neighboring residues.

October 2K5© Bioinformatics Centre, UoP15 Parker & Hodges method Utilizes 3 parameters : –Hydrophilicity : HPLC –Accessibility : Janin’s scale –Flexibility : Karplus & Schultz Hydrophilicity parameter was calculated using HPLC from retention co-efficients of model synthetic peptides. Surface profile was determined by summing the parameters for each residue of a seven-residue segment and assigning the sum to the fourth residue. One of the most useful prediction algorithms

October 2K5© Bioinformatics Centre, UoP16 Kolaskar & Tongaonkar’s method Semi-empirical method which uses physiological properties of amino acid residues frequencies of occurrence of amino acids in experimentally known epitopes. Data of 169 epitopes from 34 different proteins was collected of which 156 which have less than 20 aa per determinant were used. Antigen: EMBOSS

October 2K5© Bioinformatics Centre, UoP17 CEP Server Predicts the conformational epitopes from X-ray crystals of Ag-Ab complexes. uses percent accessible surface area and distance as criteria

October 2K5© Bioinformatics Centre, UoP18 An algorithm to map sequential and conformational epitopes of protein antigens of known structure

October 2K5© Bioinformatics Centre, UoP19

October 2K5© Bioinformatics Centre, UoP20 CE: Beyond validation High accuracy: –Limited data set to evaluate the algorithm –Non-availability of true negative data sets Prediction of false positives? –Are they really false positives? Limitation: –Limited by the availability of 3D structure data of antigens Different Abs (HyHEL10 & D1.3) have over-lapping binding sites

October 2K5© Bioinformatics Centre, UoP21 CE: Features The first algorithm for the prediction of conformational epitopes or antibody binding sites of protein antigens Maps both: sequential & conformational epitopes Prerequisite: 3D structure of an antigen

October 2K5© Bioinformatics Centre, UoP22 CEP: Conformational Epitope Prediction Server

October 2K5© Bioinformatics Centre, UoP23 T-cell epitope prediction algorithms Considers amphipathic helix segments, tetramer and pentamer motifs (charged residues or glycine) followed by 2-3 hydrophobic residues and then a polar residue. Sequence motifs of immunodominant secondary structure capable of binding to MHC with high affinity. Virtual matrices which are used for predicting MHC polymorphism and anchor residues.

October 2K5© Bioinformatics Centre, UoP24 Case study: Design & development of peptide vaccine against Japanese encephalitis virus

October 2K5© Bioinformatics Centre, UoP25 We Have Chosen JE Virus, Because  JE virus is endemic in South-east Asia including India.  JE virus causes encephalitis in children between 5-15 years of age with fatality rates between 21-44%.  Man is a "DEAD END" host.

October 2K5© Bioinformatics Centre, UoP26 We Have Chosen JE Virus, Because Killed virus vaccine purified from mouse brain is used presently which requires storage at specific temperatures and hence not cost effective in tropical countries. Protective prophylactic immunity is induced only after administration of 2-3 doses. Cost of vaccination, storage and transportation is high.

October 2K5© Bioinformatics Centre, UoP27 Predicted structure of JEVS Mutations: JEVN/JEVS

October 2K5© Bioinformatics Centre, UoP28

October 2K5© Bioinformatics Centre, UoP29 CE of JEVN Egp

October 2K5© Bioinformatics Centre, UoP30 Loop1 in TBEV: LA EEH QGGT Loop1 in JEVN: HN EKR ADSS Loop1 in JEVS: HN KKR ADSS Species and Strain specific properties: TBEV/ JEVN/JEVS Antibodies recognising TBEV and JEVN would require exactly opposite pattern of charges in their CDR regions. Further, modification in CDR is required to recognise strain-specific region of JEVS.

October 2K5© Bioinformatics Centre, UoP31 Multiple alignment of Predicted T H -cell epitope in the JE_Egp with corresponding epitopes in Egps of other Flaviviruses JE DFGSIGGVFNSIGKAVHQVFGGAFRTLFGGMS MVE DFGSVGGVFNSIGKAVHQVFGGAFRTLFGGMS WNE DFGSVGGVFTSVGKAIHQVFGGAFRSLFGGMS KUN DFGSVGGVFTSVGKAVHQVFGGAFRSLFGGMS SLE DFGSIGGVFNSIGKAVHQVFGGAFRTLFGGMS DEN2 DFGSLGGVFTSIGKALHQVFGAIYGAAFSGVS YF DFSSAGGFFTSVGKGIHTVFGSAFQGLFGGLN TBE DFGSAGGFLSSIGKAVHTVLGGAFNSIFGGVG COMM DF S GG S GK H V G F G Multiple alignment of JE_Egp with Egps of other Flaviviruses in the YSAQVGASQ region JE SENHGNYSAQVGASQAAKFTITPNAPSITLKLG MVE STSHGNYSTQIGANQAVRFTISPNAPAITAKMG WNE VESHG ‑‑‑‑ KIGATQAGRFSITPSAPSYTLKLG KUN VESHGNYFTQTGAAQAGRFSITPAAPSYTLKLG SLE STSHGNYSEQIGKNQAARFTISPQAPSFTANMG DEN2 HAVGNDTG ‑‑‑‑‑ KHGKEIKITPQSSTTEAELT YF QENWN ‑‑‑‑‑‑‑‑ TDIKTLKFDALSGSQEVEFI TBE VAANETHS ‑‑‑‑ GRKTASFTIS ‑‑ SEKTILTMG

Peptide Modeling Initial random conformation Force field: Amber Distance dependent dielectric constant 4r ij Geometry optimization: Steepest descents & Conjugate gradients Molecular dynamics at 400 K for 1ns Peptides are: SENHGNYSAQVGASQ NHGNYSAQVGASQ YSAQVGASQ YSAQVGASQAAKFT NHGNYSAQVGASQAAKFT SENHGNYSAQVGASQAAKFT

October 2K5© Bioinformatics Centre, UoP33

October 2K5© Bioinformatics Centre, UoP34

October 2K5© Bioinformatics Centre, UoP35 Relevant Publications & Patent Urmila Kulkarni-Kale, Shriram Bhosale, G. Sunitha Manjari, Ashok Kolaskar, (2004). VirGen: A comprehensive viral genome resource. Nucleic Acids Research 32: Urmila Kulkarni-Kale & A. S. Kolaskar (2003). Prediction of 3D structure of envelope glycoprotein of Sri Lanka strain of Japanese encephalitis virus. In Yi-Ping Phoebe Chen (ed.), Conferences in research and practice in information technology. 19: A. S. Kolaskar & Urmila Kulkarni-Kale (1999) Prediction of three- dimensional structure and mapping of conformational antigenic determinants of envelope glycoprotein of Japanese encephalitis virus. Virology. 261: Patent: Chimeric T helper-B cell peptide as a vaccine for Flaviviruses. Dr. M. M. Gore, Dr. S.S. Dewasthaly, Prof. A.S. Kolaskar, Urmila Kulkarni-Kale Sangeeta Sawant WO 02/ A1

October 2K5© Bioinformatics Centre, UoP36 Important references Hopp, Woods, 1981, Prediction of protein antigenic determinants from amino acid sequences, PNAS U.S.A 78, Parker, Hodges et al, 1986, New hydrophilicity scale derived from high performance liquid chromatography peptide retention data: Correlation of predicted surface residues with antigenicity and X-ray derived accessible sites, Biochemistry:25, Kolaskar, Tongaonkar, 1990, A semi empirical method for prediction of antigenic determinants on protein antigens, FEBS 276, Men‚ndez-Arias, L. & Rodriguez, R. (1990), A BASIC microcomputer program forprediction of B and T cell epitopes in proteins, CABIOS, 6, Peter S. Stern (1991), Predicting antigenic sites on proteins, TIBTECH, 9, A.S. Kolaskar and Urmila Kulkarni-Kale, Prediction of three- dimensional structure and mapping of conformational epitopes of envelope glycoprotein of Japanese encephalitis virus,Virology, 261, 31-42