Genome to Vaccinome: Immunoinformatics & Vaccine design case studies Urmila Kulkarni-Kale Bioinformatics Centre University of Pune

Slides:



Advertisements
Similar presentations
Secondary structure prediction from amino acid sequence.
Advertisements

Prediction of B cell epitopes Pernille Haste Andersen Immunological Bioinformatics CBS, DTU
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Department of Systems Biology Technical University of Denmark Immunological Bioinformatics Introduction to the.
Major Histocompatibility Complex and T Cell Receptor
©CMBI 2005 Exploring Protein Sequences - Part 2 Part 1: Patterns and Motifs Profiles Hydropathy Plots Transmembrane helices Antigenic Prediction Signal.
1 Protein Structure, Structure Classification and Prediction Bioinformatics X3 January 2005 P. Johansson, D. Madsen Dept.of Cell & Molecular Biology, Uppsala.
Computational Immunology An Introduction Rose Hoberman BioLM Seminar April 2003.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Structural bioinformatics
Epitope prediction algorithms Urmila Kulkarni-Kale Bioinformatics Centre University of Pune.
The branch that breaks Is called rotten, but Wasn’t there snow on it? Bartolt Brecht Haiti after a hurricane.
Chapter 9 Structure Prediction. Motivation Given a protein, can you predict molecular structure Want to avoid repeated x-ray crystallography, but want.
MHC Polymorphism. MHC Class I pathway Figure by Eric A.J. Reits.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Multiple alignment June 29, 2007 Learning objectives- Review sequence alignment answer and answer questions you may have. Understand how the E value may.
Informatics Support for Vaccine Projects Using and extending the UCSC bioinformatics infrastructure.
Computational Biology, Part 10 Protein Structure Prediction and Display Robert F. Murphy Copyright  1996, 1999, All rights reserved.
The branch that breaks Is called rotten, but Wasn’t there snow on it? Bartolt Brecht Haiti after a hurricane.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Anti-idiotypes and Immunity Dr. Ziad Jaradat. Anti-idiotypes and Immunity The immune system of an individual can make millions of different kinds of antibodies:
Protein Sequence Analysis - Overview Raja Mazumder Senior Protein Scientist, PIR Assistant Professor, Department of Biochemistry and Molecular Biology.
Comparative Genomics of Viruses: VirGen as a case study Dr. Urmila Kulkarni-Kale Bioinformatics Centre University of Pune Pune
Overview of Bioinformatics A/P Shoba Ranganathan Justin Choo National University of Singapore A Tutorial on Bioinformatics.
Pattern databasesPattern databasesPattern databasesPattern databases Gopalan Vivek.
Protein Tertiary Structure Prediction
Influenza Research Database (IRD): A Web-based Resource for Influenza Virus Data and Analysis Victoria Hunt 1 *, R. Burke Squires 1, Jyothi Noronha 1,
Methods MHC class-I T cell epitope prediction for Nef Consensus and ancestral sequences of the Nef protein for the different HIV-1 subtypes were obtained.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Statistical Tool for Identifying Sequence Variations That Correlate with Virus Phenotypic Characteristics in the Virus Pathogen Resource (ViPR) July 22,
© Wiley Publishing All Rights Reserved. Protein 3D Structures.
Neural Networks for Protein Structure Prediction Brown, JMB 1999 CS 466 Saurabh Sinha.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
Protein Structure Modelling Many sequences - few structures Homology Modelling - Based on Sequence Similarity with Sequences of Known Structures.
Statistical Tool for Identifying Sequence Variations that Correlate with Virus Phenotypic Characteristics in the Virus Pathogen Resource (ViPR) Brett E.
A Mutational Investigation of an HIV Patient’s GP120 Glycoprotein and it’s Implications on CD4 Binding Salita Kaistha Usrinus College, Collegeville PA.
110/30/2015 Antigens Antigens Hugh B. Fackrell 210/30/2015 ä Assigned Reading ä Content Outline ä Performance Objectives ä Key terms ä Key Concepts ä.
Immunoinformatics & Vaccine design. Outline Immunology recap What is reverse vaccinology? Immunoinformatics –Databases (Knowledgebases) –Algorithms (B-
Telling self from non-self: Learning the language of the Immune System Rose Hoberman and Roni Rosenfeld BioLM Workshop May 2003.
HMMs for alignments & Sequence pattern discovery I519 Introduction to Bioinformatics.
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
1 Web Site: Dr. G P S Raghava, Head Bioinformatics Centre Institute of Microbial Technology, Chandigarh, India Prediction.
Proteins. Protein Function  Catalysis  Structure  Movement  Defense  Regulation  Transport  Antibodies.
Protein Sequence Analysis - Overview - NIH Proteomics Workshop 2007 Raja Mazumder Scientific Coordinator, PIR Research Assistant Professor, Department.
Protein Structure Prediction ● Why ? ● Type of protein structure predictions – Sec Str. Pred – Homology Modelling – Fold Recognition – Ab Initio ● Secondary.
Antigen and Antigenicity Antigen and Antigenicity
Lecture 1: Immunogenetics Dr ; Kwanama
Motif Search and RNA Structure Prediction Lesson 9.
Specific Defenses of the Host Part 2 (acquired or adaptive immunity)
Bioinformatics in Vaccine Design
September 2K3Bioinformatics Centre, University of Pune, Pune. 1 Role of Bioinformatics in designing vaccines Urmila Kulkarni-Kale Information Scientist.
AP Biology Proteins AP Biology Proteins Multipurpose molecules.
Lecture 13 Immunology and disease: parasite antigenic diversity.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Janeway’s Immunobiology
Structure and Function
In Search of the Body’s Antibodies: Investigate Antibodies Using Enzyme Linked Immunosorbent Assay (ELISA) Module developed at Boston University School.
Structural Bioinformatics Elodie Laine Master BIM-BMC Semester 3, Genomics of Microorganisms, UMR 7238, CNRS-UPMC e-documents:
Multiple Sequence Alignment Dr. Urmila Kulkarni-Kale Bioinformatics Centre University of Pune
Multiple Sequence Alignment
חיזוי ואפיון אתרי קישור של חלבון לדנ"א מתוך הרצף
Predicting Active Site Residue Annotations in the Pfam Database
Protein Sequence Analysis - Overview -
Telling self from non-self: Learning the language of the Immune System
Protein Sequence Analysis - Overview -
The Rational Design of an AIDS Vaccine
Protein structure prediction.
Loyola Marymount University
Loyola Marymount University
Justin R. Bailey, Eleanor Barnes, Andrea L. Cox  Gastroenterology 
Solution Structure of the Proapoptotic Molecule BID
Presentation transcript:

Genome to Vaccinome: Immunoinformatics & Vaccine design case studies Urmila Kulkarni-Kale Bioinformatics Centre University of Pune

Outline Immunology basics What is reverse vaccinology? Immunoinformatics –Databases (Knowledgebases) –Algorithms (B- and T-cell epitope predictions) –Predictions Case studies –Mumps virus –Japanese encephalitis virus

April 5, 2K9© UKK, Bioinformatics Centre, University of Pune 3 The Immune System body's defense against infectious organisms The Innate immunity: first line of defense –rapid nonspecific responses –recognition of conserved structures present in many microorganisms lipopolysaccharides in bacterial cell walls or proteins in flagella The adaptive immune response: second line of defense –tailored to an individual threat –specific to an infectious agent –memory cells persist that enable a more rapid and potent response on ‘re-infection’

April 5, 2K9© UKK, Bioinformatics Centre, University of Pune 4 The adaptive immune response Stimulated by receptor recognition of a specific small part of an antigen known as an epitope Two major arms: –The humoral immune response of antibody-secreting B lymphocytes (B cell epitopes) –The cellular immune response of T lymphocytes (T cell Th epitopes) –Response stimulated by receptor recognition

Antigen presentation and recognition: molecular and cellular processes.

Host-Pathogen interactions: Surface proteins In case of Viruses: –Capsid –Envelope –Membrane

D. Serruto, R. Rappuoli / FEBS Letters 580 (2006) 2985–2992

January 2K7© Bioinformatics Centre, UoP8 Antigen-Antibody (Ag-Ab) complexes Non-obligatory heterocomplexes that are made and broken according to the environment Involve proteins (Ag & Ab) that must also exist independently Remarkable feature: –high affinity and strict specificity of antibodies for their antigens. Ab recognize the unique conformations and spatial locations on the surface of Ag Epitopes & paratopes are relational entities

January 2K7© Bioinformatics Centre, UoP9 Methods to identify epitopes 1.Immunochemical methods ELISA : Enzyme linked immunosorbent assay Immunoflurorescence Radioimmunoassay 2.X-ray crystallography: Ag-Ab complex is crystallized and the structure is scanned for contact residues between Ag and Ab. The contact residues on the Ag are considered as the epitope. 3.Prediction methods: Based on the X-ray crystal data available for Ag-Ab complexes, the propensity of an amino acid to lie in an epitope is calculated.

Antigen-Antibody complex Number of antibodies that could be raised against an antigen Number of Ab-binding sites on an antigen A few antibodies may have overlapping binding sites on same antigen

January 2K7© Bioinformatics Centre, UoP11 Ab-binding sites: Sequential & Conformational Epitopes ! Sequential Conformational Ab-binding sites Paratope

January 2K7© Bioinformatics Centre, UoP12 Properties of Epitopes They occur on the surface of the protein and are more flexible than the rest of the protein. They have high degree of exposure to the solvent. The amino acids making the epitope are usually charged and hydrophilic.

January 2K7© Bioinformatics Centre, UoP13 B cell epitope prediction algorithms : Hopp and Woods –1981 Welling et al –1985 Parker & Hodges Kolaskar & Tongaonkar – 1990 Kolaskar & Urmila Kulkarni – 1999, 2005 Haste et al., 2006 T cell epitope prediction algorithms : Margalit, Spouge et al Rothbard & Taylor – 1988 Stille et al –1987 Tepitope Sequence based Structure based

January 2K7© Bioinformatics Centre, UoP14 Hopp & Woods method Pioneering work Based on the fact that only the hydrophilic nature of amino acids is essential for an sequence to be an antigenic determinant Local hydrophilicity values are assigned to each amino acid by the method of repetitive averaging using a window of six Accuracy: 45-55%

January 2K7© Bioinformatics Centre, UoP15 Welling’s method Based on the % of each aa present in known epitopes compared with the % of aa in the avg. composition of a protein. assigns an antigenicity value for each amino acid from the relative occurrence of the amino acid in an antigenic determinant site. regions of 7 aa with relatively high antigenicity are extended to aa depending on the antigenicity values of neighboring residues.

January 2K7© Bioinformatics Centre, UoP16 Parker & Hodges method Utilizes 3 parameters : –Hydrophilicity : HPLC –Accessibility : Janin’s scale –Flexibility : Karplus & Schultz Hydrophilicity parameter was calculated using HPLC from retention co-efficients of model synthetic peptides. Surface profile was determined by summing the parameters for each residue of a seven-residue segment and assigning the sum to the fourth residue. One of the most useful prediction algorithms

January 2K7© Bioinformatics Centre, UoP17 Kolaskar & Tongaonkar’s method Semi-empirical method which uses physiological properties of amino acid residues frequencies of occurrence of amino acids in experimentally known epitopes. Data of 169 epitopes from 34 different proteins was collected of which 156 which have less than 20 aa per determinant were used. Antigen: EMBOSS

January 2K7© Bioinformatics Centre, UoP18 CEP Server Predicts the conformational epitopes from X-ray crystals of Ag-Ab complexes. uses percent accessible surface area and distance as criteria

January 2K7© Bioinformatics Centre, UoP19 An algorithm to map sequential and conformational epitopes of protein antigens of known structure

January 2K7© Bioinformatics Centre, UoP20

January 2K7© Bioinformatics Centre, UoP21 CE: Features The first algorithm for the prediction of conformational epitopes or antibody binding sites of protein antigens Maps both: sequential & conformational epitopes Prerequisite: 3D structure of an antigen

January 2K7© Bioinformatics Centre, UoP22 CEP: Conformational Epitope Prediction Server

January 2K7© Bioinformatics Centre, UoP23 T-cell epitope prediction algorithms Considers amphipathic helix segments, tetramer and pentamer motifs (charged residues or glycine) followed by 2-3 hydrophobic residues and then a polar residue. Sequence motifs of immunodominant secondary structure capable of binding to MHC with high affinity. Virtual matrices are used for predicting MHC polymorphism and anchor residues.

April 5, 2K9© UKK, Bioinformatics Centre, University of Pune 24 MHC-Peptide complex

January 2K7© Bioinformatics Centre, UoP25 Epitome database

January 2K7© Bioinformatics Centre, UoP26 CED database

January 2K7© Bioinformatics Centre, UoP27 BciPep Database

AgAbDB: Home page

Genomic Data of viruses Epitope Prediction software Antigen: 3D structure(s) Variations/ conservations Rational Vaccine design: Challenges & opportunities Annotations Organisations Data mining Rules for predictions Accuracy related issues Experimental validations Relatively very few Modeling is only solution

Reverse Vaccinology workbench: list of parts The components are– –A curated genomic resource (VirGen) A curated genomic resource (VirGen) –A server for prediction of epitopes (CEP) 1999; 2005 A server for prediction of epitopes (CEP) –A knowledge-base to study Ag-Ab interactions (AgAbDb) 2007 A knowledge-base to study Ag-Ab interactions (AgAbDb) –A server for variability analyses (PVIS) 2009A server for variability analyses (PVIS) –A derived database of 3D structures of viral proteins  Compilation of experimental structures of viral proteins from PDB  Predicted structures using homology modeling approach 1999; 2007  Study of sequence  structure  function (antigenicity) to identify & prioritize vaccine candidates

VirGen home Menu to browse viral families Genome analysis & Comparative genomics resources Guided tour & Help Navigation bar Search using Keywords & Motifs

Tabular display of genome annotation Retrieve sequence in FASTA format Sample genome record in VirGen ‘Alternate names’ of proteins

Graphical view of Genome Organization Viral polyprotein along with the UTRs Graphical view generated dynamically using Scalable Vector Graphics technology

Multiple Sequence Alignment Dendrogram MSA Link for batch retrieval of sequences

Browsing the module of Whole Genome Phylogenetic trees Most parsimonious tree of genus Flavivirus Input data: Whole genome Method: DNA parsimony Bootstrapping: 1000 Most parsimonious tree of genus Flavivirus Input data: Whole genome Method: DNA parsimony Bootstrapping: 1000

AgAbDB: Home page

PDB AgAbDB: summary of interacting residues

Interactions mapped on structure

Study of variations at different levels of Biocomplexity Strains/isolates of a virus Serotypes of a virus Viruses that belong to same genus Viruses that belong to same family Implications of variations in designing vaccines How similar is similar? How different is different?

Protein Variablility Index Server(PVIS) Beta test version  PVIS takes MSA as an input and calculates variability of amino acids using Wu-Kabat’s coefficient at each position of the consensus sequence  Features: Interactive, GUI based alignment output format No limit on input length of MSA At each position of alignment, user can view consensus residue and its corresponding variability Generates CSV (Comma Separated File) of Variability values against their positions in consensus sequence

Various output formats

Antigenic diversity of mumps virus: an insight from predicted 3D structure of HN protein

Mumps Virus:at a glance Source: VirGen database Order:Mononegavirales Family: Paramyxoviridae Subfamily: Paramyxovirinae Genus: Rubulavirus Species:Mumps virus Genome: -ve sense ssRNA Genotypes: 10: A  J (SH gene) Known antigenic proteins: F & HN

SBL-1 HN: Predicted structure Fold:  propeller Monomer: 6 bladed propeller with 4-stranded  sheet & 4 helices Helices: Red, Strands: yellow, Turns: blue, Coils: green

A new site for neutralisation: mapping antigenicity using parts list approach Hypervariable region of HN identified using MSA of Vaccine strains (Majority marked with yellow screen Residues 462, 464, 468, 470, 473, 474 present on surface; Known escape mutants are in proximity Total variations: 47

Colour: according to majority Mapping mutations on 3D structure of Mumps virus: a case study

January 2K7© Bioinformatics Centre, UoP47 Case study: Design & development of peptide vaccine against Japanese encephalitis virus

January 2K7© Bioinformatics Centre, UoP48 We Have Chosen JE Virus, Because  JE virus is endemic in South-east Asia including India.  JE virus causes encephalitis in children between 5-15 years of age with fatality rates between 21-44%.  Man is a "DEAD END" host.

January 2K7© Bioinformatics Centre, UoP49 We Have Chosen JE Virus, Because Killed virus vaccine purified from mouse brain is used presently which requires storage at specific temperatures and hence not cost effective in tropical countries. Protective prophylactic immunity is induced only after administration of 2-3 doses. Cost of vaccination, storage and transportation is high.

January 2K7© Bioinformatics Centre, UoP50 Predicted structure of JEVS Mutations: JEVN/JEVS

January 2K7© Bioinformatics Centre, UoP51

January 2K7© Bioinformatics Centre, UoP52 CE of JEVN Egp

January 2K7© Bioinformatics Centre, UoP53 Loop1 in TBEV: LA EEH QGGT Loop1 in JEVN: HN EKR ADSS Loop1 in JEVS: HN KKR ADSS Species and Strain specific properties: TBEV/ JEVN/JEVS Antibodies recognising TBEV and JEVN would require exactly opposite pattern of charges in their CDR regions. Further, modification in CDR is required to recognise strain-specific region of JEVS.

January 2K7© Bioinformatics Centre, UoP54 Multiple alignment of Predicted T H -cell epitope in the JE_Egp with corresponding epitopes in Egps of other Flaviviruses JE DFGSIGGVFNSIGKAVHQVFGGAFRTLFGGMS MVE DFGSVGGVFNSIGKAVHQVFGGAFRTLFGGMS WNE DFGSVGGVFTSVGKAIHQVFGGAFRSLFGGMS KUN DFGSVGGVFTSVGKAVHQVFGGAFRSLFGGMS SLE DFGSIGGVFNSIGKAVHQVFGGAFRTLFGGMS DEN2 DFGSLGGVFTSIGKALHQVFGAIYGAAFSGVS YF DFSSAGGFFTSVGKGIHTVFGSAFQGLFGGLN TBE DFGSAGGFLSSIGKAVHTVLGGAFNSIFGGVG COMM DF S GG S GK H V G F G Multiple alignment of JE_Egp with Egps of other Flaviviruses in the YSAQVGASQ region JE SENHGNYSAQVGASQAAKFTITPNAPSITLKLG MVE STSHGNYSTQIGANQAVRFTISPNAPAITAKMG WNE VESHG ‑‑‑‑ KIGATQAGRFSITPSAPSYTLKLG KUN VESHGNYFTQTGAAQAGRFSITPAAPSYTLKLG SLE STSHGNYSEQIGKNQAARFTISPQAPSFTANMG DEN2 HAVGNDTG ‑‑‑‑‑ KHGKEIKITPQSSTTEAELT YF QENWN ‑‑‑‑‑‑‑‑ TDIKTLKFDALSGSQEVEFI TBE VAANETHS ‑‑‑‑ GRKTASFTIS ‑‑ SEKTILTMG

Peptide Modeling Initial random conformation Force field: Amber Distance dependent dielectric constant 4r ij Geometry optimization: Steepest descents & Conjugate gradients Molecular dynamics at 400 K for 1ns Peptides are: SENHGNYSAQVGASQ NHGNYSAQVGASQ YSAQVGASQ YSAQVGASQAAKFT NHGNYSAQVGASQAAKFT SENHGNYSAQVGASQAAKFT

January 2K7© Bioinformatics Centre, UoP56

January 2K7© Bioinformatics Centre, UoP57

Publications Urmila Kulkarni-Kale, Janaki Ojha, G. Sunitha Manjari, Deepti D. Deobagkar, Asha D. Mallya, Rajeev M. Dhere & Subhash V. Kapre (2007). Mapping antigenic diversity & strain-specificity of mumps virus: a bioinformatics approach. Virology. A.D. Ghate, B.U. Bhagwat, S.G. Bhosle, S.M. Gadepalli and U. D. Kulkarni- Kale(2007). Characterization of Antibody-Binding Sites on Proteins: Development of a Knowledgebase and Its Applications in Improving Epitope Prediction. Protein & Peptide Letters, 14(6), Urmila Kulkarni-Kale, Shriram Bhosle and A. S. Kolaskar (2005) CEP: a conformational epitope prediction server. Nucleic Acids Research. 33,W168–W171. Urmila Kulkarni-Kale, Shriram Bhosale, G. Sunitha Manjari, Ashok Kolaskar, (2004). VirGen: A comprehensive viral genome resource. Nucleic Acids Research 32: Urmila Kulkarni-Kale & A. S. Kolaskar (2003). Prediction of 3D structure of envelope glycoprotein of Sri Lanka strain of Japanese encephalitis virus. In Yi- Ping Phoebe Chen (ed.), Conferences in research and practice in information technology. 19: A. S. Kolaskar & Urmila Kulkarni-Kale (1999) Prediction of three- dimensional structure and mapping of conformational antigenic determinants of envelope glycoprotein of Japanese encephalitis virus. Virology. 261:31-42.

Acknowledgements Prof. A. S. Kolaskar Ms. G. Sunitha Manjari, Bhakti Bhawat, Surabhi Agrawal & Shriram Bhosle M.Sc. / ADB Ms. Sangeeta Sawant, & Dr. M. M. Gore Ms. Janaki Oza, Prof. Deepti Deobagkar, Dr. Mallya, Dr. Dhere & Dr. Kapre Financial support: –Center of excellence (CoE) by both MCIT & DBT, Govt. of India –M.Sc. Bioinformatics programme from DBT, Govt. of India –Molecular modeling facility at Bioinformatics centre, University of Pune –Serum Institute of India Thank you all!

Bioinformatics University of Pune

Short Term Courses Long Term Courses HRD Activities In Bioinformatics and Biotechnology

M.Sc. Bioinformatics Advanced Diploma in Bioinformatics (On hold) CRCDM (PPP model) Credit exchange program: M.Sc. Zoology & Biotechnology Contributory teaching: M.Sc./M.Tech. Biotechnology (Integrated) M.B.A. Biotechnology Long Term Courses

Started in 2002 Masters level 2 years (4 Semesters), full time 25 credits/semester + Project (16 credits) Intake thru entrance test No. of students: M. Sc. Bioinformatics ICMSSyllabus

Integrated Course Management System (ICMS )

BINC: Bioinformatics National Certification examination HITS ~850 Registrations

Thank You