Immunological Bioinformatics: Prediction of epitopes in pathogens Ole Lund.

Slides:



Advertisements
Similar presentations
Bioinformatical design of a vaccine against influenza virus N1 subtype Bonaccorsi, Irene; Clausen, Martin Bau; Høj, Leif Howalt; Kjær, Jesper and Sayyad,
Advertisements

Hotspot Hunter: a computational system for large-scale screening and selection of candidate immunological hotspots in pathogen proteomes G.L. Zhang, A.M.
Acceptable mismatches based on structural epitopes on HLA molecules Toulouse, April 2, 2008.
HeleneAndersenMayaBondeAndersenSimonCarlsen MortenAhlgreenGronemann&MadsChristianHjortsø Figure 4, Alignment of the NS3 protein: Alignment of NS3 performed.
Office of Infectious Diseases Computational Challenges for Infectious Diseases Michael Shaw, PhD OID/Office of the Director.
Vaxil BioTherapeutics Ltd.
Introduction to Immunology
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU T cell Epitope predictions using bioinformatics (Neural Networks and hidden.
Prediction of B cell epitopes Pernille Haste Andersen Immunological Bioinformatics CBS, DTU
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Department of Systems Biology Technical University of Denmark Immunological Bioinformatics Introduction to the.
Vaccine Design. Need for new vaccine technologies The classical way of making vaccines have in many cases been tried for the pathogens for which no vaccines.
Computer Aided Vaccine Design Dr G P S Raghava. Concept of Drug and Vaccine Concept of Drug Concept of Drug –Kill invaders of foreign pathogens –Inhibit.
MHC Polymorphism Ole Lund. Objectives What is HLA polymorphism? What is it good for? How does it make life difficult for vaccine design? Definition of.
The branch that breaks Is called rotten, but Wasn’t there snow on it? Bartolt Brecht Haiti after a hurricane.
Immunological bioinformatics Ole Lund, Center for Biological Sequence Analysis (CBS) Denmark.
“Theoretical and Experimental description of Peptide-MHC binding”
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Department of Systems Biology Technical University of Denmark Immunological Bioinformatics Processing, combined.
HIV/AIDS as a Microcosm for the Study of Evolution.
Bioinformatics pipeline for detection of immunogenic cancer mutations by high throughput mRNA sequencing Jorge Duitama 1, Ion Mandoiu 1, and Pramod Srivastava.
MHC Polymorphism. MHC Class I pathway Figure by Eric A.J. Reits.
Class I pathway Prediction of proteasomal cleavage and TAP binidng Morten Nielsen, CBS, BioCentrum, DTU.
Class I pathway Prediction of proteasomal cleavage and TAP binidng Can Keşmir, TBB, Utrecht University, NL & CBS, BioCentrum, DTU.
Selection of T Cell Epitopes Using an Integrative Approach Mette Voldby Larsen cand. scient. in biology ph.d. student.
Influenza A Virus Pandemic Prediction and Simulation Through the Modeling of Reassortment Matthew Ingham Integrated Sciences Program University of British.
The branch that breaks Is called rotten, but Wasn’t there snow on it? Bartolt Brecht Haiti after a hurricane.
Immunological Bioinformatics Ole Lund. Challenges of the immune system Time Creation of self Creation of an immune system/ Tolerance to self Infection.
Protein Therapeutics Immunogenicity Stephen Lynn CHEM645 0.
Immunological databases on the web Ole Lund Center for Biological Sequence Analysis BioCentrum-DTU Technical University of Denmark
Epitope Selection Rational Vaccine design. Why? Therapeutic vaccines Therapeutic vaccines Treatment of viral infections (e.g., HIV, HCV), and resistant.
Prediction of CTL responses Mette Voldby Larsen cand. scient. in biology ph.d. student.
T Cell Receptor (TCR) & MHC Complexes-Antigen Presentation
The Major Histocompatibility Complex And Antigen Presentation
VACCINOMICS: CURRENT FINDINGS, CHALLENGES AND NOVEL APPROACHES FOR VACCINE DEVELOPMENT Megan Brown Dr. Inna G. Ovsyannikova and Dr. Gregory A. Poland.
Bioinformatics in transplantation immunology PhD defense Malene Erup Larsen October 27th 2010.
Methods MHC class-I T cell epitope prediction for Nef Consensus and ancestral sequences of the Nef protein for the different HIV-1 subtypes were obtained.
Selection of T Cell Epitopes Using an Integrative Approach Mette Voldby Larsen cand. scient. in Biology PhD in Immunological Bioinformatics.
HIV-1 evolution in response to immune selection pressures
1 Computer-aided Subunit Vaccine Design G.P.S. Raghava, Institute of Microbial Technology, Chandigarh  Understanding immune system  Breaking complex.
Lab of Immunoregulation Berkower Lab Weiss Lab -- Angelo Spadaccini -- Russell Vassell -- Yisheng Ni -- Yong He -- Yisheng Ni -- Yong He –Hong Chen --
Genome – and HLA-wide scanning and validation for cytotoxic CD8 T cell responses against Mycobacterium tuberculosis EU6-FRAME PROJECT Sheila Tuyet Tang,
Ad5 expressing Gag, Pol and Nef Immune responses in control of HIV replication non-human primates // SIV extrapolated to humans // HIV Prospects for.
CHAPTER 23 Molecular Immunology.
110/30/2015 Antigens Antigens Hugh B. Fackrell 210/30/2015 ä Assigned Reading ä Content Outline ä Performance Objectives ä Key terms ä Key Concepts ä.
Telling self from non-self: Learning the language of the Immune System Rose Hoberman and Roni Rosenfeld BioLM Workshop May 2003.
What is a Project Purpose –Use a method introduced in the course to describe some biological problem How –Construct a data set describing the problem –Define.
Structural Bioinformatics Section Vaccine Research Center/NIAID/NIH
1 Web Site: Dr. G P S Raghava, Head Bioinformatics Centre Institute of Microbial Technology, Chandigarh, India Prediction.
Statistical physics of T cell receptor selection and function Thesis committee meeting, 04/15/2009 Andrej Košmrlj Physics Department Massachusetts Institute.
Introduction H5N1 is an avian influenza. It was detected in humans for the first time in 1997 in Hong Kong. Since then the spread to humans has been limited.
T Cell Receptor (TCR) & MHC Complexes-Antigen Presentation Pin Ling ( 凌 斌 ), Ph.D. ext 5632; References: 1. Abbas, A, K. et.al,
Lecture 1: Immunogenetics Dr ; Kwanama
ProteinStart position HLAEpitopePositive responses (n) Env209A*0101SFEPIPSHY1 Env310A*0101/Cw*0401GPGPGRAFY1 Gag406A*0302RAPRKKGC WK 1 Nef9A*0101/A*0302SVVGWPAVR1.
HIV/AIDS.
Major Histocompatibility complex OR MHC MHC The principal function of T cells are : The principal function of T cells are : Defense against intracellular.
Specific Defenses of the Host Part 2 (acquired or adaptive immunity)
Bioinformatics in Vaccine Design
Prediction of T cell epitopes using artificial neural networks Morten Nielsen, CBS, BioCentrum, DTU.
Lecture 13 Immunology and disease: parasite antigenic diversity.
Major Histocompatibility Complex (MHC) Human Leukocyte Antigen (HLA)
T cell receptor & MHC complexes-Antigen presentation
T Cell Receptor (TCR) & MHC Complexes-Antigen Presentation
Adaptive immunity antigen recognition Y Y Y Y Y Y Y Y Y invading
Intracellular Pathogens Extracellular Pathogens
Immunology and disease: parasite antigenic diversity
Ligand Docking to MHC Class I Molecules
Experimental methods in classic epitope discovery
Telling self from non-self: Learning the language of the Immune System
Volume 18, Issue 5, Pages (November 2015)
Effects of a Single Escape Mutation on T Cell and HIV-1 Co-adaptation
Volume 20, Issue 5, Pages (August 2017)
Presentation transcript:

Immunological Bioinformatics: Prediction of epitopes in pathogens Ole Lund

Data driven predictions List of peptides that have a given biological feature Mathematical model (neural network, hidden Markov model) Search databases for other biological sequences with the same feature/property YMNGTMSQV GILGFVFTL ALWGFFPVV ILKEPVHGV ILGFVFTLT LLFGYPVYV GLSPTVWLS WLSLLVPFV FLPSDFFPS CVGGLLTMV FIAGNSAYE >polymerase“ MERIKELRDLMSQSRTREILTKTTVDHMAIIKKYTSGRQEKNPALRMKWMMAMK YPITAD KRIMEMIPERNEQGQTLWSKTNDAGSDRVMVSPLAVTWWNRNGPTTSTVHYPK VYKTYFE KVERLKHGTFGPVHFRNQVKIRRRVDINPGHADLSAKEAQDVIMEVVFPNEVGA RILTSE SQLTITKEKKEELQDCKIAPLMVAYMLERELVRKTRFLPVAGGTSSVYIEVLHLTQ GTCW EQMYTPGGEVRNDDVDQSLIIAARNIVRRATVSADPLASLLEMCHSTQIGGIRMV DILRQ NPTEEQAVDICKAAMGLRISSSFSFGGFTFKRTNGSSVKKEEEVLTGNLQTLKIKV HEGY EEFTMVGRRATAILRKATRRLIQLIVSGRDEQSIAEAIIVAMVFSQEDCMIKAVRGD LNF...

Prediction algorithms MHC binding data Prediction algorithms Genome scans

Influenza A virus (A/Goose/Guangdong/1/96(H5N1)) >polymerase“ MERIKELRDLMSQSRTREILTKTTVDHMAIIKKYTSGRQEKNPALRMKWMMAMKYPITAD KRIMEMIPERNEQGQTLWSKTNDAGSDRVMVSPLAVTWWNRNGPTTSTVHYPKVYKTYFE KVERLKHGTFGPVHFRNQVKIRRRVDINPGHADLSAKEAQDVIMEVVFPNEVGARILTSE SQLTITKEKKEELQDCKIAPLMVAYMLERELVRKTRFLPVAGGTSSVYIEVLHLTQGTCW EQMYTPGGEVRNDDVDQSLIIAARNIVRRATVSADPLASLLEMCHSTQIGGIRMVDILRQ NPTEEQAVDICKAAMGLRISSSFSFGGFTFKRTNGSSVKKEEEVLTGNLQTLKIKVHEGY EEFTMVGRRATAILRKATRRLIQLIVSGRDEQSIAEAIIVAMVFSQEDCMIKAVRGDLNF... and 9 other proteins MERIKELRD ERIKELRDL RIKELRDLM IKELRDLMS KELRDLMSQ ELRDLMSQS LRDLMSQSR RDLMSQSRT DLMSQSRTR LMSQSRTRE and 4376 other 9mers Proteins 9mer peptides >Segment 1 agcaaaagcaggtcaattatattcaatatggaaagaataaaagaactaagagatctaatg tcgcagtcccgcactcgcgagatactaacaaaaaccactgtggatcatatggccataatc aagaaatacacatcaggaagacaagagaagaaccctgctctcagaatgaaatggatgatg gcaatgaaatatccaatcacagcagacaagagaataatggagatgattcctgaaaggaat and other nucleotides on 8 segments Genome

Arms race between humans and microbes Recognize Escape Peptides from microbes HLA molecules In Humans

Figure by Anne Mølgaard, peptide (KVDDTFYYV) used as vaccine by Snyder et al. J Virol 78, (2004). Human MHC: ~1000 variants distributed over 12 types Peptide: up to 20 9 variants

HLA A and B diversity Nielsen M, Lundegaard C, Blicher T, Lamberth K, Harndahl M, Justesen S, Roder G, Peters B, Sette A, Lund O, Buus S., NetMHCpan, a method for quantitative predictions of peptide binding to any HLA-A and -B locus protein of known sequence. PLoS ONE :e796.

Binding affinity vs antigenecity A quantitative analysis of the variables affecting the repertoire of T cell specificities recognized after vaccinia virus infection. Assarsson E, Sidney J, Oseroff C, Pasquetto V, Bui HH, Frahm N, Brander C, Peters B, Grey H, Sette A. J Immunol Jun 15;178(12):

Prediction of MHC I epitopes Major histocompatibility complex class I binding predictions as a tool in epitope discovery. Lundegaard C, Lund O, Buus S, Nielsen M. Immunology Jul;130(3): Epub 2010 May 26. Review.

Recent benchmark studies Class I –Peters B, Bui HH, Frankild S et al. A community resource benchmarking predictions of peptide binding to MHC-I molecules. PLoS Comput Biol 2006; 2:e65. –Lin HH, Ray S, Tongchusak S, Reinherz EL, Brusic V. Evaluation of MHC class I peptide binding prediction servers: applications for vaccine research. BMC Immunol 2008; 9:8. Class II –Wang P, Sidney J, Dow C, Mothe B, Sette A, Peters B. A systematic assessment of MHC class II peptide binding predictions and evaluation of a consensus approach.PLoS Comput Biol 2008; 4:e –Lin HH, Zhang GL, Tongchusak S, Reinherz EL, Brusic V. Evaluation of MHC-II pep- tide binding prediction servers: applications for vaccine research. BMC Bioinformatics 2008; 9(Suppl. 12):S22. –Toward more accurate pan-specific MHC-peptide binding prediction: a review of current methods and toolsLianming Zhang, Keiko Udaka, Hiroshi Mamitsuka, Shanfeng ZhuBriefings in bioinformatics (impact factor: 7.33). 09/2011; DOI: /bib/bbr060

Validation of binding predictions

Response diversity Hoof, et al., JI, 2010

TB epitope discovery strategy Tang ST et al Submitted. Mtb H37Rv genome sequence Selection of peptides predicted to bind to HLA supertypes (NetCTL, protFun, SubCell): A2 (A*0201), A3 (A*0301), B7 (B*0702) (coverage approx. 80% of the world population) Synthesis selected peptides Measuring peptide/MHC binding affinity in vitro Screening for peptide recognition in in vitro CD8 + T cell assay in healthy PPD + donors Direct ex vivo determination of frequencies of peptide/tetramer + CD8 + T cells in TB patients (Multi) functionality of peptide responsive CD8 + T cells in TB patients Genome-Based In Silico Identification of New Mycobacterium tuberculosis Antigens Activating Polyfunctional CD8+ T Cells in Human Tuberculosis. Tang ST, van Meijgaarden KE, Caccamo N, Guggino G, Klein MR, van Weeren P, Kazi F, Stryhn A, Zaigler A, Sahin U, Buus S, Dieli F, Lund O, Ottenhoff TH. J Immunol Jan 15;186(2): Epub 2010 Dec 17.

TB Genome-Based In Silico Identification of New Mycobacterium tuberculosis Antigens Activating Polyfunctional CD8+ T Cells in Human Tuberculosis. Tang ST, van Meijgaarden KE, Caccamo N, Guggino G, Klein MR, van Weeren P, Kazi F, Stryhn A, Zaigler A, Sahin U, Buus S, Dieli F, Lund O, Ottenhoff TH. J Immunol Jan 15;186(2): Epub 2010 Dec 17.

TB Genome-Based In Silico Identification of New Mycobacterium tuberculosis Antigens Activating Polyfunctional CD8+ T Cells in Human Tuberculosis. Tang ST, van Meijgaarden KE, Caccamo N, Guggino G, Klein MR, van Weeren P, Kazi F, Stryhn A, Zaigler A, Sahin U, Buus S, Dieli F, Lund O, Ottenhoff TH. J Immunol Jan 15;186(2): Epub 2010 Dec 17.

Tetramer and cytokine staining of 10 cured TB patients and 10 healthy controls Genome-Based In Silico Identification of New Mycobacterium tuberculosis Antigens Activating Polyfunctional CD8+ T Cells in Human Tuberculosis. Tang ST, van Meijgaarden KE, Caccamo N, Guggino G, Klein MR, van Weeren P, Kazi F, Stryhn A, Zaigler A, Sahin U, Buus S, Dieli F, Lund O, Ottenhoff TH. J Immunol Jan 15;186(2): Epub 2010 Dec 17.

The challenge of rational epitope selection We have more than 2500 MHC molecules We often have more than 500 different pathogenic strains How to design a method to select a small pool of peptides that will cover both the MHC polymorphism and the pathogen diversity? –No peptide will bind to all MHC molecules and few (maybe even no) peptides will be present in all pathogenic strains

Vaccine discovery - HIV case story 10 HIV proteins –> 2,000,000 different peptides exist within the known HIV clades Patient diversity –More than 2500 different MHC molecules The challenge –Select 100 (0.005%) peptides with optimal genomic and HLA coverage

HIV Gag phylogenetic tree Clade C Clade D Clade B Clade A Clade AE Few peptides conserved between all viral strains

Dodo: Flavi viruses

Predicted West Nile virus Epitopes Mette Voldby Larsen

Sequence identity vs. serotype Solmaz Gabery

Epitope identification 56 (1.5%) 9mer are conserved among all 15 Clade A gag sequences

Polyvalent vaccines Select epitopes in a way so that they together cover all strains. Strain 1 Strain 2 Strain 1 Strain 2 Epitope Uneven coverage, Average coverage = 2 Even coverage, Average coverage = 2 X

EpiSelect. Pathogen diversity

Selected West Nile Virus Epitopes Shown relative to NC001563/M12296 Mette Voldby Larsen

Use of EpiSelect: CTL Epitopes with Maximum HIV-1 Coverage Problem: The high mutation rate of HIV-1 makes it difficult to identify CTL epitopes that are conserved among all subtypes. Possible solution: Chose a number of predicted and experimentally identified epitopes that together constitute a broad coverage of the HIV-1 strains examined. Data: 300+ fully sequenced HIV-1 strains A (A1 and A2), B, C, D, and CRF01_AE Methods: Prediction of CTL epitopes restricted by A1, A2, A3, A24, B7, B44, or B58 Select the epitopes that give the broadest coverage The algorithm chooses epitopes found in as many strains as possible, while up prioritizing epitopes from strains with few already-selected epitopes. Results: The final set consists of 180 epitopes. On average, each strain is covered by 54 epitopes (minimum 29). Ongoing work by Annika Karlsson: The ability of the chosen epitopes to elicit CTL response will be examined by using PBMCs from HIV-1 infected patients. Annika Karlsson and Carina Perez

SupertypesPhenotype frequencies CaucasianBlackJapaneseChineseHispanicAverage A2,A3, B783 %86 %88 %88 %86 %86% +A1, A24, B44100 %98 %100 %100 %99 %99 % +B27, B58, B62100 %100 %100 %100 %100 %100 % HLA polymorphism - frequencies Sette et al, Immunogenetics (1999) 50:

Perez et al., JI, 2008 Annika Karlsson Karolinska Institute Carina Perez Response of 31 HIV infected patients to 184 predicted HIV epitopes

All HIV responsive patients respond to at least one of nine peptides Perez et al., JI, 2008

PopCover – 2D searching > 2,000,000 different peptides exists within the known HIV clades peptides with prediction binding affinity stronger than 500 nM to any MHC molecule – 5608(tat), 20961(nef), 31848(gag),42748(pol), (env) No Gag peptides are found in all clades and 92% of all Gag peptides are shared only between 0-5% of all clades The challenge Select 64 (less than 0.001%) peptides with optimal genomic and HLA coverage –tat(4), nef(15), gag(15), pol(15), env(15)

EpiSelect and PoPCover EpiSelect The sum is over all genomes i. P j i is 1 if epitope j is present in genome i. C i is the number of times genome i has been targeted in the already selected set of epitopes PopCover The sum is over all genomes i and HLA alleles k. R j ki is 1 if epitope j is present in genome i and is presented by allele k, and E ki is the number of times allele k has been targeted by epitopes in genome i by the already selected set of epitopes, f k is the frequency of allele k in a given population and g i is the genomes frequency

20 juni 2015Marcus Buggert33 An average of 4,79 recognized peptides per patient Tat Nef Gag Pol Env Marcus Buggert et al., In preparation Experimental validation of HIV class II epitopes

Experimental validation

Vaccine design. Polytope construction NH2 COOH Epitope Linker M C-terminal cleavage Cleavage within epitopes New epitopes cleavage

Polytope starting configuration Immunological Bioinformatics, The MIT press.

Polytope optimal configuration Immunological Bioinformatics, The MIT press.

Prediction servers at CBS Web servers CTL epitopes MHC binding MHC Motif viewer Proteasome processing B-cell epitopes Plotting of epitopes relative to reference sequence Analysis of human immunoglobulin VDJ recombination Geno-pheno type association based mapping of binding sites PhD/master course in Immunological Bioinformatics, June,

Peters B, et al. Immunogenetics :326-36, PLoS Biol :e91. Immune Epitope Database (IEDB)

Cross-reactivity Crossreactivity is predictable (Pearsons r = ) –Rule of thumb: Each mutation halfs the response Frankild et al., PLoS ONE 3(3): e1831 Hoof, et al., JI, 2010

Pilot study of immunogenecity based on DrugBank Records corresponding to 123 FDA-approved biotech (protein/peptide) drugs were downloaded Sequences were compared to the human proteome (sequences from “Homo Sapiens” in NR (non redundant database from NCBI)) using blast. Sequences found in DrugBank and NR need to be manually validated/curated

Types of proteins Human/Human protein sequence Identical proteins Modified/allelic human proteins Non human proteins Antibodies Non human Human-murine chimaer Humanized Human Who – Allelic differences of VDJ genes How much – Break tolerance Tolerance to own B cell receptors?

Proposed application in assessment of protein drugs 1Compare amino acid sequence of drug with the human proteome 2Predict epitopes in regions that differ from the human proteome 3Select representative HLA alleles 4Verify binding experimentally 5Assess predicted immunogenecity using blood from treated patients/transgenic animals/naïve donors 6Compare with clinical findings of immunogenecity/adverse effects/lack of effect

Data acquired Data on 33 approved therapeutic proteins Julie Serritslev, Jens Vindahl Kringelum, et al., in preparation Immunogenicity Percent of recipients in a clinical study that had detectable antibodies against the therapeutic protein The primary source of immune response data was the reviewed data presented in Meyler's Side Effects of Drugs and from FDA labels.

Alleles representative of HLA-A, HLA-B and HLA-DRB, HLA-DQB1 and four HLA-DPB1 super-types Julie Serritslev, Jens Vindahl Kringelum, et al., in preparation; Nielsen et al., 2008

Prediction of epitopes MHC Class I and II binders can be predicted for all known alleles (A ROC ~ ) Binding correlates with likelihood of response No epitope give response in all individuals Cross reactivity correlates with epitope similarity B cell epitopes are hard to predict (A ROC ~ )