Immunological Bioinformatics Ole Lund
Challenges of the immune system Time Creation of self Creation of an immune system/ Tolerance to self Infection with microbe A Autoimmunity (break of tolerance to self) Allergen -> allergy Vaccine Cancer Inside Outside Peptide drugs Infection with microbe B Transplant ations
Effect of vaccines
The Immune System The innate immune system The adaptive immune system
Adaptive immune response Signal induced –Pathogens Antigens –Epitopes B Cell T Cell
Presentation of peptides on MHC I Figure by Eric A.J. Reits 1:5 1:200 1:2 Response to1:(5x20x20 0) = 1:2000 peptides
Figure by Anne Mølgaard, peptide (KVDDTFYYV) used as vaccine by Snyder et al. J Virol 78, (2004). Peptide (epitope) bound to MHC
Data driven predictions List of peptides that have a given biological feature Mathematical model (neural network, hidden Markov model) Search databases for other biological sequences with the same feature/property YMNGTMSQV GILGFVFTL ALWGFFPVV ILKEPVHGV ILGFVFTLT LLFGYPVYV GLSPTVWLS WLSLLVPFV FLPSDFFPS CVGGLLTMV FIAGNSAYE >polymerase“ MERIKELRDLMSQSRTREILTKTTVDHMAIIKKYTSGRQEKNPALRMKWMMAMK YPITAD KRIMEMIPERNEQGQTLWSKTNDAGSDRVMVSPLAVTWWNRNGPTTSTVHYPK VYKTYFE KVERLKHGTFGPVHFRNQVKIRRRVDINPGHADLSAKEAQDVIMEVVFPNEVGA RILTSE SQLTITKEKKEELQDCKIAPLMVAYMLERELVRKTRFLPVAGGTSSVYIEVLHLTQ GTCW EQMYTPGGEVRNDDVDQSLIIAARNIVRRATVSADPLASLLEMCHSTQIGGIRMV DILRQ NPTEEQAVDICKAAMGLRISSSFSFGGFTFKRTNGSSVKKEEEVLTGNLQTLKIKV HEGY EEFTMVGRRATAILRKATRRLIQLIVSGRDEQSIAEAIIVAMVFSQEDCMIKAVRGD LNF...
Prediction algorithms MHC binding data Prediction algorithms Genome scans
Antigen Discovery Lauemøller et al., 2000
Influenza A virus (A/Goose/Guangdong/1/96(H5N1)) >polymerase“ MERIKELRDLMSQSRTREILTKTTVDHMAIIKKYTSGRQEKNPALRMKWMMAMKYPITAD KRIMEMIPERNEQGQTLWSKTNDAGSDRVMVSPLAVTWWNRNGPTTSTVHYPKVYKTYFE KVERLKHGTFGPVHFRNQVKIRRRVDINPGHADLSAKEAQDVIMEVVFPNEVGARILTSE SQLTITKEKKEELQDCKIAPLMVAYMLERELVRKTRFLPVAGGTSSVYIEVLHLTQGTCW EQMYTPGGEVRNDDVDQSLIIAARNIVRRATVSADPLASLLEMCHSTQIGGIRMVDILRQ NPTEEQAVDICKAAMGLRISSSFSFGGFTFKRTNGSSVKKEEEVLTGNLQTLKIKVHEGY EEFTMVGRRATAILRKATRRLIQLIVSGRDEQSIAEAIIVAMVFSQEDCMIKAVRGDLNF... and 9 other proteins MERIKELRD ERIKELRDL RIKELRDLM IKELRDLMS KELRDLMSQ ELRDLMSQS LRDLMSQSR RDLMSQSRT DLMSQSRTR LMSQSRTRE and 4376 other 9mers Proteins 9mer peptides >Segment 1 agcaaaagcaggtcaattatattcaatatggaaagaataaaagaactaagagatctaatg tcgcagtcccgcactcgcgagatactaacaaaaaccactgtggatcatatggccataatc aagaaatacacatcaggaagacaagagaagaaccctgctctcagaatgaaatggatgatg gcaatgaaatatccaatcacagcagacaagagaataatggagatgattcctgaaaggaat and other nucleotides on 8 segments Genome
Experimental validation of Bioinformatics predictions Wang et al., 2006
Peters B, et al. Immunogenetics :326-36, PLoS Biol :e91. Results are deposited in public databases
Figure by Anne Mølgaard, peptide (KVDDTFYYV) used as vaccine by Snyder et al. J Virol 78, (2004). Human MHC: ~1000 variants distributed over 12 types Virus epitope: up to 20 9 variants
Coverage of HLA alleles Clustering in: O Lund et al., Immunogenetics : SupertypeSelected allele A1 A*0101 A2 A*0201 A3 A*1101 A24A*2401 A26 (new*)A*2601 B7 B*0702 B8 (new*)B*0801 B27 B*2705 B39(new*)B*3901 B44 B*4001 B58 B*5801 B62B*1501
Polyvalent vaccines The equivalent of this in epitope based vaccines is to select epitopes in a way that that they together cover all strains. Strain 1 Strain 2 Strain 1 Strain 2 Epitope Uneven coverage, Average coverage = 2 Even coverage, Average coverage = 2
Processing
Proteasome specificity Low polymorphism –Constitutive & Immuno- proteasome Evolutionary conserved Stochastic and low specificity –Only 70-80% of the cleavage sites are reproduced in repeated experiments
Proteasome evolution ( 1 unit) Constitutive Immuno Human (Hs) - Human Drosophila (Dm) - Fly Bos Taurus (Bota) - Cow Oncorhynchus mykiss (Om) - Fish …
Immuno- and Constitutive proteasome specificity... LVGPTPVNIIGRNMLTQL.. P1P1’ Immuno Constitutive
NetChop –Neural network based method PaProc –Weight matrix based method FragPredict –Based on a statistical analysis of cleavage- determining amino acid motifs present around the scissile bond i.e. also weight matrix like Predicting proteasomal cleavage
NetChop 3.0 Cterm (MHC ligands) LDFVRFMGVMSSCNNPA LVQEKYLEYRQVPDSDP RTQDENPVVHFFKNIVT TPLIPLTIFVGENTGVP LVPVEPDKVEEATEGEN YMLDLQPETTDLYCYEQ PVESMETTMRSPVFTDN ISEYRHYCYSLYGTTLE AAVDAGMAMAGQSPVLR QPKKVKRRLFETRELTD LGEFYNQMMVKAGLNDD GYGGRASDYKSAHKGLK KTKDIVNGLRSVQTFAD LVGFLLLKYRAREPVTK SVDPKNYPKKKMEKRFV SSSSTPLLYPSLALPAP FLYGALLLAEGFYTTGA NetChop-3.0 C-term –Trained on class I epitopes –Most epitopes are generated by the immuno proteasome –Predicts the immuno proteasome specificity
NetChop20S-3.0 In vitro digest data from the constitutive proteasome Toes et al., J.exp.med. 2001
Prediction performance TP FP AP AN A roc =0.5 A roc = spec Sens
Predicting proteasomal cleavage NetChop20S-3.0 NetChop-3.0 Relative poor predictive performance –For MHC prediction CC~0.92 and AUC~0.95
Proteasome specificity NetChop is one of the best available cleavage method –
Transporter associated with antigen processing (TAP)
What does TAP do?
TAP affinity prediction Transporter Associated with antigen Processing Binds peptides 9-18 long Binding determined mostly by N1-3 and C terminal amino acids
Integrating all three steps (protesaomal cleavage, TAP transport and MHC binding) should lead to improved identification of peptides capable of eliciting CTL responses Integration?
Identifying CTL epitopes 1 EBN3_EBV YQAYSSWMY EBN3_EBV QSDETATSH EBN3_EBV PVSPAVNQY EBN3_EBV AYSSWMYSY EBN3_EBV LAAGWPMGY EBN3_EBV IVQSCNPRY EBN3_EBV FLQRTDLSY EBN3_EBV YTDHQTTPT EBN3_EBV GTDVVQHQL HLA affinityProteasomal cleavageTAP affinity
Large scale method validation HIV A3 epitope predictions
Class II MHC binding MHC class II binds peptides in the class II antigen presentation pathway Binds peptides of length 9-18 (even whole proteins can bind!) Binding cleft is open Binding core is 9 aa Peptide: up to 20 9 variants Human MHC II: ~1000 variants
Cartoon by Eric Reits Humoral immunity
Antibody - Antigen interaction Fab Antigen Epitope Paratope Antibody The antibody recognizes structural properties of the surface of the antigen
Discontinuous B-cell epitopes An example: An epitope of the Outer Surface Protein A from Borrelia Burgdorferi (1OSP) SLDEKNSVSVDLPGEM KVLVSKEKNKDGKYDLI ATVDKLELKGTSDKNN GSGVLEGVKADKCKVK LTISDDLGQTTLEVFKE DGKTLVSKKVTSKDKS STEEKFNEKGEVSEKIIT RADGTRLEYTGIKSDGS GKAKEVLKG 1OSP, Li et al. 1997
The DiscoTope web server
Prediction servers at CBS for Immunological features BepiPred » Linear B-cell epitopesDiscoTope » Discontinuous B-cell epitopes NetChop » Proteasomal cleavages (MHC ligands)NetCTL » Integrated class I antigen presentation NetCTLpan Pan-specific integrated class I antigen presentation NetMHC » Binding of peptides to MHC class I allelesNetMHCII » Binding of peptides to MHC class II alleles NetMHCIIpan » Pan-specific binding of peptides to MHC class II HLA-DR alleles of known sequence NetMHCpan » Pan-specific binding of peptides to MHC class I alleles of known sequence VDJsolver » Analysis of human immunoglobulin VDJ recombination