IMMUNOGRID Nikolai Petrovsky and Vladimir Brusic

Slides:



Advertisements
Similar presentations
Major Histocompatibility Complex. Principles of Immune Response Highly specific recognition of foreign antigens Mechanisms for elimination of microbes.
Advertisements

T-cell epitope prediction by molecular dynamics simulations Irini Doytchinova Medical University of Sofia School of Pharmacy Medical University of Sofia.
Hotspot Hunter: a computational system for large-scale screening and selection of candidate immunological hotspots in pathogen proteomes G.L. Zhang, A.M.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU T cell Epitope predictions using bioinformatics (Neural Networks and hidden.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Introduction and Importance of Bioinformatics: Application in Drug/Vaccine Design G. P. S. Raghava Web:
Training and applying hidden Markov models and support vector machines for prediction of T-cell epitopes Van Hai Van, Cao Thi Ngoc Phuong, Tran Linh Thuoc.
Show & Tell Limsoon Wong KRDL Datamining: Turning Biological Data into Gold.
Computer Aided Vaccine Design Dr G P S Raghava. Concept of Drug and Vaccine Concept of Drug Concept of Drug –Kill invaders of foreign pathogens –Inhibit.
MHC Polymorphism Ole Lund. Objectives What is HLA polymorphism? What is it good for? How does it make life difficult for vaccine design? Definition of.
Computational Immunology An Introduction Rose Hoberman BioLM Seminar April 2003.
Bioinformatics Needs for the post-genomic era Dr. Erik Bongcam-Rudloff The Linnaeus Centre for Bioinformatics.
Managing Data Resources
Bioinformatics pipeline for detection of immunogenic cancer mutations by high throughput mRNA sequencing Jorge Duitama 1, Ion Mandoiu 1, and Pramod Srivastava.
MHC Polymorphism. MHC Class I pathway Figure by Eric A.J. Reits.
“Inverse Kinematics” The Loop Closure Problem in Biology Barak Raveh Dan Halperin Course in Structural Bioinformatics Spring 2006.
Pattern recognition in the immune system. Specific peptide recognition Antibody epitopes T-cell receptors recognizing peptides presented on MHC-molecules.
Stochastic Stage-structured Modeling of the Adaptive Immune System Dennis L. Chao 1, Miles P. Davenport 2, Stephanie Forrest 1, and Alan S. Perelson 3.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Informatics Support for Vaccine Projects Using and extending the UCSC bioinformatics infrastructure.
Protein Therapeutics Immunogenicity Stephen Lynn CHEM645 0.
Epitope Selection Rational Vaccine design. Why? Therapeutic vaccines Therapeutic vaccines Treatment of viral infections (e.g., HIV, HCV), and resistant.
Comparative Genomics of Viruses: VirGen as a case study Dr. Urmila Kulkarni-Kale Bioinformatics Centre University of Pune Pune
Methods MHC class-I T cell epitope prediction for Nef Consensus and ancestral sequences of the Nef protein for the different HIV-1 subtypes were obtained.
Machine-learning in building bioinformatics databases for infectious diseases Victor Tong Institute for Infocomm Research A*STAR, Singapore ASEAN-China.
Bioinformatics and medicine: Are we meeting the challenge?
Selection of T Cell Epitopes Using an Integrative Approach Mette Voldby Larsen cand. scient. in Biology PhD in Immunological Bioinformatics.
Secondary Databases Ansuman sahoo Roll: Y Bioinformatics Class Presentation 30 Jan 2013.
Biological Databases By : Lim Yun Ping E mail :
1 Computer-aided Subunit Vaccine Design G.P.S. Raghava, Institute of Microbial Technology, Chandigarh  Understanding immune system  Breaking complex.
Finish up array applications Move on to proteomics Protein microarrays.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
Telling self from non-self: Learning the language of the Immune System Rose Hoberman and Roni Rosenfeld BioLM Workshop May 2003.
What is a Project Purpose –Use a method introduced in the course to describe some biological problem How –Construct a data set describing the problem –Define.
Mining Biological Data. Protein Enzymatic ProteinsTransport ProteinsRegulatory Proteins Storage ProteinsHormonal ProteinsReceptor Proteins.
Limsoon Wong Laboratories for Information Technology Singapore From Informatics to Bioinformatics.
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
1 Web Site: Dr. G P S Raghava, Head Bioinformatics Centre Institute of Microbial Technology, Chandigarh, India Prediction.
Protein Sequence Analysis - Overview - NIH Proteomics Workshop 2007 Raja Mazumder Scientific Coordinator, PIR Research Assistant Professor, Department.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
Specific Defenses of the Host Part 2 (acquired or adaptive immunity)
Bioinformatics in Vaccine Design
Lecture 19 November 16 th 2010 Quiz 2 scheduled for November 23 rd not November 18th.
Limsoon Wong Laboratories for Information Technology Singapore From Datamining to Bioinformatics.
BME435 BIOINFORMATICS.
Randi Vita, M.D. Better living through ontologies at the Immune Epitope Database La Jolla Institute for Allergy & Immunology Division of Vaccine Discovery.
Chapter 43 The Immune System.
Immunoinformatics Approach for Non-Small Cell Lung Cancer
T cell receptor & MHC complexes-Antigen presentation
Immune System II Acquired Immunity.
T Cell Receptor (TCR) & MHC Complexes-Antigen Presentation
IMMUNOGLOBULIN STRUCTURE AND FUNCTION
Intracellular Pathogens Extracellular Pathogens
Introduction C.Eng 714 Spring 2010.
Major Histocompatibility Complex (MHC) and its encoding molecules
Predicting Active Site Residue Annotations in the Pfam Database
Data Warehousing and Data Mining
Ligand Docking to MHC Class I Molecules
Experimental methods in classic epitope discovery
Protein Sequence Analysis - Overview -
Bioinformatics for plant biosecurity and surveillance systems
The major histocompatibility complex (MHC) and MHC molecules
Telling self from non-self: Learning the language of the Immune System
Protein Sequence Analysis - Overview -
Web Mining Department of Computer Science and Engg.
The Rational Design of an AIDS Vaccine
The Genetic Basis for Cancer Treatment Decisions
Introduction to Bioinformatic
Alessandro Sette, Sinu Paul, Kerrie Vaughan, Bjoern Peters 
مادة المناعة-النظري\ المرحلة الثالثة
Presentation transcript:

IMMUNOGRID Nikolai Petrovsky and Vladimir Brusic Medical Informatics Centre, University of Canberra March 2003

Summary Introduction Databases Vaccine development Conclusion

The immune system is composed of many interdependent cell types, organs, and tissues that jointly protect the body from infections (bacterial, parasitic, fungal, or viral) and from the growth of tumor cells. The immune system is the second most complex body system in humans.

An enormous diversity in human immune system >1013 MHC class I haplotypes (IMGT-HLA) 107-1015 different T-cell receptors (Arstila et al., 1999) 1012 B-cell clonotypes in an individual (Jerne, 1993) 1011 linear epitopes composed of nine amino acids >>1011 conformational epitopes >109 combinatorial antibodies (Jerne, 1993)

Immunology is a combinatorial science The amount of immune data is growing exponentially GRID technology offers a unique opportunity to divide and conquer immune complexity.

IMMUNOINFORMATICS IMMUNOLOGY COMPUTER SCIENCE IMMUNOLOGY DATABASES COMPUTATIONAL MODELS IMMUNOLOGY Learning Algorithms, Pattern Recognition, Adaptive Memories, Intelligent Agents Design of Experiments, Data Interpretation COMPUTATIONAL EXPERIMENTS

IMMUNOGRID basic clinical immunology maths/stats molecular biology systems science maths/stats databases artificial intelligence algorithms physics/chemistry cell biology molecular biology IMMUNOGRID basic immunology clinical

Summary Introduction Databases Predictions of vaccine targets Functional genomics/Immunomics Conclusion

IMMUNOGRID Database technology for storage, manipulation, and modelling of immunological data Computational models to facilitate immunological research - predictive models - mathematical models

Databases General databases Specialist immunological databases Data warehouses

General databases GenBank Prosite EMBL DDBJ PIR PDB SWISS-PROT GenPept DBCAT Catalogue of databases www.infobiogen.fr/services/dbcat

General databases Advantages significant infrastructure interfaces for data extraction and analysis curation and quality assurance of data centrally accessible standardised formats facilitating automation independently maintained and funded

General databases Disadvantages quality control of content error propagation typically poor annotation of features obsolete, incomplete, or redundant entries lack of synchronisation application of standards (nomenclature etc.)

Specialist databases KABAT HIV molecular IMGT immunology FIMM MHCPEP SLAD SYFPEITHI MHCDB 15 databases described in the JIM review

Specialist databases Advantages more detailed information created and maintained by the domain experts high level of quality assurance of data better compliance to standards have specialist tools

Specialist databases Disadvantages irregular updates low level of automation less reliable for access and currency funding uncertainty

Data warehouse goals Efficient querying, reporting and complex analyses of data Flexibility in adding tools for data analyses Scalability etc. Schönbach et al. Briefings in Bioinformatics, 2000

FIMM

Summary Introduction Databases Vaccine development Conclusion

A cancer cell under attack by T cells of the immune system Cancer cell killed

V. Brusic, 2002

Modelling MHC-binding peptides

Model requirements High accuracy Generalisation Improvement over time High specificity (cheap confirmation) High sensitivity (broad coverage) Generalisation Predict well previously unseen peptides Predict well across allelic variants Improvement over time Robustness (resistance to errors and biases)

MHC-binding peptides Binding motifs Quantitative matrices Artificial neural networks Hidden Markov models Molecular modelling

ARTIFICIAL NEURAL NETWORK P U T H I D D E N A C D E F G H I K L M N P Q R S T V W Y A C D E F G H I K L M N P Q R S T V W Y Y I N P U T

Example 1 1994 - Prediction of MHC class I binding peptides Molecule: HLA-A*0201 Subset: 9-mers Data: 186 binders, 1071 non-binders

Example Experimental testing of protein thyrosine phosphatase (IA-2) in at-risk IDDM relatives Binding assays T-cell proliferation assays Honeyman et al., Nat. Biotechnol. 1998 Brusic et al., Bioinformatics 1998

HLA-DR4 T-cell epitopes from an IDDM antigen IA-2 . HLA-DR4 T-cell epitopes from an IDDM antigen IA-2 1000 T-cell resp. < 1 SD T-cell resp. 1-2 SD T-cell resp. > 2 SD 1/IC50)*100 100 Binding Index ( 10 1 -2 2 4 6 8 10 Binding Prediction

Example 2

Cyclical refinement Initial experiments refine Optimise/ clean Computer models Further experiments define

Malaria - 500 000 000 cases per annum Example 3 Malaria - 500 000 000 cases per annum Search for vaccine targets in HLA-A11 population in Vosera - Papua New Guinea Six antigens from P. falciparum LSA-1 ~1909 AA SALSA ~ 83 AA CSP ~ 432 AA GLURP ~1262 AA STARP ~ 604 AA TRAP ~ 559 AA 3127 peptides

TRAP-559AA Example 3 MNHLGNVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYSE EVCNDQVDLYLLMDCSGSIRRHNWVNHAVPLAMKLIQQLN LNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRS LLSTNLPYGRTNLTDALLQVRKHLNDRINRENANQLVVIL TDGIPDSIQDSLKESRKLSDRGVKIAVFGIGQGINVAFNR FLVGCHPSDGKCNLYADSAWENVKNVIGPFMKAVCVEVEK TASCGVWDEWSPCSVTCGKGTRSRKREILHEGCTSEIQEQ CEEERCPPKWEPLDVPDEPEDDQPRPRGDNSSVQKPEENI IDNNPQEPSPNPEEGKDENPNGFDLDENPENPPNPDIPEQ KPNIPEDSEKEVPSDVPKNPEDDREENFDIPKKPENKHDN QNNLPNDKSDRNIPYSPLPPKVLDNERKQSDPQSQDNNGN RHVPNSEDRETRPHGRNNENRSYNRKYNDTPKHPEREEHE KPDNNKKKGESDNKYKIAGGIAGGLALLACAGLAYKFVVP GAATPYAGEPAPFDETLGEEDKDLDEPEQFRLPEENEWN

88 NVKNVSQTNFKSLLRNLGVSENIFLKEN 115 Example 3 1) Overlapping study Twenty overlapping 9-mer peptides from the known immunogenic region of LSA-1 90 94 105 88 NVKNVSQTNFKSLLRNLGVSENIFLKEN 115 2) Initial ANN model: 98 binders and 145 non-binders 34 peptides selected and tested for HLA-A*1101 binding 3) Refined ANN model: 123 (98+13+12) binders and 203 (145+41+17) non-binders twenty-nine (29) peptides were selected and tested

Correctly predicted binders 3/20 10/36 22/29 % 76 29 15 Brusic et al. Journal of Molecular Graphics and Modelling, 2001

Prediction of cancer-related T-cell epitopes Other work Identification of relationship between TAP transporter and MHC binding using KDD techniques Brusic et al. (1999). In Silico Biology 1, 109-121. Daniel et al. (1998). Journal of Immunology 161, 617-624. Prediction of cancer-related T-cell epitopes Zarour et al. (2002). Canc. Res. 62, 213-218. Kierstad et al. (2001). Br. J. Canc. 85, 1735-1745. Zarour et al. (2000). Canc. Res. 60, 4946-4952. Zarour et al. (2000). PNAS USA 97, 400-405. Prediction of peptides that bind multiple MHC molecules Brusic et al. (2002). Immunology and Cell Biology 80, 280-285. Large-scale (genome-wide) screening of MHC binders Schönbach et al. (2002). Immunology and Cell Biology 80, 300-306. Prediction of renal transplant outcomes Petrovsky et al (2002). Graft 4, 6-13.

A substantial effort is required to model a single MHC molecule There are more than 1000 different human MHC molecules and growing The number of pathogen genomes for vaccine design is increasing rapidly Thus vaccine target identification is a parallel problem ameniable to IMMUNOGRID

Summary Introduction Databases Predictions of vaccine targets Conclusion

Conclusions Bioinformatics is revolutionising immunology The scope of immunoinformatics is huge – it comprises databases, molecular-level and organism level models, genomics and proteomics of the immune system, as well as genome-to-genome studies The size and complexity of the field necessitates a distributed approach to database management, analysis and data mining GRID provides the perfect answer to the needs of Immunoinformatics

IMMUNOGRID basic clinical immunology maths/stats molecular biology systems science maths/stats databases artificial intelligence algorithms physics/chemistry cell biology molecular biology IMMUNOGRID basic immunology clinical