CISBIC Sub-project 1: Stephen Muggleton, Brendan Wren, Victor Lesk CISBIC, Imperial College London. www.imperial.ac.uk/cisbic Modeling genotype-phenotype.

Slides:



Advertisements
Similar presentations
Distinctive characteristics
Advertisements

1.1.3 MI.
Office of Infectious Diseases Computational Challenges for Infectious Diseases Michael Shaw, PhD OID/Office of the Director.
Zhen Shi June 2, 2010 Journal Club. Introduction Most disease-causing mutations are thought to confer radical changes to proteins (Wang and Moult, 2001;
Identification of Novel Virulence-Associated Genes via Genome Analysis of Hypothetical Genes Sara Garbom, Åke Forsberg, Hans Wolf- Watz, and Britt-Marie.
Essentials of Glycobiology Lecture 43 June 10, 2004 Ajit Varki
Goals of PATHOGENOMICS related research Chief Scientist Office Ministry of Health Israel.
KEGG: Kyoto Encyclopedia of Genes and Genomes Susan Seo Intro to Bioinformatics Fall 2004.
Chapter 32 Viral Pathogenesis Samar Abu Ouda
HIV/AIDS as a Microcosm for the Study of Evolution.
Rules: Cell phones off Computers only for class related work No food or drink in lab room Text Book: Hartwell et al., 2nd Edition 2004 Genetics from Genes.
Influenza A Virus Pandemic Prediction and Simulation Through the Modeling of Reassortment Matthew Ingham Integrated Sciences Program University of British.
1. How does conjugation work? Sex in Bacteria How do bacteria exchange DNA.
Identification of Genes Involved in the Synthesis of Sialic Acid from Fusobacterium Nucleatum. Hatem Abdelhadi California State University Long Beach.
In the beginning, There was Mendel…. MEDICAL GENETICS Human Genetics
CISBIC: Sub-project 1: Stephen Muggleton CISBIC, Flowers Building, Imperial College London. Modeling genotype-phenotype relations.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Analysis and Management of Microarray Data Dr G. P. S. Raghava.
Chapter 13. The Impact of Genomics on Antimicrobial Drug Discovery and Toxicology CBBL - Young-sik Sohn-
Genes, which are carried on chromosomes, are the basic physical and functional units of heredity. Genes are specific sequences of bases that encode instructions.
HIV-1 evolution in response to immune selection pressures
Multivalent pneumococcal vaccines can increase the transmissibility and virulence of non-vaccine strains Eleanor Watkins, Caroline Buckee, Bridget Penman,
Sub-Project 3 Progress Report March 2009 Simon Moon, Anna Rose, Maggie Dallman and Jaroslav Stark.
What’s up with the flu? Novel H1N1? SWINE FLU??? Mexican flu? swine-origin influenza A? A(H1N1)? S-OIV? North American flu? California flu? Schweingrippe.
Streptococcus pneumoniae pathogenesis
Molecular characterisation of methicillin-sensitive Staphylococcus aureus from deep surgical site infections in orthopaedic patients.
John R. LaMontagne Memorial Symposium on Pandemic Influenza Research April 4-5, 2005 Institute of Medicine Working Group One: Influenza Virulence and Antigenic.
Predicting protein degradation rates Karen Page. The central dogma DNA RNA protein Transcription Translation The expression of genetic information stored.
Patterns of divergent selection from combined DNA barcode and phenotypic data Tim Barraclough, Imperial College London.
Introduction to DNA Microarrays: Functional Mining of Array Patterns Michael F. Miles, M.D., Ph.D. Depts. of Pharmacology/Toxicology and Neurology and.
Bio-Linux 3.0 An integrated bioinformatics solution for the EG community ClustalX showing DNA polymerase alignment GeneSpring showing yeast transcriptome.
2009 IADR, MIAMI, FL, USA Hands-on Experience for using the Human Oral Microbiome Database (HOMD) 2009 IADR Workshop, Miami, FL, USA Tsute (George) Chen.
Mining publicly available microarray data Frances Turner
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Genome sequencing and annotation Comprehensive identification of virulence gene candidates by various means Bioinformatic prioritization of virulence gene.
Post-genomic Virology The impact of bioinformatics, microarrays and proteomics on investigating host and pathogen interactions Steven Masson.
Genetics and Genomics Forward genetics Reverse genetics
Ch. 11 DNA Structure. Chromosomes Structure  Two Components:  DNA  Protein.
“Neutralizing Antibodies Derived from the B Cells of 1918 Influenza Pandemic Survivors” (Yu et. al) Daniel Greenberg.
What is a QTL? Quantitative trait locus (loci) Region of chromosome that contributes to variation in a quantitative trait Generally used to study “complex.
Immunology Ontology Workshop Buffalo, NY June 11-13, 2012.
Antimicrobial resistance and virulence genes among Streptococcus suis serotype 2 isolates from pig-derived products  Streptococcus suis is an emerging.
Phase variation in Campylobacter jejuni. Recap: genes and proteins G TC AG CT A G CT A T AT A G C DNA Proteins UCCUGAAUG RNA 1 codon = 1 amino acid.
New research areas in personalised medicines
Bacterial and Viral Pathogenicity
The Bacteria January 12th, 2010.
Comparative genotypic and phenotypic characterization of
 The human genome contains approximately genes.  At any given moment, each of our cells has some combination of these genes turned on & others.
Table 5.1.
Peter John M.Phil, PhD Atta-ur-Rahman School of Applied Biosciences (ASAB) National University of Sciences & Technology (NUST)
Control Prevent entry Reduce exposure Vaccination
Genomics of epidemic pathogens
Taxonomical classification is recognizing and registering the worlds organism diversity – continual changing knowledge about evolutionary and ecological.
Development of a real-time fluorescence resonance energy transfer PCR to identify the main pathogenic Campylobacter spp.  A. Ménard, F. Dachet, V. Prouzet-Mauleon,
S. Borrell, S. Gagneux  Clinical Microbiology and Infection 
Aim What happens when a bacteria or virus mutates?
Working in the Post-Genomic C. elegans World
-The relationship between genes and traits. -Fields of Genetics.
J.A. McCullers  Clinical Microbiology and Infection 
One SNP at a Time: Moving beyond GWAS in Psoriasis
1.1.3 MI.
Whole-genome phylogeny of meat and human clinical Escherichia coli ST131 isolates. Whole-genome phylogeny of meat and human clinical Escherichia coli ST131.
Evolution of Salmonella within Hosts
H.D. Donoghue  Clinical Microbiology and Infection 
Distribution and clonal relationship of cell surface virulence genes among Streptococcus pneumoniae isolates in Japan  S. Imai, Y. Ito, T. Ishida, T.
Fraser Fraser 2000 Metzker 2010 Metzker 2010.
F.M. Cohan  Clinical Microbiology and Infection 
AS Level Paper 1 and 2. A2 Level Paper 1 and 3 - Topics 1-4
Presentation transcript:

CISBIC Sub-project 1: Stephen Muggleton, Brendan Wren, Victor Lesk CISBIC, Imperial College London. Modeling genotype-phenotype relations in Campylobacter

Campylobacter jejuni : Unanswered questions  Are all strains equally pathogenic to humans?  Do strains from different sources share common features?  Why is C. jejuni a commensal in avians, but highly infectious in humans?  How does C. jejuni in some cases perturb the immune system to cause neurological sequelae?  Answers are likely to lie in glyco-surface structures

C. iejuni NCTC11168 from gene sequence to glycostructure Kelly J, 2006 Aim. Model variation in pathogen genome sequence of glycosylated surface structures & understand how this variation affects immune response.

Glycostructure questions How do we get from sequence to full glycostructure? How does glycan diversity affect disease potential and niche adaptation? Use combination of traditional and systems approaches 1)Mutagenesis 2)Natural strain variation 3)Pathway modelling PglF Cj1120 PglE Cj1121 PglD Cj1123 PglC Cj1124 PglA Cj1125 PglH Cj1129 PglJ Cj1127 PglI Cj1128 PglK Cj1130 N O Basic sugars UDP-  -D-GlcNAc PglB Cj1126 C L PseB Cj1293 PseC Cj1294 PseH Cj1313 PseG Cj1312 PseI Cj1317 PseF Cj1311 PseA Cj1316

Strain collection gDNA Microarray Quality Control GeneSpring GACK analysis Gene Calling Bayesian analysis Phylogenetic analysis Statistical analysis Analysis Values Ratio Binary data Evolution of virulence & pathogenesis Phylogenomics pipeline

Natural strain variation and phylogenomics A livestock and non- livestock clade, but most clinical isolates fall in the non-livestock clade “Unknown” source of C. jejuni infection 10 further sub-clades Water/wildlife sub-clade

Identification of sub-clade/niche specific genes Identified glyco-specific genes for 1) Flagellin glyco (O) 2) Capsule (C) E.g. Phosphoramidate Genes Cj present Genes Cj absent

Importance of Capsule Side branch sugars more widely conserved than core sugars? Phosphoramidate present in 86% of strains (232/270)? Link to subproject 3 ► Capsule prevents excessive cytokine production by dendritic cells ► Presence of phosphoramidate appears to reduce cytokine production

Machine learning approach Biochemical Pathways (KEGG, Biocyc) Mutant data (MAS-NMR) Glycan Structures (MAS-NMR) Prolog Database Machine learning Background rules Hypotheses

Examples of hypotheses C. jejuni glycan synthesis pathway

Integrating multi-strain genetic data with machine learning of gene functions Genetic data for 75 serotyped strains of C. jejuni “If the product of Gene1 synthesises a sugar precursor which is transformed by the product of Gene2, then Gene1 tends to be present in strains with the same serotype as those expressing Gene2”. Working assumption for relating gene function to co-presence

Integrating multi-strain genetic data with machine learning of gene functions Knowledgebase (KEGG, BioCyc) Cj1432 Cj1434 Cj1438 Cj1440 Cj1442 Cj1432 Cj1434 Cj1438 Cj1440 Cj1442 Cj1432 Cj1434 Cj1438 Cj1440 Cj1442 ? ? ? WTCj1433-Cj1432- Glycan mutant observations Hypothesized transferase- encoding genes

Integrating multi-strain genetic data with machine learning of gene functions Knowledgebase (KEGG, BioCyc) Cj1438 Cj1442 Cj1432 Cj1434 ? ? ? WTCj1433-Cj1432- Glycan mutant observations Hypothesized transferase- encoding genes Genetic data for serotyped strains

Learning from strain data (work in progress) Learning curves with and without strain data Gene functions provided to learning (%) Gene functions identified by learning (%)

Acknowledgements PI‘s and senior staff: Anne Dell, Stephen Muggleton, Jeremy Nicholson, Chris Rawlings, Mike Sternberg, Brendan Wren Postdocs: Richard Barton, Paul Hitchen, Emily Kay, Victor Lesk, Alireza Tamaddoni- Nezhad