Technology from seed Computer Assisted Analysis of Pulsed-field Gel Electrophoresis gels João André Carriço 40th ESCMID Postgraduate.

Slides:



Advertisements
Similar presentations
©2011 Elsevier, Inc. Molecular Tools and Infectious Disease Epidemiology Betsy Foxman Chapter 7 Omics Analyses in Molecular Epidemiologic Studies.
Advertisements

Different types of data e.g. Continuous data:height Categorical data ordered (nominal):growth rate very slow, slow, medium, fast, very fast not ordered:fruit.
Clustering Clustering of data is a method by which large sets of data is grouped into clusters of smaller sets of similar data. The example below demonstrates.
Basic Gene Expression Data Analysis--Clustering
HW 4 Answers.
Clustering Basic Concepts and Algorithms
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
Distance methods. UPGMA: similar to hierarchical clustering but not additive Neighbor-joining: more sophisticated and additive What is additivity?
Clustering Evaluation April 29, Today Cluster Evaluation – Internal We don’t know anything about the desired labels – External We have some information.
Clustering Petter Mostad. Clustering vs. class prediction Class prediction: Class prediction: A learning set of objects with known classes A learning.
Cluster Validation.
Clustering. What is clustering? Grouping similar objects together and keeping dissimilar objects apart. In Information Retrieval, the cluster hypothesis.
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Terminology of phylogenetic trees
Dimension reduction : PCA and Clustering Slides by Agnieszka Juncker and Chris Workman modified by Hanne Jarmer.
Quantitative analysis of 2D gels Generalities. Applications Mutant / wild type Physiological conditions Tissue specific expression Disease / normal state.
An Overview of Clustering Methods Michael D. Kane, Ph.D.
1 Limma homework Is it possible that some of these gene expression changes are miscalled (i.e. biologically significant but insignificant p value and vice.
Computational Biology Group. Class prediction of tumor samples Supervised Clustering Detection of Subgroups in a Class.
Distance-Based Approaches to Inferring Phylogenetic Trees BMI/CS 576 Colin Dewey Fall 2010.
Correspondence between typing methods results: A web-based quantitative analysis tool Carriço, J.A.; F. R. Pinto; J. Melo-Cristino; H. de Lencastre ; J.
Today Cluster Evaluation Internal External
Supplemental Digital Content 1:
Machine Learning for the Quantified Self
Figure 1 Principle coordinates analysis plot for 127 Escherichia coli isolates from the SENTRY and Meropenem Yearly Susceptibility Test Information Collection.
Laboratory investigation for clonality of a foodborne outbreak due to
Escherichia coli O104:H4 south-west France, June 2011
Carriço, J. A. ; F. R. Pinto; J. Melo-Cristino; H. de Lencastre ; J. S
J. Engberg, D. D. Bang, R. Aabenhus, F. M. Aarestrup, V. Fussing, P
S. Le Hello, V Falcot, F Lacassin, F Baumann, P Nordmann, T Naas 
A. N. Maniatis, S. Pournaras, S. Orkopoulou, P. T. Tassios, N. J
Salmonella genomic island 1-J variants associated with change in the antibiotic resistance gene cluster in multidrug-resistant Salmonella enterica serovar.
HSP65-PRA identification of non-tuberculosis mycobacteria from 4892 samples suspicious for mycobacterial infections  M. Saifi, E. Jabbarzadeh, A.R. Bahrmand,
K. Diestra, E. Miró, C. Martí, D. Navarro, J. Cuquet, P. Coll, F
Sporadic occurrence of CMY-2-producing multidrug-resistant Escherichia coli of ST- complexes 38 and 448, and ST131 in Norway  U. Naseer, B. Haldorsen,
D.S. Hansen, R. Skov, J.V. Benedí, V. Sperling, H.J. Kolmos 
Foodborne transmission of sorbitol-fermenting Escherichia coli O157:[H7] via ground beef: an outbreak in northern France, 2011  L.A. King, E. Loukiadis,
Displaying the diversity in the natural history of infections of Pseudomonas aeruginosa in patients with non-cystic fibrosis bronchiectasis including:
D.R. Knight, P. Putsathit, B. Elliott, T.V. Riley 
(A) Hierarchical clustering was performed to identify groups of patients with similar RNASeq expression of 20 genes associated with reduced survivability.
Emergence of resistance to fluoroquinolones and third-generation cephalosporins in Shigella flexneri subserotype 1c isolates from China  S. Qiu, X. Xu,
PCR ribotyping and arbitrarily primed PCR for the comparison of enterotoxigenic Bacteroides fragilis strains from two Polish university hospitals  Gayane.
Cluster diagram for four patient pairs with genotyped strains sharing the same sequence typing (ST). Cluster diagram for four patient pairs with genotyped.
Dissemination of multidrug-resistant, class 1 integron-carrying Acinetobacter baumannii isolates in Taiwan  L.-Y. Huang, T.-L. Chen, P.-L. Lu, C.-A. Tsai,
Phylogenetic analysis of 3,582 HAV isolates in the VP1/P2B region (315-bp fragment). Phylogenetic analysis of 3,582 HAV isolates in the VP1/P2B region.
Nearest Neighbors CSC 576: Data Mining.
Ciprofloxacin-resistant, CTX-M-15-producing Escherichia coli ST131 clone in extraintestinal infections in Italy  M. Cerquetti, M. Giufrè, A. García-Fernández,
Representative RAPD profiles from pools of H
C. Silva-Costa, F.R. Pinto, M. Ramirez, J. Melo-Cristino 
Close genetic relationship between Salmonella enterica serovar Enteritidis isolated from patients with diarrhoea and poultry in the Republic of Korea 
Emergence of extended-spectrum β-lactamase-producing Escherichia coli and Klebsiella pneumoniae during the years 2000 and 2004 in Helsinki, Finland  S.D.
Endemic meningococcal disease in Cerdanyola, Spain, 1987–93: molecular epidemiology of the isolates of Neisseria meningitidis  M. Esther Verdú, Pere Coll,
Molecular characterisation of a dominant methicillin-resistant Staphylococcus aureus (MRSA) clone in a Mexican hospital (1999–2003)  G. Echániz-Aviles,
Genetic relatedness between group B streptococci originating from bovine mastitis and a human group B streptococcus type V cluster displaying an identical.
L. R. Ásmundsdóttir, H. Erlendsdóttir, A. L. Gísladóttir, M
Clonality among ampicillin-resistant Enterococcus faecium isolates in Sweden and relationship with ciprofloxacin resistance  E. Torell, J. Kühn, B. Olsson-Liljeauist,
Dendrogram constructed on the basis of RiboPrint pattern types obtained for 13 complement-sensitive and 2 complement-resistant strains of M. catarrhalis.
Diversity of carbapenemases in clinical isolates of Enterobacteriaceae in Croatia—the results of a multicentre study  V. Zujić Atalić, B. Bedenić, E.
Comparison of macrorestriction analysis of genomic DNA by pulsed-field gel electrophoresis and ribotyping with conventional methods for differentiation.
Clonal dissemination of two clusters of Acinetobacter baumannii producing OXA-23 or OXA-58 in Rome, Italy  R.E. Mendes, T. Spanu, L. Deshpande, M. Castanheira,
Usefulness of double locus sequence typing (DLST) for regional and international epidemiological surveillance of methicilin-resistant Staphylococcus aureus 
S. aureus recovered from the airways of a cystic fibrosis patient.
F. B. Spencker, S. Haupt, M. C. Claros, S. Walter, T. Lietz, R
Deviations between de novo or insertion trees and gold standard trees.
Genetic diversity of community-associated methicillin-resistant Staphylococcus aureus in southern Stockholm,   H. Fang, G. Hedin, G. Li, C.E.
Combination of pulsed-field gel electrophoresis (PFGE) and single-enzyme amplified fragment length polymorphism (SAFLP) for differentiation of multiresistant.
DGGE analysis of tet(M) in water samples and in bacterial isolates.
Example of population analysis profile curves for vancomycin-susceptible and heterogeneous vancomycin-intermediate S. aureus strains. Example of population.
Presentation transcript:

technology from seed Computer Assisted Analysis of Pulsed-field Gel Electrophoresis gels João André Carriço 40th ESCMID Postgraduate Education Course: Bacterial Molecular Typing – A Practical and Theoretical Course

technology from seed Why “Computer assisted” Analysing 2 gels to detect outbreak situation –worst case scenario: ((2*27 (1) )*(2*27-1))/2=1431 comparisons Analysing 20 gels from an 1 year longitudinal study –worst case scenario: ((20*27 (1) )*(20*27-1))/2= comparisons! “Please have a computer do that for me!”.... But the computers won’t replace the eyes...hence the “assisted” (1) 30 lanes gels with 3 markers =27 isolates

technology from seed Band position tolerance: 1.25% Optimization: 0.5% UPGMA Dice coefficients Cut-off 80% Dendrogram construction: Image Analysis settings: Cluster Cut-off:/Type definition: Dendrograms for PFGE Type assignment What is behind the dendrogram? McDougal, L. K., et all J Clin Microbiol 41:

technology from seed Image Analysis : Normalization Molecular size markers: Lambda Reference Strains

technology from seed Image Analysis : Band matching Typical settings: Band position tolerance Optimization a b c d e f

technology from seed Cluster Analysis: Distance metrics # bands 11 1 # bands difference 4 4 Dice Jaccard nj=nj= n i,j =ni=ni= Dice = 83,3%81.8% Jaccard= 71,4%69.2% By calculating all possible pairs you build a Similarity Matrix a b c d e f

technology from seed Building the Dendrogram abcdefabcdef ab c d e f a b Sim (c, (a,b))= ( )/2 =90.15 c UPGMA: Unweighted Pair Group Method with Arithmetic mean a b c d e f ab c d e f And you procede until you have everything in one group...

technology from seed Building the Dendrogram abcdefabcdef UPGMA abcdefabcdef Single Linkage Complete Linkage abcdefabcdef

technology from seed Validating the Dendrogram: Cophenetic Correlation abcdefabcdef a b c d e f abcdefabcdef abcdefabcdef Pearson´s R

technology from seed Assigning strain to types Tenover’s Criteria: if two strains’ band patterns differ up to 6 bands, counted on both lanes, they are considered to be from the same type Tenover, F, et all J Clin Microbiol 43(11): Cluster Cut-off: Normally tries to emulate the Tenover’s criteria The dendrogram should be looked upon and analysed taking in consideration other relevant epidemiological data

technology from seed Assigning strain to types Serrano, I, et al J Clin Microbiol 43(4): S.pneumoniae

technology from seed Problems in PGFE R6 N315 T4 1:10 1:51:21:1 ????????? ??? ???????????? ?????? R6 N315 T4 1:10 1:51:21:1

technology from seed Problems in data analysis An.Epidemiol.16, (3) 2006: 157–169

technology from seed Conclusions PFGE is still the “gold standard” typing method of choice for several microorganisms The PFGE Image Analysis should always be carefully done and the program settings and parameters need to be carefully selected...but other epidemilogically relevant data should always be considered! Dendrograms aren’t the best method for analysing PFGE data, but they are “easily” understandable and provide a way to summarize data.... New methods are needed. As all gel based methods the inter-laboratory reproducibility involves a big concerted effort... Sequence-based methods are much more appealing for data exchange Computers help you in the analysis...they don’t do the analysis for you ;-)

technology from seed