Mod. Reg. Data set Correct topology location Sens- itivity Speci- ficity TMMOD 1 (a) (b) (c) S (78.3%) 51 (61.4%) 64 (77.1%) 67 (80.7%) 52 (62.7%)

Slides:



Advertisements
Similar presentations
Week 8 Video 4 Hidden Markov Models.
Advertisements

Prediction of protein localization and membrane protein topology Gunnar von Heijne Department of Biochemistry and Biophysics Stockholm Bioinformatics Center.
Protein Sequence Classification Using Neighbor-Joining Method
Bioinformatics Research Overview Li Liao Develop new algorithms and (statistical) learning methods that help solve biological problems > Capable of incorporating.
“Homology-enhanced probabilistic consistency” multiple sequence alignment : a case study on transmembrane protein Jia-Ming Chang 2013-July-09 Chang, J-M,
TMpro: Transmembrane Helix Prediction using Amino Acid Properties and Latent Semantic Analysis Madhavi Ganapathiraju, N. Balakrishnan, Raj Reddy and Judith.
Localization prediction of transmembrane proteins Stefan Maetschke, Mikael Bodén and Marcus Gallagher The University of Queensland.
Wellcome Trust graduate course. - Computational Methods series. --- Sequence-based bioinformatics. Dr. Hyunji Kim Department of Biochemistry, University.
Bioinformatics Research Overview Li Liao Develop new algorithms and (statistical) learning methods > Capable of incorporating domain knowledge > Effective,
Machine Learning & Bioinformatics 1 Tien-Hao Chang (Darby Chang)
Symbols used to construct molecular circuit diagrams in systems biology. (Top) Sample flowchart symbols. (Bottom) Graphical symbols for molecular interaction.
Hidden Markov Models (HMM)
Membrane topology prediction of the SLC45A2 protein using TMHMM (v. 2
C N TM1 (a) WARLVMCFVLVLITTSIWTLIMV SOSUI PSORT II
Using Bioinformatics: Genetic Testing
Abundance of proteins matching selected subcellular locations and functions in CaCo‐2 cells. Abundance of proteins matching selected subcellular locations.
Combining HMMs with SVMs
Immune features from Model 3 that associate with and predict the HAI response to TIV. The cross‐validated area under the ROC curve (cvAUROC) for Model.
Motif detectability corresponds to the phylogenetic profile of the cognate transcription factor. Motif detectability corresponds to the phylogenetic profile.
Exo‐metabolomics profiling of major metabolites elucidated the metabolic capabilities of monospecies Exo‐metabolomics profiling of major metabolites elucidated.
Topological Ordering Algorithm: Example
Validation of the CCE associations by mass spectrometry.
Structure of in‐frame deletions.
Model training of generalized Lotka–Volterra (gLV) to time‐resolved measurements of monospecies and pairwise assemblages Model training of generalized.
CISC 841 Bioinformatics (Fall 2007) Hybrid models
Network derived from large‐scale fractionation predicts 48 protein complexes and communities Network derived from large‐scale fractionation predicts 48.
№36 საჯარო სკოლა X კლასი ლუკა აშხატოევი
different predicted values (D)
Model of human CLC-Kb transporter topology and positions of amino acid exchanges due to mutations in the CLCNKB gene as identified in Bartter syndrome.
Heat maps showing global relative growth phenotype and comparison between measured and predicted values. Heat maps showing global relative growth phenotype.
Percentage of proteins identified in envelope membrane extracts according to the purification method and the number of transmembrane domains. Percentage.
Figure 3. Genes differentially expressed in batch cultures during adaptation to low temperature. Genes differentially expressed in batch cultures during.
Topological Ordering Algorithm: Example
Shielding of bacterial spores against galactic cosmic radiation by meteorite material. Shielding of bacterial spores against galactic cosmic radiation.
CISC 667 Intro to Bioinformatics (Fall 2005) Hidden Markov Models (IV)
Protein information in the Human Protein Atlas.
Visual validation of the computational outputs.
A connectionist model in action
Results from the Morris water task.
Phylogenetic analysis of PurR from low-GC Gram-positive bacteria.
Topological Ordering Algorithm: Example
Patient organoids respond more diverse to drugs and with lower therapeutic potential than 2D cultured patient cells Patient organoids respond more diverse.
The cell cycle influences circadian phase progression Circadian intervals with divisions (p1,d1,p2) last 21.95 ± 3.8 h (n = 1,926) and are significantly.
Profile HMMs GeneScan TMMOD
Shared components define the ‘ancient’ phagosome.
Transcriptomic and epigenetic changes associated with Factor 1 in the scMT data Transcriptomic and epigenetic changes associated with Factor 1 in the scMT.
Direct role of HIS-24 and HPL in stress response.
Simulation showing that the cell length variability of the entire population can mask abnormal cell length variability at a specific cell cycle period.
Experimental validation of predicted endocytosis functions in human.
Integration of proteomic data into the model
Mathematical modeling of affinities between measured species
Alignment of N-terminal sequences of type IV pilin proteins.
(A) Effect of increasing d-glucose concentration (2
Seasonal effect of MS transport of 25 μmol l-1 d-[3H]fructose in the presence of 100 μmol l-1 mucosal phloridizin and 100 μmol l-1 mucosal phloretin. Seasonal.
The alignment of P-type ATPase-like domains of TgATPaseP-GC, PfGCα, and PfGCβ with the members of human P4-ATPases. The alignment of P-type ATPase-like.
Summary of major events in Earth history relating to the appearance of eukaryotic groups and the appearance of O2 in the atmosphere and in marine environments.
Mean Pcrit for each species (averaged across individuals within that species), plotted versus log mean mass (mg) of individuals in that species (filled.
Mean CO2 release rates (μmol CO2 h–1; averaged across all individuals tested) at normoxia as a function of mean fresh mass (mg; log–log plot) for tenebrionid.
Mean and s.d. values of the observed vertical impulse ratio (Robs) and predicted vertical impulse ratio (Rpred). Mean and s.d. values of the observed vertical.
(A) Mean maximum single-jaw forces (± s.d.), by species.
Heatmap showing Spearman’s rho values for significant correlations between weed abundances and bacterial classes (i.e., OTU data pooled at the class level;
Topological Ordering Algorithm: Example
Mean CO2 release rates (averaged across all individuals tested) versus PO2 for each species of (A) tenebrionid and (B) scarabaeid. Mean CO2 release rates.
Effect of mianserin (MI) on the stimulation of dancing by cocaine (COC). Effect of mianserin (MI) on the stimulation of dancing by cocaine (COC). Dance.
Schematic representation of the hydrolysis of amorphous and microcrystalline cellulose by noncomplexed (A) and complexed (B) cellulase systems. Schematic.
Antibiofilm strategies arising from fundamental research.
Diagnostic plots of the TGI PKPD model fitted to the A677 TGI data.
In the signal transduction pathway finder array, fold expression differences were analyzed through the SA Biosciences Web page, transferred into MATLAB,
General overview of the bioinformatic pipelines for the 16S rRNA gene microbial profiling and shotgun metagenomics. General overview of the bioinformatic.
E2F4 program is predictive of the efficacy of intravesical BCG immunotherapy in NMIBC. A, the survival curves of intravesical therapy-treated and untreated.
Presentation transcript:

TMMOD: An improved hidden Markov model for predicting transmembrane topology (Bioinformatics 2005)

Mod. Reg. Data set Correct topology location Sens- itivity Speci- ficity TMMOD 1 (a) (b) (c) S-83 65 (78.3%) 51 (61.4%) 64 (77.1%) 67 (80.7%) 52 (62.7%) 97.4% 71.3% 97.1% TMMOD 2 61 (73.5%) 54 (65.1%) 66 (79.5%) 99.4% 93.8% 99.7% TMMOD 3 70 (84.3%) 74 (89.2%) 71 (85.5%) 98.2% 95.3% 99.1% TMHMM 69 (83.1%) 96.2% PHDtm (85.5%) (88.0%) 98.8% 95.2% S-160 117 (73.1%) 92 (57.5%) 128 (80.0%) 103 (64.4%) 126 (78.8%) 77.4% 96.1% 97.0% 80.8% 96.7% 120 (75.0%) 97 (60.6%) 118 (73.8%) 132 (82.5%) 121 (75.6%) 135 (84.4%) 98.4% 97.7% 97.2% 95.6% 110 (68.8%) 135 (84.4%) 133 (83.1%) 124 (77.5%) 143 (89.4%) 97.8% 94.5% 98.3% 97.6% 98.1% 123 (76.9%) 134 (83.8%)

Fitness value:

L. Petersen et al, (2003), J. Mol. Biol. 326:1361-1372.

BMC Bioinformatics, 2007.