Jianlin Cheng Computer Science Department & Informatics Institute

Slides:



Advertisements
Similar presentations
CWS: A Comparative Web Search System Jian-Tao Sun, Xuanhui Wang, § Dou Shen Hua-Jun Zeng, Zheng Chen Microsoft Research Asia University of Illinois at.
Advertisements

+ Multi-label Classification using Adaptive Neighborhoods Tanwistha Saha, Huzefa Rangwala and Carlotta Domeniconi Department of Computer Science George.
Iterative Optimization and Simplification of Hierarchical Clusterings Doug Fisher Department of Computer Science, Vanderbilt University Journal of Artificial.
Ke Liu1, Junqiu Wu2, Shengwen Peng1,Chengxiang Zhai3, Shanfeng Zhu1
Dukka Application of Monte Carlo Simulation: Removing averaging artifacts in protein structure prediction.
Xin Gao PhD student Outline Traditional Protein Structure Prediction  Introduction  Methods Review  Experimental Results Refinement  Motivation.
Three-Stage Prediction of Protein Beta-Sheets Using Neural Networks, Alignments, and Graph Algorithms Jianlin Cheng and Pierre Baldi Institute for Genomics.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Clustered alignments of gene- expression time series data Adam A. Smith, Aaron Vollrath, Cristopher A. Bradfield and Mark Craven Department of Biosatatistics.
Jianlin Cheng, PhD Informatics Institute, Computer Science Department University of Missouri, Columbia Fall, 2011.
Protein Threading Optimization Using Consensus Homology Modeling Maliha Sarwat ( ), Tasmin Tamanna Haque ( ) Department of Computer Science.
Abstracts of main servers in CASP11
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Contact Lens: Evaluating Protein Structure by Contacts Contact Lens: Evaluating Protein Structure by Contacts RMSD vs. Contact Lens Root Mean Square Distance.
The 7 steps of Homology modeling. 1: Template recognition and initial alignment.
MULTICOM – A Combination Pipeline for Protein Structure Prediction
Hybrid Protein Model Quality Assessment Jianlin Cheng Computer Science Department & Informatics Institute University of Missouri, Columbia, MO, USA.
Modelling Workshop - Some Relevant Questions Prof. David Jones University College London Where are we now? Where are we going? Where should.
Detecting the Domain Structure of Proteins from Sequence Information Niranjan Nagarajan and Golan Yona Department of Computer Science Cornell University.
Face Alignment Using Cascaded Boosted Regression Active Shape Models
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Calibration of Design Methods for Slope Stabilization and Earth Retention J. Erik Loehr Civil and Environmental Engineering University of Missouri - Columbia.
Modelling binding site with 3DLigandSite Mark Wass
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
Representations of Molecular Structure: Bonds Only.
Learning user preferences for 2CP-regression for a recommender system Alan Eckhardt, Peter Vojtáš Department of Software Engineering, Charles University.
Personalized Search Cheng Cheng (cc2999) Department of Computer Science Columbia University A Large Scale Evaluation and Analysis of Personalized Search.
Personalized Web Search by Mapping User Queries to Categories Fang Liu Presented by Jing Zhang CS491CXZ February 26, 2004.
Lecture 12 CS5661 Structural Bioinformatics Motivation Concepts Structure Prediction Summary.
Modeling Protein Structures and Gene Regulatory Networks by Mining Protein and RNA-Seq Data Jianlin Jack Cheng, PhD Computer Science Department University.
Clustering Personalized Web Search Results Xuehua Shen and Hong Cheng.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
EntityRank :Searching Entities Directly and Holistically Tao Cheng, Xifeng Yan, Kevin Chen-Chuan Chang Computer Science Department, University of Illinois.
Summarizing Conversations with Clue Words Giuseppe Carenini Raymond T. Ng Xiaodong Zhou Department of Computer Science Univ. of British Columbia.
Jianlin Jack Cheng Computer Science Department University of Missouri, Columbia, USA Mexico, 2014.
MolIDE2: Homology Modeling Of Protein Oligomers And Complexes Qiang Wang, Qifang Xu, Guoli Wang, and Roland L. Dunbrack, Jr. Fox Chase Cancer Center Philadelphia,
Protein Secondary Structure, Bioinformatics Tools, and Multiple Sequence Alignments Finding Similar Sequences Predicting Secondary Structures Predicting.
A Novel Local Patch Framework for Fixing Supervised Learning Models Yilei Wang 1, Bingzheng Wei 2, Jun Yan 2, Yang Hu 2, Zhi-Hong Deng 1, Zheng Chen 2.
Multiple Mapping Method with Multiple Templates (M4T): optimizing sequence-to-structure alignments and combining unique information from multiple templates.
Department of Computer Science, Graduate School of Information Science & Technology, Osaka University Retrieving Similar Code Fragments based on Identifier.
Domains or not domains? ShuoYong Shi, Indraneel Majumdar and Nick V. Grishin Howard Hughes Medical Institute, Department of Biochemistry, University of.
Exploit of Online Social Networks with Community-Based Graph Semi-Supervised Learning Mingzhen Mo and Irwin King Department of Computer Science and Engineering.
Structure prediction: Homology modeling
Lessons from CASP targets ShuoYong Shi, Lisa Kinch, Jimin Pei, Ruslan Sadreyev, and Nick V. Grishin Howard Hughes Medical Institute, Department of Biochemistry,
Computational engineering of bionanostructures Ram Samudrala University of Washington How can we analyse, design, & engineer peptides capable of specific.
Modelling protein tertiary structure Ram Samudrala University of Washington.
Data Mining, ICDM '08. Eighth IEEE International Conference on Duy-Dinh Le National Institute of Informatics Hitotsubashi, Chiyoda-ku Tokyo,
Rosetta Steven Bitner. Objectives Introduction How Rosetta works How to get it How to install/use it.
Active Feedback in Ad Hoc IR Xuehua Shen, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Structural alignment methods Like in sequence alignment, try to find best correspondence: –Look at atoms –A 3-dimensional problem –No a priori knowledge.
Surflex: Fully Automatic Flexible Molecular Docking Using a Molecular Similarity-Based Search Engine Ajay N. Jain UCSF Cancer Research Institute and Comprehensive.
Iterative K-Means Algorithm Based on Fisher Discriminant UNIVERSITY OF JOENSUU DEPARTMENT OF COMPUTER SCIENCE JOENSUU, FINLAND Mantao Xu to be presented.
R ESEARCH U PDATE ON J ULY 17 TH – A MY (1) Found tools for RNA tertiary structure prediction From secondary structure to tertiary structure o NAST(2009):
CoMFA Study of Piperidine Analogues of Cocaine at the Dopamine Transporter: Exploring the Binding Mode of the 3  -Substituent of the Piperidine Ring Using.
哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.
Copyright © 2014 American Institutes for Research and Cleveland Metropolitan School District. All rights reserved. March 2014 Interpreting Vendor Assessment.
FM Model Assessment: Old scores and New Combinations ShuoYong Shi Nick Grishin Lab.
Zachary Starr Dept. of Computer Science, University of Missouri, Columbia, MO 65211, USA Digital Image Processing Final Project Dec 11 th /16 th, 2014.
Automated Structure Prediction using Robetta in CASP11 Baker Group David Kim, Sergey Ovchinnikov, Frank DiMaio.
Scoring the Technical Evaluation Maximum possible score
Project name and logo Workflow materials models: template 1
TEMPLATE-BASED METHODS FOR PROTEIN MODEL QA
CLSciSumm-2018 What to submit Task Framework Task 1A Task 1B
KMeans Clustering on Hadoop Fall 2013 Elke A. Rundensteiner
Movie Recommendation System
National University of Laos
Volume 20, Issue 6, Pages (June 2012)
Discussion of Protein Disorder Prediction
High-Resolution Comparative Modeling with RosettaCM
Presentation transcript:

A Multi-Template Multi-Model Combination Approach to Template-Based Modeling Jianlin Cheng Computer Science Department & Informatics Institute University of Missouri, Columbia, MO, USA

5. Combination & Refinement (2-3%) 1. Template Ranking 2. Multiple-Template Combination Alignments Combination Query-Template 1 MAR-TCRK-EGAP-WY… Y-R-MH-R-DGM-MWT… TAKMTHK-DEGFG-YW… MARTCRKEGAP-WY… Y-RMH-RDGM-MWT… Input Query . MARTCRKE… Query-Template 2 MAR-TCRK-EGAPWY… TAKMTHK-DEGFGYW… . . 4. Evaluation 5. Combination & Refinement (2-3%) 3. Model Generation Models Generator Output CASP8 Server Models

Traditional Model Selection Single-Model Evaluation Clustering / Consensus Approach

Global-Local Model Combination CASP8 Models Rank models by GDT-TS scores predicted by ModelEvaluator …… . Put relatively good, but not the best models at the top

Global-Local Model Combination Structure comparison by TM-Score . . Select top 5 models as seed models Identify similar models or fragments Retain top 50% models

Global-Local Model Combination Globally similar models Locally similar model fragments Combination and iterative modeling by Modeller Side chain rebuilt by SCWRL.

Some High-Quality Predictions GDT=0.90 T0426 GDT=0.97 T0432 GDT=0.92 T0458 GDT=0.97 Orange: structure; Green: model H-Bonds are well predicted.

Conclusions Iterative modeling and averaging improve side-chain placement, geometry, and H-Bonds Combining multiple good similar models can produce a model better than the top ranked model Combined models are at least as good as centroids and have no steric clashes

Acknowledgements CASP8 organizers and assessors CASP8 participants MU colleagues: Dong Xu, Toni Kazic My group: Zheng Wang Allison Tegge Xin Deng