Jianlin Jack Cheng Computer Science Department University of Missouri, Columbia, USA Mexico, 2014
Targeted Sampling Fold Space Alignment SpaceModel Pool Sequence Space Model Generation Template & Alignment Combination
Internal or CASP Model Pool Combination Refinement Side Chain Tuning Massive Assessment Model Ranking
Samplers BLAST CSBLAST CSIBLAST PSIBLAST SAM HMMer HHSearch HHblits HHsuite MULTICOM PRC FFAS Compass MUSTER RaptorX 1. Alignment Combination Based on E- Values 2. Alignment Combination Based on Structures 3. Multiple Sequence Alignment + Structural Features 150 – 200 Models Template Library Alignment Combination Model Generation Modeller MTMG FUSION 125,000 templates (in-house) 125,000 templates (in-house) 39,000 (in-house) 39,000 (in-house) Third- party (local) Fold Sampling
MULTICOM (Server) MULTICOM (Human) Servers (partial list) Best (domains) nns10 BAKER-ROSETTASERVER8 IntFOLD37 Zhang-Server4 TASSER-VMT3 MULTICOM Server2 QUARK2 RBO_Aleph2 HHPred-A2 FFAS-3D2 myprotein-me2 PhyreX1 SAM-T08-server1 ZHOU-SPARKS-X1 HHPred-X1
Methods (blue: in-house) TypeFeatures MULTICOM-NOVELSingleStructural, physical, chemical features OPUS-PSPSCa atom contact potentials Proq2SStructural features RWplusSSide-chain orientation dependent potential ModelEva1SStructural features, contacts ModelEva2SStructural features, contacts, disorder, conservation RS_CB_SRSSDistance dependent statistical potential SELECTproSEnergy-based (h-bond, angle, electrostatics, vdw) DopeSStatistical potential DFire2SEnergy-based potential Modfoldcluster2ClusterPairwise model similarity (geometry) APOLLOCPairwise model similarity PconsCPairwise model similarity QAproC + SWeighted pairwise model similarity MULTICOM (human)ConsensusAverage ranking
Methods (blue: in-house) TypeAverage GDT-TS # Better# Best MULTICOM-NOVELSingle OPUS-PSPS Proq2S RWplusS ModelEva1S ModelEva2S RS_CB_SRSS0.343 SELECTproS0.411 DopeS DFire2S Modfoldcluster2Cluster0.403 APOLLOC PconsC0.402 QAproC + S MULTICOM (human)Consensus
Combine similar models or fragments 3DRefine (energy, bond, angle) + FUSION to refold unaligned loops and tails + SCRWL for side chain packing (server) Automated detection and replacement of bad models (worked in all 13 server exception cases)
Templates: 4IB2, 4EF1, 4OTE, 4K3F, 3UP9, 3GXA, 4GOT The best server model designated as the first model Distribution of GDT-TS Scores of MULTICOM Server Models GDT: 0.87 GDT: 0.73 GDT: Blue: structure Gold: model GDT-TS score: 0.86
Blue: structure Gold: model GDT-TS score: 0.59 Server models: Zhang-Server_TS1 BAKER-ROSETTASERVER_TS4 myprotein-me_TS1 Human model is better than Zhang-Server_TS1 Distribution of GDT-TS Scores of CASP Server Models
Blue: structure Gold: model GDT-TS score: 0.63 Human model: The same GDT-TS score Better side-chain quality Server models: nns_TS1 nns_TS3 nns_TS2 FFAS-3D_TS1 Distribution of GDT-TS Scores of CASP Server Models
Blue: structure Gold: model GDT-TS score: ~0.22 Selected and combined models of low (average) quality Distribution of GDT-TS Scores of CASP Server Models
Large-scale independent sampling Large-scale quality assessment Exception handling Model combination Model refinement Model refolding Template recognition in thin, remote profile Alignment in thin, remote profile Quality assessment with few good models
Group Members Badri Adhikari Deb Bhattacharya Renzhi Cao Jilong Li CASP Assessors Dr. Roland Dunbrack CASP Organizers CASP Server Predictors