Current Status and Future Directions for TEXTAL March 2, 2003 The TEXTAL Group at Texas A&M: Thomas R. Ioerger James C. Sacchettini Tod Romo Kreshna Gopal.

Slides:



Advertisements
Similar presentations
Alberta Tools ESRD Hydrology Community of Practice Chiadih Chang
Advertisements

Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.
Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee.
A 3-D reference frame can be uniquely defined by the ordered vertices of a non- degenerate triangle p1p1 p2p2 p3p3.
Topology Generation Suat Mercan. 2 Outline Motivation Topology Characterization Levels of Topology Modeling Techniques Types of Topology Generators.
Computing Protein Structures from Electron Density Maps: The Missing Loop Problem I. Lotan, H. van den Bedem, A. Beacon and J.C. Latombe.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Ioerger Lab – Bioinformatics Research
1 Chapter 12: Decision-Support Systems for Supply Chain Management CASE: Supply Chain Management Smooths Production Flow Prepared by Hoon Lee Date on 14.
The TEXTAL System: Automated Model-Building Using Pattern Recognition Techniques Dr. Thomas R. Ioerger Department of Computer Science Texas A&M University.
CAPRA: C-Alpha Pattern Recognition Algorithm Thomas R. Ioerger Department of Computer Science Texas A&M University.
Agenda Overview Why TransCAD Challenges/tips Initiatives Applications.
The TEXTAL System for Automated Model Building Thomas R. Ioerger Texas A&M University.
The Protein Data Bank (PDB)
PcaA Mycolic acid cyclopropyl synthase (Smith&Sacchettini) original structure solved at 2.0A via MAD R-value = 0.22, R-free = residues,  fold.
Clementine Server Clementine Server A data mining software for business solution.
TEXTAL - Automated Crystallographic Protein Structure Determination Using Pattern Recognition Principal Investigators: Thomas Ioerger (Dept. Computer Science)
Object Recognition. Geometric Task : find those rotations and translations of one of the point sets which produce “large” superimpositions of corresponding.
Automated Model-Building with TEXTAL Thomas R. Ioerger Department of Computer Science Texas A&M University.
Recent Developments in TEXTAL Phenix Workshop Berkeley Sept Thomas R. Ioerger Texas A&M University.
TEXTAL: A System for Automated Model Building Based on Pattern Recognition Thomas R. Ioerger Department of Computer Science Texas A&M University.
11/21/02Visualization Laboratory, Texas A&M University1 Next Generation Spatially Immersive Visualization Systems Prof. Frederic I. Parke Visualization.
Model Database. Scene Recognition Lamdan, Schwartz, Wolfson, “Geometric Hashing”,1988.
Prediction of Local Structure in Proteins Using a Library of Sequence-Structure Motifs Christopher Bystroff & David Baker Paper presented by: Tal Blum.
TEXTAL Progress Basic modeling of side-chain and backbone coordinates seems to be working well. –even for experimental MAD maps, 2.5-3A –using pattern-recognition.
Web Design Process CMPT 281. Outline How do we know good sites from bad sites? Web design process Class design exercise.
A Probabilistic Approach to Protein Backbone Tracing in Electron Density Maps Frank DiMaio, Jude Shavlik Computer Sciences Department George Phillips Biochemistry.
Multimedia. Definition What is Multimedia? Multimedia can have a many definitions these include: Multimedia means that computer information can be represented.
Business Requirements Using Unified Modeling Language Eric H. Castain, SVP Internet Services Group, Architecture Wells Fargo March 2005.
1 ISA&D7‏/8‏/ ISA&D7‏/8‏/2013 Systems Development Life Cycle Phases and Activities in the SDLC Variations of the SDLC models.
Transmembrane proteins in the Protein Data Bank: identification and classification Gabor, E. Tusnady, Zsuzanna Dosztanyi and Istvan Simon Bioinformatics,
Visualizing Information in Global Networks in Real Time Design, Implementation, Usability Study.
Community Evolution in Dynamic Multi-Mode Networks Lei Tang, Huan Liu Jianping Zhang Zohreh Nazeri Danesh Zandi & Afshin Rahmany Spring 12SRBIAU, Kurdistan.
Neural Networks for Protein Structure Prediction Brown, JMB 1999 CS 466 Saurabh Sinha.
Flexible Multi-scale Fitting of Atomic Structures into Low- resolution Electron Density Maps with Elastic Network Normal Mode Analysis Tama, Miyashita,
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Friday, February 4, 2000 Lijun.
3D Animation 5. Character Animation Dr. Ashraf Y. Maghari Information Technology Islamic University of Gaza Ref. Book: The Art of Maya.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Modeling - Simulation and AI Software ©Ideler2002.
The Effects of Immersion and Navigation on the Acquisition of Spatial Knowledge of Abstract Data Networks James Henry, M.S. Nicholas F. Polys, Ph.D. Virginia.
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
Systems Analysis and Design in a Changing World, Fourth Edition
Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.
SOA-25: Data Distribution Solutions Using DataXtend ® Semantic Integrator for Sonic ™ ESB Users Jim Barton Solution Architect.
The GriPhyN Planning Process All-Hands Meeting ISI 15 October 2001.
Cmpe 589 Spring 2006 Lecture 2. Software Engineering Definition –A strategy for producing high quality software.
SIS Spatial Information Solutions April 23, 2005 MSU ERAC Presentation Spatial Information Solutions: A New Business Delivering Spatial Technology Research.
Web-based virtual reality in manufacturing applications
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Sponsored by the National Science Foundation Measurement System Spiral 2 Year-end Project Review University of Wisconsin, Colgate University, Boston University.
Team Members Ming-Chun Chang Lungisa Matshoba Steven Preston Supervisors Dr James Gain Dr Patrick Marais.
© 2014 IBM Corporation IBM SPSS Modeler Gold on Cloud Jump Start Service.
Use of Machine Learning in Chemoinformatics
Real Time Collaboration and Sharing
Teaching Roles for Instructional Software Eric Sharp EDMS 6474.
Neural Networks Lecture 11: Learning in recurrent networks Geoffrey Hinton.
GPGPU Performance and Power Estimation Using Machine Learning Gene Wu – UT Austin Joseph Greathouse – AMD Research Alexander Lyashevsky – AMD Research.
Cluster Analysis What is Cluster Analysis? Types of Data in Cluster Analysis A Categorization of Major Clustering Methods Partitioning Methods.
Structure Visualization
Interaction and Animation on Geolocalization Based Network Topology by Engin Arslan.
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
Semantic Visualization
Modeling Unordered Point Data for 3D GIS Mapping
Reduce the need for human intervention in protein model building
Salevich Alex & Frenkel Eduard Wizard Hunting
network of simple neuron-like computing elements
Protein structure prediction.
Topological Signatures For Fast Mobility Analysis
Dr. Thomas R. Ioerger Department of Computer Science
OU BATTLECARD: Oracle WebCenter Training
Presentation transcript:

Current Status and Future Directions for TEXTAL March 2, 2003 The TEXTAL Group at Texas A&M: Thomas R. Ioerger James C. Sacchettini Tod Romo Kreshna Gopal Reetal Pai Vinod

Current model-building capabilities: targeting “medium-res” maps ( A) builds 80-90% of models in medium-quality maps typically 2-10 chains RMSD error ~1A for C-alphas and side-chains a.a. identity accuracy –near % for “good” maps recommended: make 2.8A map around 1 molecule time: several hours (depends on size of map)

Progress Update Integration with PHENIX Sequence Alignment –almost there (Tod) –80-100% acc. for good maps (IF5a, CzrA: hi-res data) –still face challenges with lower-quality density incorrect chain connections cause confusion truncated/unrepresentative side-chain density New algorithms: –connecting chain-breaks using fragment lookup (Reetal) –new density-scaling technique (experimental) –feature-weighting algorithm (Kreshna)

Current Developments Trying different databases –real vs. back-transformed maps (phase error?) –different resolutions! –clustering (rotamers, Kmeans, SVD, reduce size) –have to recalculate feature vectors and feature weights iterating between model-building and phase refinement –(not much progress since Sept...)

Graphics Desired: an interactive interface to TEXTAL –semi-automated, rather than black-box –give advice to user, but get their help –generate alternatives, but let them decide Approach: small targeted apps –special-purpose, customized graphical tools –e.g. link Capra chains, select better residues... Extend to VR and other interfaces

Textal Assistance

Virtual Benefits Vestibular cues reinforce stereo cues Immersive environment fosters rapid understanding of complex topologies Enabling technology for “low spatial” people Collaborative tool –Shared virtual environment –CAVElib support for connecting remote cave sessions

Longer-term Ideas Take structure factors as input, generate maps Extensions of pattern-recognition –apply to nucleotides, carbohydrates? –can use same features capturing shape of density? –train neural network to recognize phosphodiester bonds same as C-alpha’s –use linear/helical geometric constraints to build backbone; then fit bases like LOOKUP –how to distinguish DNA/RNA from protein?

Textal on-line