The TEXTAL System for Automated Model Building Thomas R. Ioerger Texas A&M University.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

QR Code Recognition Based On Image Processing
Object Recognition using Invariant Local Features Applications l Mobile robots, driver assistance l Cell phone location or object recognition l Panoramas,
Unsupervised learning. Summary from last week We explained what local minima are, and described ways of escaping them. We investigated how the backpropagation.
Kiyoshi Irie, Tomoaki Yoshida, and Masahiro Tomono 2011 IEEE International Conference on Robotics and Automation Shanghai International Conference Center.
Protein Planes Bob Fraser CSCBC Overview Motivation Points to examine Results Further work.
Computing Protein Structures from Electron Density Maps: The Missing Loop Problem I. Lotan, H. van den Bedem, A. Beacon and J.C. Latombe.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Ioerger Lab – Bioinformatics Research
Protein Tertiary Structure Prediction
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
A new face detection method based on shape information Pattern Recognition Letters, 21 (2000) Speaker: M.Q. Jing.
The TEXTAL System: Automated Model-Building Using Pattern Recognition Techniques Dr. Thomas R. Ioerger Department of Computer Science Texas A&M University.
CAPRA: C-Alpha Pattern Recognition Algorithm Thomas R. Ioerger Department of Computer Science Texas A&M University.
CONTENT BASED FACE RECOGNITION Ankur Jain 01D05007 Pranshu Sharma Prashant Baronia 01D05005 Swapnil Zarekar 01D05001 Under the guidance of Prof.
The 7 steps of Homology modeling. 1: Template recognition and initial alignment.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
PcaA Mycolic acid cyclopropyl synthase (Smith&Sacchettini) original structure solved at 2.0A via MAD R-value = 0.22, R-free = residues,  fold.
Current Status and Future Directions for TEXTAL March 2, 2003 The TEXTAL Group at Texas A&M: Thomas R. Ioerger James C. Sacchettini Tod Romo Kreshna Gopal.
Unsupervised Learning and Data Mining
TEXTAL - Automated Crystallographic Protein Structure Determination Using Pattern Recognition Principal Investigators: Thomas Ioerger (Dept. Computer Science)
Protein structure prediction May 30, 2002 Quiz#4 on June 4 Learning objectives-Understand difference between primary secondary and tertiary structure.
Don't fffear the buccaneer Kevin Cowtan, York. ● Map simulation ⇨ A tool for building robust statistical methods ● 'Pirate' ⇨ A new statistical phase improvement.
Automated Model-Building with TEXTAL Thomas R. Ioerger Department of Computer Science Texas A&M University.
Recent Developments in TEXTAL Phenix Workshop Berkeley Sept Thomas R. Ioerger Texas A&M University.
TEXTAL: A System for Automated Model Building Based on Pattern Recognition Thomas R. Ioerger Department of Computer Science Texas A&M University.
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
TEXTAL Progress Basic modeling of side-chain and backbone coordinates seems to be working well. –even for experimental MAD maps, 2.5-3A –using pattern-recognition.
Protein Structures.
Radial Basis Function (RBF) Networks
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
A Probabilistic Approach to Protein Backbone Tracing in Electron Density Maps Frank DiMaio, Jude Shavlik Computer Sciences Department George Phillips Biochemistry.
The P HENIX project Crystallographic software for automated structure determination Computational Crystallography Initiative (LBNL) -Paul Adams, Ralf Grosse-Kunstleve,
 C. C. Hung, H. Ijaz, E. Jung, and B.-C. Kuo # School of Computing and Software Engineering Southern Polytechnic State University, Marietta, Georgia USA.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Protein Tertiary Structure Prediction
Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.
ClusPro: an automated docking and discrimination method for the prediction of protein complexes Stephen R. Comeau, David W.Gatchell, Sandor Vajda, and.
Spatial Statistics Applied to point data.
CPSC 601 Lecture Week 5 Hand Geometry. Outline: 1.Hand Geometry as Biometrics 2.Methods Used for Recognition 3.Illustrations and Examples 4.Some Useful.
OBJECT RECOGNITION. The next step in Robot Vision is the Object Recognition. This problem is accomplished using the extracted feature information. The.
PDBe-fold (SSM) A web-based service for protein structure comparison and structure searches Gaurav Sahni, Ph.D.
Shape Matching for Model Alignment 3D Scan Matching and Registration, Part I ICCV 2005 Short Course Michael Kazhdan Johns Hopkins University.
Transmembrane proteins in the Protein Data Bank: identification and classification Gabor, E. Tusnady, Zsuzanna Dosztanyi and Istvan Simon Bioinformatics,
BALBES (Current working name) A. Vagin, F. Long, J. Foadi, A. Lebedev G. Murshudov Chemistry Department, University of York.
Neural Networks for Protein Structure Prediction Brown, JMB 1999 CS 466 Saurabh Sinha.
PROTEIN STRUCTURE CLASSIFICATION SUMI SINGH (sxs5729)
COMPARISON OF IMAGE ANALYSIS FOR THAI HANDWRITTEN CHARACTER RECOGNITION Olarik Surinta, chatklaw Jareanpon Department of Management Information System.
Machine Learning Neural Networks (3). Understanding Supervised and Unsupervised Learning.
Data Extraction using Image Similarity CIS 601 Image Processing Ajay Kumar Yadav.
Handwritten Recognition with Neural Network Chatklaw Jareanpon, Olarik Surinta Mahasarakham University.
What’s the Point? Working with 0-D Spatial Data in ArcGIS
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
6.S093 Visual Recognition through Machine Learning Competition Image by kirkh.deviantart.com Joseph Lim and Aditya Khosla Acknowledgment: Many slides from.
“Niche Work” Graham J Wills, Lucent Technologies (Bell Lab)
Topics in bioinformatics CS697 Spring 2011 Class 12 – Mar Molecular distance measurements Molecular transformations.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Robodog Frontal Facial Recognition AUTHORS GROUP 5: Jing Hu EE ’05 Jessica Pannequin EE ‘05 Chanatip Kitwiwattanachai EE’ 05 DEMO TIMES: Thursday, April.
EBI is an Outstation of the European Molecular Biology Laboratory. PDBe-fold (SSM) A web-based service for protein structure comparison and structure searches.
Stony Brook Integrative Structural Biology Organization
Data Mining, Neural Network and Genetic Programming
Creating fuzzy rules from numerical data using a neural network
Reduce the need for human intervention in protein model building
Protein Planes Bob Fraser CSCBC 2007.
Protein Structures.
Antonio Plaza University of Extremadura. Caceres, Spain
Dr. Thomas R. Ioerger Department of Computer Science
Presentation transcript:

The TEXTAL System for Automated Model Building Thomas R. Ioerger Texas A&M University

Role of Automated Model Building input: map ==> output: model (coords) goal: automation –what level is possible? need for human judgement/correction for difficult cases? –incorporation in systems like PHENIX –use on beam-lines –detection of NCS; molecular replacement –iteration with phase improvement (Resolve)

Based on pattern recognition Consider a spherical region of 5Å radius... Have I ever seen a region of density similar to this in any previously-interpreted map? if so, use coordinates of atoms from matched region, translated and rotated metric: density correlation, but must be rotation-invariant (optimize orientation) The TEXTAL Approach

Feature Extraction faster distance metric: –weighted Euclidean distance of feature vectors examples of (rotation-invariant) features: –standard deviation, other statistics in region –distance to center of mass –moments of inertia, ratios (for symmetry) search a database of regions from solved maps, with features extracted off-line

1. sequence alignment 2. real-space refinement 3. heuristics to fix backbone Outline of the Process elec. dens. map atomic coords structure factors (with est. phases) CAPRA LOOKUP Post-Processing C-alpha chains (PDB file of predicted CA coords) initial model (complete coords) calculate features in 5A region around each C-alpha; search database for matches

CAPRA: C-Alpha Pattern Recognition Algorithm 1. Map scaling - adjust density so on average, >1.0 captures to 20% of volume, <-1.0 capture bottom 20% 2. Tracing - skeletonization - pseudo-atoms on 0.5A grid; eliminate lowest density pts first; don’t break connectivity 3. Calculate features for 5A region around each pseudo-atom 4. Use neural network to predict distance to nearest C-alpha –trained on features from random pts in 1A contour of known map 5. Select way-points: predicted closest locally, >2.5A apart 6. Link way-points together into C-alpha chains –consider quality of neural net prediction –prefer longer chains; don’t break off into side-chains –take secondary structure into account: straightness and helicity

Examples of CAPRA Steps

Example of CA-chains fit by CAPRA

Example of Models Built by Textal

Future Work correction by sequence alignment characterizing accuracy of Textal as function of: resolution, phase quality –at what point (of refinement) will it work? –how well will it work? (rmsd,  err ph ) iteration with phase improvement

Potential Points of Collaboration Tracer as a tool (and density scaling?) Using model-building for NCS detection, mask generation Interaction with solvent-flattening

Acknowledgements James C. Sacchettini –Kreshna Gopal –Reetal Pai –Tod Romo funding from National Institutes of Health