Docking Molecular Structures into EM Lecture 2. Another Rigid Body Docking Example Rossman 2000, Fitting atomic models into electron microscopy maps,

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

CSCE643: Computer Vision Bayesian Tracking & Particle Filtering Jinxiang Chai Some slides from Stephen Roth.
Not just a pretty picture Jul. 14 Hyunwook Lee.
Image Analysis Phases Image pre-processing –Noise suppression, linear and non-linear filters, deconvolution, etc. Image segmentation –Detection of objects.
Maximum Covariance Analysis Canonical Correlation Analysis.
3D Shape Histograms for Similarity Search and Classification in Spatial Databases. Mihael Ankerst,Gabi Kastenmuller, Hans-Peter-Kriegel,Thomas Seidl Univ.
Image Segmentation Image segmentation (segmentace obrazu) –division or separation of the image into segments (connected regions) of similar properties.
Learning to estimate human pose with data driven belief propagation Gang Hua, Ming-Hsuan Yang, Ying Wu CVPR 05.
Chapter 5 Raster –based algorithms in CAC. 5.1 area filling algorithm 5.2 distance transformation graph and skeleton graph generation algorithm 5.3 convolution.
Iterative Relaxation of Constraints (IRC) Can’t solve originalCan solve relaxed PRMs sample randomly but… start goal C-obst difficult to sample points.
Lecture 3 Jack Tanabe Old Dominion University Hampton, VA January 2011 Conformal Mapping.
December 5, 2013Computer Vision Lecture 20: Hidden Markov Models/Depth 1 Stereo Vision Due to the limited resolution of images, increasing the baseline.
Computing Protein Structures from Electron Density Maps: The Missing Loop Problem I. Lotan, H. van den Bedem, A. Beacon and J.C. Latombe.
ICIP 2000, Vancouver, Canada IVML, ECE, NTUA Face Detection: Is it only for Face Recognition?  A few years earlier  Face Detection Face Recognition 
Active Calibration of Cameras: Theory and Implementation Anup Basu Sung Huh CPSC 643 Individual Presentation II March 4 th,
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
Protein Docking and Interactions Modeling CS 374 Maria Teresa Gil Lucientes November 4, 2004.
Detecting Image Region Duplication Using SIFT Features March 16, ICASSP 2010 Dallas, TX Xunyu Pan and Siwei Lyu Computer Science Department University.
FLEX* - REVIEW.
Face Recognition Based on 3D Shape Estimation
BL5203: Molecular Recognition & Interaction Lecture 5: Drug Design Methods Ligand-Protein Docking (Part I) Prof. Chen Yu Zong Tel:
1 UNIBO - Geomarketing - UNIT 3 - F. Fusco Francesco Fusco.
Protein Structure Prediction Samantha Chui Oct. 26, 2004.
Lecture II-2: Probability Review
1 Ion Optics Simulations What it is. How it’s useful. The SIMION ion optics software. –How it works. –Limitations and cautions –Demonstrations and examples.
Data Mining Techniques
Protein Tertiary Structure Prediction
1 CE 530 Molecular Simulation Lecture 7 David A. Kofke Department of Chemical Engineering SUNY Buffalo
Slides are based on Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems.
Topics covered Scope and applications of insilico modeling in modern biology. Comparative modeling Constructing an initial model refining the model manipulating.
OPTIMIZATION OF FUNCTIONAL BRAIN ROIS VIA MAXIMIZATION OF CONSISTENCY OF STRUCTURAL CONNECTIVITY PROFILES Dajiang Zhu Computer Science Department The University.
Texture. Texture is an innate property of all surfaces (clouds, trees, bricks, hair etc…). It refers to visual patterns of homogeneity and does not result.
Intelligent Vision Systems ENT 496 Object Shape Identification and Representation Hema C.R. Lecture 7.
Flexible Multi-scale Fitting of Atomic Structures into Low- resolution Electron Density Maps with Elastic Network Normal Mode Analysis Tama, Miyashita,
Module 1: Statistical Issues in Micro simulation Paul Sousa.
Model Construction: interpolation techniques 1392.
Evolving Virtual Creatures & Evolving 3D Morphology and Behavior by Competition Papers by Karl Sims Presented by Sarah Waziruddin.
24. Lecture WS 2008/09Bioinformatics III1 V24 Hybrid-methods for macromolecular complexes Structural Bioinformatics (a) Integration of structures of various.
CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.
November 13, 2014Computer Vision Lecture 17: Object Recognition I 1 Today we will move on to… Object Recognition.
Data Extraction using Image Similarity CIS 601 Image Processing Ajay Kumar Yadav.
Docking Molecular Structures into EM. Introduction Detailed models of proteins are required in order to elucidate the structure-function relationship.
December 9, 2014Computer Vision Lecture 23: Motion Analysis 1 Now we will talk about… Motion Analysis.
A Passive Approach to Sensor Network Localization Rahul Biswas and Sebastian Thrun International Conference on Intelligent Robots and Systems 2004 Presented.
Project 11: Determining the Intrinsic Dimensionality of a Distribution Okke Formsma, Nicolas Roussis and Per Løwenborg.
Project 11: Determining the Intrinsic Dimensionality of a Distribution Okke Formsma, Nicolas Roussis and Per Løwenborg.
CHAPTER 17 O PTIMAL D ESIGN FOR E XPERIMENTAL I NPUTS Organization of chapter in ISSO –Background Motivation Finite sample and asymptotic (continuous)
14. Lecture WS 2006/07Bioinformatics III1 V14 Hybrid-methods for macromolecular complexes Structural Bioinformatics (a) Integration of structures of various.
Clustering Instructor: Max Welling ICS 178 Machine Learning & Data Mining.
A Fast LBG Codebook Training Algorithm for Vector Quantization Presented by 蔡進義.
1 CLUSTER VALIDITY  Clustering tendency Facts  Most clustering algorithms impose a clustering structure to the data set X at hand.  However, X may not.
Vector Quantization Vector quantization is used in many applications such as image and voice compression, voice recognition (in general statistical pattern.
Atomic structure model
The KOSMOSHOWS What is it ? The statistic inside What it can do ? Future development Demonstration A. Tilquin (CPPM)
Surflex: Fully Automatic Flexible Molecular Docking Using a Molecular Similarity-Based Search Engine Ajay N. Jain UCSF Cancer Research Institute and Comprehensive.
Machine Vision Edge Detection Techniques ENT 273 Lecture 6 Hema C.R.
Digital Image Processing CSC331
Lecture 3 Patterson functions. Patterson functions The Patterson function is the auto-correlation function of the electron density ρ(x) of the structure.
MASKS © 2004 Invitation to 3D vision Lecture 3 Image Primitives andCorrespondence.
Computacion Inteligente Least-Square Methods for System Identification.
 Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems n Introduction.
Semi-Supervised Clustering
Real-time Wall Outline Extraction for Redirected Walking
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
Toshiro Oda, Keiichi Namba, Yuichiro Maéda  Biophysical Journal 
Object Recognition Today we will move on to… April 12, 2018
Biointelligence Laboratory, Seoul National University
SEG5010 Presentation Zhou Lanjun.
Structural Flexibility of CaV1. 2 and CaV2
Presentation transcript:

Docking Molecular Structures into EM Lecture 2

Another Rigid Body Docking Example Rossman 2000, Fitting atomic models into electron microscopy maps, Acta Cryst D56, nice description of 3 examples - nice description of 3 examples

Rossman 2000 Human rhinovirus- about 90 of 100 known serotypes use the cell surface glycoprotein ICAM-1 or CD54 as a receptor structure of ICAM-1 has been determined crystallographically HRV-ICAM complex only determined via cryo-EM. May be impossible to determine crystal structure because binding of ICAM is a recognition event that initiates virus break-up and the release of the virus RNA into host cell cytoplasm ICAM-1 usually functions to promote intercellular adhesion and signalling in response to inflammation HRV14-Fab17-IA

Rossman 2000 glycosolated structures of ICAM-1 with various elbow angles were fitted into EM map manually models were refined as rigid bodies in reciprocal space with respect to difference maps obtained by subtraction of HRV density from complex density using program X-PLOR the refinement process was reproducible to better than 1.5 A between equivalent atoms determine atomic interaction between virus and receptor

Automated Rigid-Body Docking Originally, docking was done manually However, a variety of computational docking algorithms have been developed to perform reliable and reproducible fitting of rigid structures into low-resolution maps

Correlation Coefficients The most widely used method for fitting atomic structures into low-resolution EM maps is a systematic maximization of the density cross-correlation of atomic models with electron density maps. packages used for this: COAN, DOCKEM and EMFIT The best results are obtained when the surface edges of individual components in a complex are well defined, and where there are only small regions of densities that cannot be assigned uniquely to a single component. However, at low resolution, standard correlation coefficients lie within a small range – problem of false positives in the fitting, particularly where the map includes densities not accounted for by docked compounds

Correlation Coefficients: Density Filtering Several density filtering operations have been proposed to overcome these difficulties: One approach involves the use of a mask to block the overlapping region between the densities arising from the individual docked components and the target map in calculations of the correlation coefficient ( Roseman, 2000). One approach involves the use of a mask to block the overlapping region between the densities arising from the individual docked components and the target map in calculations of the correlation coefficient ( Roseman, 2000). Roseman, 2000 Roseman, 2000

Correlation Coefficients: Density Filtering Another approach involves altering the functional form of the compared densities by applying a filter that enhances detection of contours (or surface) in the maps being compared Another approach involves altering the functional form of the compared densities by applying a filter that enhances detection of contours (or surface) in the maps being compared Filtering maximizes both density and contour overlap and therefore enhances numerical scoring contrast between potential solutions Filtering maximizes both density and contour overlap and therefore enhances numerical scoring contrast between potential solutions

Density Filtering

Correlation Coefficients: Density Filtering A drawback of density filtering is that the significant levels of noise that are present in low-resolution maps derived from electron microscopy can be amplified by certain density filtering approaches, which may increase the likelihood of "false- positive" fits of density.

Vector Quantization Automated docking is sometimes combined with vector distributions by vector quantization of the atomic model vector quantization is a data compression technique used in image and speech processing applications Vector quantization offers a flexible way to develop a reduced representation of 3D biological data from a variety of biophysical sources Situs package can be used for this

Vector Quantization data is represented by a small number of “codebook” vectors to reduce the combinatorial complexity of the structural comparison an actin monomer (backbone trace + atoms) encoded by four codebook vectors :

Vector Quantization In rigid-body docking, vector quantization is used to discretize both high- and low-resolution data sets. Pairs of corresponding codebook vectors are then identified and superimposed by a least- squares fit. The codebook vector least-squares fit results in a superposition of the corresponding high- and low-resolution data sets:

Core Weighting "core-weighting" approach - the construction of a complex structure from many components is simplified to a series of single component fitting procedures. The "core" region of a structure is defined as the part whose density distribution is unlikely to be altered by the presence of adjacent components. The "surface" region is the part that is accessible or can interact with other components. The region enclosed by the accessible surface thus belongs to the core region.

Core Weighting The single component fitting is conducted using a grid-threading Monte Carlo (GTMC) method that identifies the global maximum state (best fit) among a series of local maximum states determined by short Monte Carlo searches originating at a variety of grid points.

Flexible Docking Large macromolecular assemblies often undergo large functional rearrangements in some cases, the X-ray structure does not correspond to the EM structure This complicates the fitting, as not only orientation, but conformational rearrangements must be considered

Flexible Docking flexible fitting is often done by domain segmentation followed by fitting of each domain as a rigid body block (see earlier example with actin/myosin) However, large conformational changes often involve tightly coupled motions between domains

Flexible Docking: Vector Quantization in Situs Flexible docking combines vector quantization with molecular mechanics simulations where conformational changes require flexible fitting. Codebook vectors are used as constraints for the fitting. Flexible docking combines vector quantization with molecular mechanics simulations where conformational changes require flexible fitting. Codebook vectors are used as constraints for the fitting. The colored regions in the following diagram are the so-called Voronoi (nearest neighbor) cells whose centroids should coincide with the generating vectors:

Flexible Docking: Vector Quantization Fitting of the centroids to the low-resolution vector positions induces the desired conformational change. it is possible to constrain the distances between the vectors to reduce the effect of noise and experimental limitations on the vector positions:

Flexible Fitting: Normal Mode Analysis novel method for quantitative flexible docking of structures into EM maps in many cases, functional rearrangements of macromolecules can be described by a small number of low-frequency normal modes

Flexible Fitting: Normal Mode Analysis Normal mode refinement procedure relies on real space correlation between the calculated electron density map from atomic model and EM map The fitting is performed by deforming the structure along a set of low-frequency normal modes in order to maximize the correlation coefficient

Situs Situs is set of routines for quantitative docking of 3D data at variable resolution The software supports both rigid-body and flexible docking using a variety of fitting strategies. topology-representing neural networks are used to correlate the high- and low-resolution data sets – a form of vector quantization Situs is developed by the Laboratory for Structural Bioinformatics, biomachina.org. biomachina.org (other public software for map fitting - COAN ( Volkmann and Hanein, 1999), EMfit ( Rossmann et al., 2001), EMAN ( Jiang et al., 2001; Ludtke et al., 1999) (other public software for map fitting - COAN ( Volkmann and Hanein, 1999), EMfit ( Rossmann et al., 2001), EMAN ( Jiang et al., 2001; Ludtke et al., 1999) Volkmann and Hanein, 1999Rossmann et al., 2001Jiang et al., 2001Ludtke et al., 1999 Volkmann and Hanein, 1999Rossmann et al., 2001Jiang et al., 2001Ludtke et al., 1999

Situs Papers J Struc. Biol, 1999, 125: J Struc. Biol, 2001, 133:

Situs microtubules decorated with kinesin-related ncd proteins are used to demonstrate utility of packages Situs Situs step-by-step tutorial on this available Situs

Situs: Vector Quantization Given a certain number of vectors, need to make sure that the vector positions are statistically reliable and reproducible. Situs employs two quantization algorithms with different characteristics: the topology-representing neural network (TRN) performs a global stochastic search using random start vectors. Often a number of statistically independent TRN runs is repeated and the results are clustered and averaged. Useful if no prior information exists about the vector positions the topology-representing neural network (TRN) performs a global stochastic search using random start vectors. Often a number of statistically independent TRN runs is repeated and the results are clustered and averaged. Useful if no prior information exists about the vector positions Linde-Buzo-Gray (LBG, also called k -means) algorithms performs only a single run of local gradient descent search. It is useful if one wants to update existing vector positions, or if one wants to add distance constraints. Linde-Buzo-Gray (LBG, also called k -means) algorithms performs only a single run of local gradient descent search. It is useful if one wants to update existing vector positions, or if one wants to add distance constraints.

Situs: VIRTUAL REALITY developing a 3-dimensional graphics extension for Situs, termed SenSitus, that can support virtual-reality devices such as stereo glasses, 3- dimensional trackers, and force-feedback (haptic) devices. A force-feedback device measures a user's hand position and exerts a precisely controlled force on the hand. forces are calculated according to the correlation coefficient of density maps and crystallographic data.

Docking Issues/ Open Problems estimation of fitting quality validation of results estimation of fitting errors detection of ambiguities attempt at automated solutions

Possible Future Developments parallel computing can speedup process further development of methods to combine data from NMR, XRC and EM models derived by homology modelling are increasingly being used for docking studies - integration of docking methods with structural database searches

The “Other” docking: ligands and proteins Not this year …!