Structural Bioinformatics Yasaman Karami Master BIM-BMC Semestre 3, 2014-2015 Laboratoire de Biologie Computationnelle et Quantitative (LCQB) e-documents:

Slides:



Advertisements
Similar presentations
Population-based metaheuristics Nature-inspired Initialize a population A new population of solutions is generated Integrate the new population into the.
Advertisements

1 An Adaptive GA for Multi Objective Flexible Manufacturing Systems A. Younes, H. Ghenniwa, S. Areibi uoguelph.ca.
Tuesday, May 14 Genetic Algorithms Handouts: Lecture Notes Question: when should there be an additional review session?
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Protein Structure Prediction With Evolutionary Algorithms Natalio Krasnogor, U of the West of England William Hart, Sandia National Laboratories Jim Smith,
Spie98-1 Evolutionary Algorithms, Simulated Annealing, and Tabu Search: A Comparative Study H. Youssef, S. M. Sait, H. Adiche
EMBIO – Cambridge Particle Swarm Optimization applied to Automated Docking Automated docking of a ligand to a macromolecule Particle Swarm Optimization.
CISC667, F05, Lec21, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction 3-Dimensional Structure.
Engineering Optimization
Graphical Models for Protein Kinetics Nina Singhal CS374 Presentation Nov. 1, 2005.
Protein Tertiary Structure Prediction. Protein Structure Prediction & Alignment Protein structure Secondary structure Tertiary structure Structure prediction.
Energetics and kinetics of protein folding. Comparison to other self-assembling systems?
Mossbauer Spectroscopy in Biological Systems: Proceedings of a meeting held at Allerton House, Monticello, Illinois. Editors: J. T. P. DeBrunner and E.
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
D Nagesh Kumar, IIScOptimization Methods: M1L4 1 Introduction and Basic Concepts Classical and Advanced Techniques for Optimization.
Genetic Threading By J.Yadgari and A.Amir Published: special issue on Bioinformatics in Journal of Constraints, June 2001 Alexandre Tchourbanov University.
Protein Side Chain Packing Problem: A Maximum Edge-Weight Clique Algorithmic Approach Dukka Bahadur K.C, Tatsuya Akutsu and Tomokazu Seki Proceedings of.
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
1 Protein Folding Atlas F. Cook IV & Karen Tran. 2 Overview What is Protein Folding? Motivation Experimental Difficulties Simulation Models:  Configuration.
Construyendo modelos 3D de proteinas ‘fold recognition / threading’
CSM6120 Introduction to Intelligent Systems Other evolutionary algorithms.
Genetic Algorithms and Ant Colony Optimisation
JM - 1 Introduction to Bioinformatics: Lecture XVI Global Optimization and Monte Carlo Jarek Meller Jarek Meller Division of Biomedical.
Conformational Sampling
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
A new Approach to Structural Prediction of Proteins Heiko Schröder Bertil Schmidt Jiujiang Zhu School of Computer Engineering Nanyang Technological University.
Protein Folding in the 2D HP Model Alexandros Skaliotis – King’s College London Joint work with: Andreas Albrecht (University of Hertfordshire) Kathleen.
Doshisha Univ. JapanGECCO2002 Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover Takeshi YoshidaTomoyuki.
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
Modern Heuristic Optimization Techniques and Potential Applications to Power System Control Mohamed A El-Sharkawi The CIA lab Department of Electrical.
(Particle Swarm Optimisation)
The Particle Swarm Optimization Algorithm Nebojša Trpković 10 th Dec 2010.
Evolving Virtual Creatures & Evolving 3D Morphology and Behavior by Competition Papers by Karl Sims Presented by Sarah Waziruddin.
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
Iterative Improvement Algorithm 2012/03/20. Outline Local Search Algorithms Hill-Climbing Search Simulated Annealing Search Local Beam Search Genetic.
Approximation Algorithms For Protein Folding Prediction Giancarlo MAURI,Antonio PICCOLBONI and Giulio PAVESI Symposium on Discrete Algorithms, pp ,
Protein Structure Prediction
Single-solution based metaheuristics. Outline Local Search Simulated annealing Tabu search …
Probabilistic Algorithms Evolutionary Algorithms Simulated Annealing.
Biologically inspired algorithms BY: Andy Garrett YE Ziyu.
SwinTop: Optimizing Memory Efficiency of Packet Classification in Network Author: Chen, Chang; Cai, Liangwei; Xiang, Yang; Li, Jun Conference: Communication.
D Nagesh Kumar, IIScOptimization Methods: M8L5 1 Advanced Topics in Optimization Evolutionary Algorithms for Optimization and Search.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein backbone Biochemical view:
Mean Field Theory and Mutually Orthogonal Latin Squares in Peptide Structure Prediction N. Gautham Department of Crystallography and Biophysics University.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
We propose an accurate potential which combines useful features HP, HH and PP interactions among the amino acids Sequence based accessibility obtained.
Genetic Algorithm. Outline Motivation Genetic algorithms An illustrative example Hypothesis space search.
Hirophysics.com The Genetic Algorithm vs. Simulated Annealing Charles Barnes PHY 327.
A new protein-protein docking scoring function based on interface residue properties Reporter: Yu Lun Kuo (D )
Genetic Algorithms.
Avdesh Mishra, Md Tamjidul Hoque {amishra2,
Computational Structure Prediction
Scientific Research Group in Egypt (SRGE)
Generating, Maintaining, and Exploiting Diversity in a Memetic Algorithm for Protein Structure Prediction Mario Garza-Fabre, Shaun M. Kandathil, Julia.
School of Computer Science & Engineering
Subject Name: Operation Research Subject Code: 10CS661 Prepared By:Mrs
Advanced Artificial Intelligence Evolutionary Search Algorithm
Metaheuristic methods and their applications. Optimization Problems Strategies for Solving NP-hard Optimization Problems What is a Metaheuristic Method?
Multi-Objective Optimization
3-Dimensional Structure
Yang Zhang, Andrzej Kolinski, Jeffrey Skolnick  Biophysical Journal 
Stochastic Local Search Variants Computer Science cpsc322, Lecture 16
Protein structure prediction.
Boltzmann Machine (BM) (§6.4)
Protein structure prediction
Presentation transcript:

Structural Bioinformatics Yasaman Karami Master BIM-BMC Semestre 3, Laboratoire de Biologie Computationnelle et Quantitative (LCQB) e-documents:

Protein Structure Prediction (PSP) Yasaman Karami – The goal is to predict the (complex) 3D structure (and some sub-features) of a protein from its amino acid sequence (a 1D object) Primary Sequence3D Structure

The function of a protein inside a cell depends on its 3D structure Protein folding procedure to understand their physical, chemical and biological features According to Science, the problem remains one of the top 125 outstanding issues in modern science.[1] Science[1] Some of the most successful methods have a reasonable probability of predicting the folds of small, single-domain proteins within 1.5 angstroms over the entire structure.[2][2] Drug design (using protein structures and docking methods) Elimination of some diseases (like Alzheimer) Yasaman Karami – The importance of PSP Insulin 3D structure Science 2005, 309: Dile et al.,Current Opinion in Structural Biology 2007, 17:342–346.

The huge number of possible conformations (Levinthal paradox) Physical features of proteins are still unknown. Some proteins could fold to multiple structures. The impact of nature and other macromolecules on folding procedure. Yasaman Karami – The difficulties Levinthal Paradox

Three categories: 1.Homology modeling/Comparative modeling 2.Threading 3.Ab-initio Yasaman Karami – MethodsMethods 1 23

Championed by the Nobel Prize Laureate At the environmental conditions (temperature, solvent concentration and composition, etc.) at which folding occurs, the native structure is a unique, stable and kinetically accessible minimum of the free energy. Optimization problem – Evolutionary Algorithms (Genetic Algorithm) – Bio-inspired Algorithms (Ant Colony Algorithm, etc.) Yasaman Karami – Anfinsen Dogma

1.The conformation representation The main degrees of freedom in forming the 3D trace of the polypeptide chain are the two dihedral angles on each side of the Ca atom. φ (phi, involving the backbone atoms C'-N-Cα-C') ψ (psi, involving the backbone atoms N-Cα-C'-N) ω (omega, involving the backbone atoms Cα-C'-N-Cα) φ controls the C'-C' distance, ψ controls the N-N distance and ω controls the Cα- Cαdistance. a dihedral or torsion angle is the angle between two planes Yasaman Karami – Steps of ab-initio prediction

Amino acid chains (proteins) are represented as connected beads on a 2D or 3D lattice HP: hydrophobic – hydrophilic property Hydrophobic amino acids can form a hydrophobic core energy potential Yasaman Karami – HP Protein Folding Model

Model adds energy value to each hydrophobic pair that are adjacent on lattice AND not consecutive in the sequence Goal: find low energy configurations! Yasaman Karami – HP Protein Folding Model

Proteins are represented using internal coordinates (vs. Cartesian) Absolute vs. Relative encoding Absolute Encoding: specifies an absolute direction cubic lattice: {U,D,L,R,F,B} Relative Encoding: specifies direction relative to the previous amino acid cubic lattice: {U,D,L,R,F} Yasaman Karami – Encodings for Internal Coordinates n-1

Yasaman Karami – Steps of ab-initio prediction (cont.)

3.Evaluation of predicted structure Yasaman Karami – Steps of ab-initio prediction (cont.)

Yasaman Karami – PSP using optimization algorithm

Yasaman Karami – PSP using optimization algorithm

Yasaman Karami – PSP using optimization algorithm

Yasaman Karami – PSP using optimization algorithm

Yasaman Karami – PSP using optimization algorithm

Genetic Algorithm (based on evolution process in human body): – Chromosome or conformation – Coding method – Fitness function – Selection – Cross over/Mating – Mutation Yasaman Karami – Genetic Algorithm (GA)

 stylized representation of the movement of organisms in a bird flock or fish school.  The local and global best position and velocity is updated permanently. Yasaman Karami – Particle Swarm Optimization (PSO) Hassan et al., American Institute of Aeronautics and Astronautics (2005). Reid et al., Evolutionary Computation (CEC), IEEE, 2014.

HP lattice simplification: – The number of H-H neighboring connections is computed and multiplied by -1. Yasaman Karami – HP lattice model

Yasaman Karami – combination of GA and PSO on HP lattice PSO is used to prevent the collision produced by mutation operator using hydrophobic -hydrophilic feature of amino acids. (very simple and inaccurate) PSO algorithm Lin et al., International Journal of Fuzzy Systems, 2011.

Algorithm – TS is a local neighborhood search algorithm, leads the next search based on a flexible memory function. It could search quickly but is weak to find the global optimum point. – In this work fitness function of the mutated children are calculated, then they are placed in the tabu list with their energy values. This list is sorted increasingly based on energy values and used to find the final mutated children. Crossover is implemented in TS the same as mutation with some differences; energy of offspring are compared with a threshold then placed in the tabu list. Yasaman Karami – Tabu Search

– AB off-lattice model is also based on hydrophobicity and divides the amino acids to two groups, A and B. – A stands for hydrophobic and B stands for hydrophilic amino acids. – In addition to the property of hydrophobicity, bend angles and the distance between amino acids should be considered to calculate the energy value. – This model is more accurate than HP lattice, also the calculations to find the energy value are simpler. Yasaman Karami – AB off-lattice

Yasaman Karami – combination of GA and TS on AB off-lattice Using Tabu Search method AB off-lattice Zhang et al., BMC Systems Biology, 2010.

The main idea: – finding the native structure of a given protein is not equivalent to “finding a native state needle in a conformational space haystack” but, instead, should be more like “finding a set of equivalent needles in a haystack”. CHARMM force field: Yasaman Karami – Multi-objective optimization

Two objective function: – Optimization using Pareto front Yasaman Karami – Multi-objective optimization Pareto front line Cutello et al., J. R. Soc. Interface, 2005.

Ant colony algorithm is inspired from the ants' social life and food foraging. Ants communicate with each other and find food source through pheromone trails. Yasaman Karami – Ant Colony Optimization (ACO)

It is based on marriage in honey bees and inspired from honey bee's reproduction process. Honey bee colony consists of a queen, drones, broods and workers. Workers are considered as local search algorithms that improve the results and queen as the answer. At the end, the improved and mutated brood will be considered as the queen if it has the better fitness value. Childs will be produced from crossover of the queen and drones. Yasaman Karami – Honey Bee Optimization (HBO)

SA is a generic probabilistic meta-heuristic for the global optimization problem of locating a good approximation to the global optimum of a given function in a large search space. Yasaman Karami – Simulated Annealing (SA) Simulated Annealing used for hill climbing

ConclusionConclusion Yasaman Karami – D structure gives clues to function determination 3D structure determination is difficult, slow and expensive The use of a near-optimal meta-heuristic algorithm, such as a Genetic Algorithm, is one of the most promising optimization methods as it explores minimal number of potential structures. Conformation representation and energy models are very important A powerful exploration of the conformational space can be achieved by using Optimization algorithms.