Presentation is loading. Please wait.

Presentation is loading. Please wait.

Structural Bioinformatics Workshop Max Shatsky Workshop home page:

Similar presentations


Presentation on theme: "Structural Bioinformatics Workshop Max Shatsky Workshop home page:"— Presentation transcript:

1 Structural Bioinformatics Workshop Max Shatsky Email: maxshats@post.tau.ac.ilmaxshats@post.tau.ac.il Workshop home page: http://bioinfo3d.cs.tau.ac.il/ http://bioinfo3d.cs.tau.ac.il/ (Follow “Courses” link)

2 Schedule Introduction Introduction to protein structure. Introduction to pattern matching. Protein structure alignment (comparison). Rigid/Flexible case. Protein Docking Rigid/Flexible case. GAMB++ library.

3  GAMB++ exercise  Presentation and Design Review  Final Project –Software Engineering –Efficiency of Solution –Working Examples and Test Cases –Documentation –Knowledge of all project aspects Grade Ingredients

4 Bioinformatics - Computational Genomics DNA mapping. Protein or DNA sequence comparisons. Exploration of huge textual databases. In essence one- dimensional methods and intuition.

5 Structural Bioinformatics - Structural Genomics Elucidation of the 3D structures of biomolecules. Analysis and comparison of biomolecular structures. Prediction of biomolecular recognition. Handles three-dimensional (3-D) structures. Geometric Computing. (a methodology shared by Computational Geometry, Computer Vision, Computer Graphics, Pattern Recognition etc.)

6 Protein Structural Comparison ApoAmicyanin - 1aaj Pseudoazurin - 1pmy

7 Algorithmic Solution About 1 sec. Fischer, Nussinov, Wolfson ~ 1990.

8 Multiple Structural Comparison Globins

9 FlexProt – Flexible Protein Alignment

10 http://bioinfo3d.math.tau.ac.il/FlexProt

11 Example : Trypsin/Trypsin inhibitor Figure from B. Honig’s Labs web-site at Columbia University.

12 Introduction to Protein Structure

13 The central dogma DNA ---> mRNA ---> Protein {A,C,G,T} {A,C,G,U} {A,D,..Y} Guanine-Cytosine T->U Thymine-Adenine 4 letter alphabets 20 letter alphabet Sequence of nucleic acids seq of amino acids

14 When genes are expressed, the genetic information (base sequence) on DNA is first transcribed (copied) to a molecule of messenger RNA in a process similar to DNA replication. The mRNA molecules then leave the cell nucleus and enter the cytoplasm, where triplets of bases ((codons) forming the genetic code specify the particular amino acids that make up an individual protein. This process, called translation, is accomplished by ribosomes (cellular components composed of proteins and another class of RNA) that read the genetic code from the mRNA, and transfer RNAs (tRNAs) that transport amino acids to the ribosomes for attachment to the growing protein. (From www.ornl.gov/hgmis/publicat/primer/ )www.ornl.gov/hgmis/publicat/primer/

15 Amino acids and the peptide bond C  – first side chain carbon (except for glycine ). Cα atoms

16

17 Wire-frame or ribbons display

18

19 Geometric Representation 3-D Curve {v i }, i=1…n

20

21 Secondary structure

22 Hydrogen bonds.  strands and sheets

23

24

25 The Holy Grail - Protein Folding From Sequence to Structure. Relatively primitive computational folding models have proved to be NP complete even in the 2-D case.

26 Determination of protein structures X-ray Crystallography NMR (Nuclear Magnetic Resonance) EM (Electron microscopy)

27 An NMR result is an ensemble of models Cystatin (1a67)

28 The Protein Data Bank (PDB) International repository of 3D molecular data. Contains x-y-z coordinates of all atoms of the molecule and additional data. http://pdb.tau.ac.il http://www.rcsb.org/pdb/

29

30

31 Why bother with structures when we have sequences ? In evolutionary related proteins structure is much better preserved than sequence. Structural motifs may predict similar biological function. Getting insight into protein folding. Recovering the limited (?) number of protein folds.

32 Applications Classification of protein databases by structure. Search of partial and disconnected structural patterns in large databases. Extracting Structure information is difficult, we want to extract “new” folds.

33 Applications (continued) Speed up of drug discovery. Detection of structural pharmacophores in an ensemble of drugs (similar substructures in drugs acting on a given receptor – pharmacophore). Comparison and detection of drug receptor active sites (structurally similar receptor cavities could bind similar drugs).

34 Structural Bioinformatics Lab Goals Development of state of the art algorithmic methods to tackle major computational tasks in protein structure analysis, biomolecular recognition, and Computer Assisted Drug Design. Establish truly interdisciplinary collaboration between Life and Computer Sciences.

35 Object Recognition

36 Geometric Task : find those rotations and translations of one of the point sets which produce “large” superimpositions of corresponding 3-D points. Given two configurations of points in the three dimensional space, T

37 Geometric Task (continued) Aspects: Object representation (points, vectors, segments) Object resemblance (distance function) Transformation (translations, rotations, scaling)

38 Transformations Translation Translation and Rotation Rigid Motion (Euclidian Trans.) Translation, Rotation + Scaling

39 Distance Functions Two point sets: A={a i } i=1…n B={b j } j=1…m Pairwise Correspondence: (a k 1,b t 1 ) (a k 2,b t 2 )… (a k N,b t N ) (1) Exact Matching: ||a k i – b t i ||=0 (2) RMSD (Root Mean Square Distance) Sqrt( Σ||a k i – b t i || 2 /N) < ε Hausdorff distance: h(A,B)=max aєA min bєB ||a– b|| H(A,B)=max( h(A,B), h(B,A))


Download ppt "Structural Bioinformatics Workshop Max Shatsky Workshop home page:"

Similar presentations


Ads by Google