Presentation is loading. Please wait.

Presentation is loading. Please wait.

Introduction to Structural Bioinformatics Dong Xu Computer Science Department 271C Life Sciences Center 1201 East Rollins Road University of Missouri-Columbia.

Similar presentations


Presentation on theme: "Introduction to Structural Bioinformatics Dong Xu Computer Science Department 271C Life Sciences Center 1201 East Rollins Road University of Missouri-Columbia."— Presentation transcript:

1 Introduction to Structural Bioinformatics Dong Xu Computer Science Department 271C Life Sciences Center 1201 East Rollins Road University of Missouri-Columbia Columbia, MO 65211-2060 E-mail: xudong@missouri.edu 573-882-7064 (O) http://digbio.missouri.edu

2 Structural Bioinformatics l Prediction and modeling l Protein structure l DNA structure l RNA structure l Membrane structures l Large-complex structure

3 An Overview o A protein folds into a unique 3D structure under the physiological condition Lysozyme sequence (129 amino acids): KVFGRCELAA AMKRHGLDNY RGYSLGNWVC AAKFESNFNT QATNRNTDGS TDYGILQINS RWWCNDGRTP GSRNLCNIPC SALLSSDITA SVNCAKKIVS DGNGMNAWVA WRNRCKGTDV QAWIRGCRL Protein backbones: Side chain

4 Protein Structure Representations Lysozyme structure: ball & stick strand surface

5 [ PDB: http://www.pdb.org ] Growth of Protein Data Bank (PDB)

6 Protein Structure Database: PDB (1) l PDB (Protein Data Bank) Web site: http://www.rcsb.org/pdb/http://www.rcsb.org/pdb/ l 33,252 Structures as of 25-Oct-2005 l PDB ID: 4-character identifier (1cau, 1gox, and 256b) l Search methods * search by PDB ID (e.g. 1lyz); * SearchLite: protein name, author's name, etc. (e.g., HIV protease); * SearchFields: EC Number, the name of the binding ligand (e.g., inhibitor), the range of the protein size, and the secondary structure content.

7 Protein Structure Database: PDB (2) PDB format (headers + coordinates): HEADER OXIDOREDUCTASE (OXYGEN(A)) 14-JUN-89 1GOX 1GOX COMPND GLYCOLATE OXIDASE (E.C.1.1.3.1) 1GOX... ATOM 232 N ALA 29 54.035 64.332 19.352 1.00 23.93 1GOX ATOM 233 CA ALA 29 52.992 65.356 19.569 1.00 24.74 1GOX ATOM 234 C ALA 29 53.519 66.762 19.309 1.00 25.43 1GOX ATOM 235 O ALA 29 54.648 67.179 19.655 1.00 25.66 1GOX ATOM 236 CB ALA 29 52.433 65.340 20.993 1.00 24.54 1GOX... HETATM 3165 O HOH 658 62.480 62.480 0.000 0.50 65.79 1GOX... END

8 Molecular Visualization RasMol: http://www.umass.edu/microbio/rasmol/index2.htm http://www.umass.edu/microbio/rasmol/index2.htm VMD: http://www.ks.uiuc.edu/Research/vmd

9 Relevance of Protein Structure in the Post-Genome Era sequence structure function medicine

10 Structure-Function Relationship Certain level of function can be found without structure. But a structure is a key to understand the detailed mechanism. A predicted structure is a powerful tool for function inference. Trp repressor as a function switch

11 Structure-Based Drug Design HIV protease inhibitor Structure-based rational drug design is still a major method for drug discovery.

12 Structures in Protein Language: Letters  Words  Sentences Protein: Residues  Secondary Structure  Tertiary Structure

13 Primary, Secondary and Tertiary Structures of Proteins

14  helix  Single protein chain (local)  Shape maintained by intramolecular H bonding between -C=O and H-N-

15  sheet  Several protein chains  Shape maintained by intramolecular H bonding between chains  Non-local on protein sequence

16  -sheet (parallel, anti-parallel)

17 Dihedral angles

18 Ramachandran plot (alpha)

19 Ramachandran plot (beta)

20 Protein Structure Domain (1) o Structure domain: compact, globular unit glycoprotein actin

21 Protein Structure Domain (2) o Structure domain is evolutionary, functional, and folding unit of a protein o Domain insertion: insert: zinc metalloproteinase + parent: thioredoxin (disulfide oxidoreductase)  Dsba: disulfide bond forming protein o Protein design (growth hormone) o Threading target

22 Structure Is Better Conserved during Evolution Structure can adopt a wide range of mutations. Physical forces favor certain structures. Concept of fold. Number of fold is limited. Currently ~800 Total: 1,000s ~10,000s TIM barrel

23 The number of different protein folds is limited PDB submissions per year Year Already known folds New folds

24 Protein Folding Problem A protein folds into a unique 3D structure under the physiological condition Lysozyme sequence: KVFGRCELAA AMKRHGLDNY RGYSLGNWVC AAKFESNFNT QATNRNTDGS TDYGILQINS RWWCNDGRTP GSRNLCNIPC SALLSSDITA SVNCAKKIVS DGNGMNAWVA WRNRCKGTDV QAWIRGCRL

25 Web Addresses  Resource: http://digbio.missouri.edu/resource/ http://digbio.missouri.edu/resource/  Further reading (a review on protein modeling): www.bentham.org/cpps1-1/Dong%20Xu/xu_cpps.htm


Download ppt "Introduction to Structural Bioinformatics Dong Xu Computer Science Department 271C Life Sciences Center 1201 East Rollins Road University of Missouri-Columbia."

Similar presentations


Ads by Google