#19 - Protein Structure Basics & Classification

Slides:



Advertisements
Similar presentations
Hidden Markov Model in Biological Sequence Analysis – Part 2
Advertisements

Protein Structure C483 Spring 2013.
Protein Structure Prediction
Protein Structure – Part-2 Pauling Rules The bond lengths and bond angles should be distorted as little as possible. No two atoms should approach one another.
The amino acids in their natural habitat. Topics: Hydrogen bonds Secondary Structure Alpha helix Beta strands & beta sheets Turns Loop Tertiary & Quarternary.
Bioinformatics Finding signals and motifs in DNA and proteins Expectation Maximization Algorithm MEME The Gibbs sampler Lecture 10.
The Structure and Functions of Proteins BIO271/CS399 – Bioinformatics.
Protein-a chemical view A chain of amino acids folded in 3D Picture from on-line biology bookon-line biology book Peptide Protein backbone N / C terminal.
1 Levels of Protein Structure Primary to Quaternary Structure.
Protein Functions: catalyze reactions (enzymes) receptors (eg. pain receptors) transport (ions across membranes, oxygen in blood) molecular motors recognition.
Lecture 9 Hidden Markov Models BioE 480 Sept 21, 2004.
Protein Basics Protein function Protein structure –Primary Amino acids Linkage Protein conformation framework –Dihedral angles –Ramachandran plots Sequence.
Protein Structure Elements Primary to Quaternary Structure.
(Foundation Block) Dr. Ahmed Mujamammi Dr. Sumbul Fatma
Proteins: Levels of Protein Structure Conformation of Peptide Group
Housekeeping Your performance on the exam has caused me to re-evaluate how homework will be handled I will now be picking up every problem assigned on.
Supersecondary structures. Supersecondary structures motifs motifs or folds, are particularly stable arrangements of several elements of the secondary.
What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed.
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Proteins. Proteins? What is its How does it How is its How does it How is it Where is it What are its.
Proteins: Amino Acid Chains DNA Polymerase from E. coli Standard amino acid backbone: Carboxylic acid group, amino group, the alpha hydrogen and an R group.
Good solutions are advantageous Christophe Roos - MediCel ltd Similarity is a tool in understanding the information in a sequence.
Protein Structure Stryer Short Course Chapter 4. Peptide bonds Amide bond Primary structure N- and C-terminus Condensation and hydrolysis.
 It refers to the amino acid content (type and number),and sequence in the polypeptide chain and the location of the disulfide bonds if present. 
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.1 11/9/05 Protein Structure Databases (continued) Prediction & Modeling.
Protein Folding & Biospectroscopy F14PFB David Robinson Mark Searle Jon McMaster
CS790 – BioinformaticsProtein Structure and Function1 Review of fundamental concepts  Know how electron orbitals and subshells are filled Know why atoms.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
Protein Structure 1 Primary and Secondary Structure.
The α-helix forms within a continuous strech of the polypeptide chain 5.4 Å rise, 3.6 aa/turn  1.5 Å/aa N-term C-term prototypical  = -57  ψ = -47 
HMMs for alignments & Sequence pattern discovery I519 Introduction to Bioinformatics.
Bioinformatics Ayesha M. Khan 9 th April, What’s in a secondary database?  It should be noted that within multiple alignments can be found conserved.
Chap. 4. Problem 1. Part (a). Double and triple bonds are shorter and stronger than single bonds. Because the length of a peptide bond more closely resembles.
11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 1 11/7/05 Protein Structure: Classification, Databases,
Motif discovery and Protein Databases Tutorial 5.
PROTEIN PATTERN DATABASES. PROTEIN SEQUENCES SUPERFAMILY FAMILY DOMAIN MOTIF SITE RESIDUE.
10/8/07BCB 444/544 F07 ISU Dobbs #20 - Protein Structure Basics & Classification1 BCB 444/544 Lecture 20 Protein Structure Basics, Visualization, Classification.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Proteins: 3D-Structure Chapter 6 (9 / 17/ 2009)
Protein backbone Biochemical view:
Structure and Function
Lecture 47: Structure II -- Proteins. Today’s Outline The monomers: amino acids – Side chain characteristics – Acid-base equilibria and pK a Peptide backbone.
9/24/07BCB 444/544 F07 ISU Dobbs #14 - Review: Nucleus, Chromosomes, Genes, RNA, Protein1 BCB 444/544 Lecture 14 Review: Nucleus, Chromosomes, Genes, RNA,
Structural organization of proteins
Mir Ishruna Muniyat. Primary structure (Amino acid sequence) ↓ Secondary structure ( α -helix, β -sheet ) ↓ Tertiary structure ( Three-dimensional.
Protein Structure BL
Biochemistry Free For All
Dr. Jagdish Kaur, P.G.G.C.,Sector 11 Chandigarh
Protein Structure and Properties
The heroic times of crystallography
Protein Structure September 7,
The Peptide Bond Amino acids are joined together in a condensation reaction that forms an amide known as a peptide bond.
Lecture 5 Protein Structure.
Conformationally changed Stability
The Peptide Bond Amino acids are joined together in a condensation reaction that forms an amide known as a peptide bond.
There are four levels of structure in proteins
Chapter 3 Proteins.
Protein Structure Prediction
Sequence Based Analysis Tutorial
Conformationally changed Stability
BCB 444/544 Lecture 7 #7_Sept5 Global vs Local Alignment
Levels of Protein Structure
Protein structure prediction.
Binding and Conformational Change
HIDDEN MARKOV MODELS IN COMPUTATIONAL BIOLOGY
Protein structure (Foundation Block).
Fig 3.13 Reproduced from: Biochemistry by T.A. Brown, ISBN: © Scion Publishing Ltd, 2017.
The Three-Dimensional Structure of Proteins
Presentation transcript:

#19 - Protein Structure Basics & Classification BCB 444/544 10/5/07 Lecture 19 A bit of: Protein Structure - Basics Protein Structure Visualization, Classification & Comparison #19_Oct05 BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Required Reading (before lecture) #19 - Protein Structure Basics & Classification Required Reading (before lecture) 10/5/07 √Mon Oct 1 - Lecture 17 Protein Motifs & Domain Prediction Chp 7 - pp 85-96 √ Wed Oct 3 - Lecture 18 Protein Structure: Basics (Note chg in Lecture Schedule online ) Chp 12 - pp 173-186 √Thurs Oct 4 & Fri Oct 5 - Lab 6 & Lecture 19 Protein Structure: Basics, Databases, Visualization, Classification & Comparison Chp 13 - pp 187-199 BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

BCB 544 - Extra Required Reading #19 - Protein Structure Basics & Classification 10/5/07 BCB 544 - Extra Required Reading Assigned Mon Sept 24 BCB 544 Extra Required Reading Assignment: for 544 Extra HW#1 Task 2 Pollard KS, …., Haussler D. (2006) An RNA gene expressed during cortical development evolved rapidly in humans. Nature 443: 167-172. http://www.nature.com/nature/journal/v443/n7108/abs/nature05113.html doi:10.1038/nature05113 PDF available on class website - under Required Reading Link BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

BCB 544 Projects (Optional for BCB 444) #19 - Protein Structure Basics & Classification 10/5/07 BCB 544 Projects (Optional for BCB 444) For a better idea about what's involved in the Team Projects, please look over last year's expectations for projects: http://www.public.iastate.edu/~f2007.com_s.544/project.htm Criteria for evaluation of projects (oral presentations) are summarized here: http://www.public.iastate.edu/%7Ef2007.com_s.544/homework/HW7.pdf Please note: wrong URL (instead of that shown above) was included in originally posted 544ExtraHW#1; corrected version is posted now BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Assignments & Announcements - #1 #19 - Protein Structure Basics & Classification 10/5/07 Assignments & Announcements - #1 Students registered for BCB 444: Two Grading Options 1) Take Final Exam per original Grading Policies 2) Instead of taking Final Exam - you may participate in a Team Research Project If you choose #2, please do 3 things: Contact Drena (in person) Send email to Michael Terribilini (terrible@iastate.edu) Complete 544 Extra HW#1 - Task 1.1 by noon on Mon Oct 1 BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Assignments & Announcements - #2 #19 - Protein Structure Basics & Classification 10/5/07 Assignments & Announcements - #2 BCB 444s (Standard): 200 pts Midterm Exams = 100 points each 200 Homework & Laboratory assignments = 200 points 100 Final Exam 500 pts Total for BCB 444 BCB 444p (Project): 190 Team Research Project 590 pts Total for BCB 444p BCB 544: 200 pts Midterm Exams = 100 points each 200 Homework & Laboratory assignments 200 Discussion Questions & Team Research Projects 700 pts Total for BCB 544 BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Assignments & Announcements #3 #19 - Protein Structure Basics & Classification 10/5/07 Assignments & Announcements #3 ALL: HomeWork #3 Due: Mon Oct 8 by 5 PM HW544: HW544Extra #1 √Due: Task 1.1 - Mon Oct 1 by noon Due: Task 1.2 & Task 2 - Fri Oct 12 by 5 PM (not Monday) 444 "Project-instead-of-Final" students should also submit: HW544Extra #1 Due: Task 1.1 - Mon Oct 8 by noon Due: Task 1.2 - Fri Oct 12 by 5 PM (not Monday) Task 2 NOT required! BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

QUESTIONS re: HW#3? Due Mon #19 - Protein Structure Basics & Classification 10/5/07 QUESTIONS re: HW#3? Due Mon BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

HMM example from Eddy HMM paper: Toy HMM for Splice Site Prediction #19 - Protein Structure Basics & Classification 10/5/07 This is a new slide HMM example from Eddy HMM paper: Toy HMM for Splice Site Prediction BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

An HMM for Occasionally Dishonest Casino #19 - Protein Structure Basics & Classification An HMM for Occasionally Dishonest Casino 10/5/07 Transition probabilities Prob(Fair  Loaded) = 0.01 Prob(Loaded  Fair) = 0.2 But, where do you start? "Begin" state not shown BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Occasionally Dishonest Casino - HW#3 #19 - Protein Structure Basics & Classification Occasionally Dishonest Casino - HW#3 10/5/07 "Begin" state? 50:50 chance of starting with F vs L die BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Calculating Different Paths to an Observed Sequence #19 - Protein Structure Basics & Classification 10/5/07 This slide has been changed Calculating Different Paths to an Observed Sequence Calculations such as those shown below are used to fill a matrix with probability values for every state at every position transition probability emission probability BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

#19 - Protein Structure Basics & Classification 10/5/07 Calculate optimal path? Construct a matrix of probability values for every state at every residue How: one way = Viterbi Algorithm Initialization (i = 0) Recursion (i = 1, . . . , L): For each state k Termination: To find *, use trace-back, as in dynamic programming BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Viterbi for Calculating Most Probable Path* #19 - Protein Structure Basics & Classification Viterbi for Calculating Most Probable Path* 10/5/07 1  x 6 2  (1/6)(1/2) = 1/12 (1/2)(1/2) = 1/4 (1/6)max{(1/12)0.99, (1/4)0.2} = 0.01375 (1/10)max{(1/12)0.01, (1/4)0.8} = 0.02 B F L (1/6)max{0.013750.99, 0.020.2} = 0.00226875 (1/2)max{0.013750.01, 0.020.8} = 0.08 * Path within HMM that matches query sequence with highest probability BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

#19 - Protein Structure Basics & Classification Total Probability 10/5/07 Several different paths can result in observation x Probability that our model will emit x is: BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Calculating the Total Probability: #19 - Protein Structure Basics & Classification 10/5/07 This slide has bee changed Calculating the Total Probability: Note: This not the same as matrix on previous slide! Here, last column contains sums for each row 1  x 6 2  (1/6)(1/2) = 1/12 (1/2)(1/2) = 1/4 (1/6)sum{(1/12)0.99, (1/4)0.2} = 0.022083 (1/10)sum{(1/12)0.01, (1/4)0.8} = 0.020083 B F L (1/6)sum{0.0220830.99, 0.0200830.2} = 0.004313 (1/2)sum{0.0220830.01, 0.0200830.8} = 0.008144 Total probability = = 0 + 0.004313 + 0.008144 = 0.012 BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

A few more Details re: Profiles & HMMs #19 - Protein Structure Basics & Classification 10/5/07 A few more Details re: Profiles & HMMs Smoothing or "Regularization" - method used to avoid "over-fitting" Common problem in machine learning (data-driven) approaches Limited training sample size causes over-representation of observed characters while "ignoring" unobserved characters Result? Miss members of family not yet sampled (too many false negative hits) Pseudocounts - adding artificial values for 'extra' amino acid(s) not observed in the training set Treated as a 'real' values in calculating probabilities Improve predictive power of profiles & HMMs Dirichlet mixture - commonly used mathematical model to simulate the aa distribution in a sequence alignment To "correct" problems in an observed alignment based on limited number of sequences BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Chp 7 - Protein Motifs & Domain Prediction #19 - Protein Structure Basics & Classification 10/5/07 Chp 7 - Protein Motifs & Domain Prediction SECTION II SEQUENCE ALIGNMENT Xiong: Chp 7 Protein Motifs and Domain Prediction √Identification of Motifs & Domains in MSAs √Motif & Domain Databases Using Regular Expressions √Motif & Domain Databases Using Statistical Models Protein Family Databases Motif Discovery in Unaligned Sequences √Sequence Logos BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

#19 - Protein Structure Basics & Classification 10/5/07 Motifs & Domains Motif - short conserved sequence pattern Associated with distinct function in protein or DNA Avg = 10 residues (usually 6-20 residues) e.g., zinc finger motif - in protein e.g., TATA box - in DNA Domain - "longer" conserved sequence pattern, defined as a independent functional and/or structural unit Avg = 100 residues (range from 40-700 in proteins) e.g., kinase domain or transmembrane domain - in protein Domains may (or may not) include motifs BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

#19 - Protein Structure Basics & Classification 10/5/07 2 Approaches for Representing "Consensus" Information in Motifs & Domains Regular expression - symbolic representation of information from MSA e.g., protein phosphorylation site motif: [S,T]- X- [R,K] Symbols represent specific or unspecified residues, spaces, etc. 2 mechanisms for matching: Exact "Fuzzy" (inexact, approximate) - flexible, more permissive to detect "near matches" Statistical model - includes probability information derived from MSA e.g., PSSM, Profile, or HMM BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Motif & Domain Databases #19 - Protein Structure Basics & Classification 10/5/07 Motif & Domain Databases Based on regular expressions: Prosite (Interpro includes Prosite, PRINTS, etc) Emofit Limitation: these don't take probability info into account Based on statistical models: PRINTS BLOCKS ProDom Pfam SMART CDART Reverse PsiBLAST READ your textbook & try some of these at home; there are distinct advantages/disadvantages associated with each TAKE HOME LESSON: Always try several methods! (not just one!) BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Protein Family Databases #19 - Protein Structure Basics & Classification 10/5/07 Protein Family Databases In addition to databases of "related" protein sequences, based on shared motifs or domains (Pfam, BLOCKS, CDART), some databases "cluster" sequences into families based on near full-length sequence comparisons COGs - Clusters of Orthologous Groups (at NCBI) Mostly Prokaryotic sequences KOG = newer Eukaryotic version COGnitor - softwared to search database ProtoNet - also clusters of homologous protein sequences Advantages: tree-like hierarchical structure Provide GO (gene ontology) annotations Provides InterPro keywords BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Motif Discovery in Unaligned Sequences #19 - Protein Structure Basics & Classification 10/5/07 Motif Discovery in Unaligned Sequences Expectation Maximization - generate"random" alignment of all sequences, derive PSSM, iteratively match individual sequences to PSSM to edit & improve it Problems? Can hit a local optimum (premature convergence) Sensitive to initial alignment MEME - Multiple EM for Motif Elicitation - modified EM, avoids local optimum issues; two step procedure Gibbs Sampling - generate "trial" PSSM from random alignment first, as in EM, but leave one sequence out of initial alignment, then iteratively match PSSM to left-out sequences Gibbs Sampler - web-based motif search via Gibbs sampling Not mentioned in textbook: Stochastic context-free grammers Other "state of the art"pproaches in recent literature, but not available in web-based servers (yet) BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Chp 12 - Protein Structure Basics #19 - Protein Structure Basics & Classification 10/5/07 Chp 12 - Protein Structure Basics SECTION V STRUCTURAL BIOINFORMATICS Xiong: Chp 12 Protein Structure Basics LAB 6 Introduction to Protein DataBank - PDB PyMol Cn3D? BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Chp 12 - Protein Structure Basics #19 - Protein Structure Basics & Classification 10/5/07 Chp 12 - Protein Structure Basics SECTION V STRUCTURAL BIOINFORMATICS Xiong: Chp 12 Protein Structure Basics Amino Acids Peptide Bond Formation Dihedral Angles Hierarchy Secondary Structures Tertiary Structures Determination of Protein 3-Dimensional Structure Protein Structure DataBank (PDB) BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Protein Structure & Function #19 - Protein Structure Basics & Classification Protein Structure & Function 10/5/07 Protein structure - primarily determined by sequence Protein function - primarily determined by structure Globular proteins: compact hydrophobic core & hydrophilic surface Membrane proteins: special hydrophobic surfaces Folded proteins are only marginally stable Some proteins do not assume a stable "fold" until they bind to something = Intrinsically disordered Predicting protein structure and function can be very hard -- & fun! BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

4 Basic Levels of Protein Structure #19 - Protein Structure Basics & Classification 4 Basic Levels of Protein Structure 10/5/07 BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Primary & Secondary Structure #19 - Protein Structure Basics & Classification 10/5/07 Primary & Secondary Structure Primary Linear sequence of amino acids Description of covalent bonds linking aa’s Secondary Local spatial arrangement of amino acids Description of short-range non-covalent interactions Periodic structural patterns: -helix, b-sheet BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Tertiary & Quaternary Structure #19 - Protein Structure Basics & Classification Tertiary & Quaternary Structure 10/5/07 Tertiary Overall 3-D "fold" of a single polypeptide chain Spatial arrangement of 2’ structural elements; packing of these into compact "domains" Description of long-range non-covalent interactions (plus disulfide bonds) Quaternary In proteins with > 1 polypeptide chain, spatial arrangement of subunits BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

"Additional" Structural Levels #19 - Protein Structure Basics & Classification "Additional" Structural Levels 10/5/07 Super-secondary elements Motifs Domains Foldons BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

#19 - Protein Structure Basics & Classification Amino Acids 10/5/07 Each of 20 different amino acids has different "R-Group" or side chain attached to Ca BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Peptide Bond is Rigid and Planar #19 - Protein Structure Basics & Classification Peptide Bond is Rigid and Planar 10/5/07 BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Hydrophobic Amino Acids #19 - Protein Structure Basics & Classification Hydrophobic Amino Acids 10/5/07 BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

#19 - Protein Structure Basics & Classification Charged Amino Acids 10/5/07 BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

#19 - Protein Structure Basics & Classification Polar Amino Acids 10/5/07 BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Certain Side-chain Configurations are Energetically Favored (Rotamers) #19 - Protein Structure Basics & Classification 10/5/07 Ramachandran plot: "Allowable" psi & phi angles BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Glycine is Smallest Amino Acid R group = H atom #19 - Protein Structure Basics & Classification Glycine is Smallest Amino Acid R group = H atom 10/5/07 Glycine residues increase backbone flexibility because they have no R group BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

#19 - Protein Structure Basics & Classification Proline is Cyclic 10/5/07 Proline residues reduce flexibility of polypeptide chain Proline cis-trans isomerization is often a rate-limiting step in protein folding Recent work suggests it also may also regulate ligand binding in native proteins Andreotti (BBMB) BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Cysteines can Form Disulfide (S-S) Bonds #19 - Protein Structure Basics & Classification Cysteines can Form Disulfide (S-S) Bonds 10/5/07 Disulfide bonds (covalent) stabilize 3-D structures In eukaryotes, disulfide bonds are often found in secreted proteins or extracellular domains BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Globular Proteins Have a Compact Hydrophobic Core #19 - Protein Structure Basics & Classification Globular Proteins Have a Compact Hydrophobic Core 10/5/07 Packing of hydrophobic side chains into interior is main driving force for folding Problem? Polypeptide backbone is highly polar (hydrophilic) due to polar -NH and C=O in each peptide unit (which are charged at neutral pH=7, found in biological systems); these polar groups must be neutralized Solution? Form regular secondary structures, e.g., -helix, b-sheet, stabilized by H-bonds BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Exterior Surface of Globular Proteins is Generally Hydrophilic #19 - Protein Structure Basics & Classification 10/5/07 Exterior Surface of Globular Proteins is Generally Hydrophilic Hydrophobic core formed by packed secondary structural elements provides compact, stable core "Functional groups" of protein are attached to this framework; exterior has more flexible regions (loops) and polar/charged residues Hydrophobic "patches" on protein surface are often involved in protein-protein interactions BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Protein Secondary Structures #19 - Protein Structure Basics & Classification Protein Secondary Structures 10/5/07 Helices Sheets Loops Coils BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Helix: Stabilized by H-bonds between every ~ 4th residue in Backbone #19 - Protein Structure Basics & Classification Helix: Stabilized by H-bonds between every ~ 4th residue in Backbone 10/5/07 C = black O = red N = blue H = white Look! - Charges on backbone are "neutralized" by hydrogen bonds (H-bonds) - red fuzzy vertical bonds BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Certain Amino Acids are "Preferred" & Others are Rare in Helices #19 - Protein Structure Basics & Classification 10/5/07 Certain Amino Acids are "Preferred" & Others are Rare in Helices Ala, Glu, Leu, Met = good helix formers Pro, Gly Tyr, Ser = very poor Amino acid composition & distribution varies, depending on on location of helix in 3-D structure BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

-Sheets - also Stabilized by H-bonds Between Backbone Atoms #19 - Protein Structure Basics & Classification 10/5/07 -Sheets - also Stabilized by H-bonds Between Backbone Atoms Anti-parallel Parallel BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

#19 - Protein Structure Basics & Classification 10/5/07 Loops Connect helices and sheets Vary in length and 3-D configurations Are located on surface of structure Are more "tolerant" of mutations Are more flexible and can adopt multiple conformations Tend to have charged and polar amino acids Are frequently components of active sites Some fall into distinct structural families (e.g., hairpin loops, reverse turns) BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

#19 - Protein Structure Basics & Classification Coils 10/5/07 Regions of 2' structure that are not helices, sheets, or recognizable turns Intrinsically disordered regions appear to play important functional roles BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs

Chp 13 - Protein Structure Basics #19 - Protein Structure Basics & Classification 10/5/07 Chp 13 - Protein Structure Basics SECTION V STRUCTURAL BIOINFORMATICS Xiong: Chp 13 Protein Structure Visualization, Comparison & Classfication Protein Structural Visualization Protein Structure Comparison Protein Structure Classification BCB 444/544 F07 ISU Dobbs #19- Protein Structure Basics & Classification BCB 444/544 Fall 07 Dobbs