M.M. Dalkilic, PhD Monday, September 08, 2008 Class III Indiana University, Bloomington, IN Sequence Homology 1 Sequence Similiarty (Computation) M.M.

Slides:



Advertisements
Similar presentations
Proteins from Amino Acids
Advertisements

Carbohydrates, Lipids, Proteins, and Nucleic Acids
Protein Structure and Physics. What I will talk about today… -Outline protein synthesis and explain the basic steps involved. -Go over the Chemistry of.
Lecture 2 – The Kinetics of Enzyme Catalyzed Reaction Dr. AKM Shafiqul Islam University Malaysia Perlis
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Polypeptides – a quick review A protein is a polymer consisting of several amino acids (a polypeptide) Each protein has a unique 3-D shape or Conformation.
Lecture 1 BNFO 240 Usman Roshan. Course overview Perl progamming language (and some Unix basics) Sequence alignment problem –Algorithm for exact pairwise.
Protein structure Friday, 10 February 2006 Introduction to Bioinformatics Brigham Young University DA McClellan
1. Primary Structure: Polypeptide chain Polypeptide chain Amino acid monomers Peptide linkages Figure 3.6 The Four Levels of Protein Structure.
CISC667, F05, Lec27, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Review Session.
2.7 DNA Replication, transcription and translation
Ribosome Structure 1. Outline the structure of a ribosome based on the diagram: ● A site.
Transcription Transcription is the synthesis of mRNA from a section of DNA. Transcription of a gene starts from a region of DNA known as the promoter.
Proteins. You need to know that: Proteins have a variety of functions within all living organisms. The general structure of an amino acid Condensation.
1 Chapter 3: Protein ZHOU Yong Department of Biology Xinjiang Medical University.
Supersecondary structures. Supersecondary structures motifs motifs or folds, are particularly stable arrangements of several elements of the secondary.
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
Insulin: Weight = 5733, 51 amino acids Glutamine Synthetase: Weight = 600,000, 468 amino acids.
Introduction to Bioinformatics Algorithms Algorithms for Molecular Biology CSCI Elizabeth White
DNA & RNA Protein Synthesis. DNA be.com/watch?v =4PKjF7OumYo.
Sequence analysis: Macromolecular motif recognition Sylvia Nagl.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
M.M. Dalkilic, PhD Monday, September 08, 2008 Class V Indiana University, Bloomington, IN Sequence Homology 1 Sequence Similiarty (Computation) M.M. Dalkilic,
CFE Higher Biology DNA and the Genome Translation.
PROTEIN SYNTHESIS THE FORMATION OF PROTEINS USING THE INFORMATION CODED IN DNA WITHIN THE NUCLEUS AND CARRIED OUT BY RNA IN THE CYTOPLASM.
 DNA must be copied (before a cell divides)  The DNA molecule produces 2 IDENTICAL new complementary strands following the rules of base pairing: A-T,
 Translation Creating Protein from mRNA Protein Structure  Proteins are made of Amino Acids.  There are 20 different Amino Acids.  The sequence of.
Genetics 314 – Spring, 2009 Lecture 7 Reading – Chapter 13 First Exam – Friday, February 6 th, 2009 Review Session – Wednesday, Feb. 4th.
Chapter Twenty Two Nucleic Acids. Copyright © Houghton Mifflin Company. All rights reserved.22 | 2 →CO 22.1 Human egg and sperm. Lipids cont’d Dr. Nikos.
1 PROTEIN SYNTHESIS: Translation. 2 Transcription Translation DNA mRNA Ribosome Protein Prokaryotic Cell DNA  RNA  Protein.
Bioinformatics The Prediction of Life Tony C Smith Department of Computer Science University of Waikato
Translation. What I need to know:- 1.What translation is 2.The role of tRNA 3.Know about anticodons and codons 4.The function of start/stop codons 5.One.
3.A.1 DNA and RNA Part IV: Translation DNA, and in some cases RNA, is the primary source of heritable information. DNA, and in some cases RNA, is the primary.
Introduction to Bioinformatics Algorithms Algorithms for Molecular Biology CSCI Elizabeth White
Genetic Engineering Genetic engineering is also referred to as recombinant DNA technology – new combinations of genetic material are produced by artificially.
Chs. 11 and 12 Clicker Quiz! Get your clickers ready! 8 questions.
Sequence Alignment.
Copyright © 2009 Pearson Education, Inc. Head Tail fiber DNA Tail.
© 2005 Jones and Bartlett Publishers Chapter 18 How Genes Work and How Genes are Controlled.
Translation and Protein Synthesis Notes
M.M. Dalkilic, PhD Monday, September 08, 2008 Class II Indiana University, Bloomington, IN Sequence Homology 1 Sequence Similiarty (Computation) M.M. Dalkilic,
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Composed of 4 nucleotides, that always pair the same.
Higher Human Biology Unit 1 Human Cells KEY AREA 3: Gene Expression.
Protein Structure and Function. Proteins are organic compounds made from amino acids held together by peptide bonds.
Lesson 4- Gene Expression PART 2 - TRANSLATION. Warm-Up Name 10 differences between DNA replication and transcription.
DNA Structure and Protein Synthesis Topic 2.4. Introduction  Cause of CF?  faulty CFTR protein  What causes faulty protein?  DNA Mutation  What is.
BIOL 2416 CH 6: Translation. What is a protein? A protein consists of 1 or more polypeptides A polypeptide is a polymer of amino acids bound together.
SC.912.L.16.3 DNA Replication. – During DNA replication, a double-stranded DNA molecule divides into two single strands. New nucleotides bond to each.
Organic Compounds: Proteins
Nucleic Acids DNA & RNA.
What is a Protein? A protein is simply a long chain of amino acids linked together by bonds. The backbone of amino acids form strong covalent bonds and.
Protein Synthesis and Protein Folding
Human Cells Gene Expression
7.3 Translation.
Protein Synthesis.
Indiana University, Bloomington, IN
Topic DNA.
Indiana University, Bloomington, IN
Translation 2.7 & 7.3.
Transcription & Translation.
7.3 Translation.
Amino Acids.
Study Question: What are enzymes?
Mr. Briner Unit 7.3 DNA Translation HL
The four primary organic macromolecules
Replication and Transcription
Proteins.
Protein Synthesis Transcription.
Chapter 18 How Genes Work and How Genes are Controlled
Presentation transcript:

M.M. Dalkilic, PhD Monday, September 08, 2008 Class III Indiana University, Bloomington, IN Sequence Homology 1 Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 ©

Outline New Programming and written homework Friday New Reading Posted on Website Readings [R] Chaps 5 Most Important Aspect of Bioinformatics—homology search through sequence similarity (cont’d) Some vocabulary snuck in 2 Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 ©

Computation (review) Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 © 3 Algorithm “process or rules for (esp. machine) calculations. The execution of an algorithm must not include any subjective decisions, nor must it require the use of intuition or creativity” [Brassard & Bratley]

Computation (review) Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 © 4 constant Upper bound starts Upper bound

General Technique of Dynamic Programming Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 © 5 But what if data needs to be shared or the cost of redundancy is too high? Rethink computation: Dynamic Programming or Recursive Optimization Reduce cost of sharing thereby reduce cost of recursion

General Technique of Dynamic Programming Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 © 6 “ Dynamic programming reduces the running time of a recursive function to be at most the time required to evaluate the function for all arguments less than or equal to the given argument, treating the cost of a recursive call as a constant” [Sedgewick] o Top-down DP o Bottom-Up DP

Vocabulary Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 © 7 There are about a dozen words that you will encounter when engaging in bioinformatics—or computational biology. It’s important to know what they mean. I’m not going to provide a listing of all the important words, but ones that I believe are important now. ENZYME is typically a peptide (molecule made from proteins) that enables or “catalyzes” phenomenon—this could changing one molecule to another.

Vocabulary Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 © 8 The IUBMB has developed six categories tying together nomenclature with function: EC1 oxidoreductase (moving around hydrogen) EC2 tranferase (move a functional unit) EC3 hydrolase (involves H 2 O) EC4 lysase “cleave” (or cut) without using EC1 or EC3 EC5 isomerase (change in conformation) EC6 ligase (join functional units with covalent bonds) ENZYME is typically a peptide (molecule made from proteins) that enables or “catalyzes” phenomenon—this could changing one molecule to another.

Vocabulary Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 © 9 Restriction endonuclease cleaves DNA at specific sites…

Vocabulary Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 © 10 1.Replication 2.Transcription 3.Reverse Transcription 4.Translation

Vocabulary Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 © 11 3 “letters” of DNA becomes 3 “letters” of RNA becomes 1 “letter” of protein codon Six Reading Frames

Multiple Sequence Alignment of Proteins Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 © 12 i f protein A Amino acid gap

Why Alignment of Proteins? Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 © 13 i f Conjecture: Structure imparts function and similar functions should have similar structures. Therefore, align proteins to look for regions that are “similar” in sequence, since sequence determines structure and like sequences will (likely) produce similar function.

Domains and Motifs Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 © 14 Collections of motifs that perform a function Structural motifs Functional motifs Principle is how percent identity (similarity) and homology play out—above 40% (25%) percent identity one may infer a homology is plausible.

Domains and Motifs Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 © 15 Primary structure (sequence itself) Secondary structure [most common] (alpha-helix, beta- sheet) Tertiary structure is collection of secondary structure interlaced with loops Quarternary structure is combination of tertiary structures

Recurrence of Aligning two Sequences Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 © 16 # elementary particles in universe

Better Recurrences Sequence Similiarty (Computation) M.M. Dalkilic, PhD SoI Indiana University, Bloomington, IN 2008 © 17 [Waterman]