Modular proteins I Level 3 Molecular Evolution and Bioinformatics Jim Provan Patthy Sections 8.1.1 – 8.1.3.

Slides:



Advertisements
Similar presentations
Five Classes of Introns
Advertisements

Chap. 6 Problem 2 Protein coding genes are grouped into the classes known as solitary (single) genes, and duplicated or diverged genes in gene families.
Duplication, rearrangement, and mutation of DNA contribute to genome evolution Chapter 21, Section 5.
13 and 15 October, 2004 Chapter 13 RNA Splicing (and some other important stuff)
The Molecular Genetics of Gene Expression
Transcription in eucaryotes The basic chemistry of RNA synthesis in eukaryotes is the same as in prokaryotes. Genes coding for proteins are coded for by.
Protein Modules An Introduction to Bioinformatics.
Chapter 17 AP Biology From Gene to Protein.
Step 1 of Protein Synthesis
RNA splicing By xiao yi 生物学基地班
Transcription: Synthesizing RNA from DNA
RNA processing Splicing and the mechanism of splicing of mRNA
CHAPTER 3 GENE EXPRESSION IN EUKARYOTES (cont.) MISS NUR SHALENA SOFIAN.
Posttranscriptional Modification of RNA
Outline  Introduction  The chemistry of RNA Splicing  The Splicing Machinery  Splicing Pathway  Alternative Splicing  Exon Shuffling  RNA Editing.
Anum kamal(BB ) Umm-e-Habiba(BB ). Gene splicing “Gene splicing is the removal of introns from the primary trascript of a discontinuous gene.
{ DNA Processes: Transcription and Translation By: Sidney London and Melissa Hampton.
CHAPTER 17 FROM GENE TO PROTEIN Copyright © 2002 Pearson Education, Inc., publishing as Benjamin Cummings Section B: The Synthesis and Processing of RNA.
Eukaryotic cells modify RNA after transcription
6 domains, “Helical Wheel” Domain I contains binding sites for the 5’ exon (keeps the 5’ exon from floating away after the first splicing step) Consensus.
RNA Splicing Prokaryotic genes: the codon is immediately adjacent to another Eukaryotic genes: consists of blocks of coding sequences separated by noncoding.
Pre-mRNA Splicing Lecture 1 Douglas Black. Fig The most complex RNA processing reaction is pre-mRNA splicing. Most genes in metazoan (multicellular)
Ultraconserved Elements in the Human Genome Bejerano, G., et.al. Katie Allen & Megan Mosher.
©2000 Timothy G. Standish Revelation 18:4 4And I heard another voice from heaven, saying, Come out of her, my people, that ye be not partakers of her sins,
A PowerPoint presentation by Gene Tempest
RNA and Protein Synthesis
RNA AND PROTEIN SYNTHESIS RNA vs DNA RNADNA 1. 5 – Carbon sugar (ribose) 5 – Carbon sugar (deoxyribose) 2. Phosphate group Phosphate group 3. Nitrogenous.
RNA Structure and Transcription Mrs. MacWilliams Academic Biology.
RNA Processing Capping Polyadenylation Introns vs exons Splicing
Chapter 14 – RNA molecules and RNA processing
Chapter 10 Transcription RNA processing Translation Jones and Bartlett Publishers © 2005.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Chapter 17 From Gene to Protein
PROTEIN SYNTHESIS. Protein Synthesis: overview  DNA is the code that controls everything in your body In order for DNA to work the code that it contains.
 The replication of DNA takes place in S phase of interphase  However, DNA is also used during G 1 to assemble proteins  This process is broken down.
LECTURE CONNECTIONS 14 | RNA Molecules and RNA Processing © 2009 W. H. Freeman and Company.
PROTEIN SYNTHESIS The Blueprint of Life: From DNA to Protein.
Chap. 8 Problem 1 Mechanisms of post-transcriptional gene control of protein coding genes are shown in Fig The most commonly used mechanism is the.
Ch. 17 From Gene to Protein. Genes specify proteins via transcription and translation DNA controls metabolism by directing cells to make specific enzymes.
Chapter 17 From Gene to Protein.
Transcription and mRNA Modification
Lecture 4 Topic 2. Gene Function & Gene Expression.
Alternative Splicing (a review by Liliana Florea, 2005) CS 498 SS Saurabh Sinha 11/30/06.
Post-transcriptional regulation
CHAPTER 13 RNA and Protein Synthesis. Differences between DNA and RNA  Sugar = Deoxyribose  Double stranded  Bases  Cytosine  Guanine  Adenine 
Chapter 3 The Interrupted Gene.
Gene Regulation In 1961, Francois Jacob and Jacques Monod proposed the operon model for the control of gene expression in bacteria. An operon consists.
Modular proteins II Level 3 Molecular Evolution and Bioinformatics Jim Provan Patthy Sections – 8.2.
Protein Synthesis RNA, Transcription, and Translation.
Eukaryotic genes are interrupted by large introns. In eukaryotes, repeated sequences characterize great amounts of noncoding DNA. Bacteria have compact.
Colinearity of Gene and Protein
Posttranscriptional Modification of DNA Primary Transcript – newly synthesized RNA Mature tRNA molecules are generated in both prokaryotes and eukaryotes.
TRANSCRIPTION (DNA → mRNA). Fig. 17-7a-2 Promoter Transcription unit DNA Start point RNA polymerase Initiation RNA transcript 5 5 Unwound.
RNA MODIFICATION Eukaryotic mRNA molecules are modified before they exit the nucleus.
Molecular Basis for Relationship between Genotype and Phenotype DNA RNA protein genotype function organism phenotype DNA sequence amino acid sequence transcription.
Fig Prokaryotes and Eukaryotes
Evolution of gene function
Transcription Translation
Relationship between Genotype and Phenotype
Relationship between Genotype and Phenotype
Peter John M.Phil, PhD Atta-ur-Rahman School of Applied Biosciences (ASAB) National University of Sciences & Technology (NUST)
Concept 17.3: Eukaryotic cells modify RNA after transcription
Fig Figure 21.1 What genomic information makes a human or chimpanzee?
Evolution of eukaryote genomes
What is RNA? Do Now: What is RNA made of?

Extra chromosomal Agents Transposable elements
Revelation 18:4 4 And I heard another voice from heaven, saying, Come out of her, my people, that ye be not partakers of her sins, and that ye receive.
Relationship between Genotype and Phenotype
Presentation transcript:

Modular proteins I Level 3 Molecular Evolution and Bioinformatics Jim Provan Patthy Sections – 8.1.3

Protein domains Folded structures of proteins that are larger than  residues generally consist of multiple structural domains: Compact, stable units with a unique three-dimensional structure Interactions within a domain are more significant than those between domains Fold independently i.e. structural domains are also folding domains If domain performs distinct function which remains intact in the isolated domain, then it is also a functional domain Many multidomain proteins are homomultimeric i.e. contain multiple copies of a single type of structural domain: Arisen through internal duplication of complete domains Fate of domains determined by similar rules to paralogous genes

Protein domains Many multidomain proteins are heteromeric: Example is plasminogen activator where a trypsin-like serine protease is joined to kringle, finger and EGF domains May occur by fusion of two or more genes (chimeric proteins) Also known as modular proteins, with domains known as modules Certain modules occur in a wide variety of hetero- and homomultimeric proteins: Suggests mechanisms to facilitate duplication and dispersal “Building blocks” of different types of multidomain proteins are known as mobile protein modules Frequency of transfer and incorporation into new protein reflects fixation probability

Modular assembly by intronic recombination Discovery of introns provided potential new mechanisms for protein evolution: Gilbert suggested that recombination within introns could assort exons independently Idea of rapid construction of novel genes from parts of old ones led to the formulation of the exon-shuffling hypothesis According to “introns early” theories, all extant genes were constructed from a limited number of exon types Under the “introns late” theory, intronic recombination and exon shuffling could not have played a major role in the assembly of the earliest genes Original theory was that exons corresponded directly to modules and/or structural motifs

Problems with the “introns early” hypothesis In the case of many genes, no obvious correspondence was observed between protein structure and intron location It is now known that introns can also be inserted into genes i.e. structure of a gene may not be its original structure Introns suitable for exon shuffling did not originate until a relatively late stage of eukaryotic evolution Exon shuffling has only been conclusively demonstrated in “young” proteins unique to higher eukaryotes Only a special group of exons, the “symmetrical” modules, are really valuable for exon shuffling. Intron phase distribution is also a crucial factor.

Self-splicing introns Group I introns: Reaction requires only a guanine nucleotide cofactor: — Provides a free 3’-OH group that attacks the 5’ splice site — 3’-OH generated at the end of the upstream exon — Second transesterification joins the two exons Crucially depends on folded structure of the intron itself Group II introns: Does not require an external cofactor: 2’-OH of an adenine within the intron cuts the 5’ splice site 2’5’ phosphodiester bond (branch site) forms the lariat structure Although folding is still crucial, chemistry, sequence of events and lariat formation are similar to nuclear spliceosomal introns

Spliceosomal intron splicing mechanism

Spliceosomal introns Spliceosomal introns are only spliced in the presence of a complex of specific proteins and RNA known as a spliceosome Majority of intron is unimportant: as long as the 5’ and 3’ splice sites and the branch site are conserved, splicing can take place: Large insertions into spliceosomal introns, or deletions do not affect splicing efficiency Chimeric introns, containing the 5’ end of one intron and the 3’ end of another, are also properly spliced Mutations (directed or otherwise) in these regions lead to aberrant splicing Spliceosomal intron plays a minor role in its own splicing: the actual spliceosome complex is more important

Evolution of spliceosomal introns Both group I and group II self-splicing mechanisms resemble spliceosome catalysed splicing: Initial step is attack by ribose hydroxyl group on 5’ splice site In each case, reactions are transesterifications where phosphate moieties are retained in products In group II and spliceosomal introns, intron is released as a lariat Accepted that spliceosomal-catalysed splicing evolved from group II self-splicing introns : Key step was transfer of catalytic role from intron to other molecules Formation of spliceosome gave spleceosomal introns structural freedom as they no longer had to fulfil the catalytic function Generally found only in nuclear genomes of higher eukaryotes (plants, animals and fungi)

Insertion and spread of spliceosomal introns

Intron loss Plays a significant role in changing exon-intron structure of genes Introns may be eliminated through mechanism that gives rise to processed genes (retroposition) Reverse transcription can also lead to loss of only some introns: Reverse transcription of perfectly spliced mRNA and recombination with the functional gene: mutates original gene Partially processed pre-mRNA could give rise to a semi-processed gene: generates a new paralogue

Gene duplication / deletion due to intronic recombination

Exon shuffling via recombination in introns Believed that insertion of exons may occur by the same mechanism as insertion of introns: Exon shuffling may be a consequence of the occasional inclusion of exon sequences in the insertion cycle of introns Alternative splicing (exon skipping during splicing) may yield exons with flanking introns If such a composite is inserted into the genome by the same mechanism that inserts single introns (reverse splicing) we have exon shuffling Key difference between intronic recombination model and retrotransposition model: In first case, insertion occurs into a pre-existing intron of same phase as introns flanking exon Retrotransposition model does not have this requirement

Evolution of urokinase PSPSGPSGK GK

Evolution of tissue plasminogen activator PSGKPSGKF F K module duplication PSGKFK

Evolution of Factor XII PSGKPSGKG F PSGKGF FN2 PSGKGF Duplication of G module