Microbial Genome/Proteome Architectures – Signatures of Environmental Adaptation CHITRA DUTTA Structural Biology & Bioinformatics Division Indian Institute.

Slides:



Advertisements
Similar presentations
Codon Bias and Regulation of Translation among Bacteria and Phages
Advertisements

RNA and PROTEIN SYNTHESIS
Key Area : Genetic Control of Metabolism in Micro-organisms Unit 2: Metabolism and Survival.
Eukaryotes and Prokaryotes Key Differences in Protein Synthesis.
Speaker: HU Xue-Jia Supervisor: WU Yun-Dong Date: 19/12/2013.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Methods of identification and localization of the DNA coding sequences Jacek Leluk Interdisciplinary Centre for Mathematical and Computational Modelling,
Molecular Evolution with an emphasis on substitution rates Gavin JD Smith State Key Laboratory of Emerging Infectious Diseases & Department of Microbiology.
How do Replication and Transcription Change Genomes? Andrey Grigoriev Director, Center for Computational and Integrative Biology Rutgers University.
Central Dogma Information storage in molecules DNA RNA Protein transcription translation replication.
The Chemostat Continuous culture devices are a means of maintaining cell populations in exponential growth for long periods. In a chemostat, the rate at.
(CHAPTER 12- Brooker Text)
Gene Mutations.
Online Counseling Resource YCMOU ELearning Drive… School of Architecture, Science and Technology Yashwantrao C havan Maharashtra Open University, Nashik.
Express yourself That darn ribosome Mighty Mighty Proteins Mutants RNA to the Rescue
Transcription Transcription- synthesis of RNA from only one strand of a double stranded DNA helix DNA  RNA(  Protein) Why is RNA an intermediate????
Chapter 3 The Biological Basis of Life. Chapter Outline  The Cell  DNA Structure  DNA Replication  Protein Synthesis  What is a Gene?  Cell Division:
CAI and the most biased genes Zinovyev Andrei Institut des Hautes Études Scientifiques.
Chapter 3 Substitution Patterns Presented by: Adrian Padilla.
google. com/search
Regulatory factors 1) Gene copy number 2) Transcriptional control 2-1) Promoters 2-2) Terminators, attenuators and anti-terminators 2-3) Induction and.
Transcription and Translation
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
Gene Expression Chapter 13.
Molecular Clock. Rate of evolution of DNA is constant over time and across lineages Resolve history of species –Timing of events –Relationship of species.
Genome organization. Nucleic acids DNA (deoxyribonucleic acid) and RNA (ribonucleic acid) store and transfer genetic information in living organisms.
CHMI E.R. Gauthier, Ph.D. 1 CHMI 2227E Biochemistry I Gene expression.
Today: Genetic Technology Wrap-up Exam Review Remember: Final Exam is Wednesday, 12/13 at 1 pm!
The information content of DNA is in the form of specific sequences of nucleotides The DNA inherited by an organism leads to specific traits by dictating.
1 TRANSCRIPTION AND TRANSLATION. 2 Central Dogma of Gene Expression.
Chapter 3 The Biological Basis of Life. Chapter Outline The Cell DNA Structure DNA Replication Protein Synthesis Cell Division: Mitosis and Meiosis New.
Molecular Biology in a Nutshell (via UCSC Genome Browser) Personalized Medicine: Understanding Your Own Genome Fall 2014.
Chapter 3 The Biological Basis of Life. Chapter Outline  The Cell  DNA Structure  DNA Replication  Protein Synthesis.
Chapter 17 From Gene to Protein. Gene Expression DNA leads to specific traits by synthesizing proteins Gene expression – the process by which DNA directs.
Gene Regulations and Mutations
Models of Molecular Evolution III Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections 7.5 – 7.8.
Genetics is the study of heredity which is concerned with how:  information in nucleic acids is expressed  nucleic acids are duplicated and transmitted.
GENE EXPRESSION What is a gene? Mendel –Unit of inheritance conferring a phenotype Modern definition –Unit of DNA directing the synthesis of a polypeptide.
Introduction to Bioinformatics Algorithms Algorithms for Molecular Biology CSCI Elizabeth White
Introduction to Bioinformatics II Lecture 5 By Ms. Shumaila Azam.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
Microbial Genetics - Mutation l Mutation - Introduction –A mutation is a change in the DNA sequence that results in a change in the product protein –Mutations.
Crash Course!  Introduction to Molecular Biology.
Chap 18 The Genetics of Viruses and Bacteria. Structure of Virus Approximately 20 nm in diameter Their genome can contain DNA or RNA. Enclosed by a.
Lesson Four Structure of a Gene. Gene Structure What is a gene? Gene: a unit of DNA on a chromosome that codes for a protein(s) –Exons –Introns –Promoter.
1 Codon Usage. 2 Discovering the codon bias 3 In the year 1980 Four researchers from Lyon analyzed ALL published mRNA sequences of more than about 50.
In populations of finite size, sampling of gametes from the gene pool can cause evolution. Incorporating Genetic Drift.
1 What forces constrain/drive protein evolution? Looking at all coding sequences across multiple genomes can shed considerable light on which forces contribute.
Unit-II Synthetic Biology: Protein Synthesis Synthetic Biology is - A) the design and construction of new biological parts, devices, and systems, and B)
Microbial Genetics Structure and Function of Genetic Material The Regulation of Bacterial Gene Expression Mutation: Change in Genetic Material Genetic.
Molecular Biology Lecture 1 Introduction to Molecular Biology.
1. 2 Discovering the codon bias 3 Il codice genetico è DEGENERATO.
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
Discovering the codon bias
A Quest for Genes What’s a gene? gene (jēn) n.
Lesson Four Structure of a Gene.
Lesson Four Structure of a Gene.
Evolution of gene function
Causes of Variation in Substitution Rates
1st lesson Medical students Medical Biology Molecular Biology
Microbial Genetics - DNA Transfer
What are the Patterns Of Nucleotide Substitution Within Coding and
Synthetic Biology: Protein Synthesis
Chapter 17 From Gene to Protein.
Directed Mutagenesis and Protein Engineering
Section 1: Mutation and Genetic Change
TRANSCRIPTION Copyright © 2009 Pearson Education, Inc.
Metabolism and Survival
AS Level Paper 1 and 2. A2 Level Paper 1 and 3 - Topics 1-4
Genetics of Microbial Biodegradation
Presentation transcript:

Microbial Genome/Proteome Architectures – Signatures of Environmental Adaptation CHITRA DUTTA Structural Biology & Bioinformatics Division Indian Institute of Chemical Biology 4, Raja S. C. Mullick Road Kolkata

Molecular Evolution – Alternate views Evolution of genes/proteins – Mutation versus Selection

Types of Mutations Neutral Mutation : Synonymous base changes Synonymous base changes Base changes in introns, pseudogenes, other non-coding and non-regulatory regions Base changes in introns, pseudogenes, other non-coding and non-regulatory regions Even some non-synonymous base changes can be neutral if they don’t affect protein function Even some non-synonymous base changes can be neutral if they don’t affect protein function Advantageous Mutation – Positive Selection Deleterious Mutation – Purifying Selection

Tests for selection on sequences d S = # synonymous substitutions per nucleotide site d S = # synonymous substitutions per nucleotide site in the sequence in the sequence d N = # non-synonymous (replacement) substitutions d N = # non-synonymous (replacement) substitutions per nucleotide site in the sequence per nucleotide site in the sequence

Molecular Evolution – Alternate views Evolution of genes/proteins – Mutation versus Selection Current view Mutation and Selection

(A) INTERGENOMIC VARIATIONS - Mutational Bias – Mutational Bias – (i) Base-equifrequent genomes (i) Base-equifrequent genomes (ii) G+C-rich genomes (ii) G+C-rich genomes (iii) A+T-rich genomes (B) INTRAGENOMIC VARIATIONS : Intergenic variation - Translational Selection & other forces Intergenic variation - Translational Selection & other forces Interstrand variation – (i) Replicational-transcriptional Selection Interstrand variation – (i) Replicational-transcriptional Selection (ii) Thermophilic Adaptation (ii) Thermophilic Adaptation Horizontally Transferred genes Horizontally Transferred genes VARIATIONS IN GENOME COMPOSITION : UNICELLULAR ORGANISMS

Codon Bias : Mutation versus Selection Nc (Effective Number of Codons used by a gene) – It is a measure of how small a subset of codons are being used by a gene. Nc (Effective Number of Codons used by a gene) – It is a measure of how small a subset of codons are being used by a gene. The measure ranges from 61 for a gene using all codons with equal frequency to 20 for a gene that is effectively using only one codon to translate its corresponding amino acid. The measure ranges from 61 for a gene using all codons with equal frequency to 20 for a gene that is effectively using only one codon to translate its corresponding amino acid. Higher is selection pressure, higher is codon bias and lower is Nc value. Higher is selection pressure, higher is codon bias and lower is Nc value.

Translational Selection in Synonymous Codon Usage

Translational Selection In unicellular organisms, a significant correlation exists between the extent of codon bias and expression levels of genes. In unicellular organisms, a significant correlation exists between the extent of codon bias and expression levels of genes. Highly expressed genes, in general, exhibit a strong preference for a subset of synonymous codons recognized by the abundant tRNAs in such species, while the lowly or moderately expressed genes have a more uniform pattern of codon usage. Highly expressed genes, in general, exhibit a strong preference for a subset of synonymous codons recognized by the abundant tRNAs in such species, while the lowly or moderately expressed genes have a more uniform pattern of codon usage. Microbial genes : Microbial genes :

Relative Synonymous Codon Usage (RSCU) of different codons in a set of highly expressed genes of any organism : where, Xij = No. of the jth codon for the ith amino acid, ni = Total no. of synonymous codons for the ith amino acid. The Normalized RSCU or Relative Adaptiveness (W) for a set of genes: The Codon Adaptation Index (CAI) of a particular gene ( 0 ≤ CAI ≤ 1) : where L is the number of codons in the gene. Greater is the value of CAI, higher is the potential of expression.

Translational selection usually operates in accordance with mutational bias of the genome Most of the unicellular organisms exhibit Translational Selection. Exceptions include Some genomes with extremely high mutational bias Genomes of some obligatory intracellular organisms with strand-specific mutational bias Species adapted to extreme environments Translational Selection – Some Observations

Amino Acid Selection in Bacterial Proteins Multivariate analyses of various bacterial proteome reveals the following as primary sources of intra-proteome variations : (i)Hydrophobicity (ii) Aromaticity (iii) Mean molecular Weight (iv) Biosynthetic cost of Production (v) Gene Expression Level In most of the free-living bacteria, the Principle of Cost minimization holds good

Asymmetric Mutational Bias AND Replicational -Translational Selection

Asymmetric Mutational Bias & Replicational -Translational Selection Strong compositional asymmetries between the genes lying on the leading versus lagging strands were observed in many other prokaryotic organisms at the level of nucleotides, codons and even in amino acids. Bacterial chromosome replication usually starts at a single origin, and two replication forks propagate in opposite directions up to termination signals. As the replication mechanism differs for the two strands of the duplex DNA, this process often gives rise to compositional asymmetries between the leading strand and the lagging strand. During transcription, the non-template strand is in an open single stranded configuration that is more prone to specific mutations like C  T (U) deamination. The template strand is less susceptive to this process and is protected by transcription dependent DNA repair. As a result, the leading strand contains an excess of G/T over C/A. Replicational selection is responsible for the higher number of genes on the leading strands, and transcriptional selection appears to be responsible for the enrichment of highly expressed genes on these strands. Example: the spirochaetes Borrelia burgdorferi and Treponema pallidum, the endosymbiotic bacteria Blochmannia floridanus, human pathogens Bertonella henselae and Bertonella quintana etc.

Asymmetric Mutational Bias & Replicational - Translational Selection Overall GC-skew = (G-C) / (G+C) (GC) 3 - skew = (G 3 -C 3 ) / (G 3 +C 3 ) Overall AT-skew = (A-T) / (A+T) (AT) 3 - skew = (A 3 -T 3 ) / (A 3 +T 3 )

Influence of Replicational –Transcriptional Selection on Codon Usage

Influence of Replicational –Transcriptional Selection on Amino Acid Usage

Asymmetric Mutational Bias & Replicational - Translational Selection Striking features :  Strong strand-specific skews in nucleotide composition - Leading strand in replication is richer in G and T than lagging strand.  Higher number of genes on the leading strands - Replicational selection  Enrichment of highly expressed genes on leading strands – Transcriptional selection  Distinct codon as well as amino acid usage patterns depending on whether the gene is transcribed on the leading or lagging strand of replication. Replicational-transcriptional selection is very common in obligatory intra-cellular bacteria which are not much exposed to recombinational processes.

Influence of Environment / Life-style (i) Thermophilic Adaptation (ii) Halophilic Adaptation Influence of Environment / Life-style (i) Thermophilic Adaptation (ii) Halophilic Adaptation

Thermophilic Adaptation – A case study Nanoarchaeum equitans – Only known obligatory symbiotic archaeon. It must be in contact with the crenarchaeon host Ignicoccus for survival and growth Genome size is only 490 kb - the smallest microbial genome known to date Yet it has the highest coding capacity, with little non-coding regions Genes for several vital metabolic pathways appear to be missing. It cannot synthesize most nucleotides, amino acids, lipids, and cofactors Possesses most of the DNA repair enzymes and the complete genetic mechinary necessary for transcription, translation and DNA replication Apparent lack of translational selection, like other strictly symbiotic /parasitic microorganisms Ancient species, or Reduced Genome ?

Thermophilic Adaptation Coding regions are significantly overrepresented by purine bases A significant positive correlation exists (r=0.89,p<0.0001) between overall purine-pyrimidine ratio (R:Y) and the optimal growth temperature (OGT) Higher is the OGT, higher is the selection for purine nucleotides in coding sequences Prevalence of purine bases in mRNAs might prevent distracting RNA-RNA interactions and formation of local double-strands within the molecule

Mutivariate Analysis of Amino Acid Usage in N. equitans

Thermophilic Adaptation of N. equitans  Comparison of 109 common orthologs between N. equitans, S. tokodaii and M. maripaludis reveals that thermophile proteins are usually characterized by  relatively high aliphatic index,  marked overrepresentation of positively charged residues,  underrepresentation of Ser, Thr and Cys,  fewer sulfur atoms and  higher propensities of alpha-helix formation in secondary structure.  Homology modeling reveals that surface charge distribution is significantly different in the orthologous proteins of N. equitans and M. maripaludis.  Comparison of isoelectric points indicates that hyperthermophiles have relatively more basic proteomes than mesophiles.

Halophilic Adaptation Halophilic organisms require very high concentrations of salt (at Halophilic organisms require very high concentrations of salt (at least 2 M, approximately ten times the salt level of ocean water) least 2 M, approximately ten times the salt level of ocean water) for optimal growth and can be found in environments such as for optimal growth and can be found in environments such as Dead Sea, the Great Salt Lake, or man made salterns. Dead Sea, the Great Salt Lake, or man made salterns. Salient features of extreme halophiles: Salient features of extreme halophiles: cytoplasm is nearly saturated with KCL (Lanyi 1974). cytoplasm is nearly saturated with KCL (Lanyi 1974). proteins of these organisms require high salt for activity and stability and at less than 1–2 M NaCl or KCl most haloarchaeal proteins unfold and lose their activity (Madern et al. 2000). proteins of these organisms require high salt for activity and stability and at less than 1–2 M NaCl or KCl most haloarchaeal proteins unfold and lose their activity (Madern et al. 2000).

Halophilic Adaptation

Halophilic Adaptation - Amino Acid Usage Halophilic Adaptation - Amino Acid Usage

Halophilic Adaptation – Codon & Amino Acid Usage

Halophilic Adaptation - Amino Acid Usage

Halophilic Adaptation - Summary  Extreme Halophilic organisms are clustered according to their unique amino acid composition and synonymous codon usage irrespective of their taxonomic position and GC content.  Higher ratio of negative to positive charged amino acid residues and lower hydrophobicity are the major factors contributing for halophilic adaptation of proteins.  Negatively charged amino acid residues increase at the cost of increase in positively charged and non-polar residues in Halophilic orthologs.  Large hydrophobic residues are replaced by small and borderline hydrophobic residues.  There is a lack of regular secondary structure i,e, decrease in alpha helical content and increase in coil region for halophilic proteins. These features may be important to prevent aggregation and, at the same time, retain structural flexibility and activity of proteins at high salt concentrations.

Microbial Genome/Proteome Architectures Factors influencing amino acid usages :   Directional mutational pressure   Functional/structural constraints   Gene expressivity   Bioenergetic Requirements (Cost minimization)   Environmental Adaptation Origins of codon/nucleotide biases :  Directional mutation Pressure  Translational Selection – Gene Expressivity  Coupled Replicational –Transcriptional Selection  Environmental Adaptation  Other minor factors like context-dependence etc.