Journal Club Jenny Gu October 24, 2006. Introduction Defining the subset of Superfamilies in LUCA Examine adaptability and expansion of particular superfamilies.

Slides:



Advertisements
Similar presentations
An Introduction to Life
Advertisements

CELLULAR RESPIRATION How Cells Release Energy Aerobic Cellular Respiration 1. Glycolysis 4. Electron Transport System 3. Krebs Cycle Anaerobic Cellular.
Introduction to molecular biology. Subjects overview Investigate how cells organize their DNA within the cell nucleus, and replicate it during cell division.
Nucleic Acids Nucleic acids are molecules that store information for cellular growth and reproduction There are two types of nucleic acids: - deoxyribonucleic.
LS Chapter 5 Biology Basics Student Learning Outcomes: 1.Explain the biological hierarchy of organization Give examples of each level 2.Explain.
Microbial Metabolism.
Energy Generation in Mitochondria and Chloroplasts
Objectives Contrast the roles of glycolysis and aerobic respiration in cellular respiration. Relate aerobic respiration to the structure of a mitochondrion.
AP Review Chapters Fast Facts Metabolic pathways that release energy are called catabolic pathways - fermentation and cellular respiration Cellular.
BackBack Next Next CLOSE WINDOW.
Chapter 23.  Agents that cause disease  Many microorganisms: bacteria, fungi, protozoa  Bacteria are prokaryotes, but only a few are pathogens; most.
Methylation, Acetylation and Epigenetics
Pfam(Protein families )
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Cell Structure and Function. Why are cells small?
MCSG Site Visit, Argonne, January 30, 2003 Genome Analysis to Select Targets which Probe Fold and Function Space  How many protein superfamilies and families.
Alternative Pathways in cell respiration
CSE182-L12 Gene Finding.
Exploiting Structural and Comparative Genomics to Reveal Protein Functions  How many domain families can we find in the genomes and can we predict the.
1P2-1 Chapter 1: Outline The Living World Bacteria, Archaea, Eukarya, (Viruses) Biomolecules Functional Groups Major Classes of Biomolecules Biochemical.
DNA Structure Replication Functions (Stores and provides copies of genetic material- genes) – Blueprint (genes) for Protein Synthesis (Enzymes and cell.
Subsystem Approach to Genome Annotation National Microbial Pathogen Data Resource Claudia Reich NCSA, University of Illinois, Urbana.
Chapter 2 – Water, Biochemistry, and Cells
Chapter 3 The Biological Basis of Life. Chapter Outline  The Cell  DNA Structure  DNA Replication  Protein Synthesis  What is a Gene?  Cell Division:
Control of Gene Expression Eukaryotes. Eukaryotic Gene Expression Some genes are expressed in all cells all the time. These so-called housekeeping genes.
Amino acids are the building blocks of what macromolecule?
DNA STRUCTURE page What are the monomers of the nucleic acids?
Characteristics of Life Growth and development Cellularity Reproduction Responsiveness Movement Require energy.
Cellular Metabolism Chapter 4. Introduction Metabolism is many chemical reactionss Metabolism breaks down nutrients and releases energy= catabolism Metabolism.
Cellular Respiration & Protein Synthesis
Regulation of Gene Expression
Today: Genetic Technology Wrap-up Exam Review Remember: Final Exam is Wednesday, 12/13 at 1 pm!
Prokaryotes and fundamentally different from eukaryotes p547-p549 (Chap28, Raven et al.,)
Genetics: Chapter 7. What is genetics? The science of heredity; includes the study of genes, how they carry information, how they are replicated, how.
Prokaryotes Lack nucleus No organelles Possess DNA, RNA, and all other machinery Possess ATP synthesis Two Domains –Bacteria –Archaea.
Chapter 3 The Biological Basis of Life. Chapter Outline The Cell DNA Structure DNA Replication Protein Synthesis Cell Division: Mitosis and Meiosis New.
AP Biology Ch. 9 – Cellular Respiration. Catabolic pathway Fermentation Aerobic respiration Anaerobic respiration Cellular respiration Redox reaction.
Prokaryotes Chapter 20. Figure 5.1 The Scale of Life.
Extras Classification Viruses Bacteria Population Genetics
BSC Developmental Biology Patterns of Inheritance EvolutionEcology.
Fea- ture Num- ber Feature NameFeature description 1 Average number of exons Average number of exons in the transcripts of a gene where indel is located.
Human Anatomy & Physiology I Chapter 4 Cell Metabolism 4-1.
Protein and RNA Families
Ch. 17 From Gene to Protein. Genes specify proteins via transcription and translation DNA controls metabolism by directing cells to make specific enzymes.
I. Prolinks: a database of protein functional linkage derived from coevolution II. STRING: known and predicted protein-protein associations, integrated.
Eukaryotic Gene Control. Gene Organization: Chromatin: Complex of DNA and Proteins Structure base on DNA packing.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
1 Computational functional genomics Lital Haham Sivan Pearl.
Genome analysis. Genome – the sum of genes and intergenic sequences of a haploid cell.
(H)MMs in gene prediction and similarity searches.
1 Studying Life. 1 Studying Life 1.1 What Is Biology? 1.2 How Is All Life on Earth Related? 1.3 How Do Biologists Investigate Life? 1.4 How Does Biology.
Discovering functional linkages and uncharacterized cellular pathways using phylogenetic profile comparisons: a comprehensive assessment Raja Jothi, Teresa.
The Biologist’s Wishlist A complete and accurate set of all genes and their genomic positions A set of all the transcripts produced by each gene The location.
Energy yielding reactions. Oxidation – Reduction Oxidation is the removal of electrons (e - ) from an atom or molecule, often produces energy. A loses.
1 Genes and Proteins The genetic information contained in the nucleotide sequence of DNA specifies a particular type of protein Enzymes = proteins that.
Ch. 11: DNA Replication, Transcription, & Translation Mrs. Geist Biology, Fall Swansboro High School.
Chapter 7: The Blueprint of Life, from DNA to Protein.
bacteria and eukaryotes
OXYGEN REVOLUTION Eukaryotes Evolved Anaerobic World (4.6 BYA-)
Transcription & Gene Expression
The Mimivirus Giant double stranded DNA virus Discovered in amoebas
Unit 7 “DNA & RNA” 10 Words.
Cellular Respiration Stage 2 & 3: Oxidation of Pyruvate Krebs Cycle or Citric Acid Cycle
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Cellular Respiration Stage 2 & 3: Oxidation of Pyruvate Krebs Cycle
Eukaryote Regulation and Gene Expression
Classification of Organisms
A 13C Isotope Labeling Strategy Reveals the Influence of Insulin Signaling on Lipogenesis in C. elegans  Carissa L. Perez, Marc R. Van Gilst  Cell Metabolism 
Cellular Respiration Stage 2 & 3: Oxidation of Pyruvate Krebs Cycle
AS Level Paper 1 and 2. A2 Level Paper 1 and 3 - Topics 1-4
Presentation transcript:

Journal Club Jenny Gu October 24, 2006

Introduction Defining the subset of Superfamilies in LUCA Examine adaptability and expansion of particular superfamilies of LUCA related to function and genome size. Challenged Woese’s Annealing hypothesis.

3-D Structural Comparison Domain Similarity Defined by: SSAP Dynamic Programming based Structure Comparison Algorithm CORA Comparison to 3D templates for each Superfamily. Manual Inspection. Profile based approaches Detect sequence patterns between relatives Functional Information Public resources (COGs, GO, KEGG) and literature Expect Curators Methods

Genome Structural Annotation and Occurrence Profiles Dataset: 114 complete genomes. 100 Prokaryotic Genomes 85 Bacteria, 15 Archeobacteria species 14 Eukaryotic Genomes Structural Annotation CATH HMMs -> Gene3D database. Superfamily Domain Occurrence Profiles (Prokaryotes) 940/1278 CATH domain present in at least one genome. Annotation Coverage: 50% of genes. Methods

Ancestral Superfamily Set Selection Defined by: Present in at least 90% of species from all kingdoms. Present in at least 70% archaeal and eukaryotic species. Definition avoids selection of superfamilies overrepresented in Bacteria but poorly represented in smaller groups. Flexibility for considering false-negative prediction error with sequence based approach. Guarantee selection of families in LUCA. Eliminate error introduced by horizontal gene transfer. Methods

Functional Annotation Automatic Functional Annotation for 940 structural superfamilies annotated in 100 prokaryotic species with COG. Superfamily functionally classified according to statistically most represented functional COG subcategory. 726/940 superfamilies annotated in COG (5% or more of species, at least 5 genes) For ancestral superfamily, further annotation with Pfam and literature. Methods

Definition of the Superfamily Functional Groups COG has six functional groups Translation Replication Metabolism Cellular Process Transcription Poorly Characterized Not considered RNA processing and modificaton Chromatin structure and dynamics Methods

Superfamily Functional Distribution in the Ancestral Domain Set 140 superfamilies found in all organisms of the three main kingdoms (Bacteria, Archaea, and Eukaryotes) 15% of Superfamilies, 55% of all domains in bacterial genes, and 18% of all domains in eukaryotes. Results and Discussion

Superfamily Functional Distribution in the Ancestral Domain Set (cont..) Representatives in all six COG functional groups. Translation (48 superfamilies) and Metabolic (46 superfamilies) comprise majority of ancestral domains. Metabolism (385 superfamilies) has undergone a higher expansion than translation (90 superfamilies). Results and Discussion

Analysis of the Cellular Functions of Ancestral CATH Superfamilies in the LUCA Two issues in defining ancestry: Domain ubiquity through all species. Probable functions such domains could have performed in LUCA. Results and Discussion

Analysis of the Cellular Functions of Ancestral CATH Superfamilies in the LUCA Results and Discussion

Analysis of the Cellular Functions of Ancestral CATH Superfamilies in the LUCA Results and Discussion Interconversion of sugars and synthesis of polysaccharides. Synthesis of ATP and partial equilibrium of NAD/NADH Part of the Calvin Cycle Pentose phosphate pathway Acetyl-CoA for cholesterol and/or steroids and synthesis and degradation of fatty acids. Part of the Krebs Cycle

Analysis of the Cellular Functions of Ancestral CATH Superfamilies in the LUCA Results and Discussion Nucleotide metabolism incomplete. Two alternatives for LUCA Synthesized nucleotides by de novo pathways Incorporated from surrounding soup. Enzyme for interconversion of nucleoside monophosphates are present.

Analysis of the Cellular Functions of Ancestral CATH Superfamilies in the LUCA Results and Discussion DNA synthesis, repair, ligation, and modification are represented. Synthesis of RNA and DNA transcription represented. Domain related to robosomal partical and protein synthesis are abundant. Methyl Transfer Proteins

Analysis of the Cellular Functions of Ancestral CATH Superfamilies in the LUCA Results and Discussion Membrane and Cell wall biogenesis Transduction of protein-protein signals and gene regulation Protein signal recognitio for protein transport Cell division Electron transport And ATP synthase

Universal Distribution Percentage of Superfamilies Universal Distribution Percentages Superfamily occurrence profiles derived from the prokaryotic sample (Archaea and Bacteria) 100% = Superfamily present in all species. 0% = Superfamily has highly specific distribution in just a few species. Methods

Ancestry and Evolutionary Temperature Results and Discussion

Ancestry and Evolutionary Temperature Results and Discussion

Superfamily Duplication Rates and Functional Diversification Another measure to gauge evolutionary temperature. Number of homologues within a superfamily. Observed high correlation with duplication and functional diversification. Results and Discussion

Superfamily Duplication Rates and Functional Diversification High universality spans across more function subcategories. Metabolism has a higher duplication rate and functional diversification than translation. Results and Discussions

Genome Size Correlation and the Coefficient of Interspecies Gene Variation (CIGV) of Superfamilies Domain occurrence profiles from 100 prokaryotic sample. Correlation coefficients between occurrence and genome size. (compared to randomly generated null model.) CIGV calculated by dividing standard deviation over all values of occurrence profile for a given superfamily. Methods

Statistical Analysis of Superfamily Distributions Kolmogorov-Smirnov two-sample test in the two- tailed version for large samples. Compared pairs of distribution between different functional groups. Methods

Superfamily Occurrence Profiles and Genome Size Correlation Results and Discussions

Superfamily Occurrence Profiles and Genome Size Correlation Results and Discussions

Superfamily Occurrence Profiles and Genome Size Correlation Results and Discussions

Superfamily Coefficient of Interspecies Gene Variation Results and Discussions High CIGV values = more adaptable. Hotter evolutionary temperature Low CIGV values = less adaptable.

Superfamily Coefficient of Interspecies Gene Variation Results and Discussions

Rates of Superfamily Innovation in the Functional Groups Results and Discussions Poor Innovation High Innovation

Conclusions A more realistic distribution of superfamilies in distant species. Life achived modern cellular status long before separation of three kingdoms. Woese’s annealing hypothesis called into question. A function of specific features and adaptabilities versus time.