Gene models and proteomes for Saccharomyces cerevisiae (Sc), Schizosaccharomyces pombe (Sp), Arabidopsis thaliana (At), Oryza sativa (Os), Drosophila melanogaster.

Slides:



Advertisements
Similar presentations
GBrowse at TAIR Philippe Lamesch TAIR curator. Seqviewer.
Advertisements

Part I: Tips and Techniques from curators GBrowse at TAIR David Swarbreck.
DNAStructureandReplication. Transformation: Robert Griffith (1928)
Eukaryotic Intron Loss Tobias Mourier & Daniel C. Jeffares.
The chemical Basis of Inheritance. Chromatin / Chromosomes.
Introduce GeneSpring GX12 Yun Lian GeneSpring Layout.
Orthology, paralogy and GO annotation Paul D. Thomas SRI International.
Basics of Comparative Genomics Dr G. P. S. Raghava.
1/30 Comparative Genomics. 2/30 Overview of the Talk Comparing Genomes Homologies & Families Sequence Alignments.
First release of HOGENOM, a database of homologous genes from complete genome Equipe Bioinformatique et Génomique Evolutive Laboratoire de Biométrie et.
Design principle of biological networks—network motif.
Selection on codons OEB Degenerate Code.
Alternative splicing and evolution Daniel Jeffares.
An Analysis of “Gene Finding in Novel Genomes” Michael Sneddon.
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
Genomics Complete Genomes in The Public DataBases >100 Non-Eukaryotes Eukaryotes: Leishmania 257 Kb 79 orfs Plasmodium falciparum I 947 Kb 205.
“Gene Finding in Novel Genomes” by Ian Korf Presented by: Christine Lee SoCAL BSI 2004.
The phylogenetics project data revealed! October 4, 2010 BIOS E-127.
Selection upon codons BIOS E *Aside: shallow trees are strange… And ignore question 7. of assignment…
Model Organisms and Databases. Model Organisms Characteristics of model organisms in genetics studies –Genetic history well known –Short life cycle; large.
The phylogenetics project data revealed! October 4, 2010 OEB 192.
EVOLUTIONARY AND COMPUTATIONAL GENOMICS Shin-Han Shiu Plant Biology / CMB / EEBB / Genetics / QBMI.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
Computational Structural Bioinformatics ECS129 Instructor: Patrice Koehl
Genome projects and model organisms Level 3 Molecular Evolution and Bioinformatics Jim Provan.
Daniel Rico, PhD. Daniel Rico, PhD. ::: Introduction to Functional Analysis Course on Functional Analysis Bioinformatics Unit.
Using The Gene Ontology: Gene Product Annotation.
Université Libre de Bruxelles, Belgique Laboratoire de Bioinformatique des Génomes et des Réseaux (BiGRe)
Genomes School B&I TCD Bioinformatics May Genome sizes Completed eukaryotic nuclear genomes Type of organismSpeciesGenome size (10 6 base pairs)
Ontologies, data standards and controlled vocabularies.
This presentation was originally prepared by C. William Birky, Jr. Department of Ecology and Evolutionary Biology The University of Arizona It may be used.
IGEM 101: Session 7 4/2/15Jarrod Shilts 4/5/15Ophir Ospovat.
1/29 Comparative Genomics. 2/29 Overview of the Talk Comparing Genomes Homologies & Families Sequence Alignments.
Genomfart Any general theory for genome evolution will have to account for: the unique natural history of various genetic elements, the population-genetic.
Comparative Genomics.
Central dogma: the story of life RNA DNA Protein.
Using blast to study gene evolution – an example.
Chapter 1 Introduction.
It will help in preparing for the exam to read:
Lecture 21 – Genome Annotation & Sequenced Genomes Based on Chapther 8 Genomics: The Mapping and Sequencing of Genomes Copyright © 2010 Pearson Education.
Group discussion Name this protein. Protein sequence, from Aedes aegypti automated annotation >25558.m01330 MIHVQQMQVSSPVSSADGFIGQLFRVILKRQGSPDKGLICKIPPLSAARREQFDASLMFE.
Chapter 11 Meiosis & Genetics What do you think meiosis makes?
WSSP Chapter 10 Literature Search Where do you learn about the function of your gene? atttaccgtg ttggattgaa attatcttgc atgagccagc tgatgagtat gatacagttt.
Bos taurus Olfactory Receptor Katie Davis 1,2 and Sandra Rodriguez-Zas 1 1 Department of Animal Sciences, University of Illinois Urbana-Champaign, 2 ACES.
Template provided by: “posters4research.com” CONCLUSIONS AND FUTURE WORKS REFERENCE Cytochrome P450 (CYPs) are known to play a central role in insecticide.
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
What’s new in GO?. Priorities Annotation outreach Reference genomes User advocacy Ontology development Software.
Basics of Comparative Genomics
Generating Multiple Sequence Alignments with ClustalW
EL: To find out what a genome is and how gene expression is regulated
Saccharomyces Genome Database (SGD)
Prediction of Regulatory Elements for Non-Model Organisms Rachita Sharma, Patricia.
Genome Annotation Continued
Volume 11, Issue 3, Pages (March 2003)
Evolution of Biochemical Pathways
Every living organism inherits a blueprint for life from its parents.
The Coming of Age for Piwi Proteins
Purposes: To demonstrate the tendency of proteins to become longer with increase of organism complexity To study domain architecture of proteins and to.
Argonaute proteins Current Biology
Generating Multiple Sequence Alignments with Clustal Omega
Allelic Heterogeneity in the COH1 Gene Explains Clinical Variabilityin Cohen Syndrome  Hans Christian Hennies, Anita Rauch, Wenke Seifert, Christian Schumi,
Identification of novel F-box proteins in Xenopus laevis
Basics of Comparative Genomics
CARPEL FACTORY, a Dicer Homolog, and HEN1, a Novel Protein, Act in microRNA Metabolism in Arabidopsis thaliana  Wonkeun Park, Junjie Li, Rentao Song,
Computational genomics
Homologs in 38 fully sequenced eukaryotic genomes to 534 assigned S
Rtt101 and Mms1 in budding yeast form a CUL4DDB1‐like ubiquitin ligase that promotes replication through damaged DNA Mms1 belongs to the DDB1 family of.
L. Aravind, Eugene V. Koonin  Current Biology 
Comparative Genomic Analysis Identifies an ADP-Ribosylation Factor–like Gene as the Cause of Bardet-Biedl Syndrome (BBS3)  Annie P. Chiang, Darryl Nishimura,
Correspondence Current Biology
Presentation transcript:

Gene models and proteomes for Saccharomyces cerevisiae (Sc), Schizosaccharomyces pombe (Sp), Arabidopsis thaliana (At), Oryza sativa (Os), Drosophila melanogaster (Dm), Anopheles gambiae (Ag), Caenorhabditis elegans (Ce), Mus musculus (Mm), Rattus norvegicus (Rn), and Homo sapiens (Hs) were downloaded from the NCBI website (ftp.ncbi.nlm.nih.gov).ftp.ncbi.nlm.nih.gov HMMPFAM search against several major signature databases- PFAM, TIGRFAM, SMART, and Superfamily match to one or more of the models in any one of the databases “Known (PDF)” no matches to any one of the models in any database “Unknown(POF)” Our Definition of genes with unknown function What makes species different? A study of Unique Genes

% of Genome “Unknowns” account for about 25% of each genome

% hits at each e-value “Unknowns” are not as conserved as “knowns”, even between related organisms! BLAST e-value Yeast

Outl. Sc Sp At Os Dm Ag Mm Rn Hs Ce Known Unknown Relationship tree among the 10 different genomes reveals a high degree of evolutionary divergence among “unknowns” from different species “Unknowns” have a different rate of evolution? “Unknowns” are new genes?

% Unique genes Known Unknown “Unknowns” are mainly species-specific. Representation of “unknowns” in the “unique-ome” of different species. “Unique-ome” was defined by a BLAST cut off of Between the 10 different genomes!

Known Unknown Compared to “knowns”, “Unknowns” are more disordered, less hydrophobic and shorter.

“Unknown” Conclusions Unknown genes are typically species-specific and might provide some of the keys that define species-specific differences. Unraveling the function of “unknowns” would improve our understanding of species-specific functions. Disordered protein functions are thought to include the formation and regulation of large multi-molecular assemblies that participate in important regulatory functions. Disordered regions on proteins have been reported to evolve significantly more rapidly than ordered regions. “Unknowns” are likely to be the result of greater evolutionary divergence among species leading to the establishment of new, species-specific regulatory networks.