Biological nomenclature in the postgenomic era: Biological and computational issues. George Garrity and Catherine Lyons Bergey’s Manual Trust and Explicatrix,

Slides:



Advertisements
Similar presentations
How to Use This Presentation
Advertisements

Zoology 305 Library Databases/Indexes Lab Goals for session: 1) Meet your librarian Kevin Messner 2) Understand.
LG 4 Outline Evolutionary Relationships and Classification
How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
Diana Hernandez Integrating the catalogue of Mexican biota: different approaches for different client perspectives.
Biology Chapter 18 Test Review B
Alyxia Banks ex R.Br. Alyxia rubricaulis subsp poyaensis Boiteau Alyxia rubricaulis (Baill.) Guillaumin genus: species: subspecies:
Integrated Taxonomic Information System Janet Gomon, Deputy Director, ITIS Smithsonian Institution Museum of Natural History The.
Chapter 25/26 Taxonomy and Biodiversity Evolutionary biology The major goal of evolutionary biology is to reconstruct the history of life on earth ►Process:
Alberts, Bray, Hopkins, Johnson Copyright © 2004 Pearson Education, Inc., publishing as Benjamin Cummings Professor: Dr. Barjis Room: P313 Phone: (718)
SDD: Structured Descriptive Data Gregor Hagedorn (Germany) Bob Morris (USA) Kevin Thiele (Australia)
Chapter 15: Classification
Classification of Organisms. Categories of Biological Classification Scientists Assign Organisms Two-Word Names 2,000 yrs ago, Aristotle grouped plants.
Microbial Taxonomy and the Evolution of Diversity 1 19 Copyright © McGraw-Hill Global Education Holdings, LLC. Permission required for reproduction or.
Phylogeny Systematics Cladistics
Chapter 18 Classification
Scratchpads Publishing biodiversity: The interplay between Scratchpads and the Biodiversity Data Journal Dr Dimitrios Koureas Biodiversity Informatics.
Names for Life Catherine Lyons 1 and George M. Garrity 2,3 1 Explicatrix LLC, 2 Michigan State University and 3 Bergey’s Manual Trust Names for Life and.
Arthur ChapmanData Quality Training SABIF June 2012 Taxonomic and Nomenclature Data A. D. Chapman Data Quality.
BIO 244 GENERAL MICROBIOLOGY
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
CLASSIFICATION & NOMENCLATURE of VIRUSES A large number of morphologically and physico ‑ chemically distinct types of viruses that infect virtually all.
Codebook Centric to Life-Cycle Centric In the beginning….
CHAPTER 25 TRACING PHYLOGENY. I. PHYLOGENY AND SYSTEMATICS A.TAXONOMY EMPLOYS A HIERARCHICAL SYSTEM OF CLASSIFICATION  SYSTEMATICS, THE STUDY OF BIOLOGICAL.
CLASSIFICATION & NOMENCLATURE of VIRUSES A large number of morphologically and physico ‑ chemically distinct types of viruses that infect virtually all.
AN INTRODUCTION TO TAXONOMY: THE BACTERIA
KIPO’s progress on ST.96. Contents II. Projects I. I. Progress on XML Standards III. Future plans.
Phylogeny and the Tree of Life
Bioinformatics Forum: March 14-15, 2005 National Institute for Environmental Studies Bioinformatics Forum: March 14-15, 2005 Names for life An Introduction.
Microbial taxonomy and phylogeny
Classification of Organisms. The study of the kinds and diversity of organisms and their evolutionary relationships is called systematics or taxonomy.
Scratchpads Publication Module - A paradigm shift in publishing RBG Kew, Seminar,
BIOLOGICAL CLASSIFICATION. Taxonomy  Biological classification, or scientific classification in biology, is a method by which biologists group and categorize.
17.1 History of Classification
Warm-Up: The shaded sequence of nucleotides is for a gene from DNA that is similar to what you might find from a living human (Living DNA). The rest are.
Systematics the study of the diversity of organisms and their evolutionary relationships Taxonomy – the science of naming, describing, and classifying.
Prokaryote Taxonomy & Diversity Classification, Nomenclature & Identification Phenetic Classification Molecular Phylogeny Approach Classification (hierarchical.
Species  OTUs  OPUs  Species  OTUs  OPUs. Rosselló-Mora & Amann 2001, FEMS Rev. 25:39-67 Taxa circumscription depends on the observable characters.
QUIZ What is the science that describes, names and classifies organisms? Linnaeus classified organisms according to their ______ & ______. (True or False)
NCBI’s Bioinformatics Resources Michele R. Tennant, Ph.D., M.L.I.S. Health Science Center Libraries U.F. Genetics Institute January 2015.
Updated: January 2015 By Jerald D. Hendrix. A. Classification Systems B. Levels of Classification C. Definition of “Species” D. Nomenclature E. Useful.
GLOBAL BIODIVERSITY INFORMATION FACILITY Cataloging and using Taxonomic Data The Global Names Architecture David Remsen Senior Programme Officer, ECAT.
This material was developed by Duke University, funded by the Department of Health and Human Services, Office of the National Coordinator for Health Information.
1 Literature review. 2 When you may write a literature review As an assignment For a report or thesis (e.g. for senior project) As a graduate student.
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
Scratchpads The virtual research environment for biodiversity data Simon Rycroft, Dave Roberts, Vince Smith, Alice Heaton, Katherine Bouton, Laurence Livermore,
Organizing information in the post-genomic era The rise of bioinformatics.
Systematics: The Science Of Biological Diversity Chapter 12
PIRSF Classification System PIRSF: Evolutionary relationships of proteins from super- to sub-families Homeomorphic Family: Homologous proteins sharing.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
Don’t make me think Biodiversity Data Publishing Made Easy Laurence Livermore, Vince Smith, Alice Heaton, Simon Rycroft, Ed Baker, Ben Scott & Lyubomir.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
Phylogeny & the Tree of Life
Primary vs. Secondary Databases Primary databases are repositories of “raw” data. These are also referred to as archival databases. -This is one of the.
Acronym Soup GBIF, TDWG & GUIDs Jerry Cooper. Global Biodiversity Information Facility (GBIF) Established in 2000 through non-binding MOU (25 countries.
Classification. Cell Types Cells come in all types of shapes and sizes. Cell Membrane – cells are surrounded by a thin flexible layer Also known as a.
HISCOM An Australian Virtual Herbarium Jim Croft Australian National Herbarium.
The History of Classification Copyright © McGraw-Hill Education Early Systems of Classification Classification is the grouping of objects or organisms.
Classification.
Copyright © 2005 Brooks/Cole — Thomson Learning Biology, Seventh Edition Solomon Berg Martin Chapter 22 Understanding Diversity: Systematics.
CLASSIFICATION Why Classify?. INQUIRY ACTIVITY 1) Construct a table with six rows and six columns. Label each row with the name of a different fruit.
Bergey’s Manual Trust What is Bergey’s Manual Trust? –Non-profit, private organization –Produces updated classification and descriptive information of.
GENBANK FILE FORMAT LOCUS –LOCUS NAME Is usually the first letter of the genus and species name, followed by the accession number –SEQUENCE LENGTH Number.
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
Classification Biology I. Lesson Objectives Compare Aristotle’s and Linnaeus’s methods of classifying organisms. Explain how to write a scientific name.
Phylogeny & the Tree of Life
CLASSIFICATION VOCABULARY
Biological Classification Honors Biology.
IEEE R Comment Resolution
SUBMITTED BY: DEEPTI SHARMA BIOLOGICAL DATABASE AND SEQUENCE ANALYSIS.
Presentation transcript:

Biological nomenclature in the postgenomic era: Biological and computational issues. George Garrity and Catherine Lyons Bergey’s Manual Trust and Explicatrix, LLC

Imagine.. A clinical microbiologist’s predicament The microbial ecologist’s dilemma The case of Francisella novicida The history of the Altermonadaceae –Genus described in emendations, 20 species –19 moved to four genera –5 synonyms, two subspecies –64 names, five genera, three families, two classes The common thread in all these stories…

Stan Falkow’s Underwear “Given a choice, most taxonomists would rather wear each other’s underwear than use each other’s names” Why is this so?

My objective Share some insights on problems in three areas –Nomenclature and taxonomy –Publishing taxonomic information –A generalized taxonomic model Finite state machine Simple grammar –Global issues Data equivalence Data provenance Data curation

Problems in nomenclature Systematic biologists –Marking territory –Personal achievement Other biologists –End-users Unfamiliar with literature –Unique aspects Unaware of Codes of Nomenclature –Legalistic framework »Formation and assignment of names »Circumscription and emendation of taxa »Priority and citation »Synonymy and homonymy »Correction of orthographic errors »Adjudication of nomenclatural disputes –But »Do not govern classification or identification

–Biological names Primary entry point into STM literature Prominent role in laws/regulations –Commerce, public safety, public health Primary entry point into scientific databases Poor identifiers –Fixed in time and scope –May not be revised –Synonymies generally not address –Persist, but »obsolesce in relation to taxon »An archival record of a taxonomic definition for a single point in time Problems in nomenclature (cont.)

The name/taxon disjunction Impact –Accumulation of dubious names in literature/databases –Effects assertions of: Identity, commonality of pathways, common ancestry, homology, parology, xenology Legal consequences

Problems in print publishing Key requirement –Proposals and emendations must appear in print Code specific –Prokaryotic Code »Effective, legitimate, and valid »Registration Taxonomies are retrospective –Can only cite earlier publications –Cannot cite future emendations –Increasingly based on molecular sequence data Deposit of sequence data in public databases –Not conveniently referenced in print

Problems with electronic publishing No formal publishing mechanisms –Does not fulfill fundamental requirement of the Code(s) –Lack bibliographic information Not citable Not persistent –Subject to uncontrolled change –May disappear Link rot –404 Link not found

A brief glimpse at where we’re headed The Bergamot/N4L model –Separates names from taxa Taxa nameless –Uniquely, persistently identified –Supports multiple, overlapping taxonomies Accumulation of new data vs. new methodologies Rank agnostic –Unique from all other approaches An identifier resolution service, not an information space in which to practice taxonomy. –Names provide an entry point into the literature Reliably Persistently A lightweight information layer

A simple grammar species -> current.name.pointer, exemplar.deposit.pointer+, sequence.deposit.pointer+ taxon -> current.name.pointer, nomos.defined.data, (taxon+|species+) nomos.defined.data -> (sequence|phenotypic.feature|text)+ name -> (citation, bibliographic.record, name.status) exemplar -> exemplar.id, source sequence -> gene, sequence.deposit source -> exemplar|exemplar.deposit|text exemplar.deposit -> brc.id.pointer, deposit.id.pointer, source sequence.deposit -> brc.id.pointer, deposit.id.pointer, source phenotypic.feature -> feature.name, feature.value, deposit.id.pointer

Exemplar+ Sequence+ Name+ Taxon Species+

Exemplar+ Sequence+ Name+ Taxon Literature Governing bodies GenBank DDBJ EMBL others Collections BRC Species+

Taxon Exemplar+ Sequence+ Name+ Species+ Literature Governing bodies GenBank DDBJ EMBL others Collections BRC Practitioner + genotypic “omics” Proposal STM Legal Databases Priority Validity Synonymy Exemplar req. phenotypic direct indirect BRC PublicPrivate General

Exemplar+ Sequence+ Name+ Species+ A properly formed species Sequence+ Name+ Species+ Candidatus or exemplar lost Sequence+ Environmental sequence Exemplar+ Name+ Species+ Old type strain, not yet sequenced Name+ Species+ Old type, exemplar based on drawing or description Sequence+ “Name”+ Misidentifed taxon Exemplar*

Exemplar+ Sequence+ Name+ Taxon N4L/Bergamot Literature Governing bodies GenBank DDBJ EMBL others Collections BRC Species+

A bit of background information Bergey’s Manual Trust –Principal information source Bergey’s Manual of Determinative Bacteriology Bergey’s Manual of Systematic Bacteriology Taxonomic Outline of the Procaryotes

A bit of background information Bergey’s Manual Trust –Principal information source Bergey’s Manual of Determinative Bacteriology Bergey’s Manual of Systematic Bacteriology Taxonomic Outline of the Procaryotes

A bit of background information Bergey’s Manual Trust –Principal information source Bergey’s Manual of Determinative Bacteriology Bergey’s Manual of Systematic Bacteriology Taxonomic Outline of the Procaryotes –Expertise in content packaging/delivery SGML/XML publishing –The Systematics »XML compliant SGML instance

A bit of background information Bergey’s Manual Trust –Principal information source Bergey’s Manual of Determinative Bacteriology Bergey’s Manual of Systematic Bacteriology Taxonomic Outline of the Procaryotes –Expertise in content packaging/delivery SGML/XML publishing –The Systematics »XML compliant SGML instance –The Outline »An experiment in SGML/XML publishing

A bit of background information Bergey’s Manual Trust –Principal information source Bergey’s Manual of Determinative Bacteriology Bergey’s Manual of Systematic Bacteriology Taxonomic Outline of the Procaryotes –Expertise in content packaging/delivery SGML/XML publishing –The Systematics »XML compliant SGML instance –The Outline »An experiment in SGML/XML publishing

A bit of background information Bergey’s Manual Trust –Principal information source Bergey’s Manual of Determinative Bacteriology Bergey’s Manual of Systematic Bacteriology Taxonomic Outline of the Procaryotes –Expertise in content packaging/delivery SGML/XML publishing –The Systematics »XML compliant SGML instance –The Outline »An experiment in SGML/XML publishing –Derivative projects »Bergamot/N4L »The Determinative