1 Barcodes and Zoocodes David J Patterson

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

28 March 2003e-MapScholar: content management system The e-MapScholar Content Management System (CMS) David Medyckyj-Scott Project Director.
Australian Faunal Directory (AFD) and Australian Plant Census (APC): Content, Architecture and Services Documenting and delivering nomenclature and taxonomy.
How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
1 A Systematic Review of Cross- vs. Within-Company Cost Estimation Studies Barbara Kitchenham Emilia Mendes Guilherme Travassos.
GUID-1 Workshop Welcome and Introduction Donald Hobern GBIF Program Officer for Data Access and Database Interoperability February 2006.
Y.Pancheshnikov, ACRL, 2003 Course-Centered Collection Evaluation in the Agricultural Sciences for University Instructional Program Reviews Yelena Pancheshnikov.
SDD: Structured Descriptive Data Gregor Hagedorn (Germany) Bob Morris (USA) Kevin Thiele (Australia)
Recognition, Identification and Names Spring 2014.
Internal documentation and user documentation
Corals and sea anemones on line: a functioning biodiversity database D. G. Fautin R. W. Buddemeier University of Kansas: Department of Ecology and Evolutionary.
1 Knowledge Management Session 4. 2 Objectives 1.What is knowledge management? Why do businesses today need knowledge management programs and systems.
DNA Barcodes: Linking GenBank records to Museum Specimens David E. Schindel, Executive Secretary, CBOL Robert Hanner, University of Guelph.
Software Engineering General Project Management Software Requirements
A Methodology for Developing a Taxonomy – A Subject Oriented Approach
Developing Ideas for Research and Evaluating Theories of Behavior
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 5 Slide 1 Requirements engineering l The process of establishing the services that the.
Dimensions of Data Quality M&E Capacity Strengthening Workshop, Addis Ababa 4 to 8 June 2012 Arif Rashid, TOPS.
BIS TDWG Conference 28 October 2013, Florence Documenting data quality in a global network: the challenge for GBIF Éamonn Ó Tuama, Andrea Hahn, Markus.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
Advances in Technology and CRIS Nikos Houssos National Documentation Centre / National Hellenic Research Foundation, Greece euroCRIS Task Group Leader.
Richard White Biodiversity Data. Outline Biodiversity: what is it? – Definitions: is biodiversity: A resource? Something which can be measured? How to.
Lesson 4: Using HTML5 Markup.  The distinguishing characteristics of HTML5 syntax  The new HTML5 sectioning elements  Adding support for HTML5 elements.
Classification of Organisms. The study of the kinds and diversity of organisms and their evolutionary relationships is called systematics or taxonomy.
1 Introduction to Modeling Languages Striving for Engineering Precision in Information Systems Jim Carpenter Bureau of Labor Statistics, and President,
South African Education Portal
World Bank’s Strategic Environmental Assessment Toolkit STRATEGIC ENVIRONMENTAL ASSESSMENT IN AFRICA REGIONAL WORKSHOP ADDIS ABABA, ETHIOPIA, June 16-19,2008.
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
Classification and Systematics Tracing phylogeny is one of the main goals of systematics, the study of biological diversity in an evolutionary context.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Biodiversity Informatics Sarah Faulwetter Hellenic Centre for Marine Research.
CHRIS NELSON METADATA TECHNOLOGY WORK SESSION ON STATISTICAL METADATA GENEVA 6-8 MAY 2013 Designing a Metadata Repository Metadata Technology Ltd.
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
Universal Biological Indexer and Organizer Research Funded by the Andrew W. Mellon Foundation MBL / WHOI LIBRARY The Tangled Tree of Life Informatics and.
LIR 10 Week 7 Boolean Searching and Online Periodical Databases.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
AUKEGGS Architecturally Significant Issues (that we need to solve)
Strategies for Conducting Research on the Internet Angela Carritt User Coordinator, Oxford University Library Services Angela Carritt User Education Coordinator,
IBISAdmin Utah’s Web-based Public Health Indicator Content Management System.
Systematics: The Science Of Biological Diversity Chapter 12
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Tackling the Complexities of Source Evaluation: Active Learning Exercises That Foster Students’ Critical Thinking Juliet Rumble & Toni Carter Auburn University.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Majid Sazvar Knowledge Engineering Research Group Ferdowsi University of Mashhad Semantic Web Reasoning.
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Harvesting Social Knowledge from Folksonomies Harris Wu, Mohammad Zubair, Kurt Maly, Harvesting social knowledge from folksonomies, Proceedings of the.
Providing web services to mobile users: The architecture design of an m-service portal Minder Chen - Dongsong Zhang - Lina Zhou Presented by: Juan M. Cubillos.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Created Exclusively for eBusiness and IT, Inc. “Information Over Technology” ™ April 28, 2004 © 2004 eBusiness and IT, Inc. 1 One.
Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 1 Research: An Overview.
FIND IT! USING LIBRARY CATALOGING CONCEPTS TO ORGANIZE AND MAKE RECORDS FINDABLE DIONNE L. MACK, INTERIM DIRECTOR OF QUALITY OF LIFE DEPARTMENTS.
Non-commercial biodiversity research and benefit sharing – general perspectives and challenges Christoph L. Häuser - Chair, GTI Coordination Mechanism,
Research Introduction to the concept of incorporating sources into your own work.
The Semantic Web By: Maulik Parikh.
Authority versus authenticity: the shift from labels to identifiers
The Systems Engineering Context
RCN Development of an Online Database to Enhance the Conservation of SGCN Invertebrates in the Northeastern Region James W. Fetzner Jr. & John.
Textbook Engineering Web Applications by Sven Casteleyn et. al. Springer Note: (Electronic version is available online) These slides are designed.
Systematics: The Science Of Biological Diversity Chapter 12
Data Management: The Data Repatriation Re-integration Step or …
Chapter 17: Organizing Life’s Diversity
Name authority control in an evolving landscape
Cynthia S. Parr, Robert Guralnick, Nico Cellinese, Roderic D.M. Page 
Subject Name: SOFTWARE ENGINEERING Subject Code:10IS51
Analyzing and Organizing Information
Big Data Needs Little CRUD:
Presentation transcript:

1 Barcodes and Zoocodes David J Patterson

2 Barcodes and Zoocodes Outline Protistan issues What ICZN says (and what it doesn’t) Changing landscape How bar codes might fit in

3 Barcodes and Zoocodes Protists and ambiregnal problems – Peranema trichophorum (Ehrenberg, 1830) Dujardin 1841 Pseudoperanema hyalinum Christen, 1962 Protists are neither plants nor animals Nomenclaturally, they can be treated as either or both Leading to interesting consequences Such as Peranema / Pseudoperanema - and its 2 type species

4 Barcodes and Zoocodes Of the 30+ genera of heterotrophic euglenids Anisonema Astasia Atractonema Calkinsia Dinema Distigma Dolium Entosiphon Heteronema Metanema Peranema Phacus Urceolus have homonyms Then there’s the dinoflagellates, the stramenopiles, the cryptomonads, the …

5 Barcodes and Zoocodes What is covered by the Zoo code A name, being a label or a flag for the concept – criteria are set by the code The species is nebulous – we do not know what species are, otherwise we would not be able to bring a meeting to its knees by asking the question, ‘So, what do YOU think a species is?’ Concept, a precise but inaccurate model of a species Type material – criteria are set by the code Description in a publication – criteria are set by the code Observations

6 Barcodes and Zoocodes Imprecise relationships between concepts and species May be separated from the species (bad descriptions) May overlap with the concept May be part of the range of the species (most common) Indeed more than one concept may be included within the species (subjective synonyms) Or a concept may be much broader than the species Or embrace more than one species The concept ≠ reality Concepts may be precise but are always inaccurate

7 Barcodes and Zoocodes Types – reference points for the concept There are many kinds of types For species, types are specimens Holotype, a singular entity Type series, comprised of syntypes, from which a lectotype may be selected Neotype – offers a device to create a new type

8 Barcodes and Zoocodes What can be a type An animal, or part of an animal, or the fossilized work of an animal or the work of an animal for names established before 1931 A colony or part of a colony (e.g. corals) A natural replacement, impression, mould or part thereof (72.5.4) in extant species of protistans, one or more preparations of directly individuals representing different stages of the life cycle (a hapantotype) A microscopic preparation in which the relevant type- material is clearly indicated (72.5.6) In the case of a nominal species-group taxon based on an illustration or description, or a bibliographic reference to an illustration or description, the name-bearing type is the specimen or specimens illustrated (and not the illustration or description itself).

9 Barcodes and Zoocodes How well typified are ‘protozoa’ Very poorly Foraminifera are the most speciose group, they form shells, and these are used as types Ciliates are reasonably speciose, and many recent descriptions have type material in the form of silver-stained preparations on glass slides – but these fade For most protists, only interpreted illustrations are available We have tended to use un-interpreted (photographic) type material, but this is not code-compliant Absence of good types creates unstable (imprecise) concepts that cannot be resolved with current approaches

10 Barcodes and Zoocodes Not included in the Code as type material are Living material (such as cultures) Sequence information such as barcodes

11 Barcodes and Zoocodes Relationship between barcodes and zoocode Nil

12 Barcodes and Zoocodes Why pluralize ‘Zoocodes’ The nature and role of systematics is changing to embrace informatics Zoology has begun a new phase, with the first version of an on-line names registry This will survive because nomenclature is fundamental to the management of biological information Informatics needs a unified nomenclatural foundation (i.e. no more of the parochiality of ‘plants’ and ‘animals’)

13 Barcodes and Zoocodes Names offer a logical way to search for and index content Names annotate data objects All names annotate all data objects A compilation of all names ever used is the foundation of a universal index for biology or for a semantic web for biology The significance of names

14 Barcodes and Zoocodes Indexes - what works in books doesn’t necessarily work on the internet Because names of organisms change over time or can be mis-spelled or have vernacular versions All of which will be embedded in on-line documents SO, which name to use in the index?

15 Barcodes and Zoocodes Reconciliation – linking alternative names for the same organism A query initiated with any name, can be expanded to all names and will unify data associated with each

16 Barcodes and Zoocodes Peranema – the fern And for us, most significantly, are problems of homonyms Peranema – the euglenid 14% plant generic names have also been used for non-plants

17 Barcodes and Zoocodes All pieces of information about organisms Other organizational systems Compile all names Fix names problems Classifications & other opinions Unified framework Workbench to engage the experts Applications – working with the complexity of biology Semantic web for biology TAXONOMIC INTELLIGENCE An architecture for managing biodiversity information on the web `

18 Barcodes and Zoocodes KNOWLEDGE On the fly pages for all species customised for every user PORTAL Knowledge and aggregation technology deliver species pages Information becomes knowledge-rich statements Biology 2.0 WORKBENCH An on-line environment that allows experts to create knowledge All information within a unified navigable framework UNION A comprehensive, flexible, authoritative framework for knowledge Species information placed in evolutionary context CLASSIFICATION BANK Incorporating opinions about relationships Compilations of facts about species NAMES MANAGEMENT BY NAMEBANK SERVICES Fixing names problems with taxonomic intelligence Compendia of facts indexed by names NAMES – INDEXING TERMS – NAMEBANK DATABASE All names for all organisms - the means to index all content All content

19 Barcodes and Zoocodes NOT compliant with the code BUT Acting as surrogates for type material – overcoming the lack of type material problem. As they have no validity under the code, they need to be chained to something that can be associated with traditional taxonomy – a kind of vouchering Acting as a ‘taxonomic concept’ – anything with 100% similarity to this barcode is the same entity (the uncertain relationships between the concept and species remain) As the flag, a replacement for a conventional name that can tie into an informatics environment Where may barcodes fit into this picture

20 Barcodes and Zoocodes Barcode-concepts are precise and low cost identifiers for taxa The relationship among barcode concepts and traditional concepts will need to be assessed The most discriminating barcode will be more helpful in this regard Would benefit from an (automated) protocol that will assess on a case by case basis the relationship with phylogenetic trees. What has to be done – concept reconciliation

21 Barcodes and Zoocodes What has to be done – informatics links Barcode Phylogenetic analysis - concept reconciliation links barcode to names (cultures have a role here) Taxonomic intelligence chains barcode to name and to local and distributed content

22 Barcodes and Zoocodes In sum The code is not relevant Favor the most discriminatory barcode Concept reconciliation is important but not an overwhelming challenge Embed the processes within the emergent informatics structure

23 Barcodes and Zoocodes Thank you

24 Barcodes and Zoocodes

25 Barcodes and Zoocodes Disambiguation – distinguishing spelled alike names for different things Clues (= rulesets) that allow automated tools to discriminate the euglenid from the fern Peranema Dons (the fern) Peranema Dujardin (the euglenid) Peranema and Pteridophyta vs Peranema and Euglenida Peranema trichophorum, or Peranema, Anisonema and Urceolus

26 Barcodes and Zoocodes This troika is potentially very powerful The use of barcodes is inevitable They offer an accelerated mechanism to catalog and identify (map to concepts) and instantly engage the informatics structure Where may barcodes fit into the picture

27 Barcodes and Zoocodes RSS feed reader Some examples of taxonomic intelligence in action

28 Barcodes and Zoocodes Libraries Publishers Museums Federal Agencies Who is affected by these problems? Search engines Federated databases Students and researchers Red spotted newt

29 Barcodes and Zoocodes Where may barcodes fit into the picture All pieces of information about organisms Other organizational systems Compile all names Fix names problems Classifications & other opinions A unified framework A 2.0 workbench to engage the experts Applications – working with the complexity of biology Semantic web for biology TAXONOMIC INTELLIGENCE