The role of persistent identifiers in tracking taxon changes Andrew C. Jones, Richard J. White, Ewen R. Orme, School of Computer Science, Cardiff University,

Slides:



Advertisements
Similar presentations
A vision for the future of taxonomic databases David Eades Illinois Natural History Survey Presented at the Natural History Museum, London, 17 January.
Advertisements

TDWG GUID-2 June 10, 2006Jessie Kennedy/Rob Gales LSID Resolution In SEEK Taxon.
The Library of Life Federated Description Services and the Library of Life or What can we do with SDD anyway? Kevin Thiele Centre for Biological Information.
At Reading Frank Bisby, Alistair Culham, Paul Valdes, Neil Caithness, Tim Sutton, Peter Brewer At Cardiff Alec Gray, Andrew Jones, Nick Fiddian, Nick Pittas,
Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.
What is a Flora? Peter Hovenkamp. What is not a Flora? Labwork/ecology paper Species selection on non-taxonomic criteria No identification tool Character.
GUID-1 Workshop Welcome and Introduction Donald Hobern GBIF Program Officer for Data Access and Database Interoperability February 2006.
To share data, all providers must agree upon a data standard.
Diana Hernandez Integrating the catalogue of Mexican biota: different approaches for different client perspectives.
Taxonomic data issues: An ecologist’s experience R.K. Peet The University of North Carolina Adapted by J Kennedy.
I: The Lineage of Taxonomic Revisions The taxonomic history of Aus L. 1758, first described by Linnaeus in 1758 (i), is shown through four subsequent revisions.
One Million Species in the Catalogue of Life – a triumph for Species 2000 and ITIS, or for TDWG standards? Frank Bisby Executive Director: Species 2000.
Common Data Models and Protocols Richard White, Cardiff University Talk given at “Making Species Databases Interoperable”,
The Species 2000 Protocols for a Distributed System by Yuri Roskov, Species 2000 Secretariat 20th International CODATA Conference, Session K2, 25 October.
17.1 The Linnaean System of Classification TEKS 7A, 8A, 8B The student is expected to: 7A analyze and evaluate how evidence of common ancestry among groups.
10 March 2004Richard J. White – COMSC / BB Unit Reliable knowledge discovery in a biodiversity Grid Part 2: Litchi and ambiguous names by Richard J. White.
Plant Systematics databases: Users perspectives Robert K. Peet, University of North Carolina In collaboration with The National Center for Ecological Analysis.
BIS TDWG Conference 28 October 2013, Florence Documenting data quality in a global network: the challenge for GBIF Éamonn Ó Tuama, Andrea Hahn, Markus.
A Beginners Guide to Understanding Taxonomy, Names and Concepts Jessie Kennedy Napier University.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
115 October 2005Richard White - Sp2000/ENBI - Stockholm Litchi: interlinking species information systems Richard White, Andrew Jones, Ed Donovan Computer.
Fourth Annual Summit | Feb | Tucson, AZ Scratchpads for community involvement for natural history collections Dr Dimitris Koureas Biodiversity.
Species Banks a GBIF mechanism to provide electronic access to quality species information Peter H. Schalk, Marc Brugman ETI, University of Amsterdam Tinde.
SCIENCE-DRIVEN INFORMATICS FOR PCORI PPRN Kristen Anton UNC Chapel Hill/ White River Computing Dan Crichton White River Computing February 3, 2014.
Databases From A to Boyce Codd. What is a database? It depends on your point of view. For Manovich, a database is a means of structuring information in.
Richard White Biodiversity Data. Outline Biodiversity: what is it? – Definitions: is biodiversity: A resource? Something which can be measured? How to.
Indexing the Species Names of the World - for the World Frank Bisby (Species 2000), Michael Ruggiero (ITIS) Per de Place Bjørn (GBIF - ECAT)
Representing taxonomy MarBEF-IODE workshop Oostende, March 2007.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
GLOBAL BIODIVERSITY INFORMATION FACILITY Cataloging and using Taxonomic Data The Global Names Architecture David Remsen Senior Programme Officer, ECAT.
[] Where Did Those GBIF Occurrences Come From? Providing Digital Access to NatureServe's Reference Database: Report on a Project in the Early Stages of.
Databases From A to Boyce Codd. What is a database? It depends on your point of view. For Manovich, a database is a means of structuring information in.
Phylogeny: Evolutionary History and Ancestry Background © 2008 Regents of the University of California. All rights reserved. Use for SGI Field Test only.
A curation interface for reconciliation of species names for India. Thomas Vattakaven and R. Prabhakar, India Biodiversity Portal, Strand Life Sciences,
Experience from Mapping Existing Models to the Transfer Schema Robert Kukla.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
TDWG Life Sciences Identifiers Applicability Statement Ben Richardson Review Manager, LSID Applicability Statement Western Australian Herbarium Department.
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Taxonomic verification: Species 2000 and the Catalogue of Life Frank Bisby.
The VegBank Data Model. Biodiversity data structure Taxonomic database Plot/Inventory database Occurrence database Plot Observation/ Collection Event.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
1October 2006Richard White, Andrew Jones & Frank Bisby - TDWG - St Louis Federating taxonomic databases: progress with the Catalogue of Life Dynamic Checklist.
ALA Metadata - Goals and Issues Donald Hobern, Director, Atlas of Living Australia 29 August 2008.
Data Integration in Bioinformatics Using OGSA-DAI The BioDA Project Shirley Crompton, Brian Matthews (CCLRC) Alex Gray, Andrew Jones, Richard White (Cardiff.
Acronym Soup GBIF, TDWG & GUIDs Jerry Cooper. Global Biodiversity Information Facility (GBIF) Established in 2000 through non-binding MOU (25 countries.
Progress Alastair Culham. i4Life – the BIG aim To move Catalogue of Life from a research project to a sustainable service 1.To enhance the content 2.To.
Extending the biogeographical model Africamuseum 6 (7?) June 2013.
VegBank A vegetation field plot archive Produced at: The National Center for Ecological Analysis and Synthesis Principal Investigators: Robert K. Peet,
The New GBIF Data Portal Web Services and Tools Donald Hobern GBIF Deputy Director for Informatics October 2006.
Swedish botanist Carolus Linnaeus developed the scientific naming system (1750’s) still used today.
Riccardi: DIALOGUE Workshop August 1, 2005 Supported by NSF BDI 1 Representing and Using Phylogenetic Characters in Morphbank Greg Riccardi, David Gaitros,
TDWG – Looking Backward and Forward Donald Hobern, Director, Atlas of Living Australia 20 October 2008.
Modeling Security-Relevant Data Semantics Xue Ying Chen Department of Computer Science.
Converting an Existing Taxonomic Data Resource to Employ an Ontology and LSIDS Jessie Kennedy Rob Gales, Robert Kukla.
Banaras Hindu University. A Course on Software Reuse by Design Patterns and Frameworks.
Where now for the taxon transfer schema and related work: collaboration possibilities? Jessie Kennedy.
Example projects using metadata and thesauri: the Biodiversity World Project Richard White Cardiff University, UK
GBIF - ECAT  Electronic Catalogue of Names of Known Organisms  Program Officer;  Per de Place Bjørn 
The University of Reading Frank Bisby, Alistair Culham, Neil Caithness, Tim Sutton, Peter Brewer, Chris Yesson Cardiff University Alec Gray, Andrew Jones,
Laura Russell VertNet Meherzad Romer NatureServe Canada John Wieczorek
Globally Unique Identifiers: What, why, when, which and what now? Dave Thau University of Kansas
Charles Copp, Neil Caithness & Richard White.  Evaluation, selection and acquisition of existing thesauri  Thesaurus modelling - logical and physical.
Introduction to Persistent Identifiers
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
Brief introduction to the project
Data Management: The Data Repatriation Re-integration Step or …
VIETNAM ACADEMY OF SCIENCE AND TECHNOLOGY
HOW (and why?) DO WE DESCRIBE ?
Presentation transcript:

The role of persistent identifiers in tracking taxon changes Andrew C. Jones, Richard J. White, Ewen R. Orme, School of Computer Science, Cardiff University, UK {Andrew.C.Jones | R.J.White |

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)2 The Catalogue of Life GSD CAS Web front-end Other software clients of Catalogue of Life (e.g. using it as their “taxonomic backbone”)

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)3 CoL in use

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)4 CoL & LSIDs

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)5 Concepts that stay the same Sci. name 1 Synonyms: Sci. name 2 Sci. name 3 Sci. name 4 urn:lsid:catalogueoflife.org: taxon: :dc urn:lsid:catalogueoflife.org: taxon: :ac2009 Dynamic checklist lsid Annual checklist lsid KEY: Sci. name 1 Synonyms: Sci. name 2 Sci. name 3 Sci. name 4 urn:lsid:catalogueoflife.org: taxon: :dc urn:lsid:catalogueoflife.org: taxon: :ac2010

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)6 Evolving concepts in dynamic & annual checklist Sci. name 1 Synonyms: Sci. name 2 Sci. name 3 Sci. name 4 Sci. name 1 Synonyms: Sci. name 3 Sci. name 2 Synonyms: Sci. name 4 Sci. name 1 Synonyms: Sci. name 3 Sci. name 5 Sci. name 2 Synonyms: Sci. name 4 urn:lsid:catalogueoflife.org: taxon: :dc urn:lsid:catalogueoflife.org: taxon: :dc urn:lsid:catalogueoflife.org: taxon: :dc urn:lsid:catalogueoflife.org: taxon: :dc urn:lsid:catalogueoflife.org: taxon: :dc urn:lsid:catalogueoflife.org: taxon: :ac2009 urn:lsid:catalogueoflife.org: taxon: :ac2010 urn:lsid:catalogueoflife.org: taxon: :ac2010 Dynamic checklist lsid Annual checklist lsid KEY:

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)7 Data integration and the CoL Two sources of information about species x:  Do they refer to the same concept? Same persistent identifier  If not, how are the concepts related; what can we infer? Different persistent identifiers Needs something like TCS

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)8 Specimen data & changing concepts

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)9 Using data associated with changing concepts Pipistrellus pipistrellus sensu stricto (Common Pipistrelle; 45 kHz) Pipistrellus pygmaeus (Soprano Pipistrelle; 55 kHz) Pipistrellus pipistrellus sensu lato (45 & 55 kHz) (Pre-1999)

Don't know which new species these observations relate to... … but still applicable to genus Pipistrellus 10

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)11 Worse still … Though CoL taxa have precise circumscription when defined … … difficult precisely to know that concept when applying a CoL persistent identifier Identification keys for CoL taxa?

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)12 Capturing taxon concept changes Changed persistent identifiers from source databases; or Detecting changes by comparison  Same synonyms, parent taxon, etc?

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)13 Representing the changes Persistent identifier metadata  Taxon concept relationships e.g. isCongruentTo; includes; overlaps Granularity?  Many species changed due to underlying cause, e.g. splitting a genus?  Higher taxa need relationship metadata too  Additional explanatory metadata attached to species (set of relationships between relevant higher taxa)?  Explicit representation of the actions leading to change, e.g. “split”, “merge” & “transfer”?

Jones, White & Orme. Tracking Taxon Changes (TDWG 2009)14 Issues for discussion Differing perspectives of users, providers (and computer scientists) Need for conventions in describing evolving checklists Metadata describing actions, not just set relationships? Services to support data integration exploiting persistent identifiers When does a concept really change? Some URLs... 4D4Life project: 4D4Life questionnaire: