Globally Unique Identifiers in Biodiversity Informatics Kevin Richards Landcare Research NZ TDWG 2008.

Slides:



Advertisements
Similar presentations
The Corporation for National Research Initiatives The Handle System Persistent, Secure, Reliable Identifier Resolution.
Advertisements

IDF open meeting 2007 doi>. Eight possible innovations doi> Innovative uses of the DOI System.
TDWG GUID-2 June 10, 2006Jessie Kennedy/Rob Gales LSID Resolution In SEEK Taxon.
GUIDs in EMu Ian Turnbull KE Software. GUID? UUID? A Globally Unique Identifier (GUID) is a persistent unique reference number used as an identifier.
DDI3 Uniform Resource Names: Locating and Providing the Related DDI3 Objects Part of Session: DDI 3 Tools: Possibilities for Implementers IASSIST Conference,
GUID-1 Workshop Welcome and Introduction Donald Hobern GBIF Program Officer for Data Access and Database Interoperability February 2006.
Persistent identifiers – an Overview Juha Hakala The National Library of Finland
Globally Unique Identifiers and Life Science Identifiers Dave Thau University of Kansas California Academy of Sciences
Web development  World Wide Web (web) is the Internet system for hypertext linking.  A hypertext document (web page) is an online document. It contains.
URI IS 373—Web Standards Todd Will. CIS Web Standards-URI 2 of 17 What’s in a name? What is a URI/URL/URN? Why are they important? What strategies.
Handle System Namespace and Service Definition Sam X. Sun
Hands-On Microsoft Windows Server 2003 Networking Chapter 7 Windows Internet Naming Service.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.
GLOBAL BIODIVERSITY INFORMATION FACILITY Greg Riccardi Co-chair 9 November Outcomes of the GBIF LSID-GUID Task Group.
CORDRA Philip V.W. Dodds March The “Problem Space” The SCORM framework specifies how to develop and deploy content objects that can be shared and.
Tutorial -01. Objective In this session we will discuss about : 1.What is MVC? 2.Why MVC? 3.Advantages of MVC over ASP.NET 4.ASP.NET development models.
UNIT-V The MVC architecture and Struts Framework.
Distributed Computing COEN 317 DC2: Naming, part 1.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Persistent Identifiers Reinhard.
XP New Perspectives on XML Tutorial 4 1 XML Schema Tutorial – Carey ISBN Working with Namespaces and Schemas.
Locating objects identified by DDI3 Uniform Resource Names Part of Session: Concurrent B2: Reports and Updates on DDI activities 2nd Annual European DDI.
Tobias Weigel (DKRZ) Tobias Weigel Deutsches Klimarechenzentrum (DKRZ) Persistent Identifiers Solving a number of problems through a simplistic mechanism.
Presented by DOI Create: TERN as a use-case Siddeswara Guru
Resolving Unique and Persistent Identifiers for Digital Objects Why Worry About Identifiers? Individuals and organizations, including governments and businesses,
LAYING OUT THE FOUNDATIONS. OUTLINE Analyze the project from a technical point of view Analyze and choose the architecture for your application Decide.
LSIDs and RDF Kevin Richards TDWG Introduction Kevin Richards (Landcare Research NZ) –Landcare Informatics group –GUID Subgroup –LSID.NET code port.
Piero Attanasio mEDRA: the European DOI agency The DOI as a tool for interoperability between private and public sector Athens, 14 January.
Enabling Embedded Systems to access Internet Resources.
OCLC Online Computer Library Center Erpanet Symposium on Persistent Identifiers PURLs Stuart Weibel Senior Research Scientist June 17, 2004.
Globally Unique Identifiers Workshop (GUID-1) International Working Group on Taxonomic Databases - TDWG Global Biodiversity Information Facility - GBIF.
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
Distributed Computing COEN 317 DC2: Naming, part 1.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 1 1 Browser Basics Introduction to the Web and Web Browser Software Tutorial.
NOVA Networked Object-based EnVironment for Analysis P. Nevski, A. Vaniachine, T. Wenaus NOVA is a project to develop distributed object oriented physics.
IPv4/IPv6, HIP, E.164, URI/URN/URL, DOI Overviews Sep Seungjae Lee
TDWG Infrastructure Project 1. Project Status Lee Belbin & Donald Hobern.
Ricardo Pereira Software Engineer TDWG Infrastructure Project (TIP)
TDWG Life Sciences Identifiers Applicability Statement Ben Richardson Review Manager, LSID Applicability Statement Western Australian Herbarium Department.
M O S A i C S MOSAICS Brussels 5-6 October 2005 © 2005 Belgian Science Policy. I Virginie Storms Belgian Science Policy Office Laboratory for Microbiology,
Working with XML Schemas ©NIITeXtensible Markup Language/Lesson 3/Slide 1 of 36 Objectives In this lesson, you will learn to: * Declare attributes in an.
Module - Identifiers The DSpace Course. Module Overview  By the end of this module you will:  Understand what persistent identifiers are, how they work.
P088; Presented in Canberra, 27 th March, 2008 GR000: Presented in Fremantle on 20 th October, 2008 GAIA RESOURCES Experiences in mobilizing biodiversity.
LSIDs and RDF in TDWG Roger Hyam, TDWG, RBGE Donald Hobern, GBIF June 7-9, Edinburgh, UK.
LTER, PASTA, and persistent identifiers LTER IMC Water Cooler Series January 2011.
Persistent Identifiers (PIDs) & Digital Objects (DOs) Christine Staiger & Robert Verkerk SURFsara.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Digital Object Identifier doi> Norman Paskin The International DOI Foundation W3C DRM workshop January 22/
Managed by UT-Battelle for the Department of Energy Kay Kasemir ORNL/SNS 2012, January 9-12 at NSRRC, Taiwan Control System Studio Training.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
Converting an Existing Taxonomic Data Resource to Employ an Ontology and LSIDS Jessie Kennedy Rob Gales, Robert Kukla.
Low-Risk Persistent Identification: the “Entity” (N2T) Resolver 10 October 2006 John Kunze, California Digital Library, University of California.
TDWG Infrastructure Project (TIP) Globally Unique Identifiers (GUID) Donald Hobern - Ricardo Pereira TDWG Executive Meeting June 1-2, Madrid, Spain.
Globally Unique Identifiers: What, why, when, which and what now? Dave Thau University of Kansas
1 CS 502: Computing Methods for Digital Libraries Guest Lecture William Y. Arms Identifiers: URNs, Handles, PURLs, DOIs and more.
MSc Publishing on the Web Week 4 Image Maps. Aims and Objectives Discover what are image maps To understand the different types of image map To understand.
Web Design – Week 2 Introduction to website basics Website basics: How the Web Works Client / server architecture Packet switching URL components.
Life Science Identifiers Chris Wroe (based on material from myGrid team and IBM Life Sciences)
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Domain Name System The Technology Context Presentation.
PIDs and National PID Services
Introduction to Persistent Identifiers
Jessie Kennedy Rob Gales, Robert Kukla
Introduction to Persistent Identifiers
A step-by-step guide to DOI registration
Service Metadata Registry (COSMOS)
Chapter 27 WWW and HTTP.
PID‘s ( in theory land ) M. Dreyer.
Tech introduction.
New Perspectives on XML
Presentation transcript:

Globally Unique Identifiers in Biodiversity Informatics Kevin Richards Landcare Research NZ TDWG 2008

Introduction GUID (Globally Unique IDentifier) –What, Why, Which, How –LSIDs –Issues

What are GUIDs Globally Unique IDentifier A short name for a complex entity on the web Each name identifies only one entity Examples: –UUID eg 3E9D6B68-A08C-4F15-BC8A-1265F15D30E2 –DOI eg doi: /jmbi –Handle eg hdl: /abc –LSID eg urn:lsid:indexfungorum.org:names: –PURL eg

What is a GUID –Properties Persistent Opaque Resolvable, sometimes - useful for locating information about the entity

Why use GUIDs Data at Provider 2 BOOK : “Three little pigs” 2 copies Data Consumer Data at Provider 1 BOOK : “The three little pigs” 3 copies BOOKS: “Three little pigs” … (2) “The three little pigs” … (3)

Data at Provider 2 (ID = P2) BOOK : “Three little pigs” ID (eg ISBN) = A123 2 copies Data Consumer Data at Provider 1 (ID = P1) BOOK : “The three little pigs” ID (eg ISBN) = A123 3 copies BOOKS: ID : A123 : “The three little pigs”… (5) … but with GUIDs … BOOK Titles: ID A123 : Provider P1 : “The three little pigs” ID A123 : Provider P2 : “Three little pigs”

Example in our domain Consensus Id : urn:lsid:compositae.org:names:45240C9B-D419-4B6F-93A5-D0A6DEAB4C81 Name : Anthemis gaudium-solis Velen. ProviderIdTaxon Name IPNI urn:lsid:ipni.org:names: :1.1 Anthemis gaudium-solis Vel. Tropicos Anthemis goudium-solis Velen. Euro+Med133202Anthemis gaudium-solis Velen. Govaerts {29FFBEDC-19F BCB3-05EE2C7816C8} Anthemis gaudiumsolis Velen.

GUIDs are vital to TDWG architecture

Which GUID GUID Subgroup Recommendations: Use LSIDs for identifying biodiversity data Reuse GUIDs where they already exist –GUID type –Existing assignments See GUID Report Also Canberra LSID Workshop report:

What is an LSID? Life Science IDentifier Developed by The Object Management Group & W3C Implemented by the team at IBM Used for – data objects, datasets, images, files

LSID Format urn:lsid:bioguid.org:taxon:1122:v1 Prefix - indicates that this is a URN URN type - indicates that it’s an LSID-type urn Authority - the authority who issued the LSID Namespace - internal to that authority Object identifier - within that authority Version - optional

LSID Rules Data doesn’t change (byte identical) Always available for resolution –Hand over to another authority if necessary At least some basic metadata

Pros of LSIDs  Not tied to physical addresses (as URLs are)  Comparison can be done without resolving the ID – eg for cases like “does object a = object b”  Do not require any central registration or central service  Quick to adopt  Encourage thought and planning before they are allocated

Cons of LSIDs However …  Requires DNS SRV record  Requires specialised software to resolve an LSID (not built in to most software)  The restriction - “LSID data cannot change” can be difficult

How What data/objects to apply Ids to Decide on –Authority –Namespace –Local ids (new vs existing) Issue LSIDs Setup resolver

LSID Code Current Code Stacks –Open Source (sourceforge.net) –Java, C++, Perl (IBM) –Microsoft.NET (Myself) –TAPIR LSID configuration

LSID Tools IBM LSID Launchpad Firefox LSID Browser LSID Tester (Rod Page) Web based resolver – get LSID metadata to get summary info of LSID object Example LSID servers: –Index Fungorum - urn:lsid:indexfungorum.org:names: –IPNI – urn:lsid:ipni.org:names: : –uBio - urn:lsid:ubio.org:namebank:11815

Issues to think about Who assigns new LSIDs? Who maintains LSID resolvers? What to assign LSIDs to: –Physical or Digital –Granularity –Only objects that need to be resolved / identified externally –Is there any data, or only metadata?

Issues to think about When to resolve LSIDs –Every time an LSID is encountered, or only when a client requests it? TDWG standards for metadata –Which ones? –Consistent application

References LSID Source Forge - LSID.NET Source Forge - LSID Tutorial ibm.com/developerworks/opensource/library/os-lsid/ 128.ibm.com/developerworks/opensource/library/os-lsid/ LSID Specification - LSID Tester - LSID Launchpad ibm.com/developerworks/downloads/detail.php?group_id=124&what= rele&id=553http://www- 124.ibm.com/developerworks/downloads/detail.php?group_id=124&what= rele&id=553 GUID Subgroup - GUID Subgroup Reports – show_comments=1http://wiki.gbif.org/guidwiki/wikka.php?wakka=GUID2Report& show_comments=1 – Firefox LSID developer site -