Networking Biodiversity Data – Online Access to Distributed Data Sources in GBIF-D Andrea Hahn, A. Kirchhoff & W.G. Berendsohn Botanic Garden and Botanical Museum Berlin-Dahlem, FU Berlin, Dept. of Biodiversity Informatics and Laboratories Nov 29, 2004
A.Hahn: Networking Distributed Data Sources in GBIF-D The German GBIF "National Nodes" Plants and Protists Fungi and Lichens Prokaryotes and Viruses Invertebrates I Vertebrates Invertebrates II Invertebrates III
GBIF-D Botany The Portal of the German botanical node combines five "Areas of Expertise" Genetic Resources Phytodiversity (in-situ) Botanical Gardens Herbaria Phyto-Taxonomy GBIF.de /botanik VIRTUAL HERBARIUM
A.Hahn: Networking Distributed Data Sources in GBIF-D Database Access DB 2 DB 5 etc. DB 1 DB 4 DB 3 ? ! : "Wrapper Technology"
A.Hahn: Networking Distributed Data Sources in GBIF-D Technical Approach: BioCASE Separation of transfer protocol and content protocol: BioCASe content schema: ABCD –covers all types of biological collections –allows for a high degree of detail –uses variable atomization
A.Hahn: Networking Distributed Data Sources in GBIF-D The BioCASe Protocol (1) XML based specification for communication between providers and consumers Works with any content schema Defines three basic operations: – capabilities – scan – search
A.Hahn: Networking Distributed Data Sources in GBIF-D The BioCASe Protocol (2) T09:30:47-05: search […]/abcd/1.2 Ast* false
A.Hahn: Networking Distributed Data Sources in GBIF-D User Interface Client (Servlet) Portal UnitLoader Java API Meta- data JDBC Data Provider Data Flow, simplified Config. Files SQL Unit wrapper Unit Wrapper Unit data UnitLoader Internet http BioCASe Protocol Request XML http Response XML ? Client ! Unit data
A.Hahn: Networking Distributed Data Sources in GBIF-D an XML Schema for: collection information – unit data scientific names from identifications data on the collection as such data origin, IPR etc. A ccess to B iological C ollection D ata
A.Hahn: Networking Distributed Data Sources in GBIF-D Datasets ABCD Structure – Overview Dataset Units (observation or specimen records) Dataset..... (admin. & tech. contacts, other networks, expiry date) Metadata v.1.49
A.Hahn: Networking Distributed Data Sources in GBIF-D ABCD - Collection Metadata Metadata Description IconURI Scope (geographical and taxonomic keywords) Version Owners IPRStatements RevisionData Creators Contributors DateCreated DateModified v.1.49
A.Hahn: Networking Distributed Data Sources in GBIF-D Datasets ABCD Structure – Overview Dataset Metadata Units (observation or specimen records) Dataset..... (admin. & tech. contacts, other networks, expiry date) v.1.49
A.Hahn: Networking Distributed Data Sources in GBIF-D Units/Unit ABCD - Unit Data Unit extension References, digital images, associations, assemblages, measurements and facts, sequences, notes IDs, content contact, editor, IPR,... v.1.49 Observation unit Specimen unit Unit state domain (physical state-specific subtypes) Identifications Gathering event and site characteristics Unit collection domain (domain-specific subtypes)
A.Hahn: Networking Distributed Data Sources in GBIF-D Unit collection domain (domain-specific subtypes) Identifications
A.Hahn: Networking Distributed Data Sources in GBIF-D Integration into GBIF Software components (portal / provider) freely available GBIF portal understands DiGIR and BioCASe protocol Darwin Core and ABCD records integrated into user interface
A.Hahn: Networking Distributed Data Sources in GBIF-D Thank you! -BioCASE: -GBIF-International: -GBIF-D: ABCD : default.htm Open Review Process – Comments Welcome! Contact: