The RESEARCH DATA ALLIANCE GEO BON Workgroup 8 WG: Brokering Governance Wim Hugo – ICSU-WDS/ SAEON / GEO BON.

Slides:



Advertisements
Similar presentations
AR – Issues for Attention Tactical and Strategic Guidance documents – what is the agreed approval/ publication process? –Strategic Guidance will.
Advertisements

Standards and Interoperability Forum: Status, Issues & Plans GEO Architecture and Data Committee Kyoto, Japan 9 February 2009 Siri-Jodha Singh Khalsa.
The Dryad Data Repository Ryan Scherle 1, Hilmar Lapp 1, Amol Bapat 2, Sarah Carrier 2, Jane Greenberg 2, Peggy Schaeffer 1, Todd Vision 1,3, Hollie White.
Objectives to improve citizens awareness and comfort industrial competitiveness efficiency of public administrations by enhancing and supporting the use,
INTERNATIONAL INSTITUTE FOR GEO-INFORMATION SCIENCE AND EARTH OBSERVATION Towards quality-aware Infrastructures for Geographic Information Services Richard.
Interoperability Principles in the Global Earth Observations System of Systems (GEOSS) Presented 13 March 2006 at eGY in Boulder, CO by: Eliot Christian,
Architecture and Data Management Strategy (Action Plan) Ivan 1 DeLoatch, USGS, ADC Co-chair Alessandro Annoni, EC, ADC Co-chair Jay Pearlman, IEEE, ADC.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
TDWG Annual Conference 2013, Florence Hannu Saarenmaa University of Eastern Finland Integrating observation and survey data for production of the Essential.
GEO Work Plan Symposium 2012 ID-05 Resource Mobilization for Capacity Building (individual, institutional & infrastructure)
Providing Access to Your Data: Access Mechanisms Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
GEOCAB portal 18 September 2014 IDIB meeting, Enschede Gregory Giuliani University of Geneva UNEP/GRID-Geneva Jean-Christophe Desconnets IRD.
1 Guidelines For The Future Sharing Best Practice For National Bibliographies In The Digital Era Neil Wilson Information Coordinator IFLA Bibliography.
Prepared for the 3rd SBB telecon 20 Mar 2012 Michele Walters, BI-01 task coordinator.
Save time. Reduce costs. Find and reuse interoperability solutions on Joinup for developing European public services Nikolaos Loutas
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
Providing Access to Your Data: Access Mechanisms Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
ENV proposal meeting, Geneva, Sep. 24, GCI Presentation Joost van Bemmelen, ESA
Linking Tasks, Data, and Architecture Doug Nebert AR-09-01A May 2010.
W HAT IS I NTEROPERABILITY ? ( AND HOW DO WE MEASURE IT ?) INSPIRE Conference 2011 Edinburgh, UK.
Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY Meredith A. Lane CODATA/ERPANET Workshop: Scientific Data Selection &
IODE Ocean Data Portal - ODP  The objective of the IODE Ocean Data Portal (ODP) is to facilitate and promote the exchange and dissemination of marine.
SHARE (SHared Access Research Ecosystem) Tyler Walters Co-Chair, SHARE Steering Group (a joint committee of the ARL, the AAU, and the APLU) Eric Celeste.
® GEOSS AIP 5 Water SBA Update HDWG June 2012 Matt Austin NOAA Stefan Fuest KISTERS Jochen Schmidt NIWA.
1 Using the GEOSS Common Infrastructure in the Air Quality & Health SBA: Wildfire & Smoke Assessment Prepared by the GEOSS AIP-2 Air Quality & Health Working.
Using Open Data to Create Value for Citizens. Data.gov Provides instant access to ~400,000 datasets in easy to use formats Contributions from UN, World.
Gary GELLER NASA Ecological Forecasting Program Jet Propulsion Laboratory California Institute of Technology NASA Biodiversity Meeting Alexandria, VA 6.
Hydro DWG at the RDA Plenary BoF - Improve sharing of water resource data globally 24 September BREAKOUT :30-15:00.
Report of the Architecture and Data Committee (ADC) R.Shibasaki (ADC, Japan)
11-12 June 2015, Bari-Italy Coordinating an Observation Network of Networks Encompassing satellite and In-situ to fill the Gaps in European Observations.
ISWG / SIF / GEOSS OOSSIW - November, 2008 GEOSS “Interoperability” Steven F. Browdy (ISWG, SIF, SCC)
ISWG / SIF / GEOSS OOS - August, 2008 GEOSS Interoperability Steven F. Browdy (ISWG, SIF, SCC)
Ideas on Opening Up GEOSS Architecture and Extending AIP-5 Wim Hugo SAEON.
12 th Meeting of the GBIF Participant Nodes Committee 6-7 October 2013, Berlin, Germany Data mobilization and use for international policy Olaf Bánki Senior.
The RESEARCH DATA ALLIANCE WG: Brokering Governance Wim Hugo – ICSU-WDS/ SAEON.
International Planetary Data Alliance Registry Project Update September 16, 2011.
SAEOS G Technical Background 4 November, 2009 Wim Hugo, SAEON.
Wim Hugo SAEON/ SAEOS SAEOS AND C ROSS -D OMAIN I NTEROPERABILITY : P RACTICAL A PPLICATIONS OF STANDARDS June 2012.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Robin GOFFAUX Fondation pour la Recherche sur la Biodiversité ECOSCOPE Metadata.
Data Sources & Using VIVO Data Visualizing Science VIVO provides network analysis and visualization tools to maximize the benefits afforded by the data.
CrossCutting topic: Data Quality and European Network of EO Networks
Enhancements to Galaxy for delivering on NIH Commons
Common interoperability, best practices and strategic approach
Paul Eglitis [IEEE] and Siri Jodha S. Khalsa [IEEE]
South African Research Data Infrastructure
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
Metadata Catalogue and Knowledge Network
Responsible Citizenship of the World of Science
Themes in Geosciences.
GEO WP 1. INFRASTRUCTURE (Architecture and Data Management)
Trustworthiness of Preservation Systems
Brokering Agreement process Stefano Nativi and Mattia Santoro ESSI-lab of CNR-IIA San Petersburg (Russia), 07 Nov 2016.
Flanders Marine Institute (VLIZ)
knowledge organization for a food secure world
Scotland’s Environment Web Environmental Data Portal Joanna Muse Scottish Environment Protection Agency.
High-Level Overview SAEON involvement in South African and Global Research Data Infrastructure Asiphe Sahula, Wim Hugo-SAEON AfriGEOSS Symposium 28.
GEOSS Air Quality Community Infrastructure
EC FP7 - Cooperation Theme 6: Environment (incl. climate change)
Citizen Science’s contribution to GEO BON
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
Prepared by: Jennifer Saleem Arrigo, Program Manager
Session 2: Metadata and Catalogues
LOD reference architecture
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
Bird of Feather Session
School of Information Studies, Syracuse University, Syracuse, NY, USA
Presentation transcript:

The RESEARCH DATA ALLIANCE GEO BON Workgroup 8 WG: Brokering Governance Wim Hugo – ICSU-WDS/ SAEON / GEO BON

The Group on Earth Observations Biodiversity Observation Network – GEO BON – coordinates activities relating to the Societal Benefit Area (SBA) on Biodiversity of the Global Earth Observation System of Systems (GEOSS). Some 100 governmental, inter-governmental and non-governmental organizations are collaborating through GEO BON to organize and improve terrestrial, freshwater and marine biodiversity observations globally and make their biodiversity data, information and forecasts more readily accessible to policymakers, managers, experts and other users. Moreover, GEO BON has been recognized by the Parties to the Convention on Biological Diversity. GEO BON

GEO BON has a Manifesto … It is possible, desirable, and in the public interest to:  Ensure that scientific data is described properly, preserved properly, and discoverable;  Once discovered, its utility, quality, and scope can be understood, even if the data sets are large;  Once understood; it can be accessed freely and openly;  Once accessed, it can be included into distributed processes, preferably automatically, and on large scales;  Once processed, the knowledge gathered can be re-used. … across multiple domains and dissemination channels.

Typical EBV Details

EBV ClassEBV Genetic CompositionAllelic Diversity for Selected Species Breed and Variety Diversity Species Populations and RangesAbundances for a selected set of species Distributions for a representative set of species Species traitsPhenology of selected functional groups Body Mass for Selected Species Community Composition and InteractionOverall taxonomic diversity for selected locations Species interactions Ecosystem Extent and Structure Ecosystem extent and fragmentation for a range of ecosystems Ecosystem structure Ecosystem function and processesNet primary productivity Nutrient retention doi=

Example: Simplified Objective

Generic Use Case

Main Components Data www Discovery Meta-Data www “Publish” “Find” “Bind” Visualise Process Assess Mediator/ Broker Analysis

Generic Dimensions of Data  Spatial Coverage  XYZ  Temporal Coverage: T  Topic or Semantic/ Ontological Coverage  P: Phenomenon  mostly physical, chemical, or other contextual data  B: Biological  Tx: Species and Taxonomy (with some extensions)  Al: Allele/ Genome/ Phylogenetic  Each unique combination of these, supported by a vocabularies/ ontology is a generic data family Continuous or Near-Continuous: Uppercase Discrete or dispersed: Lowercase

Some Generic Data Standards and Interoperability Requirements XYZ, t, P XY, t, P XYZ, t, P/ B NetCDF S-DB O&M MetaCat NetCDF WxS SOS CSV XYZ, t, P/ B Multi-dimensional Traditional Spatial Signals Ecosystem GBIF Index DwC XYZ, T, Tx Occurrence GenBank FTP/ ASN.1 XYZ, T, Al Genome

Status: Working Demonstrator  Extending functionality as and when we have opportunity within existing projects. No dedicated funding.  SAEON is building a loosely coupled open prototype  EU BON is building a closely coupled operational system  Supported by ongoing efforts in GBIF, DataOne, and other stakeholders

 Updates to GEO BON Handbook WIKI pages on standards, software, and best practices  Identify/ Develop Content Standards and Vocabularies for EBVs and Data Families  Including name services for  Taxonomy  Traits  Location  Time  Habitats  Species Interaction  … Areas of Collaboration: GEO-BON Workgroup 8

For Each Data Family…

Typical Guidance For Each EBV …

?

… described properly, preserved properly, and discoverable  Meta-data standards implied.  Harvesters, brokers, and meta-data interoperability implied.  Persistent identifiers implied.  Protocols and standards for data exchange/ uploads implied.  Preservation standards and formats implied.  Tools and approaches to make searches more efficient (vocabularies, ontologies, dealing with massive meta-data collections, …).  Sustainable, accredited data centers and long-term archives are implied – depositor SLA and contract. How long is the ‘Long Term’? Who funds this? Distributed or Centralised Infrastructure?

… its utility, quality, and scope can be understood … Implies:  Visualisations, Collations, Data Exploration Tools,  Utility metrics (‘Like’..),  feedback on quality, quality metrics and standards,  viewing search results in relation to reference spatial, temporal, and ontological/ taxonomic coverages,  ability to dynamically extract 'thumbnail' views of large datasets, … ‘Big’ Data: Different protocol – not HTTP but maybe RPC?

… accessed freely and openly … Implies: Standardised services, licenses and policies, Standardised, generic conditions and exceptions to free and open access, Simplified, effective distribution channels, even if costs are involved, … Equal opportunity to discover and apply.

… included into distributed processes … Implies: Persistence of mash-ups, derived works, and mediations, Web context documents, Web processing services, Standards and guidelines for grid computing, Ability to construct decision support models, indicators, and standardized, interoperable final products, … What moves? Data, Processes, or Both? Concept of a ‘Distributed Indicator Standard’

… due recognition is afforded to the creators … Implies:  Data publication and citation,  Data and service citation indices,  Linking to scholarly articles, …

… the knowledge gathered can be re-used … Implies:  Defining and storing templates and examples of finished work, processes, mash-ups, …  Liberalising Meta-Data and building formal knowledge networks, … ICSU-WDS Working Group on Knowledge Networks (seeking a home in an RDA Collaboration) Collaboration with RDA on Trusted Digital Repositories

… against a backdrop of …  The push to extend formal meta-data with linked open data;  The increased availability of crowd-sourced and citizen contributions;  A proliferation of devices and sensors;  And the construction of knowledge networks.

Building Infrastructure  Is NOT a Research Task – it is an Engineering Task  We can realise large parts of the GEO BON infrastructure already  Issues are not so much technological as institutional  Our first principle should be to engage and amend existing infrastructure components  Infrastructure cannot be funded through projects or through voluntary contributions.

GEOSS Broker Data Source: SOS Data Source: WxS Data Source: MetaCAT Data Source: NetCDF MetaData: SOS MetaData: WxS MetaData: MetaCAT MetaData: NetCDF Other Sources CS/W Endpoint Search Component Shared Platform – Meta-Data Repository GEOSS Meta-Data Resources Variety of Standards and Protocols Map ComponentChart Component Web Context Document Indicator Component

Some Generic Data Standards and Interoperability Requirements XYZ, t, P XY, t, P XYZ, t, P/ B NetCDF S-DB O&M MetaCat NetCDF WxS SOS CSV XYZ, t, P/ B Multi-dimensional Traditional Spatial Signals Ecosystem GBIF Index DwC XYZ, T, Tx Occurrence GenBank FTP/ ASN.1 XYZ, T, Al Genome

Use Case to be achieved User discovers a standardised data source Online Resource(s) forwarded to Broker Broker sends request for mediation to Registry Registry sends a list of compliant ‘Mediations’ Broker confirms user choice Render/ Preview/ Download/ Model Persist as a Web Context Document User saves Choice(s) User saves Choice(s) Mediation Saved? Do for more than one discovery action No Yes