Data Fabric IG From Testing to Recommendations Beth Plale.

Slides:



Advertisements
Similar presentations
University of St Andrews School of Computer Science Experiences with a Private Cloud St Andrews Cloud Computing co-laboratory James W. Smith Ali Khajeh-Hosseini.
Advertisements

A Unified Approach to Combat Counterfeiting: Use of the Digital Object Architecture and ITU-T Recommendation X.1255 Robert E. Kahn President & CEO CNRI,
Architecture is More Than Just Meeting Requirements Ron Olaski SE510 Fall 2003.
The NSDL Registry Diane Hillmann  Jon Phipps. What We’re Doing Received an NSF grant in Oct. 2006, to: Register metadata schemas, vocabularies, application.
15th January, NGS for e-Social Science Stephen Pickles Technical Director, NGS Workshop on Missing e-Infrastructure Manchester, 15 th January, 2007.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
Near East Rural & Agricultural Knowledge and Information Network - NERAKIN Food and Agriculture Organization of the United Nations Near East and North.
DATA FOUNDATION TERMINOLOGY WG 4 th Plenary Update THE PLUM GOALS This model together with the derived terminology can be used Across communities and stakeholders.
Kuali Rice at Indiana University Rice Setup Options July 29-30, 2008 Eric Westfall.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
Profiling Metadata Specifications David Massart, EUN Budapest, Hungary – Nov. 2, 2009.
RDA Data Foundation and Terminology (DFT) IG: Introduction Prepared for RDA 6 th Plenary Paris, Sept. 25, 2015 Gary Berg-Cross, Raphael Ritz Co-Chairs.
Data Fabric IG Introduction. 2  about 50 interviews & about 75 community interactions  Data Management and Processing is too time consuming and costly.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
AUKEGGS Architecturally Significant Issues (that we need to solve)
Web: Minimal Metadata for Data Services Through DIALOGUE Neil Chue Hong AHM2007.
DRAFT CAMP Platform Component Canonical Types, etc. Copyright © 2012 OASIS Open.
Problems/Disc. Adoption of standards Should there be standards? (not a big problem – responsibility lies with data centre – onus not on scientist) (peer.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
TWC Adoption of RDA DTR and PID in Deep Carbon Observatory Data Portal Stephan Zednik, Xiaogang Ma, John Erickson, Patrick West, Peter Fox, & DCO-Data.
Hydro DWG at the RDA Plenary: BoF and Aligning HDWG work with WMO expectations and timeline Sylvain, Tony, Silvano, Ilya.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No The pan-European.
Biodiversity Data Exchange Using PRAGMA Cloud Umashanthi Pavalanathan, Aimee Stewart, Reed Beaman, Shahir Shamsir C. J. Grady, Beth Plale Mount Kinabalu.
Connecting People With Information Transforming the Way the DoD Manages Data M. David Allen OASD(NII)/DoD CIO May 23, 2006 “The.
Hydro DWG at the RDA Plenary BoF - Improve sharing of water resource data globally 24 September BREAKOUT :30-15:00.
An adoption phase for RDA WGs?. Background WGs end after 18 months WGs (and some IGs) produce outputs, but adoption of these outputs often only takes.
Current Middleware Picture Tom Barton University of Chicago Tom Barton University of Chicago.
Why RDA? A domain repository perspective George Alter ICPSR University of Michigan.
RDA End to End RDA Global Tested, Hardened, Integrated Council TAB OAB Sec Tech Transfer Outreach Mtgs Publication Testing & Eval RDA Coord Groups Third.
Repository Registries Agenda 11.30Welcome & State of the Discussion Is it all one – is it all different? Peter & Herman and commenters 12.10Actions to.
Data Foundation IG DF Organizing Chairs: Gary Berg-Cross & Peter Wittenburg.
NFFA-EUROPE: Information and Data Management Repository Platform for nanoscience in Europe LOGO of your Pilot – organisation / initiative Stefano Cozzini.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
RDA/US Adoption Seed Projects RDA/US is partnering with four groups as part of the MacArthur 2016 Adoption Seeds program Bringing visibility to food security.
1 The Metadata Groups - Keith G Jeffery. 2 Positioning  Raise profile of metadata  Data first  Also software, resources, users  Achieve outputs/outcomes.
ICSU-WDS & RDA Data Publication Services WG. 2 Linking Research Data and the Literature: why? Why link? 1.Increase visibility & discoverability of research.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
European Life Sciences Infrastructure for Biological Information ELIXIR Cloud Roadmap Chairs: Steven Newhouse, EMBL-EBI & Mirek Ruda,
Preservation e-Infrastructure IG Description: help ensure preservation of needed data succeeds Goals: foster worldwide collaboration; ensure consistency.
Draft Data Foundation and Terminology (DFT) Vocabulary Development Process Prepared for WG-Core meeting 24/25.2 Munich/Garching Gary Berg-Cross Co-Chair.
Global Water Information Interest Group meeting RDA 7 th Plenary, 1 st March 2016, Tokyo Global Water Information Interest Group Welcome to the inaugural.
Bringing visibility to food security data results: harvests of PRAGMA and RDA Quan (Gabriel) Zhou, Venice Juanillas Ramil Mauleon, Jason Haga, Inna Kouper,
JOINT SESSION IG Domain Repositories, IG Agriculture Data, WG BioSharing Registry, IG Materials Data, WG Wheat Data RDA P6, Paris,
Evaluating Barriers to Output Adoption in the Digital Humanities Lindsay Poirier RDA Data Share Fellow, Co-Chair Empirical Humanities Metadata WG Plenary.
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
Intentions and Goals Comparison of core documents from DFIG and Publishing Workflow IG show that there is much overlap despite different starting points.
RDA 9th Plenary Breakout 3, 5 April :00-17:30
WG/IG Collaboration Meeting 6 Dec 12-13, NIST, Gaithersburg 'Assembling the Pieces: Connecting Outputs with Each Other and with Domain Adoption‘
RDA Data Fabric (DF) Interest Group Peter Wittenburg & Gary Berg-Cross
Power of PID kernel information
Materials Resource Registries Working Group Co-chairs: Laura M
The RPID Testbed Rob Quick Manager – High Throughput Computing
Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,
Data Foundation and Terminology (DFT) Vocabulary Development Session
PID centric fabric constructed piece by piece
Agenda Welcome and overview (Peter)
C2CAMP (A Working Title)
Utilize Group Policy Terminal Server Settings
Verilog to Routing CAD Tool Optimization
Brief WG/IG reporting Tobias Weigel on behalf of co-chairs
Data types and persistent identifiers in
Agenda (AM) 9:30-10:15 Introduction to RDA
Bird of Feather Session
RDA uptake activities and plans: ESGF
WG PID Kernel Information RDA P11 Berlin – March 2018
Leveraging PIDs for object management in data infrastructures RDA UK Node Workshop, July Tobias Weigel (DKRZ)
1st Call for Collaboration Projects
Presentation transcript:

Data Fabric IG From Testing to Recommendations Beth Plale

Objectives of Session Inductive examination of fabric composition A particular fabric or composition grows one RDA recommendation at a time Linked to n=1, n=2,... n=m discussion Tuesdday Design strategy to expand core compositions Identify incentives for e-infrastructure providers to provision core compositions in experimentation mode Assess will of group to take on challenge and determine next steps

Dimensions of Testing: getting to Data Fabrics RDA produces RDA Recommendations (i.e., outputs) Some technically-oriented RDA Recommendations have reference software with it, these are clearly starting point for any data fabric Many technically oriented RDA recommendations do not have reference software, yet are machine actionable (a schema for instance). These also are of immediate interest to data fabric composition. Other recommendations play important but background roles in early composition RDA recommendation : Purely human consumption and action RDA recommend- ation: Reference software Machine actionable reference Human consumption needed before being actionable reference

Compositions of components Core Components & Services Specific Components & Services Composition (or Fabric) A Composition B

Compositions of components Composition (Fabric) B Given nature of data (can’t do much without understanding it), successful data fabric will likely: 1. run on possibly distributed e-infrastructure (EUDAT, NDS, …) 2. Serve scholarly domain as domain infrastructure 3. Support multiple projects within that domain 4. And eventually result in cross-domain research For 3 and 4 to be realized, CoCo of 2 must be sharable

Start with simple composition in single fabric (inductive approach) Set of RDA recomendations pulled into single configuration for a use RDA PIT WG Recommendation and RDA Data Type Registry Recommendation are starting point because both are among few current RDA ouputs that have reference software

Objectives of Session Inductive examination of fabric composition A particular fabric or composition grows one RDA recommendation at a time Linked to n=1, n=2,... n=m discussion Tuesdday Design strategy to expand core compositions Identify incentives for e-infrastructure providers to provision core compositions in experimentation mode Assess will of group to take on challenge and determine next steps

PID minimal metadata Let’s agree on types that are used to minimally define the metadata (attributes) associated with a PID

Objectives of Session Inductive examination of fabric composition A particular fabric or composition grows one RDA recommendation at a time Linked to n=1, n=2,... n=m discussion Tuesdday Design strategy to expand core compositions Start with minimal PID attributes Identify incentives for e-infrastructure providers to provision core compositions in experimentation mode Assess will of group to take on challenge and determine next steps

Bringing visibility to food security data results: harvests of PRAGMA and RDA Beth Plale, Indiana Univ, USA; Jason Haga, AIST, Japan Launch use of two RDA products in Asia by utilizing PRAGMA community and tools to work with new rice genome group in Philippines and implement software services at AIST (Japan) using outputs of PID Information Types and Data Type Registries Working Groups Software will be installed additionally at National Data Service in US to stimulate US adoption

PRAGMA Data Service PRAGMA/Rocks compute VMs Rice genome variant discovery Bringing visibility to food security data results: harvests of PRAGMA and RDA Beth Plale, Indiana University, USA; Jason Haga, AIST, Japan Future Uses Persistent ID Types (PIT) Data Type Registry (DTR)