WG/IG Collaboration Meeting 6 Dec 12-13, NIST, Gaithersburg 'Assembling the Pieces: Connecting Outputs with Each Other and with Domain Adoption‘

Slides:



Advertisements
Similar presentations
© Keith G Jeffery, Anne G S Asserson GL New Orleans Hyperactive Grey Objects Keith G Jeffery Director, IT & International Strategy CCLRC.
Advertisements

Provenance-Aware Storage Systems Margo Seltzer April 29, 2005.
1 e-Arts and Humanities Scoping an e-Science Agenda Sheila Anderson Arts and Humanities Data Service King’s College London.
Co-funded by the European Union under FP7-ICT Co-ordinated by aparsen.eu #APARSEN Welcome to the Conference !! Juan Bicarregui Chair, APA Executive.
Cyberinfrastructure Supporting Social Science Cyberinfrastructure Workshop October Chicago Geoffrey Fox
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
MASDIR / MIG Metadata and Standards Directory Working Group / Metadata Interest Group Prof Keith G Jeffery
1 European policies for e- Infrastructures Belarus-Poland NREN cross-border link inauguration event Minsk, 9 November 2010 Jean-Luc Dorel European Commission.
Position Paper for Data Fabric IG Interoperability, Infrastructures and Virtuality Gary Berg-Cross, Keith.
Sharing Research Data Globally Alan Blatecky National Science Foundation Board on Research Data and Information.
METADATA WORKSHOP Conclusions Keith Jeffery Peter Wittenburg.
1 Metadata Coordinating Chairs Meeting Gaithersburg November Keith Jeffery, Rebecca Koskela, Jane Greenberg, Alex Ball, Brigitte Jörg, Bridget Almas,
Children’s Health Exposure Analysis Resource (CHEAR) CHEAR Center for Data Science Susan Teitelbaum, PhD November 4, 2015.
1 Interactions between the Marine Data Harmonization IG and Data Citation WG.
An adoption phase for RDA WGs?. Background WGs end after 18 months WGs (and some IGs) produce outputs, but adoption of these outputs often only takes.
1 e-Arts and Humanities Scoping an e-Science Agenda Sheila Anderson Arts and Humanities Data Service Arts and Humanities e-Science Support Centre King’s.
Why RDA? A domain repository perspective George Alter ICPSR University of Michigan.
RDA End to End RDA Global Tested, Hardened, Integrated Council TAB OAB Sec Tech Transfer Outreach Mtgs Publication Testing & Eval RDA Coord Groups Third.
1 The Metadata Groups - Keith G Jeffery. 2 Positioning  Raise profile of metadata  Data first  Also software, resources, users  Achieve outputs/outcomes.
Data Foundation IG DF Organizing Chairs: Gary Berg-Cross & Peter Wittenburg.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
1 The Metadata Groups - Keith G Jeffery. 2 Positioning  Raise profile of metadata  Data first  Also software, resources, users  Achieve outputs/outcomes.
Data Fabric IG From Testing to Recommendations Beth Plale.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
Name - Date Technology-enhanced Learning: tomorrow’s school and beyond Pat Manson Head of Unit Technology Enhanced Learning Directorate General.
School on Grid & Cloud Computing International Collaboration for Data Preservation and Long Term Analysis in High Energy Physics.
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
Bringing visibility to food security data results: harvests of PRAGMA and RDA Quan (Gabriel) Zhou, Venice Juanillas Ramil Mauleon, Jason Haga, Inna Kouper,
RDA Plenary 5 Big Data (Analytics) IG Session
Overview of WGs, IGs and BoFs
FIM4R and FIM IG coordination C.M. Zwölf, H. Short
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
RDA Data Fabric (DF) Interest Group Peter Wittenburg & Gary Berg-Cross
EOSC MODEL Pasquale Pagano CNR - ISTI
Paolo Budroni, University of Vienna
Jarek Nabrzyski Director, Center for Research Computing
Research Data Alliance - Research Data Sharing without barriers Terena Networking Conference 22 May 2014.
Data Ingestion in ENES and collaboration with RDA
Steven Newhouse EGI-InSPIRE Project Director, EGI.eu
Recap: introduction to e-science
BoF: VREs- Keith G Jeffery & Helen Glaves
Data Discovery Paradigms Interest Group Report on Activities and Outputs Anita de Waard, Siri Jodha Singh Khalsa Fotis Psomopoulis Mingfang Wu.
C2CAMP (A Working Title)
Connecting the European Grid Infrastructure to Research Communities
Overview of System Engineering
EOSCpilot Skills Landscape & Framework
Chapter 1 Database Systems
EGI Webinar - Introduction -
EOSCpilot All Hands Meeting 9 March 2018, Pisa
WG/IG Collaboration Meeting June Göteborg METADATA GROUPS PERSPECTIVE Keith G Jeffery & Rebecca Koskela.
From Observational Data to Information (OD2I IG )
Promoting Desktop Grids Virtual Team (Q1/2014 updates)
VRE – IG charter Helen Glaves
Archives and Records Professionals for Research Data IG
Core Data Resources and FAIRification of Data
Repository Platforms for Research Data Interest Group: Requirements, Gaps, Capabilities, and Progress Robert R. Downs1, 1 NASA.
Common Solutions to Common Problems
A Funders Perspective Maria Uhle Co-Chair, Belmont Forum Directorates for Geosciences, US National Science Foundation.
Break out group coordinator:
Integrating social science data in Europe
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Geospatial IG meeting P8 Denver
The ENVRIplus approach of cooperation
Metadata Development in the Earth System Curator
Bird of Feather Session
VRE-IG European View: Sustainability Keith G Jeffery
Helena Cousijn, Claire Austin, Jonathan Petters & Michael Diepenbroek
Co-Chairs: Keith Jeffery, Rebecca Koskela, Alex Ball
Expand portfolio of EGI services
Logical Architecture & UML Package Diagrams
Presentation transcript:

WG/IG Collaboration Meeting 6 Dec 12-13, NIST, Gaithersburg 'Assembling the Pieces: Connecting Outputs with Each Other and with Domain Adoption‘

e-Research Infrastructures: A Focus for RDA?- Keith G Jeffery Capitalized the “A” before Focus

RDA Principle / Ethos Let 100 flowers blossom (Mao Zedong 1957) (usually misquoted as let 1000 flowers bloom!) Great for groups generating ideas Is it good for products … and products we wish to be ‘joined-up? … and adopted within and across domains?

Already Moving Away from the Principle Top-down views Data Fabric Some clustering TAB graphic diagrams clustering groups (Beth Plale) Grouping of ‘flowers’ into ‘bouquets’ Repositories groups Metadata groups …….. Should we do this more consistently? And if so on what basis?

e-Research Infrastructures Although individual researchers still exist Much research is done in teams using e-RIs Across all disciplines

e-Research Infrastructures Although individual researchers still exist Much research is done in teams using e-RIs Across all disciplines Do the e-RI communities Provide the requirements?

e-Research Infrastructures Assets from research using e-RIs commonly made available openly for re-use with curation and provenance e-RIs increasingly connecting together (by domain) e.g. environment, social science, humanities, materials science Exactly in line with RDA objectives of making data available within and across domains

e-RI Dimensions Topology: centralised (CERN) vs distributed (EPOS) Domains: particle physics to arts & humanities Utilisation of e-Is (grid and cloud computing, supercomputing, network, detectors) Which condition the requirements of the communities

e-RIs: Requirements for RDA activities/products? Example for discussion 1. Common requirements across all e-RIs (and hence their users) Support for interoperation Support for provenance and curation Support for workflow construction (even simple query) Support for deployment to e-Is Support for citation 2. Requirements of specific (groups of?) e-RIs Support for instrumentation/detector control and data stream validation Support for particular analytics / simulation / visualisation

Matrix of Requirements by Domains Particle Physics Arts & Humanities Support for interoperation Support for workflow construction (even simple query) Support for deployment to e-Is Support for provenance and curation Support for citation Support for instrumentation/detector control and data stream validation Support for particular analytics / simulation / visualisation

Matrix of Requirements by Domains Particle Physics Arts & Humanities Support for interoperation Support for workflow construction (even simple query) Support for deployment to e-Is Support for provenance and curation Support for citation Support for instrumentation/detector control and data stream validation Support for particular analytics / simulation / visualisation which domains require which products

Let us just unpack one of these as an example: Provenance Need to understand the relationship between datasets In temporal dimension Versions In intent dimension Purpose (why a derived dataset?) In process dimension Commands/script (reflecting intent?) Software involved Operating environment involved Note relationship to curation Involves: PIDs Metadata Repositories Data fabric/workflow Data fabric/deployment (and more)

Leads to: Weighted by domain priorities Weighted by ‘size’ of domain Noting particularly groups that are represented maximally in both requirement for product and domains Which RDA groups are concerned with : Common requirements across all e-RIs Support for interoperation Support for workflow construction (even simple query) Support for deployment to e-Is Support for provenance and curation Support for citation Requirements of specific or groups of e-RIs Support for instrumentation/detector control and data stream validation Support for particular analytics / simulation / visualisation

Cluster RDA Groups Base on the requirements /domains matrix Reduces management and coordination load Encourages joint thinking and concentration of expertise Provides centres of excellence related to: e-RIs (especially those with shared concerns working together) Domains A stronger basis for products that are (a) joined-up; (b) adopted by the communities

Move progressively to… IG for each cluster Long-lived, strategic, steering, foresight WGs ‘spun out’ for specific pieces of work of short (18 month-24 month) duration yielding products Of general (all domains) use Specific to particular domains Some WGs could be ‘owned’ by >1 IG

This suggestion should produce: Prioritised products Developed by WGs Supported longer-term by IGs Developers in new rôle Based on strength of requirements from communities Encourages adoption A transparent and reproducible decision-making process More concentration of expertise Joined-up solutions Best solutions/products Sustainability

?