Scaling the Open Science Framework: National Data Service Dashboard, Cloud Storage Add-ons, and Sharing Science Data on the Decentralized Web Natalie K.

Slides:



Advertisements
Similar presentations
GRADD: Scientific Workflows. Scientific Workflow E. Science laboris Workflows are the new rock and roll of eScience Machinery for coordinating the execution.
Advertisements

Introduction to Research Data Management Services, January 2013 Library Data Services Functions and activities.
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Archiving research data in the cloud or in a local repository Michele Kimpton, CEO DuraSpace CNI Dec 2014.
Improving Integrity, Transparency, and Reproducibility Through Connection of the Scholarly Workflow Andrew Sallans Partnerships Lead Center for Open Science.
Using Sakai to Support eScience Sakai Conference June 12-14, 2007 Sayeed Choudhury Tim DiLauro, Jim Martino, Elliot Metsger, Mark Patton and David Reynolds.
Designing Online Communities: If We Build it, Will They Come? Yvonne Clark Instructional Designer Penn State University.
Hydra Partners Meeting March 2012 Bill Branan DuraCloud Technical Lead.
Supporting education and research E-learning tools, standards and systems Sarah Porter Head of Development, JISC.
Sara Bowman Center for Open Science Open Science Framework: Facilitating Transparency and Reproducibility.
Software Cluster Improve Collaboration and Community Engagement Work with diverse communities that contribute to the sustainability of scientific software.
New Value from the DSpace Foundation and Fedora Commons Michele Kimpton and Sandy Payette Executive Directors DuraSpace.
Hydra: future development A Hydra roadmap… Hydra Europe Symposium – Dublin – 7/8 April 2014 Richard Green.
The Department of Energy’s Public Access Solution Giving Voice to Energy and Science R&D Results Jeffrey Salmon Deputy Director for Resource Management.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
One Body, Many Heads for Repository-Powered Digital Content Applications Hydra Europe Symposium, Trinity College, Dublin, 7 th April 2014 Chris Awre Head.
Making Connections: SHARE and the Open Science Framework Jeffrey Open Repositories 2015.
Exploring ‘Workspaces’ Tom Visser, SARA compute and networking services, Amsterdam Garching Workshop 21 st September 2010.
G ET A HEAD ON Y OUR R EPOSITORY Tom Cramer Chief Technology Strategist Stanford University Libraries.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Open Science Framework Jeffrey Spies University of Virginia.
Brian Nosek University of Virginia -- Center for Open Science -- Improving Openness.
Practical Steps for Increasing Openness and Reproducibility Courtney Soderberg Statistical and Methodological Consultant Center for Open Science.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Aalto Data.
Webinar on increasing openness and reproducibility April Clyburne-Sherin Reproducible Research Evangelist
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Practical Steps for Increasing Openness and Reproducibility Courtney Soderberg Statistical and Methodological Consultant Center for Open Science.
David Mellor, PhD Project Manager at Improving Openness and Reproducibility of Scientific Research.
Breakout Groups Goal Format Demo Pitch. Overview Monday – 3-6p Breakouts Tuesday – 9-12p Pitches (10 min, 10 discussion) – 2-6p breakouts Wednesday –
Sara Bowman Center for Open Science | Promoting, Supporting, and Incentivizing Openness in Scientific Research.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Support to scientific.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
Brian Nosek University of Virginia -- Center for Open Science -- Improving Openness.
Open Science Framework Jeffrey Center for Open Science | University of Virginia.
Sara Bowman Center for Open Science | Promoting, Supporting, and Incentivizing Openness in Scientific Research.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Aalto Data Repository.
Robert R. Downs1and Robert S. Chen2
David Mellor Building infrastructure to connect, preserve, speed up, and improve scholarship David Mellor
What is Open Science and How do I do it?
Dataverse Integration with Open Science Framework (OSF)
Tokamak data mirror for JET and MAST Moving towards an open data repository for European nuclear fusion research.
Scholarly Workflow: Federal Prototype and Preprints
Shifting the research culture toward openness and reproducibility
Center for Open Science: Practical Steps for Increasing Openness
Jarek Nabrzyski Director, Center for Research Computing
Lorne Campbell University of Western Ontario
Connection of the scholarly work flow with the open science framework
Hydra, research data and Archivematica
A different kind of Carpentry
An Overview of Data-PASS Shared Catalog
Open Science Framework
Achieving Open Science
Data Sharing Now and in the Future
Transparency increases the credibility and relevance of research
A Framework for Managing and Sharing Research Workflow
Richard Green (for Chris Awre) Open Repositories Conference, Dublin

VI-SEEM Data Repository
DATA SPHINX & EUDAT Collaboration
Hydra: a case study Chris Awre
NFFA Europe.
An ecosystem of contributions
An EUDAT-based FAIR Data Approach for Data Interoperability
Repository Platforms for Research Data Interest Group: Requirements, Gaps, Capabilities, and Progress Robert R. Downs1, 1 NASA.
Social media for global scientific community – Mendeley project
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
Jisc Research Data Shared Service (RDSS)
Dataverse for citing and sharing research data
Presentation transcript:

Scaling the Open Science Framework: National Data Service Dashboard, Cloud Storage Add-ons, and Sharing Science Data on the Decentralized Web Natalie K. Meyers Center for Open Science http://cos.io/ | http://osf.io

Improving Openness and Reproducibility of Scientific Research Mission Improving Openness and Reproducibility of Scientific Research

Technology to enable change Training to enact change Incentives to embrace change Improving scientific ecosystem

Infrastructure Metascience Community

Training to enact change Once infrastructure is in place, we need to show researchers how to use it to improve their practices.

Partner with others on training --- librarians are great partners in this ---- to teach researchers skills in how to deal with basic data management and how to improve their research workflows for personal and sharing purposes. Software Carpentry and Data Carpentry are other great examples of efforts in this area, and partnerships with those in libraries --- we’ve done some work with them(SHARE Associates) and are exploring ways to do more. Free training on how to make research more reproducible http://cos.io/stats_consulting

Community we recognize that stewardship of high-value data should not exist solely with one organization or institution and its systems. We are highly interested in encouraging and creating a distribution of trust.

Incentives to embrace change Supporting these behavioral changes requires improving the full scientific ecosystem. At a conference like IWSG 2016, there are many people in the room contributing important parts to this ecosystem. I hope you leave this talk seeing the potential for how we might be able to work together on connecting tools to provide for better transparency and reproducibility in the workflow.

752 Journals 63 Organizations http://cos.io/top

Transparency & Openness Promotion Guidelines Eight Standards Data citation Design transparency Research materials transparency Data transparency Analytic methods (code) transparency Preregistration of studies Preregistration of analysis plans Replication Three Tiers Disclose Require Verify Signatories 752 Journals 63 Organization Learn more at http://cos.io/top TOP Matrix

Metascience

A reader quick, keen, and leery Did wonder, ponder, and query When results clean and tight Fit predictions just right If the data preceded the theory Anonymous, quoted from Kerr (1998)

https://osf.io/e81xl/ Cancer Biology https://osf.io/ezcuj/ Psychology

Infrastructure The free, open source Open Science Framework (OSF; http://osf.io) stores and connects content from across the research workflow (e.g., materials, code, datasets, and publications).

Open Science Framework http://osf.io free, open source

There’s more to it than sharing of discrete objects There’s more to it than sharing of discrete objects. Think about using this as an opportunity to increase transparency by capturing the entire workflow, and to do so while connecting tools and services that make up the parts of the workflow, not requiring people to change all of their practices at once, and providing immediate efficiencies and value to the researcher AS they comply with requirements.

There’s more to it than sharing of discrete objects There’s more to it than sharing of discrete objects. Think about using this as an opportunity to increase transparency by capturing the entire workflow, and to do so while connecting tools and services that make up the parts of the workflow, not requiring people to change all of their practices at once, and providing immediate efficiencies and value to the researcher AS they comply with requirements. OpenSesame

There’s more to it than sharing of discrete objects There’s more to it than sharing of discrete objects. Think about using this as an opportunity to increase transparency by capturing the entire workflow, and to do so while connecting tools and services that make up the parts of the workflow, not requiring people to change all of their practices at once, and providing immediate efficiencies and value to the researcher AS they comply with requirements. OpenSesame

CurateND Institutional Repository OSF Integration Joint effort w/Johns Hopkins and University of Notre Dame For more information see: For more info see: Archiving Research Data into Hydra through the Open Science Framework (OSF) Given at Hydra Connect 2016 Oct 6 2016 Available: https://wiki.duraspace.org/display/hydra/Hydra+Connect+2016 Presentation - Rick Johnson, Don Brower, Sayeed Choudry, Elliot Metsger Audience: All Archiving Research Data into Hydra through the Open Science Framework (OSF) - A look at initial work of Notre Dame and Johns Hopkins to archive research projects from the OSF into Fedora and Hydra repositories, and first implementation of a Fedora Research Object Model. This plugs into a service offering of the Center for for Open Science, OSF for Institutions (OSF4I). ND/JHU version will be initial support for OSF Fedora Archiving Add-on in the OSF as part of OSF4I offering. We hope to start discussion around next steps for other Hydra institutions to use this along with OSF4I to allow them support to archive research data from the OSF into their own Hydra/Fedora repository. Slides: (Google) (PDF) https://goo.gl/WNZpjZ https://wiki.duraspace.org/download/attachments/77445482/JHU-ND%20HydraConnect%202016%20slides%20%281%29.pdf?version=1&modificationDate=1475681469806&api=v2 Contact: Rick Johnson rjohns14@nd.edu

NDS OSF Dashboard integration Contact: Ian Taylor ian.j.taylor@gmail.com Describe the past years’ efforts on dashboard bitbucket.org/nds-org/nds-dashboard http://www.nationaldataservice.org/ http://ndspilot.com

A vision for distributing the stewardship of High Value Scientific data Blockchain DBs for use cases like smart badges Connect to distributed filesystems to encourage better stewardship of scientific data Prototyped  use cases with Tahoe-LAFS and CephFS The OSF’s modular technical stack and the abstraction layers in place to integrate external services, we are well positioned to adopt decentralized systems. For example, the blockchain could be easily connected to the OSF for use cases like smart Badges   AND storage add-ons could be developed to connect to distributed filesystems for stewardship of scientific data.  

An inclusive approach to sharing & archiving BitTorrent or like protocol could allow the OSF, scientists and colleagues to transfer and host torrent files (or packages of torrents). A modified BitTorrent Tracker & client could enable users to donate storage to the system Content Addressable storage could be developed on top of such a system We have prototyped an inclusive approach to archiving where any person, organization, or institution could contribute to scientific data stewardship by storing and hosting some percentage of OSF data via BitTorrent.

We want to hear about implementation and collaboration opportunities natalie@cos.io Slides: https://osf.io/kqddj DOI 10.17605/OSF.IO/KQDDJ