Improving Integrity, Transparency, and Reproducibility Through Connection of the Scholarly Workflow Andrew Sallans Partnerships Lead Center for Open Science
A talk on the Open Science Framework 1.Free, open source platform 2.Designed to add efficiency to workflow 3.Connector to other tools and services
So, why is this important?
Challenges: Perceived norms Norms Communality Open Sharing Universalism Evaluate research on own merit Disinterestedness Motivated by knowledge and discovery Organized skepticism Consider all new evidence, even against one’s prior work Quality Counternorms Secrecy Closed Particularlism Evaluate research by reputation Self-interestedness Treat science as a competition Organized dogmatism Invest career promoting one’s own theories, findings Quantity
Anderson, Martinson & DeVries, 2007
A little something about COS Est Non-profit tech startup 4 leading foundation funders, > $14M Located in Charlottesville, VA Team: ~ 25 FT & 20 interns Mostly software developers and researchers Mission: Improve openness, integrity, and reproducibility of research
Create an account easy and free!
Modify Account Settings Update information
Dashboard Project Organizer
Project Overview Page Overview
Using the Wiki In the menu bar Wiki history Add new or edit
Adding Components Add a new component
Adding Contributors Select from the results Choose permissions
Privacy
Uploading Files Click upload button or just drag and drop
Versioning See version history and download
Registering Your Work Choose the registration template Create a frozen registered version
Sharing Your Work Create a view-only link Describe the link use Option to make anonymous Select what parts of the project to share
Unique and permanent IDs Scientific content must be easy to cite and annotate Approach: GUIDs for all content -> RPCB -> Tim Errington -> Coding_Study_1.xslx
Current Add Ons: Dropbox Github Dataverse Figshare AmazonS3
Connecting the workflow
Examples of other connections Taking a data management plan and converting it into a living document. Providing a data repository lookup service and checklist to assist with preparation for deposit. Connecting to a sensitive video storage service.
Tools used: Qualtrics, Dropbox, Survey Monkey, R OSF features used: version control, collaboration, wiki for lab notebook and meetings
Tools used: next-gen sequencers, pipeline software, custom software OSF features used: version control, file sharing, GitHub and Dropbox integration, public sharing.
Problem to solve: teaching undergrads how to make research reproducible and verifiable. OSF features used: organization of documents and data, command files, metadata, and file sharing.
What’s underneath the hood? Python Javascript Git MongoDB / TokuMx Ansible Elasticsearch OSF API Rackspace Linode
Want to join the effort? Contribute content to SHARE or partner on curation of content Serve as an Ambassador Coordinate a reproducible statistics and practices workshop Join the team –
Open Science Framework osf.io