Introduction to the Dryad Digital Repository A nonprofit repository for data underlying the international scientific and medical literature. April 2013.

Slides:



Advertisements
Similar presentations
Partnering with Faculty / researchers to Enhance Scholarly Communication Caroline Mutwiri.
Advertisements

Ryan Scherle and Jane Greenberg. A Repository of Data Underlying Journal Articles.
Evolutionary biology Population genetics Systematics Paleontology Botany and Zoology Genomics Ecology Medicine Agriculture Anthropology Bioinformatics.
The Dryad Data Repository Ryan Scherle 1, Hilmar Lapp 1, Amol Bapat 2, Sarah Carrier 2, Jane Greenberg 2, Peggy Schaeffer 1, Todd Vision 1,3, Hollie White.
Paul Price Dow Chemical Company
Current status Todd Vision (overview) Elena Feinstein (curation) Ryan Scherle (demo) 7/23/12Dryad Board of Directors1.
Data archiving in evolutionary biology Michael Whitlock.
Ensuring a Journal’s Economic Sustainability, While Increasing Access to Knowledge.
PubMed Central ANCHASL Spring Meeting April 1, 2005 Robert James Associate Director of Public Services Duke University.
OPEN ACCESS Your Publisher of Choice DE GRUYTER OPEN Society-Pays Publishing Program.
What is data citation & why do we care? What’s been happening here and overseas? How ready are you for data citation? 1 Welcome! Image:
NATIONAL LIBRARY OF MEDICINE PubMed Central Brooke Dine National Library of Medicine Medical Library Association Conference May 2004.
New business models for open research Todd Vision Jared Lyle Mark Hahnel 12-June-2014Open Repositories1.
Learn more about Open Access Breakfast meeting at BMC March 30th 2010 Aina Svensson and Karin Meyer Lundén Electronic Publishing Centre, Uppsala University.
Open Access: A Publisher’s Perspective Daniel Wilkinson 20 th October, 2014.
Supplementary Data and Publishers Neil Beagrie, Julia Chruszcz, and Peter Williams Charles Beagrie Ltd Dryad UK April 2010.
Guide to a successful PowerPoint design – simple is best
"Cherish old knowledge that you may acquire new" - The Analects of Confucius
Rutgers University Libraries What is RUcore? o An institutional repository, to preserve, manage and make accessible the research and publications of the.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Making sense of doi: /01/503C303E9B551 Digital Object Identifiers DOIs.
Belinda Tiffen Director Library Open Access Publishing: What You Need to Know Research Week UTS:
ARMA 6 th June Costs and payment of open access article processing charges.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
From Berlin back to Business OPEN Stellenbosch University Library and Information Service Mimi Seyffert Manager: Digitisation and Digital Services.
Open Access: An Introduction Edward Shreeves Director, Collections and Content Development University of Iowa Libraries
Managing journals: challenges and opportunities How to get started (with OJS) Jackie Proven.
Publishing in Perpetuity The importance of Digital Preservation for Publishers in Science, Medicine and Technology Drs Eefke Smit International STM Association.
Literature/data integration and Ryan Scherle Data Repository Architect Dryad Digital Repository HighWire Fall Publishers’ Meeting November 20, 2013 You.
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT Data Citation Mechanism and.
Collaborative Approach to Open Access: Experience from Bioline International Leslie Chan Associate Director Bioline International University of Toronto.
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
The Department of Energy’s Public Access Solution Giving Voice to Energy and Science R&D Results Jeffrey Salmon Deputy Director for Resource Management.
Julie Hannaford, Meryl Greene, Kristian Galberg,
SCIENCE, RESEARCH DATA, AND PUBLISHING Stewart Wills Editorial Director, Web & New Media, Science 26 February 2013.
Supporting scientific communities by publishing data Dryad Digital Repository Peggy Schaeffer OpenAIRE/LIBER Workshop May 28, 2013 Ghent, Belgium.
What can publishers do to support data? Dryad’s perspective STM Annual US Conference - April 22, 2015 Meredith Morovati Executive Director Illustration.
Scholarly Communication, Author Rights, and GT Library Services Julie G. Speer Faculty Advisory Board Meeting April 14, 2009.
Data archiving and curation Ryan Scherle Data Repository Architect Dryad Digital Repository CurateGear January 8, 2014 You may reuse any of the original.
Digital Commons & Open Access Repositories Johanna Bristow, Strategic Marketing Manager APBSLG Libraries: September 2006.
BMJ and Data Sharing Claire Bower, Digital Communications
Now launched! Visit nature.com/scientificdata Honorary Academic Editor Susanna-Assunta Sansone Advisory.
DataONE: Preserving Data and Enabling Data-Intensive Biological and Environmental Research Bob Cook Environmental Sciences Division Oak Ridge National.
Publishing & Citing Research Data Arun Prakash. Agenda  Introduction  Why is Data publishing important ?  Ongoing Work  Role of Semantics.
Evolving a Community Digital Repository: Lessons from Dryad Making data underlying scientific publications discoverable, freely reusable, and citable Bill.
DEEP BLUE University of Michigan Institutional Repository.
Grey Literature in Open Source Repositories December 2015 Dan Aitken, developer, discoverygarden Erin.
Open BU 23 October 2013 Jack Ammerman. Open AccessNational policySustainabilityArticle Processing Charges 2/18/2016Open BU 2 Gold.
Greater Visibility, Greater Access QSpace QSpace Queen’s University Research & Learning Repository.
Data Citation Implementation Pilot Workshop
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Open Science and Research – Services for Research Data Management © 2014 OKM ATT 2014–2017 initiative Licenced under.
The R EPOSITORY AS P UBLISHER OPPORTUNITIES AND CHALLENGES IN A DUAL ROLE BEN HOCKENBERRY SYSTEMS LIBRARIAN | ST. JOHN FISHER COLLEGE.
CONTENTdm A proven solution September A complete digital collection management software solution Stores, manages and provides access for all digital.
Discover ScholarSphere A repository service collaboration between the University Libraries and ITS.
The New Now: Institutional Repositories and Academia Institutional Repository USM April 17, 2015 Marilyn Billings Scholarly Communication Librarian.
NRF Open Access Statement
Ian Bruno, Suzanna Ward The Cambridge Crystallographic Data Centre
Promoting and Preserving FIU Research and Scholarship
OceanDocs Digital Repository of Marine Science Research Outputs
ACS 2016 Moving research forward with persistent identifiers
and Scholarly Communication
CNI Spring 2010 Membership Meeting
Access  Discovery  Compliance  Identification  Preservation
Open Access to your Research Papers and Data
OpenML Workshop Eindhoven TU/e,
Research Data Management
Digital Library and Plan for Institutional Repository
Research data lifecycle²
Digital Library and Plan for Institutional Repository
Presentation transcript:

Introduction to the Dryad Digital Repository A nonprofit repository for data underlying the international scientific and medical literature. April 2013 DataDryad.org 1

The End – To make data archiving and reuse standard within scientific communication. The Means – Enable low-burden data archiving at the time of manuscript submission. – Promote researcher benefits from data archiving. – Promote responsible data reuse. – Empower journals, societies & publishers in shared governance. – Ensure sustainability and long-term preservation. The Scope – Research data in science and medicine – Primarily data underlying findings in peer-reviewed articles – Also data from some non-peer reviewed publications (e.g. dissertations) – And some non-data content (e.g. software scripts, figures) DataDryad.org 2

The value proposition For authors and researchers, Dryad… – increases the impact of, and citations to, published research – preserves and makes available others’ data – frees researchers from the burden of data preservation and access For journals, publishers, and societies, Dryad… – frees journals from the burden of maintaining supplemental data – supports all varieties of data archiving policies For libraries and institutions, Dryad… – makes data available at no cost, under clear terms of use – helps fulfill their research data management mandates For funders, Dryad… – provides a cost-effective mechanism to make research more accessible DataDryad.org 3

Data archiving has many benefits Modified from Beagrie et al. (2009) Keeping Research Data Safe 2 Direct Verification of published research Preserving accessibility to data Allowing reuse and repurposing of data Discoverability of data Indirect (costs avoided) Redundant data collection Inefficient legacy data curation Burden of sharing-upon-request Opportunity cost of science not done Near term Protection against personnel turnover Availability for review and validation Long term Secure long-term stewardship Increased impact per publication Private Increased citations New collaborations New research opportunities Fulfilling funding mandates Public More efficient use of research dollars Public trust in science Educational opportunities Improved methodologies More informed policy 4 DataDryad.org

Dryad focuses on the long tail of orphan data Volume Rank frequency of datatype Specialized repositories (e.g. GenBank, GBIF) Orphan data After Heidorn (2008) Many datasets belong to the long tail. Though less standardized, they can be rich in information content and have unique value 5 DataDryad.org

Why use Dryad rather than Supplementary Online Materials? DryadSOM Discoverable: indexed and exposed to both web and bibliographic search engines ✔✗ Identifiable: DataCite DOIs within articles serve as permanent, resolvable identifiers ✔✗*✗* Permanent: processes in place to promote preservation (incl. format migration) ✔✔ / ✗ ** Curated: quality control by both automated processes and human inspection ✔✗*✗* Ease of deposit: streamlined deposit, allowance for large and complex datasets ✔✔ / ✗ ** Formatted for reuse: support for non-PDF file formats ✔✔ / ✗ ** Updatable: new versions of data files can be added, metadata can be enhanced ✔✗ Support for embargoes: can delay release of data in accordance with journal policy ✔✗ Free reuse: no paywall, clear terms of reuse (all data released under CC Zero) ✔✔ / ✗ ** Economy of scale: cost efficiency from shared infrastructure ✔✔ / ✗ ** Alignment to organizational mission: focus on archiving and reuse of scientific data ✔✗ * A few publisher SOM sites are exceptions to the general rule ** Practices differ among publishers, see Smit (2011), doi: /january2011-smit 6 DataDryad.org

Researchers and journals are using Dryad for archiving DataDryad.org 7

…and using the data for research DataDryad.org 8

9

10 JournalIntegration Date Data Packages Data Downloads Ave. downloads per package All Journals ,39646 Molecular Ecology ,60438 Evolution ,52433 American Naturalist ,19554 Journal of Evolutionary Biology ,72928 Journals benefit when data is reused A “Data Package” is all of the data files for a journal article. All Dryad data packages link to the associated journal article.

Dryad integrates article and data submission Dryad works with the manuscript workflow of journals to: – Simplify the process of data submission for authors, – Allow authors to deposit, to a single repository, gigabytes of data files in their original formats, – Ensure permanent bidirectional links between the article and the data, and increased visibility for both, – Ensure that the data is accessible once the article becomes available, – Offer the option of making data available for editorial or peer review, via secure access for editors and reviewers, – Give authors the option to embargo public access to data for a limited time after publication, if permitted by the journal's data policy. Options are customized to meet the requirements of each journal. DataDryad.org 11

Over 30 integrated partner journals The American Naturalist Biology Letters BMJ Open Biological Journal of the Linnean Society Ecological Monographs eLife Evolutionary Applications Evolution Functional Ecology gms German Medical Science Heredity Journal of Animal Ecology Journal of Evolutionary Biology Journal of Fish and Wildlife Management Journal of Heredity Journal of Open Public Health Data Journal of Paleontology Methods in Ecology and Evolution Molecular Ecology and M.E. Resources Paleobiology PLoS Biology, PLOS Genetics Systematic Biology ZooKeys & 7 other Pensoft journals.. and more being added regularly DataDryad.org 12

Trustworthy repository infrastructure Making data available is the primary mission of the organization – No pay-walls or restrictive licenses (all released under CCZero) – The same data may be hosted by other services (non-exclusivity) Built on the DSpace repository platform, an open source framework used by hundreds of institutional repositories Multiple machine and human interfaces for discovery and access – Dublin Core metadata, harvestable through OAI-PMH – DOIs registered through DataCite – Curators add metadata to enhance keyword searching Assurance of data integrity and permanent availability – Service mirroring and backup – File migration and bit-level integrity assurance – Organizational failover through DataONE and CLOCKSS 13 DataDryad.org

Dryad as an organization Governed by an interim Board Now a nonprofit organization incorporated in North Carolina, USA. Membership open to all stakeholder organizations, including scientific societies, publishers, funding agencies, universities & institutes. Governed by an elected 12-member Board of Directors – Nominated and elected by the Membership First Annual membership meeting 24 May 2013 in Oxford. DataDryad.org 14

Dryad’s business plan Deposit fees are the primary source of revenue, for several reasons: – The time of deposit is when the majority of costs are incurred – Revenue scales with costs (i.e. volume of deposits) – The costs are distributed both fairly and widely – This enables Dryad to make access to the data free in perpetuity Membership fees will cover costs of annual membership meetings Project grants will supplement the operational budget for R&D activities DataDryad.org 15

Payment plans PlanContract?Paid byCost 1 1. VouchernoAny organization, in advance $65 per data package (members) $70 per data package (non- members) 2. Deferred payment 1 yr.Any organization, in advance $70 per data package (members) $75 per data package (non- members) 3. Subscrip- tion 2 yrs.Journal or journals, fee based on total # of research articles published by the in the prior year Unlimited number of submissions for a fixed fee; base fee of $25 per research article for members, $30 for non-members Individual deposit noAuthor, at time of deposit $80/data package, with waivers for submissions from low-income economies 1 Up to a fixed deposit size (currently 10GB). Additional charges for larger deposits. DataDryad.org 16

To learn more Repository home: News: Project documentation: Code: or contact us: Todd Vision, Director, Laura Wendell, Dryad Executive Director, DataDryad.org 17