Research software best practices: Transparency, credit, and citation

Slides:



Advertisements
Similar presentations
Identifiers and trust: lessons for data publishers Valued Resources: Roles and Responsibilities of Digital Curators and Publishers FOURTH BLOOMSBURY.
Advertisements

Dr. Markus Quandt GESIS – Leibniz-Institute for the Social Sciences Workshop: Persistent Identifiers for the Social Sciences University Club, Bonn, February.
VO Sandpit, November 2009 Data Citation, Principles and Practice Sarah DataCite Annual Conference, 2014.
Restoring reproducibility: Making scientist software discoverable Source codes are increasingly important for the advancement of science in general and.
New Features Update ISI Web of Knowledge. Copyright 2006 Thomson Corporation 2 New features added Mozilla Firefox web browser is now supported New access.
JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot CERN Library GS/SIS The Library behind the scene Opportunities for Scientific.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Data Publishing Workflows: Strategies and Standards
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
OCLC Online Computer Library Center A Global OpenURL Resolver Registry Phil Norman OCLC Dlsr4lib Workshop March 23 rd, 2006 Arlington VA.
E-journal Publishing Strategies at Pitt Timothy S. Deliyannides Director, Office of Scholarly Communication and Publishing and Head, Information Technology.
THE DATA CITATION INDEX AN INNOVATIVE SOLUTION TO EASE THE DISCOVERY, USE AND ATTRIBUTION OF RESEARCH DATA MEGAN FORCE 22 FEBRUARY 2014.
Software Sustainability Institute Linking software: Citations, roles, references,and more
Astrophysics Source Code Library Making scientist software discoverable Alice Allen ASCL, Editor 06/25/15 CUA.
Data Infrastructures Opportunities for the European Scientific Information Space Carlos Morais Pires European Commission Paris, 5 March 2012 "The views.
Data Citation: the next big thing… ?!?! 1 Victoria University 20 Nov
5-7 November 2014 DR Workflow Practical Digital Content Management from Digital Libraries & Archives Perspective.
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
Software Sustainability Institute Dealing with software: the research data issues 26 August.
Next Steps???. Funding Agencies/Foundations Agencies and foundations that provide financial support to scholars should require that the projects they.
Recommended Practices for Journal Article Supplemental Material Highlights of the Sub-Session Background Basic Principles Definitions Status of Recommendations.
Joint Declaration of Data Citation Principles Notes [1] CODATA 2013: sec 3.2.1; Uhlir (ed.) 2012, ch 14; Altman &
Software Sustainability Institute Software Attribution can we improve the reusability and sustainability of scientific software?
HEFCE/Higher Education Academy/JISC cc-by-sa (uk2.5) Image source – flickr (cc-by) OER and the Open Agenda Malcolm Read, Executive Secretary, JISC.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
Incentives for Biodiversity Data Publishing June 2011.
Publishing & Citing Research Data Arun Prakash. Agenda  Introduction  Why is Data publishing important ?  Ongoing Work  Role of Semantics.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
Software Sustainability Institute Tracking Software Contributions doi: /m9.figshare Joint ORCID – DRYAD Symposium on Research.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group Should.
4 way comparison of Data Citation Principles: Amsterdam Manifesto, CoData, Data Cite, Digital Curation Center FORCE11 Data Citation Synthesis Group.
Margret Plank 17th International Conference on Grey Literature 1st and 2nd December 2015, Amsterdam (Netherlands) Move beyond text – How TIB manages the.
NIH BioCADDIE / Force11 Data Citation Pilot Kickoff Meeting Nine Zero Hotel, Boston MA, 3 February 2016 Introduction: Tim Clark, Maryann Martone and Joan.
Joint Declaration of Data Citation Principles (Overview) The Data Citation Synthesis Group Joint Declaration.
Sara Bowman Center for Open Science | Promoting, Supporting, and Incentivizing Openness in Scientific Research.
Restoring reproducibility: Making scientist software discoverable Alice Allen Astrophysics Source Code Library ascl.net.
Role of librarians in improving the research impact and academic profiling of Indian universities J. K. Vijayakumar Ph. D Manager, Collections & Information.
Data Mining for Expertise: Using Scopus to Create Lists of Experts for U.S. Department of Education Discretionary Grant Programs Good afternoon, my name.
IEEE Membership Benefits
NRF Open Access Statement
Jeff Moon Data Librarian &
Our Digital Showcase Scholars’ Mine Annual Report from July 2015 – June 2016 Providing global access to the digital, scholarly and cultural resources.
Emphasize “scholarly” and “universities” to distinguish TDL from other efforts. A digital infrastructure for the scholarly activities of Texas universities.
Save the Code? What to do with Short research codes
Publishing DDI-Related Topics Advantages and Challenges of Creating Publications Joachim Wackerow EDDI16 - 8th Annual European DDI User Conference Cologne,
Demonstrating Scholarly Impact: Metrics, Tools and Trends
EPSRC research data expectations and research software management
Jarek Nabrzyski Director, Center for Research Computing
OceanDocs Digital Repository of Marine Science Research Outputs
Restoring reproducibility: Making scientist software discoverable
EPSRC Research Data Policy Awareness
Transparency increases the credibility and relevance of research
Publishing software and data
Software Documentation
Linking persistent identifiers at the British Library
CNI Spring 2010 Membership Meeting
Introduction to Implementing an Institutional Repository
Access  Discovery  Compliance  Identification  Preservation
Metrics: a game of hide and seek
e-Thesis Submission: What You Need to Know About Going Global
OpenML Workshop Eindhoven TU/e,
Expanding Knowledge: Introduction to Scholarly Communication
Catherine Foley Director of Digital Archive and Library Projects MATRIX, Center for Digital Humanities and Social Sciences at MSU Mid-Michigan Digital.
Mission DataCite was founded in 2009 as an international organization which aims to: establish easier access to research data increase acceptance of research.
Research Data Management
Entering the Data Era; Digital Curation of Data-intensive Science…… and the role Publishers can play The STM view on publishing datasets Bloomsbury Conference.
Objectives, activities, and results of the database Lituanistika
Measuring Your Research Impact
Bird of Feather Session
Dataverse for citing and sharing research data
Presentation transcript:

Research software best practices: Transparency, credit, and citation Alice Allen Astrophysics Source Code Library ascl.net We’d heard about practices that can help in the software development life cycle. I will cover a few steps that you can take to strengthen research and recognize the contribution of software to the community, and tell you a bit about the Astrophysics Source Code Library and how it can help.

Research software Integrity of research depends on transparency and reproducibility “… anything less than release of actual source code is an indefensible approach for any scientific results that depend on computation...” Most astronomers depend on software do to their research – it is a method and that method should be available so that others can examine it. Or to paraphrase a bit in Nature in 2012: not making the software available is indefensible Ince, Hatton, & Graham-Cumming, The case for open computer programs, Nature, v. 482, Feb. 23, 2012

Efforts Underway Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE) CodeMeta project Force11 Software Citation Working Group Center for Open Science's Transparency and Openness Promotion (TOP) Guidelines Engineering Academic Software at Dagstuhl WSSSPE: Multi-disciplinary, international, community-driven effort; four yearly meetings with a couple of smaller interim meetings. Advocates for best practices in software sustainability and career paths for software authors, and works to improve recognition of research software as an intellectual contribution equal to other research products. First meeting was in 2013; reports from its meetings are available online. CodeMeta: relatively small group of people involved in data and software repositories and registries; it is creating a crosswalk table, a “Rosetta stone” for software metadata standards already in place. This will help facilitate the movement of metadata about software – and citations to it. Force11: Future of Research Communications. Grew out of a workshop held in Dagstuhl in 2011. Seeks to improve knowledge creation and sharing by leveraging technology for media-rich digital publishing of scholarly outputs, including data and software. It has published software citation principles. Center for Open Science: founded in 2013, it seeks to remove barriers to sharing, and to increase transparency and replicability of research. Its strategic plan states that “openness with data, methods, and tools makes them citable contributions.” Its Transparency and Openness Guidelines advocate for code accessability. Engineering Academic Software at Dagstuhl: held in June 2016; its manifesto addresses software citation

Force11 Software Citation Principles Importance Credit and attribution Unique identification Persistence Accessibility https://peerj.com/articles/cs-86/ Importance: Software should be considered a legitimate and citable product of research. Credit and attribution: Software citations should facilitate giving scholarly credit Unique identification: A software citation should include a method for identification that is globally unique and recognized by at least a community of the corresponding domain experts Persistence: Unique identifiers and metadata describing the software and its disposition should persist—even beyond the lifespan of the software they describe. Accessibility: Software citations should facilitate access to the software itself and to its associated metadata, documentation, data, and other materials necessary for both humans and machines to make informed use of the referenced software.

Dagstuhl Manifesto on Citation I will make explicit how to cite my software. I will cite the software I used to produce my research results. When reviewing, I will encourage others to cite the software they have used. https://dl.dropboxusercontent.com/u/11565521/dagstuhl-eas-manifesto-2016-12-02.pdf The Dagstuhl Manifesto focused on steps that members of a research community can take on their own; these actions are ones that most astronomers can start doing immediately.

Astrophysics Source Code Library (ASCL, ascl.net) Effort to encourage and support transparency of research. If you are reading a paper and come across the name of a code, you should be able to look through the source code, to examine it, see what assumptions and calculations were made. Started in 1999

And here’s a typical record. We use WordPress for content management and news (our blog), and have an online submissions forms. The MySQL database allows us to better integrate with ADS, the primary indexer of astro journals; we create the bibcode ADS uses for our entries, and added the link to the ADS entry for our record to the page.

Number of code entries at year end, 2010 - 2016 We currently have 1485 codes in the ASCL.

Barriers No payback/benefit to sharing software Not required to, so don’t/too busy to do more work Fear of judgment for “messy” code Fear of losing competitive edge University policies Expectation of support from community http://www.pd4pic.com/road-works-site-street-away-barrier-warning-stop.html

No one can assume that valuable innovations will pop up magically in the public domain if their inventors received no reward for their labor and capital. Richard Epstein Incentives: Citations for software/tracking citations Greater recognition for code contribution Complying with funding requirements Future funding/peer review Meet changing/new expectations

Cumulative number of citations to ASCL entries in ADS by year Citations to ASCL entries are generally more than doubling every year, as you can see from this chart. The ASCL has made it possible to cite software that does not have a descriptive paper in the literature to use for citation. The second largest source of referral traffic to the ASCL comes from ADS, which means people are finding links to code entries there and following them to the ASCL – codes are more easily discoverable.

Citations by journal ADS shows that 58 journals it indexes have citations to ASCL entries. In addition to being indexed by ADS, Web of Science also indexes the resource.

Benefits of the ASCL Improves transparency of research Aids in software discovery Provides way to cite software separately from papers Assigns DOIs for codes housed on ASCL Reliability of data ADS shows that 58 journals it indexes have citations to ASCL entries. In addition to being indexed by ADS, Web of Science also indexes the resource. Transparency: links software with research that uses the software Discovery: Google/Google Scholar, ADS, WoS Citation: Journals don’t want to link to your website! Citations tracked by ADS and WoS Authors accrue credit for valuable contributions to science DOIs: for those who want them Reliability: Actively curated

You can change the world! (Or at least a little piece of it!) Release your code Specify how you want your code to be cited License your code Register your code Archive your code somewhere Release your code so others can at least read through it Specify citation method: Choose a trackable way/Put this information where easily seen License your code so others know how to use it Register your code in the ASCL Archive your code somewhere so it’s not lost to science/remains to complete the research record

Dagstuhl Manifesto on Citation I will make explicit how to cite my software. I will cite the software I used to produce my research results. When reviewing, I will encourage others to cite the software they have used. https://dl.dropboxusercontent.com/u/11565521/dagstuhl-eas-manifesto-2016-12-02.pdf