Restoring reproducibility: Making scientist software discoverable Source codes are increasingly important for the advancement of science in general and.

Slides:



Advertisements
Similar presentations
Identifiers and trust: lessons for data publishers Valued Resources: Roles and Responsibilities of Digital Curators and Publishers FOURTH BLOOMSBURY.
Advertisements

Supporting Engagement in Open Access: a Publishers Perspective
DIScovery SciEnce through Computational Thinking (DISSECT) Enrico Pontelli.
Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
Research CU Boulder Cyberinfrastructure & Data management Thomas Hauser Director Research Computing CU-Boulder
Data, Data Everywhere, But Not a Byte to Eat Michael F. Huerta, Ph.D. Associate Director, National Library of Medicine Director, Office of Health Information.
Symposium on Digital Curation in the Era of Big Data: Career Opportunities and Educational Requirements: A Data Scientist Perspective Dr. Vicki Lynn Ferrini.
Funding Opportunities at NSF Jane Silverthorne International Arabidopsis Consortium Workshop January 15, 2011.
Birds of a Feather: Deans, Chairs, Organization Representatives Tom Healy Lead Program Manager External Research & Programs Microsoft Research Tom Healy.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Social, Political and Financial Issues of Connecting GeoData In and Between Governmental Agencies NSF GeoData Workshop June 17-19, 2014 Boulder, CO Sara.
Software Cluster Improve Collaboration and Community Engagement Work with diverse communities that contribute to the sustainability of scientific software.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Issue: How can RUSA best meet member desire for online learning opportunities and balance value to members while creating a sustainable online learning.
Astrophysics Source Code Library Making scientist software discoverable Alice Allen ASCL, Editor 06/25/15 CUA.
Translating Knowledge to On-the-Ground Results Henry L. Green, Hon. AIA National Institute of Building Sciences Congressional.
CC 2007, 2011 atribution - R.B. Allen Scholarship, Science, Data, and Domain Informatics.
Overview of the HUBzero Platform
National Statistical Committee of the Kyrgyz Republic Science, technology and innovation statistics in the Kyrgyz Republic Training workshop for ECO countries.
Deep Carbon Observatory Data Science and Data Management Infrastructure Overview and Demonstration Patrick West – Tetherless World Constellation Rensselaer.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Objectives.
Copyright Basics Fundamentals you should know Slides produced by the Copyright Education & Consultation Program.
MAGIC Presentation Gabrielle Allen, NSF / Richard Carlson, DOE MAGIC CoChairs September 13, 2011 Large Scale Networking (LSN) FY12 Annual Planning Meeting.
CyberInfrastructure workshop CSG May Ann Arbor, Michigan.
Data Sharing and Archiving: A Professional Society View Clifford S. Duke Ecological Society of America September 9, 2010.
Management and organizational bodies guiding the DCO-Data Science DCO COMMITTEE STRUCTURE Deep Life Deep Life Reservoirs & Fluxes Deep Energy Physics &
Data Issues & Recommendations. Data Issues AON meeting last week—18 AON PIs, plus collaborating groups & programs –CADIS, an AON project for cooperative.
The Digital Library for Earth System Science: Contributing resources and collections Meeting with GLOBE 5/29/03 Holly Devaul.
NanoHUB.org and HUBzero™ Platform for Reproducible Computational Experiments Michael McLennan Director and Chief Architect, Hub Technology Group and George.
From Lucent, Inc. This is the Sablime® home page. It has access to all the functionality of the Sablime® Configuration Management System.
TWC Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Observatory Community Xiaogang (Marshall) Ma, Yu Chen, Han Wang, Patrick West,
HEFCE/Higher Education Academy/JISC cc-by-sa (uk2.5) Image source – flickr (cc-by) OER and the Open Agenda Malcolm Read, Executive Secretary, JISC.
ClinicalTrials.gov Results Reporting, Unique Evidence, and the Role of Medical Librarians SCR CONNECTions March 19, 2014.
The Digital Library for Earth System Science: Contributing resources and collections GCCS Internship Orientation Holly Devaul 19 June 2003.
The Geosciences are a discipline that is strongly data driven, and large data sets are often developed by researchers and government agencies. The complexity.
S2I2: Enabling grand challenge data intensive problems using future computing platforms Project Manager: Shel Swenson (USC & GATech)
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
What European needs to do to take the lead in Space Astronomy ESO Astronomy Faculty May 2004.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
A Data Centre for Science and Industry Roadmap. INNOVATION NETWORKING DATA PROCESSING DATA REPOSITORY.
WEBSITE CRITIQUE On Scientific News Websites. IFLSCIENCE.com.
Enabling Grids for E-sciencE EGEE Applications Registry Current status & latest developments Marios Chatziangelou.
Research Information Management: Continuity, Change and Impact Michael Jubb Research Information Network UUK Workshop 5 December 2007.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
Arizona Astronomical Data Hub AAS 227: Dark/Orphaned Data P. Bryan Heidorn ORCID: University of January 2016.
The Case for a Data Decadal Survey A community-driven initiative to harmonize data practices, research priorities and infrastructure for 2025 Carol B.
Deep Carbon Observatory Data Science and Data Management Infrastructure Overview and Demonstration Patrick West – Tetherless World Constellation Rensselaer.
Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee
Enabling Grids for E-sciencE EGEE Applications Registry Current status & latest developments Marios Chatziangelou.
Southern California Earthquake Center SI2-SSI: Community Software for Extreme-Scale Computing in Earthquake System Science (SEISM2) Wrap-up Session Thomas.
Connecting Users, Data & Data Repositories Simon J. Goring ORCID: John W. Williams doi: /m9.figshare Distinguished Lecture.
CCSM PROGRAM PLAN an update to science plan and business plan based on the proposals put forward for CSL allocations a possible framework to allocate CCSM.
RDA-WDS Publishing Data IG Data Bibliometrics Working Group.
1 Open Science Grid: Project Statement & Vision Transform compute and data intensive science through a cross- domain self-managed national distributed.
Transformative Earth Sciences through Data: Neotoma, EarthCube & Flyover Country Simon Goring Assistant Scientist University of Wisconsin - Madison S i.
Ukpmc.ac.uk As a result of the mandates Research in the open How mandates work in practice 29 th May, 2009 Paul Davey, UK PubMed Central Engagement Manager,
DATA MANAGEMENT: WHAT IT MEANS FOR YOUR RESEARCH Maggie Howell 02 June
EarthCube Sustaining the Geosciences for 21 st Century Challenges Credits: from top to bottom: NOAA Okeanos Explorer Program (CC BY-SA 2.0), NASA/Kathryn.
Restoring reproducibility: Making scientist software discoverable Alice Allen Astrophysics Source Code Library ascl.net.
Bringing visibility to food security data results: harvests of PRAGMA and RDA Quan (Gabriel) Zhou, Venice Juanillas Ramil Mauleon, Jason Haga, Inna Kouper,
Save the Code? What to do with Short research codes
Ian Bruno, Suzanna Ward The Cambridge Crystallographic Data Centre
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
Jarek Nabrzyski Director, Center for Research Computing
Restoring reproducibility: Making scientist software discoverable
Research software best practices: Transparency, credit, and citation
Nature and software Leslie Sage Senior editor, physical sciences
Activities and National Priorities of National Members
Bird of Feather Session
Finding information in Management and Leadership
Presentation transcript:

Restoring reproducibility: Making scientist software discoverable Source codes are increasingly important for the advancement of science in general and astrophysics in particular. Journal articles meant to detail the general logic behind new results and ideas often do not make the source codes that generated these results available, decreasing the transparency and integrity of the research. The Astrophysics Source Code Library (ASCL) is a registry of scientist-written software used in astronomy research. The challenges of creating and growing the resource will be covered by its current editor, who will also discuss specific steps the ASCL has taken to improve code discovery in astronomy and the effect this work is having within astronomy and more broadly in other research areas.

Restoring reproducibility: Making scientist software discoverable Alice Allen ASCL, Editor 02/12/15 NIST

Assumptions Software is important in research Research software should be available “… anything less than release of actual source code is an indefensible approach for any scientific results that depend on computation...” Ince, Hatton, & Graham-Cumming, The case for open computer programs, Nature, v. 482, Feb. 23, 2012

ASCL First stop,

Code entry, 1999

Home page, 1999

ASCL Second stop,

Code entry, 2010

Home page, 2010

ASCL Third stop, 2014-present

Code entry, present

Home page, present

Browse, present

alegri / 4freephotos.com

Other efforts ASDS (most robust; funded; ) AstroForge (funded, ) SkySoft (2003-present) Astro-Code Wiki ( ) Astro-Sim ( ) AstroShare ( ; last update 2012)

Lessons learned People don’t want to deposit their codes/like to keep their codes nearby ◦ Little incentive to register software ◦ Don’t want to go first ◦ Don’t want to have another site to update Funding cycle not long enough to get uptake by community Authors will not update metadata Limited marketing limited growth

To bring about change … Build it Enlist/involve others Market, market, market Engage the community ◦ Learn what barriers and incentives exist ◦ Mitigate barriers and nurture incentives Be patient

Total code entries by quarter July, September, 2011

Number of code entries at year end,

Advisory Committee

Get the word out

Community work

No one can assume that valuable innovations will pop up magically in the public domain if their inventors received no reward for their labor and capital. -Richard Epstein

Are we having any effect?

Percentage of code entries in ASCL with citations

Wider efforts Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE) Library of Congress National Academies of Science workshop NSF Cyberinfrastructure meeting

Questions and discussion