Software Sustainability Institute www.software.ac.uk The Software Sustainability Institute 20 January 2015, HEP Software Foundation workshop Neil Chue.

Slides:



Advertisements
Similar presentations
April 2010 MRC Data Sharing Policy Peter Dukes Policy Lead – Data Sharing & Preservation.
Advertisements

Building Repositories of eprints in UK Research Universities Bill Hubbard SHERPA Project Manager University of Nottingham.
ESDS Qualidata and QUADS Coordination Louise Corti Online Resources Day 15 November 2005, London.
A centre of expertise in data curation and preservation DCC/NeSC eScience Workshop, June 2008 Working in partnership with the eScience community This work.
Web: OMII-UK Delivering Software and Social Platforms for Successful Research RCUK Review of e-Science, 8 December.
Web: The Future of OMII-UK e-Science: the Changing Landscape 17 April 2009 Neil Chue Hong.
European R&D Support Programme ACCESSING EUROPEAN FUNDING FOR RESEARCH AND DEVELOPMENT.
Software Sustainability Institute “Doing Science Properly in the Digital Age” UK e-Infrastructure Academic User Community Forum 12 September.
January 3-4th 2007University of Plymouth Academic Dissemination with a focus on HELP CETL Award Holders Mark Stone, HELP CETL Director Higher Education.
Relating research to practice Heather King Department of Education King’s College London.
Writing For Publication Workshop STEM Discipline.
Software Sustainability Institute SSI in one slide.
The JISC vision of research information management Dr Malcolm Read Executive Secretary, JISC.
CAS Academy September 2012 Network of Teaching Excellence in Computer Science.
Developing and enhancing the practice and management of scholarly activity and research within Further Education Colleges, Higher Education Academy [HEA],
PERFORMANCE FOR ALL The Project & the System. A HE project co-ordinated by University of Bristol, open to HE internationally. Developing the requirements.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
The Tower Hotel, November 26, 2009 Research Data Management Infrastructure Programme Launch Event SUpporting Data Management Infrastructure for the Humanities.
Software Sustainability Institute Software Information and Scientific Publications doi: /m9.figshare Beyond EMI: A Roadmap.
The student experience of e-learning Dr Greg Benfield Oxford Centre for Staff & Learning Development.
August 14, 2015 Research data management – an introduction Slides provided by the DaMaRO Project, University of Oxford Research Services.
Serving MERLOT on Your Campus Gerry Hanley California State University and MERLOT Seminars on Academic Computing August 7, 2002 Snowmass CO Copyright Gerard.
Software Sustainability Institute Training in Computational Skills Scientific Meeting 2014 “NGS Data after the Gold Rush” TGAC, Norwich.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Developing a Strategy for Technology Enhanced Learning at UEL.
Margaret J. Cox King’s College London
Lessons from industry for science cyberinfrastructure Simplicity, scale, and sustainability via SaaS/PaaS.
Software Sustainability Institute Software Sustainability: Issues, Challenges and Initiatives Neil Chue Hong,
Defining and Measuring Impact Professor Andy Neely Deputy Director, AIM Research.
Software Sustainability Institute Linking software: Citations, roles, references,and more
A centre of expertise in digital information management UKOLN is supported by: Benefits of Research360 Catherine Pink Institutional Data.
Software Sustainability Institute Referencing software to make it sustainable: what, why and how? SciencePAD Persistent Identifiers.
Managing Research Data – The Organisational Challenge at Oxford James A J Wilson Friday 6 th December,
13 September 2012 The Libraries’ Role in Research Data Management: A Case Study from the University of Minnesota Meghan Lafferty, Chemistry, Chemical Engineering,
Software Sustainability Institute Putting the user back into software sustainability 16 December 2013, Scientific Software Days, Austin.
We are the 92% Valuing the contribution of research software Neil Chue Hong, FORCE2015 Research Communications and e-Scholarship.
The Personal Tutoring System Background for new personal tutors and other student support staff.
Caring and Sharing Collaboration in Digital Curation outside North America Ross Harvey Simmons College, Boston Curation Matters: 17 June 2010.
Software Sustainability Institute Dealing with software: the research data issues 26 August.
Building An Academic Career
Software Sustainability Institute What makes “good code” good for science? 26 th September 2013, MozFest 2013, London Neil Chue Hong.
A centre of expertise in digital information management UKOLN is supported by: University of Bath Roadmap for EPSRC Catherine Pink Institutional.
Scholarly communications Discussion group Linked Data Workshop May 2010.
An Introduction. Aspiration To begin the process of adding significant value to those emerging repositories in which.
Software Sustainability Institute Software Attribution can we improve the reusability and sustainability of scientific software?
Because good research needs good data Funded by: Digital Curation for Researchers, 28th February 2013 The Shifting Research Data Management Policy Landscape.
Evaluating E-Learning Efficacy University of York, UK Wayne Britcliffe and Simon Davis Edinburgh Napier Learning and Teaching conference 14 th June 2012.
June 3, 2016 Research data management – an introduction Slides provided by the DaMaRO Project, University of Oxford Research Services.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
HEFCE/Higher Education Academy/JISC cc-by-sa (uk2.5) Image source – flickr (cc-by) OER and the Open Agenda Malcolm Read, Executive Secretary, JISC.
We are the 92% 16 November 2014, WSSSPE2, SC14, New Orleans, USA Neil Chue Hong Software Sustainability.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
1 © ACADEMY OF FINLAND Academy of Finland 2012: Research knows no boundaries Tiina Kotti PhD, Programme Manager, Programme Unit.
Professional progression of a subject librarian Suzanne Rushe Subject Librarian for Engineering Limerick Institute of Technology
Software Sustainability Institute Activities and Experiences from the UK Software Sustainability Institute Neil Chue Hong,
Software Sustainability Institute Tracking Software Contributions doi: /m9.figshare Joint ORCID – DRYAD Symposium on Research.
Software Sustainability Institute Working with research software 2 nd - 4 th November.
Software Sustainability Institute Building sustainable software for science … why good code is only the beginning 10 April 2013, EGI.
What does it mean to be a RETA Instructor this project? Consortium for 21 st Century Learning C21CL
Introducing the RSP Chris Yates, University of Wales, Aberystwyth.
Software Sustainability Institute Open science is impossible without software 5 th April 2016,
Software Sustainability Institute There’s No Such Thing As Irreproducible Research 27.
Research Councils UK and the research funding landscape Name Job title Research Councils UK.
Software Sustainability Institute There’s No Such Thing As Irreproducible Research (Software Credit Edition)
Open Exeter Project Team
Ian Bruno, Suzanna Ward The Cambridge Crystallographic Data Centre
Data Management: Documentation & Metadata
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Bird of Feather Session
Presentation transcript:

Software Sustainability Institute The Software Sustainability Institute 20 January 2015, HEP Software Foundation workshop Neil Chue Hong Software Sustainability Institute ORCID: | Where indicated slides licensed under Supported by Project funding from

Software Sustainability Institute Of course, we don’t deliberately set out to create code that can’t be used by others Right?

Software isn’t special, it’s mainstream Survey of researchers from 15 Russell Group unis conducted by SSI between Aug- Oct respondents covering representative range of funders, discipline and seniority. 69% 92%

This isn’t just about the “traditional” computational sciences Survey of researchers from 15 Russell Group unis conducted by SSI between Aug- Oct respondents covering representative range of funders, discipline and seniority.

And it isn’t just using software, it’s researchers developing software too Survey of researchers from 15 Russell Group unis conducted by SSI between Aug- Oct respondents covering representative range of funders, discipline and seniority. 56% 21%

So what’s the issue? Survey of researchers from 15 Russell Group unis conducted by SSI between Aug - Oct respondents covering representative range of funders, discipline and seniority. Analysis of data from 49,650 grant titles and abstracts published on Gateway to Research covering Analysis of job adverts posted to jobs.ac.uk in 1H % 4% Of UK researchers have had no formal software development training Of jobs advertised in UK universities were software related 77% Of PIs had not included costs for software development in bids 30% Of UK research investment has been spent on research which relies on software … and then there are gender related issues

Career Paths in UK Careers outside academic sector Non-university Research (industry, government etc.) ProfessorPermanent Research Staff Early Career Research PhD students Source: The Scientific Century, Royal Society, 2010 (revised to reflect first stage clarification from “What Do PhD’s Do?” study) UK STEM graduate career paths

Software Sustainability Institute Software is pervasive in research

Software Sustainability Institute Pleiotropic loci – diabetes and Parkinson’s disease Selection at pleiotropic loci underlies disease co-occurrence in human populations. Navarro, Haley, Karosas et al. Submitted to Nature Genetics

Software Sustainability Institute Behind every great piece of science… #go through each SNP of interest for(my $x = 0; $x < $x++) { #and then each downstream SNP of interest for(my $y = $x+1; $y < $y++) { #if SNPs within our chosen distance (500kb) and both present in the haplotypes file if((!($trait[$x] eq $trait[$y])) && (abs($pos[$x] - $pos[$y]) <= ) && (exists($legArrayPos{$pos[$x]})) && (exists($legArrayPos{$pos[$y]}))) { my $snp1ArrayPos = "”; my $snp2ArrayPos = "”; my $snp1All = "”; my $snp2All = "”; #create output file for this SNP pair my $filename = "ConditionedResults2/$chr[$x].$pos[$x]-$pos[$y].EHH.GBR.2.txt”; print "$filename\n”; unless (-e $filename) { open(OUT, ">$filename"); #####################CHANGE THESE IF NOT FOCUSING ON SECOND SNP######################### my $start = $pos[$y] ; if ($start < 1) { $start = 1; } my $end = $pos[$y] ; if ($end > $chrLengths{$chr[$x]}) { $end = $chrLengths{$chr[$x]}; }

Software Sustainability Institute Just the Nature of the problem? Maintenance is not fun Producing new code is fun Statistics courtesy of Jo Hannay et al, “How Do Scientists Develop and Use Scientific Software? Published online 13 October 2010 | Nature 467, (2010) doi: /467775a

Software Sustainability Institute The modern researcher… … worries about:  Data management and analysis  Reproducible research  Scalable simulations  Integration of models and workflows  Collaboration Picture of Otto Stern courtesy of Emilio Segre Visual Archives

Software Sustainability Institute reuse reproduce repeatreplicate same experiment same lab same experiment different lab same experiment different set up different experiment some of same test Figure by Carole Goble adapted from Drummond C, Replicability is not Reproducibility: Nor is it Good Science, online and Peng RD, Reproducible Research in Computational Science Science 2 Dec 2011:

Software Sustainability Institute SIGMOD Reproducibility SIGMOD conference offered to attempt to repeat/reproduce papers accepted at conference  “High burden on reviewers when setting up experiments”  Use of VMs advocated Bonnet et al, SIGMOD Record, June 2011 (Vol. 40, No. 2) doi: /

Software Sustainability Institute People UK Research Computing Ecosystem Network/Collaboration Instruments Software Computing Data Communities … Communities …

Software Sustainability Institute SSI Drivers and Themes Two key drivers which cause people to seek the SSI’s advice:  They want to be more productive in their research  They don’t want to be embarrassed by appearing worse than their peers Broadly, our work falls into a few key themes:  Developing the scientific computing / software development skill base  The role and reward of software in research  Recognition of software career paths  Re[peatable|producible|computable] research

Software Sustainability Institute The Foundations of Digital Research Re- search Careers Recognition / Reward Skills and Capability

Software Sustainability Institute The Software Sustainability Institute A national facility for cultivating world- class research through software Better software enables better research Software reaches boundaries in its development cycle that prevent improvement, growth and adoption Providing the expertise and services needed to negotiate to the next stage Developing the policy and tools to support the community developing and using research software Supported by EPSRC Grant EP/H043160/1

Software Sustainability Institute SSI Objectives 1.Getting software on the research agenda 2.Supporting communities that want change 3.Increasing skills 4.Improving software Building a platform Engaging the community Being an authoritative voice

Communication Website & blog Campaigns Advice Guides Courses Workshops Fellowship Research Software Policy Training Community Consultancy 41 projects 92 evaluations 4 surgeries 33 UK SWC workshops learners 50,000 readers 41 domain ambassadors 20+ workshops organised 740 researchers 50,000 grants analysed 150+ contributed articles 19,000 unique visitors per month 272 RSEs engaged1700 signatures 13 issues highlighted

Architectural Dominance Image courtesy PDES Inc Slide from Sean Barker, BAE SYSTEMS, DPC Designed to Last

Software Sustainability Institute SSI Organisation Community Engagement (Lead: Shoaib Sufi)  Fellowship Programme Fellowship Programme  Events and Roadshows Research Software (Lead: Steve Crouch)  Open Call for Consultancy Projects / Funded Collaborations Open Call for Consultancy Projects  Software Evaluation Software Evaluation Policy (Lead: Simon Hettrick)  Guides and Case Studies Guides and Case Studies  Best Practice and Policy Training (Lead: Aleksandra Pawlik)  Software Carpentry Software Carpentry  Software Surgeries Collaboration between universities of Edinburgh, Manchester, Oxford and Southampton.

Software Sustainability Institute Water Swap Reaction Coordinate A water-swap reaction coordinate for the calculation of absolute protein-ligand binding free energies Woods CJ, Malaisree M, Hannongbua S, Mulholland AJ J. Chem. Phys. (2011) vol. 134, pp

Software Sustainability Institute Case Study: Ligand Binding Centre for Computational Chemistry, Bristol  New methods for rapid MC sampling of biomolecular systems modelled using QM/MM  Developed two codes ProtoMS (F77) + Sire (C++)  Water-Swap Reaction Coordinate method to calculate absolute protein-ligand binding free energies SSI’s work is helping to scale development  ProtoMS and Sire both single developer codes  ASPIRE/ACQUIRE framework has multiple devs Split architecture between ASPIRE (adaptive multiresolution hybrid MD simulation) and ACQUIRE (WorkPacket scheduling system with optimisation for time to result vs “green-ness”

Software Sustainability Institute Case Study: ICAT ICAT metadata catalogue, STFC  Support experimental data linking at Large Facilities  RAL UK (ISIS, DIAMOND, CLF), SNS US, ELLETRA Italy  ICAT operationally critical at sites, other projects looking to use  Undertook interview-based organisational review 92 observations, 32 recommendations SSI’s work means the project has move forward  Taking forward: steering group, induction process, expanded roadmapping activity, technical workshops via Skype, involved with deployment  1 year post-project 25 (75%) recommendations implemented Used by other major projects (PandataODI, CRISP) behind-some-worlds-large-experimental-facilities behind-some-worlds-large-experimental-facilities

Software Sustainability Institute Case Study: Fusion Plasma Culham Centre for Fusion Energy  GS-2 used to study low-frequency turbulence in magnetized plasma  No common visualisation across different groups  Deliver mutually agreeable framework that can be extended easily and can be maintained by the small fusion community SSI’s work means the software can be used between groups  Simplified & enhanced plasma visualisation tool Based on ParaView o/s tool For simulations using GS-2 o/s package  Aim to allow CCFE to contribute back to GS-2 community  “I am very confident the tool will be invaluable” Colin Roach, CCFE

Software Sustainability Institute Preservation vs sustainability Image courtesy of RGB Kew – not for reuse Image courtesy of London Permaculture under CC-by-nc-sa license Preservation? Sustainability?

Software Sustainability Institute Sustainable Communities Cohesion and Identity: Creating a community Tolerance and Diversity: Smart growth through collaboration Efficient use of resources: Leveraging infrastructure Adaptability to change: Governing sustainably

Software Sustainability Institute Case Study: Climate Policy Modelling CIAS team at Tyndall Centre for Climate Change Research, University of East Anglia  Develop linked climate and economic models for detailed analysis  Their software was not ready to be used by other groups One researcher/developer at UEA, several users SSI’s work means the software is robust enough that it can be installed and used by others  Enabled use of the software by the WWFN’s Climascope project and James Cook University Documented software to allow extensions by contributors Made it easier to maintain and backup Added job scheduling to improve modeling throughput New modelling framework enables new models i.e. new science

Software Sustainability Institute Case Study: Tinkering with APES NSF/EPSRC Funded  Polarisable empirical force fields (AMOEBA)  AMBER, Tinker, DL_POLY, ONETEP, Q-Chem SSI’s work is helping to coordinate development across many packages and many people  Different languages, licenses, styles, teams

Software Sustainability Institute Case Study: Brain Imaging Brain Research Imaging Centre, Edinburgh  Develop PrivacyGuard software, a DICOM image deidentification toolkit  Created software to support new multispectral colouring modulation and variance identification technique (“MCMxxxVI”) to identify white matter lesions that are indicative of declining cognitive ability  BRIC are not principally software developers, but do provide software to other researchers SSI’s work means the software has been reviewed and refactored  Looked at exploitation Usability review, Naming/trademark review  Made it easier for BRIC staff to maintain and develop Move to standard repositories, testing and documentation processes Examination of licencing for MCMxxxVI Extraction and refactoring to create standalone tools

Software Sustainability Institute SSI Fellows : 16 fellows 2013: 15 fellows 2012: 10 fellows Range of subjects, career stages software.ac.uk/fellows

Software Sustainability Institute Journal of Open Research Software

Software Sustainability Institute SSI Website Disseminating best practice, gathering information 19,000+ unique visits/month

Software Sustainability Institute SSI Guides and Top Tips Guides provide in depth information  Licences  Software development  Project management  Repositories and project infrastructure  Open source  Community building  Publicising software  Policy Top Tips provide quick overviews  Software development  Repositories and project infrastructure  Software carpentry  Citing software  Data handling  Promoting and communicating your project  Community building and project management

Software Sustainability Institute SSI Briefing Papers

Software Sustainability Institute SSI Blog Articles on research software and related issues  Ask the Institute  A Day in the Software Life  Heroes of Software Engineering  Top Tips  Women in Software 150+ posts written by external contributors  10,000+ unique pageviews/month

Software Sustainability Institute SSI Training Software Carpentry   International initiative to teach basics of software engineering to computational researchers The “why” more than the “how”  Phenomenally successful – 2x oversubscription  Cheap to run but budget for 3x the coffee!  SSI are UK Coordinators for SWC  We ran 13 workshops in 2013 to 600+ learners Software Sustainability Surgeries  “Bring your own code”  “What makes Good Code good?”  Run at existing conferences, and for software funding programmes  Offering bespoke advice as well as training

Software Sustainability Institute Creating a training community Bringing together 39+ organisations with interest in e- Infrastructure training Raising issues and enablers with RCUK, BIS software.ac.uk/policy

Software Sustainability Institute Research Software Engineer Join the RSE community at

Software Sustainability Institute No-one sets out to write unsustainable software People need the skills and incentives to maintain software through its lifetime The Institute helps UK researchers benefit from more sustainable software

Work with us –