Software Sustainability Institute There’s No Such Thing As Irreproducible Research 27.

Slides:



Advertisements
Similar presentations
Grant review at NIH for statistical methodology Jeremy M G Taylor Michelle Dunn Marie Davidian.
Advertisements

Edinburgh Mouse Atlas to e-MouseAtlas Richard Baldock MRC Human Genetics Unit Institute of Genetics and Molecular Medicine MRC Human Genetics.
SoundSoftware.ac.uk Prizes for Reproducibility in Audio & Music Research Chris Cannam, Luís Figueira, Mark Plumbley Centre for Digital Music Queen Mary,
Le-Edged Sword Risks, Rewards and the Double-Edged Sword: Views of Pharmacogenetic Testing and Research in the Alaska Native/American Indian Community.
Science Gateways and their role in Reproducibility Nancy Wilkins-Diehr San Diego Supercomputer Center
Harvard School of Public Health John Godleski, MD Dept of Environmental Health.
College of Engineering, Mathematics and Physical Sciences.
Bioinformatics Training for Dental Researchers Lynn Johnson, Ph.D. University of Michigan.
Computational Challenges in Whole-Genome Association Studies Ion Mandoiu Computer Science and Engineering Department University of Connecticut.
Software Sustainability Institute Software Information and Scientific Publications doi: /m9.figshare Beyond EMI: A Roadmap.
Course & Unit of Study Portal (CUSP) An Overview.
Data Publishing Workflows: Strategies and Standards
Software Sustainability Institute The Software Sustainability Institute 20 January 2015, HEP Software Foundation workshop Neil Chue.
Software Cluster Improve Collaboration and Community Engagement Work with diverse communities that contribute to the sustainability of scientific software.
Valuing Software and Other Research Outputs Daniel S. Katz Program Director, Division of Advanced Cyberinfrastructure.
2013 Washington D.C. Advocacy Trip for Human Space Exploration February 5 & 6 22 Travelers 80 Congressional Office Visits.
Software Sustainability Institute Training in Computational Skills Scientific Meeting 2014 “NGS Data after the Gold Rush” TGAC, Norwich.
Open Software for (Open) Science doi. org/ /m9
Open Data, Open Source: preparing for Big Data in Metabolomics Rob L Davidson #MetSoc2015 This presentation DOI: /m9.figshare
Software Sustainability Institute Software Sustainability: Issues, Challenges and Initiatives Neil Chue Hong,
Software Sustainability Institute Linking software: Citations, roles, references,and more
Scott Emrich Assistant Professor, Computer Science and Engineering Scientific Manager, VectorBase University of Notre Dame A flexible, scalable genomics.
Data Infrastructures Opportunities for the European Scientific Information Space Carlos Morais Pires European Commission Paris, 5 March 2012 "The views.
Taverna and my Grid Basic overview and Introduction Tom Oinn
Open Data, Open Source: preparing for Big Data in Metabolomics Rob L Davidson #MetSoc2015 This presentation DOI: /m9.figshare
Software Sustainability Institute Putting the user back into software sustainability 16 December 2013, Scientific Software Days, Austin.
A centre of expertise in digital information management UKOLN is supported by: Monica Duke Project.
We are the 92% Valuing the contribution of research software Neil Chue Hong, FORCE2015 Research Communications and e-Scholarship.
Presented by: Prof Mark Baker ACET, University of Reading Tel: Web:
Hackathons for Scientific Software How and When do they Work? Erik H. Trainer, Chalalai Chaihirunkarn, Arun Kalyanasundaram, James D. Herbsleb.
Sage Bionetworks A non-profit organization with a vision to enable networked team approaches to building better models of disease BIOMEDICINE INFORMATION.
Cardiovascular Risk Decision Support Software for Patients and Clinicians Presenter: John Colquhoun School of Computing Science, Newcastle University Project.
Software Sustainability Institute Dealing with software: the research data issues 26 August.
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
Software Sustainability Institute What makes “good code” good for science? 26 th September 2013, MozFest 2013, London Neil Chue Hong.
School of Geography FACULTY OF ENVIRONMENT The Elements of a Computational Infrastructure for Social Simulation Mark Birkin 1, Rob Allan 2, Sean Beckhofer.
ELSI: Ethical, Legal and Social Issues surrounding availability of genomic information DOD and NIH devoted ~3-5% of annual HGP budgets to ELSI research.
Software Sustainability Institute Software Attribution can we improve the reusability and sustainability of scientific software?
Software Sustainability Institute Building a Scientific Software Accreditation Framework
Sage Bionetworks A non-profit organization with a vision to enable networked team approaches to building better models of disease BIOMEDICINE INFORMATION.
E-Labs and the Stock of Health Method for Simulating Health Policies Philip Couch, Medinfo 2013 Philip Couch, Martin O’Flaherty, Matthew Sperrin, Benjamin.
Jim Bednar November 2010 Doctoral Training Centre in Neuroinformatics and Computational Neuroscience.
We are the 92% 16 November 2014, WSSSPE2, SC14, New Orleans, USA Neil Chue Hong Software Sustainability.
Edinburgh e-Science MSc Bob Mann Institute for Astronomy & NeSC University of Edinburgh.
Use Case 5: Biomarker Potential and Limitations of Circulating miRNA Performed by the Data Management and Resource Repository (DMRR) ERCC Data Analysis.
Curtin University is a trademark of Curtin University of Technology CRICOS Provider Code 00301J The Digital Mineral Library at Curtin University Major.
Software Sustainability Institute Tracking Software Contributions doi: /m9.figshare Joint ORCID – DRYAD Symposium on Research.
A presentation about myExperiment David De Roure and Carole Goble.
Software Sustainability Institute Working with research software 2 nd - 4 th November.
Research Priorities & Trends for NIH in 2003 Claire T. Driscoll Director Technology Transfer Office National Human Genome Research Institute (NHGRI) Research.
Software Infrastructure for Sustained Innovation (SI 2 ) PI meeting Arlington, VA January 17-18, 2013 Ewa Deelman, University of Southern California Miron.
Software Sustainability Institute Building sustainable software for science … why good code is only the beginning 10 April 2013, EGI.
Software Sustainability Institute Open science is impossible without software 5 th April 2016,
Software Sustainability Institute CW2016 Hackday Technical considerations or How to score extra marks with the judges CW2016 March 22.
Software Sustainability Institute Data Carpentry Aleksandra Pawlik Software Sustainability Institute Data Science Club, 17 th March.
Cornell University June 2016 Sponsored by Cornell Statistical Consulting Unit Instructors Emily Davenport (Cornell University) Erika Mudrak (CSCU) Lynn.
Software Sustainability Institute There’s No Such Thing As Irreproducible Research (Software Credit Edition)
Scaling the Open Science Framework: National Data Service Dashboard, Cloud Storage Add-ons, and Sharing Science Data on the Decentralized Web Natalie K.
Is research software different from software
Change and continuity in the 20th Century
How an RSE can benefit your projects:
PPT1: Basics of software engineering
A Web-based Interactive Genome Library for Surveillance, Detection, Characterization and Drug-Resistance Monitoring of Influenza Virus Infection in the.
Angela L. Rasmussen, Michael G. Katze  Cell Host & Microbe 
by M. Gallego Llorente, E. R. Jones, A. Eriksson, V. Siska, K. W
Evolutionary History of the ADRB2 Gene in Humans
Micro- to Macro-system Integration
Markers for Mapping by Admixture Linkage Disequilibrium in African American and Hispanic Populations  Michael W. Smith, James A. Lautenberger, Hyoung.
Workshop Invitation, Friday August 26th, at DMS 105, 12noon – 3pm
Research Software Group
Presentation transcript:

Software Sustainability Institute There’s No Such Thing As Irreproducible Research 27 th January 2016, Digging Into Data, Glasgow Neil Chue Hong Software Sustainability Institute ORCID: | Slides licensed under CC-BY where indicated: Supported by Project funding from

Software Sustainability Institute Or… “A personal journey as a data explorer” /m9.figshare

Software Sustainability Institute /m9.figshare

Software Sustainability Institute /m9.figshare

Software Sustainability Institute /m9.figshare

Software Sustainability Institute /m9.figshare

Software Sustainability Institute /m9.figshare

Software Sustainability Institute /m9.figshare

Software Sustainability Institute T T /m9.figshare

Software Sustainability Institute T T “Virtual data warehouse” /m9.figshare

Software Sustainability Institute T T Public Health Data - Name - Address - Visits - Symptoms - Treatments /m9.figshare

Software Sustainability Institute /m9.figshare

Software Sustainability Institute /m9.figshare

Software Sustainability Institute /m9.figshare

Software Sustainability Institute /m9.figshare

Software Sustainability Institute /m9.figshare

Software Sustainability Institute /m9.figshare

Software Sustainability Institute Mapping courtesy of Robin T Wilson /m9.figshare

Software Sustainability Institute T T This taught me three things… /m9.figshare

Software Sustainability Institute T T 1. The power of data is when it’s brought together /m9.figshare

Software Sustainability Institute T T 2. Software can help solve difficult data integration problems /m9.figshare

Software Sustainability Institute T T 3. No-one can spell diarhea diereah dyereeah diarrheah dioreah diarrhoea! /m9.figshare

Software Sustainability Institute Positive selection in large genomic datasets Selection at pleiotropic loci underlies disease co-occurrence in human populations. Navarro, Haley, Karosas et al. Submitted to Nature Genetics /m9.figshare

Software Sustainability Institute Hapbin: fast haplotype based scans hapbin: An Efficient Program for Performing Haplotype-Based Scans for Positive Selection in Large Genomic Datasets. DOI: /molbev/msv /m9.figshare

Software Sustainability Institute Open Source lets others benefit Slide courtesy of Nancy Wilkins-Diehr BEAST software licensed under LGPL /m9.figshare

Software Sustainability Institute /m9.figshare

Software Sustainability Institute Errors due to bioinformatics pipeline /m9.figshare

Software Sustainability Institute Raise standards for preclinical cancer research 47 out of 53 “landmark” publications could not be replicated Begley, Ellis. Nature, 483, 2012 doi: /483531a /m9.figshare

Software Sustainability Institute it’ Victoria Stodden, AMP Special Issue Reproducible Research Computing in Science and Engineering July/August 2012, 14(4) Howison and Herbsleb (2013) "Incentives and Integration In Scientific Software Production" CSCW /m9.figshare

Software Sustainability Institute Errors due to bioinformatics pipeline The results presented in the Report “Ancient Ethiopian genome reveals extensive Eurasian admixture throughout the African continent“ were affected by a bioinformatics error Llorente et al. Science, 350, 6262 doi: /science.aad /m9.figshare

Software Sustainability Institute T T Nullius in verba “Take nobody’s word for it” /m9.figshare

Software Sustainability Institute T T There’s no such thing as irreproducible research There’s reproducible research and there’s ignorance It’s not research if it’s not transparent /m9.figshare

Software Sustainability Institute

Software Sustainability Institute ^ and Software /m9.figshare

Software Sustainability Institute T T /m9.figshare

Software Sustainability Institute T T Vandewalle (2012) DOI: /MCSE /m9.figshare

Software Sustainability Institute T T Without data it’s difficult to validate results. But without code, we waste the opportunity to advance science /m9.figshare

Software Sustainability Institute Acknowledgements The SSI team: -Aleksandra Pawlik -Carole Goble -Claire Wyatt -Clem Hadfield -Dave De Roure -Devasena Prasad -Giacomo Peru -Graeme Smith -Iain Emsley -John Robinson -Les Carr -Mario Antonioletti -Mark Parsons -Mike Jackson -Olivier Philippe -Shoaib Sufi -Simon Hettrick -Stephen Crouch The SSI Fellows and collaborators especially: -James Baker -James Hetherington -Martin Hammitzsch -Robin Wilson (for contributing examples) EPCC Industry Projects: -Mark Sawyer -Maureen Wilkinson -Paul Graham -Rob Baxter -Terry Sloan Mouse Atlas: -James Sharpe -Richard Baldock Epigenetic analysis - James Prendergast - Colin Maclean Scientific software: -Dan Katz -Heather Piowowar -James Howison -Jeff Carver -Jennifer Schopf -Kaitlin Thaney -Martin Fenner -Victoria Stodden Software/Data Carpentry -Greg Wilson -Jonah Duckles -Katy Huff -Tracy Teal Slides at: