Rewarding Reproducibility and Method Publishing the GigaScience Way Scott Edmunds
The Issue: = growing reproducibility gap Data-driven science era brings: Huge opportunities Huge challenges with: data curation, review/QA, handling, sharing
GigaSolution: deconstructing the paper Take data publication approach further and reward: Data availability Metadata/curation Interoperability Availability of workflows Transparent analyses Data Metadata Methods Analyses
GigaSolution: deconstructing the paper Worlds largest genomics organisation with: 17PB storage, 20.5K cores, 212TFlops, >1000 bioinformaticians Utilizes big-data infrastructure and expertise from: Combines and integrates: Open-access journal Data Publishing Platform Data Analysis Platform
How are we supporting data reproducibility? Data sets Analyses Linked to DOI Open-Paper Open-Review DOI: / X-1-18 >6500 accesses Open-Code 8 reviewers tested data in ftp server & named reports published DOI: / Open-Pipelines Open-Workflows DOI: / Open-Data 78GB CC0 data Code in sourceforge under GPLv3: >4000 downloads Enabled code to being picked apart by bloggers in wiki
SOAPdenovo2 workflows implemented in galaxy.cbiit.cuhk.edu.hk
SOAPdenovo2 workflows implemented in galaxy.cbiit.cuhk.edu.hk Implemented entire workflow in our Galaxy server, inc.: 3 pre-processing steps 4 SOAPdenovo modules 1 post processing steps Evaluation and visualization tools Also available to download by >25K Galaxy users in
“Deconstructed” Journal “Regular” Journal “Conscientious” Online Journal
“Deconstructed” Journal “Regular” Journal “Conscientious” Online Journal
“Deconstructed” Journal “Regular” Journal “Conscientious” Online Journal
Image Source: “Deconstructed” Journal “Regular” Journal “Conscientious” Online Journal
Ultimate Goal: Executable papers Data Papers Executable (Methods) Papers Analysis Papers
Give us your data & pipelines! * What is needed to make it happen? Contact us: * APC’s currently generously covered by BGI
Ruibang Luo (BGI/HKU) Shaoguang Liang (BGI-SZ) Tin-Lap Lee (CUHK) Huayen Gao (CUHK) Qiong Luo (HKUST) Senghong Wang (HKUST) Yan Zhou (HKUST) Thanks facebook.com/GigaScience blogs.openaccesscentral.com/blogs/gigablog/ Peter Li Chris Hunter Jesse Si Zhe Nicole Nogoy Tam Sneddon Alexandra Basford Laurie Goodman Follow us: galaxy.cbiit.cuhk.edu.hk CBIIT Funding from: Our collaborators: team:
Ruibang Luo (BGI/HKU) Shaoguang Liang (BGI-SZ) Tin-Lap Lee (CUHK) Huayen Gao (CUHK) Qiong Luo (HKUST) Senghong Wang (HKUST) Yan Zhou (HKUST) Thanks facebook.com/GigaScience blogs.openaccesscentral.com/blogs/gigablog/ Peter Li Chris Hunter Jesse Si Zhe Nicole Nogoy Tam Sneddon Alexandra Basford Laurie Goodman Follow us: galaxy.cbiit.cuhk.edu.hk CBIIT Funding from: Our collaborators: team: Happy New Year!