SiZhe Xiao GigaScience 2013 POSTER Open Access GigaDB – revolutionizing data dissemination, organization and use Xiao Si Zhe 1, Chris Hunter, Tam P. Sneddon,

Slides:



Advertisements
Similar presentations
Raising your research profile with AEKOS Anita Smyth and David Turner Logos used with consent. Content of this presentation except logos is released under.
Advertisements

Don’t make me think Biodiversity data publishing made easy Vince Smith, Alice Heaton, Laurence Livermore, Simon Rycroft, Ben Scott & Lyubomir Penev* The.
Ensuring a Journal’s Economic Sustainability, While Increasing Access to Knowledge.
Service activities ViBRANT Project Year 3/Final Review Meeting – Brussels Description & Objectives WP Description WP Objectives WP partners.
BioMed Central’s open data initiatives Alliance for Permanent Access conference 7 th November 2012 Iain Hrynaszkiewicz Publisher (Open Science), BioMed.
Open Access: A Publisher’s Perspective Daniel Wilkinson 20 th October, 2014.
Rewarding Reproducibility and Method Publishing the GigaScience Way Scott Edmunds
Journals Full Text Resources Including MedIND. For Scholarly Information We start with Bibliographic Databases having references to journals and other.
Figures for ADMIRAL Project grant application These figures are copyright © David Shotton, University of Oxford, They are made available for reuse.
Service update Elin Stangeland Repository Manager.
ⓒ UNIST LIBRARY UNIST Institutional Repository ⓒ UNIST LIBRARY
Promoting data dissemination and reproducibility. Christopher I. Hunter, Scott C. Edmunds, Peter Li, Xiao Si Zhe, Robert L Davidson, Laurie Goodman. Submit.
Tools for reproducible and accessible science VMs, KnitR and OMERO Rob Davidson Cardiac Physiome Workshop Auckland, April 8th 2015.
Open Data, Open Source: preparing for Big Data in Metabolomics Rob L Davidson #MetSoc2015 This presentation DOI: /m9.figshare
E-BIOGENOUEST: A REGIONAL LIFE SCIENCES INITIATIVE FOR DATA INTEGRATION Datacite Annual Conference Nancy Olivier Collin – IRISA/INRIA
DATAVERSE FOR JOURNALS Mercè Crosas, Ph.D. Director of Data Science IQSS, Harvard Society for Scholarly Publishing 37 th Meeting,
Libraries as Partners in Research: the UC Curation Center’s Tools and Services UC3 Team University of California Curation Center California Digital Library.
Open Access Ayesha Abed Library BRAC University October 30, 2011.
Open Data, Open Source: preparing for Big Data in Metabolomics Rob L Davidson #MetSoc2015 This presentation DOI: /m9.figshare
Software workflows as research objects & GigaGalaxy Rob L Davidson, Chris I Hunter ISI CODATA International Training Workshop on Big Data 11 th March 2015.
Depth customization of DSpace: Best practices and techniques of institutional repository at IIT Kanpur, India By S. K. Vijaianand V. D. Shrivastava Gaurav.
Introduction to GigaScience journal & database Chris I Hunter & Rob L Davidson ISI CODATA International Training Workshop on Big Data 11 th March 2015.
GigaDB explained Christopher I Hunter International Training Workshop on Big Data 11-Mar-2015.
Software Sustainability Institute Software Attribution can we improve the reusability and sustainability of scientific software?
Data Management in Scholarly Journals and possible Roles for Libraries – Some Insights from EDaWaX Sven Vlaeminck | Leibniz-Information Centre for Economics.
Anomalies in Open-Access & Traditional Biomedical Literature: A Comparative Analysis Abstract This research compares rates of anomaly and post-publication.
1 ARRO: Anglia Ruskin Research Online Making submissions: Benefits and Process.
Now launched! Visit nature.com/scientificdata Honorary Academic Editor Susanna-Assunta Sansone Advisory.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
WHAT ARE WE GOING TO DO WITH DATA? Rob L Davidson #WCSJ2015 This presentation DOI: /m9.figshare
Don’t make me think Biodiversity Data Publishing Made Easy Laurence Livermore, Vince Smith, Alice Heaton, Simon Rycroft, Ed Baker, Ben Scott & Lyubomir.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
QSSPN: Dynamic Simulation of Molecular Interaction Networks Describing Gene Regulation, Signalling and Whole-Cell Metabolism in Human Cells.
GigaScience ( is an online, open-access journal that includes, as part of its publishing activities, the database GigaDB.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
Merging and sharing Metabolomics analysis tools with Galaxy: transparent, reproducible, open 'omics Robert L Davidson #MMW2014 Merlion.
CyVerse-enabled NCBI Sequence Read Archive (SRA) Submission Pipeline
Dryad UK discussion meeting Mark Patterson, Director of Publishing April 27, 2010 Committed to making the world’s scientific and medical literature.
Brian Hole COASP, Riga, 20 September 2013.
Data Citation Implementation Pilot Workshop
Publication Ethics Webinar: Jan 2016 (Ethical) framework for author-driven publishing Dr Michaela Torkar Editorial Director, F1000Research
| 1 Anita de Waard, VP Research Data Collaborations Elsevier RDM Services May 20, 2016 Publishing The Full Research Cycle To Support.
CitEc as a source for research assessment and evaluation José Manuel Barrueco Universitat de València (SPAIN) May, й Международной научно-практической.
DATA CITATION Laurie Goodman, PhD Editor-in-Chief, GigaScience ORCID ID: Twitter:
Enhancements to Galaxy for delivering on NIH Commons
NRF Open Access Statement
Peter Li GigaScience GigaDB and Galaxy: revolutionizing data dissemination, organization and analysis Peter Li GigaScience.
Edmunds GigaScience 2013 POSTER Open Access
J Exp Bot. 2017;68(17): doi: /jxb/erx352
Tin-Lap, LEE School of Biomedical Sciences,
Figure 3: MetaLIMS sample input.
Christopher I Hunter Conference name Date
GFBio – Education module
GigaDB – revolutionizing data dissemination, organization and use
Publishing software and data
Figure 2: Make a component
University of Nigeria, Nsukka
Data publishing from the viewpoint of a biodiversity publisher
SRA Submission Pipeline
Figure 2. Workflow of MethMotif Batch Query
OpenML Workshop Eindhoven TU/e,
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
USER MANUAL - WORLDSCINET
Figure 2. Effect of gradually decreasing photoperiod on PHA response in Siberian hamsters. Asterisk (*) indicates statistical significance at P﹤0.05, determined.
Figure 4. The mean of spermatocyte of various treatment groups
Figure 4. Classified landsat image 2016
Fig. 1. iS-CellR pipeline overview
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Citation databases and social networks for researchers: measuring research impact and disseminating results - exercise Elisavet Koutzamani
USER MANUAL - WORLDSCINET
Presentation transcript:

SiZhe Xiao GigaScience 2013 POSTER Open Access GigaDB – revolutionizing data dissemination, organization and use Xiao Si Zhe 1, Chris Hunter, Tam P. Sneddon, Scott C. Edmunds, Alexandra T. Basford, Peter Li, and Laurie Goodman. Abstract GigaScience, the online open-access open-data journal, has recently developed GigaDB, a new integrated database of ‘big-data’ studies from the life and biomedical sciences. The initial goals of GigaDB are to assign DOIs to datasets to allow them to be tracked and cited, and to provide a user-friendly web interface to provide easy access to selected GigaDB datasets and files. We will be working with authors to make the raw data, computational tools and data processing pipelines described in the GigaScience papers available and, where possible, executable on an informatics platform. We hope that by making both the data and processes involved in their analysis freely accessible, this novel form of publication will help articles published in GigaScience to have a much higher impact in the scientific literature, and maximize their reuse within the community. GigaDB currently accepts submissions in Excel format. Example submission and template files can be found on the website ( To date, GigaDB comprises over 56 datasets and includes Genomic, Transcriptomic, Epigenomic and Metagenomic dataset types but we accept many other dataset types including proteomic and neuroimaging studies. Future goals include integration with the BGI Cloud, and with the Galaxy software tools to enable users to directly upload files to Galaxy for further analysis. We are also working with ISA- Tab and other scientific standards groups to support and extend the usability and interoperability model. Keywords: DOI, Galaxy, big-data, database, informatics platform, GigaScience doi: /m9.figshare Cite this poster as: GigaDB – revolutionizing data dissemination, organization and use. Xiao Si Zhe, Chris Hunter, Tam P. Sneddon, Scott C. Edmunds, Alexandra T. Basford, Peter Li, and Laurie Goodman. © 2013 Edmunds et al. This is an Open Access poster distributed under the terms of the Creative Commons Attribution License ( which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Correspondence: 1. BGI HK Research Institute, 16 Dai Fu Street, Tai Po Industrial Estate, Hong Kong SAR, China. 2. BGI-Shenzhen, Beishan Industrial Zone, Yantian District, Shenzhen, China. 3. School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong SAR, China. 4. CUHK-BGI Innovation Institute of Trans-omics, The Chinese University of Hong Kong, Shatin, Hong Kong SAR, China. 5. HKU-BGI Bioinformatics Algorithms and Core Tecnology Research Laboratory & Department of Computer Science, University of Hong Kong, Pok Fu Lam, Hong Kong 6. Oxford e-Research Centre, University of Oxford, Oxford, UK. Laurie Goodman, Chris Hunter, Scott Edmunds, Tam Sneddon (GigaScience), Shaoguang Liang (BGI-SZ), Qiong Luo, Senghong Wang, Yan Zhou (HKUST), Rob Davidson and Mark Viant (Birmingham Uni), Marco Galardini (Unifi) Acknowledgements Thanks to: Financial support from: Data sets Analyses Linked to DOI Open-Paper DOI: / Open-Pipelines Open-Workflows DOI: / Open-Data 78GB CC0 data Linking papers to data and analyses 10/18 microarray papers cannot be reproduced Ioannidis: “Most Published Research Findings Are False” >15X increase in retracted papers in last decade Lack of incentives to make data/methods available Poor metadata quality and lack of interoperability Growing replication gap: Background Combine and integrate (via citable DOIs): Open-access journal Data Publishing Platform gigadb.org Data Analysis Platform galaxy.cbiit.cuhk.edu.hk GigaSolution: deconstructing the paper Submit your next manuscript containing large-scale data and workflows to GigaScience and take full advantage of: No space constraints, and unlimited data and workflow hosting in GigaDB and GigaGalaxy Article processing charges for all submissions in 2013 covered by BGI Open access, open data and highly visible work freely available for distribution Inclusion in PubMed and Google Scholar GigaDB Home page: Aspera data transfer Faster download speeds Validation checks Fail – submitter is provided error report Pass – dataset is uploaded to GigaDB. GigaDB Submission Workflow Curator makes dataset public (can be set as future date if required) DataCite XML file Excel submission file Submitter logs in to GigaDB website and uploads Excel submission GigaDB DOI assigned Files Submitter provides files by ftp or Aspera XML is generated and registered with DataCite Curator Review Curator contacts submitter with DOI citation and to arrange file transfer (and resolve any other questions/issues). DOI / / Genomic data from the crab-eating macaque/cynomolgus monkey (Macaca fascicularis) (2011) Public GigaDB dataset Datasets public in GigaDB