Data publishing wakes up the sleeping data -real practices in China Scientific Data Zhang Lili Computer Network Information Center,CAS.

Slides:



Advertisements
Similar presentations
Nottingham ePrints School of Biosciences School Board Meeting Nov 2005 Bill Hubbard SHERPA Project Manager University of Nottingham.
Advertisements

Contents Importance Knowledge for CBD Managing Knowledge 2.
OVERVIEW & LIBRARY SUPPORT FOR DATA MANAGEMENT/SHARING Jim Van Loon, MSME/MLIS Science Librarian.
THE NEED AND DRIVE FOR HIGH QUALITY DATA PUBLICATION Iain Hrynaszkiewicz Head of Data and HSS Publishing, Open Research Nature Publishing Group & Palgrave.
PhD-course Research Data Management (RDM) Expert Centre Research Data.
FROM DATA REPOSITORIES TO DATA JOURNALS – WHERE, WHEN AND HOW TO SUBMIT Andrew L. Hufton Managing Editor, Scientific Data Nature Publishing Group
INTRODUCTION TO RESEARCH DATA MANAGEMENT Robin Desmeules Janice Kung J W Scott Health Sciences Library University of Alberta Libraries.
Undertaken by the ………………………………
Social Science Data and ETDs: Issues and Challenges Joan Cheverie Georgetown University Myron Gutmann ICPSR – University of Michigan Austin McLean ProQuest.
Open for ^ Business Research Data Services & Data Management Planning Ryan Schryver Wendt Commons is our.
Challenges & opportunities in the preservation of (digital) information: the case of European research libraries Museo de las Ciencias Teatro de UNIVERSUM.
The importance of DART for funding agencies Dr. Ingrid Kissling-Näf.
ACCESS for VALIDITY ACCESS for INNOVATION. Starting January 2011 for NEW proposals Not voluntary – “integral part” of proposal and FastLane Required for.
Data Archiving and Networked Services DANS is an institute of KNAW en NWO Data Archiving and Networked Services Introduction to Data Management Planning.
Preserving the Scientific Record: Case Study 2 – Arctic Temperature Variability Matthew Mayernik National Center for Atmospheric Research Version 1.0 Review.
Data Providers Dissemination – Access, cost, formats, size, metadata, service, support, findability, Policies – Copyright, fees, confidentiality, preservation,
Data Management in Scholarly Journals and possible Roles for Libraries – Some Insights from EDaWaX Sven Vlaeminck | Leibniz-Information Centre for Economics.
Now launched! Visit nature.com/scientificdata Honorary Academic Editor Susanna-Assunta Sansone Advisory.
Preserving the Scientific Record: Case Study 2 – Arctic Temperature Variability Data Matthew Mayernik National Center for Atmospheric Research Version.
GigaScience ( is an online, open-access journal that includes, as part of its publishing activities, the database GigaDB.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
AACP Annual Meeting #RxOA #PharmEd14.  What is Open Access?  Spencer D. C. Keralis Research Associate  Institutional Repositories.
DOE Data Management Plan Requirements
Institutional Repositories July 2007 Intellectual property management : the DISA experience Dr D Peters DISA: Digital Innovation South Africa.
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
Aalto Research Data Management Policy Ella Bingham 8 April 2016 This work is licensed under the Creative Commons Attribution 4.0 International License.
Publishing for early career researchers University of Glasgow, october 2015 Suzanne Mekking, sr. Publisher Brill April
| 1 Anita de Waard, VP Research Data Collaborations Elsevier RDM Services May 20, 2016 Publishing The Full Research Cycle To Support.
International Planetary Data Alliance Registry Project Update September 16, 2011.
PhD-course Research Data Management (RDM) Expert Centre Research Data.
NRF Open Access Statement
Jeff Moon Data Librarian &
EOSC Services for Scientists
Publishing DDI-Related Topics Advantages and Challenges of Creating Publications Joachim Wackerow EDDI16 - 8th Annual European DDI User Conference Cologne,
Digital Collection Development Policy
Scholarly Publishing Sally Harvey, MLS
1. Objectives of theory-mining reviews
Pasquale Pagano CNR – ISTI (Pisa, Italy)
EPSRC research data expectations and research software management
Starting from the end: what to do when restricted data is released
Research software best practices: Transparency, credit, and citation
Open access as a means to produce high quality data Anja Gassner Head Research Method Group Sentinel Landscape Coordinator FTA World Agroforestry Centre.
GFBio – Education module
Publishing software and data
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
KIOS Open Knowledge: A pillar for excellence
A Journal of the Committee on Data for Science and Technology (CODATA) of the International Council for Science (ICSU)
Institutional role in supporting open access, open science, open data
User Interface HEP Summit, DESY, May 2008
General Finnish DMP Guidance
Data Management: Documentation & Metadata
Open Access to your Research Papers and Data
Building a GER Toolbox As you return from break, please reassemble into your working groups: Surveys and Instruments Analytical Tools Getting Published.
OpenML Workshop Eindhoven TU/e,
Creating a Culture of Open Data in Academia
Research Data Management
Activities and National Priorities of National Members
United Nations Statistics Division
Introduction to the MIABIS SOP Working Group
How to Implement the FAIR Data Principles? Elly Dijk
Jisc Research Data Shared Service (RDSS)
Developing Institutional Data Repositories
Dataverse for citing and sharing research data
Digital Library and Plan for Institutional Repository
Research data lifecycle²
Startup and future / Inge Rutsaert / dd
It’s all about people Data-related training experiences from EUDAT, OpenAIRE, DANS Marjan Grootveld, DANS EDISON workshop, 29 August 2017.
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Research Data Dr Aoife Coffey, Research Data Coordinator
Digital Library and Plan for Institutional Repository
Presentation transcript:

Data publishing wakes up the sleeping data -real practices in China Scientific Data Zhang Lili Computer Network Information Center,CAS 13 Sep 2016

Data Sharing Issues Privacy Concerns Data abuse Teachprivacy.com Articulate.com 1 data are generated by scientists and funded by the public, so who possess the data? Here’s already good examples in funding agencies that take data management plan into account for research project and in China, we have directly fundings for data production and sharing for a long time in SDB. 2 Another questione is in the ivory tower of science, Privacy Concerns Data abuse

What is data publishing Data publishing (also data publication) is the act of releasing data in published form for (re)use by others (Wikipedia, 2016). Model/classification(Bryan Lawrence et al,2011) Standalone Data Publication Data Publication by Proxy Appendix Data Journal Driven Data Archival Overlay Publication

Contents Why publish data What data to publish How to publish data

Why publish data For data authors/owners For data curators Incentives Gain more reputations in scholar community by citation Enhanced exchanges and more focus as well as more suggestions Supports Save energy for long-term data preservation and ongoing services Get third party authorized the trustfulness of your data For data curators Academic recognition For data readers/users Trustful data Data quality control/clear data description Easy to use/Safe to use Less need to worry about intellectual property rights dispute by citation For journals/presses Cooperation between data journals and traditional journals(joint publishing/reduce falsification)/repositories(professional lifelong data services)

What data to publish Items Key points Notes SCOPE Popularity Scarcity Sustainability Topics, areas, fields, methods, skills FORM Easy to understand Friendly access standards CONTENTS capacity richness Quality control VALUE/UTILITY* Reusable-reliability Innovation in data process To support the validation of scientific research Consistency/stability Partly unpredictable OTHERS Background information in life-long data cycle

How to publish data Li J, Wu C, Zhang L et al. Survey and analysis of scientific data publishing. China Scientific Data 1 (2016), DOI: 10.11922/csdata.120.2015.0009

Right time to publish data Published research papers with datasets available Need help from third party for ongoing maintenances of existing datasets Existing datasets that tend to sleep

About China Scientific Data Hosted by Chinese Academy of Sciences Jointly published by CNIC,CAS & ICSU CODATA China Committee (CN11-6035/N,ISSN 2096-2223) An academic journal publishing multidisciplinary data papers Online bilingual versions To promote standardized data openness and citation and to making data findable, accessible, intelligible and reusable (FAIR) SCOPE:Data papers describing (but not limited to) the following: Datasets or data products generated from major scientific activities Derived datasets or data products refined from raw data Datasets linked to existing publications

Key points in writing Clear intellectual property rights relationship maintance Entire and accurate author name list Authorized to publish datasets and data papers by cc-by 4.0 Completion for data files and storage record Confirm the data files storage are complete and it matches with the description involved in data papers. Make sure the data has been deposited in the most suitable, accessible and reliable data repository. Rigor experiment and high data quality Whether the data are generated in a rigorous and methodological way? Data validation and quality control Sufficient background information (such as spatial and temporal information) according to specific discipline as well as data application suggestions Integrity of the description Enough specific explanations for research method and data processing steps together with data source, process, software aided and data file types allowing for easily reproducing. All necessary information for datasets reuse or integration. Conformance for metadata, datasets, other data description as well as data papers. 第一段最后一句:Some background information description can be optional, such as datasets regarding to one kind of fruit that only has rootted within a quite small certain area 总结:Nice data is the premise and delicated paper as a necessary

How to publish data template www.sciencedb.cn www.csdata.org Login CSDATA(registration/browsing) Download template and example Submit with data papers and datasets www.csdata.org template www.sciencedb.cn

If I have a dataset/datasets Kept by myself Archived by the funding agency Sharing it online by my website 2016 international training workshop for developing countries on big data for science held by codata china and CAS in july 2016 Stored in a long-term repository for re-analysis

Why not write a data paper! OPEN ACCESS open access for data papers & datasets Open for reuse Easily access,lifelong data curation Efficient processing & rapid dissemination at least 15 days for open ,within 3 month for publication High quality guarenteed strict review process(peer review/crowd rating/voting)

Thank you !

Tough points under discussion Originality Preferred creativity in data papers Derived data worthy of publication: DataA +DataB=My data? Data flow publication Similar data papers-ongoing dataflow and data papers Published data size Larger datasets with more citation as presumed; Less chance to publish similar small size datasets;