Download presentation
Presentation is loading. Please wait.
1
OpenML Workshop (III) @ Eindhoven TU/e, 22-10-2014
3TU.Datacentrum OpenML Workshop Eindhoven TU/e, Introducing myself and IEC/Library TU/e IEC/Library Available under CC BY license, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
2
International Open Access Week
Sharing research data Why? It’s expected by research funders, journals, professional organizations and research evaluators Because of scientific integrity: reproducibility of results Because of re-using results: data-driven science You benefit from it: increases your visibility and enhances the trustworthiness of your research How? On request Personal website Publishing / archiving in a repository Open access week Because data providing the evidence for a published paper can be asked for by others in view of verificating or replicating your results (scientific integrity) Because journal, funder or code of conduct demand data to be accessible Because data are unique and / or valuable (non-repeatable observations) Because data are an asset, worth sharing in order to be reused or built on by others UPSIDE: Uniform Principle of Sharing Integral Data and Materials Expeditiously International Open Access Week
3
Re-using research data
To be re-used, data should be Findable: DOI; metadata (to allow discovery) Accessible: ≠ open access; licenses to use; to humans and machines Intelligible, assessable: metadata (to allow understandability) Interoperable: combining across multiple sources Preserved: long-term availability Source: Research Data Netherlands / Marina Noordegraaf Findable + citeable Accessibility doesn’t necessarily means open access Findable: easy to find both by humans and computers based on mandatory description of the metadata that allow researchers to track and trace interesting datasets; Accessible: stored long term such that they can be easily accessed and/or downloaded with well-defined license and access conditions (Open Access when possible), whether at the level of metadata, or at the level of the actual data content; Interoperable: ready to be combined (across multiple sources) by humans as well as computers; Re-Usable: ready to be used for future research and to be processed further using computational methods. Different levels of accessibility: not accessible, after request, made available on a personal website, published with a DOI; by machines
4
3TU.Datacentrum #1 Findability + citability: 3TU.DC assigns DOI’s; discovery metadata are mandatory; data sets are indexed by DataCite, Google, Data Citation Index Accessibility: 3TU.DC = open access; embargo’s (6 months) are allowed Intelligible, assessable, interoperable: up to the researcher Preservation: 3TU.DC has quality mark Data Seal of Approval Source: Research Data Netherlands / Marina Noordegraaf Costs: 3500 / 4500 euro per Tb per 20 year
5
3TU.Datacentrum #2 File format support levels
Self-upload of simple data sets (≤ 4 Gb) Tailor-made solutions Upload and download statistics Collections of data sets Source: Research Data Netherlands / Marina Noordegraaf Costs: 3500 / 4500 euro per Tb per 20 year
6
DOI’s and OpenML #1 DataCite Netherlands : assigns and distributes DOI’s on behalf of DataCite to research organizations and data centers in NL Organizations can register DOI’s for its objects by applying for an account at DataCite Netherlands Objects need to be persistent, long-term available Objects are preferably open access; restricted access is allowed Objects should be citable (metadata added) Objects must have a public landing page Source: Research Data Netherlands / Marina Noordegraaf
7
DOI’s and OpenML #2 Organizations must ensure maintenance and supply of metadata A contract will be signed to ensure the abovementioned points, after that the organization will receive its own DOI prefix Costs: € 1000,- (once-only, subject to changes) Creating DOIs: manually via web forms ↔ uploading xml resources files Source: Research Data Netherlands / Marina Noordegraaf
8
URL’s of mentioned webpages (in order of appearance)
OpenML Workshop Eindhoven: Website IEC/Library [TU/e]: Data on request (Reinhart-Rogoff paper): Data on personal website (Thomas Piketty): Publishing data (3TU.Datacentrum): International Open Access Week: DataCite metadata search: Data Citation Index (Thomson Reuters): Data Seal of Approval: File format support levels: DataCite Netherlands: International Open Access Week
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.