Download presentation
Presentation is loading. Please wait.
Published byLaura Johnson Modified over 9 years ago
1
CERN – IT Department CH-1211 Genève 23 Switzerland www.cern.ch/i t Data Publishing Tim Smith CERN/IT
2
Easy, in essence…
3
Challenging, in practice Bit Rot Media Verification Media Migration Technology tracking
4
Open Data as a Service REST API REST API OAI- PMH API OAI- PMH API Open Data Pilot
5
Low Barriers
6
Beware the False Summit Data Publication Science
7
Digital Dark Ages Scientific method Propose hypotheses to explain phenomena Test hypotheses predictions through repeatable experiment Share observations and conclusions for independent scrutiny, reproduction and verification Publication: Preparation (standardisation), issuing
8
Accessible Normalisation
9
Interpretable Raw Reconstructed Reduced Published Data Reduction / Analysis SW: 10M LoC
10
Zenodo – GitHub bridge.zenodo.json
11
Code ↔ Data ↔ Paper
12
Interpretable Raw Calibrate Filter Transform Reconstructed Reduced Select Published Anonymised Standardised Annotated Data Reduction / Analysis Calibration data Conditions data Formatters Filter/Selection algorithms Statistical Models
13
Repeatability Capture –Entire workflow –With data, code, statistical models, documentation –Environment, Virtual Machines
14
Verification and Reproduction Good software development practice: –Code test suite Unit & regression Publish data and analysis code together –Workflow and environment captured –Automated test of the result rerunconfirmed
15
http://zenodo.org @zenodo_org Tim.Smith@cern.ch http://www.cern.ch
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.