CERN – IT Department CH-1211 Genève 23 Switzerland www.cern.ch/i t Data Publishing Tim Smith CERN/IT.

Slides:



Advertisements
Similar presentations
What is Science?.
Advertisements

Earth Science Chapter 1-1.
Experiments in Computer Science Mark Claypool. Introduction Some claim computer science is not an experimental science –Computers are man-made, predictable.
Active Data Curation in Libraries: Issues and Challenges ASEE ELD Presentation June 27, 2011 William H. Mischo & Mary C. Schlembach.
1 CS 502: Computing Methods for Digital Libraries Lecture 27 Preservation.
Data Publishing Workflows: Strategies and Standards
Types of software. Sonam Dema..
WHAT IS SCIENCE WORD WALL PART 2. REPETITION Making multiple sets of measurements or observations in a scientific investigation. Running through the experiment.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
CERN – IT Department CH-1211 Genève 23 Switzerland t CERN Open Source Collaborative tools: Digital Library Software Tim Smith CERN/IT.
Managing Mature White Box Clusters at CERN LCW: Practical Experience Tim Smith CERN/IT.
CERN - IT Department CH-1211 Genève 23 Switzerland t The CERN Document Server 12 th November 2010 Tim Smith.
Data Analysis using Java Mobile Agents Mark Dönszelmann, Information, Process and Technology Group, IT, CERN ATLAS Software Workshop Analysis Tools Meeting,
Preserving the Scientific Record: Preserving a Record of Environmental Change Matthew Mayernik National Center for Atmospheric Research Version 1.0 [Review.
Software Sustainability Institute Dealing with software: the research data issues 26 August.
CERN IT Department CH-1211 Genève 23 Switzerland t Windows Desktop Applications Life-cycle Management Sebastien Dellabella, Rafal Otto Internet.
CERN IT Department CH-1211 Genève 23 Switzerland t LCG Gridview / LCG SAM use cases Miguel Anjo 8 th July 2008 Database Developers’ Workshop.
European Organization for Nuclear Research Organisation Européenne pour la Recherche Nucléaire High-Energy Physics Data Delivering Data in Science ICSTI.
CERN IT Department CH-1211 Genève 23 Switzerland t MSG status update Messaging System for the Grid First experiences
Practicing Science LESSON 1 – SKILLS OF SCIENCE MS. CABRERA.
CERN – IT Department CH-1211 Genève 23 Switzerland t Open Access at CERN Tim Smith CERN/IT.
Statistical Analysis of Inlining Heuristics in Jikes RVM Jing Yang Department of Computer Science, University of Virginia.
Agriculture Biology Mr. Bushman. Science The process through which nature is: Studied Discovered Understood All areas of science involve posing inquires.
The Scientific Method. What is Science? Write 3 questions a biologist might ask about this picture.
The research process Psych 231: Research Methods in Psychology.
CERN – IT Department CH-1211 Genève 23 Switzerland t Working with Large Data Sets Tim Smith CERN/IT Open Access and Research Data Session.
CERN IT Department CH-1211 Genève 23 Switzerland t 24x7 Service Support Tony Cass LCG GDB, 24 th November 2009.
CERN IT Department CH-1211 Genève 23 Switzerland t Towards agile software development Marwan Khelif IT-CS-CT IT Technical Forum – 31th May.
Chapter 1.1 – What is Science?. State and explain the goals of science. Describe the steps used in the scientific method. Daily Objectives.
Zenodo Information Architecture and Usability CERN openlab Summer Students Lightning Talks Sessions Megan Potter › 19/08/2015.
CERN General Infrastructure Services Department CERN GS Department CH-1211 Geneva 23 Switzerland SMS CERN General Infrastructure.
Data Organization Quality Assurance and Transformations.
Dr. Fuchs. 1.1 What is Science What are the goals of Science and what procedures are at the core of scientific methodology?
CERN – IT Department CH-1211 Genève 23 Switzerland t Zenodo: Share, Publish and Preserve Multidisciplinary Research Results Tim SMITH Cloud.
© 2016 LDRA Ltd The FACE Conformance Verification Matrix in Practice.
Digital Media Lecture 0: It’s all just bits! Georgia Gwinnett College School of Science and Technology Dr. Jim Rowan.
A WEB-ENABLED APPROACH FOR GENERATING DATA PROCESSORS University of Nevada Reno Department of Computer Science & Engineering Jigar Patel Sergiu M. Dascalu.
CERN – IT Department CH-1211 Genève 23 Switzerland t Zenodo: in support of Open Science Tim SMITH At CERN-JRC meeting.
CERN IT Department CH-1211 Genève 23 Switzerland t Migration from ELFMs to Agile Infrastructure CERN, IT Department.
CERN IT Department CH-1211 Genève 23 Switzerland t COOL Performance Tests ATLAS Conditions Database example Romain Basset, IT-DM October.
CERN - IT Department CH-1211 Genève 23 Switzerland CCRC Tape Metrics Tier-0 Tim Bell January 2008.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Grid Technology SL Section Software Lifecycle Duarte Meneses.
The Scientific Method & Scientific Inquiry. The Process of Science SCIENCE is a way of exploring the natural world Science DOES NOT attempt to answer.
CERN IT Department CH-1211 Genève 23 Switzerland t Web Content Management IT Considerations Tim Smith IT/UDS.
 Observation  Formulate a Hypothesis  Set Up a Controlled Experiment  Organize and Analyzing Data  Drawing Conclusions  Repeating Experiments /
The research process Psych 231: Research Methods in Psychology.
Dr Tim Smith CERN/IT For the visit of the Alliance of German Science Organizations.
A WEB-ENABLED APPROACH FOR GENERATING DATA PROCESSORS University of Nevada Reno Department of Computer Science & Engineering Jigar Patel Sohei Okamoto.
CERN - IT Department CH-1211 Genève 23 Switzerland t Improving CERN AV Workflow 10 th January 2011 Thomas Baron, Jacques Fichet, Tim Smith.
The Scientific Method. The scientific method is the only scientific way accepted to back up a theory or idea. This is the method on which all research.
CERN IT Department CH-1211 Genève 23 Switzerland t Bamboo users meeting IT-CS-CT.
Webinar on increasing openness and reproducibility April Clyburne-Sherin Reproducible Research Evangelist
CERN IT Department CH-1211 Genève 23 Switzerland t EIS Section input to GLM For GLM attended by Director for Computing.
CERN - IT Department CH-1211 Genève 23 Switzerland t Improving CERN AV Workflow 13 nth December 2010 Thomas Baron, Jacques Fichet, Tim Smith.
CERN – IT Department CH-1211 Genève 23 Switzerland t Copyright and Content Tim SMITH Invenio User Group Workshop, CERN, Oct 2015.
© 2016 by Pearson Education, Inc.. THINK CRITICALLY, FACIONE & GITTENS Chapter 14 Empirical Reasoning.
Scientific Literature and Communication Unit 3- Investigative Biology b) Scientific literature and communication.
We have stated that science is really just a body of knowledge.
Distinguish between an experiment and other types of scientific investigations where variables are not controlled,
Opening Big Data; in small and large chunks
Zenodo: A Research Data Repository for All
Scientific Thinking and Processes Notes
2-2 What is the Process of Science?
Chapter 1 The Scientific Method
What is Science? Review This slide show will present a question, followed by a slide with an acceptable answer. For some questions, there is a definite.
Do Now: Answer the following in your Science Notebook using complete sentences.
The Scientific Method.
Earth Science Chapter 1-1.
BES III Software: Short-term Plan ( )
INAF Long Term Preservation
Presentation transcript:

CERN – IT Department CH-1211 Genève 23 Switzerland t Data Publishing Tim Smith CERN/IT

Easy, in essence…

Challenging, in practice Bit Rot Media Verification Media Migration Technology tracking

Open Data as a Service REST API REST API OAI- PMH API OAI- PMH API Open Data Pilot

Low Barriers

Beware the False Summit Data Publication Science

Digital Dark Ages Scientific method Propose hypotheses to explain phenomena Test hypotheses predictions through repeatable experiment Share observations and conclusions for independent scrutiny, reproduction and verification Publication: Preparation (standardisation), issuing

Accessible Normalisation

Interpretable Raw Reconstructed Reduced Published Data Reduction / Analysis SW: 10M LoC

Zenodo – GitHub bridge.zenodo.json

Code ↔ Data ↔ Paper

Interpretable Raw Calibrate Filter Transform Reconstructed Reduced Select Published Anonymised Standardised Annotated Data Reduction / Analysis Calibration data Conditions data Formatters Filter/Selection algorithms Statistical Models

Repeatability Capture –Entire workflow –With data, code, statistical models, documentation –Environment, Virtual Machines

Verification and Reproduction Good software development practice: –Code test suite Unit & regression Publish data and analysis code together –Workflow and environment captured –Automated test of the result rerunconfirmed