David De Roure Repeat, Reuse, Remix, Reproduce, … Reconstructable Research.

Slides:



Advertisements
Similar presentations
Geoscience Information Network Stephen M Richard Arizona Geological Survey National Geothermal Data System.
Advertisements

CoAKTing IFD Dave in Hawaii. 2 CoAKTing IFD n Objective is to advance the state of the art in collaborative mediated spaces for distributed e- Science.
David De Roure Social Networking and Workflows in Research.
David De Roure. Between 19 th October and 23 rd November 2007 I attended six international meetings related to e-Science Grid 2007 Scientific and Scholarly.
E-Science: Understanding Research Data Malcolm Atkinson & David De Roure & 20 October 2009 RCUK fact-finding mission.
Less is More Lightweight Ontologies and User Interfaces for Smart Labs J. G. Frey, G. V. Hughes, H. R. Mills, m. c. schraefel, G. M. Smith, David De Roure.
Workflows for Digital Curation and Preservation Stacy Kowalczyk PASIG Dublin 2012 October 17, 2012.
ISWC 2005, Galway Seven Bottlenecks to Workflow Reuse and Repurposing Antoon Goderis Ulrike Sattler Phillip Lord Carole Goble University of Manchester.
European Life Sciences Infrastructure for Biological Information Rafael C Jimenez ELIXIR CTO EMBL-EBI workshop networks and pathways.
David De Roure Manchester Edition. John Taylor There are a number of grid applications being developed and there is a whole raft of computer technologies.
Designing, Executing and Reusing Scientific Workflows Katy Wolstencroft, Paul Fisher, myGrid.
Accelerating Time to Experiment – The myExperiment Approach to Open Science David De Roure Carole Goble Jiten Bhagat.
David De Roure Creating Research Objects that contain collections of data, papers and research workflows.
Microsoft Research Faculty Summit David De Roure University of Southampton, UK.
David De Roure Eindhoven Edition. Due to the complexity of the software and the backend infrastructural requirements, e-Science projects usually involve.
Toward Replayable Research in Networking and Systems Eric Eide University of Utah, School of Computing May 25, 2010.
Future Access to the Scientific and Cultural Heritage – A shared Responsibility Birte Christensen-Dalsgaard State and University Library.
Jiten Bhagat University of myExperiment A Social VRE for Research Objects JISC Roadshow | February.
Personal Data Management Why is this such an issue? Data Provenance Representing links v Representing data Identifying resources: Life Science Identifiers.
University of Illinois Role of Mashups, Cloud Computing, and Parallelism for Visual Analytics Loretta Auvil.
Sean Making Metadata Work, ISKO London, 23 rd June 2014 Metadata for Research Objects 1.
Provenance in my Grid Jun Zhao School of Computer Science The University of Manchester, U.K. 21 October, 2004.
David De Roure WSRI Summer School RPI July You will be able to answer the question “What is Web 2.0?” 2.You will have some ideas about how our.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester materials by Dr Katy Wolstencroft and Dr Aleksandra.
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
My Experiment – A Web 2.0 Virtual Research Environment David De Roure Carole Goble.
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
14/11/11 Taverna Roadmap Shoaib Sufi myGrid Project Manager.
Designing, Executing, Reusing and Sharing Workflows: Taverna and myExperiment Supporting the in silico Experiment Life Cycle Katy Wolstencroft Paul Fisher.
Connecting to Ensemble: AlgoViz. AlgoViz Community  Sharing educational resources Visualizations for data structure and algorithms  Sharing experience.
OHT 11.1 © Marketing Insights Limited 2004 Chapter 9 Analysis and Design EC Security.
Taverna Workflow. A suite of tools for bioinformatics Fully featured, extensible and scalable scientific workflow management system – Workbench, server,
MyExperiment Research Objects: Beyond Workflows and Packs Stian Soiland-Reyes myGrid, University of Manchester BOSC 2013, ISMB, Berlin, This.
David De Roure University of Southampton, UK Carole Goble The University of Manchester, UK A Web 2.0 Virtual Research Environment OGF Semantic Grid Research.
Wf4Ever: Preserving workflows as digital Research Objects EGI Community Forum 2012, Workflow Systems workshop Leibniz Supercomputing Centre, Münich,
E-Science for the SKA WF4Ever: Supporting Reuse and Reproducibility in Experimental Science Lourdes Verdes-Montenegro* AMIGA and Wf4Ever teams Instituto.
MyExperiment 2.0 – Preserving digital Research Objects using the Wf4Ever architecture EGI/SHIWA Workshops on e-Science Workflows Budapest, Stian.
Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester.
1 Dr. Paolo Missier, Prof. Carole Goble Information Management Group School of Computer Science, University of Manchester, UK with additional material.
SCAP E SCAPE Project EU project aimed at building a scalable platform for planning and execution of computation intensive processes for ingestion or migration.
Professor Carole Goble
Scientific Data Management - From the Lab to the Web Semantic Data Management Dagstuhl Seminar April 2012 José Manuel Gómez Pérez, iSOCO
Data Attribution and Citation Practices and Standards Fifth China - U.S. Roundtable on Scientific Data Cooperation Beijing, China, October, 2011.
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
Infrastructures for Social Simulation Rob Procter National e-Infrastructure for Social Simulation ISGC 2010 Social Simulation Tutorial.
WHIP - Workflow Hosted in Portals Kurt Mueller and Andrew Harrison School of Computer Science, Cardiff And Ian Taylor School of Computer Science, Cardiff.
Data Management & the Library. FACT #1 Research is increasingly digital and produces digital data.
The Astronomy challenge: How can workflow preservation help? Susana Sánchez, Jose Enrique Ruíz, Lourdes Verdes-Montenegro, Julian Garrido, Juan de Dios.
A presentation about myExperiment David De Roure and Carole Goble.
David De Roure Workflows in Support of Large-Scale Science Provenance, a.
The 10 Best Practices for Workflow Design BioVeL M6 Workshop Göteborg, May 10-11, 2012 Kristina Hettne, Marco Roos (LUMC), Katy Wolstencroft, Carole Goble.
ISMB Demo, 01 July 2009 Franck Tanoh University of Manchester, UK.
RDFa Primer Bridging the Human and Data webs Presented by: Didit ( )
W ORKFLOW -C ENTRIC R ESEARCH O BJECTS : F IRST C LASS C ITIZENS IN S CHOLARLY D ISCOURSE Khalid Belhajjame, Oscar Corcho, Daniel Garijo, Jun Zhao, Paolo.
Jiro Sumitomo, James M. Hogan, Felicity Newell, Paul Roe Microsoft QUT eResearch Centre
Faculty of Education, Language and Community Services Stavroula Tsembas Marketing and Distribution: Metadata Linkages What is metadata? information about.
Co-evolution of digital technologies and research methods David De Roure.
MyExperiment Team F2F Manchester November Team Face to Face Meeting (Manchester) Thursday, 26th November myExperiment meeting. University.
Smart Labs for Smart People New ways to collect, curate and share information Jeremy Frey School of Chemistry, University of Southampton June 2010Jeremy.
The Influence and Impact of Web 2.0 on e-Research Infrastructure, Applications and Users User Day.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
REMI Database Antall Fernandes. REMI ● A relational database to facilitate data - metadata organization of various research studies. ● Interface into.
Research Objects Preserving scientific data and methods Stian Soiland-Reyes, Khalid Belhajjame School of Computer Science, Univ of Manchester myGrid NIHBI.
myExperiment: Towards Research Objects David De Roure
Jenn Riley Metadata Librarian Digital Library Program
An ontology for e-Research
Managing Private and Public Views of DDI Metadata Repositories
Jenn Riley Metadata Librarian Digital Library Program
Presentation transcript:

David De Roure Repeat, Reuse, Remix, Reproduce, … Reconstructable Research

Expertise Community Software Digital Music Collections ground truth Evaluation Infrastructure (sociotechnical) Results Evaluations papers Papers

Assembly of Artefacts

Assembly of Apparatus

Assembly of Apparatus

NRAO/AUI/NSF telescopes for the naked mind Datascopes From Signal to Understanding

data method

Kepler Triana BPEL Taverna Trident Meandre Galaxy

 “Facebook for Scientists”...but different to Facebook!  A repository of research methods  A community social network of people and things  A Social Virtual Research Environment  A probe into researcher behaviour  Open source (BSD) Ruby on Rails app  REST and SPARQL interfaces, supports Linked Data  Influenced BioCatalogue, MethodBox and SysMO-SEEK myExperiment currently has 307 groups, 2442 workflows, 608 files and 236 packs - see wiki.myexperiment.org

Results Logs Results Metadata Paper Slides Feeds into produces Included in produces Published in produces Included in Published in Workflow 16 Workflow 13 Common pathways QTL Paul’s Pack Paul’s Research Object

SELECT?pack ?contrib WHERE { ?pack rdf:type mepack:Pack. ?pack ore:aggregates ?contrib. } SELECT?pack ?contrib WHERE { ?pack rdf:type mepack:Pack. ?pack ore:aggregates ?contrib. } SELECT?wf ?uri WHERE { ?wf mebase:has-current-version ?v. ?v mecomp:executes-dataflow ?d. ?d mecomp:has-component ?c. ?c rdf:type mecomp:WSDLProcessor. ?c mecomp:processor-uri ?uri. } SELECT?wf ?uri WHERE { ?wf mebase:has-current-version ?v. ?v mecomp:executes-dataflow ?d. ?d mecomp:has-component ?c. ?c rdf:type mecomp:WSDLProcessor. ?c mecomp:processor-uri ?uri. } Sean Bechhofer

Reusable. The key tenet of Research Objects is to support the sharing and reuse of data, methods and processes. Repurposeable. Reuse may also involve the reuse of constituent parts of the Research Object. Repeatable. There should be sufficient information in a Research Object to be able to repeat the study, perhaps years later. Reproducible. A third party can start with the same inputs and methods and see if a prior result can be confirmed. Replayable. Studies might involve single investigations that happen in milliseconds or protracted processes that take years. Referenceable. If research objects are to augment or replace traditional publication methods, then they must be referenceable or citeable. Revealable. Third parties must be able to audit the steps performed in the research in order to be convinced of the validity of results. Respectful. Explicit representations of the provenance, lineage and flow of intellectual property. The R dimensions Replacing the Paper: The Twelve Rs of the e-Research Record” on + Repair, Release, …

Machine repeat Machine repeat REPRODUCE Machine software paper Research Record software Software REPRODUCE OR REPEAT? software workflow paper Software wf Machine software workflow algorithm software

What is the future of the papers so that we can reconstruct research? “Instruments” Experiments Results Research

openresearchsoftware.metajnl.com

How do we reconstruct Citizen Scholarship?

Discussion An experiment is an assembly of artefacts Software is an assembly of artefacts What is the research record so that we can reconstruct research? - Describe or encapsulate? (Web or particle?) - Learn from software practice? Machines are users too… autonomic Mirex? Data is getting attention, remind people about software too, and experiments as reconstructable research objects… which might be executable

Credits: Ashley Burgoyne, Ichiro Fujinaga, Kevin Page, Ben Fields, Stephen Downie, Malcolm Atkinson, Iain Buchan, Carole Goble, Paul Fisher, Sean Bechhofer, Tim Crawford