Workflow Systems for Life Sciences and Social Sciences

Slides:



Advertisements
Similar presentations
Introduction to Computers Lecture By K. Ezirim. What is a Computer? An electronic device –Desktops, Notebooks, Mobile Devices, Calculators etc. Require.
Advertisements

Kensington Oracle Edition: Open Discovery Workflow Meets Oracle 10g Professor Yike Guo.
Web Accessible Virtual Research Environment for Ecosystem Science Community Presentation by Siddeswara Guru.
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
DEV392: Extending SharePoint Products And Technologies Through Web Parts And ASP.NET Clint Covington, Program Manager Data And Developer Services - Office.
1 of 2 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
© , Michael Aivazis DANSE Software Issues Michael Aivazis California Institute of Technology DANSE Software Workshop September 3-8, 2003.
SESSION 9 THE INTERNET AND THE NEW INFORMATION NEW INFORMATIONTECHNOLOGYINFRASTRUCTURE.
Microsoft SharePoint 2013 SharePoint 2013 as a Developer Platform
Microsoft Office Sharepoint Server 2007 (MOSS) Overview Momentum Microsoft November 15, 2007.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
Office 365: Efficient Cloud Solutions Wednesday March 12, 9AM Chaz Vossburg / Gabe Laushbaugh.
A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School.
Windows.Net Programming Series Preview. Course Schedule CourseDate Microsoft.Net Fundamentals 01/13/2014 Microsoft Windows/Web Fundamentals 01/20/2014.
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
Module 3: Business Information Systems Chapter 11: Knowledge Management.
Lecture 8 – Platform as a Service. Introduction We have discussed the SPI model of Cloud Computing – IaaS – PaaS – SaaS.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
Electronic Laboratory Notebook
Class Instructor Name Date. Classroom Tips Class Roster – Please Sign In Class Roster – Please Sign In Internet Usage Internet Usage –Breaks and Lunch.
Introduction To Computer System
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
© 2003 East Collaborative e ast COLLABORATIVE ® eC SoftwareProducts TrackeCHealth.
System Design: Designing the User Interface Dr. Dania Bilal IS582 Spring 2009.
Research Data Management At the Smithsonian Using SIdora Nano Tech Working Group May 15, 2014.
, Implementing GIS for Expanded Data Accessibility and Discoverability ASDC Introduction The Atmospheric Science Data Center (ASDC) at NASA Langley Research.
Introduction to Versioning
CHAPTER FOUR COMPUTER SOFTWARE.
Data Management: Documentation and Metadata for Engineering and Physical Sciences Ivey Glendon, Metadata Librarian Jeremy Bartczak, Intellectual Access.
NEPTUNE Canada Workshop Oceans 2.0 Project Environment NEPTUNE Canada DMAS Team Victoria, BC February 16, 2009.
Data Wrangling and Interoperability Andrea Denton Research and Data Services Manager Claude Moore Health Sciences Library Ricky Patterson.
Data Management: Documentation & Metadata Sherry Lake, Senior Data Consultant Bill Corey, Data Consultant Jeremy Bartczak, Intellectual Access & Metadata.
Taverna Workflow. A suite of tools for bioinformatics Fully featured, extensible and scalable scientific workflow management system – Workbench, server,
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Large Scale Nuclear Physics Calculations in a Workflow Environment and Data Provenance Capturing Fang Liu and Masha Sosonkina Scalable Computing Lab, USDOE.
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
Introduction to ArcGIS for Environmental Scientists Module 3 – GIS Analysis Model Builder.
Nature Reviews/2012. Next-Generation Sequencing (NGS): Data Generation NGS will generate more broadly applicable data for various novel functional assays.
1 Limitations of BLAST Can only search for a single query (e.g. find all genes similar to TTGGACAGGATCGA) What about more complex queries? “Find all genes.
Introduction of Geoprocessing Lecture 9. Geoprocessing  Geoprocessing is any GIS operation used to manipulate data. A typical geoprocessing operation.
Operation and Maintenance of APEC Engineer Data Bank (1) Operation and Maintenance of APEC Engineer Data Bank (1) Dr. Hsieh, Shang-Hsien (Patrick) Professor.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
 Programming - the process of creating computer programs.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Pertemuan 16 Materi : Buku Wajib & Sumber Materi :
Toward a common data and command representation for quantum chemistry Malcolm Atkinson Director 5 th April 2004.
Satisfying Requirements BPF for DRA shall address: –DAQ Environment (Eclipse RCP): Gumtree ISEE workbench integration; –Design Composing and Configurability,
ACCESSING DATA IN THE NIS USING THE KEPLER WORKFLOW SYSTEM Corinna Gries.
© Geodise Project, University of Southampton, Workflow Support for Advanced Grid-Enabled Computing Fenglian Xu *, M.
SharePoint Fest 2013 Chicago What’s New and Exciting (and not so great) in SharePoint Designer 2013 Workflows Ira Fuchs – SharePoint Technical Specialist,
Project Paper Presentation Hanlin Wan March 15, 2011.
Visual Basic.NET Comprehensive Concepts and Techniques Chapter 1 An Introduction to Visual Basic.NET and Program Design.
Devices 10 billion Internet- connected devices by 2016 People 1 billion+ people use social media services today Cloud 30 % of data will live in or pass.
Your Interactive Guide to the Digital World Discovering Computers 2012 Chapter 13 Computer Programs and Programming Languages.
Introduction to Visual Basic. NET,. NET Framework and Visual Studio
Unit 3 Virtualization.
MIRACLE Cloud-based reproducible data analysis and visualization for outputs of agent-based models Xiongbing Jin, Kirsten Robinson, Allen Lee, Gary Polhill,
University of Chicago and ANL
Pipeline Execution Environment
SNOW ONLINE TRAINING IN HYDERABAD
Graduation Project Kick-off presentation - SET
An Introduction to Visual Basic .NET and Program Design
MetaShare, Powered by Azure, Gives SharePoint a User-Friendly, Intuitive User Interface and Added App Features with No Added Administrative Tasks OFFICE.
New Platform to Support Digital Humanities in the Czech Republic
Scientific Workflows Lecture 15
Presentation transcript:

Workflow Systems for Life Sciences and Social Sciences Bill Corey Data Consultant Data Management Consulting Group University of Virginia Library wtc2h@virginia.edu Andrea Horne Denton Health Sciences Data Consultant Claude Moore Health Sciences Library ash6b@virginia.edu © 2013 by the Rector and Visitors of the University of Virginia. This work is made available under the terms of the Creative Commons Attribution-ShareAlike 4.0 International license http://creativecommons.org/licenses/by-sa/4.0/

Goals for the workshop Learn about workflow and process Identify workflow tools See how workflow tools can support your data management Gain peer and expert feedback

Workflow versus Process Workflow: A sequence of connected steps where each step follows the next, or where steps occur concurrently. It is a systematic pattern of activity. Process: A workflow that has well-defined inputs, outputs and purposes. Historically, workflow management systems are business oriented. There are literally hundreds of companies providing products to fit business needs. Some examples of BI workflow products are Approval Process Control, Asset Management, Auto Notification systems, Help Desk Management, Resource Management, User Activity Monitoring, Purchasing, Operations, Personal Management and Warehouse Management. http://en.wikipedia.org/wiki/Workflow

Glossary Workflow Management System (WMS): Computer system that manages & defines a series of tasks. Scientific Workflow System (SWS): Computer system designed to compose & execute a series of computational or data manipulation steps in a scientific application. Also called a Scientific Resource Management System (SRM) or a Scientific Workflow Management System (SWMS). Electronic Lab Notebook (ELN): Computer program designed to replace paper laboratory notebooks. Semantic Workflow System (SWS): Computer system that designs computational experiments, validates workflows, and generates metadata. Laboratory Information Management System (LIMS): Computer program that serves as a laboratory and information management system. Often includes ELNs, SWSs, data management, and data analysis tools. May also be called a Laboratory Information System (LIS) or Laboratory Management System (LMS).

What is a scientific workflow? According to the Pegasus Workflow Management System Project… “allows users to easily express multi-step computational tasks, for example retrieve data from an instrument or a database, reformat the data, and run an analysis.” Scientific workflows allow scientists to model, design, execute, debug, re-configure and re-run analysis and visualization processes.

Typical features of workflows Portability / Reuse Flexible design Performance Scalability Provenance Data Management Reliability Reproducibility image source: http://pegasus.isi.edu/

Science Pipes and Kepler Based on the Kepler Scientific Workflow Management System (SWMS) Science Pipes allows anyone to access, analyze, and visualize primary biodiversity data using the data analysis tools of Kepler. Analyses and visualizations can be shared, modified, reused and enhanced. The Kepler Project Create, Execute and Share models and analyses Scientific and Engineering disciplines Data sources can be local or online Integrate different software components, such as “R” and “C” code Facilitate remote and distributed execution of models Graphical User Interface (GUI) Demo: Video: Introduction to SciencePipes http://info.sciencepipes.org/help/2012/10/video-introduction-to-sciencepipes.html Video: How to create a pipe (6 min) http://info.sciencepipes.org/help/2012/12/video-how-to-create-a-simple-pipe.html Kepler: https://kepler-project.org/ Sources: https://kepler-project.org/ & http://sciencepipes.org/beta/home Image source: http://sciencepipes.org/beta/pipes/detail?workflowID=b1e1c4c6-8931-11df-8902-cb7a96ae555e

Sources: http://irisnote.com/how-it-works/ irisnote SRM Platform Enterprise Electronic Lab Notebook (ELN) and Content Management System (CMS) irisnote Simple, powerful replacement for paper notebooks Highly scalable architecture Data & application agnostic SaaS, on-premise private cloud, or hybrid Collaboration – groups, in-system messaging, email Manage & protect institutional knowledge -- fully compliant with 21 CFR Part 11, GLP & ISO regulatory requirements, encryption Cross-platform -- Mac, Windows, iPad Irisnote announced Oct. 23, 2013 that the basic cloud service will be removed from service on Oct. 25, 2013. Demo: Log into my irisnote account to demonstrate the layout. http://irisnote.com/solutions/ http://irisnote.com/how-it-works/ Sources: http://irisnote.com/how-it-works/ Image source: http://irisnote.com/how-it-works/

Source: http://wings-workflows.org/about WINGS Workflow Instance Generation & Specialization Semantic Workflow System that assists scientists with the design of computational experiments Multi-discipline: sociology, biology, education, environmental sciences Uses Pegasus or OODT as the execution engine Can submit workflows in scripted format for local, secure execution Generates metadata attributes for all new data products Modular design Designed to interface with external data & component catalogs WINGS: http://wings-workflows.org/about http://seagull.isi.edu/marbles/ Dmo: WINGS Sandbox: http://www.wings-workflows.org/sandbox/ OODT: http://oodt.apache.org/ Pegasus: http://pegasus.isi.edu/ Source: http://wings-workflows.org/about

Open Science Framework The Open Science Framework (OSF) is part network of research materials, part version control system, and part collaboration software. The purpose of the software is to support the scientist's workflow and help increase the alignment between scientific values and scientific practices. Document and archive studies Share and find materials Detail individual contributions Increase transparency Register materials Manage scientific workflow WINGS: http://wings-workflows.org/about http://seagull.isi.edu/marbles/ Dmo: WINGS Sandbox: http://www.wings-workflows.org/sandbox/ OODT: http://oodt.apache.org/ Pegasus: http://pegasus.isi.edu/ Source: https://openscienceframework.org/explore/

What is your experience? Does anyone have a tool or workflow system that they are using now? What challenges have you encountered using a tool or workflow? What are the barriers to using these solutions? What features are you looking for in your ‘ideal’ workflow system tools?

What workflow tools are available? Scientific Workflow Management Systems -- General: Pegasus http://pegasus.isi.edu/ Taverna http://www.taverna.org.uk/ Kepler https://kepler-project.org/ accerlys http://accelrys.com/ Open Science Framework https://openscienceframework.org/ ESRI ArcGIS http://www.esri.com/software/arcgis Scientific Workflow Systems -- Domain-specific: Social Sciences: Microsoft Visio, WINGS, Argos Ontology, text editors (Emacs, notepad) Life Sciences: VisTrails, Kepler, Galaxy, Science Pipes, Scilligence Engineering: Triana, GridANT, GridFlow, Askalon, Karajan, Platform LSF Physical Sciences: ELNs, LIMS, SDMS, Pegasus Humanities: Virtual Research Environments, Omeka Text editors: http://kieranhealy.org/files/misc/workflow-apps.pdf http://kieranhealy.org/resources/

What workflow tools are available? Workflow Modules and Tools: Karma Provenance Collection http://d2i.indiana.edu/provenance_karma myexperiment: scientific workflows and research objects http://www.myexperiment.org/ Microsoft Visio: flowchart software Microsoft SharePoint: document management and scientific workflow integration ELNs, LIMs Golims http://www.golims.com/ Labarchives http://www.labarchives.com/ Labguru http://www.labguru.com/ Notebookmaker http://www.notebookmaker.com/ LabCollector http://labcollector.com/ Thermo Scientific Watson http://www.thermoscientific.com/

Demo 1: Science Pipes/Kepler Video: Introduction to SciencePipes http://info.sciencepipes.org/help/2012/10/video-introduction-to-sciencepipes.html Video: How to create a pipe http://info.sciencepipes.org/help/2012/12/video-how-to-create-a-simple-pipe.html Video: Introduction to SciencePipes http://info.sciencepipes.org/help/2012/10/video-introduction-to-sciencepipes.html Video: How to create a pipe http://info.sciencepipes.org/help/2012/12/video-how-to-create-a-simple-pipe.html

Demo 2: WINGS WINGS: http://wings-workflows.org/about WINGS example: http://seagull.isi.edu/marbles/ WINGS Sandbox: http://www.wings-workflows.org/sandbox/ OODT: http://oodt.apache.org/ Pegasus: http://pegasus.isi.edu/ WINGS: http://wings-workflows.org/about Example: http://seagull.isi.edu/marbles/ Sandbox: http://www.wings-workflows.org/sandbox/ OODT: http://oodt.apache.org/ Pegasus: http://pegasus.isi.edu/

Demo 3: Open Science Framework OSF: Scientists https://www.youtube.com/watch?feature=player_embedded&v=c6lCJFSnMcg OSF: Developers https://www.youtube.com/watch?feature=player_embedded&v=WRadGRdkAIQ For Scientists: https://www.youtube.com/watch?feature=player_embedded&v=c6lCJFSnMcg For Developers: https://www.youtube.com/watch?feature=player_embedded&v=WRadGRdkAIQ

More ideas! William J. Turkel Doing research with digital sources http://williamjturkel.net/how-to/ Bamboo DiRT Digital Research Tools http://dirt.projectbamboo.org/ Transforming Scholarly Communication http://msrworkshop.tumblr.com/tagged/resources Mindmeister Mind mapping & brainstorming http://www.mindmeister.com/ Scrivener http://www.literatureandlatte.com/scrivener.php Zotero http://www.zotero.org/

Mailing List Subscription Please check the box on our sign-in sheet to receive occasional emails to keep up with our services, training, and news. Please encourage others to subscribe: http://eepurl.com/CJwYT

Questions? Bill Corey Data Management Consulting Group University of Virginia Library wtc2h@virginia.edu Andrea Horne Denton Health Sciences Data Consultant Claude Moore Health Sciences Library ash6b@virginia.edu http://dmconsult.library.virginia.edu dmconsult@virginia.edu