Reproducible Bioinformatics

Slides:



Advertisements
Similar presentations
INFSO-RI An On-Demand Dynamic Virtualization Manager Øyvind Valen-Sendstad CERN – IT/GD, ETICS Virtual Node bootstrapper.
Advertisements

Docker Martin Meyer Agenda What is Docker? –Docker vs. Virtual Machine –History, Status, Run Platforms –Hello World Images and Containers.
Docker Martin Meyer Agenda What is Docker? –Docker vs. Virtual Machine –History, Status, Run Platforms Hello World Terminology: Image and.
Johan Janssen, Info Support. Continuous delivery Docker Jenkins Questions.
Johan Info Support. Internet of things Continuous delivery Docker Jenkins Questions.
Windows Azure Conference 2014 Running Docker on Windows Azure.
Johan Janssen, Info Support. Continuous delivery Docker Jenkins Questions.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Objectives.
PALMS update Marco Mambelli 18/9/ PALMS project OASIS provides the infrastructure to host the software in CVMFS but the users need more guidance.
CERN-PH-SFT-SPI August Ernesto Rivera Contents Context Automation Results To Do…
VMs in Azure Breeze Lab Manager 19 November 2012.
#msitconf. Damien Caro Technical Evangelist Manager, Что будет, если приложение поместить в контейнер? What happens if the application.
Repository Manager 1.3 Product Overview Name Title Date.
Using Docker in a CyVerse World The main portion of this tutorial should take about 45 minutes to go through, and assumes you have already gone through.
DECTRIS Ltd Baden-Daettwil Switzerland Continuous Integration and Automatic Testing for the FLUKA release using Jenkins (and Docker)
Deploying Elastic Java EE Microservices in the Cloud with Docker
Moving Legacy Applications to Docker Josh Ellithorpe Software Architect (NOTE: PASTE IN PORTRAIT AND SEND BEHIND FOREGROUND GRAPHIC FOR CROP)
Prof. Jong-Moon Chung’s Lecture Notes at Yonsei University
INTRO TO Presenter: PhuongNQK.
Containers as a Service with Docker to Extend an Open Platform
Fundamentals Sunny Sharma Microsoft
Web application hosting with Openshift, and Docker images
Web application hosting with Openshift, and Docker images
VAGRANT AND DOCKER AS LEARNING ENVIRONMENTS
Docker and Azure Container Service
How to download, configure and run a mapReduce program In a cloudera VM Presented By: Mehakdeep Singh Amrit Singh Chaggar Ranjodh Singh.
Containers: The new network endpoint
CyVerse Discovery Environment
Docker Birthday #3.
In-Depth Introduction to Docker
An In-Depth Look at the Autodesk® AutoCAD® App Autoloader Module
Perspectives on the intersection between computer science and psychology Developing reproducible – and reusable – methods through research software engineering.
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
EIN 6133 Enterprise Engineering
Atomic Gerard Braad FUDCon Phnom Penh 2016
Andrew Pruski SQL Server & Containers
Containers in HPC By Raja.
Drupal VM and Docker4Drupal For Drupal Development Platform
Integration of Singularity With Makeflow
Building Cross Platform PowerShell Modules
Drupal VM and Docker4Drupal as Consistent Drupal Development Platform
OpenStack Summit AsiaWorld Expo / November 2013
Build /21/2018 © 2015 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION.
Azhagappan Arunachalam
Introduction to Docker
Agenda Intro Why use containers at all? Linux Kernel: a pop of history
Introduction to Introduction to Singularity
Haiyan Meng and Douglas Thain
Intro to Docker Containers and Orchestration in the Cloud
Intro about Contanier and Docker Technology
Module 01 ETICS Overview ETICS Online Tutorials
Docker, Drupal and Persistence
Openstack-alapú privát felhő üzemeltetés
Presented by Bogdan Stanca-Kaposta (Spirent)
Mark Buckler September 8, 2017
Quoting and Billing: Commercialization of Big Data Analytics
F II 1. Background Objectives
Configuration management suite
Docker Some slides from Martin Meyer Vagrant Box:
CloudOpting - Hackathon
Introduction to Docker
MCBIOS 2016 – University of Memphis, TN
OpenStack Summit Berlin – November 14, 2018
Azure Container Service
Abel Sanchez, John Williams
Computational Pipeline Strategies
Deploying machine learning models at scale
Container Images – Best Practices & Tools
Practical Computational Reproducibility in the Life Sciences
SSDT, Docker, and (Azure) DevOps
Presentation transcript:

Reproducible Bioinformatics Bioconda and BioContainers enabling sustainable bioinformatic infrastructure Yasset Perez-Riverol PhD. github: github.com/ypriverol twitter: @ypriverol Bioconda/BioContainers community

Outline Reproducibility in Science. What is a containers? & Why they are so popular? BioContainers Architecture Future Directions

Bioinformatics Software: Reproducibility/Usability Challenge Publication Software Research Dependency Issues Versioning Testing/Integration ? My Data ? Publication Software

Learning from some communities: Bioconductor Publication Software: R Package Research Implementation of Guidelines: Versioning Testing Dependency management Documentation My Data ? Publication Software

Matrix of Hell

Containers Solution.

Docker Architecture

Why is so popular Build once, run anywhere A clean, safe & portable runtime environment for your app. No worries about missing dependencies, packages & other pain points during subsequent deployments. Run each app in its own isolated container Automate testing, integration, packaging…anything you can script A VM without the overhead of a VM

Current BioContainers Architecture

Dockerfile Container # Base Image FROM biocontainers/biocontainers:latest # Metadata LABEL base.image="biocontainers:latest" LABEL version="3" LABEL software="Comet" LABEL software.version="2016012" LABEL description="an open source tandem mass spectrometry sequence database search tool" LABEL website="http://comet-ms.sourceforge.net/" LABEL documentation="http://comet-ms.sourceforge.net/parameters/parameters_2016010/" LABEL license="http://comet-ms.sourceforge.net/" LABEL tags="Proteomics" # Maintainer MAINTAINER Felipe da Veiga Leprevost <felipe@leprevost.com.br> USER biodocker RUN ZIP=comet_binaries_2016012.zip && \ wget https://github.com/BioDocker/software-archive/releases/download/Comet/$ZIP -O /tmp/$ZIP && \ unzip /tmp/$ZIP -d /home/biodocker/bin/Comet/ && \ chmod -R 755 /home/biodocker/bin/Comet/* && \ rm /tmp/$ZIP RUN mv /home/biodocker/bin/Comet/comet_binaries_2016012/comet.2016012.linux.exe /home/biodocker/bin/Comet/comet ENV PATH /home/biodocker/bin/Comet:$PATH WORKDIR /data/ CMD ["comet"]

Dockerfile Container # Base Image FROM biocontainers/biocontainers:latest # Metadata LABEL base.image="biocontainers:latest" LABEL version="3" LABEL software="Comet" LABEL software.version="2016012" LABEL description="an open source tandem mass spectrometry sequence database search tool" LABEL website="http://comet-ms.sourceforge.net/" LABEL documentation="http://comet-ms.sourceforge.net/parameters/parameters_2016010/" LABEL license="http://comet-ms.sourceforge.net/" LABEL tags="Proteomics" # Maintainer MAINTAINER Felipe da Veiga Leprevost <felipe@leprevost.com.br> USER biodocker RUN ZIP=comet_binaries_2016012.zip && \ wget https://github.com/BioDocker/software-archive/releases/download/Comet/$ZIP -O /tmp/$ZIP && \ unzip /tmp/$ZIP -d /home/biodocker/bin/Comet/ && \ chmod -R 755 /home/biodocker/bin/Comet/* && \ rm /tmp/$ZIP RUN mv /home/biodocker/bin/Comet/comet_binaries_2016012/comet.2016012.linux.exe /home/biodocker/bin/Comet/comet ENV PATH /home/biodocker/bin/Comet:$PATH WORKDIR /data/ CMD ["comet"]

Tool deployment & sustainability in science Programming language agnostic OS independent No root privileges needed Management of multiple version HPC and Cloud compatible easy to maintain What is needed?

Tool deployment & sustainability in science Conda is an open source package management system and environment management system for installing multiple versions of software packages and their dependencies and switching easily between them.

BioContainers: Mulled based Containers Meeting, August 2017

But the new cool kid is called Containers Get all the other nice technologies for free rkt singularity modules Meeting, August 2017

How to find a container http://biocontainers.pro/registry/#/ 17 Meeting, August 2017 17

Name spaces Namespace for Dockerfile based containers: docker pull biocontainers/blast Namespace for Dockerfile free based containers: docker pull quay.io/biocontainers/bedtools

Who is using it?

What is next: MultiContainers http://biocontainers.pro/multi-package-containers/

Some Numbers Relevant Links More than 2000 Containers. Use in production by: Galaxy, Phenomenal2020, Cyverse, OSG. 210 issues discussed. More than 30 Contributors. Relevant Links http://biocontainers.pro http://biocontainers.pro/registry http://github.com/BioContainers/