Cloud infrastructure for training in Life Sciences Manuel Corpas The Genome Analysis Centre.

Slides:



Advertisements
Similar presentations
University of St Andrews School of Computer Science Experiences with a Private Cloud St Andrews Cloud Computing co-laboratory James W. Smith Ali Khajeh-Hosseini.
Advertisements

Information on GVL - Genomics Virtual Laboratory Oct 2013 Audience: Service Desk Developed as part of the Australian.
Cloud Computing Mick Watson Director of ARK-Genomics The Roslin Institute.
The Genome Analysis Centre Building Excellence in Genomics and Computational Bioscience.
QCloud Queensland Cloud Data Storage and Services 27Mar2012 QCloud1.
IBM Watson Research © 2004 IBM Corporation BioHaystack: Gateway to the Biological Semantic Web Dennis Quan
Introduction to bioknoppix: Linux for the life sciences Carlos M Rodríguez Rivera Humberto Ortiz Zuazaga.
EMBL-EBI and Bioinformatics Steven Newhouse, Head of Technical Services, EMBL-EBI.
TGAC Training Coordination for the BBSRC Strategically-Funded Institutes Tanya Dickie: Bioinformatics & Biomathematics Training.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
Cloud Computing Why is it called the cloud?.
GMOD in the Cloud Genome Informatics November 3, 2011 Scott Cain GMOD Project Coordinator Ontario Institute for Cancer Research
IPlant Collaborative Powering a New Plant Biology iPlant Collaborative Powering a New Plant Biology.
Customized cloud platform for computing on your terms !
GridPP Tuesday, 23 September 2003 Tim Phillips. 2 Bristol e-Science Vision National scene Bristol e-Science Centre Issues & Challenges.
European Life Sciences Infrastructure for Biological Information ELIXIR
 Prototype for Course on Web Security ETEC 550.  Huge topic covering both system/network architecture and programming techniques.  Identified lack.
© What do bioinformaticians do?
Genomics Virtual Lab: analyze your data with a mouse click Igor Makunin School of Agriculture and Food Sciences, UQ, April 8, 2015.
Infrastructure clouds, microbial genomics, and the Cloud Virtual Resource project (CloVR) Sam Angiuoli
Microsoft Research Faculty Summit Paul Watson Professor of Computer Science Newcastle University, UK.
Building Biodiversity Information Education: Next Generation Bioinformaticians P. Bryan Heidorn Carole Palmer Dan Wright Graduate School of Library and.
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Objectives.
Presented by: Tina Chargois. Online learning is becoming more and more prevalent, and offers a host of new opportunities for today’s students.
RNA-Seq 2013, Boston MA, 6/20/2013 Optimizing the National Cyberinfrastructure for Lower Bioinformatic Costs: Making the Most of Resources for Publicly.
Introduction to network management. INTRODUCTION ● Course Overview ● Course Objectives.
Microsoft Azure Storage. Networking Compute Storage Virtual Machine Operating System Applications Data & Access Runtime Provision.
RAL PPD Computing A tier 2, a tier 3 and a load of other stuff Rob Harper, June 2011.
David R. McWilliams, Ph.D. Section of Statistical Genetics, Department of Biostatistical Sciences, Center for Public Health Genomics Bioinformatician IV.
The GOBLET Training Portal Manuel The GOBLET Training Portal Manuel
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Bioinformatics Core Facility Guglielmo Roma January 2011.
Celine DONDEYNAZ, Joint Research Centre- Italy A. Leone, C. Carmona, P. Mainardi, M.Giacomassi and Prof. Daoyi Chen A Web knowledge Management Platform.
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Atmosphere.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Clouds in Bioinformatics Rob Knight HHMI and University of Colorado at Boulder.
Biomedical Big Data Training Collaborative biobigdata.ucsd.edu BBDTC UPDATES Biomedical Big Data Training Collaborative biobigdata.ucsd.edu.
Bio-Linux 3.0 An integrated bioinformatics solution for the EG community ClustalX showing DNA polymerase alignment GeneSpring showing yeast transcriptome.
Introduction: Cloud, Linux and basic skills Mick Watson Director of ARK-Genomics The Roslin Institute.
Microsoft Azure Active Directory. AD Microsoft Azure Active Directory.
| nectar.org.au NECTAR TRAINING Module 1 Overview of cloud computing and NeCTAR services.
| nectar.org.au NECTAR TRAINING Module 5 The Research Cloud Lifecycle.
Bio-IT World Conference and Expo ‘12, April 25, 2012 A Nation-Wide Area Networked File System for Very Large Scientific Data William K. Barnett, Ph.D.
Galaxy Community Conference July 27, 2012 The National Center for Genome Analysis Support and Galaxy William K. Barnett, Ph.D. (Director) Richard LeDuc,
‘BigExcel’ A Web-Based Framework for Exploring Big Data in Social Sciences Asif Saleem, Blesson Varghese and Adam Barker University of St Andrews, UK
1 The Cloud and Desktop as a Service as a teaching tool for different research communities David Wallom Oxford e-Research Centre.
TOWARDS A FRENCH -SCIENCE ? Results of the e-Biogenouest project ( ) Coordination : Olivier Collin – Yvan Le Bras (IRISA) e -Test an e-Science.
European Life Sciences Infrastructure for Biological Information EGI 2015, Lisbon, 18 May 2015 Rafael C Jimenez, ELIXIR CTO ELIXIR.
ENEA GRID & JPNM WEB PORTAL to create a collaborative development environment Dr. Simonetta Pagnutti JPNM – SP4 Meeting Edinburgh – June 3rd, 2013 Italian.
European Grid Initiative The EGI Federated Cloud as Educational and Training Infrastructure for Data Science Tiziana Ferrari/ EGI.eu.
READ ME FIRST Use this template to create your Partner datasheet for Azure Stack Foundation. The intent is that this document can be saved to PDF and provided.
Accessing the VI-SEEM infrastructure
The CLoud Infrastructure for Microbial Bioinformatics
Volunteer Computing for Science Gateways
Tools and Services Workshop
Joslynn Lee – Data Science Educator
Bridges and Clouds Sergiu Sanielevici, PSC Director of User Support for Scientific Applications October 12, 2017 © 2017 Pittsburgh Supercomputing Center.
Bioinformatics Community of CNGrid A New Approach to Utilizing Grids
NBIC Galaxy to Strengthen the Bioinformatics Community in the Netherlands Hailiang Mei David van Enckevort
FICEER 2017 Docker as a Solution for Data Confidentiality Issues in Learning Management System.
Manchester HEP group Network, Servers, Desktop, Laptops, and What Sabah Has Been Doing Sabah Salih.
Presentation During the 9th Annual Heads of Institutions Forum
The Institute of Quantitative Social Science
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
Irene-Angelica Chounta Senior Researcher
SHARCNET More than just HPC.
Presentation transcript:

Cloud infrastructure for training in Life Sciences Manuel Corpas The Genome Analysis Centre

[egi.edu] The Genome Analysis Centre The Genome Analysis

The Genome Analysis Centre The Genome Analysis

Bottleneck is NOT Production of data Technology Budget The Genome Analysis Centre The Genome Analysis

Bottleneck IS TRAINING! The Genome Analysis Centre The Genome Analysis

Bottleneck IS TRAINING! – Bioinformatics The Genome Analysis Centre The Genome Analysis

Bioinformatics Training The Genome Analysis Centre The Genome Analysis

Mick Watson Roslin Institute The Genome Analysis Centre The Genome Analysis

1.Most bioinformaticians are bad scientists The Genome Analysis Centre The Genome Analysis

1.Most bioinformaticians are bad scientists 2.Most biologists are bad bioinformaticians: poor computer skills, bad at maths/statistics The Genome Analysis Centre The Genome Analysis

1.Most bioinformaticians are bad scientists 2.Most biologists are bad bioinformaticians: poor computer skills, bad at maths/statistics 3.Short courses benefit no-one The Genome Analysis Centre The Genome Analysis

Carole Goble University of Manchester The Genome Analysis Centre The Genome Analysis

Students and trainers don’t like learning how to use new things The Genome Analysis Centre The Genome Analysis

Students and trainers don’t like learning how to use new things Trainees need to be eased in by using familiar stuff The Genome Analysis Centre The Genome Analysis

How can we bridge the gap? The Genome Analysis Centre The Genome Analysis

Bioinformatics Learning Tools iPython (analytics learning) Sanbox resources (galaxy instance with data) Repository of training machines Suite of VMs

Titus Brown Michigan State University The Genome Analysis Centre The Genome Analysis

1.Participants bring their laptops The Genome Analysis Centre The Genome Analysis

1.Participants bring their laptops 2.Pre installed machines The Genome Analysis Centre The Genome Analysis

1.Participants bring their laptops 2.Pre installed machines 3.Cloud computing The Genome Analysis Centre The Genome Analysis

Cloud + Bioinformatics + Training = The Genome Analysis Centre The Genome Analysis

Why Bioinformatics Training in the Cloud? The Genome Analysis Centre The Genome Analysis

3 Advantages The Genome Analysis Centre The Genome Analysis [Adapted from Titus Brown]

1.Participants can use own – Computers – Web browser 2.Graphical interaction via – X Windowes – IPython – Knitr 3.Compute can be scaled up/down depending on what it’s being taught The Genome Analysis Centre The Genome Analysis

1.Participants can use own – Computers – Web browser 2.Graphical interaction via – X Windows – IPython – Knitr 3.Compute can be scaled up/down depending on what it’s being taught The Genome Analysis Centre The Genome Analysis

1.Participants can use own – Computers – Web browser 2.Graphical interaction via – X Windowes – IPython – Knitr 3.Compute can be scaled up/down depending on what it’s being taught The Genome Analysis Centre The Genome Analysis

3 Challenges The Genome Analysis Centre The Genome Analysis [Adapted from Titus Brown]

1.Institutional resistance – Privacy of clinically sensitive data 2.Reliable network access and servers needed – > 30 people clicking at the same time! 3.Cost The Genome Analysis Centre The Genome Analysis

1.Institutional resistance – Privacy of clinically sensitive data 2.Reliable network access and servers needed – > 30 people clicking at the same time! 3.Cost The Genome Analysis Centre The Genome Analysis

1.Institutional resistance – Privacy of clinically sensitive data 2.Reliable network access and servers needed – > 30 people clicking at the same time! 3.Cost The Genome Analysis Centre The Genome Analysis

NM Trainee Trainer Registry The Genome Analysis Centre The Genome Analysis National eResearch Collaboration Tools and Resources (NeCTAR) Watson-Haigh et al. 2013

MRC UK Microbial Genomics Open Stack Each VM 32Gb RAM, 8 cores, 1Tb Biolinux The Genome Analysis Centre The Genome Analysis Nick Loman, University of Birmingham

Why Cloud? Very little technical knowledge required Snapshot ready for replication User can take instance home The Genome Analysis Centre The Genome Analysis

Cloud + Bioinformatics + Training = The Genome Analysis Centre The Genome Analysis

The Genome Analysis Centre The Genome Analysis Rafael Jiménez Titus Brown Mick Watson Carole Goble Nick Loman Vicky Schneider