Norman Morrison Senior Research Fellow, The University of Manchester Biodiversity Virtual e-Laboratory An e-Infrastructure and e-Science environment supporting.

Slides:



Advertisements
Similar presentations
Research Councils ICT Conference Welcome Malcolm Atkinson Director 17 th May 2004.
Advertisements

Web: The Future of OMII-UK e-Science: the Changing Landscape 17 April 2009 Neil Chue Hong.
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
Web Accessible Virtual Research Environment for Ecosystem Science Community Presentation by Siddeswara Guru.
UK e-Science and the White Rose Grid Paul Townend Distributed Systems and Services Group Informatics Research Institute University of Leeds.
An Introduction to Social Simulation Andy Turner Presentation as part of Social Simulation Tutorial at the.
THE JOINED UP WORLD OF E-RESEARCH Professor Neil McLean National Technical Standards Adviser to the Department of Education Science and Training (DEST)
Interest for the Economy: Reaching Supersites sustainability through the creation of a science - commercial ecosystem This document produced by Members.
Data Management Needs and Challenges for Telemetry Scientists Josh M London Wildlife Biologist, Polar Ecosystems Program National Marine Mammal Laboratory.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
GEO Work Plan Symposium 2012 ID-05 Resource Mobilization for Capacity Building (individual, institutional & infrastructure)
Scientific Data Infrastructure: activities in the Capacities Programme of FP7 Presentation at euroCRIS Workshop, Brussels 15 September 2009 "The views.
1 e-Infrastructures: the European Perspective on Scientific Data Carlos Morais Pires INFSO Directorate F Unit F3 “The views expressed in this presentation.
1 European policies for e- Infrastructures Belarus-Poland NREN cross-border link inauguration event Minsk, 9 November 2010 Jean-Luc Dorel European Commission.
User requirements for and concerns about a European e-Infrastructure Steven Newhouse, Director.
Software from Science for Science Steven Newhouse, Director.
Advanced Computing Services for Research Organisations Bob Jones Head of openlab IT dept CERN This document produced by Members of the Helix Nebula consortium.
Designing, Executing, Reusing and Sharing Workflows: Taverna and myExperiment Supporting the in silico Experiment Life Cycle Katy Wolstencroft Paul Fisher.
U.S. Department of the Interior U.S. Geological Survey Next Generation Data Integration Challenges National Workshop on Large Landscape Conservation Sean.
Dimitris Koureas, PhD Natural History Museum London Linking layers of biodiversity data: Informatics challenges for the long tail research RDA - Long Tail.
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
Taverna Workflow. A suite of tools for bioinformatics Fully featured, extensible and scalable scientific workflow management system – Workbench, server,
SCAP E SCAPE Project EU project aimed at building a scalable platform for planning and execution of computation intensive processes for ingestion or migration.
A public-private partnership building a multidisciplinary cloud platform for data intensive science Bob Jones Head of openlab IT dept CERN This document.
This document produced by Members of the Helix Nebula Partners and Consortium is licensed under a Creative Commons Attribution 3.0 Unported License. Permissions.
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
E-Science and Technology Infrastructure for Biodiversity and Ecosystem Research.
LifeWatch E-Science and Observatory Infrastructure for Biodiversity & Ecosystem Science Olaf Bánki.
Ecosystem Service Indicators, Biome-BGC and the SZTAKI Desktop Grid P. Ittzés 1, A. Cs. Marosi 2, Z. Barcza 1, F. Horváth 1 1. MTA Centre for Ecological.
1 European e-Infrastructure experiences gained and way ahead OGF 20 / EGEE User’s Forum 9 th May 2007 Mário Campolargo European Commission - DG INFSO Head.
Infrastructures for Social Simulation Rob Procter National e-Infrastructure for Social Simulation ISGC 2010 Social Simulation Tutorial.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
AN ORGANISATION FOR A NATIONAL EARTH SCIENCE INFRASTRUCTURE PROGRAM Virtual Geophysics Laboratory (VGL): Scientific workflows Exploiting the Cloud Josh.
Scientific Gateway for Academic Grid Malaysia Group Name: ZenFone Munirah binti Kassim Ana Farhanah binti Omar Siti Syahirah binti.
Toward a common data and command representation for quantum chemistry Malcolm Atkinson Director 5 th April 2004.
Project number: ENVRI and the Grid Wouter Los 20/02/20161.
The DEER Distributed European Electronic Resource Dr Suzanne Keene Francesca Monti University College London.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
The Global Scene Wouter Los University of Amsterdam The Netherlands.
THE BIOVEL PROJECT: ROBUST PHYLOGENETIC WORKFLOWS RUNNING ON THE GRID Bachir Balech (IBBE-CNR)
NASA Earth Exchange (NEX) A collaborative supercomputing environment for global change science Earth Science Division/NASA Advanced Supercomputing (NAS)
An Open Data Platform in the framework of the EGI-LifeWatch Competence Centre Fernando Aguilar Jesús Marco
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Aalto Data.
1 This Changes Everything: Accelerating Scientific Discovery through High Performance Digital Infrastructure CANARIE’s Research Software.
About the European Science Foundation 1. 2 ESF Member Organisations ESF is an independent association of 13 Member Organisations ● research funding organisations.
Open Data and Cloud Computing e-Infrastructure for Biodiversity Daniele Lezzi Barcelona Supercomputing Center International Workshop on Science Gateways.
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
EGI-InSPIRE EGI-InSPIRE RI The European Grid Infrastructure Steven Newhouse Director, EGI.eu Project Director, EGI-InSPIRE 29/06/2016CoreGrid.
Coordination and Policy Development in Preparation for a European Open Biodiversity Knowledge Management System Supported by the European Commission through.
INTRODUCTION TO XSEDE. INTRODUCTION  Extreme Science and Engineering Discovery Environment (XSEDE)  “most advanced, powerful, and robust collection.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
Project Database Handler The Project Database Handler is a brokering application which will mediate interactions between the project database and other.
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
EGI… …is a Federation of over 300 computing and data centres spread across 56 countries in Europe and worldwide …delivers advanced computing.
Enhancements to Galaxy for delivering on NIH Commons
Accessing the VI-SEEM infrastructure
Ecological Niche Modelling in the EGI Cloud Federation
The BlueBRIDGE project
Tools and Services Workshop
Joslynn Lee – Data Science Educator
SuperComputing 2003 “The Great Academia / Industry Grid Debate” ?
Pasquale Pagano CNR, Italy
IaaS Layer – Solutions for “Enablers”
National e-Infrastructure Vision
Recap: introduction to e-science
Brief introduction to the project
Connecting the European Grid Infrastructure to Research Communities
Introduction to D4Science
EGI Webinar - Introduction -
WIS Strategy – WIS 2.0 Submitted by: Matteo Dell’Acqua(CBS) (Doc 5b)
Presentation transcript:

Norman Morrison Senior Research Fellow, The University of Manchester Biodiversity Virtual e-Laboratory An e-Infrastructure and e-Science environment supporting research on biodiversity

Your learning objectives ?

What is a Virtual e-Laboratory? Like a physical laboratory – A place “inside computers” where you can analyse data and do digital experiments – It’s equipped with everything you need Workflows Services Data Hardware

Scientific challenges in Environment Source: W.Los

Part of a workflow to study the ecological niche of the Horseshoe crab (Limulus polyphemus) Workflows, pipelines and other applications are built from “services” Workflows allow scientists to run studies and experiments to process vast amounts of data, repeatedly – Select and apply successive “services” (data analysis and processing steps) – Import data from own research and/or from existing public sources – Choose input parameters Access a library of workflows – Re-using existing workflows improves efficiency by reducing research time and overhead expenses

Biodiversity occurs on different levels of biological organisation and at different spatial and time scales Varying spatial scale - from a single tree stump, forest stand, etc. - to an entire landscape, country, or region Ecosystems Species DNA, proteins and genes Across time and evolution - some processes act very fast - others take millions of years Source: W.Los

Workflows driven by science and policy needs CO 2 emissions continuously increasing – 10 GtC in 2010; Sequestration is the sustainable process to mitigate the effects Over the past 50 years, humans have changed ecosystems – resulting in a substantial and largely irreversible loss of biodiversity Invasions of alien species – A leading cause of biodiversity loss and related economic damages. They degrade ecosystem services, generate human health problems and impact outdoor recreation. “transportation with ships is a high risk to spread the species to these spots” Stelzer et al 2013 Source: NOAA

Public groups –Publishing workflows and results Private groups –Local materials –Intra-project work and collaborations 8700 members, 318 groups, 2625 workflows, 674 files, 276 packs Workflows must be shareable and discoverable

Secure, scalable, reliable, and well-documented in a geographically distributed network of services Users’ workflows and applications Sustained Service and Data Providers GBIF, CoL, ITIS, OBIS, WoRMS, EBI, BGBM, CRIA, EoL, BHL, ALA, etc. + many many more Recognised and stable Resource Providers National, EGI.eu, PRACE, commercial, etc.

Services must be discoverable A fully curated, well-founded catalogue of Web services for biodiversity science

Workflow maturity process Use Case Development Science and Tech Team Consolidation Documentation Disseminations Showcase Release Workshops Conferences Training Tutorial Publication Demos Play Days Documentation External Users Workflow Management Plan Management Internal Service Team External reviewers Internal

Users need to be able to build and use workflows Technical PAL Science PAL Domain Scientist Taverna Workbench Component Builder Taverna Lite / Server Taverna Player / Domain-Specific Website Workflow Visibility Concept KnowledgeWorkflow design, computeDomain science HighLow

Workbench Interaction Server Server Servers Catalogues Repositories Expert Tools Run time Execution Services SHIM Data & Provenance Authentication Management System Deployment Infrastructure hosting, compute, storage Biodiversity Catalogue User facing Portal Lite Taverna Player

3 V’s of Big Data – Volume, Variety, Velocity Variety – Finding, retrieving and cleaning data prior to analysis is often a time-consuming task – Data is not always neatly structured the way you want it for your analysis tools

Learning objectives for the course How to building a simple workflow How to import / export data from a workflow How to use the Taverna engine to run your workflows An introduction to the BioVeL Portal What is a component… and how to create and use one Embedding an R script in a workflow Discovering and sharing workflows and services via myExperiment and Biodiversity Catalogue Introduction to BioVeL Portal

Your learning objectives How will what you’ve learned on the course impact your future research? What benefits will using workflows have on your day to day activities?

BioVeL is funded by the European Commission 7th Framework Programme (FP7). It is part of its e-Infrastructures activity. BioVeL contributes to LifeWatch and GEO BON. BioVeL products are free to access. Under FP7, the e-Infrastructures activity is part of the Research Infrastructures programme, funded under the FP7 'Capacities' Specific Programme. It focuses on the further development and evolution of the high-capacity and high-performance communication network (GÉANT), distributed computing infrastructures (grids and clouds), supercomputer infrastructures, simulation software, scientific data infrastructures, e-Science services as well as on the adoption of e-Infrastructures by user communities.