Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on Virtual Research Environments Pedro Andrade (CERN)

Slides:



Advertisements
Similar presentations
Ambienti Applicativi Virtuali: La Tecnologia D4Science Pasquale Pagano -> Leonardo Candela Istituto di Scienza e Tecnologie dellInformazione A. Faedo Incontro.
Advertisements

United Nations Spatial Data Infrastructure Dr Kristin Stock Social Change Online and Centre for Geospatial Science, University of Nottingham.
Integrated Capture Information System Objectives and Context Sachiko Tsuji, Marc Taconet, Nicolas Bailly, Yves Jaques ICIS Requirements Gathering
User Communities scenarios and achievements Marc Taconet Anton Ellenbroek FAO – Fisheries and Aquaculture Department Nicolas Bailly.
D4Science: a Data Infrastructure Ecosystem for Science DL.org Autumn School – Athens, 3-8 October 2010 Leonardo Candela 6 th October 2010.
D4Science technical features and opportunities of the grid infrastructure for large scale data management Pasquale Pagano D4Science Technical.
Infrastrutture per Biblioteche e “Repositories” Digitali Donatella Castelli.
D4Science Project (DILIGENT For Science) Donatella Castelli CNR-ISTI DRIVER Summit January 2008 Gottingen (Germany)
Community fair Introduction of the D4Science communities, their challenges and the synergies M. Taconet (FAO) - chair L. Fusco (ESA), N. Bailly (WFC),
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
A Social Networking Research Environment for Scientific Data Sharing: The D4Science Offering M. Assante, L. Candela, D. Castelli, F. Mangiacrapa, P. Pagano.
Final Review Meeting 16 th March 2010 Brussels (Belgium) D4Science Production Infrastructure Contract n°: RI Pedro Andrade (CERN)
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
1/ 27 The Agriculture Ontology Service Initiative APAN Conference 20 July 2006 Singapore.
Massimiliano Assante – Leonardo Candela – Donatella Castelli – Pasquale Pagano Fourteenth International Conference on Grey Literature An Environment Supporting.
1 Common Challenges Across Scientific Disciplines Laurence Field CERN 18 th November 2013.
D4Science-II Preparation Meeting 28 th 2009 CERN (Geneva) D4Science Objectives, Status, and Plans Pedro Andrade
Updates from EOSDIS -- as they relate to LANCE Kevin Murphy LANCE UWG, 23rd September
Presentation Outline (hidden slide) Technical Level: 100 Intended Audience: TDMs, ITPros, ITDMs, BI specialists Objectives (what do you want the audience.
material assembled from the web pages at
Dimitris Koureas, PhD Natural History Museum London Linking layers of biodiversity data: Informatics challenges for the long tail research RDA - Long Tail.
Introduction to iMarine and it’s challenges Alexandros Antoniadis (NKUA) John Gerbesiotis (NKUA)
Making Connections: SHARE and the Open Science Framework Jeffrey Open Repositories 2015.
Virtual Research Environments: e-Infrastructures beyond Digital Libraries Pasquale Pagano CNR-ISTI RCDL08 Conference Information.
Donatella Castelli CNR-ISTI
A DΙgital Library Infrastructure on Grid EΝabled Technology ETICS Usage in DILIGENT Pedro Andrade
Issues in (Financial) High Performance Computing John Darlington Director Imperial College Internet Centre Fast Financial Algorithms and Computing 4th.
Supporting eScience Communities: The D4Science Perspective on EGI Pasquale Pagano D4Science TD User Community Transition from.
Scenarios for a Learning GRID Online Educa Nov 30 – Dec 2, 2005, Berlin, Germany Nicola Capuano, Agathe Merceron, PierLuigi Ritrovato
LifeWatch E-Science and Observatory Infrastructure for Biodiversity & Ecosystem Science Olaf Bánki.
A Practical Approach to Metadata Management Mark Jessop Prof. Jim Austin University of York.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
1 The Integrated fisheries Capture Information System (ICIS) Marc Taconet EGEE Istambul - 25 th September 2008 Scientific Data Infrastructure.
Pro’s and Con’s of Cloud Computing for Scientific Data – the (on-going) experience of D4Science Andrea Manieri Engineering Ingegneria Informatica s.p.a.
AquaMaps: Mapping Biodiversity Hotspots and Assessing Impacts of Climate Change K.Kaschner (FAO & Albert-Ludwigs- University of Freiburg), M. Taconet (FAO),
IMarine and our contribution 1 Presentation methodology: PechaKucha 20x20 Andrea Manzi (CERN) Nick Drakopoulos (CERN) IT GT.
7. Grid Computing Systems and Resource Management
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
On the D4Science Approach Toward AquaMaps Richness Maps Generation Pasquale Pagano - CNR-ISTI Pedro Andrade.
Managing Virtual Research Environments in Hybrid Data Infrastructures Pasquale Pagano (CNR, Italy) iMarine Technical Director
User scenario on Marine Biodiversity AquaMaps Pasquale Pagano National Research Council (CNR) – ISTI Italy.
D4Science and ETICS Building and Testing gCube and gCore Pedro Andrade CERN EGEE’08 Conference 25 September 2008 Istanbul (Turkey)
Example projects using metadata and thesauri: the Biodiversity World Project Richard White Cardiff University, UK
Managing deployment and activation of Web Applications in a distributed e-Infrastructure EGI Technical Forum September 2011 Lyon
Cloud Computing for Ecological Modeling in the D4Science Infrastructure A. Manzi (CERN), L. Candela, D. Castelli, G. Coro, P. Pagano, F. Sinibaldi (ISTI-CNR)
The Global Scene Wouter Los University of Amsterdam The Netherlands.
Realising Virtual Research Environments by Hybrid Data Infrastructures: the D4Science Experience Andrea Manzi (CERN) Leonardo Candela, Donatella Castelli,
WP3 Biomapping status quo WP3: NRM, CDF, CEFAS, DINARA, WCS.
D4Science: Opening data Infrastructures to boost education and knowledge Pasquale Pagano D4Science Technical Director CNR - ISTI EGI Conference.
Mapping Reference Codelists Claudio Baldassarre, Yann Laurent, Aureliano Gentile.
Pedro Andrade > IT-GD > D4Science Pedro Andrade CERN European Organization for Nuclear Research GD Group Meeting 27 October 2007 CERN (Switzerland)
Project progress and outlook Donatella Castelli CNR-ISTI iMarine Board, March 2013, FAO (Rome)
Virtual Research Environments as-a-Service Donatella Castelli CNR-ISTI EGI Conference 2016, 6-8 April.
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
1 Tutorial Outline 30’ From Content Management Systems to VREs 50’ Creating a VRE 80 Using a VRE 20’ Conclusions.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
Special Interest Groups - a Grid Service Dr. Algimantas Juozapavicius Vilnius University EGEE’06, Geneva, September.
Enhancements to Galaxy for delivering on NIH Commons
e-Infrastructure Integration with gCube
The BlueBRIDGE project
Pasquale Pagano (CNR-ISTI) Project technical director
Virtual Research Environments as-a-Service
Pasquale Pagano CNR – ISTI (Pisa, Italy)
Pasquale Pagano CNR, Italy
Introduction to D4Science
Virtual Research Environments as-a-Service
Bird of Feather Session
A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
Microsoft Azure Data Catalog
Presentation transcript:

Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on Virtual Research Environments Pedro Andrade (CERN) Leonardo Candela (CNR) Andrea Manzi (CERN) Pasquale Pagano (CNR) 5 th EGEE User Forum 14 April 2010 Uppsala (Sweden) project |

2 Outline FARM Community  who is the community  what are their needs/requirements D4Science Infrastructure  management of heterogeneous resources  creation of virtual research environments FARM VREs  multiple environments for different applications  combining VREs as scientific workflows 14 th April 2010, Uppsala Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on VREs

3 Fisheries and Aquaculture Research Management – Experts, scientists and researches from the UN FAO and other related agencies/organizations FARM Community 14 th April 2010, Uppsala Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on VREs Production systems and fleets Resources of commercial interest Broader ecosystem Physical environment FAO WFC RFBs CoML / OBIS IOC IUCN Fisheries Biodiversity Environment GBIF

4 FARM Community The scientific daily activities of this community brought several challenging requirements to the D4Science project:  Collaborative working environments  Intensive on-demand data processing  Automatic reporting from templates  Combine biodiversity information with fisheries time series  Manipulation of very large time series data (harmonization)  Spatial dimension and mapping (GIS)  Consumption of data and derived products through dynamically defined application workflows  Support for annotations 14 th April 2010, Uppsala Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on VREs

5 FARM Community They need an infrastructure enabler system providing advanced data management services through dynamically deployed specialized research environments  Automatically describe and promote harmonization and provenance information  Promote re-usage of data sets and derived products by different applications by offering rich publication and discovery tools  Can be easily enriched by community specific applications  Registration, publication, and discovery of applications as simple as registration, publication, and discovery of data 14 th April 2010, Uppsala Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on VREs

6 D4Science Infrastructure D4Science-II provides an ecosystem of e-Infrastructure resources based on the concept of Virtual Research Environment. A VRE can be describe as: A distributed and dynamically created environment, where a subset of data, services, and hardware resources, are dynamically assigned to a subset of users, for a limited timeframe, at little or no cost for the providers of the infrastructure. VREs are enriched with facilities for communication, collaboration, and sharing among scientists and researchers. 14 th April 2010, Uppsala Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on VREs

7 D4Science Infrastructure A VRE supports cooperative activities such as:  Data analysis and processing  Metadata and provenance generation  Data generation, integration, enrichment, and curation  Processes execution and optimization  Registration and execution of application specific tools 14 th April 2010, Uppsala Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on VREs

8 D4Science Infrastructure 14 th April 2010, Uppsala Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on VREs EGEE / EGI D4Science (gLite) (gCube) OBIS, GBIF AQUAMAPS

9 FARM VREs Three end user environments modelled as D4Science VREs  Species Prediction Modeling – AquaMaps VRE  Fishery Country Profiles – FCPPS VRE  Integrated Catch IS – ICIS VRE 14 th April 2010, Uppsala Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on VREs Species Prediction Modeling Integrated Catch Information System FisheriesBiodiversity Species occurrence maps Fishing activity / Catch Fishery Country Profiles Fisheries comprehensive profiles

10 FARM VREs AquaMaps VRE Objective: Improve the production of species distribution and biodiversity maps for on-line presentation Features:  Parametric generation of species distribution and biodiversity maps  Improved generation time 14 th April 2010, Uppsala Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on VREs One Multispecies map computed on 6,188 half degree cells (over 170k) and 2,540 species requires 125 millions computations One global map (extended to all species and cells around the world) requires about 400 billions computations

11 FARM VREs FCPPS VRE Objective: Increase visibility and accuracy of country profiles Features:  Improve timeliness  Life-cycle support (draft, review)  Reuse data and report-templates  Data integration across boundaries  Document production system 14 th April 2010, Uppsala Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on VREs Reports include maps and graphs computed on-demand and updated directly in the report upon request Country-specific templates integrating 10 years of time series catch observations

12 FARM VREs ICIS VRE Objective: Improve the management of capture statistics data Features:  Share time series  Filter, merge, join data  Facilitate import and curation  Harmonized and reallocated catch statistics 14 th April 2010, Uppsala Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on VREs 10 years catch data containing millions of observation points described by ~ 100 attributes

13 FARM VREs These VREs satisfy different end-user scenarios:  Requested by different FARM groups/users  Accessed from distributed locations at the same time However they are based on a common pool of resources:  Data collections are shared, enhanced, and re-used by multiple applications  Same distributed computing and storage facilities are dynamically allocated to different tasks  The set of base services enhance the VREs with full text search, content browsing, annotation, metadata management, etc 14 th April 2010, Uppsala Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on VREs

14 D4Science Fisheries ICIS VRE FARM VREs GIS areas - species FAO RFBs Catch statistics Reference system Catch statistics Reference system Nafo Ices Fisheries FCPPS VRE Fishery ontology Reference System FAO Document Repository EEA reports OceanDocs Biodiversity AquaMaps VRE WFC Species occurence FishBase OBIS SealifeBase UBC EGEE LOCAL 14 th April 2010, Uppsala Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on VREs

15 Conclusions D4Science provides VREs to the FARM community where different users integrate and process a common set of heterogeneous data collections using distinct applications D4Science fosters the collaboration among FARM community members by promoting the sharing of workflows and data FARM VREs are currently running in the D4Science infrastructure and being exploited by several FAO bodies 14 th April 2010, Uppsala Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on VREs

Questions ? 14 th April 2010, Uppsala Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on VREs