Introduction to D4Science

Slides:



Advertisements
Similar presentations
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Advertisements

INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 6 2/13/2015.
Hydra Partners Meeting March 2012 Bill Branan DuraCloud Technical Lead.
Distributed Database Management Systems
A Social Networking Research Environment for Scientific Data Sharing: The D4Science Offering M. Assante, L. Candela, D. Castelli, F. Mangiacrapa, P. Pagano.
SaaS, PaaS & TaaS By: Raza Usmani
Cloud Attributes Business Challenges Influence Your IT Solutions Business to IT Conversation Microsoft is Changing too Supporting System Center In House.
Customer Sales Presentation Stoneware webNetwork Powered by ThinkServer.
© Rheinmetall Defence 2013 The Geospatial Catalogue and Database Repository (GCDR) and the Knowledge Management System (KMS) Shane Reschke – Technical.
FIORANO SERVICE BUS The Cloud Enablement Platform
Massimiliano Assante – Leonardo Candela – Donatella Castelli – Pasquale Pagano Fourteenth International Conference on Grey Literature An Environment Supporting.
Interoperability ERRA System.
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
A detailed look at the Microsoft Windows Infrastructure at UWE including Active Directory (AD), MIIS, Exchange, SMS, IIS, SQL Server, Terminal Services.
material assembled from the web pages at
Microsoft SharePoint Server 2010 for the Microsoft ASP.NET Developer Yaroslav Pentsarskyy
Week 5 Lecture Distributed Database Management Systems Samuel ConnSamuel Conn, Asst Professor Suggestions for using the Lecture Slides.
4 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved. Computer Software Chapter 4.
Tools for collaboration How to share your duck tales…
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
IMarine and our contribution 1 Presentation methodology: PechaKucha 20x20 Andrea Manzi (CERN) Nick Drakopoulos (CERN) IT GT.
Building Scientific Workflows for the Fisheries and Aquaculture Management Community based on Virtual Research Environments Pedro Andrade (CERN)
Managing Virtual Research Environments in Hybrid Data Infrastructures Pasquale Pagano (CNR, Italy) iMarine Technical Director
Cloud Computing for Ecological Modeling in the D4Science Infrastructure A. Manzi (CERN), L. Candela, D. Castelli, G. Coro, P. Pagano, F. Sinibaldi (ISTI-CNR)
Realising Virtual Research Environments by Hybrid Data Infrastructures: the D4Science Experience Andrea Manzi (CERN) Leonardo Candela, Donatella Castelli,
Submitted to :- Neeraj Raheja Submitted by :- Ghelib A. Shuaib (Asst. Professor) Roll No : Class :- M.Tech(CSE) 2 nd Year.
IMarine: Accessing and Managing Biodiversity Data Pasquale Pagano (CNR) iMarine Technical Director
Virtual Research Environments as-a-Service Donatella Castelli CNR-ISTI EGI Conference 2016, 6-8 April.
1 Tutorial Outline 30’ From Content Management Systems to VREs 50’ Creating a VRE 80 Using a VRE 20’ Conclusions.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Prof. Jong-Moon Chung’s Lecture Notes at Yonsei University
Chapter 1 Characterization of Distributed Systems
Web GIS: Architectural Patterns and Practices
Accessing the VI-SEEM infrastructure
Experience in managing service portfolio: OpenAIRE, BlueBridge
The BlueBRIDGE project
Pasquale Pagano (CNR-ISTI) Project technical director
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
By: Raza Usmani SaaS, PaaS & TaaS By: Raza Usmani
Discovering and accessing data from a distributed network of data centres S. Mazzeo (ESA)
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Clouds , Grids and Clusters
Distributed Cache Technology in Cloud Computing and its Application in the GIS Software Wang Qi Zhu Yitong Peng Cheng
EOSC MODEL Pasquale Pagano CNR - ISTI
Virtual Research Environments as-a-Service
Pasquale Pagano CNR – ISTI (Pisa, Italy)
Pasquale Pagano CNR, Italy
INTAROS WP5 Data integration and management
An Overview of Data-PASS Shared Catalog
Donatella Castelli CNR-ISTI
Flanders Marine Institute (VLIZ)
Joseph JaJa, Mike Smorul, and Sangchul Song
DI4R, 30th September 2016, Krakow
Microsoft SharePoint Server 2016
Grid Computing.
Recap: introduction to e-science
GSAF Grid Storage Access Framework
Introduction to Cloud Computing
Managing Clouds with VMM
Virtual Research Environments as-a-Service
EOSCpilot All Hands Meeting 8 March 2018 Pisa
Emerging technologies-
EMBRC - European Marine Biological Resource Center K. Deneudt, I. Nardello Pilot Blue Cloud Workshop March 28th, 2017 Brussels.
Cloud Computing: Concepts
Fundamental Concepts and Models
GISELA & CHAIN Workshop Digital Cultural Heritage Network
ArcGIS Online – The Road Ahead
A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
Microsoft Virtual Academy
Presentation transcript:

Introduction to D4Science Massimiliano Assante ISTI - CNR Massimiliano.assante@cnr.it

The D4Science infrastructure Data Infrastructure combining over 500 software components into a coherent and centrally managed system of hardware, software, and data resources 18/11/2018 Introduction to Existing Technologies and Enabling Platform - M. Assante

D4Science Identity D4Science is connecting +3900 scientists in 44 countries integrating data from +50 heterogeneous providers executing +50,000 models & algorithms/month providing access to over a billion quality records in repositories worldwide operating with 99,7% service availability D4Science hosts 95 Virtual Research Environments to serve the biological, ecological, environmental, social mining, culture heritage, and statistical communities world-wide. 18/11/2018 Introduction to Existing Technologies and Enabling Platform - M. Assante

D4Science for Geeks uses cloud-computing technologies to manage +300 servers (more than 5 TB Ram, +1700 CPUs, 400 TB storage) is monitored via Nagios and Ganglia is maintained via Ansible (Application Deployment + Configuration Management + Continuous Delivery) is governed by deployment and operation policies has established SLA is operated according to defined Terms of Use 18/11/2018 Introduction to Existing Technologies and Enabling Platform - M. Assante

to host and maintain data Storage as Service to host and maintain data Database High-availability Standard Ready-to-use Cloud Storage Scalable Reliable Secure Geographical DB Scalable OGC Standard Privacy and Attribution 18/11/2018 Introduction to Existing Technologies and Enabling Platform - M. Assante

Applications as a Service to curate and manage data Metadata Generation Geospatial Data Biodiversity Data Statistical Data Textual Data Harmonization Disambiguate Validate Integrate and Consistency Check Data Exchange OGC protocols DarwinCore SDMX DublinCore 18/11/2018 Introduction to Existing Technologies and Enabling Platform - M. Assante

to process and extract knowledge Rich and Heterogeneous Computing as Service to process and extract knowledge Scalable Easy to Manage Across Boundaries Tailored Elastic Assignment of Computing Assignment of Processors Virtual Research Environment Rich and Heterogeneous High Throughput Map-Reduce Parallel R 18/11/2018 Introduction to Existing Technologies and Enabling Platform - M. Assante

Born from the user needs to host applications in a secure and scalable environment to maintain and preserve data to securely delivery data to known users Capacities to analyze datasets to manage the full data life-cycle from import to validation, curation, harmonization and publication to reduce the costs of data maintenance Applications to access authoritative datasets to mash-up data to analyse big datasets to validate datasets and provide a standard access to them Data 18/11/2018 Introduction to Existing Technologies and Enabling Platform - M. Assante

Virtual Research Environment to access, share and collaborate Share Database Tables Workflow Files Communicate Post Favourite Connection Organize Dynamic VRE Creation Secure Policy Control 18/11/2018 Introduction to Existing Technologies and Enabling Platform - M. Assante

Virtual Research Environment a distributed and dynamically created environment where subset of resources (data, services, computational, and storage resources) regulated by tailored policies are assigned to a subset of users via interfaces for a limited timeframe at little or no cost for the providers of the participatory data e-infrastructures L. Candela, D. Castelli, P. Pagano (2013) Virtual Research Environments: An Overview and a Research Agenda. Data Science Journal, Vol. 12 18/11/2018 Introduction to Existing Technologies and Enabling Platform - M. Assante

Supported projects – full list at https://services. d4science 18/11/2018 Introduction to Existing Technologies and Enabling Platform - M. Assante

References / Links D4Science Web Site: http://www.d4science.org D4Science for developers: http://dev.d4science.org Catalogue of Applications https://www.gcube-system.org/catalogue-of-applications Software Key Features https://wiki.gcube-system.org/GCube_Features Developer Guide https://wiki.gcube-system.org/Developer%27s_Guide FeatherWeightStack https://wiki.gcube-system.org/Featherweight_Stack SmartGears https://wiki.gcube-system.org/SmartGears gCube APIs https://wiki.gcube-system.org/GCube_Application_Programming_Interface Administration Guide https://wiki.gcube-system.org/Administrator%27s_Guide 18/11/2018 Introduction to Existing Technologies and Enabling Platform - M. Assante