EGI FedCloud in Digital Humanities

Slides:



Advertisements
Similar presentations
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI AAI in EGI Status and Evolution Peter Solagna Senior Operations Manager
Advertisements

DARIAH-ERIC Towards a sustainable social and technical European eResearch Infrastructure for the Arts and Humanities DARIAH-ERICDARIAH-ERIC VCC1 e –Infrastructures.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI CloudBroker Platform integration into WS-PGRADE/gUSE Zoltán Farkas MTA.
The role of Parthenos for CLARIN ERIC Steven Krauwer CLARIN ERIC Executive Director 1.
CLARIN Infrastructure Vision (and some real needs) Daan Broeder CLARIN EU/NL Max-Planck Institute for Psycholinguistics.
European Grid Initiative Federated Cloud update Peter solagna Pre-GDB Workshop 10/11/
MTA SZTAKI Department of Distributed Systems The problems of persistent identifiers in the context of the National Digital Data Archives of Hungary András.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Evolution of AAI for e- infrastructures Peter Solagna Senior Operations Manager.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
Storing digital assets on Grid/EGI FedCloud with gLibrary Giuseppe La Rocca, INFN DARIAH ERIC.
Overview of the global architecture Giacinto DONVITO INFN-Bari.
Storing digital assets on Grid/EGI FedCloud with gLibrary Giuseppe La Rocca, INFN DARIAH ERIC.
EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union under grant number DARIAH Competence Centre e-Infrastructure.
Ljubljana, 22 nd April 2015EGI DARIAH CC Kick-off meeting1 EGI DARIAH Competence Centre Project logistics and activity plan Karolj Skala, Davor Davidović.
EGI-Engage EGI Webinar - Introduction - Gergely Sipos EGI.eu / MTA SZTAKI 6/26/
European Grid Initiative The EGI Federated Cloud as Educational and Training Infrastructure for Data Science Tiziana Ferrari/ EGI.eu.
DARIAH EU AAI consideration K. Skala, D. Davidović, Z. Šojat Lisbon, 22 May 2015.
EGI-InSPIRE EGI-InSPIRE RI EGI strategy towards the Open Science Commons Tiziana Ferrari EGI-InSPIRE Director at EGI.eu.
EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union under grant number Federated Cloud Update.
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
Networks ∙ Services ∙ People Di4R Network. Services. People. GÉANT 28 th September, Krakow.
The EGI Federated Cloud
Accessing the VI-SEEM infrastructure
The EGI Training Infrastructure
B. Piringer R. Barbera, A. Calanducci, C. Carrubba, D. Davidovic, G
Overview of the global architecture
User Engagement in EGI (With focus on the cloud)
Federated Cloud Computing
eduTEAMS platform for collaboration Niels Van Dijk
Supporting Research on Biodiversity: LifeWatch on the Cloud
IaaS Layer – Solutions for “Enablers”
Technical Meeting with CNR and INAF 7 October 2014
Defining and tracking requirements for New Communities
Donatella Castelli CNR-ISTI
KER - Open Data Platform
EGI/EUDAT/INDIGO-DataCloud Joint project proposal for EINFRA-12 A
DI4R, 30th September 2016, Krakow
WS-PGRADE for Molecular Sciences and XSEDE
Introduction to EGI; Training activities and plans
EGI-Engage Engaging the EGI Community towards an Open Science Commons
Pre-OMB meeting Preparation for the Workshop “EGI towards H2020”
An easier path? Customizing a “Global Solution”
Status report of the LToS platform
Connecting the European Grid Infrastructure to Research Communities
EGI – Organisation overview and outreach
The EGI Federated Cloud
EGI Webinar - Introduction -
DARIAH requirements and roadmap in EGI
Ruđer Bošković Institute, Croatia
DARIAH Competence Centre: architecture and activity summary
LifeWatch Cloud Computing Workshop
Conference: Data and Life Sci +DC
Ruđer Bošković Institute, Croatia
The SADE mini-project of the EGI DARIAH Competence Centre
Platform for the long tail of science
Common Solutions to Common Problems
Pre-OMB meeting Preparation for the Workshop “EGI towards H2020”
Integrating social science data in Europe
VCC 4 General VCC meeting, 2/3 April 2012, Utrecht, The Netherlands
Brian Matthews STFC EOSCpilot Brian Matthews STFC
VCC 2 General VCC meeting, 2/3 April 2012, Utrecht, The Netherlands
Virtual Competency Centre 1: e-Infrastructure General VCC meeting, 2/3 April 2012, Utrecht, The Netherlands Karlheinz Moerth (Co-head of VCC 1, Austria)
DARIAH – Competence Centre in a nutshell
User Support in EGI Reactive and proactive services
Support services for EGI portal-* communities
Expand portfolio of EGI services
EOSC-hub Contribution to the EOSC WGs
LifeWatch AARC Pilot Fernando Aguilar 13th FIM4R Workshop
Presentation transcript:

EGI FedCloud in Digital Humanities Davor Davidović Ruđer Bošković Institute DARIAH Competence Centre, EGI-Engage

Digital Arts and Humanities Search Browse Access Annotate Archive STORAGE digitization storing analysis COMPUTE DI4R conference, Krakow, 28-30.09.2016.

DI4R conference, Krakow, 28-30.09.2016. What is DARIAH-ERIC? DARIAH, the Digital Research Infrastructure for the Arts and Humanities… …aims to enhance and support digitally-enabled research and teaching across the humanities and arts. It is a connected network of tools, information, people and methodologies for investigating, exploring and supporting research across the digital arts and humanities for researchers and humanists. DI4R conference, Krakow, 28-30.09.2016.

DARIAH Organization Virtual Competency Centres 20 Working Groups: VCC e-Infrastructure VCC Research and Education VCC Scholarly Context Management VCC Advocacy 20 Working Groups: Text and Data Analytics Natural Language Processing Training and Education Digital Annotation Visual Media Guidelines and Standards … Dynamic and flexible units with specific goals and outcomes, related to one or more VCCs Cover strategic areas and topics, provide sustainability and incorporate the outcomes of working groups DI4R conference, Krakow, 28-30.09.2016.

DARIAH resources today DARIAH is not a service provider and does not provide any compute nor storage resources, so... Scattered resources: Local, institutional, national, public, different access policies,… Allocated through project : HaS, Ariadne, Cendari, NeDiMAH sustainability? Limited usage of the cloud technologies National providers (e.g. DARIAH-DE) Small number of available cloud-based services/applications (e.g. in EGI FedCloud) In-kind contributions DI4R conference, Krakow, 28-30.09.2016.

DI4R conference, Krakow, 28-30.09.2016. A&H requirements Storage and data capacities Digital repositories and archives Long-term data retention Compute resources Simple access, AAI DARIAH IdP, eduGain Training and education on using e-Infrastructure DI4R conference, Krakow, 28-30.09.2016.

DI4R conference, Krakow, 28-30.09.2016. EGI European 32 countries (National Grid Initiative) Grid Federating IT services Infrastructure compute power, storage, applications Clouds, grids, clusters,... Sustainability sustainable operations Project driven-innovation EGI-Engage, Indigo-DataCloud, AARC, etc... DI4R conference, Krakow, 28-30.09.2016.

What EGI offers to user communities? Technical support Compute resources User-specific applications Base services: AAI, monitoring, service registry Storage resources DI4R conference, Krakow, 28-30.09.2016.

EGI-Engage – DARIAH Competence Centre Widen the usage of the Federated (cloud) services for A&H research Objectives: Strengthening the collaboration between EGI and DARIAH Increasing the number of cloud-based services and applications for A&H running Raising the awareness of benefits of using e-Infrastructure in A&H Providing access to EGI FedCloud resources Technical support DI4R conference, Krakow, 28-30.09.2016.

DI4R conference, Krakow, 28-30.09.2016. DARIAH-CC workplan Development phase Dissemination phase Provide direct access to compute and storage resources (VMs, block storage,…) Establish DARIAH VO Develop selected services and applications AAI – EduGain, OpenID Build demonstrators/examples Dissemination actions workshops, presentations, events,... Training and education Technical support Engaging new use cases from A&H DI4R conference, Krakow, 28-30.09.2016.

Available (planned) FedCloud resources Virtual organization: vo.dariah.eu EGI-DARIAH SLA: 1/4/2016 – 1/9/2017 GWDG (DE) MTA SZTAKI (HU) VCPU 30 Memory 70 GB Storage 2 TB INFN-Bari (IT) SRCE (CRO) INFN-Catania (IT) DI4R conference, Krakow, 28-30.09.2016.

DARIAH-CC software stack Outreach, training, user support Community Apps. and services Optical Character Recognition system DBO@Cloud New app & services Semantic Search Engine Training platform Cloud Access DARIAH Science Gateway WS-PGRADE CDSTAR Technologies Federated DARIAH resources AAI EGI FedCloud infrastructure DARIAH Virtual Organisation e-Infrastructure DI4R conference, Krakow, 28-30.09.2016.

Services for A&H Generic services End-user oriented, non-specific applications: DH Gateway, PSSE, Cloud Access, File Transfer FedCloud services, applications and tools Developers services App developer and service provider oriented, development services: gLibrary, CDSTAR, WS-PGRADE Demonstrators End-user, specific research groups, specific use-cases: DBO@Cloud, OCR DI4R conference, Krakow, 28-30.09.2016.

DARIAH Science Gateway Central access point for FedCloud resources User login via EduGain (in progress) –> DARIAH IdP, OpenID Access to DARIAH VO resources Based on Liferay and WS-PGRADE/gUSE Generic services file transfer, Cloud Access Specific A&H services PSSE, SSE, DBO@Cloud, OCR (beta) DI4R conference, Krakow, 28-30.09.2016.

DARIAH Science Gateway Identity provider 1 Identity provider 2 DARIAH Science Gateway PSSE Portlet OCR Portlet gLibrary Portlet APP Portlet APP Portlet DCI Bridge Robot certificate EGI FedCloud DI4R conference, Krakow, 28-30.09.2016.

Parallel Semantic Search Engine (PSSE) Parallel search across Open Access repositories Search and semantically correlate contents in geographically distributed digital repositories across several different domain DI4R conference, Krakow, 28-30.09.2016.

DI4R conference, Krakow, 28-30.09.2016. Simple cloud services Simple Cloud Access IaaS – drag&drop app running on FedCloud OpenStack from INFN Bari and Catania FileTransfer – DataAvenue DI4R conference, Krakow, 28-30.09.2016.

DI4R conference, Krakow, 28-30.09.2016. Developers services Repository framework developed by INFN Access to existing and the creation of a new repositories via REST API “Tool” for creating and managing repositories Common data storage Architecture (GWDG) Provides system that can store, modify, search and access structured and unstructured data Already in-use by DARIAH-DE, but not on FedCloud! DI4R conference, Krakow, 28-30.09.2016.

DI4R conference, Krakow, 28-30.09.2016. Use case 1: DBO@Cloud The Virtual Dialect Dictionary 100+ year old collection of Bavarian dialects from Austro-Hungarian monarchy 50,000+ records (provided by Austrian Academy of Science) Organize, search, store and retrieve digital assets Traget users: lexicographers Link: https://dariah-gateway.lpds.sztaki.hu/web/guest/dbo-portlet DI4R conference, Krakow, 28-30.09.2016.

Use case 2: Optical Character Recognition Digitalization of a large collection of scanned or pictured documents with a search option Based on the CDSTAR framework for data storing and analyzing OCR  for Big Data problems MapReduce parallelization model SaaS – installation possible on any cloud site Beta version DI4R conference, Krakow, 28-30.09.2016.

DI4R conference, Krakow, 28-30.09.2016. Indigo-DataCloud INtegrating Distributed data Infrastructures for Global ExplOitation 04.2015. – 09.2017. Goal: develop a sustainable PaaS Cloud solution for e-Science 26 partners, 11 counties 11 scientific communities DI4R conference, Krakow, 28-30.09.2016.

INDIGO DARIAH repository platform Platform for easy creation of new repositories Provide simple deploying and hosting of the Open Access repository solutions in the Cloud: Invenio, ePrints, Islandora, OAR (docker images) Does not require technical knowledge, makes deploying repo in the Cloud simple Beneficiaries: small groups, individuals, A&H-related projects Under development DI4R conference, Krakow, 28-30.09.2016.

DARIAH repository - scheme DI4R conference, Krakow, 28-30.09.2016.

Who does need these services? Who are the beneficiaries of the services in DARIAH? COMMUNITIES RESEARCHERS SERVICE PROVIDERS Services and applications (OCR, DBO@Cloud, PSSE) DH Science Gateway Frameworks, engines (gLibrary, CDSTAR) Researcher -> Working groups Communities -> DARIAH-related projects Service providers -> existing DARIAH resource providers DI4R conference, Krakow, 28-30.09.2016.

Future plans for FedCoud in A&H Integration of the service into the Gateway Finish registering the Gateway as EduGain SP Support DARIAH via “Cloud infra” working group Engage new use-cases and user community to explore: Existing services: PSSE, OCR Provide new user-specific services Prepare demonstrators: DBO@Cloud, OCR Workshop (hands-on) App developers Indigo repository, gLibrary, CDSTAR End-users PSSE, DBO@Cloud, OCR, Cloud Access DI4R conference, Krakow, 28-30.09.2016.