EPOS e-Infrastructure

Slides:



Advertisements
Similar presentations
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Advertisements

© Keith G Jeffery, Anne G S Asserson GL 11 Washington Keith G Jeffery Director, IT & International Strategy, STFC
Theme 3: Architecture. Q1: Who houses stuff, both records and identifiers All useful services and repositories are centralized (latency, etc.) … but centralizing.
CNRIS CNRIS 2.0 Challenges for a new generation of Research Information Systems.
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
Information and Business Work
Massimo Cocco & Joern Lauterjung EPOS PP Council Rome September 19 th 2013 The Technical architecture Material prepared by WG7.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
This chapter is extracted from Sommerville’s slides. Text book chapter
The Preparatory Phase Proposal a first draft to be discussed.
Open Access to Grey Literature on e-Infrastructures: The BELIEF-II Project Digital Library Stefania Biagioni, Donatella Castelli, Franco Zoppi CNR-ISTI.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Donatella Castelli CNR-ISTI
Interfacing Registry Systems December 2000.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
The european ITM Task Force data structure F. Imbeaux.
1 24 September BREAKOUT :30 1)Review of Metadata Standards Directory (DCC version and GitHub) 2)Introduction of Metadata Standards Catalog.
Linking Tasks, Data, and Architecture Doug Nebert AR-09-01A May 2010.
1 Metadata Coordinating Chairs Meeting Gaithersburg November Keith Jeffery, Rebecca Koskela, Jane Greenberg, Alex Ball, Brigitte Jörg, Bridget Almas,
Jamie Hall (ILL). SciencePAD Persistent Identifiers Workshop PANData Software Catalogue January 30th 2013 Jamie Hall Developer IT Services, Institut Laue-Langevin.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
Nicola Bertazzoni Marta Zaetta Integrating CRIS with other systems.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
Interoperability from the e-Science Perspective Yannis Ioannidis Univ. Of Athens and ATHENA Research Center
Auditing Grey in a CRIS Environment
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Sync and Exchange Research Data b2drop.eudat.eu This work is licensed under the Creative Commons CC-BY 4.0 licence B2DROP EUDAT’s Personal.
Moodle Moot – August 2015 Nick Thompson, CCLE Coordinator CASA Community Application Sharing Architecture.
TCS-ICS interactions Kuvvet Atakan 1 and the WP6 and WP7 Teams 1 University of Bergen / Department of Earth Science.
DSpace - Digital Library Software
WP3 Harmonization & Integration J. Lauterjung & WP 3 Group.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
Title of the presentation | Date |1 Nikos Houssos National Documentation Centre (EKT/NHRF) CRIS for research information management.
Why are we here? projectsnational coordinationWorking groupsTCS.
Testing and Validation Kuvvet Atakan and the WP6 and WP7 Teams.
1 Chapter 12 Configuration management This chapter is extracted from Sommerville’s slides. Text book chapter 29 1.
EPOS IP Roadmap Massimo Cocco & PDB. EPOS IP project Timeline Implementation Validation Pre-operation.
EPOS IP Validation and pre-operational phases Massimo Cocco & PDB.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
WP10 – GNSS Data & Products Rui Fernandes on behalf of WP10 members.
WP6 Technical Work J Lauterjung GFZ Potsdam. Objective The main objective is the development of a novel and efficient e- infrastructure concept addressing.
Collaborative Systems Developing Collaborative Systems with a Reuse Driven Process.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Metadata Standards Directory Alex Ball, Jane Greenberg, Keith Jeffery, Rebecca Koskela.
Cloud-based e-science drivers for ESAs Sentinel Collaborative Ground Segment Kostas Koumandaros Greek Research & Technology Network Open Science retreat.
ISWG / SIF / GEOSS OOSSIW - November, 2008 GEOSS “Interoperability” Steven F. Browdy (ISWG, SIF, SCC)
CLARIN EUDAT2020 uptake plan Dieter Van Uytvanck CLARIN ERIC EUDAT User Forum, Rome.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EPOS and EUDAT.
ISWG / SIF / GEOSS OOS - August, 2008 GEOSS Interoperability Steven F. Browdy (ISWG, SIF, SCC)
RI EGI-InSPIRE RI Earth science e-infrastructures workshop Diego Scardaci, EGI.eu Technical Outreach Expert.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Infrastructure and Workflow for the Formal Evaluation of Semantic Search Technologies Stuart N. Wrigley 1, Raúl García-Castro 2 and Cassia Trojahn 3 1.
The BlueBRIDGE project
Pasquale Pagano (CNR-ISTI) Project technical director
AAI for a Collaborative Data Infrastructure
INTAROS WP5 Data integration and management
The GEMBus Architecture and Core Components
Middleware independent Information Service
Steering Group Member, Link Digital
Chapter 18 MobileApp Design
BoF: VREs- Keith G Jeffery & Helen Glaves
INSPIRE Geoportal Thematic Views Application
EOSC services architecture
LOD reference architecture
Data Management Components for a Research Data Archive
SOA initiatives at Istat
A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
EOSC-hub Contribution to the EOSC WGs
Presentation transcript:

EPOS e-Infrastructure Daniele Bailo, and the EPOS IT team EPOS Kick-off meeting – October 5, 2015 - Rome

What is an e-Infrastructure? Short (and ambiguous) answer: an infrastructure with some “electronic” e- e-road e-dam e-railway e-airport

e-Infrastructure for research Interworking of:* people and organisational structures digitally-based technology(hardware and software) resources (data, services, digital libraries,HPC) communications (protocols,access rights,network) GOAL: support modern, internationally leading collaborative research * [Research Councils UK (RCUK)]

EPOS e-Infrastructure people and organisational structures

EPOS e-Infrastructure resources (data, services, digital libraries) 244 Research Infrastructures 138 Institutions 22 countries 2272 GPS receivers 4939 seismic stations 464 TB Seismic data 1.095 PB Storage capacity (seismology) 1.240 PB Storage capacity (GNSS DC) 828 instruments in 118 Laboratories

e-Infrastructure for research What about: digitally-based technology (hardware and softw.) communications (protocols,access rights,network) ?

EPOS ARCHITECTURE ICS and TCS

Architecture Integrated Core Services (ICS) Interoperability layer Thematic Core Services (TCS)

Principles – 1. Interoperability System A  System B communication By means of common language (protocol) and interface to “translate” Protocol System A System B Interface Interface Service A Service B System + interface= service [Newman, Sam.”Building Microservices”,O'ReillyMedia,Inc.,2015]

Principles – 2. metadata Metadata describes an object without actually accessing it Discovery Contextualization Detailed processing * PROESSING INCLUDES SCHEMA INFORMATION TO CONNECT SOFTWARE TO DATA [RDA: the importance of metadata, Keith G Jeffery, Rebecca Koskela]

Integrated Core Services - ICS -

ICS - Integrated Core Services ICS-C and ICS-D ICS-Central Hub Orchestrates external resources (ICS-D) EPOS member states competition for hosting it ICS-Distributed services Provide IT resources not built by EPOS ICS-C Computational resources Visualization resources Others…

Interoperability Layer: ICS-APIs (wrappers) ICS-C EPOS GUI EPOS APIs ICS-D METADATA catalogue System Managing Software IAAA system Workflow Engine Interoperability Layer: ICS-APIs (wrappers) TCSs TCS API TCS API Data set Data products Services Software Data set Data products Services Software National RIs FR Seismic networks FR Seismic networks FR Seismic networks FR Seismic networks FR Seismic networks FR Seismic networks FR Seismic networks FR Seismic networks National RIs National RIs National RIs National RIs

ICS-C Generic Architecture Authentication and Authorization Layer (AAI) GUI Client API METADATA catalogue Accounting System System managing software Workflow Engine Resource connector ICS-D Connector TCS Connector

ICS-C Generic Architecture Authentication and Authorization Layer (AAI) GUI Client API METADATA catalogue Accounting System System managing software Workflow Engine Resource connector ICS-D Connector TCS Connector

System Managing Software It’s the software we have to develop. Bundle of software/technologies, e.g. Interfaces to other ICS modules: Integration with 3rd party tools for: caching tools (rabbitMQ) workflow tools Integrate and orchestrates all ICS components

ICS-C Generic Architecture Authentication and Authorization Layer (AAI) GUI Client API METADATA catalogue Accounting System System managing software Workflow Engine Resource connector ICS-D Connector TCS Connector

Metadata-Elements of EPOS ecosystem Researchers, research managers, innovators, media User Model User interaction with the system: AAAI, interaction workflow, multlinguality Processing Model Describes functions of processing environment etc. Data Model Describes research data and ancillary info (instituions) Resources: 1. data providers, 2. internal to ics, 3. other outside epos delivery framework. We describe: instruments, data storage resources, data processing and visualization Data model: 3 layers. Ongoing work: metadata collection, data taxonomy Processing model: 1.description of the functions available in each processing environment, 2. protocols to connect to them, 3. algorthms which deal with the composing of the processing User model: how a suer interacts with the system: iaaa, interaction workflow, multilinguality Resource Model Representing ICT resources Complete ICT environment for research

Metadata model 3-layers Manage the complexity CERIF (formal syntax, declared semantics) Functions -Discovery, selection -Impact assessment -Manage interoperability Issues (selection of) heterogeneity Many standards to be mapped from TCS Ontologies Complex work Web portal, Spatio-Temporal Search Discovery (DC) and (CKAN, eGMS) Generate Contextual (CERIF metadata model) Point to Detailed (community specific) Esempio carta identità domain specific – data (TCS or Naational Ris)

Contextual (CERIF metadata model) (http://www.eurocris.org/) Common European Research Information Format

Metadata Q&A Do TCSs need to adopt CERIF metadata standard? No, but provide “rich” metadata Do TCSs need to copy their metadata into the ICS metadata catalogue? Short answer: we need translation Long answer  next slides

ICS-C Generic Architecture Authentication and Authorization Layer (AAI) GUI Client API METADATA catalogue Accounting System System managing software Workflow Engine Resource connector ICS-D Connector TCS Connector

AAAI (Authentication, Authorisation, Accounting Infrastructure) Who (name) is accessing your data? How is your data being used? Can you measure the impact of your data on society? If yes, which is its value? What if I download your data and re-sell it? May I please download your data and replicate it on my personal commercial cloud? How do you manage privacy? Who is citing your data?

Great! Only one password to access to 7 systems! AAAI layer Purpose Authenticate users Authorize users Transparent access to TCS and ICS-D data & services Goal Only 1 login – Delegation Issues Many system, sometimes non interoperable EPOS-EGI-CC AARC, EUDAT AAAI ICS-C AAI ICS-D AAI AAI AAI TCS 1 TCS 2 …TCS n

AAAI Q&A Is my EduGAIN/X509/… AAAI system interoperable with ICS? EPOS ICS and other initiatives are working on AAAI interoperability I’m starting from scratch, can ICS help? Of course. Let’s talk with your IT crowd* Do I really need an AAAI? Do you really need a debit card? *

ICS-C Generic Architecture Authentication and Authorization Layer (AAI) GUI Client API METADATA catalogue Accounting System System managing software Workflow Engine Resource connector ICS-D Connector TCS Connector

Workflow engine Purpose Workflow represent a process (Algorithm, service, computation), a composition of atomic functions Tracks data transformation Allows versioning and reproducibility Generates provenance Comments Interoperability among various workflow engines VERCE contribution Rapid validation and monitoring

Workflow & provenance example Two years ago I analyzed 12 of those 341 “*.dat” files in my ./temp/specs/ folder… But ..hey.. I don’t remember WHICH files and HOW I PROCESSED THEM… …and made an image for my paper….

ICS-C Generic Architecture Authentication and Authorization Layer (AAI) GUI Client API METADATA catalogue Accounting System System managing software Workflow Engine Resource connector ICS-D Connector TCS Connector

EPOS Interface GUI & API Human interface (GUI) Website or portal Machine interface API or Web service which execute scripts or queries Returns data/metadata in a given standard Esempio carta identità

“EaaS” EPOS as a Service Metadata & services available: RDF export (“old” ENVRI) OAI-PMH, CKAN, open-search… (EUDAT) other standards CERIF XML APIs Other CRIS EPOS GUI EPOS APIs RDF OAI-PMH EPOS APIs

ICS-C Generic Architecture Authentication and Authorization Layer (AAI) GUI Client API METADATA catalogue Accounting System System managing software Workflow Engine Resource connector ICS-D Connector TCS Connector

Accounting system Purpose Keeps track of Used resources (data, processing) Users actions Goal Load balance Logging Statistics Impact analysis Tailor GUI to user Uses metadata catalogue May I know which resources were accessed by a user and when? Which is the most downloaded data? Most searched data? Is your TCS available or too may users connected?

ICS-C Generic Architecture Authentication and Authorization Layer (AAI) GUI Client API METADATA catalogue Accounting System System managing software Workflow Engine Resource connector ICS-D Connector TCS Connector

Connectors Software Interfaces to RESOURCES ( TCS or ICS-D) EPOS ICS Resource connector ICS-D Connector TCS Connector Service API Service API Service API Service API …….. Service (ICS-D) Service (ICS-D) Service (TCS) Service (TCS)

Thematic Core Services - TCS -

Time for QUIZ Which are the main ingredients of TCS architecture? New building and new hardware New furniture for my office Money System managing the data; metadata; web services Schema describing the data management plan

TCS generic architecture D) System managing the data; metadata; web services National Layer Research Infrastructures: these can vary in different countries within a specific scientific community. They represent the existing DDSS. TCS system: this represents the e-Infrastructure for a specific scientific community. It may include the software used to federate National RIs, or the software to present results on the web (web portal). Metadata Catalogue: this is usually a database where each data object (e.g. file, in case of non-streamed data) is referenced and described by the metadata. It can be used to drive the Web Services

TCS Checklist DDSS / Assets Metadata describing them (standard) Web services (API) exposing them ICS-TCS common work on metadata data policies implemented (AAAI)

TCS Metadata principles Metadata used for Data / Dataproducts, Software, Services, Resources, Users Approved (e.g. OSI,OGC, INSPIRE…) or de facto standard Intra community common strategy Metadata already used by other communities? Common developments? community data (e.g. seismic waveforms, GPS time-series, geological maps), software (e.g. an analysis or visualisation application), services (e.g. use of specialist equipment), resources (e.g. computers, instrumentation, detectors) and users (with their roles, responsibilities and authorities used for AAI).

TCS Web services / APIs principles Each type of Data delivered (and its associated metadata) should be accessible via web services and/or APIs Based on approved (e.g. OSI) or de facto standards Main purposes Discovery (metadata) Data access and retrieval (e.g. download) Data analysis / visualization / processing

ICS-TCS common work Metadata Definition of community, standard-based metadata APIs Definition of APIs for data discovery and for data access/retrieval Match & Map: Match/Mapping of community metadata within the Metadata Catalogue

Metadata match and map Process of mapping Metadata elements to the canonical CERIF ICS –TCS common work in selecting discovery and contextual metadata element (e.g. lat, lon, author,etc)

Metadata & interoperability Do TCSs need to copy their metadata into the ICS metadata catalogue? Metadata dump: Metadata is copied. Requires polling, many entries in ICS catalogue, TCS low “burden” Metadata Runtime access: MD accessed by querying web services. No polling, avoids errors, needs reliable TCS webservices Match / Mapping is required for both options

TCS-ICS Open issues

Computational Earth Sciences (CES) Not reinvent the wheel principle: re-use existing software / available resources TCS contribution: mapped in the assets/requirement collection phase ICS will set up a bid procedure for CES

Identifiers (PID / UID): encourage their adoption by TCS to Unique and reliable reference to Digital Object Reproducibility of data & citation Interlinked with other initiatives (EUDAT, RDA) Ongoing discussion: ICS-C will provide further recommendations

AAAI

“the ITs” We support you!

ICS STAFF - WP6&7 Needed urgently BGS, KNMI, INGV, UiB, GFZ WP Chairs: Keith keith.jeffery@keithgjefferyconsultants.co.uk Kuvvet Kuvvet.Atakan@uib.no Matt mharr@bgs.ac.uk Institutions: BGS, KNMI, INGV, UiB, GFZ Task forces: Architecture Graham agsmith@bgs.ac.uk PID / UID Martin mln@bgs.ac.uk Workflow James jpass@bgs.ac.uk AAAI Mariusz m.sterzel@cyfronet.pl Documentation daniele.bailo@ingv.it IT TCS CONTACTS Needed urgently

Thank You Epos Social www.epos-eu.org WebSite Newsletter www.epos-eu.org/newsletter www.epos-eu.org/ride R.I.D.E. Epos Social

Common questions Do ICS store the data? No but for technical purposes. Data is stored & curated by users. Do ICS assign PIDs? In principle no. Ongoing discussion I feel lost: where should I start from? Define metadata for your data