aspects of archive system design

Slides:



Advertisements
Similar presentations
Where next…. Stakeholder workshop, 29 Jan To the end of the project.
Advertisements

ESO-ESA Existing Activities Archives, Virtual Observatories and the Grid.
CHORUS Implementation Webinar May 16, 2014 Mark Martin Assistant Director, Office of Scientific and Technical Information Office of Science U.S. Department.
Enabling Access to Sound Archives through Integration, Enrichment and Retrieval WP1. Project Management.
14 October 2003ADASS 2003 – Strasbourg1 Resource Registries for the Virtual Observatory R.Plante (NCSA), G. Greene (STScI), R. Hanisch (STScI), T. McGlynn.
Solar and STP Physics with AstroGrid 1. Mullard Space Science Laboratory, University College London. 2. School of Physics and Astronomy, University of.
Data preservation & the Virtual Observatory Bob Mann Wide-Field Astronomy Unit Royal Observatory Edinburgh
The Preparatory Phase Proposal a first draft to be discussed.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
Enabling Access to Sound Archives through Integration, Enrichment and Retrieval WP8. Dissemination & Exploitation.
EdSkyQuery-G Overview Brian Hills, December
GENIUS kick-off - November 2013 GENIUS kick-off meeting WP400 – Tools for data exploitation X. Luri.
Science Archive for Sky Surveys Data Providers and the VO - NeSC 2003 March Wide Field Astronomy Unit Institute for Astronomy.
SIXTH FRAMEWORK PROGRAMME FP INCO-MPC-1 MEditerranean Development of Innovative Technologies for integrAted waTer managEment.
Towards a European network for digital preservation Ideas for a proposal Mariella Guercio, University of Urbino.
Markus Dolensky, ESO Technical Lead The AVO Project Overview & Context ASTRO-WISE ((G)A)VO Meeting, Groningen, 06-May-2004 A number of slides are based.
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
CLARIN work packages. Conference Place yyyy-mm-dd
Participation in 7FP Anna Pikalova National Research University “Higher School of Economics” National Contact Points “Mobility” & “INCO”
EVA Workshop, 26 March 2003, Florence, Italy1 COINE Cultural Objects In Networked Environments Anthi Baliou University of Macedonia,Library Thessaloniki,
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
Summary Report Project Name: Infoway Testing Environment Brief Project Description: Develop testing harness to allow for conformance testing of the applications.
EURO-VO Structure Data Centre Alliance (DCA) A collaborative and operational network of European data centres who, by the uptake of new VO technologies.
Enabling Access to Sound Archives through Integration, Enrichment and Retrieval Annual Review Meeting - Introduction.
WP2 – Strategy, co-operation and dissemination EuroVO-AIDA – First periodic review – 24 April 2009 Françoise Genova, Project Coordinator WP2 – NA2 Strategy,
Overview and status of the project EuroVO-AIDA – Final review – 5 October 2010 Françoise Genova, Project Coordinator, 1 Overview and status of the project.
WP8 – Assessment of emerging technologies EuroVO-AIDA – First periodic review – 24 April 2009 Françoise Genova, Project Coordinator WP8 Assessment of emerging.
The Large Synoptic Survey Telescope Project Bob Mann Wide-Field Astronomy Unit University of Edinburgh.
GENIUS Kick-Off meeting December 4th 2013 WP-620 Simulated catalogue data Francesc Julbe UB – IEEC, Barcelona.
Introduction to the VO ESAVO ESA/ESAC – Madrid, Spain.
© Services GmbH Project work plan 2/20/ St. Petersburg, May 18, 2011 Dr. Andrey Girenko
WP2 – Definition of European DCA strategy EuroVO-DCA Final Review, 29 April 2009 Françoise Genova, Coordinator, WP2 lead WP2 Definition of European Data.
National Geospatial Enterprise Architecture N S D I National Spatial Data Infrastructure An Architectural Process Overview Presented by Eliot Christian.
F. Genova, AstroNET meeting, Poitiers The Astrophysical Virtual Observatory.
IPDA Architecture Project International Planetary Data Alliance IPDA Architecture Project Report.
ENEA GRID & JPNM WEB PORTAL to create a collaborative development environment Dr. Simonetta Pagnutti JPNM – SP4 Meeting Edinburgh – June 3rd, 2013 Italian.
Click to edit Master title style Click to edit Master text styles Second level Third level Fourth level Fifth level 1 SI O S Svalbard Integrated Arctic.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Introduction: AstroGrid increases scientific research possibilities by enabling access to distributed astronomical data and information resources. AstroGrid.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
AstroGrid and Virtual Observatories for Radio Interferometry arrays/ proposals Anita Richards Paul Harrison Noel Winstanley (Jodrell Bank Centre for Astrophysics,
Bob Jones EGEE Technical Director
AIDA Fourth Technology Forum
An Approach to Software Preservation
WHY? - Found initiative while case statement preparation
Building a worldwide interoperable data infrastructure for astronomy: the International Virtual Observatory Alliance SciDataCon 2016, Denver, 13/Sept/2016.
INTAROS WP5 Data integration and management
PUBLICATION OF RCA SUCCESS STORIES
Moving towards the Virtual Observatory Paolo Padovani, ST-ECF/ESO
Universitat de Barcelona / FBG
Changing Practices… Changing Values
WP3 – SA1 Service activities in support of deployment of IVOA protocols and standards Christophe ARVISET, ESA.
VI-SEEM Data Repository
Horizon 2020: Open data pilots and lessons learnt
EOSCpilot Skills Landscape & Framework
EOSC & e-Science: enabling the digital transformation of Science
Leigh Grundhoefer Indiana University
Case Study: Algae Bloom in a Water Reservoir
DRIVER Digital Repository Infrastructure Vision for European Research
Databases, Web Pages and Archives
EOSCpilot All Hands Meeting 8 March 2018 Pisa
WIS Strategy – WIS 2.0 Submitted by: Matteo Dell’Acqua(CBS) (Doc 5b)
European Open Science Cloud All Hands Meeting Pisa 8-9 March 2018
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Google Sky.
7th EU Research FP has ten themes defined in order:
New Platform to Support Digital Humanities in the Czech Republic
Australian and New Zealand Metadata Working Group
Interoperability and data for open science
Fabio Pasian, Marco Molinaro, Giuliano Taffoni
Presentation transcript:

aspects of archive system design GENIUS Final Report for WP3: aspects of archive system design Nigel Hambly Institute for Astronomy, Edinburgh University

WP3 Description From the GENIUS proposal: “The objective of this work package is to design, prototype and develop aspects of the archive infrastructure needed for the scientific exploitation of Gaia data. The design and technology choices made will be motivated by the real user requirements identified by WP2 – in particular, the massive, complex queries defined by the Grand Challenges – and by other initiatives, such as the GREAT project, and will be made with full recognition of the constraints imposed by the ESAC archive system, with which it must interface effectively. Prototypes will be prepared and tested in cooperation with the end user community and with the ESAC science archive team through the DPAC CU9. A core principle will be the adoption of Virtual Observatory standards and the development of VO infrastructure to enable ready interoperation with the other external datasets needed to release the full scientific potential of Gaia.”

GENIUS WP3 in DPAC CU9 WP930 => 1.8 (out of total of 6.1) FTE to end of March 2017

Work Breakdown Structure T3.1: Technical Co-ordination System Requirements Specification Systems Interface Control: ICDs T3.2: Aspects of Archive (end-user) Interface Design Subsystems interface infrastructure (affecting end-user experience) Enhanced features for User Interfaces T3.3: VO Infrastructure Client-side “Table Access Protocol” (TAP) tool International Virtual Observatory Alliance (IVOA) work T3.4: Data Centre Collaboration Distributed Query Processing (DQP) infrastructure T3.5: Cloud-based Research and Data Mining Environments Virtual Machines and containerisation

T3.1: Technical Co-ordination Good communication channels On-line collaborative tools (Wiki, SVN, teleconferencing) Requirements Specification SRS documented (Milestone MS6) Formal interface control established e.g. DQP ICD (Deliverable D3.1) and ensures good coordination between the 20 individuals involved in DPAC CU9 WP930 (presently a total of 6.1 FTE with 2.5 FTE at ESDC, 1.8 FTE national funding agencies and 1.8 FTE currently resourced via GENIUS in WP3)

T3.1 (cont.): Integration with DPAC GENIUS Deliverable D3.1

T3.2: Aspects of Archive Interface Design End-user experience depends on propagation of relevant information through the system Primary mechanism for interface specification within DPAC is a data model “Dictionary Tool” Some key infrastructural features missing from the GENIUS/CU9 perspective Concentrated on these before proceeding any contributions to the User Interface itself Gaia archive UI enhancements

T3.2 (cont.): Dictionary Tool enhancements New metadata fields Additional propagation features Ensure UIs contain all necessary information New version of tool released Q2 2016 In good time for DR1 developments

T3.2 (cont.): Gaia archive UI enhancements Auto-complete identified under beta-testing as a “nice-to-have” feature GENIUS Deliverable D3.4 Will appear in next release of GACS

T3.3: VO Infrastructure VO-Dance, a client-side integration tool Allowing the end-user to publish to the VO IVOA activities ADQL standards ADQL parser enhancements VO support Coordination of content descriptors (UCD1+) ADQL Cookbook in support of GDR1 VO Compliance Document

GENIUS Deliverable D3.3

GENIUS Deliverable D3.4

T3.3 (cont.): VO Infrastructure ✔ ✔ ✔ http://ia2.inaf.it/index.php/14-projects/39-eu-genius GENIUS Deliverable D3.4

e.g. TAP Schema editing via IA2TAP GUI application

T3.4: Data Centre Collaboration Archive test suite Recurring theme in requirements analysis is one of use of Gaia data in conjunction with other surveys: Multiple wavelengths Multiple epochs Combination of primarily astrometric data with other surveys/missions Distributed Query Processing is required

T3.4 (cont.): archive test suite β-testing Stress-testing GENIUS: Deliverable D3.5, but also covers in part Milestone MS9 Deliverables D3.4 and 3.6

T3.4 (cont.): Distributed Query Processing GENIUS: Deliverable D3.6

T3.4 (cont.): DQP under the hood

T3.4 (cont.): infrastructure demonstrator see http://genius.roe.ac.uk DQP prototype GENIUS Deliverable D3.2

T3.5: Data mining environments Docker Light-weight Virtual Machine DQP has been containerised using Docker for deployment at ESAC Demonstrates flexible and secure deployment of third-party code at a Data Centre Has potential as a mechanism for containerisation of user code uploads

GENIUS Deliverable D3.7

Deliverables: status summary

GENIUS legacy from WP3 We take seriously our responsibility for dissemination of results and SW coming out of the R&D programme using well-established Digital Curation mechanisms (and avoiding ad-hoc, ephemeral web resources): Software and associated documentation in GitHub Mainstream electronic publishing (e.g. article in Astronomy & Computing) Participation and integration in the international Virtual Observatory (developments of which adhere to the first two points) Re-use of knowledge in other areas and projects (e.g. taking forward FP7 GENIUS experience into H2020 ASTERICS)

Examples from WP3 GitHub resources (whole or in part): https://github.com/stvoutsin/pyrothorn https://github.com/stvoutsin/tap-autocomplete https://github.com/stvoutsin/taplib IVOA contributions: http://wiki.ivoa.net/internal/IVOA/InterOpMay2016-DAL/adql-20160509.pdf A&C paper (submitted end Jan 2017): “Use of Docker for deployment and testing of astronomy software” Data Centres using GENIUS-developed software: ESAC Science Data Centre Wide Field Astronomy Unit (University of Edinburgh, UK) In fact anywhere that uses IVOA infrastructure, e.g. ADQL parser