ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing1 Organization of the Euclid Data Processing: dealing with complexity Fabio Pasian.

Slides:



Advertisements
Similar presentations
Enabling Access to Sound Archives through Integration, Enrichment and Retrieval WP1. Project Management.
Advertisements

© 2009 The MITRE Corporation. All rights Reserved. Evolutionary Strategies for the Development of a SOA-Enabled USMC Enterprise Mohamed Hussein, Ph.D.
Chapter 7: Key Process Areas for Level 2: Repeatable - Arvind Kabir Yateesh.
Alternate Software Development Methodologies
ITIL: Service Transition
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
New DFG Information Infrastructure Projects Dr. Stefan Winkler-Nees; Birmingham, 28. March 2011 New DFG Information Infrastructure Projects.
DCS Architecture Bob Krzaczek. Key Design Requirement Distilled from the DCS Mission statement and the results of the Conceptual Design Review (June 1999):
Software Engineering General Project Management Software Requirements
© , Michael Aivazis DANSE Software Issues Michael Aivazis California Institute of Technology DANSE Software Workshop September 3-8, 2003.
The Software Product Life Cycle. Views of the Software Product Life Cycle  Management  Software engineering  Engineering design  Architectural design.
SOLAR ORBITER SOC Test Plans Nana Bach SOWG#7 – 6-9 July 2015.
Development plan and quality plan for your Project
Astronomical GRID Applications at ESAC Science Archives and Computer Engineering Unit Science Operations Department ESA/ESAC.
NBTC/ITU Workshop on Cross-Border Frequency Coordination June , 2015 Bangkok, Thailand.
EOSC Generic Application Security Framework
Chapter 4 Interpreting the CMM. Group (3) Fahmi Alkhalifi Pam Page Pardha Mugunda.
Euclid Consortium OU-LE3 KOM Meeting, London 1March 22-23, OU /SDC Proposed Development Strategy Maurice Poncet (CNES, SDC-FR Lead) Andrea Zacchei.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
COST (European Cooperation in Science & Technology) An Introduction to the COST Grant System Dr. Mafalda Quintas (TD Science Officer) Ms. Andrea Tortajada.
ELISA Data Processing Centre Volker Beckmann APC, Francois Arago Centre A. Petiteau, E. Porter, G. Auger, E. Plagnol, P. Binétruy.
GLAST Science Support CenterAugust 9, 2004 Implementation of the Standard Analysis Environment (SAE) James Peachey (HEASARC/GLAST SSC—GSFC/L3)
OU-LE3/GC-SWG Euclid Consortium Interaction between SWG and SGS-OUs Fabio Pasian & Marc Sauvage (EC SGS Project Office) The presented document.
Rational Unified Process Fundamentals Module 4: Disciplines II.
1 Chapter 2 The Process. 2 Process  What is it?  Who does it?  Why is it important?  What are the steps?  What is the work product?  How to ensure.
NIST Special Publication Revision 1
DCS Overview MCS/DCS Technical Interchange Meeting August, 2000.
1 INFRA : INFRA : Scientific Information Repository supporting FP7 “The views expressed in this presentation are those of the author.
ETICS2 All Hands Meeting VEGA GmbH INFSOM-RI Uwe Mueller-Wilm Palermo, Oct ETICS Service Management Framework Business Objectives and “Best.
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
WGClimate John Bates NOAA SIT Workshop Agenda Item #8 WGClimate Work Plan progress & Issues CEOS SIT Technical Workshop CNES, Montpellier, France 17 th.
All rights reserved © Altec ExoMars 2018 Rover Operations Control Centre Planned Organization of ROCC Operations I. Musso.
Georgia Institute of Technology CS 4320 Fall 2003.
Bob Jones Technical Director CERN - August 2003 EGEE is proposed as a project to be funded by the European Union under contract IST
European Commission, DG Education and Culture,
Systems Analysis and Design in a Changing World, Fourth Edition
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
Develop Project Charter
WGISS /09/2015 DATA PRESERVATION – CNES APPROACH B. Chausserie-Laprée.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Distribution and components. 2 What is the problem? Enterprise computing is Large scale & complex: It supports large scale and complex organisations Spanning.
1 ASTRONET Coordinating strategic planning for European Astronomy.
Planck Science Team – UniMi-Milano – 02 – 04 November 2005 Andrea Zacchei / Davide Maino Planck ST #25 – UniMi, November 2005 Inputs from: SISSA,
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Information Architecture WG: Report of the Spring 2004 Meeting May 13, 2004 Dan Crichton, NASA/JPL.
TCS-ICS interactions Kuvvet Atakan 1 and the WP6 and WP7 Teams 1 University of Bergen / Department of Earth Science.
Software Quality Assurance SOFTWARE DEFECT. Defect Repair Defect Repair is a process of repairing the defective part or replacing it, as needed. For example,
EC Review – 01/03/2002 – WP9 – Earth Observation Applications – n° 1 WP9 Earth Observation Applications 1st Annual Review Report to the EU ESA, KNMI, IPSL,
12 March, 2002 LCG Applications Area - Introduction slide 1 LCG Applications Session LCG Launch Workshop March 12, 2002 John Harvey, CERN LHCb Computing.
GEO Implementation Boards Considerations and Lessons Learned (Document 8) Max Craglia (EC) Co-chair of the Infrastructure Implementation Board (IIB) On.
From Use Cases to Implementation 1. Structural and Behavioral Aspects of Collaborations  Two aspects of Collaborations Structural – specifies the static.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI UMD Roadmap Steven Newhouse 14/09/2010.
12 March, 2002 LCG Applications Area - Introduction slide 1 LCG Applications Session LCG Launch Workshop March 12, 2002 John Harvey, CERN LHCb Computing.
International Workshop Jan 21– 24, 2012 Jacksonville, Fl USA INCOSE IW 2012 MBSE Requirement Flowdown Workshop - Outbrief - John C. Watson Principal Member.
Harmonised use of accreditation for assessing the competence of various Conformity Assessment Bodies Dr Andreas Steinhorst, EA ERA workshop 13 April 2016,
From Use Cases to Implementation 1. Mapping Requirements Directly to Design and Code  For many, if not most, of our requirements it is relatively easy.
CERN, 25 June 2014CERN-Euclid meeting1 Euclid Science Ground Segment Organisation Fabio Pasian Euclid Consortium SGS Manager (INAF – OATrieste) The presented.
Structured Container Delivery Oscar Renalias Accenture Container Lead (NOTE: PASTE IN PORTRAIT AND SEND BEHIND FOREGROUND GRAPHIC FOR CROP)
* Chair of the ASTERICS General Assembly
ITIL: Service Transition
ST Status : Data Model & EMA mockup
SGS for OUSIM Meeting goals:
Distribution and components
Exploitation of ISS Scientific data - sustainability
Introduction to Software Testing
DRIVER Digital Repository Infrastructure Vision for European Research
Systems Engineering for Mission-Driven Modeling
PSS verification and validation
From Use Cases to Implementation
EOSC-hub Contribution to the EOSC WGs
Presentation transcript:

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing1 Organization of the Euclid Data Processing: dealing with complexity Fabio Pasian (INAF – O.A.Trieste) and Christophe Dabin, Marc Sauvage, Oriana Mansutti, Claudio Vuerli, Anna Gregorio on behalf of the Euclid SGS development team The presented document is Proprietary information of the Euclid Consortium. This document shall be used and disclosed by the receiving Party and its related entities (e.g. contractors and subcontractors) only for the purposes of fulfilling the receiving Party's responsibilities under the Euclid Project and that identified and marked technical data shall not be disclosed or retransferred to any other entity without prior written permission of the document preparer.

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing2 The Euclid Mission M2 mission in the framework of the ESA Cosmic Vision Programme Euclid mission objective is to map the geometry and understand the nature of the dark Universe (dark energy and dark matter) Actors in the mission: ESA and the Euclid Consortium (institutes from 13 European countries and USA, funded by their own national Space Agencies) For more information see :

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing3 The Euclid Consortium The Euclid Consortium is in charge of: –building and operating the instruments (VIS and NISP) –developing and running the data processing within a unified Science Ground Segment (SGS) –performing the science analysis on the Euclid data products The Euclid Consortium is composed of members –350+ Consortium members participating in SGS (active: ~150)

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing4 Euclid at a Glance

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing5 SOC ESAC MOC ESOC scientific community EA DDS Ground Station Euclid External data (KiDS, DES,...) SDC VObs EA is built jointly by EC and SOC, and is managed by SOC. «Internal» and «public» EA functions – the latter allows access to a subset of EA data The Ground Segment at a glance SDC ECSGS Project Office SDC ESA/SOC and the EC SGS have developed, and are committed to maintain, a tight collaboration in order to design and develop a single, truly integrated SGS. This is an institutional view of the GS System Team

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing6 LE1 SOC MOC The Ground Segment as seen in the high-level Euclid documents Euclid Consortium (ECSGS) MOGSSGS Ground Station

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing7 Level 1 Level 2 Level 3 Level E Level S VIS NIRSIREXT LE1 MER SIM SPESHEPHZ LE3 VIS/NIR/SIR/EXT cross-check VIS/NIR/SIR/EXT cross-check SIR cross-check SIR cross-check MER cross-check MER cross-check OPS MOC Ground Station The Ground Segment as seen from the data processing point of view The coloured boxes correspond to the Processing Functions, which are a product of the Euclid SGS SOC MOGSSGS This is an functional view of the SGS

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing8 SWGs, OUs and SDCs Science Working Groups –external to the SGS –turning science objectives into requirements placed on the pipeline products and performances –verifying that the requirements are met (define V&V procedures) Organisation Units –providing the algorithmic definition of the processing to be implemented by the SDCs and validate the implementation Science Data Centres –implementing the data processing pipelines as specified by OUs –procuring local h/w and s/w resources –different activities: SDC-DEV (development – i.e. transforming algorithms into robust code) SDC-PROD (integration on local infrastructure, production runs of pipeline) individual Euclid scientists may belong to more than one of the above groups

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing9 Development–Verification&Validation SWGsOU SDC-DEV SDC-PROD … requirements validation (on results) code validation algorithms, test data pipeline code, test data pipelines verification 2. only for validation against high-level requirements for every Processing Function 1. in most cases, no interfaces but joint development 3. common integration platform

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing10 Development–Verification&Validation 1.Set of documents being prepared jointly between OUs and SDCs (by product – Processing Function – and not by organisation) : a.PF Requirements Specification Document b.PF Validation Plan c.Development Plans (organised by SDC) 2.Validation by SWGs of the high-level data processing requirements a.high-level data processing requirements attributed to PFs b.the SGS will be considered as validated if every high-level data processing requirement is validated c.the SGS is including in the top-level IV&V plans the inputs provided by the SWG coordinators regarding the principles of validation as well as the recommendations and typologies of Validation test – this top-level document will be co-signed by SGS and SWG coordinators Responding to recommendations from the SGS-PRR: –Simplification/reduction of interfaces –«Best Practices» document issued to help OUs/SDCs

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing11

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing12 Pillars of SGS development SGS-Level Services = shared tools and systems for SGS software development Standards and guidelines Development platform Integration platform Data model Software infrastructure The System Team provides these to make the integration and operation of the Processing Functions a simple as possible [ Current status is wrt ADASS XXIII ]

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing13 Standards and guidelines Standards and guidelines help developers take the right decisions Show how/where to improve code to meet the demanding requirements of the Euclid data processing Encourage the use of best practices Provide tools to help developers improve their code Current status: Standards being developed based on previous project experience and adapted to the Euclid context

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing14 Development and integration platform The SGS uses a single development platform specifying Operating system, Programming language, Support libraries CODEEN is the Euclid collaborative development and continuous integration platform The cost of fixing bugs increases as the system integration approaches completion Usage mandatory for main processing software Current status: Python adopted as the second language allowed for pipeline development in addition to C++  ( Linux + C++ & Python ) Drivers: More flexibility about who can contribute to development, long term direction of astronomical programming The System Team will ensure that we get all of the benefits and avoid the (known!) pitfalls

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing15 Data model Explicit data model built by OUs to describe the output of their processing functions (therefore input to other Processing Functions in most cases) Many projects have an implicit data model, using conventions and shared code data structures Change management of implicit data models is difficult, particularly for long-living projects where knowledge can be lost Current status: Data Model Workshops held with great participation from OUs and System Team First iterations of the DM very promising – real data products starting to be defined Challenge now is to increase the coverage to all products and maintain a flexible process to allow the DM to evolve in a controlled way along with the Processing Functions. CCB started, to accept new items and to evaluate change requests

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing16 Software Infrastructure Three main systems Data Management System (EAS) - Shared set of tools for managing the Euclid dataset: data discovery and exchange, data processing support, quality and lineage tracking Abstraction Layer (IAL) – Processing management at the SDC computing facilities Processing Orchestration (COORS) – Coordinating processing activities across all the SDCs Current status: Prototypes exist for the core EAS system and IAL Integration of these systems though EC SGS Challenges demonstrate real progress towards a working data processing system

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing17 ST Challenge #3 Final goal of challenges : deploying transparently pipelines on all SDCs Technical objectives : Demonstrate the capability to deploy IAL VM images into SDCs Demonstrate the capability to deploy, in the context of each SDC, the TIPS, NIP and VIS simulators as Euclid pipeline objects Demonstrate the capability of IAL, in the context of each SDC, to : fetch, on the basis of the metadata provided by EAS prototype (in SDC-NL), the pipelines input data in the local SDC storage area launch simulators jobs across clusters (when available in SDCs) or dedicated nodes, in accordance with PPOs defined remotely (through Jenkins) or locally (by each SDC leader) – orchestration mock-up produce and store output data into the local SDC storage area send the appropriate metadata to EAS prototype in SDC-NL Schedule: Baseline availability for deployment into SDCs : end of December 2013 By mid-February 2014, all SDCs had successfully fulfilled the challenge

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing18 Thank you for your attention

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing19

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing20 Acknowledgments  Thanks to ESA and to the Euclid Consortium, and in particular:  ESA: John Hoar (ESAC), Guillermo Buenadicha (ESAC), René Laureijs (ESTEC), Giuseppe Racca (ESTEC), Pedro Osuna (ESAC), Bruno Altieri (ESAC), Michael Schmidt (ESOC), Cyril Colombo (ESTEC), Ralf Kohley (ESAC),...  EC: Yannick Mellier (IAP), Andrea Zacchei (INAF), Keith Noddle (UoE), Maurice Poncet (CNES), Rees Williams (RuG), Christian Neissner (PIC), Johannes Koppenhöfer (MPG), Pierre Dubath (Unige), Elina Keihänen (UHelsinki), Marco Frailis (INAF), Jean-Marc Delouis (IAP), Jean-Jacques Metge (CNES), Christian Surace (LAM), Nikos Apostolakos (Geneva), Laurent Vibert (IAS), Martin Melchior (FHNW), Stefan Müller (FHNW), Marco Soldati (FHNW), Andrey Belikov (RuG), Edwin Valentijn (RuG), Harry Teplitz (IPAC), OUs staff, SDCs staff, …  The SGS PRR Panel and Board  And many other people involved in the project  This is a REAL team effort ... and thank you for your attention 

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing21 Organisation Group Project Office ECSGS Management Config. Lead O. Mansutti PA/QA Lead C. Vuerli IOT Coordination A. Gregorio Proj.Ctr. Support D. Fierro OUs System Team L. C. Dabin OU-NIR A. Grazian R. Bouwens OU-VIS H. Mc Cracken N. Shane OU-SIR M. Scodeggio C. Surace OU-EXT J. Mohr G.Verdoes-Kleijn OU-MER A. Fontana M.Kuemmel,M.Douspis OU-SIM S. Serrano A. Ealet OU-SHE A. Taylor F.Courbin,T.Schrabback OU-SPE O. Le Fèvre M. Mignoli OU-LE3 J-L. Starck F.Abdalla, E.Branchini OU-PHZ S. Paltani F. Castander SDCs SDC-DE J. Koppenhoefer F. Raison SDC-CH P. Dubath SDC-FI E. Keihanen H. Kurki-Suonio SDC-ES C. Neissner N. Tonello SDC-IT A. Zacchei M. Frailis SDC-FR M. Poncet J-J. Metge SDC-UK K. Noddle M. Holliman SDC-NL O. R. Williams A. Belikov SDC-US J. Rector H. Teplitz Abstraction Layer (IAL) M. Melchior Architecture Performance K. Noddle Data Modeling C. Dabin Monitoring & Control L. Vibert Archive Data A. Belikov Archive Metadata P. Osuna Common Tools M. Poncet Orchestration K. Noddle Data Quality M. Brescia ECSGS Manager F. Pasian ECSGS Scientist M. Sauvage ECSGS Deputy C. Dabin LE1 common infrastructure M.Frailis

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing22 Processing Functions –are a product of the Euclid SGS (to be eventually delivered to ESA at the end of the mission) –correspond to the processing steps which are performed within an «Euclid pipeline» –are algorithmically devised by the relevant OU and engineered by software development teams (SDC-DEV) –can in principle be run yielding the same results on any SDC site of the SGS (SDC-PROD, different HW environments) In most cases, Processing Functions are developed jointly by OU members and their local SDC-DEV teams –formal OU-SDC interfaces not needed in most cases –easier to develop directly pipeline-quality code –SGS System Team provides tools/standards/support (SDC Leads are members of the ST)

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing23 SWGsOU SDC-DEV SDC-PROD … requirements validation (on results) code validation algorithms, test data pipeline code, test data pipelines verification

ADASS XXIV, Calgary, 5-9 Oct 2014Fabio Pasian – Euclid Data processing24 OUs are transnational An organization based on the decomposition in Organization Units (OU), corresponding to a subset of overall EUCLID Data Processing. OU-VISOU-NIROU-SIROU-EXTOU-MEROU-PHZOU-SPEOU-LE3OU-SHE SOC ESAC MOC ESOC scientific community EA DDS Ground Station Euclid External data (KiDS, DES,...) SDC OU-SIM VIS ImagNir ImagNir SpectroExt DataEuclidisationSpectro MeasLevel 3Morpho & ShearPhot Red Sh. Simulation SDC OU coordinator OU Deputy Coordinator VObs EA is built jointly by EC and SOC, and is managed by SOC. «Internal» and «public» EA functions – the latter allows access to a subset of EA data The Ground Segment at a glance SDC EC-SGS Project Office SDC ESA/SOC and the EC SGS have developed, and are committed to maintain, a tight collaboration in order to design and develop a single, truly integrated SGS.