Open Science cloud access to LOFAR data and compute

Slides:



Advertisements
Similar presentations
Dominik Stoklosa Poznan Supercomputing and Networking Center, Supercomputing Department EGEE 2007 Budapest, Hungary, October 1-5 Workflow management in.
Advertisements

Dominik Stokłosa Pozna ń Supercomputing and Networking Center, Supercomputing Department INGRID 2008 Lacco Ameno, Island of Ischia, ITALY, April 9-11 Workflow.
What does LOFAR have to do with the Virtual Observatory (VO)? LOFAR Science Day 16 December 2003 Melbourne David Barnes The University of Melbourne.
Kjeld v.d. Schaaf DS3-T2 DS3 T2: Data Handling, Control and Distributed Computing Kjeld v.d. Schaaf 4 September 2006.
Jeroen Stil Department of Physics & Astronomy University of Calgary Stacking of Radio Surveys.
NAIC-NRAO School on Single-Dish Radio Astronomy. Arecibo, July 2005
23/04/2008VLVnT08, Toulon, FR, April 2008, M. Stavrianakou, NESTOR-NOA 1 First thoughts for KM3Net on-shore data storage and distribution Facilities VLV.
Radio Telescopes Large metal dish acts as a mirror for radio waves. Radio receiver at prime focus. Surface accuracy not so important, so easy to make.
Developing Health Geographic Information Systems (HGIS) for Khorasan Province in Iran (Technical Report) S.H. Sanaei-Nejad, (MSc, PhD) Ferdowsi University.
Susana Sánchez Expósito Instituto de Astrofísica de Andalucía - CSIC Pablo Martin, Jose Enrique Ruiz, Lourdes Verdes-Montenegro, Julian Garrido, Raül Sirvent,
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Grid Data Management A network of computers forming prototype grids currently operate across Britain and the rest of the world, working on the data challenges.
DCS Overview MCS/DCS Technical Interchange Meeting August, 2000.
National Center for Supercomputing Applications Observational Astronomy NCSA projects radio astronomy: CARMA & SKA optical astronomy: DES & LSST access:
DOE BER Climate Modeling PI Meeting, Potomac, Maryland, May 12-14, 2014 Funding for this study was provided by the US Department of Energy, BER Program.
SKA Introduction Jan Geralt Bij de Vaate Andrew Faulkner, Andre Gunst, Peter Hall.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
The SOC Pilot and the ATOA Jessica Chapman CASS Observatory Operations Research Program Leader 28 June 2011.
Rosie Bolton1 SKADS Costing work 4 th SKADS Workshop, Lisbon, 2-3 October 2008 SKADS Costing work: Spreadsheets to scalable designs Rosie Bolton Dominic.
Paul Alexander & Jaap BregmanProcessing challenge SKADS Wide-field workshop SKA Data Flow and Processing – a key SKA design driver Paul Alexander and Jaap.
Netherlands Institute for Radio Astronomy 1 ASTRON is part of the Netherlands Organisation for Scientific Research (NWO) LOFAR Operations and Schedule.
CRISP & SKA WP19 Status. Overview Staffing SKA Preconstruction phase Tiered Data Delivery Infrastructure Prototype deployment.
Research Networks and Astronomy Richard Schilizzi Joint Institute for VLBI in Europe
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
EVLA Software Bryan Butler. 2007May22EVLA SAGE Meeting2 Requirements and Goals of EVLA Software Maximize scientific throughput of the instrument At a.
ESFRI & e-Infrastructure Collaborations, EGEE’09 Krzysztof Wrona September 21 st, 2009 European XFEL.
Terena conference, June 2004, Rhodes, Greece Norbert Meyer The effective integration of scientific instruments in the Grid.
ASKAP: Setting the scene Max Voronkov ASKAP Computing 23 rd August 2010.
EURO-VO: GRID and VO Lofar Information System Design OmegaCEN Kapteyn Institute TARGET- Computing Center University Groningen Garching, 10 April 2008 Lofar.
Netherlands Institute for Radio Astronomy Big Data Radio Astronomy A VRC for SKA and its pathfinders Hanno Holties March 28 th 2012.
NEXPReS Period 3 Overview WP 6: High Bandwidth on Demand Paul Boven, JIVE.
Netherlands Institute for Radio Astronomy 1 ASTRON is part of the Netherlands Organisation for Scientific Research (NWO) Kurgan high-school students visit,
Supporting the “Solving Business Problems with Environmental Data” Competition 24 th October 2013 Vlad Stoiljkovic.
1 ASTRON is part of the Netherlands Organisation for Scientific Research (NWO) Netherlands Institute for Radio Astronomy Astronomy at ASTRON George Heald.
The Science Data Processor and Regional Centre Overview Paul Alexander UK Science Director the SKA Organisation Leader the Science Data Processor Consortium.
Astronomy 1020 Stellar Astronomy Spring_2016 Day-19.
1 /16 How do you make an image of an object ? Use a camera to take a picture ! But what if the object is hidden ?...or invisible to the human eye ?...or.
Andreas Horneffer for the LOFAR-CR Team
Multi-beaming & Wide Field Surveys
Detecting UHE cosmic-rays and neutrinos hitting the Moon
Chapter 6 Telescopes: Portals of Discovery
e-VLBI correlator mode: control interfaces
NRAO VLA Archive Survey
Mid Frequency Aperture Arrays
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
Pasquale Migliozzi INFN Napoli
Computing Architecture
EGEE NA4 Lofar Lofar Information System Design OmegaCEN
Telescopes and Images.
ASTERICS to support enabling the scientific synergies
Computing Infrastructure for DAQ, DM and SC
6.3 Telescopes and the Atmosphere
به نام خدا Big Data and a New Look at Communication Networks Babak Khalaj Sharif University of Technology Department of Electrical Engineering.
Diffraction and Resolution
By Peter Kettig Supervisor: Antoine Basset, CNES
Rick Perley National Radio Astronomy Observatory
Observational Astronomy
Observational Astronomy
Diffraction and Resolution
EOSCpilot All Hands Meeting 8 March 2018 Pisa
Optical Telescopes, Radio Telescopes and Other Technologies Advance Our Understanding of Space Unit E: Topic Three.
Electromagnetic Spectrum
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Dtk-tools Benoit Raybaud, Research Software Manager.
Topic 5 Space Exploration
Google Sky.
Overview of Workflows: Why Use Them?
The New Internet2 Network: Expected Uses and Application Communities
French Access to Copernicus Space Data
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Open Science cloud access to LOFAR data and compute EOSC Pilot Workshop, Pisa, 13 Sept 2017 Honoured to be selected as pilot in the project. Relatively new to EOSC and LOFAR details. Thanks to Hanno and others for reusing slides of earlier presentations. Rob van der Meer, ASTRON Hanno Holties, ASTRON Niels Drost, eScienceCenter Coen Schrijvers, SURFSara Michael R. Crusoe, CWL et al.

Menu Radio Astronomy & LOFAR Data storage structure Data challenges User requirements Use cases Project plan Resource requirements Interoperability requirements Success criteria of SD Summary 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

Astronomy (Radio) Andromeda Galaxy (Multi-wavelength View) Astronomy stretches over many wavelengths, each addressing different physics parameters of the objects. Some researchers want to just study the physics addressable by radio astronomy, others want to have a multi wavelength or even multi-messenger view, including gamma rays, neutrinos, gravitational waves. We are “spoiled” with the high resolution of the Hubble (visible) images. Achieving the same resolution in Radio images for comparing with other wavelengths and more detailed study of the objects, radio telescopes need to be Megametres wide Electromagnetic Wavelength longer shorter 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

Astronomy (Radio) Andromeda Galaxy (Multi-wavelength View) Astronomy stretches over many wavelengths, each addressing different physics parameters of the objects. Some researchers want to just study the physics addressable by radio astronomy, others want to have a multi wavelength or even multi-messenger view, including gamma rays, neutrinos, gravitational waves. We are “spoiled” with the high resolution of the Hubble (visible) images. Achieving the same resolution in Radio images for comparing with other wavelengths and more detailed study of the objects, radio telescopes need to be Megametres wide Detector size same resolution 1000 km 1 m 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

Westerbork Synthesis Radio Telescope -- data production -- Due to the long wavelength nature of radio astronomy, special techniques have to be used to “image” the sky. The signal need to be continuously digitized to correlate the data  Single receiver no spatial info Combining receivers in so-called baselines representing Fourier components WSRT uses earth rotation for a full 2D scan Digitized signals mostly noise Radio Telescopes produce substantial amounts of data, with volumes of “astronomical” proportions 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

Radio Astronomy at scale: International LOFAR Telescope The technical requirements of LOFAR have been driven by a broad and diverse range of scientific goals [1]. Amongst others, these goals include wide-field, high dynamic range imaging, near real-time detection of radio transients, high accuracy pulsar timing, solar monitoring, and radio pulses generated by cosmic ray air showers. To achieve such a wide spectrum of scientific requirements, the system has been designed and built to be both highly flexible as well as extensible [2]. This is achieved by using many small antennas instead of dishes, which are digitized at an early stage in the signal chain. Doing so, an operator is able to point the telescope simultaneously in multiple directions and schedule multiple observations at the same time increasing the efficiency of the telescope. Furthermore part of the flexibility is realized by using custom off the shelf computing hardware for the central systems where all data of the fields is combined. In this way imaging observations can be mixed or followed with other types of observations easily. 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

Radio Astronomy at scale: International LOFAR Telescope Correlator Groningen, NL The technical requirements of LOFAR have been driven by a broad and diverse range of scientific goals [1]. Amongst others, these goals include wide-field, high dynamic range imaging, near real-time detection of radio transients, high accuracy pulsar timing, solar monitoring, and radio pulses generated by cosmic ray air showers. To achieve such a wide spectrum of scientific requirements, the system has been designed and built to be both highly flexible as well as extensible [2]. This is achieved by using many small antennas instead of dishes, which are digitized at an early stage in the signal chain. Doing so, an operator is able to point the telescope simultaneously in multiple directions and schedule multiple observations at the same time increasing the efficiency of the telescope. Furthermore part of the flexibility is realized by using custom off the shelf computing hardware for the central systems where all data of the fields is combined. In this way imaging observations can be mixed or followed with other types of observations easily. Poznan, PSCN, PL SURFSara, NL 3 Gb/s/station 7 PB/y LTA Juelich, DE 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

LOFAR Long Term Archive -- Purpose & Use Cases -- High Level Use-Cases: Ingest Data Store Data Query Meta-data Retrieve Data Monitoring & Control 1. Ingest 3. Query 5. Monitoring & Control 4. Retrieve ALTA Meta-data Online storage Control 2. Store Cold storage 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

EOSC Pilot tasks Raw data Data products Dedicated facilities Cloud facilities Pipeline reduction Analysis etc. Final user data, images etc. Single object image Pulsar timing spectroscopy EoR In Telescope pipeline from antenna signal to visibilities Early reduction steps are embarrassingly parallel  HTC Steps to imaging could be HPC Make this border shift left by facilitating compute Make this side better accessible 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

Data challenges Where is my data? Move data to local cluster Data provenance Data access with gftp User access with Grid certificates, user database at ASTRON Data can be in multiple LTA sites 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

User requirements Old/current: I want to do what I want to the data I want to have my data Move users to new requirements: What is my data? How can I analyse it? Federated identity 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

Use cases Facilitate easy access for power user. Facilitate free/sandbox compute with own algorithm, parameters, etc. on small local data set. Then scale up to larger data set on remote cluster Make the Lofar Archive Accessible to non power users Provide non-expert user with standard pipeline, and gui for ~10 free parameters Use this demonstration to show both possibilities and limitations of current software and e-Infrastructure. 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

Project plan I Define the “perfect” environment as far as we know for the use cases. immediately start building with existing tools and resources without optimising them first, to show not only limitations, but that a total system is possible. from there define new projects for improving the working system. In case of choice of tools, take the first available to make it work. Only if it really breaks, change to another, or as part of the improvement at the end. 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

Project plan II – some elements of approach Access Lofar Archive (initially SURFsara archive) Look into connectivity to other archives Standardize the existing pipelines for a few science cases using the Common Workflow Language (CWL) Investigate notebooks Build a web frontend for running these pipelines, and allow users to change settings, input dataset, etc Use an existing viewer (eg based on rabix.io) to view the resulting workflow Look into the current downloadserver and other Lofar data retrieval elements. Perhaps put a new frontend on top of the current systems. Use Zenodo/B2Share and Research Object to export the resulting data to a persistent storages, with a DOI, etc. Disclaimer: These are the first ideas, not yet planned steps 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

Project plan III – more elements of approach How this is different from HEP (via presentation @ EOSCPilot workshop): Separate containers for each standalone software We model/preserve how the tools are connected via CWL We will model tool resource usage to enable compute resource matchmaking Identifiers for each tool Detailed attribution of contributors via CWL+RO CWL (www.commonwl.org) RO (http://www.researchobject.org/) 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

Resource requirements + Interoperability requirements Access to data in the LOFAR LTA @ SURFsara Acces to compute facilities @ SURFsara Access to data in the LOFAR LTA @ Juelich Transport of data between LTA sites Contact with GÉANT Access to compute facilities at other sites 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

Success criteria of SD Demonstration that a complete system can be constructed from existing tools. + Power user more satisfied over the system performance + More non-expert users using the system and more satisfied over system performance 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

Summary The team is defining the plan these weeks, so this presentation is work in progress. The first actions will follow soon, leading for sure to system changes in the plan: Plan and Build as we go. The team is very enthousiastic to create something and show that it works. Thank you. 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer

Square Kilometre Array Taking it to Exa-scale Start of construction 2018 Aperture arrays capable to produce more than 100x global internet traffic (production rate SKA > 10x) http://skatelescope.org 13 Sept 2017 EOSC-Pilot LOFAR, Pisa Rob van der Meer