1 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom Investigating interoperability of the LSST Data Management.

Slides:



Advertisements
Similar presentations
DCS Architecture Bob Krzaczek. Key Design Requirement Distilled from the DCS Mission statement and the results of the Conceptual Design Review (June 1999):
Advertisements

Building a Framework for Data Preservation of Large-Scale Astronomical Data ADASS London, UK September 23-26, 2007 Jeffrey Kantor (LSST Corporation), Ray.
NCSU Libraries Ingest Workflow Issues: Metadata North Carolina Geospatial Data Archiving Project Steve Morris North Carolina State University Libraries.
Data Acquisition at the NSLS II Leo Dalesio, (NSLS II control group) Oct 22, 2014 (not 2010)
DESC mtg U Penn June, 2012 Computing Infrastructure Computing Parallel Session R.Dubois
National Center for Supercomputing Applications Observational Astronomy NCSA projects radio astronomy: CARMA & SKA optical astronomy: DES & LSST access:
Python3 and DM RFC-60 Tim Jenness. Python 3 Version 3.0 released in to be released in September 2.7 was released in 2010 – Critical bug fixes.
Dec 2, 2014 Hubble Legacy Archive and Hubble Source Catalog Rick White & Brad Whitmore Current teams: HLA: Michael Dulude, Mark Kyprianou, Steve Lubow,
1 Radio Astronomy in the LSST Era – NRAO, Charlottesville, VA – May 6-8 th LSST Survey Data Products Mario Juric LSST Data Management Project Scientist.
HDF5 A new file format & software for high performance scientific data management.
MASSACHUSETTS INSTITUTE OF TECHNOLOGY NASA GODDARD SPACE FLIGHT CENTER ORBITAL SCIENCES CORPORATION NASA AMES RESEARCH CENTER SPACE TELESCOPE SCIENCE INSTITUTE.
The Pipeline Processing Framework LSST Applications Meeting IPAC Feb. 19, 2008 Raymond Plante National Center for Supercomputing Applications.
Dec 2, 2014 MAST Data Discovery Portal Tom Donaldson Tony Rogers.
SPACE TELESCOPE SCIENCE INSTITUTE Operated for NASA by AURA COS Pipeline Language(s) We plan to develop CALCOS using Python and C Another programming language?
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
Irina Sourikova Brookhaven National Laboratory for the PHENIX collaboration Migrating PHENIX databases from object to relational model.
Astronomical data curation and the Wide-Field Astronomy Unit Bob Mann Wide-Field Astronomy Unit Institute for Astronomy School of Physics University of.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness TA Weijing Chen Semantic eScience Week 10, November 7, 2011.
Versus JEDEC STAPL Comparison Toolkit Frank Toth February 20, 2000.
LSST: Preparing for the Data Avalanche through Partitioning, Parallelization, and Provenance Kirk Borne (Perot Systems Corporation / NASA GSFC and George.
DC2 Post-Mortem/DC3 Scoping February 5 - 6, 2008 DC3 Goals and Objectives Jeff Kantor DM System Manager Tim Axelrod DM System Scientist.
Parallelization and Characterization of Pattern Matching using GPUs Author: Giorgos Vasiliadis 、 Michalis Polychronakis 、 Sotiris Ioannidis Publisher:
EScience May 2007 From Photons to Petabytes: Astronomy in the Era of Large Scale Surveys and Virtual Observatories R. Chris Smith NOAO/CTIO, LSST.
Common Archive Observation Model (CAOM) What is it and why does JWST care?
Astronomy, Petabytes, and MySQL MySQL Conference Santa Clara, CA April 16, 2008 Kian-Tat Lim Stanford Linear Accelerator Center.
Conversion of Stackfit to LSST software stack Status as of Feb 20, 2012.
AMB HW LOW LEVEL SIMULATION VS HW OUTPUT G. Volpi, INFN Pisa.
GDB Meeting - 10 June 2003 ATLAS Offline Software David R. Quarrie Lawrence Berkeley National Laboratory
JWST Pipeline/Analysis Tools Perry Greenfield Science Software Branch.
Difference Image Analysis at OAC Groningen, 1st Dec 2004 AW-OAC team.
The LSST Data Processing Software Stack Tim Jenness (LSST Tucson) for the LSST Data Management Team Abstract The Large Synoptic Survey Telescope (LSST)
Frazer OwenNSF EVLA Mid-Project Review May 11-12, Transition to EVLA
Faculty meeting - 13 Dec 2006 The Hubble Legacy Archive Harald Kuntschner & ST-ECF staff 13 December 2006.
Why do F77 always spoil the landscape ?. Several ideas … slide 2.
POOL Based CMS Framework Bill Tanenbaum US-CMS/Fermilab 04/June/2003.
TeraGrid Software Integration: Area Overview (detailed in 2007 Annual Report Section 3) Lee Liming, JP Navarro TeraGrid Annual Project Review April, 2008.
Chapter 25 – Configuration Management 1Chapter 25 Configuration management.
Tools Report Engineering Node August 2007
ImageJ2. Why ImageJ2? ● Support the next generation of image data ● Interoperate and collaborate with other projects ● Broaden the ImageJ community ●
3rd July 2007Open Repository, Open Source 1 Where we were, What we've learnt, And what it means to you.
T. Axelrod, NASA Asteroid Grand Challenge, Houston, Oct 1, 2013 Improving NEO Discovery Efficiency With Citizen Science Tim Axelrod LSST EPO Scientist.
LSST Commissioning Overview and Data Plan Charles (Chuck) Claver Beth Willman LSST System Scientist LSST Deputy Director SAC Meeting.
Information Systems Development
PROTECTION.
The architecture of the P416 compiler
Queue ADT (Abstract Data Type) N …
From LSE-30: Observatory System Spec.
Chapter 7 Text Input/Output Objectives
Chapter 7 Text Input/Output Objectives
LSST Commissioning Overview and Data Plan Charles (Chuck) Claver Beth Willman LSST System Scientist LSST Deputy Director SAC Meeting.
SOFTWARE DESIGN AND ARCHITECTURE
Requirements Basis Requirements of an Image Visualization System (IVS), to support the verification of the correct functioning of some components under.
JWST Pipeline Overview
Chapter 7 Text Input/Output Objectives
POOL persistency framework for LHC
API Aspect of the Science Platform
Texas Instruments TDA2x and Vision SDK
Component Based Software Engineering
Chuck F Claver LSST System Scientist & Commissioning Lead
Cosmology and Galaxies Level 3 Requirements
Information Systems Development
Increased Efficiency and Effectiveness
What's New in eCognition 9
Analysis models and design models
IT Infrastructure for a Data Science Campus
What's New in eCognition 9
What's New in eCognition 9
TEE-Perf A Profiler for Trusted Execution Environments
X-ray high resolution spectra in the VO: the case of XMM-Newton RGS
Presentation transcript:

1 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom Investigating interoperability of the LSST Data Management software stack with Astropy Tim Jenness LSST DM Deputy System Architect 26 th June 2016

2 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom Large Synoptic Survey Telescope − 8.4m telescope imaging the entire Southern sky ~ twice a week for 10 years. − 3.2 gigapixel camera, ugrizy filters. − ~15TB per night continuously transferred to NCSA. − Alert Production Pipeline – Transient alerts issued within 60 seconds of the shutter closing. – Average about 10 million alerts per night. − Data Release Production Pipeline – 11 data releases over 10 years with a final image collection size of 0.5 exabyte and final catalog size of 15PB. – All data re-reduced for each data release.

3 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom LSST Data Management Science Pipeline Software − Software development began in − Evolved over ten years during research and development phase. − Mix of C++ and Python – 105k Python, 125k C++ (as of May 2016) − SWIG bindings (Python APIs look like C++ APIs)

4 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom Astropy − Astropy grew out of a mailing list discussion in − Becoming the shared core for Python in Astronomy. − Focused on usability (of APIs and installation). − Specialist software in Astropy Affiliated Packages. − Rapid uptake in the community. − Will set community expectations for new generations of Python users. Should LSST science pipelines adopt Astropy core classes and contribute to Astropy?

5 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom Astropy Adoption by LSST LSST are changing their approaches in the following areas: − Packaging − Coordinates (space and time) − Tables − Quantities and units Astropy also motivating us to look at alternate scheme for C++ bindings.

6 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom Packaging and build system − EUPS – Version management – Packaging − Scons build system − Investigating a setup.py option

7 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom Coordinates − Astropy time and sky coordinates classes much more complete than those in LSST software. – Based on ERFA/SOFA − LSST C++ code can do coordinate conversions, does not use an external library. − LSST C++ code does not need to do coordinate conversions and always uses TAI time and ICRS RA/Dec. − Use Astropy in Python code and very simple implementations in C++ – Integer nanoseconds for time. – Generic spherical coordinate.

8 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom Tables − The LSST table library is optimized for efficient append operations using fixed table schemas. − The Astropy table class is optimized for column access and visualization. − Astropy have extended their constructor (in version 1.2) to accept objects that have a __astropy_table__ method providing a memory view compatible with the Astropy table data model. − LSST tables can now make use of more visualization tools and can be exported to pandas.

9 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom Quantity − Very powerful Astropy facility to combine units with values. − Provide an additional check in an API, removing an entire class of bug. – Cannot combine quantities with incompatible units. − Down side is performance penalty. − LSST will adopt quantities on a case by case basis to gain experience in their use. − Have changed all unit strings in LSST codebase to be compatible with Astropy units (especially important for tables to allow columns to be interpreted as quantities). – Table units now validated.

10 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom C++ bindings − LSST use SWIG, Astropy prefer Cython. − C++ code balloons to 1.6 million lines when the 100k C++ interface is wrapped by SWIG. Compile times are dreadful. − Cython C++ support is lacking. − Pybind11 has much more potential (header only C++). − Actively experimenting switching SWIG to Pybind11 – Astropy project open to accepting bindings using this interface.

11 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom Areas where Astropy and LSST can both benefit − Generalized World Coordinate Systems − Regions − Measurement Algorithms − N-Dimensional Data representations

12 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom World Coordinate Systems − FITS-WCS is not capable of representing all the distortions that will be present in LSST optics, camera hardware (tree rings, brighter-fatter…). − STScI have a similar problem with JWST instruments. − Need a scheme that allows mappings to be chained in parallel and series that can represent these different effects. − Starlink AST and STScI GWCS – Different strengths − Serialization format is key.

13 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom Regions

14 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom Regions: Footprints

15 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom Measurement Algorithms − LSST will push the state of the art for algorithms dealing with source detection, source deblending and measurement. − Important that these algorithms are easily available to the community. − Currently depend on lots of LSST infrastructure classes and hard to disentangle. − Approach for now is to make them Astropy compatible but not necessarily standalone as Astropy Affiliated Packages.

16 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom N-Dimensional Data Representations − LSST uses an Exposure class to represent data, variance, quality masks, WCS, PSF, and exposure metadata.

17 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom

18 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom N-Dimensional Data Representations − LSST uses an Exposure class to represent data, variance, quality masks, WCS, PSF, and exposure metadata. − Astropy has an NDData that can do some of this but has evolved in a somewhat different direction. – A “pixel origin” needs to be added to NDData. − A duck-typed Exposure that can look like an NDData would be very useful, allowing LSST Exposures to be used by Astropy software packages.

19 SPIE Astronomical Telescopes + Instrumentation | 26 June - 1 July 2016 | Edinburgh, United Kingdom Summary − LSST are keen to be part of the Astropy community. − Have started to use Astropy classes in Python code. − Interoperability of LSST objects in Astropy is a priority. − Considering how best to contribute LSST code to the Astropy project (as Affiliated package and core contributions). − Lower the barrier to entry for people using the LSST software.