Palomar Transient Factory Data Flow Jason Surace IPAC/Caltech.

Slides:



Advertisements
Similar presentations
The Palomar Transient Factory or Adventures in High Fidelity Rapid Turnaround Data Processing at IPAC Jason Surace Russ Laher, Frank Masci, Wei Mi (did.
Advertisements

The Pan-STARRS Archive at STScI
Science Archives in the 21st Century Best Practices in Ingestion and Data Access at the NASA/IPAC Infrared Science Archive
PTFIDE and Forced Photometry
MOST - Moving Object Search Tool for NEOWISE and IRSA Kevin Yau 6/11/2010.
ADASS XVII Sep 2007The NOAO Pipeline Applications Francisco Valdes (NOAO) Robert Swaters (UMd) Derec Scott (NOAO) Mark Dickinson (NOAO)
DES PreCam A Proposal for the PreCam Calibration Strategy J. Allyn Smith David L. Burke Melissa J. Butner 15 July 2009 DRAFT.
1 Non-streaking moving objects from iPTF Frank Masci, Adam Waszczak, Russ Laher & James Bauer iPTF workshop, August 2014.
Optimal Photometry of Faint Galaxies Kenneth M. Lanzetta Stony Brook University.
Palomar Transient Factory Data Flow Jason Surace IPAC/Caltech.
VISTA/WFCAM pipelines summit pipeline: real time DQC verified raw product to Garching standard pipeline: instrumental signature removal, catalogue production,
The Need for Contiguous Fields NGAO Team Meeting, Waimea January 22, 2007 Claire Max.
Building a Framework for Data Preservation of Large-Scale Astronomical Data ADASS London, UK September 23-26, 2007 Jeffrey Kantor (LSST Corporation), Ray.
X-ray sources in NSVS Tim McKay University of Michigan 04/03/04.
Cosmos Data Products Version Peter Capak (Caltech) May, 23, 2005 Kyoto Cosmos Meeting.
VISTA pipelines summit pipeline: real time DQC verified raw product to Garching standard pipeline: instrumental signature removal, catalogue production,
Relative measurements with Synoptic surveys I.Photometry & Astrometry Eran Ofek Weizmann Institute.
1 HOT-WIRING THE TRANSIENT UNIVERSE | SANTA BARBARA, CA | MAY 12-15, 201 Name of Meeting Location Date - Change in Slide Master LSST Alert Production Pipelines.
Digitized Sky Survey Update Brian McLean : Archive Sciences Branch / Operations and Engineering Division.
Why Build Image Mosaics for Wide Area Surveys? An All-Sky 2MASS Mosaic Constructed on the TeraGrid A. C. Laity, G. B. Berriman, J. C. Good (IPAC, Caltech);
The NASA/NExScI/IPAC Star and Exoplanet Database 14 May 2009 David R. Ciardi on behalf of the NStED Team.
Purpose The purpose of this project was to analyze the data from the telescope to verify the relationship between period of rotation and diameter of the.
National Center for Supercomputing Applications Observational Astronomy NCSA projects radio astronomy: CARMA & SKA optical astronomy: DES & LSST access:
1 New Frontiers with LSST: leveraging world facilities Tony Tyson Director, LSST Project University of California, Davis Science with the 8-10 m telescopes.
Science Update: SN2011fe in M101 (Pinwheel Galaxy) Peter Nugent (LBNL/UCB)
1 Radio Astronomy in the LSST Era – NRAO, Charlottesville, VA – May 6-8 th LSST Survey Data Products Mario Juric LSST Data Management Project Scientist.
MASSACHUSETTS INSTITUTE OF TECHNOLOGY NASA GODDARD SPACE FLIGHT CENTER ORBITAL SCIENCES CORPORATION NASA AMES RESEARCH CENTER SPACE TELESCOPE SCIENCE INSTITUTE.
Photometry and Astrometry of SIM Planetquest Globular Cluster Targets T. M. Girard (Yale), A. Sarajedini (U. Florida), B. Chaboyer (Dartmouth) Table 1.
Data Management Subsystem Jeff Valenti (STScI). DMS Context PRDS - Project Reference Database PPS - Proposal and Planning OSS - Operations Scripts FOS.
SDSS-KSG 08 Workshop1 The SDSS DR7 and KIAS SDSS mirror Won-Kee Park ARCSEC, Sejong University 2008 SDSS-KSG Workshop.
Astronomical Spectroscopy and the Virtual Observatory ESAC, March 2007 VO tools and cross-calibration Pedro García-Lario European Space Astronomy.
WFCAM Science Archive Critical Design Review, April 2003 The SuperCOSMOS Science Archive (SSA) WFCAM Science Archive prototype Existing ad hoc flat file.
LSST Alert Management VOEvent Meeting Tucson, AZ December 5-6, 2005 Robyn Allsman,LSSTC/NOAO Maria Nieto-Santisteban, JHU Ray Plante, NCSA Tim Axelrod,
NHSC Workshop August 2013 NASA Herschel Science Center - page 1 PACS David R. Ardila User Support coordinator / NHSC Archive Scientist The Herschel Science.
Planetary Science Archive PSA User Group Meeting #1 PSA UG #1  July 2 - 3, 2013  ESAC PSA Archiving Standards.
Y-band Imaging of Extragalatic Fields and High redshift Quasars Changsu Choi 1, Myungshin Im 1 1 Center for the Exploration of the Origin of the Universe,
DC2 Post-Mortem/DC3 Scoping February 5 - 6, 2008 DC3 Goals and Objectives Jeff Kantor DM System Manager Tim Axelrod DM System Scientist.
EScience May 2007 From Photons to Petabytes: Astronomy in the Era of Large Scale Surveys and Virtual Observatories R. Chris Smith NOAO/CTIO, LSST.
Quality Assurance Benchmark Datasets and Processing David Nidever SQuaRE Science Lead.
Common Archive Observation Model (CAOM) What is it and why does JWST care?
Data Analysis Software Development Hisanori Furusawa ADC, NAOJ For HSC analysis software team 1.
Swift HUG April Swift data archive Lorella Angelini HEASARC.
1 CORE DATA PROCESSING SOFTWARE PLAN REVIEW | SEATTLE, WA | SEPTEMBER 19-20, 2013 Name of Meeting Location Date - Change in Slide Master Data Release Processing.
Hunting youngest Type Ia SNe in the intermediate Palomar Transient Factory Yi Cao (Caltech) On behalf of the intermediate Palomar Transient Factory collaboration.
Data Archives: Migration and Maintenance Douglas J. Mink Telescope Data Center Smithsonian Astrophysical Observatory NSF
VST ATLAS: Requirements, Operations and Products Tom Shanks, Nigel Metcalfe, Jamie McMillan (Durham Univ.) +CASU.
From photons to catalogs. Cosmological survey in visible/near IR light using 4 complementary techniques to characterize dark energy: I. Cluster Counts.
HARPS Data Flow System Christophe Lovis Geneva Observatory HARPS-N PDR, 6-7 December 2007, Cambridge MA.
LSST and VOEvent VOEvent Workshop Pasadena, CA April 13-14, 2005 Tim Axelrod University of Arizona.
Surace 2014 Jason Surace (Data Systems Lead) Zwicky Transient Facility Data System.
Distributed Pipeline Programming for Mosaics Or Mario Tips’N’Tricks.
Astrophysical Surveys: Visualization/Data Managements Peter Nugent (LBNL/UCB)
Difference Image Analysis at OAC Groningen, 1st Dec 2004 AW-OAC team.
MMT Observation Database for Light Curve Analysis Vladimir Agapov Presentation for the WG1 session 33rd IADC meeting, Houston.
IPHAS Early Data Release E. A. Gonzalez-Solares IPHAS Consortium AstroGrid National Astronomy Meeting, 2007.
The LSST Data Processing Software Stack Tim Jenness (LSST Tucson) for the LSST Data Management Team Abstract The Large Synoptic Survey Telescope (LSST)
What is Firefly (1) A web UI framework for web applications
Photometric Calibration Jorge F. García Yus GEMINI Observatory Barolo 2001.
Spotting the life of stars „Pi of the Sky” Project Katarzyna Kwiecińska UKSW-SNŚ on behalf of the Pi of the Sky collaboration.
Selection and Characterization of Interesting Grism Spectra Gerhardt R. Meurer The Johns Hopkins University Gerhardt R. Meurer The Johns Hopkins University.
26th October 2005 HST Calibration Workshop 1 The New GSC-II and it’s Use for HST Brian McLean Archive Sciences Branch.
Data Management: Data Processing Types of Data Processing at USGS There are several ways to classify Data Processing activities at USGS, and here are some.
GSPC -II Program GOAL: extend GSPC-I photometry to B = V ˜ 20 add R band to calibrate red second-epoch surveys HOW: take B,V,R CCD exposures centered at.
The First Release of AST3 point source catalog from Dome A, Antarctica in 2012 MA Bin, WEI Peng, SHANG Zhaohui, LIU Qiang, HU Yi (NAOC) 973 课题二
The GLORIA demonstrator experiment Ariel Majcher National Centre for Nuclear Research Warsaw, Poland XXXII IEEE-SPIE Joint Symposium Wilga 2013, May 29.
Pi of the Sky off-line experiment with GLORIA Ariel Majcher National Centre for Nuclear Research Warsaw, Poland 10th INTEGRAL/BART Workshop, April.
T. Axelrod, NASA Asteroid Grand Challenge, Houston, Oct 1, 2013 Improving NEO Discovery Efficiency With Citizen Science Tim Axelrod LSST EPO Scientist.
From LSE-30: Observatory System Spec.
A galaxy at redshift 10? Brigitta Eder Vera Könyves
Photometric Analysis of Asteroids
Presentation transcript:

Palomar Transient Factory Data Flow Jason Surace IPAC/Caltech

Complicated Data Path Data flows through multiple pipelines, creating a variety of science products tailored for different purposes. These pipelines operate on multiple timescales. What data you want depends in large part on what science you want to do. Realtime Data Processing – image subtraction, transient and solar system object detection. High Fidelity Daily Processing – nightly processing and recalibration for highest data quality images and source catalogs. Ensemble Processing – periodic construction of coadded images, processing of catalogs to create high precision light curves. Long-term Data Curation - storage of all raw data, processed data (images and extracted photometry), and an advanced data archive with data exploration tools, with public release.

P48 Caltech/C ahill NERSC Image Subtraction and Transient Detection/RB Pipeline Ingest Realtime Image Subtraction Pipeline Photometric Pipeline Reference Pipeline Lightcurve Pipeline Transient Candidates Lightcurves Reference Catalogs Epochal Images and Catalogs IPAC Moving Object Pipeline SSOs Reference Images

Data Transfer Data flows from the 48-inch and the PTF camera system via high-speed microwave link through a relay node at the San Diego Supercomputing Center to Cahill at Caltech. From there it forks to two places: NERSC at LBNL and IPAC at Caltech. Raw data moves as a multi-extension FITS file containing all 12 CCD images in an exposure, along with header metadata. This is the raw data product, and will not be used by many of you.

Raw Data 12 chips extracted from the MEF file and moaicked together. Dead CCD

NERSC Realtime Pipeline NERSC/LBL developed the first version of a realtime data pipeline which performs basic calibration, image subtraction against a reference image dataset, transient candidate detection, and candidate vetting via the RealBogus software. Still operating today. This is the feed-in for the existing extragalactic transient marshall. Designed around SNe detection. Most of the SNe work you have seen has come from this pipeline. In-collaboration dataset; functionality has been redeveloped, improved, and expanded by IPAC, which will be the basis for future ZTF alerts.

IPAC Data Ingest Data flows in realtime to IPAC. Upon receipt, the MEF files are broken up into individual CCD files. PTF data system processes all the CCDs wholly independently. Metadata about all the images goes into an operations database. Data receive an initial WCS. All the data are stored on spinning disk and in a deep tape archive. You are here. PTF data lives is here.

Infrared Processing and Analysis Center Multi-mission Science Center (IRAS, ISO, Spitzer WISE, Herschel, Planck, 2MASS, etc) Maintains several data rooms. ~1TB of data every 4-5 days. 24 drones with 240 cores. Roughly 0.5 PB spinning disk. Associated network equipment. Database and file servers. Archive servers. Tape backup. This will increase by a factor of 10x in the ZTF era! One shudders to imagine LSST, which will be measuring it’s computing power in megawatts. IPAC Morrisroe Computer Center

R-band Holdings 1449 nights, 3.3 million images 50 billion source apparitions (epochal detections)

g-band Holdings 406 nights, 830 thousand images

H-alpha Holdings 99 nights, 125 thousand images, plus an equal amount In at least one other H-alpha filter.

Realtime Pipeline Realtime – data is processed as received, turnaround in minutes. Needed for same-night followup. Astrometric and photometrically calibrated. Image subtraction against a reference image library constructed from all the data to-date. In-house software. “Streak detection” for fast-moving objects; moving object pipeline constructs solar system object tracklets. Transient candidate detection and extraction via psf-fitting and aperture extraction. Machine-learning “scores” candidates. Image subtractions and candidate catalogs are pushed to an external gateway where they are picked up by the solar system, ToO, and extragalactic marshalls. Not publicly available at this time.

Realtime Image Subtraction and Transient Detection Originally the community “HOTPANTS” software, now replaced with a more sophisticated in-house image subtraction package.

Realtime Pipeline This is a fast streak candidate from the Solar System Marshall.

Photometric Pipeline This pipeline processes data in the traditional manner. Starts up at the end of the night, after all the data has been received. Calibration is derived from the entire night’s worth of data. Specifically, the bias and flat-fields are derived from the data themselves. Photometric calibration is derived from extracted photometry from all sources, fitting color, extinction, time and large-scale spatial variations vs. the SDSS. Typically reach an accuracy of a few %. Astrometric calibration is done individually at the CCD level, against a combined SDSS and UCAC4 catalog. Typically good to 0.15”. Output from this pipeline are calibrated single-CCD FITS images and single-CCD catalog FITS binary tables (both aperture and psf-fit). These are archived through IRSA. Available 1-3 days after observation. These are publicly available data products.

Photometric Pipeline Output Single R-band thumbnail image of Arp 220, 8 arcminutes across. Aperture extractions catalog (sextractor-based) overlaid. All observations and detections of everything are saved in the archive. Products are a reduced image, bit-encoded data quality mask, and catalogs. All products are FITS.

Reference Image Pipeline Once enough individual observations accumulate, the “reference image” pipeline is triggered. This pipeline coadds the existing data, after selecting “best frames”, e.g. best seeing, photometric conditions, astrometry, etc. Coaddition is done based on CCD id, PTF tile, and filter. These images are the reference of the static sky, at a level deeper than the individual observations. “Reference Catalogs” are extracted from these images. This concept is important, because these are both the underlying basis of the image subtractions, and also the basis of the light-curve pipeline. Like PTF coverage, the depth of these is variable, but is current 5<n<50. Resulting products are FITS images and FITS binary tables. Publicly available.

Reference Images Single Image 60 Field 5257, Chip 7, Stack of 34

Deep Sky Coadds aka “Reference Images” * Results not typical. Near Galactic Center.

Deep Coadds

Light Curve Pipeline Each night, all detected sources from the photometric pipeline are matched against the reference catalog (better than a generic catalog-matching approach). All sources ever seen for a given CCD, PTF tile, and filter combination are loaded and analyzed. Least variable sources used as anchors for the calibration. Image-by-image correction factors computed for that image as a whole and stored as a lookup table. Application of these secondary correction factors improves overall relative calibration to near-millimag levels for bright sources (that part is important). Triggers less frequently (planned weekly updates). Highest level of our products. Not publicly available.

Example Light Curves Something a little different, these are relatively faint asteroid light curves from Chang et al

Data Products What you can publicly get today: Calibrated epochal images and catalog files at g and R-band for all data taken prior to Dec 31, Calibrated reference images and catalogs for all fields observed prior to Dec 31, In one year: Rolling release of iPTF data including light curves.

PTF Archive at IRSA Data products can be searched and retrieved via sophisticated GUI tools and also through an application program interface that allows integration of the archive into other, 3 rd party software.