Palomar Transient Factory Data Flow Jason Surace IPAC/Caltech.

Slides:



Advertisements
Similar presentations
The Palomar Transient Factory or Adventures in High Fidelity Rapid Turnaround Data Processing at IPAC Jason Surace Russ Laher, Frank Masci, Wei Mi (did.
Advertisements

Science Archives in the 21st Century Best Practices in Ingestion and Data Access at the NASA/IPAC Infrared Science Archive
PTFIDE and Forced Photometry
MOST - Moving Object Search Tool for NEOWISE and IRSA Kevin Yau 6/11/2010.
ADASS XVII Sep 2007The NOAO Pipeline Applications Francisco Valdes (NOAO) Robert Swaters (UMd) Derec Scott (NOAO) Mark Dickinson (NOAO)
1 Non-streaking moving objects from iPTF Frank Masci, Adam Waszczak, Russ Laher & James Bauer iPTF workshop, August 2014.
Optimal Photometry of Faint Galaxies Kenneth M. Lanzetta Stony Brook University.
VISTA/WFCAM pipelines summit pipeline: real time DQC verified raw product to Garching standard pipeline: instrumental signature removal, catalogue production,
ELT Stellar Populations Science Near IR photometry and spectroscopy of resolved stars in nearby galaxies provides a way to extract their entire star formation.
The Need for Contiguous Fields NGAO Team Meeting, Waimea January 22, 2007 Claire Max.
Building a Framework for Data Preservation of Large-Scale Astronomical Data ADASS London, UK September 23-26, 2007 Jeffrey Kantor (LSST Corporation), Ray.
X-ray sources in NSVS Tim McKay University of Michigan 04/03/04.
Cosmos Data Products Version Peter Capak (Caltech) May, 23, 2005 Kyoto Cosmos Meeting.
VISTA pipelines summit pipeline: real time DQC verified raw product to Garching standard pipeline: instrumental signature removal, catalogue production,
Relative measurements with Synoptic surveys I.Photometry & Astrometry Eran Ofek Weizmann Institute.
1 HOT-WIRING THE TRANSIENT UNIVERSE | SANTA BARBARA, CA | MAY 12-15, 201 Name of Meeting Location Date - Change in Slide Master LSST Alert Production Pipelines.
A Primer on Image Acquisition and Data Reduction Using TheSky6, CCDSoft V5 and Microsoft Excel Thomas C. Smith Dark Ridge Observatory (DRO)
Why Build Image Mosaics for Wide Area Surveys? An All-Sky 2MASS Mosaic Constructed on the TeraGrid A. C. Laity, G. B. Berriman, J. C. Good (IPAC, Caltech);
The NASA/NExScI/IPAC Star and Exoplanet Database 14 May 2009 David R. Ciardi on behalf of the NStED Team.
Douglas L. Tucker (FNAL) SISPI Meeting 22 February 2007 Sky Camera DB Inputs.
Purpose The purpose of this project was to analyze the data from the telescope to verify the relationship between period of rotation and diameter of the.
National Center for Supercomputing Applications Observational Astronomy NCSA projects radio astronomy: CARMA & SKA optical astronomy: DES & LSST access:
1 New Frontiers with LSST: leveraging world facilities Tony Tyson Director, LSST Project University of California, Davis Science with the 8-10 m telescopes.
Science Update: SN2011fe in M101 (Pinwheel Galaxy) Peter Nugent (LBNL/UCB)
Producing Science with the Palomar Transient Factory Branimir Sesar (MPIA, formerly Caltech)
ASTR_4170 Special Topics: Photometry & Filter Systems Day-5.
1 Radio Astronomy in the LSST Era – NRAO, Charlottesville, VA – May 6-8 th LSST Survey Data Products Mario Juric LSST Data Management Project Scientist.
MASSACHUSETTS INSTITUTE OF TECHNOLOGY NASA GODDARD SPACE FLIGHT CENTER ORBITAL SCIENCES CORPORATION NASA AMES RESEARCH CENTER SPACE TELESCOPE SCIENCE INSTITUTE.
Photometry and Astrometry of SIM Planetquest Globular Cluster Targets T. M. Girard (Yale), A. Sarajedini (U. Florida), B. Chaboyer (Dartmouth) Table 1.
Data Management Subsystem Jeff Valenti (STScI). DMS Context PRDS - Project Reference Database PPS - Proposal and Planning OSS - Operations Scripts FOS.
SDSS-KSG 08 Workshop1 The SDSS DR7 and KIAS SDSS mirror Won-Kee Park ARCSEC, Sejong University 2008 SDSS-KSG Workshop.
Astronomical Spectroscopy and the Virtual Observatory ESAC, March 2007 VO tools and cross-calibration Pedro García-Lario European Space Astronomy.
LSST Alert Management VOEvent Meeting Tucson, AZ December 5-6, 2005 Robyn Allsman,LSSTC/NOAO Maria Nieto-Santisteban, JHU Ray Plante, NCSA Tim Axelrod,
NHSC Workshop August 2013 NASA Herschel Science Center - page 1 PACS David R. Ardila User Support coordinator / NHSC Archive Scientist The Herschel Science.
Palomar Transient Factory Data Flow Jason Surace IPAC/Caltech.
Y-band Imaging of Extragalatic Fields and High redshift Quasars Changsu Choi 1, Myungshin Im 1 1 Center for the Exploration of the Origin of the Universe,
GSC-II Plate Processing Pipeline Status C. Loomis, C. Sturch, F. Guglielmetti.
DC2 Post-Mortem/DC3 Scoping February 5 - 6, 2008 DC3 Goals and Objectives Jeff Kantor DM System Manager Tim Axelrod DM System Scientist.
High Precision Astrometry and Parallax from Spatial Scanning Part 2 Adam Riess and Stefano Casertano.
EScience May 2007 From Photons to Petabytes: Astronomy in the Era of Large Scale Surveys and Virtual Observatories R. Chris Smith NOAO/CTIO, LSST.
PTF Data Caveats and Cautionary Notes Mansi M. Kasliwal Carnegie Institution for Science California Institute of Technology Thanks to Richard Walters,
Quality Assurance Benchmark Datasets and Processing David Nidever SQuaRE Science Lead.
AST3-1 photometry from Dome A Bin Ma, Peng Wei, Yi Hu, Zhaohui Shang NAOC AST3
Data Analysis Software Development Hisanori Furusawa ADC, NAOJ For HSC analysis software team 1.
Swift HUG April Swift data archive Lorella Angelini HEASARC.
1 CORE DATA PROCESSING SOFTWARE PLAN REVIEW | SEATTLE, WA | SEPTEMBER 19-20, 2013 Name of Meeting Location Date - Change in Slide Master Data Release Processing.
Counting individual galaxies from deep mid-IR Spitzer surveys Giulia Rodighiero University of Padova Carlo Lari IRA Bologna Francesca Pozzi University.
Hunting youngest Type Ia SNe in the intermediate Palomar Transient Factory Yi Cao (Caltech) On behalf of the intermediate Palomar Transient Factory collaboration.
Data Archives: Migration and Maintenance Douglas J. Mink Telescope Data Center Smithsonian Astrophysical Observatory NSF
VST ATLAS: Requirements, Operations and Products Tom Shanks, Nigel Metcalfe, Jamie McMillan (Durham Univ.) +CASU.
From photons to catalogs. Cosmological survey in visible/near IR light using 4 complementary techniques to characterize dark energy: I. Cluster Counts.
HARPS Data Flow System Christophe Lovis Geneva Observatory HARPS-N PDR, 6-7 December 2007, Cambridge MA.
LSST and VOEvent VOEvent Workshop Pasadena, CA April 13-14, 2005 Tim Axelrod University of Arizona.
Surace 2014 Jason Surace (Data Systems Lead) Zwicky Transient Facility Data System.
Pan-STARRS Seminar: IPPEugene Magnier Pan-STARRS Image Processing Pipeline An Overview IFA Pan-STARRS Seminar 735 October 6, 2004.
Distributed Pipeline Programming for Mosaics Or Mario Tips’N’Tricks.
Astrophysical Surveys: Visualization/Data Managements Peter Nugent (LBNL/UCB)
Difference Image Analysis at OAC Groningen, 1st Dec 2004 AW-OAC team.
IPHAS Early Data Release E. A. Gonzalez-Solares IPHAS Consortium AstroGrid National Astronomy Meeting, 2007.
What is Firefly (1) A web UI framework for web applications
The OmegaCAM photometric systems. OmegaCAM calibration plan (1) Basic OmegaCAM terminology : ● Key bands vs User bands ● Key band -> User band transformation.
Spotting the life of stars „Pi of the Sky” Project Katarzyna Kwiecińska UKSW-SNŚ on behalf of the Pi of the Sky collaboration.
Selection and Characterization of Interesting Grism Spectra Gerhardt R. Meurer The Johns Hopkins University Gerhardt R. Meurer The Johns Hopkins University.
26th October 2005 HST Calibration Workshop 1 The New GSC-II and it’s Use for HST Brian McLean Archive Sciences Branch.
GSPC -II Program GOAL: extend GSPC-I photometry to B = V ˜ 20 add R band to calibrate red second-epoch surveys HOW: take B,V,R CCD exposures centered at.
The First Release of AST3 point source catalog from Dome A, Antarctica in 2012 MA Bin, WEI Peng, SHANG Zhaohui, LIU Qiang, HU Yi (NAOC) 973 课题二
Pi of the Sky off-line experiment with GLORIA Ariel Majcher National Centre for Nuclear Research Warsaw, Poland 10th INTEGRAL/BART Workshop, April.
T. Axelrod, NASA Asteroid Grand Challenge, Houston, Oct 1, 2013 Improving NEO Discovery Efficiency With Citizen Science Tim Axelrod LSST EPO Scientist.
From LSE-30: Observatory System Spec.
Photometric Analysis of Asteroids
Presentation transcript:

Palomar Transient Factory Data Flow Jason Surace IPAC/Caltech

Complicated Data Path Data flows through multiple pipelines, creating a variety of science products tailored for different purposes. These pipelines operate on multiple timescales. What data you want depends in large part on what science you want to do. Realtime Data Processing – image subtraction, transient and solar system object detection. High Fidelity Daily Processing – nightly processing and recalibration for highest data quality images and source catalogs. Ensemble Processing – periodic construction of coadded images, processing of catalogs to create high precision light curves. Long-term Data Curation - storage of all raw data, processed data (images and extracted photometry), and an advanced data archive with data exploration tools, with public release.

P48 Caltech/C ahill NERSC Image Subtraction and Transient Detection/RB Pipeline Ingest Realtime Image Subtraction Pipeline Photometric Pipeline Reference Pipeline Lightcurve Pipeline Transient Candidates Lightcurves Reference Catalogs Epochal Images and Catalogs IPAC Moving Object Pipeline SSOs Reference Images

Data Transfer Data flows from the 48-inch and the PTF camera system via high-speed microwave link through a relay node at the San Diego Supercomputing Center to Cahill at Caltech. From there it forks to two places: NERSC at LBNL and IPAC at Caltech. Raw data moves as a multi-extension FITS file containing all 12 CCD images in an exposure, along with header metadata. This is the raw data product, and will not be used by many of you.

Raw Data 12 chips extracted from the MEF file and moaicked together. Dead CCD

NERSC Realtime Pipeline (see talk by Peter Nugent on day 1) NERSC operates a realtime data pipeline which performs basic calibration, image subtraction against a reference image dataset, transient candidate detection, and candidate vetting via the RealBogus software. This is the feed-in for the existing extragalactic transient marshall. Designed around SNe detection. Most of the SNe work you have seen has come from this pipeline. In-collaboration dataset; currently transferring this functionality for the IPAC realtime pipeline, which will be the basis for future ZTF alerts.

IPAC Data Ingest Data flows in realtime to IPAC. Upon receipt, the MEF files are broken up into individual CCD files. PTF data system processes all the CCDs wholly independently. Metadata about all the images goes into an operations database. Data receive an initial WCS. All the data are stored on spinning disk and in a deep tape archive. You are here. PTF data lives is here.

Infrared Processing and Analysis Center Multi-mission Science Center (IRAS, ISO, Spitzer WISE, Herschel, Planck, 2MASS, etc) Maintains several data rooms. ~1TB of data every 4-5 days. 24 drones with 240 cores. Roughly 0.5 PB spinning disk. Associated network equipment. Database and file servers. Archive servers. Tape backup. This will increase by a factor of 10x in the ZTF era! One shudders to imagine LSST, which will be measuring it’s computing power in megawatts. IPAC Morrisroe Computer Center

R-band Holdings 1275 nights, 3.1 million images

g-band Holdings 223 nights, 500 thousand images

H-alpha Holdings 99 nights, 125 thousand images

Realtime Pipeline (see talks by Adam Waszczak and Frank Masci) Pipeline executes as data is received at IPAC. Basic data reduction steps are cloned from the nightly photometric pipeline: bias correction and flat-fielding performed using stored calibration. Limited photometric calibration, but full astrometric calibration. Image subtraction against a reference image library constructed from all the data to-date. In-house software. “Streak detection” for fast-moving objects. Transient candidate detection and extraction via psf-fitting and aperture extraction. Image subtractions and candidate catalogs are pushed to an external gateway where they are picked up by the solar system, localization, and extragalactic marshalls minute phase lag. Not yet publicly available, will be in the ZTF era.

Realtime Pipeline This is a fast streak candidate from the Solar System Marshall.

Photometric Pipeline (see talk by Russ Laher) This pipeline processes data in the traditional manner. Starts up at the end of the night, after all the data has been received. Calibration is derived from the entire night’s worth of data. Specifically, the bias and flat-fields are derived from the data themselves. Photometric calibration is derived from extracted photometry from all sources, fitting color, extinction, time and large-scale spatial variations vs. the SDSS. Typically reach an accuracy of a few %. See talk by Ofek, day 1. Astrometric calibration is done individually at the CCD level, against a combined SDSS and UCAC4 catalog. Typically good to 0.15”. Output from this pipeline are calibrated single-CCD FITS images and single-CCD catalog FITS binary tables (both aperture and psf-fit). These are archived through IRSA. Available 1-3 days after observation. These are publicly available data products.

Photometric Pipeline Output Single R-band thumbnail image of Arp 220, 8 arcminutes across. Aperture extractions catalog (sextractor-based) overlaid. All observations and detections of everything are saved in the archive. Products are a reduced image, bit-encoded data quality mask, and catalogs. All products are FITS.

Reference Image Pipeline Once enough individual observations accumulate, the “reference image” pipeline is triggered. This pipeline coadds the existing data, after selecting “best frames”, e.g. best seeing, photometric conditions, astrometry, etc. Coaddition is done based on CCD id, PTF tile, and filter. These images are the reference of the static sky, at a level deeper than the individual observations. “Reference Catalogs” are extracted from these images. This concept is important, because these are both the underlying basis of the image subtractions, and also the basis of the light-curve pipeline. Like PTF coverage, the depth of these is variable, but is current 5<n<50. Resulting products are FITS images and FITS binary tables. Will be publicly released 1 year from now.

Reference Images Single Image 60 Field 5257, Chip 7, Stack of 34

Deep Sky Coadds aka “Reference Images” * Results not typical. Near Galactic Center.

Light Curve Pipeline Each night, all detected sources from the photometric pipeline are matched against the reference catalog (better than a generic catalog-matching approach). All sources ever seen for a given CCD, PTF tile, and filter combination are loaded and analyzed. Least variable sources used as anchors for the calibration. Image-by-image correction factors computed for that image as a whole and stored as a lookup table. Application of these secondary correction factors improves overall relative calibration to near-millimag levels for bright sources (that part is important). Triggers less frequently (planned weekly updates). Highest level of our products. This is an HD5 and LSD file hierarchy, currently not public. Will be in a couple years.

Example Light Curves Something a little different, these are relatively faint asteroid light curves from Chang et al

Data Products What you can get today: calibrated epochal images and catalog files for select fields. In one year: epochal images and catalog files plus reference images and catalogs for all PTF data through In two years: all PTF data and rolling release of iPTF data including light curves. An additional talk this morning will be a demo of the data archive.