Pan-STARRS Photometric Classification of Supernovae using a Hierarchical Bayesian Model George Miller, Edo Berger, Nathan Sanders Harvard-Smithsonian Center.

Slides:



Advertisements
Similar presentations
SNAP vs. Ground-based Supernova Missions Alex Kim For the SNAP collaboration Lawrence Berkeley National Laboratory January 2003.
Advertisements

Probabilistic models Jouni Tuomisto THL. Outline Deterministic models with probabilistic parameters Hierarchical Bayesian models Bayesian belief nets.
Pattern Recognition and Machine Learning
Bayesian Estimation in MARK
Flipping A Biased Coin Suppose you have a coin with an unknown bias, θ ≡ P(head). You flip the coin multiple times and observe the outcome. From observations,
Markov-Chain Monte Carlo
Chapter 4: Linear Models for Classification
Time Series Photometry; Some Musings Steve B. Howell, NASA Ames Research Center.
BAYESIAN INFERENCE Sampling techniques
Computing the Posterior Probability The posterior probability distribution contains the complete information concerning the parameters, but need often.
Star-Formation in Close Pairs Selected from the Sloan Digital Sky Survey Overview The effect of galaxy interactions on star formation has been investigated.
Bayesian Analysis of X-ray Luminosity Functions A. Ptak (JHU) Abstract Often only a relatively small number of sources of a given class are detected in.
Lecture 5: Learning models using EM
July 7, 2008SLAC Annual Program ReviewPage 1 Future Dark Energy Surveys R. Wechsler Assistant Professor KIPAC.
The Transient Universe: AY 250 Spring 2007 Existing Transient Surveys: Optical I: Big Apertures Geoff Bower.
End of Chapter 8 Neil Weisenfeld March 28, 2005.
1 SDSS-II Supernova Survey Josh Frieman Leopoldina Dark Energy Conference October 8, 2008 See also: poster by Hubert Lampeitl, talk by Bob Nichol.
The Transient Universe: AY 250 Spring 2007 Future Observations of SN IA Geoff Bower.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Natalie RoeSNAP/SCP Journal Club “Identification of Type Ia Supernovae at Redshift 1.3 and Beyond with the Advanced Camera for Surveys on HST” Riess, Strolger,
Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,
NAOKI YASUDA, MAMORU DOI (UTOKYO), AND TOMOKI MOROKUMA (NAOJ) SN Survey with HSC.
Applications of Bayesian sensitivity and uncertainty analysis to the statistical analysis of computer simulators for carbon dynamics Marc Kennedy Clive.
Padua: 1604 → 2004 – Supernovae as cosmological lighthouses SNLS – The SuperNova Legacy Survey Mark Sullivan (University of Toronto) on behalf of the SNLS.
SNLS: Overview and High-z Spectroscopy D. Andrew Howell (Toronto) for the SNLS Collaboration (see: for full list)
LSST AGN TODO Items Gordon Richards (Drexel University)
Mixture Models, Monte Carlo, Bayesian Updating and Dynamic Models Mike West Computing Science and Statistics, Vol. 24, pp , 1993.
1 System wide optimization for dark energy science: DESC-LSST collaborations Tony Tyson LSST Dark Energy Science Collaboration meeting June 12-13, 2012.
Center for Radiative Shock Hydrodynamics Fall 2011 Review Assessment of predictive capability Derek Bingham 1.
Constraining Cosmology with Peculiar Velocities of Type Ia Supernovae Cosmo 2007 Troels Haugbølle Institute for Physics & Astronomy,
How Standard are Cosmological Standard Candles? Mathew Smith and Collaborators (UCT, ICG, Munich, LCOGT and SDSS-II) SKA Bursary Conference 02/12/2010.
MACHINE LEARNING 8. Clustering. Motivation Based on E ALPAYDIN 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2  Classification problem:
Hunting youngest Type Ia SNe in the intermediate Palomar Transient Factory Yi Cao (Caltech) On behalf of the intermediate Palomar Transient Factory collaboration.
The SNLS has been allocated large amount of spectroscopic follow-up time at the VLT, Gemini North and South, Keck and Magellan. Example of a spectrum of.
Type Ia Supernovae and the Acceleration of the Universe: Results from the ESSENCE Supernova Survey Kevin Krisciunas, 5 April 2008.
SNAP Calibration Program Steps to Spectrophotometric Calibration The SNAP (Supernova / Acceleration Probe) mission’s primary science.
BAOs SDSS, DES, WFMOS teams (Bob Nichol, ICG Portsmouth)
HST ACS data LSST: ~40 galaxies per sq.arcmin. LSST CD-1 Review SLAC, Menlo Park, CA November 1 - 3, LSST will achieve percent level statistical.
Lecture 2: Statistical learning primer for biologists
3rd International Workshop on Dark Matter, Dark Energy and Matter-Antimatter Asymmetry NTHU & NTU, Dec 27—31, 2012 Likelihood of the Matter Power Spectrum.
Unsupervised Mining of Statistical Temporal Structures in Video Liu ze yuan May 15,2011.
Probabilistic models Jouni Tuomisto THL. Outline Deterministic models with probabilistic parameters Hierarchical Bayesian models Bayesian belief nets.
J. Jasche, Bayesian LSS Inference Jens Jasche La Thuile, 11 March 2012 Bayesian Large Scale Structure inference.
Searching High-z Supernovae with HSC and WFMOS
Emission Line Galaxy Targeting for BigBOSS Nick Mostek Lawrence Berkeley National Lab BigBOSS Science Meeting Novemenber 19, 2009.
CS Statistical Machine learning Lecture 25 Yuan (Alan) Qi Purdue CS Nov
Brenna Flaugher for the DES Collaboration; DPF Meeting August 27, 2004 Riverside,CA Fermilab, U Illinois, U Chicago, LBNL, CTIO/NOAO 1 Dark Energy and.
Towards cosmology with a million supernovae: The BEAMS method Renée Hlozek Phys. Rev. D In collaboration with Bruce Bassett Martin Kunz.
1.INTRODUCTION Supernovae Type Ia (SNeIa) Good candidate for standard candle to the high-z Universe (redshift
Budapest Group Eötvös University MAGPOP kick-off meeting Cassis 2005 January
Bayesian Template-Based Approach to Classifying SDSS-II Supernovae from 3-Year Survey Brian Connolly Photometric Supernova ID Workshop 3/16/12.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
Gina Moraila University of Arizona April 21, 2012.
Overview G. Jogesh Babu. R Programming environment Introduction to R programming language R is an integrated suite of software facilities for data manipulation,
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 1: INTRODUCTION.
Generalization Performance of Exchange Monte Carlo Method for Normal Mixture Models Kenji Nagata, Sumio Watanabe Tokyo Institute of Technology.
Bayesian Semi-Parametric Multiple Shrinkage
Reducing Photometric Redshift Uncertainties Through Galaxy Clustering
A photometric method to classify high-z supernovae found with HSC
Bayesian data analysis
Transfer Learning in Astronomy: A New Machine Learning Paradigm
CSCI 5822 Probabilistic Models of Human and Machine Learning
OVERVIEW OF BAYESIAN INFERENCE: PART 1
Disentangling Dust Extinction and
Disentangling Dust Extinction and
'Linear Hierarchical Models'
Multivariate Methods Berlin Chen
Internet Traffic Classification Using Bayesian Analysis Techniques
Disentangling Dust Extinction and
Yalchin Efendiev Texas A&M University
Presentation transcript:

Pan-STARRS Photometric Classification of Supernovae using a Hierarchical Bayesian Model George Miller, Edo Berger, Nathan Sanders Harvard-Smithsonian Center for Astrophysics

1.8m f/4.4 telescope with 3.2 degree FOV and 1.6 Gpix camera PS1 Sky Surveys: May 2010 – March 2014 Medium Deep 10 fields of 7 sq. deg. with a 3-day staggered cadence and grizy filters Full reprocessing MD.PV3 underway with full forced photometry Currently the best precursor and source of training data for LSST classification tools – Similar cadence (~3 days) and sensitivity – Similar grizy band-passes (+u for LSST) – Wide variety of science drivers and a large distributed collaboration PS1 Medium Deep Survey

– MISSION: To develop a joint probabilistic model for the populations of different classes of transients and subsequent observations of individual objects. Allow this model to be fit using PS1 classified (~20%) and unclassified (~80%) data and then be applied to achieve classification and parameter inference on new objects, e.g. from LSST. Implementation

– MISSION: To develop a joint probabilistic model for the populations of different classes of transients and subsequent observations of individual objects. Allow this model to be fit using PS1 classified (~20%) and unclassified (~80%) data and then be applied to achieve classification and parameter inference on new objects, e.g. from LSST. – COMPUTATIONAL METHODS: STAN Implementation Stan is a probabilistic programming language, written in C++ with python and R interfaces. Uses a “no U-turn” (NUTS) Hamiltonian Monte Carlo (HMC) sampler, which adaptively refines the Metropolis step size by the transversal of the posterior. Incorporates algorithms for automatic differentiation and adaptive tuning parameter refinement

PS1/MDS Spectroscopic Sample Sample of 514 events with spectroscopic follow up (~150 nights on MMT, Magellan, Gemini by the Harvard PS1 team) 69% Ia 22% IIP/n 4% Ib/c 4% ULSN, TDE, etc. 1% Unclassified

A hF [N CL,N F ] t 1 [N CL,N F,N SN ] β [N CL,N F,N SN ] τ rise [N CL,N F,N SN ] τ fall [N CL,N F,N SN ] M [N] t 0 [N CL,N F,N SN ] t hSNF [N CL,N F,N SN,2] t hF [N CL,N F,2] τ hSNF [N CL,N F,N SN,2] τ hF [N CL,N F,2] β hSNF [N CL,N F,N SN ] β hF [N CL,N F ] t h [N CL,2] λ [N SN,2] τ h [N CL,2] Β h [N CL ] c [N CL,N SN ] A [N CL,N F,N SN ] σ [N CL,N F,N SN ] HG [N CL,N F,N SN ] A hSNF [N CL,N F,N SN ] c hSNF [N CL,N F,N SN ] HG hSNF [N CL,N F,N SN ] σ hF [N CL,N F ] σ hSNF [N CL,N F,N SN ] chch A h [N CL ] HG h [N CL ] σ h [N CL ] Flux [N] λ [N SN ] λ [N SN,2] λ [N SN ] λ [N SN ] λ [N SN ]

Adapted from the ‘non- I a’ lightcurve model (Bazin+ 2009) Required a functional form sufficiently generic to fit a wide array of possible SN lightcurves Added second time parameter and linear decay function to allow for plateau effect near and after maximum light

Confirmed Type IIP Confirmed Type Ia Individual fits using PyMC MH

Adapted from Bazin ‘non- I a’ lightcurve model Required a functional form sufficiently generic to fit a wide array of possible SN lightcurves Added second time parameter and linear decay function to allow for plateau effect near and after maximum light Additional normal error term to encapuslate intrinsic scatter in flux and allow for overdispersion Additional terms for host galaxy information

A hF [N CL,N F ] t 1 [N CL,N F,N SN ] β [N CL,N F,N SN ] τ rise [N CL,N F,N SN ] τ fall [N CL,N F,N SN ] M [N] t 0 [N CL,N F,N SN ] t hSNF [N CL,N F,N SN,2] t hF [N CL,N F,2] τ hSNF [N CL,N F,N SN,2] τ hF [N CL,N F,2] β hSNF [N CL,N F,N SN ] β hF [N CL,N F ] t h [N CL,2] λ [N SN,2] τ h [N CL,2] Β h [N CL ] c [N CL,N SN ] A [N CL,N F,N SN ] σ [N CL,N F,N SN ] HG [N CL,N F,N SN ] A hSNF [N CL,N F,N SN ] c hSNF [N CL,N F,N SN ] HG hSNF [N CL,N F,N SN ] σ hF [N CL,N F ] σ hSNF [N CL,N F,N SN ] chch A h [N CL ] HG h [N CL ] σ h [N CL ] Flux [N] λ [N SN ] λ [N SN,2] λ [N SN ] λ [N SN ] λ [N SN ]

Hierarchical Bayes Probabilistic Connection y i [N] θ i [N] φ Model in which parameter dependences can be constructed on multiple structured levels. A common example: The one-way normal model θ = (θ i ) φ = (μ, τ)

Hierarchical Bayes Probabilistic Connection y [N] φ Model in which parameter dependences can be constructed on multiple structured levels. A common example: The one-way normal model φ = (μ, τ) Non-centered parameterizations exchanges certain dependences between hyperparameters with correlations between hyperparameters and the data ν [N] Linear Connection

Hierarchical Bayes Probabilistic Connection Deterministic Connection τ rise hF [N F ] τ rise h τ rise [N F,N SN ] M [N] Three level model, with a top level hyperparameters for each parameter, and 4 mid level hyperparameters for each filter Non-centered parameterization to remove correlations between hyperparameters

Hierarchical Bayes Probabilistic Connection Deterministic Connection τ rise hF [N F ] τ rise h τ rise [N F,N SN ] M [N] τ rise hSNF [N F,N SN ] Three level model, with a top level hyperparameters for each parameter, and 5 mid level hyperparameters for each filter Non-centered parameterization to remove correlations between hyperparameters Adopt normal hyperprior for all location (mean) hyperparameters and half-cauchy distribution for all scale (variance) hyperparameters May then feed point estimates and test quantities into other Machine Learning classification algorithms

Ia vs. non-Ia AdaBoost classification Without redshift information With redshift information

A hF [N CL,N F ] t 1 [N CL,N F,N SN ] β [N CL,N F,N SN ] τ rise [N CL,N F,N SN ] τ fall [N CL,N F,N SN ] M [N] t 0 [N CL,N F,N SN ] t hSNF [N CL,N F,N SN,2] t hF [N CL,N F,2] τ hSNF [N CL,N F,N SN,2] τ hF [N CL,N F,2] β hSNF [N CL,N F,N SN ] β hF [N CL,N F ] t h [N CL,2] λ [N SN,2] τ h [N CL,2] Β h [N CL ] c [N CL,N SN ] A [N CL,N F,N SN ] σ [N CL,N F,N SN ] HG [N CL,N F,N SN ] A hSNF [N CL,N F,N SN ] c hSNF [N CL,N F,N SN ] HG hSNF [N CL,N F,N SN ] σ hF [N CL,N F ] σ hSNF [N CL,N F,N SN ] chch A h [N CL ] HG h [N CL ] σ h [N CL ] Flux [N] λ [N SN ] λ [N SN,2] λ [N SN ] λ [N SN ] λ [N SN ]

Categorical Mixture Model Probabilistic Connection Linear Connection Top level multinomial simplex parameter controlling the classes of supernovae τ rise hF [N CL,N F ] τ rise h [N CL ] τ rise [N CL,N F,N SN ] τ rise hSNF [N CL,N F,N SN ] λ [N SN ] Produces full posterior probability of classification for each event observed. Will likely need to set more informative priors on hyperparameters to allow for convergence across the entire simplex space.

Online predictions and computational scale Probabilistic Connection Linear Connection Run model on full database and save marginal posteriors for hyperparameters Use these distributions as priors for new analysis of unclassified events Better method would use classified samples as auxiliary data and construct the hierarchical model using both tagged and untagged data This may not be computationally feasible on timescales needed for follow-up observations Can fix certain hyperparameters to reduce computational time i.e. for well understood classes such as Ia τ rise hF [N CL,N F ] τ rise h [N CL ] τ rise [N CL,N F,N SN ] τ rise hSNF [N CL,N F,N SN ] λ [N SN ]

To Do Incorporate photo-z information rather than spectroscopic redshifts Model K-corrections as a function of photo-z and SN type Include host-galaxy information (color, positional offset, etc.) Adapt functional light curve model to be more sensitive to SN rise time and shape Better constrain early online predictions Explore Riemannian manifold HMC techniques

Summary Developed joint probabilistic Bayesian Hierarchical model to be fit with PS1 MD classified SN sample Coded model using the HMC probabilistic program STAN to effectively explore the high dimensional space Classification may be accomplished using separate predictive model of fitted parameters, or by developing a full mixture model with simplex parameters Stay tuned for first results!