Rule-based Cross-matching of Very Large Catalogs Patrick Ogle and the NED Team IPAC, California Institute of Technology.

Slides:



Advertisements
Similar presentations
An Experience of Aladin Usage for the RC Catalog Radio Source Investigation Zhelenkova Olga, SAO RAS.
Advertisements

15 years of science with Chandra– Boston 20141/16 Faint z>4 AGNs in GOODS-S looking for contributors to reionization Giallongo, Grazian, Fiore et al. (Candels.
Patrick Ogle, Lee Armus, Bob Narron, Carl Grillmair, Justin Howell, IRS IST.
Markov-Chain Monte Carlo
Spike Sorting I: Bijan Pesaran New York University.
Compiled quasar catalog from LAMOST DR1
GLAST Science Support CenterAugust 9, 2004 Likelihood Analysis of LAT Data James Chiang (GLAST SSC – SLAC)
Star-Forming Galaxies in a Nearby Group: Abell Instituto de Astrofísica de Andalucía-CSIC Reverte-Payá 1, D.; Vílchez 1, J. M. & Iglesias-Páramo.
RESULTS AND ANALYSIS Mass determination Kauffmann et al. determined masses using SDSS spectra (Hdelta & D4000) Comparison with our determination: Relative.
Evolution of Luminous Galaxy Pairs out to z=1.2 in the HST/ACS COSMOS Field Jeyhan Kartaltepe, IfA, Hawaii Dave Sanders, IfA, Hawaii Nick Scoville, Caltech.
Error Propagation. Uncertainty Uncertainty reflects the knowledge that a measured value is related to the mean. Probable error is the range from the mean.
Bayesian Analysis of X-ray Luminosity Functions A. Ptak (JHU) Abstract Often only a relatively small number of sources of a given class are detected in.
Deriving and fitting LogN-LogS distributions Andreas Zezas Harvard-Smithsonian Center for Astrophysics.
C&A 10April06 1 Point Source Detection and Localization Using the UW HealPixel database Toby Burnett University of Washington.
Evolution of Luminous Galaxy Pairs out to z=1.2 in the HST/ACS COSMOS Field Jeyhan Kartaltepe, IfA, Hawaii Dave Sanders, IfA, Hawaii Nick Scoville, Caltech.
CXC CUC September SDS Page 1 CUC Sep 2007 Science Data Systems – Jonathan McDowell.
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
The ASDC SED Builder Milvia Capalbi (INAF-ASDC) in collaboration with Paolo Giommi (ASI-ASDC), Giulia Stratta (INAF-ASDC), Roberto Primavera (ElsagDatamat)
What can we learn from the luminosity function and color studies? THE SDSS GALAXIES AT REDSHIFT 0.1.
How to start an AGN: the role of host galaxy environment Rachel Gilmour (ESO Chile & IfA, Edinburgh) Philip Best (Edinburgh), Omar Almaini & Meghan Gray.
Luminosity and Mass functions in spectroscopically-selected groups at z~0.5 George Hau, Durham University Dave Wilman (MPE) Mike Balogh (Waterloo) Richard.
X-ray Surveys with Space Observatory Khyung Hee University Kim MinBae Park Jisook.
1 GALEX Angular Correlation Function … or about the Galactic extinction effects.
Peter Capak Associate Research Scientist IPAC/Caltech.
Scientific objectives for XEUS: Galaxies Groups and Clusters at z~2 Study of the Evolution of clusters in the mass range kT > 2 keV up to z=2. Dynamics,
PACS NHSC SPIRE Point Source Spectroscopy Webinar 21 March 2012 David Shupe, Bernhard Schulz, Kevin Xu on behalf of the SPIRE ICC Extracting Photometry.
Seminars on formation and evolution of the Galaxy Feb 12, 2002 The construction of the GSC2.2 Catalog Mario G. Lattanzi Osservatorio Astronomico di Torino.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
Radio-optical analysis of extended radio sources in the FLS field 2009 SA SKA Postgraduate Bursary Conference 4 th Annual Postgraduate Bursary Conference.
MARK CORRELATIONS AND OPTIMAL WEIGHTS ( Cai, Bernstein & Sheth 2010 )
Martin et al. Goal-determine the evolution of the IRX and extinction and relate to evolution of star formation rate as a function of stellar mass.
High redshift cluster search (SA22 field) KIAS SSG workshop Jae-Woo Kim (SNU) M. Im, S.-K. Lee & M. Hyun (SNU)
Image Restoration.
The Evolution of AGN Obscuration
The Environmental Effect on the UV Color-Magnitude Relation of Early-type Galaxies Hwihyun Kim Journal Club 10/24/2008 Schawinski et al. 2007, ApJS 173,
PACS NHSC Data Processing Workshop – Pasadena 10 th - 14 th Sep 2012 Measuring Photometry from SPIRE Observations Presenter: David Shupe (NHSC/IPAC) on.
Source catalog generation Aim: Build the LAT source catalog (1, 3, 5 years) Jean Ballet, CEA SaclayGSFC, 29 June 2005 Four main functions: Find unknown.
Initial Results from the Chandra Shallow X-ray Survey in the NDWFS in Boötes S. Murray, C. Jones, W. Forman, A. Kenter, A. Vikhlinin, P. Green, D. Fabricant,
The Evolution of AGN Obscuration
1 Statistics, Data Analysis and Image Processing Lectures Vlad Stolojan Advanced Technology Institute University of Surrey.
Radiation Detection and Measurement, JU, First Semester, (Saed Dababneh). 1 Counting Statistics and Error Prediction Poisson Distribution ( p.
Photometric Redshifts in Astro-Wise PhotRedCatalog Astro-Wise Workshop Leiden 2008 Jan Snigula, MPE.
Revised GALEX Ultraviolet Catalog of Globular Clusters in M31 Kyungsook Lee (1), Soo-Chang Rey (1), Sangmo Tony Sohn (2), and GALEX Science Team (1) Department.
X-ray sky survey of bright, serendipitous sources with 2XMMi at the AIP Speaker: Alexander Kolodzig Origin: Humboldt-Uni Berlin, Germany Institute:AIP.
Observational Test of Halo Model: an empirical approach Mehri Torki Bob Nichol.
Difference Image Analysis at OAC Groningen, 1st Dec 2004 AW-OAC team.
LAMOST 补充星系样本和LAMOST-SDSS星系对样本
Metal abundance evolution in distant galaxy clusters observed by XMM-Newton Alessandro Baldi Astronomy Dept. - University of Bologna INAF - OABO In collaboration.
Lecture 10 The catalog of sources –Resolved sources –Selection biases –Luminosity (and mass) functions –Volume- vs flux-limited surveys. Cross-matching.
The Evolution of AGN Obscuration Ezequiel Treister (ESO) Meg Urry (Yale) Julian Krolik (JHU)
NASSP Masters 5003F - Computational Astronomy Lecture 7 Confusion Dynamic range Resolved sources Selection biases Luminosity (and mass) functions.
Selection and Characterization of Interesting Grism Spectra Gerhardt R. Meurer The Johns Hopkins University Gerhardt R. Meurer The Johns Hopkins University.
LOGO Recognition and Measuremeant for LAMOST Galaxy Spectra 张健楠 天水 2015.
Extragalactic Survey with MAXI and First MAXI/GSC Catalog Extragalactic Survey with MAXI and First MAXI/GSC Catalog Yoshihiro Ueda Kazuo Hiroi, Naoki Isobe,
1 Giuseppe Romeo Voronoi based Source Detection. 2 Voronoi cell The Voronoi tessellation is constructed as follows: for each data point  i (also called.
The STIS NUV-MAMA objective prism … … and looking beyond for HST UV slitless spectroscopy Jes ú s Ma í z Apell á niz HST Calibration worskhop 26 October.
In conclusion the intensity level of the CCD is linear up to the saturation limit, but there is a spilling of charges well before the saturation if.
Wide-field Infrared Survey Explorer (WISE) is a NASA infrared- wavelength astronomical space telescope launched on December 14, 2009 It’s an Earth-orbiting.
Color Magnitude Diagram VG. So we want a color magnitude diagram for AGN so that by looking at the color of an AGN we can get its luminosity –But AGN.
Bayesian Template-Based Approach to Classifying SDSS-II Supernovae from 3-Year Survey Brian Connolly Photometric Supernova ID Workshop 3/16/12.
YOUR LOGO The FUS(FIRST-UKIDSS-SDSS) RED Quasar Survey Dohyeong Kim 서울대학교 천문학과 김도형
All-sky source search Aim: Look for a fast method to find sources over the whole sky Selection criteria Algorithms Iteration Simulations Energy information.
JWST FGS Guide Star Studies
K(2 m) Version of JASMINE and its Science
Photometric redshift estimation.
Procrustes Shape Analysis Verification Tool
Jean Ballet, CEA Saclay GSFC, 31 May 2006 All-sky source search
Basics of Photometry.
The dust attenuation in the galaxy merger Mrk848
Galaxy Classification in the WISE Color-Luminosity Diagram
Presentation transcript:

Rule-based Cross-matching of Very Large Catalogs Patrick Ogle and the NED Team IPAC, California Institute of Technology

NASA Extragalactic Database (NED) A fusion of multi-wavelength extragalactic data from journal articles and large catalogs

NED Holdings (October 2014) 2MASS PSC And much more, including classifications, notes, images, spectra…

New Cross-matching Algorithm Very Large Catalogs (VLCs, >10 7 sources) Find candidate matches in NED Select best match – Rule-based – Statistical analysis Match data recorded in DB Reversible and iterable GALEX ASC (NUV) vs. SDSS DR6 (gri, 6’x6’)

Cross-match Inputs VLC Source and NED Object Positions (RA, Dec, ±)  Source-Object Separation (s, ±σ) Source and Object Types (galaxy, galaxy cluster, star, UV source, etc…) Background Object Density (measured for each source) Instrumental Beam Size Other: redshift, photometry, diameters

NED Pipeline for Very Large Catalogs Source Loader – Load Very Large Catalog (VLC) source names and positions into NED. CSearch (PostgreSQL) – Find match candidates with NED near position search – Count background objects – Spatial indexing will speed up search (e.g. Q3C, HTM) MatchExpert (python) – Select best match from CSearch match candidates – Object associations for no-matches – Record match statistics for each match – Match statistic distributions and integrals – Code migration to DBMS for speed Object Loader (PostgreSQL) – Create NED cross-IDs – new objects – associations CSearch MatchEx Object Loader Source Loader

MatchEx Logic Match List from Csearch S<Scut P>Pcut Type Match Name Prefix Match Error Circles Overlap S1/S2 <0.33 NED dup. No Match Single Good Match Match Create NED object and associations NED Cross-ID Thresholds

Associations Where a match is not made to a nearby object, an association record may be created. Association types: – Source and object position error circles overlap (  ) – Object is within the beam (PSF) of the source (  ) No Match Error Circles Overlap S<beam Create In Beam Association record Create Error Overlap Association record

Application to GALEX ASC Catalog GALEX ASC (NUV) vs. NED Background region NED object GALEX search region SDSS DR6 (g,r,i) SDSS DR6 (gri, 6’x6’) GALEX All-Sky Catalog of ~40 milllion unique NUV sources created by M. Seibert (2012) Matched against ~180 million NED objects (2013)

Poisson Match Probability Search radius: r s = 7.5″ for GALEX Background radius: r b = 46.5″ for GALEX Density of background NED objects: n = N/(πr b 2 ) Expected number inside s: = N(s/r b ) 2, s = separation Poisson probability of x = k objects closer than s: – P s (x=k) = k exp(- )/k! – For k=0, simplifies to: P s (x=0) = exp(- ) = exp(-N(s/r b ) 2 ) False-match probability: P f = 1-P s (0) s rsrs rbrb Example: N = 4, s/r b = 0.08 P s (0) = P f = 0.025

Optimizing Match Selection Optimize on 100K subsample in SDSS region False-positive rate decreases with increasing Poisson cutoff. False negative rate increases with Poisson cutoff. Give 10x weight to false positives--it’s worse to make an incorrect match than to miss a match. Poisson cutoff value of 90% minimizes the combined, weighted error rate.

39,570,031 input GALEX ASC UV sources NED (2013) contained ~180 million distinct objects 10,595,382 (26.8%) of the ASC sources matched NED objects  Cross-IDs 28,974,649 (73.2%) are not matched  new NED objects – 68.2% of GASC sources are in blank NED fields – 5.0% have multiple match candidates GALEX ASC Match Results: Totals Image credit : GALEX NASA/JPL-Caltech/SSC

GALEX ASC Match Results: Background Rejection and False-Negative Rate Uncorrelated background out to 15 arcsec fit by straight line: dN/ds ~ s MatchEx is successful at filtering out this background. False-negative rate f n = 2.4% estimated by comparison to background-subtracted match candidates (red line). Separation (arcsec) false negatives

GALEX ASC Results: False Positive Rate The false-positive match rate is estimated by summing the Poisson statistic (1-P) over all matches and dividing by the total number of sources : f p =0.25% Number

GALEX ASC Results: Position Error Distribution The distribution of normalized separation r=s/σ deviates from a Gaussian. The peak is at 0.9 instead of 1.0, and the tail is stronger. r=s/σ Derivative of a Gaussian Important Lessons Learned: 1.Do not assume reported catalog position errors are correct. 2.Do not assume position error distributions are Gaussian. 3.A 3.5σ threshold on match separation rejected more candidates than expected. Number

Comparison to SDSS Photometry While no color criteria were used to select matches to GALEX sources, the NUV-g colors of GALEX-SDSS matches were checked: Most matches have -7<NUV-g<7 GALEX ASC range: 14<NUV<24 Detection rate falls at NUV>21.7

Results by Object Type Object Types ordered by candidate match frequency Most GALEX sources matched to galaxies (G) and stars (*) QSO, Galactic star (!*), UV excess object (UvES), and WD* matches overrepresented, as might be expected for a UV-selected catalog. Matches to RadioS, XrayS, GGroup, and GPair candidates were disallowed.

GALEX Photometry in NED GALEX ASC photometry added to NED spectral energy distribution of 3C 382 (CGCG ) Over 145 million GALEX ASC NUV and FUV photometry records added to NED (2 extraction methods per band)

VLCs in NED, now and future GALEX ASC: ~40,000,000 UV sources loaded and matched (2013) GALEX MSC: ~22,000,000 UV sources loaded and matched (2014) Spitzer Source List: ~42,000,000 MIR sources (2014) 2MASS PSC: ~471,000,000 NIR sources loaded (2015 finish) AllWISE: ~748,000,000 MIR sources (2015 start) SDSS DR10: ~469,000,000 Vis sources (2015 start) SDSS DR6: ~154,000,000 Vis sources loaded and matched (out of 217M), excluding sources with undesirable flag values (2008) NED aims to quadruple its object holdings in the next year!