Astrophysical Surveys: Visualization/Data Managements Peter Nugent (LBNL/UCB)

Slides:



Advertisements
Similar presentations
Research School of Astronomy & AstrophysicsSlide 1 SkyMapper SkyMapper and the Stromlo Southern Sky Survey Stefan Keller, Brian Schmidt, Paul Francis and.
Advertisements

Searching for Electromagnetic Counterparts of Gravitational-Wave Transients Marica Branchesi (Università di Urbino/INFN) on behalf of LIGO Scientific Collaboration.
Vestrand Real Time Transient Detection with RAPTOR: Exploring the Path Toward a “Thinking” Telescope Tom Vestrand on behalf of the RAPTOR Team Los Alamos.
GTN The GLAST Telescope Network Involvement for students and teachers in the science of the Swift, GLAST, and XMM-Newton missions Gordon Spear and Tim.
Hubble Science Briefing Hubble Does Double-Duty Science: Finding Planets and Characterizing Stellar Flares in an Old Stellar Population Rachel Osten Space.
The LaSilla/QUEST Variability Survey Charles Baltay Yale University February 7, 2012 A Progress Report.
The Novel Method for Identifying Fast Optical Transients The Application of Multi-Exposure Plates for OT/GRBs analyses René Hudec 1 and Lukáš Hudec 2 1.
Palomar Transient Factory Data Flow Jason Surace IPAC/Caltech.
Pi of the Sky – preparation for GW Advance Detector Era Adam Zadrożny Wilga 2014.
Time Series Photometry; Some Musings Steve B. Howell, NASA Ames Research Center.
SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO IEEE Symposium of Massive Storage Systems, May 3-5, 2010 Data-Intensive Solutions.
Observational techniques meeting #5. Future surveys Narrow (pencil beam): HDF UDF GOODs Cosmos MCT JWST.
Research Astronomy In Southern NM: Insights From the Sloan Digital Sky Survey (SDSS) Jon Holtzman NMSU Department of Astronomy.
OT and the Systematic Automated All-sky Search for Bright Optical Transients Lior Shamir & Robert J. Nemiroff Abstract Real-time detection of bright.
Tile or Stare? Optimal Search Strategies for GRBs and Afterglows Robert J. Nemiroff Michigan Technological University.
The Plan for Transient Follow-up Observations with Maidanak 1.5m Telescope Kuiyun Huang (ASIAA), Yuji Urata (NCU) Peikang Tsai (NCU), Youichi Ohyama (ASIAA),
A search for supernovae in galaxy clusters at 0.1 < z < 0.2 David J Sand Collaborators: Dennis Zaritsky, Stephane Herbert-Fort, Suresh Sivanandam, Doug.
Follow-up of Wide-Field Transient Surveys Mark Sullivan (University of Oxford)
KDUST Supernova Cosmology
The Transient Universe: AY 250 Spring 2007 Existing Transient Surveys: Optical I: Big Apertures Geoff Bower.
The Decade of Transients S. R. Kulkarni California Institute of Technology Pasadena, USA.
May stars be the actors and dark energy direct shoot a movie in the sky Chihway Chang Oct.8 ‘2008.
Pan-STARRS and TGBN Paul Price Institute for Astronomy University of Hawaii.
Astro-DISC: Astronomy and cosmology applications of distributed super computing.
Teaching Science with Sloan Digital Sky Survey Data GriPhyN/iVDGL Education and Outreach meeting March 1, 2002 Jordan Raddick The Johns Hopkins University.
Realtime follow-up analyses with IceCube / M. Kowalski / Realtime follow-up analyses with IceCube Marek Kowalski Universität Bonn High Energy.
NEO Research Project in Korea Wonyong Han 1, Yong-Ik Byun 2, Hong-Suh Yim 1, Young-Jun Choi 1, Hong-Kyu Moon 1 & NESS Team 1 Korea Astronomy and Space.
The Cosmic Simulator Daniel Kasen (UCB & LBNL) Peter Nugent, Rollin Thomas, Julian Borrill & Christina Siegerist.
National Center for Supercomputing Applications Observational Astronomy NCSA projects radio astronomy: CARMA & SKA optical astronomy: DES & LSST access:
1 New Frontiers with LSST: leveraging world facilities Tony Tyson Director, LSST Project University of California, Davis Science with the 8-10 m telescopes.
Science Update: SN2011fe in M101 (Pinwheel Galaxy) Peter Nugent (LBNL/UCB)
Producing Science with the Palomar Transient Factory Branimir Sesar (MPIA, formerly Caltech)
Padua: 1604 → 2004 – Supernovae as cosmological lighthouses SNLS – The SuperNova Legacy Survey Mark Sullivan (University of Toronto) on behalf of the SNLS.
“The Distance to the Stars: How Do Scientists Know?” Teacher Training Courses in Ohio and Michigan, June 2006.
SNLS: Overview and High-z Spectroscopy D. Andrew Howell (Toronto) for the SNLS Collaboration (see: for full list)
Exploration of the Time Domain S. G. Djorgovski VOEvent Workshop, CACR, Caltech, April 2005.
1 Hot-Wiring the Transient Universe Santa Barbara CA May 12, 2015 LSST + Tony Tyson UC Davis LSST Chief Scientist.
Stellar variability monitoring in open clusters with mini-SONG X.B. Zhang National Astronomical Observatories, Chinese Academy of Sciences.
Making the Sky Searchable: Automatically Organizing the World’s Astronomical Data Sam Roweis, Dustin Lang &
Palomar Transient Factory Data Flow Jason Surace IPAC/Caltech.
First results from The Palomar Transient Factory Eran Ofek Einstein fellow CALTECH.
LSST: Preparing for the Data Avalanche through Partitioning, Parallelization, and Provenance Kirk Borne (Perot Systems Corporation / NASA GSFC and George.
Asteroids, Meteors, Meteoroids, and Meteorites. Chris Lyles Chris Sutton Tyler Kelley By::
EScience May 2007 From Photons to Petabytes: Astronomy in the Era of Large Scale Surveys and Virtual Observatories R. Chris Smith NOAO/CTIO, LSST.
Detection and study of supernovae with the 4m International Liquid Mirror Telescope BRAJESH KUMAR University of Liège, Belgium ARIES, Nainital, India.
Chinese- international collaboration solved the central question: ”How common are planets like the Earth”
CELT Science Case. CELT Science Justification Process Put together a Science Working Group –Bolte, Chuck Steidel, Andrea Ghez, Mike Brown, Judy Cohen,
Hunting youngest Type Ia SNe in the intermediate Palomar Transient Factory Yi Cao (Caltech) On behalf of the intermediate Palomar Transient Factory collaboration.
G. Miknaitis SC2006, Tampa, FL Observational Cosmology at Fermilab: Sloan Digital Sky Survey Dark Energy Survey SNAP Gajus Miknaitis EAG, Fermilab.
The Large Synoptic Survey Telescope: The power of wide-field imaging Michael Strauss, Princeton University.
1 Imaging Surveys: Goals/Challenges May 12, 2005 Luiz da Costa European Southern Observatory.
From photons to catalogs. Cosmological survey in visible/near IR light using 4 complementary techniques to characterize dark energy: I. Cluster Counts.
Pan-STARRS and SNe Paul Price Institute for Astronomy University of Hawaii.
August 10, 2004 Apache Point Observatory, NM FINDING SUPERNOVAE IN A SLICE OF PI Dennis J. Lamenti San Francisco State University.
LSST and VOEvent VOEvent Workshop Pasadena, CA April 13-14, 2005 Tim Axelrod University of Arizona.
The Large Synoptic Survey Telescope Project Bob Mann LSST:UK Project Leader Wide-Field Astronomy Unit, Edinburgh.
1 Small Telescopes and the Photometric Calibration of the Sloan Digital Sky Survey (SDSS) Douglas Tucker.
Dynamic Andromeda: Novae and Variable Stars in M31 Yi Cao 2 nd year graduate at Caltech On Behalf of the Palomar Transient Factory collaboration.
MMT Observation Database for Light Curve Analysis Vladimir Agapov Presentation for the WG1 session 33rd IADC meeting, Houston.
Gamma-Ray Bursts with the ANTARES neutrino telescope S. Escoffier CNRS/CPPM, Marseille.
Spectroscopic Sequence Among SNe Ia Peter Nugent(LBNL/UCB)
Lecture 3 With every passing hour our solar system comes forty-three thousand miles closer to globular cluster 13 in the constellation Hercules, and still.
A Search For Untriggered GRB Afterglows with ROTSE-III Eli Rykoff ROTSE Collaboration University of Michigan Santorini GRB Meeting September 2 nd, 2005.
A Proposed Collaboration Between LIGO-Virgo and Swift to Improve the Chances to Detect Gravitational Waves from Core Collapse Supernovae Kiranjyot (Jasmine)
1 Observations of Cosmic Explosions Review, and results from the Palomar Transient Factory Avishay Gal-Yam | Weizmann Institute of Science Avishay Gal-Yam.
Wide-field Infrared Survey Explorer (WISE) is a NASA infrared- wavelength astronomical space telescope launched on December 14, 2009 It’s an Earth-orbiting.
Bayesian Template-Based Approach to Classifying SDSS-II Supernovae from 3-Year Survey Brian Connolly Photometric Supernova ID Workshop 3/16/12.
Astronomy toolkits and data structures Andrew Jenkins Durham University.
T. Axelrod, NASA Asteroid Grand Challenge, Houston, Oct 1, 2013 Improving NEO Discovery Efficiency With Citizen Science Tim Axelrod LSST EPO Scientist.
Future Trends in Nuclear Physics Computing
Presentation transcript:

Astrophysical Surveys: Visualization/Data Managements Peter Nugent (LBNL/UCB)

INT Exascale Workshop “Current” Optical Surveys Photometric: Palomar Transient Factory La Silla Supernova Search SkyMapper PanSTARRS Spectroscopic: SDSS III (BOSS, SEGUE-II, APOGEE, MARVELS) All of these surveys span astrophysics from planets to cosmology, from the static to the transient universe.

INT Exascale Workshop PTF: Competition The competition were two wide-field multi-color surveys with cadences that were either unpredictable (SkyMapper) or from days to weeks (PanSTARRS) in a given filter. How could we do something better/different? - Start quickly - P48” coupled with the CFHT12k camera - Don’t do multiple colors - Explore the temporal domains in unique ways - Take full advantage of the big-iron at Super-Computing Centers - Get all the science we possibly can out of this program Thus we need the capability of providing immediate follow-up of unique transients, using 4 to 10-m class telescopes.

INT Exascale Workshop Transient Phase-Space

INT Exascale Workshop PTF Science PTF Key Projects Various SNe Dwarf novae Transients in nearby galaxies Core collapse SNe RR Lyrae Solar system objects CVsAGN AM CVn Blazars Galactic dynamics LIGO & Neutrino transients Flare stars Hostless transients Nearby star kinematics Orphan GRB afterglows Rotation in clusters Eclipsing stars and planets Tidal events H-alpha sky-survey The power of PTF resides in its diverse science goals and follow-up.

INT Exascale Workshop PTF ( ) 92 Mpixels, 1” resolution, 60s exposures - 128MB data per minute 2 cadences SN & Dynamic with g in dark time & R in bright time

INT Exascale Workshop PTF Science The power of PTF resides in its diverse science goals and follow-up. And so does the cost… these resources are an order of magnitude more expensive than the survey itself.

INT Exascale Workshop PTF Pipeline

INT Exascale Workshop PTF Subtractions moon 4096 X 2048 CCD images - over 3000 per night NewRefSub

INT Exascale Workshop Pipeline NERSC GLOBAL FILESYSTEM 170TB Data Transfer Nodes Science Gateway Node 2 Science Gateway Node 1 Observatory PTF Collaboration via Web Processing/db Subtractions Carver 3200 core IBM 128 MB/90s 50 GB/night 512 MB/90s 200 GB/night

INT Exascale Workshop PTF Database R-bandg-bandimages1.07M166k subtractions810k50k references18.6k3.0k Candidates487M22.2M Transients * The db selection (psql) and schema was based on getting the fastest and cheapest solution to last the length of the survey and serve the science needs.

INT Exascale Workshop PTF Sky Coverage To date: 1200 Spectroscopically typed supernovae 10 5 Galactic Transients 10 4 Transients in M31

INT Exascale Workshop PTF: Real or Bogus PTF produces 1 million candidates during a typical night: Most of these are not real  Image Artifacts  Misalignment of images due to poor sky conditions  Image saturation from bright stars 50k are asteroids 1-2k are variable stars 100 supernovae 3-4 new, young supernovae or other explosions

INT Exascale Workshop Real or Bogus moon 4096 X 2048 CCD images - over 3000 per night

INT Exascale Workshop Real or Bogus 230 bogus candidates, 2 variable stars, 4 asteroids and the youngest Type Ia supernovae observed to date. PTF10ygu: Caught 2 days after explosion

INT Exascale Workshop Real or Bogus Scanners helped vet over 1000 candidates and weights/biases were determined for each scanner.

INT Exascale Workshop Real or Bogus There is a natural cut at a value of 0.1, but we go to 2 w/ 0.07 to be safe.

INT Exascale Workshop Users…

Citizen Scientists… supernova.galaxyzoo.org is now up and running! A beta version appeared last year to support the SN Ia program in PTF and a WHT spectroscopy run. I spent a week with the folks at Oxford setting up the db and giving them training sets of good and bad candidates. They did the rest… 1200 members of galaxy zoo screened all the candidates between Aug 1 and Aug 12 in 3 hrs. The top 50 hits were all SNe/variable stars and they found 3 before we did. They scanned ~25,000 objects - 3 objects/min. They now do ~200 nightly and we have 15,000 users.

INT Exascale Workshop Robot A robot (built by Josh Bloom at UCB) queries the db every 20 min and compares new transients with archival information to ascertain its likely nature and publishes them to the collaboration - classification.

INT Exascale Workshop Robot Complications to traditional methods include varying uncertainties in data, non-structured temporal sequence (bad weather, etc.), differing levels of historical information (in SDSS or not, known host in NED, etc.) And this is just for stars…we also have ones for SNe, AGN…

INT Exascale Workshop Turn-around The scanning is handled in three ways: (1)Individuals can look through anything they want and save things to the PTF database (2)SN Zoo (3)UCB machine learning algorithm is applied to all candidates and reports are generated on the best targets and what they are likely to be (SN, AGN, varstar) by comparison to extant catalogs as well as the PTF reference catalog. These come out ~15 min after a group of subtractions are loaded into the database. On June 3, 2010 we were able to photometrically screen 4 SN candidates with the Palomar 60” telescope in g, r and i-band (50% of the time on P60 is devoted to this) within 2.5 hrs of discovery on the Palomar Schmidt and take spectra of them at Keck the same night. Now a nightly occurrence.

INT Exascale Workshop Robot -10vdl Discovery and follow-up of PTF 10vdl a SN II.

INT Exascale Workshop PTF Totals Transients = 1220 Papers = 20 In addition to these we have followed 2 triggers from IceCube and one from LIGO. We estimate that at the end of the survey we will have 40B detections in the individual images and 40B detections in the deep co-additions.

INT Exascale Workshop Historical Data The DeepSky program was started in response to the needs of several astrophysics projects hosted at NERSC. The result of this project is an all-sky digital image based upon the point-and-stare observations taken via the Palomar- QUEST Consortium and the SN Factory & Near Earth Asteroid Team. This data (over 9 million ccd images) spans 7 years and almost 20,000 square degrees, with typically pointing on a particular part of the sky.

INT Exascale Workshop Historical Data (Own) Deep co-additions of many images go into making a reference image. Requires HPC to handle the sheer volume of data and the solution to large linear systems to normalize the images to each other before the co-additions. Science can be had from either the reference or the individual images that went into it.

INT Exascale Workshop SNe Ia 810 SNe Ia discovered to date. The cosmology dataset (total of 153) has been followed with a ~2-3 day cadence in gri with LT and FTN/FTS And PTF in R/g-band

INT Exascale Workshop PTF Totals

INT Exascale Workshop Near Future Next Generation Transient Survey (aka PTF-II) - Upgrade to 5X PTF: 36 sq. deg. (~ 1 billion pixels) - Would like to explore the sky on 100s timescales - Turnaround in minutes with list of new candidates - Ingest SDSS, BOSS, NED, etc. catalogs to refine our understanding of these candidates in real-time - Able to handle Advanced LIGO, neutrino detectors, etc.

INT Exascale Workshop Bottlenecks… NERSC GLOBAL FILESYSTEM 240TB Data Transfer Nodes Science Gateway Node 2 Science Gateway Node 1 Observatory PTF Classification Processing/db Carver Subtractions 2.5 MB/s 1 GB/s 12 MB/s 4 MB/s (crude) 0.5 MB/s (full)

INT Exascale Workshop Bottlenecks…crude vs. real time brightness 5-  data in db

INT Exascale Workshop Heavy Random I/O SC09 Storage challenge allowed us to couple both the SDSS db and the PTF candidate db to ask the question, which objects that we think are quasars in the static SDSS data vary like one in the PTF data. PTF db is now 250GB and growing nightly…

INT Exascale Workshop Heavy Random I/O + analytics Aster Data provides a parallel db solution that also allows us to embed many of our machine learning algorithms. Already handle PB datasets. Likely will couple both solutions (Aster + SSD).

INT Exascale Workshop Conclusions - Future LSST - 15TB data/night Only one 30-m telescope

INT Exascale Workshop Conclusions - Future Scaling issues: Much larger volume of data (100X) Much larger community (100X) Greater science scope, not just transients Much larger set of historical resources one would wish to query against Follow-up resources (All the 8-10-meter & the 30-meter telescopes) much more expensive