First Public Data Releases from the VISTA Science Archive

Slides:



Advertisements
Similar presentations
Microsoft Research Microsoft Research Jim Gray Distinguished Engineer Microsoft Research San Francisco SKYSERVER.
Advertisements

26 January 2007 CAL 07 Garching 1 The VISTA Data Flow System Jim Lewis, Mike Irwin, Peter Bunclark, Simon Hodgkin Cambridge Astronomy Survey Unit.
Nicholas Cross, Rob Blake, Ross Collins, Mark Holliman, Mike Read, Eckhard Sutorius, Nigel Hambly, Andy Lawrence, Bob Mann, Keith Noddle Wide Field Astronomy.
Mark Holliman Wide Field Astronomy Unit Institute for Astronomy University of Edinburgh.
Research School of Astronomy & AstrophysicsSlide 1 SkyMapper SkyMapper and the Stromlo Southern Sky Survey Stefan Keller, Brian Schmidt, Paul Francis and.
A W 2 ONLY SEARCH FOR LATE T AND Y DWARFS IN WISE Joana Gomes University of Hertfordshire David Pinfield, Avril Day-Jones, Ben Burningham, Maria Teresa.
Optimal Photometry of Faint Galaxies Kenneth M. Lanzetta Stony Brook University.
AstroGrid: Virtual Observatory Service Nicholas Walton AstroGrid: Virtual Observatory Service Nicholas Walton AstroGrid and Euro-VO Technology Centre Project.
VISTA/WFCAM pipelines summit pipeline: real time DQC verified raw product to Garching standard pipeline: instrumental signature removal, catalogue production,
WFAU Science Archives surveys.roe.ac.uk WFAU Demonstration Nick Cross, Rob Blake, Ross Collins, Mike Read, Eckhard Sutorius, Mark Holliman, Stellios Voutsinas,
Nicholas Cross, Rob Blake, Ross Collins, Mark Holliman, Mike Read, Eckhard Sutorius, Nigel Hambly, Andy Lawrence, Bob Mann, Keith Noddle Wide Field Astronomy.
15 December 2008Science from UKIDSS II WFCAM Science Pipeline Update WFCAM Science Pipeline Update Jim Lewis, Mike Irwin & Marco Riello Cambridge Astronomy.
VISTA pipelines summit pipeline: real time DQC verified raw product to Garching standard pipeline: instrumental signature removal, catalogue production,
ADASS, London Sep 23-26, 2007 VOExplorer : Visualising Data Discovery in the Virtual Observatory Jonathan Tedds University of Leicester/AstroGrid/ EuroVO.
Synergies with multi-wavelength surveys Matt Jarvis University of Hertfordshire.
Introduction to Spitzer and some applications Data products Pipelines Preliminary work K. Nilsson, J.M. Castro Cerón, J.P.U. Fynbo, D.J. Watson, J. Hjorth.
BinX and Astronomy Bob Mann Institute for Astronomy and National e-Science Centre.
18 April 2007 Second Generation VLT Instruments 1 VIRCAM & CPL: Lessons Learned Jim Lewis and Peter Bunclark Cambridge Astronomy Survey Unit.
Data provenance in astronomy Bob Mann Wide-Field Astronomy Unit University of Edinburgh
Commissioning the NOAO Data Management System Howard H. Lanning, Rob Seaman, Chris Smith (National Optical Astronomy Observatory, Data Products Program)
VO as a Data Grid, NeSC ‘03 WFCAM Science Archive Nigel Hambly Wide Field Astronomy Unit Institute for Astronomy, University of Edinburgh.
Cool white dwarfs in the Sloan & SuperCOSMOS Sky Surveys Nigel Hambly, Wide Field Astronomy Unit, IfA, University of Edinburgh.
1 FINAL DESIGN REVIEW | TUCSON, AZ | OCTOBER 21-25, 2013 Name of Meeting Location Date - Change in Slide Master Catalogs Framework Andrew Connolly Simulations.
1 Radio Astronomy in the LSST Era – NRAO, Charlottesville, VA – May 6-8 th LSST Survey Data Products Mario Juric LSST Data Management Project Scientist.
Functions and Demo of Astrogrid 1.1 China-VO Haijun Tian.
Science Archive for Sky Surveys Data Providers and the VO - NeSC 2003 March Wide Field Astronomy Unit Institute for Astronomy.
Astronomical data curation and the Wide-Field Astronomy Unit Bob Mann Wide-Field Astronomy Unit Institute for Astronomy School of Physics University of.
NEON Obs School 11-Aug-2005 Archival Data and Virtual Observatories 1 Virtual Observatories...or how to do your research from a beach in the Bahamas rather.
Astronomical Spectroscopy and the Virtual Observatory ESAC, March 2007 VO tools and cross-calibration Pedro García-Lario European Space Astronomy.
WFCAM Science Archive Critical Design Review, April 2003 The SuperCOSMOS Science Archive (SSA) WFCAM Science Archive prototype Existing ad hoc flat file.
Discussion - Survey Design Survey product equation: #fields = fld/nt x useable x (%xnights/yr) x years = 4 x 0.5 x (0.75 x 13 x 18) x 3 = 4 x 0.5 x 175.
IPHAS EDR: Where are we? Catalogues up to January 2006 ingested Catalogue totals: Rows: 430M Gb: 400Gb Images are available.
Common Archive Observation Model (CAOM) What is it and why does JWST care?
Data Analysis Software Development Hisanori Furusawa ADC, NAOJ For HSC analysis software team 1.
VISTA Hemisphere Survey Dark Energy Survey VHS & DES Francisco Javier Castander.
VST ATLAS: Requirements, Operations and Products Tom Shanks, Nigel Metcalfe, Jamie McMillan (Durham Univ.) +CASU.
A Search For New Planets Matthew Livas Science, Discovery, and the Universe Computer Science Introduction My capstone was to observe.
MOS Data Reduction Michael Balogh University of Durham.
C2d Data flow diagram BCD from SSC Texas SAO Quality Analysis and Improved Calibrated Data Mapping team.
Difference Image Analysis at OAC Groningen, 1st Dec 2004 AW-OAC team.
Event and Feature Catalogs in the Virtual Solar Observatory Joseph A. Hourclé and the VSO Team SP54A-07 : 2008 May 30.
IPHAS Early Data Release E. A. Gonzalez-Solares IPHAS Consortium AstroGrid National Astronomy Meeting, 2007.
The XMM Cluster Survey: Project summary and Cosmology Forecasts Kathy Romer University of Sussex.
Lecture 3 With every passing hour our solar system comes forty-three thousand miles closer to globular cluster 13 in the constellation Hercules, and still.
EXPLORE/OC: Photometry Results for the Open Cluster NGC 2660 K. von Braun (Carnegie/DTM), B. L. Lee (Toronto), S. Seager (Carnegie/DTM), H. K. C. Yee (Toronto),
26th October 2005 HST Calibration Workshop 1 The New GSC-II and it’s Use for HST Brian McLean Archive Sciences Branch.
Retele de senzori Curs 2 - 1st edition UNIVERSITATEA „ TRANSILVANIA ” DIN BRAŞOV FACULTATEA DE INGINERIE ELECTRICĂ ŞI ŞTIINŢA CALCULATOARELOR.
VISTA Visible and Infrared Survey Telescope for Astronomy Spring Semester // Giant Telescope Sciences April 12 th. Youkyung Ko.
The GLORIA demonstrator experiment Ariel Majcher National Centre for Nuclear Research Warsaw, Poland XXXII IEEE-SPIE Joint Symposium Wilga 2013, May 29.
The CORNISH data reduction pipeline. Architecture - Overview AIPS Tasks Obit Tasks ObitTalk / Python Control Scripts MySQL Database Book-keeping Raw Data.
Pi of the Sky off-line experiment with GLORIA Ariel Majcher National Centre for Nuclear Research Warsaw, Poland 10th INTEGRAL/BART Workshop, April.
Introduction to DBMS Purpose of Database Systems View of Data
Critical Design Review, April 2003
Galaxies This lesson deals with important topics relating to galaxies. Each of these topics represents a great body of knowledge and areas of interest.
The Hubble Legacy Archive (HLA) Slitless Spectroscopy Project
The INES Archive in the era of Virtual Observatories
Institute of Cosmos Sciences - University of Barcelona
Galaxies This lesson deals with important topics relating to galaxies. Each of these topics represents a great body of knowledge and areas of interest.
Future Data Architecture Cloud Hosting at USGS
Jessica L. Rosenberg George Mason University
Introduction to Database Systems
WFCAM Photometric Calibration
Databases, Web Pages and Archives
Basics of Photometry.
COMPASS Database SPACE TELESCOPE SCIENCE INSTITUTE Gretchen Greene
Introduction to DBMS Purpose of Database Systems View of Data
DATABASES WHAT IS A DATABASE?
LSST, the Spatial Cross-Match Challenge
Revealing the Two ‘Horns’ of Taurus with Gaia DR2
Best Practices in Higher Education Student Data Warehousing Forum
Presentation transcript:

First Public Data Releases from the VISTA Science Archive Nicholas Cross, Rob Blake, Ross Collins, Mark Holliman, Mike Read, Eckhard Sutorius, Nigel Hambly, Bob Mann, Keith Noddle Wide Field Astronomy Unit, Institute for Astronomy, Edinburgh, UK 28/08/2012 SpS15 Data Intensive Astronomy 北京

SpS15 Data Intensive Astronomy 北京 VISTA 4.1m NIR telescope at Cerro Paranel. 16 2k x 2k Raytheon VIRGO detectors 0.34” pixels 0.6 ° FOV Standard frame is a tile, 1.5° made of 6 pointings ~300GB of imaging data per night (~70% of Paranal data) 28/08/2012 SpS15 Data Intensive Astronomy 北京

SpS15 Data Intensive Astronomy 北京 VISTA Public Surveys. 28/08/2012 SpS15 Data Intensive Astronomy 北京

SpS15 Data Intensive Astronomy 北京 VISTA Public Surveys 3 extra-galactic in wedding cake arrangement to optimise depth/area for different redshifts: VIKING: ZYJHKs. Ks=19.3, 1500° VIDEO: ZYJHKs. Ks=21.6, 12° UltraVISTA: YJHKs+NB. Ks=23.7, 0.8° Variability survey of MW disk and bulge: VVV: ZYJHKs. Ks=18.1, 520°, 100 epochs in Ks. (109 sources 1010-1011 detections) ~106 sources / ° Survey of Magellanic Clouds: VMC: YJKs. Ks=20.3, 184°, 12 Ks epochs for Cepheid/ RR Lyrae constraints. All sky survey VHS: JKs (+ some YH). Ks=18.1, 20000° 28/08/2012 SpS15 Data Intensive Astronomy 北京

SpS15 Data Intensive Astronomy 北京 VISTA Data Flow System Processing and archiving of all WFCAM and most VISTA data Observation block processing by CASU in Cambridge + calibration Later processing and archiving by WFAU in Edinburgh. VISTA Science Archive contains a relation database that stores different types of related data. Interfaces to help users more easily access, use and evaluate the data. VSA: http://surveys.roe.ac.uk/vsa 28/08/2012 SpS15 Data Intensive Astronomy 北京

SpS15 Data Intensive Astronomy 北京 Data from CASU transferred as FITS images and tables (catalogues extracted from science frames) Image headers and tables stored in database Links to images, but image pixel data stored as flat files. Higher order products (multi-band, variability tables) generated and stored in database Linking and curation tables generated and stored in database – database is self describing. 28/08/2012 SpS15 Data Intensive Astronomy 北京

SpS15 Data Intensive Astronomy 北京 Quality Control Science images undergo QC at Garching and CASU, which is stored in image metadata Additional frame quality control done by survey teams on science frames and automatically at WFAU - deprecation codes are stored in VSA. Catalogue flagging is done for bad pixels at CASU, other issues such as saturation, deblending, near edge, and a host of other issues recorded in post-processing error bit flag in VSA. 28/08/2012 SpS15 Data Intensive Astronomy 北京

SpS15 Data Intensive Astronomy 北京 Higher Order Products Deep images and catalogues from multiple epochs. Source table: merged detections in multiple filters from deepest observations in each pointing, seamed across overlaps. SynopticSource table: merged detections in multiple filters at each epoch. Variability table: photometric statistics from all epochs. X-matched neighbour tables to external catalogues, e.g. SDSS, 2MASS, WISE, Spitzer, XMM. Links between tile and pawprint detections. Provenance and quality control information. 28/08/2012 SpS15 Data Intensive Astronomy 北京

St Andrews Seminar Talk Relational data model Entity-relation model (ERM) shows how different types of data are related to each other. Each entity (box) is a different data type (stored as a table in the database). One-to-one, one-to many, constraint on one table… Design shows how tables that are used with multiple epoch data are related. 02/12/2008 St Andrews Seminar Talk

SpS15 Data Intensive Astronomy 北京 VSA – schema browser 28/08/2012 SpS15 Data Intensive Astronomy 北京

VSA catalogues VISTA PSPI Meeting 28/1/2010 VSA catalogues VISTA PSPI Meeting

SpS15 Data Intensive Astronomy 北京 Interfaces 28/08/2012 SpS15 Data Intensive Astronomy 北京

SpS15 Data Intensive Astronomy 北京 Scientific Queries SELECT data in SQL, e.g. colour information from vhsSource and SDSS DR7 – matched through neighbour tables. Can view query launch queries into TOPCAT, or download as FITS, VOTABLE or CSV. 28/08/2012 SpS15 Data Intensive Astronomy 北京

SpS15 Data Intensive Astronomy 北京 Light-curves Z=0.06 SNe1a, SN2010gy, Discovered in PanSTARRS, Chornock et al. (2010) 28/08/2012 SpS15 Data Intensive Astronomy 北京

1st public releases from VSA Releases to the survey teams happen every 6 months. Releases to the public should happen on similar timescales from now on. Any errors or potential problems that are found are mentioned in Known Issues. All releases are mentioned in the Release History. 28/08/2012 SpS15 Data Intensive Astronomy 北京

SpS15 Data Intensive Astronomy 北京 Large colour mosaics. Allow scientists to pick out extended structures such as nebulae and clusters, which are difficult to find against a dense star field using automated software. Zoomable, with coordinates. 1 billion stars in a single high resolution image. http://djer.roe.ac.uk/vsa/vvv/iipmooviewer-2.0-beta/vvvgps5.html 28/08/2012 SpS15 Data Intensive Astronomy 北京

SpS15 Data Intensive Astronomy 北京 Summary For scientists to get the most out of large surveys, a well designed and easy to use database is essential. The VSA allows scientists to build complex queries, accessing VISTA and other data Colour, variability, size, positional selections all very straightforward. Most efficient way to discover rare objects in large surveys such as VVV. 28/08/2012 SpS15 Data Intensive Astronomy 北京

SpS15 Data Intensive Astronomy 北京 Future work Improved interfaces: MyDB like interface – programming environment Improved visualisation of large results (shading in dense regions, points in low-density regions). Improved SIAP and TOPCAT integration. List-driven photometry (possibly PSF-corrected, but not initially). Initially internally within VIKING, eventually VHS + VST-ATLAS ++ Proper motion where applicable. Bright star flagging. Cross et al. 2012, A&A, submitted. 28/08/2012 SpS15 Data Intensive Astronomy 北京

Long term difficulties. VVV detection table – currently ~15 billion rows (tile and pawprint detections) Actually stored as a view ofmonthly tables to speed up ingestion and other curation. Most tasks on the VVV are I/O limited. Takes ~4-6 months to process 1 year. Some time can be saved, but some processing will get longer – not a simple matter of just adding in new data – variability statistics, copying out release DB 28/08/2012 SpS15 Data Intensive Astronomy 北京

SpS15 Data Intensive Astronomy 北京 Solutions Optimising SQL – query execution can be improved, but is a bit of a dark art. Not clear whether this will make substantial improvements any more. Column orientated DB – beginning to experiment with this, but could require a lot of code rewriting. Solid state devices for table storage. These are expensive, but are rapidly getting cheaper. May be the best solution in a few years time. 28/08/2012 SpS15 Data Intensive Astronomy 北京