Astronomical data curation and the Wide-Field Astronomy Unit Bob Mann Wide-Field Astronomy Unit Institute for Astronomy School of Physics University of.

Slides:



Advertisements
Similar presentations
The Australian Virtual Observatory e-Science Meeting School of Physics, March 2003 David Barnes.
Advertisements

National e-Science Centre Glasgow e-Science Hub Opening: Remarks NeSCs Role Prof. Malcolm Atkinson Director 17 th September 2003.
A centre of expertise in data curation and preservation DCC/NeSC eScience Workshop, June 2008 Working in partnership with the eScience community This work.
19-20 March 2003 IVOA Registry Workgroup LeSc Astrogrid Registry: Early Designs Elizabeth Auden Astrogrid Registry Workgroup Leader IVOA Registry Workgroup.
A Very Brief Introduction to iRODS
14 October 2003ADASS 2003 – Strasbourg1 Resource Registries for the Virtual Observatory R.Plante (NCSA), G. Greene (STScI), R. Hanisch (STScI), T. McGlynn.
A PPARC funded project Tony Linde Programme Manager eScience meets eFrameworks 28 th April 2006 NeSC, Edinburgh.
Solar and STP Physics with AstroGrid 1. Mullard Space Science Laboratory, University College London. 2. School of Physics and Astronomy, University of.
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.
Leicester Database & Archive Service J. D. Law-Green, J. P. Osborne, R. S. Warwick X-Ray & Observational Astronomy Group, University of Leicester What.
Data preservation & the Virtual Observatory Bob Mann Wide-Field Astronomy Unit Royal Observatory Edinburgh
BinX and Astronomy Bob Mann Institute for Astronomy and National e-Science Centre.
Open Exeter Project Team
Data provenance in astronomy Bob Mann Wide-Field Astronomy Unit University of Edinburgh
Requirements from astronomy in the Virtual Observatory era Bob Mann Institute for Astronomy & NeSC University of Edinburgh.
Aus-VO: Progress in the Australian Virtual Observatory Tara Murphy Australia Telescope National Facility.
A long tradition. e-science, Data Centres, and the Virtual Observatory why is e-science important ? what is the structure of the VO ? what then must we.
E-Science: Stuart Anderson National e-Science Centre Stuart Anderson National e-Science Centre.
National Center for Supercomputing Applications Observational Astronomy NCSA projects radio astronomy: CARMA & SKA optical astronomy: DES & LSST access:
A PPARC funded project AstroGrid: new technology for the virtual observatory SC2004 Pittsburgh, PA November 2004 Guy Rixon AstroGrid Technical Architect.
Managing Research Data – The Organisational Challenge at Oxford James A J Wilson Friday 6 th December,
EdSkyQuery-G Overview Brian Hills, December
Functions and Demo of Astrogrid 1.1 China-VO Haijun Tian.
Science Archive for Sky Surveys Data Providers and the VO - NeSC 2003 March Wide Field Astronomy Unit Institute for Astronomy.
Virtual Observatory & LIGO Roy Williams California Institute of Technology.
NEON Obs School 11-Aug-2005 Archival Data and Virtual Observatories 1 Virtual Observatories...or how to do your research from a beach in the Bahamas rather.
F. Genova, Berlin 7, Paris, 2 December 2009 The astronomical information network.
A PPARC funded project AstroGrid Intro & Demo John Taylor Institute for Astronomy, Edinburgh.
Strasbourg astronomical Data Centre (DS) Françoise GENOVA.
Because good research needs good data Funded by: Digital Curation for Researchers, 28th February 2013 The Shifting Research Data Management Policy Landscape.
AstroGrid: The UK’s Virtual Observatory Dr Dugan Witherick – Astrophysics Group, UCL Wednesday 5 th December 2007 The University of Warwick.
1 10-June-2004Andy Lawrence : PPARC data curation panel meeting AstroGrid, Data Centres, & Edinburgh What is curation ? Data Centres in the VO era Data.
Digital preservation activities at the NLW Sally McInnes 18 September 2009.
Federated Discovery and Access in Astronomy Robert Hanisch (NIST), Ray Plante (NCSA)
EScience May 2007 From Photons to Petabytes: Astronomy in the Era of Large Scale Surveys and Virtual Observatories R. Chris Smith NOAO/CTIO, LSST.
The Astrophysical Virtual Observatory to EURO-VO transition Paolo Padovani ESO Virtual Observatory Systems Department EURO-VO Facility Centre Scientist.
Astronomical Data Archiving and Curation Clive Page AstroGrid Project University of Leicester 2004 March 22.
The Virtual Observatory Europe and the VO: the Astrophysical Virtual Observatory and the EURO-VO Astrophysical Virtual Observatory and the EURO-VO Paolo.
Microsoft Research Faculty Summit Natasa Milic-Frayling & Vijay Rajagopalan Microsoft Corporation.
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
Solar and space physics datasets within a Virtual Observatory: the AstroGrid experience Silvia Dalla * and Nicholas A Walton  * School of Physics & Astronomy,
EURO-VO Structure Data Centre Alliance (DCA) A collaborative and operational network of European data centres who, by the uptake of new VO technologies.
Who are we ? what is a VO ? what is a Grid ? how do we get there ? Andy Lawrence S.P.I.E. Hawaii Aug 2002 AstroGrid
Sky Survey Database Design National e-Science Centre Edinburgh 8 April 2003.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
The International Virtual Observatory Alliance (IVOA) interoperability in action.
Data Archives: Migration and Maintenance Douglas J. Mink Telescope Data Center Smithsonian Astrophysical Observatory NSF
Ray Norris, CSIRO Australia Telescope National Facility The Astronomers’ Data Manifesto.
German Astrophysical Virtual Observatory Overview and Results So Far W. Voges, G. Lemson, H.-M. Adorf.
Edinburgh e-Science MSc Bob Mann Institute for Astronomy & NeSC University of Edinburgh.
A PPARC funded project Common Execution Architecture Paul Harrison IVOA Interoperability Meeting Cambridge MA May 2004.
AstroGrid NAM 2001 Andy Lawrence Cambridge NAM 2001 Andy Lawrence Cambridge Belfast Cambridge Edinburgh Jodrell Leicester MSSL.
F. Genova, VO as a Data Grid, 2003/06/301 Interoperability of astronomy data bases Françoise Genova, CDS.
The Large Synoptic Survey Telescope Project Bob Mann Wide-Field Astronomy Unit University of Edinburgh.
Introduction to the VO ESAVO ESA/ESAC – Madrid, Spain.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
Data and storage services on the NGS.
Publishing Combined Image & Spectral Data Packages Introduction to MEx M. Sierra, J.-C. Malapert, B. Rino VO ESO - Garching Virtual Observatory Info-Workshop.
Because good research needs good data Funded by: C4D Workshop, 26 th July 2013 The Shifting Research Data Management Policy Landscape Joy Davidson and.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
F. Genova, AstroNET meeting, Poitiers The Astrophysical Virtual Observatory.
Annotation of “special structures” in astronomy Bob Mann Institute for Astronomy and National e-Science Centre University of Edinburgh.
Research Data Management in the Humanities: an Introduction to the Basics Open Exeter Project Team.
Introduction: AstroGrid increases scientific research possibilities by enabling access to distributed astronomical data and information resources. AstroGrid.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Open Exeter Project Team
aspects of archive system design
Moving towards the Virtual Observatory Paolo Padovani, ST-ECF/ESO
The Astronomers’ Data Manifesto
Google Sky.
Presentation transcript:

Astronomical data curation and the Wide-Field Astronomy Unit Bob Mann Wide-Field Astronomy Unit Institute for Astronomy School of Physics University of Edinburgh

2/15 Outline  Who we are Introduction to the Wide-Field Astronomy Unit  What we do Sky survey data curation: past, present and future Data curation and the Virtual Observatory  What we could do with you What WFAU could do for the DCC What the DCC could do for WFAU Questions

3/15 Outline  Who we are  Introduction to the Wide-Field Astronomy Unit  What we do Sky survey data curation: past, present and future Data curation and the Virtual Observatory  What we could do with you What WFAU could do for the DCC What the DCC could do for WFAU Questions

4/15 Wide-Field Astronomy Unit  Funded to curate optical and near-infrared sky survey data for UK (and European) community  Based at Royal Observatory Edinburgh  ~35 years of sky survey data curation at ROE  Evolving data holdings:  Photographic plates  Digital scans of photographic plates  Born-digital data  WFAU formed in 1999: group moved into UoE  Currently 12 grant-funded + 2 academic staff  Mix of astronomers, IT professionals & hybrids

5/15 Outline  Who we are Introduction to the Wide-Field Astronomy Unit  What we do  Sky survey data curation: past, present and future  Data curation and the Virtual Observatory  What we could do with you What WFAU could do for the DCC What the DCC could do for WFAU Questions

6/15 Sky survey data life-cycle: e.g. WFCAM  Images taken at telescope  UKIRT, in Hawaii  Data reduction pipeline run in Cambridge  Removes instrumental signatures  Produces final, clean images  Detects and characterises sources in images  Data transferred to Edinburgh  Ingest source catalogues and image metadata into relational database, store image files on disk  Combine data from multiple nights: new images, cats.  Publish release databases via web interface On per night basis

7/15 WFAU’s main survey archives  Past: SuperCOSMOS  Based on digital scans of photographic plates  Database: ~5TB: largest tables ~10 9 rows  Images: ~35,000 user requests (10GB) per month  Present ( ): WFCAM  Near-infrared: ~700 registered users  ~500 million rows of database results per month  ~125GB of flat file image data per month  Near-future ( ): VISTA  ~3 x data rates/volume of WFCAM

8/15 WFAU’s future plans  Large Synoptic Survey Telescope  US-led public/private project  We’re trying to get UK to buy into it  Data challenges immense  WFCAM takes ~20TB of image data per year  LSST will take ~20TB of image data per night: ~60PB images, ~8PB database ( )  LSST stimulating a lot of data management R&D in the US:  Commercial: Google  Academic: “Sci-DB” (M. Stonebraker, D. DeWitt)

9/15 The Virtual Observatory  Goal: an interoperable federation of all the world’s astronomical data resources  International Virtual Observatory Alliance  Coordinates VO development worldwide  Acts as W3C-like standards body for the VO  AstroGrid:  Only project to have developed a full VO system

10/15 Virtual Observatory components  Registry  Metadata for all data published to the VO  Standard data access protocols  For tabular data, images, spectra, time series, etc  Standard web service wrappers for application code  Enabling asynchronous calls, workflow, etc  Distributed data storage system  Presenting transparent aggregated logical view to user

11/15 Curation challenges for WFAU  More data analysis services in the data centre  Data volumes too large for user download  WFAU must provide data analysis services & hardware  Integration of data and knowledge  Third-party annotations which can be used in queries  “Object X in database Y is a quasar”  “X-ray source A is the same object as radio source B”  Better linkage between archives and online literature  Keeping staff up to date on technologies/techniques  Mostly learn by doing – do we make best choices?

12/15 Outline  Who we are Introduction to the Wide-Field Astronomy Unit  What we do Sky survey data curation: past, present and future Data curation and the Virtual Observatory  What we could do with you  What WFAU could do for the DCC  What the DCC could do for WFAU  Questions

13/15 WFAU and DCC: What we can do for you  Case studies, exemplars, etc  WFAU is a well-established, competent group  Astronomy is a relatively small, cohesive community, used to interdisciplinary collaboration  Astronomers are early adopters of IT and recognise value of data curation  VO is a rich, functional e-Science infrastructure  Collaborations to date:  Raj Bose – distributed annotation service  James Cheney – paper on data centre security

14/15 WFAU and DCC: What you can do for us  Policy advice  Increasingly need to convince research councils of benefits of long term data curation – cost/benefit  Technical advice – from DCC or its Associates  Should we use iRODS for LSST?  Do any XML databases have decent performance?  Do the VO metadata standards make sense?  Curation manual  When will the rest appear?  Training  e.g. NeSC course on relational database design

15/15 WFAU and DCC: Questions  What is the DCC’s model for collaboration?  Can’t collaborate with everyone on everything  Scientists & digital librarians live in different worlds: how do you bridge that divide?  Interdisciplinary work requires sustained interaction  What do you want from scientific data curators?  What can you offer us in return?  Few of my colleagues know anything about the DCC  Does that surprise you?