Desperately Trying to Cope with the Data Explosion in Astronomical Sciences Ray Norris CSIRO Australia Telescope National Facility.

Slides:



Advertisements
Similar presentations
World Wide Telescope mining the Sky using Web Services Information At Your Fingertips for astronomers Jim Gray Microsoft Research Alex Szalay Johns Hopkins.
Advertisements

IVOA Interoperability Workshop Boston, May 2004.
SCAR Data Management SSG Plenary 30 th July 2010 Kim Finney (Manager, Australian Antarctic Data Centre & Chief Officer, SCAR Standing Committee on Antarctic.
Discovery and Exploration in the VO Chris Miller NOAO/CTIO La Serena, Chile T HE US N ATIONAL V IRTUAL O BSERVATORY.
8 September 2008NVO Summer School 2008 – Santa Fe1 Publishing Data and Services to the VO Ray Plante Gretchen Greene T HE US N ATIONAL V IRTUAL O BSERVATORY.
The Australian Virtual Observatory (a.k.a. eAstronomy Australia) Ray Norris CSIRO ATNF.
The Australian Virtual Observatory e-Science Meeting School of Physics, March 2003 David Barnes.
Australian Virtual Observatory International Astronomical Union GA 2003 Joint Discussion 08 17th-18th July 2003 Sydney David Barnes The University of Melbourne.
WIPO Conference on Building Partnerships for Mobilizing Resources for Development Thematic Session 2 Science, Technology and Innovation for Development.
CASDA Virtual Observatory CSIRO ASTRONOMY AND SPACE SCIENCE Arkadi Kosmynin 11 March 2014.
ESO-ESA Existing Activities Archives, Virtual Observatories and the Grid.
Museums and Digital Repositories October, The punch line… In the digital realm, museums: * are very much like libraries * tend to share the same.
The Threat to Astronomical Databases Ray Norris CSIRO ATNF.
Using Sakai to Support eScience Sakai Conference June 12-14, 2007 Sayeed Choudhury Tim DiLauro, Jim Martino, Elliot Metsger, Mark Patton and David Reynolds.
@MAKERERE DSpace Development At Makerere University An overview of the Uganda Science Digital Library (USDL) Pilot Project A paper presented at the DSpace.
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.
Planning for the Virtual Observatory Tara Murphy … with input from other Aus-VO members …
1 CS 502: Computing Methods for Digital Libraries Lecture 27 Preservation.
Data preservation & the Virtual Observatory Bob Mann Wide-Field Astronomy Unit Royal Observatory Edinburgh
BinX and Astronomy Bob Mann Institute for Astronomy and National e-Science Centre.
Aus-VO: Progress in the Australian Virtual Observatory Tara Murphy Australia Telescope National Facility.
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.
INTRODUCTION TO RESEARCH DATA MANAGEMENT Robin Desmeules Janice Kung J W Scott Health Sciences Library University of Alberta Libraries.
Slide # 1 Programs of the Intel Education Initiative are funded by the Intel Foundation and Intel Corporation. Copyright © 2007 Intel Corporation. All.
S. Derriere et al., ESSW03 Budapest, 2003 May 20 UCDs - metadata for astronomy Sébastien Derriere François Ochsenbein Thomas Boch CDS, Observatoire astronomique.
Virtual Observatory --Architecture and Specifications Chenzhou Cui Chinese Virtual Observatory (China-VO) National Astronomical Observatory of China.
After completing this lesson, participants will be able to:  Identify ethical, legal, and policy issues for managing research data  Define copyrights,
July 16, 2004P. Padovani, NEON Archive School Science with Multi- Archival Data Paolo Padovani (ST-ECF/ESO) Astrophysical Virtual Observatory Science Manager.
Elements of a Data Management Plan Bill Michener University Libraries University of New Mexico Data Management Practices for.
Innovations in the Multimission Archive at STScI (MAST) M. Corbin, M. Donahue, C. Imhoff, T. Kimball, K. Levay, P. Padovani, M. Postman, M. Smith, R. Thompson.
Hello!. International Virtual Observatory Alliance Ajit Kembhavi, IUCAA, Pune.
Astronomical data curation and the Wide-Field Astronomy Unit Bob Mann Wide-Field Astronomy Unit Institute for Astronomy School of Physics University of.
Slide # 1. Slide # 2 What is Copyright? Laws have been created to protect authors and artists that create things that are creative and “original.” If.
Markus Dolensky, ESO Technical Lead The AVO Project Overview & Context ASTRO-WISE ((G)A)VO Meeting, Groningen, 06-May-2004 A number of slides are based.
F. Genova, Berlin 7, Paris, 2 December 2009 The astronomical information network.
July 16, 2004P. Padovani, NEON Archive School Science with multi-wavelength Archival Data Paolo Padovani (ESO) Virtual Observatory Systems Department &
Strasbourg astronomical Data Centre (DS) Françoise GENOVA.
BMC Open Access Colloquium, 8 February Morgan: "Open Access Repositories"
Federation and Fusion of astronomical information Daniel Egret & Françoise Genova, CDS, Strasbourg Standards and tools for the Virtual Observatories.
Federated Discovery and Access in Astronomy Robert Hanisch (NIST), Ray Plante (NCSA)
Common Archive Observation Model (CAOM) What is it and why does JWST care?
EURO-VO Structure Data Centre Alliance (DCA) A collaborative and operational network of European data centres who, by the uptake of new VO technologies.
IVOA Status Fabio Pasian Chair, International Virtual Observatory Alliance IVOA Interoperability Workshop, Garching, 9-12 Nov 2009.
March 1st, 2006Prospective PNG PNG: Databases - Virtual Observatory.
The Parkes Data Archiving Project Arkadi Kosmynin 11 December 2009 The Third ATNF Gravitational Wave Workshop.
Data Archives: Migration and Maintenance Douglas J. Mink Telescope Data Center Smithsonian Astrophysical Observatory NSF
Ray Norris, CSIRO Australia Telescope National Facility The Astronomers’ Data Manifesto.
Sharing scientific data: astronomy as a case study for a change in paradigm Présenté par Françoise Genova.
German Astrophysical Virtual Observatory Overview and Results So Far W. Voges, G. Lemson, H.-M. Adorf.
F. Genova, VO as a Data Grid, 2003/06/301 Interoperability of astronomy data bases Françoise Genova, CDS.
The Large Synoptic Survey Telescope Project Bob Mann Wide-Field Astronomy Unit University of Edinburgh.
Introduction to the VO ESAVO ESA/ESAC – Madrid, Spain.
Why RDA? A domain repository perspective George Alter ICPSR University of Michigan.
7 Dec 2009R. J. Hanisch: Astronomy Data Standards CERN 1 Data Standards in Astronomy Dr. Robert J. Hanisch Director, US Virtual Astronomical Observatory.
F. Genova, AstroNET meeting, Poitiers The Astrophysical Virtual Observatory.
Faculty meeting - 13 Dec 2006 The Hubble Legacy Archive Harald Kuntschner & ST-ECF staff 13 December 2006.
Leveraging the Expertise of our Staff and the Information Resources We Manage MIT Libraries Visiting Committee April 13, 2005.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
JCU Australian Marine Science Data Network.
Introduction: AstroGrid increases scientific research possibilities by enabling access to distributed astronomical data and information resources. AstroGrid.
Virtual Repository Progress Lars Lindberg Christensen (ESA/ESO)
aspects of archive system design
Building a worldwide interoperable data infrastructure for astronomy: the International Virtual Observatory Alliance SciDataCon 2016, Denver, 13/Sept/2016.
Data Centres in the Virtual Observatory Age
Moving towards the Virtual Observatory Paolo Padovani, ST-ECF/ESO
Long-Term Preservation of Astronomical Research Results
The Astronomers’ Data Manifesto
Google Sky.
Research Data Dr Aoife Coffey, Research Data Coordinator
Presentation transcript:

Desperately Trying to Cope with the Data Explosion in Astronomical Sciences Ray Norris CSIRO Australia Telescope National Facility

Overview Background: astronomical data Good news Bad news Data Manifesto

Astronomical Data Q: How did the first galaxies in the Universe form?

Need many wavelengths:

Source “c” at 3 cm wavelength

The mysterious “source c”

WFPC2 image 2 arcsec

The hard questions: Give me the WFPC image to normalise my spectral line cube –Obviously best to do computation locally Give me every source in NED with J-k>4 –Obviously best to do computation at host Give the me the radio spectral indices (using ATCA data) of all the objects in SLOAN which have J-K>4 in available ESO/STScI databases” –Some computations local, some on hosts –VO needs to make sensible decisions –VO needs grid computing standards Terabyte database in Baltimore Local megabyte dataset NASA Extragalactic Database in Pasadena Terabyte database in Sydney Terabyte database in New Mexico Multi-terabyte databases in Europe & US

Good News The Virtual Observatory Astronomical Data Centres Public-domain data

The Virtual Observatory (VO) The FITS standard (~1980) paved the way in interoperability International Virtual Observatory Alliance involves all major astronomical observatories worldwide –IVOA established 2002 VO is a collection of interoperating data archives and software tools which are linked to form a research environment in which astronomical research programs can be conducted. It includes terabyte distributed databases, data dictionaries, standards, protocols, tools, algorithms, web services, etc.

Examples of VO operations Give me a list of all the objects which satisfy: –Criterion A in the CDS database (in Strasbourg, France), –Criterion B in the Parkes HIPASS survey (in Australia) –Criterion C in the Hubble archive (in Baltimore, USA) P.S. –Each of these databases has a different format, coordinate system, and ontology, and each is several Tbyte in size. –Metadata is of variable quality –The object names will be different in each database.

VO Status VO is not a project-managed project – it is a collaboration of different groups, with different drivers, but united by a common goal. Several groups worldwide are now defining standards, tools, protocols, etc. Some prototype tools and web services already available (e.g. ) More will become available over the next 1-2 years See

Good News The Virtual Observatory Astronomical Data Centres Public-domain data

Astronomical Data Centres Centre de Données astronomiques de Strasbourg, France (CDS) –attempts to hold electronic copies of all published astronomical data, surveys, etc NASA Astronomical Data Centre (ADC) Baltimore, USA NASA Extragalactic Database (NED) –Interprets and combines extragalactic data Astronomical Data System (ADS) –All published astronomical literature Others

Good News The Virtual Observatory Astronomical Data Centres Public-domain data Security, confidentiality, and IP protection are not major issues in astronomy – most data are in the public domain – hence VO is interesting to Microsoft etc.

Bad News Intellectual Property controls. Journal data Bad planning of new instruments Digital Divide Legacy data Lack of awareness "Why should I share my data with my competitors?"

Bad News Intellectual Property controls. Journal data Bad planning of new instruments Digital Divide Legacy data Lack of awareness "Why should I share my data with my competitors?"

Intellectual Property Protection Patents –protect inventions Copyright –protects written work and creative work Proposed database protection –protects information (about anything) –No “fair use” provisions –You cannot cite someone else’s data without obtaining their permission –Each paper will need a paper-trail showing rights to cite data

ICSU International Council of Science United Nations IAUIUGGetc... CODATA WIPO United Nations National Representatives Committee on Data for Science and Technology World Intellectual Property Organisation

Bad News Intellectual Property controls. Journal data Bad planning of new instruments Digital Divide Legacy data Lack of awareness "Why should I share my data with my competitors?"

Journal Data Most data published in journals never make it to the data centres When they do appear in data centres, they rarely carry the metadata or ontology that enable machine-understanding Journals need to impose standards (e.g. VOTable) on authors

Bad News Intellectual Property controls. Journal data Bad planning of new instruments Digital Divide Legacy data Lack of awareness "Why should I share my data with my competitors?" Many new instruments are planned without sufficient planning or funding for data management (decreasing scientific productivity)

Bad News Intellectual Property controls. Journal data Bad planning of new instruments Digital Divide Legacy data Lack of awareness "Why should I share my data with my competitors?") We take for granted instant access to literature and databases. Our colleagues in developing countries still dream of it (thus disadvantaging them even further)

Bad News Intellectual Property controls. Journal data Bad planning of new instruments Digital Divide Legacy data Lack of awareness "Why should I share my data with my competitors?" Digitising old data competes for funding with new instruments

Bad News Intellectual Property controls. Journal data Bad planning of new instruments Digital Divide Legacy data Lack of awareness "Why should I share my data with my competitors?" BORING!

Bad News Intellectual Property controls. Journal data Bad planning of new instruments Digital Divide Legacy data Lack of awareness "Why should I share my data with my competitors?"

The Data Manifesto AstronomersManifesto We, the global community of astronomy, aspire to the following guidelines for managing astronomical data, believing that this would maximise the rate and cost-effectiveness of scientific discovery…

1. All major tables, images, and spectra published in journals should appear in the astronomical data centres. Journals should, in collaboration with data centres, define formats, table descriptions, and metadata that are easy for authors to adhere to, and can automatically be translated into a format (e.g. VOTable, FITS, etc) that can be entered by the data centre into their database.

2. All data obtained with publicly- funded observatories should, after appropriate proprietary periods, be placed in the public domain. Consistent with ICSU and OECD recommendations …to which Australia is a signatory

3. In any new major astronomical construction project, the data processing, storage, migration, and management requirements should be built in at an early stage of the project plan, and costed along with other parts of the project Isn’t this obvious? –apparently not!

4. Astronomers in all countries should have the same access to astronomical data and information.

5. Legacy astronomical data can be valuable, and high-priority legacy data should be preserved and stored in digital form in the data centres. How do you prioritise?

6. The IAU should work with other international organisations to achieve our common goals and learn from our colleagues in other fields. Use bodies such as CODATA to cross-fertilise

But the major challenge to coping with the data explosion remains…

Why can’t someone else do it?