27 June 2005 National Virtual Observatory 1 The National Virtual Observatory: Publishing Astronomy Data Robert J. Hanisch US National Virtual Observatory.

Slides:



Advertisements
Similar presentations
Trying to Use Databases for Science Jim Gray Microsoft Research
Advertisements

1 Online Science The World-Wide Telescope as a Prototype For the New Computational Science Jim Gray Microsoft Research
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
IVOA Registry WG, IVOA Registry WG Pune, 28 Sept 2004.
© Copyright 2008 All rights reserved 2 VO-India Project Started in 2002 as a collaboration between IUCAA and Persistent Systems Ltd. Part of International.
Kyoto Interop meeting, 17 May A VOTable application: Solar System objects in the VO J. Berthier 1, F. Vachier 1, V. Lainey 1, W. Thuillot 1, J.-E.
3 September 2004NVO Coordination Meeting1 Grid-Technologies NVO and the Grid Reagan W. Moore George Kremenek Leesa Brieger Ewa Deelman Roy Williams John.

September 13, 2004NVO Summer School1 VO Protocols Overview Tom McGlynn NASA/GSFC T HE US N ATIONAL V IRTUAL O BSERVATORY.
September 13, 2004NVO Summer School1 VO Protocols Overview Tom McGlynn NASA/GSFC T HE US N ATIONAL V IRTUAL O BSERVATORY.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
NVO Summer School, Aspen Sept Data Access Layer Working Group Image and Spectral Access Doug Tody National Radio Astronomy Observatory National.
8 September 2008NVO Summer School 2008 – Santa Fe1 Publishing Data and Services to the VO Ray Plante Gretchen Greene T HE US N ATIONAL V IRTUAL O BSERVATORY.
OGF-23 iRODS Metadata Grid File System Reagan Moore San Diego Supercomputer Center.
John Cunniffe Dunsink Observatory Dublin Institute for Advanced Studies Evert Meurs (Dunsink Observatory) Aaron Golden (NUI Galway) Aus VO 18/11/03 Efficient.
What does LOFAR have to do with the Virtual Observatory (VO)? LOFAR Science Day 16 December 2003 Melbourne David Barnes The University of Melbourne.
The Australian Virtual Observatory e-Science Meeting School of Physics, March 2003 David Barnes.
Designing Services for Grid-based Knowledge Discovery A. Congiusta, A. Pugliese, Domenico Talia, P. Trunfio DEIS University of Calabria ITALY
© 2011 TIBCO Software Inc. All Rights Reserved. Confidential and Proprietary. Towards a Model-Based Characterization of Data and Services Integration Paul.
Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.
1 A Case Study in E- Science: Building Ecological Informatics Solutions for Multi-Decadal Research ARL/CNI 2008 Conference Washington, DC 16 October 2008.
CASDA Virtual Observatory CSIRO ASTRONOMY AND SPACE SCIENCE Arkadi Kosmynin 11 March 2014.
Development of China-VO ZHAO Yongheng NAOC, Beijing Nov
Using Sakai to Support eScience Sakai Conference June 12-14, 2007 Sayeed Choudhury Tim DiLauro, Jim Martino, Elliot Metsger, Mark Patton and David Reynolds.
A Very Brief Introduction to iRODS
14 October 2003ADASS 2003 – Strasbourg1 Resource Registries for the Virtual Observatory R.Plante (NCSA), G. Greene (STScI), R. Hanisch (STScI), T. McGlynn.
GGF-17 Astro Workshop Preservation Environment Working Group Officers: Bruce Barkstrom (NASA Langley) Reagan Moore (SDSC) Goals  Demonstrate.
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
Planning for the Virtual Observatory Tara Murphy … with input from other Aus-VO members …
Astro-DISC: Astronomy and cosmology applications of distributed super computing.
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.
S. Derriere et al., ESSW03 Budapest, 2003 May 20 UCDs - metadata for astronomy Sébastien Derriere François Ochsenbein Thomas Boch CDS, Observatoire astronomique.
Virtual Observatory --Architecture and Specifications Chenzhou Cui Chinese Virtual Observatory (China-VO) National Astronomical Observatory of China.
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
National Center for Supercomputing Applications Observational Astronomy NCSA projects radio astronomy: CARMA & SKA optical astronomy: DES & LSST access:
1 New Frontiers with LSST: leveraging world facilities Tony Tyson Director, LSST Project University of California, Davis Science with the 8-10 m telescopes.
Alex Szalay, Jim Gray Analyzing Large Data Sets in Astronomy.
Hello!. International Virtual Observatory Alliance Ajit Kembhavi, IUCAA, Pune.
Virtual Observatory & LIGO Roy Williams California Institute of Technology.
25 Jan The Virtual Observatory: Core Capabilities and Support for Statistical Analyses in Astronomy T HE US N ATIONAL V IRTUAL O BSERVATORY Robert.
Science with the Virtual Observatory Brian R. Kent NRAO.
Production Data Grids SRB - iRODS Storage Resource Broker Reagan W. Moore
26 October 2005HST Calibration Workshop1 The National Virtual Observatory and HST T HE US N ATIONAL V IRTUAL O BSERVATORY Robert Hanisch US National Virtual.
Federation and Fusion of astronomical information Daniel Egret & Françoise Genova, CDS, Strasbourg Standards and tools for the Virtual Observatories.
Federated Discovery and Access in Astronomy Robert Hanisch (NIST), Ray Plante (NCSA)
Rule-Based Preservation Systems Reagan W. Moore Wayne Schroeder Mike Wan Arcot Rajasekar Richard Marciano {moore, schroede, mwan, sekar,
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
EScience May 2007 From Photons to Petabytes: Astronomy in the Era of Large Scale Surveys and Virtual Observatories R. Chris Smith NOAO/CTIO, LSST.
CMU-CS lunch talk, Gerard Lemson1 Computational and statistical problems for the Virtual Observatory With contributions from/thanks to: GAVO.
The International Virtual Observatory Alliance (IVOA) interoperability in action.
Data Archives: Migration and Maintenance Douglas J. Mink Telescope Data Center Smithsonian Astrophysical Observatory NSF
F. Genova, VO as a Data Grid, 2003/06/301 Interoperability of astronomy data bases Françoise Genova, CDS.
12 Oct 2003VO Tutorial, ADASS Strasbourg, Data Access Layer (DAL) Tutorial Doug Tody, National Radio Astronomy Observatory T HE US N ATIONAL V IRTUAL.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Introduction to the VO ESAVO ESA/ESAC – Madrid, Spain.
National Archives and Records Administration1 Integrated Rules Ordered Data System (“IRODS”) Technology Research: Digital Preservation Technology in a.
Publishing Combined Image & Spectral Data Packages Introduction to MEx M. Sierra, J.-C. Malapert, B. Rino VO ESO - Garching Virtual Observatory Info-Workshop.
7 Dec 2009R. J. Hanisch: Astronomy Data Standards CERN 1 Data Standards in Astronomy Dr. Robert J. Hanisch Director, US Virtual Astronomical Observatory.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
1 eScience in Astronomy: Grid & VO GAVO III KickOff eScience in Astronomy: VO & GRID eScience: making the most advanced tools of IT available to scientists.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
Introduction: AstroGrid increases scientific research possibilities by enabling access to distributed astronomical data and information resources. AstroGrid.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Virtual Observatory for cosmological simulations
Long-Term Preservation of Astronomical Research Results
Google Sky.
Technical Issues in Sustainability
Presentation transcript:

27 June 2005 National Virtual Observatory 1 The National Virtual Observatory: Publishing Astronomy Data Robert J. Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD USA Reagan Moore San Diego Supercomputer Center

27 June 2005 National Virtual Observatory 2 Topics Virtual Observatory description (VO) Discovery Services Data Management Services Interactions with the GGF –Astrophysics Research Group

27 June 2005 National Virtual Observatory 3 The Virtual Observatory The Virtual Observatory will provide a virtual sky based on the enormous data sets being created now and the even larger ones proposed for the future. It will enable a new mode of research for professional astronomers and will provide to the public an unparalleled opportunity for education and discovery. Astronomy and Astrophysics in the New Millennium

27 June 2005 National Virtual Observatory 4 Astronomy is Facing a Data Avalanche Multi-Terabyte (soon: multi- Petabyte) sky surveys and archives over a broad range of wavelengths Billions of detected sources, hundreds of measured attributes per source 1 microSky (DPOSS) 1 nanoSky (HDF-S)

27 June 2005 National Virtual Observatory 5 Composition of Results from Multiple Collections …reveals a more complete physical picture The resulting complexity of data translates into increased demands for data analysis, visualization, and understanding

27 June 2005 National Virtual Observatory 6

27 June 2005 National Virtual Observatory 7 Large-scale Synoptic Survey Telescope LSST will take pictures of the entire observable sky every 3 days –Compare images to detect changes Asteroids - sizes down to 250 meters Micro-lensing events - structure of dark matter Supernovae –Expect to generate 100 PBs of data –Expect to sustain over 50 TeraFlops computation Distributed architecture –Processing at telescope (14,000 feet, perhaps Chile) –Processing at base station (perhaps Chile) –Processing in the US

27 June 2005 National Virtual Observatory 8 An overview of the Large Synoptic Survey Telescope Jim Brase, LLNL 8.4 meter aperture telescope surveying the full sky every 3-4 nights to visual magnitude Primary missions are to study dark energy - dark matter, transient universe, outer solar system and near-earth> objects (NEO) > 13 TB / night > 100 PB over its 10 year mission Event detections on the Web in < 1 minute Pioneering new way of doing science – mining petabyte image databases First light January 2012

27 June 2005 National Virtual Observatory 9 Publication of Results What does it mean to publish large scientific collections? Requirements include: –Authenticity and integrity, the characterization of the source of the material and an assurance that the data is uncorrupted –Discovery mechanisms to identify sets of appropriate data –Access mechanisms to support expected usage patterns and analyses

27 June 2005 National Virtual Observatory 10 Research Problems that Drive Publication Requirements Statistical astronomy done right –Precision cosmology, Galactic structure, stellar astrophysics … –Discovery of significant patterns and multivariate correlations –Access to observations from multiple collections Systematic exploration of the observable parameter spaces –Searches for rare or unknown types of objects and phenomena –Low surface brightness universe, the time domain –Confronting massive numerical simulations with massive data sets –Access to large portions of a collection

27 June 2005 National Virtual Observatory 11 Comparison of Images within Large Collections Megaflares on normal main sequence stars (DPOSS)

27 June 2005 National Virtual Observatory 12 Scientific Data Publication Standard vocabulary –Uniform content descriptors for all physical variables registered in astronomy catalogs Standard data format –FITS encoding format for astronomy images Standard services for accessing collections –Simple image access service –Cone search for catalog access –Sky query node for distributed search across catalogs Enable large-scale applications –Support access to tens of terabytes of data and millions of catalog entries

27 June 2005 National Virtual Observatory 13 Data Publishing Roles (who is using the system?) Roles Authors Publishers Curators Consumers Traditional Scientists Journals Libraries Scientists read->analyze Emerging Collaborations Project www site Massive Archives Scientists & public query-> analyze

27 June 2005 National Virtual Observatory 14 Interactions with Publishers Provide validation of tabular digital data submitted to astronomy journals –Validate semantics - Uniform Content Descriptors for each table column –Validate coordinates for each named object –Check consistency of coordinates across objects –Aggregate data into a common catalog for future queries - CDS –Provide an archive of tabular data Current size is about 5 billion records

27 June 2005 National Virtual Observatory 15 Interactions with Publishers Validate image data submitted to astronomy journals –Validate encoding format - FITS –Check semantic terms in the FITS header Naming conventions for coordinates, resolution, wavelength –Check consistency of header variables –Support archiving of the original image Build consistent collection of all images published Cross correlate to other images of the same object Current aggregate survey size is about 50 Terabytes (50,000 Gbytes)

27 June 2005 National Virtual Observatory 16 Virtual Observatory Publication Services A suite of international standards for the discovery, exchange, intercomparison, and analysis of network- accessible astronomical data A data access and analysis environment that exploits the emerging computation/software/data Grid A framework for data processing that enables and encourages the re-use of algorithms A tool for astronomy research A catalyst for world-wide access to astronomical archives A vehicle for education and public outreach

27 June 2005 National Virtual Observatory 17 Types of Grid Services VOTable - standard table structure for data from catalogs Conesearch - retrieve entries from an object catalog that are spatially located within a circle mapped on the sky Simple Image Access Protocol - retrieve an image from an image archive, cropped to the desired size Simple Spectrum Access Protocol - retrieve a spectrum from a catalog Skyquery - distribute queries across multiple object catalogs, join results Mosaic service - create composite of multiple images

27 June 2005 National Virtual Observatory 18 Data Management Services VOStore - interface for simple get, put of files from an image archive VOSpace - data management interface for assembling uniform name spaces across multiple image archives Uniform Content Descriptors - standard naming conventions for all physical quantities in catalogs VO Ontology - relationships between the UCDs, also a time-space coordinate ontology for astronomy

27 June 2005 National Virtual Observatory 19 International VO Alliance The IVOA brings together the astronomers, developers, and managers of the VO initiatives world-wide –Agreements on standards for data access (VOTable, catalog queries, image retrieval, resource descriptions, etc.) –Coordination of development activities –Sharing of software and experience –International policies on data sharing and publication 13 participating organizations: Astrogrid, AVO, US-NVO, VO-Australia, VO-Canada, VO-China, VO-France, VO-Germany (GAVO), VO-India, VO-Italy (DRACO), VO-Japan, VO-Korea, VO- Russia

27 June 2005 National Virtual Observatory 20 Data Management Approaches in Scientific Disciplines Data Grids –Focus on shared collections that may be distributed across multiple sites Digital Libraries –Provide discovery and display services for scientific collections Persistent Archives –Assert authenticity and integrity of collection while underlying systems evolve

27 June 2005 National Virtual Observatory 21 NVO Digital Library Interactions Dublin Core metadata standard –Describe provenance of all objects Open Archives Initiative - Protocol for Metadata Harvesting –Used to populate service registry Carnivore v 1.0 service registry –Register all of NVO services – DSpace - digital library –Port of top of data grids for distributed data management Fedora - digital library

27 June 2005 National Virtual Observatory 22 Characteristics Standard vocabularies, data formats, services Collection management –Descriptive, administrative metadata –Access controls on creation of data, metadata, annotations –Audit trails, versions, locking, pinning, containers Distributed data –Data created at multiple sites –Data used at multiple sites –Replicas at multiple sites Persistence –All systems must manage technology evolution Federation –Sharing of data between independent collections

27 June 2005 National Virtual Observatory 23 Questions Reagan W. Moore