Data Archives: Migration and Maintenance Douglas J. Mink Telescope Data Center Smithsonian Astrophysical Observatory NSF 2004-03-23.

Slides:



Advertisements
Similar presentations
The VAO is operated by the VAO, LLC. Alternative Protocols for Discovery & Access Mike Fitzpatrick NOAO.
Advertisements

3 September 2004NVO Coordination Meeting1 Grid-Technologies NVO and the Grid Reagan W. Moore George Kremenek Leesa Brieger Ewa Deelman Roy Williams John.
September 13, 2004NVO Summer School1 VO Protocols Overview Tom McGlynn NASA/GSFC T HE US N ATIONAL V IRTUAL O BSERVATORY.
September 13, 2004NVO Summer School1 VO Protocols Overview Tom McGlynn NASA/GSFC T HE US N ATIONAL V IRTUAL O BSERVATORY.
Discovery and Exploration in the VO Chris Miller NOAO/CTIO La Serena, Chile T HE US N ATIONAL V IRTUAL O BSERVATORY.
8 September 2008NVO Summer School 2008 – Santa Fe1 Publishing Data and Services to the VO Ray Plante Gretchen Greene T HE US N ATIONAL V IRTUAL O BSERVATORY.
27 June 2005 National Virtual Observatory 1 The National Virtual Observatory: Publishing Astronomy Data Robert J. Hanisch US National Virtual Observatory.
What does LOFAR have to do with the Virtual Observatory (VO)? LOFAR Science Day 16 December 2003 Melbourne David Barnes The University of Melbourne.
The Australian Virtual Observatory e-Science Meeting School of Physics, March 2003 David Barnes.
Using Sakai to Support eScience Sakai Conference June 12-14, 2007 Sayeed Choudhury Tim DiLauro, Jim Martino, Elliot Metsger, Mark Patton and David Reynolds.
Desperately Trying to Cope with the Data Explosion in Astronomical Sciences Ray Norris CSIRO Australia Telescope National Facility.
14 October 2003ADASS 2003 – Strasbourg1 Resource Registries for the Virtual Observatory R.Plante (NCSA), G. Greene (STScI), R. Hanisch (STScI), T. McGlynn.
Solar and STP Physics with AstroGrid 1. Mullard Space Science Laboratory, University College London. 2. School of Physics and Astronomy, University of.
Constructing the Memories Creating a Digital Collection Linda J. White, Digital Project Coordinator.
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.
Leicester Database & Archive Service J. D. Law-Green, J. P. Osborne, R. S. Warwick X-Ray & Observational Astronomy Group, University of Leicester What.
1 CS 502: Computing Methods for Digital Libraries Lecture 27 Preservation.
Data preservation & the Virtual Observatory Bob Mann Wide-Field Astronomy Unit Royal Observatory Edinburgh
Aus-VO: Progress in the Australian Virtual Observatory Tara Murphy Australia Telescope National Facility.
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.
October 16-18, Research Data Set Archives Steven Worley Scientific Computing Division Data Support Section.
Digitized Sky Survey Update Brian McLean : Archive Sciences Branch / Operations and Engineering Division.
GAUDI Ground-based Asteroseismology Uniform Database Interface E. Solano Bases de données en spectroscopie stellaire. Paris.
Why Build Image Mosaics for Wide Area Surveys? An All-Sky 2MASS Mosaic Constructed on the TeraGrid A. C. Laity, G. B. Berriman, J. C. Good (IPAC, Caltech);
S. Derriere et al., ESSW03 Budapest, 2003 May 20 UCDs - metadata for astronomy Sébastien Derriere François Ochsenbein Thomas Boch CDS, Observatoire astronomique.
Virtual Observatory --Architecture and Specifications Chenzhou Cui Chinese Virtual Observatory (China-VO) National Astronomical Observatory of China.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Astrogrid Resource Registry Querying the Registry 1.Mullard Space Science Laboratory, University College London, Holmbury St. Mary, Dorking, Surrey RH5.
E. Solano Centro de Astrobiología (INTA-CSIC) I.P. Observatorio Virtual Español El Observatorio Virtual: una infraestructura básica para la investigación.
VO & Astro-Wise & others A.Belikov OmegaCEN
MASSACHUSETTS INSTITUTE OF TECHNOLOGY NASA GODDARD SPACE FLIGHT CENTER ORBITAL SCIENCES CORPORATION NASA AMES RESEARCH CENTER SPACE TELESCOPE SCIENCE INSTITUTE.
Dec 2, 2014 MAST Data Discovery Portal Tom Donaldson Tony Rogers.
Functions and Demo of Astrogrid 1.1 China-VO Haijun Tian.
Virtual Observatory & LIGO Roy Williams California Institute of Technology.
25 Jan The Virtual Observatory: Core Capabilities and Support for Statistical Analyses in Astronomy T HE US N ATIONAL V IRTUAL O BSERVATORY Robert.
Astronomical data curation and the Wide-Field Astronomy Unit Bob Mann Wide-Field Astronomy Unit Institute for Astronomy School of Physics University of.
Science with the Virtual Observatory Brian R. Kent NRAO.
Chapter 9 Section 2 : Storage Networking Technologies and Virtualization.
NEON Obs School 11-Aug-2005 Archival Data and Virtual Observatories 1 Virtual Observatories...or how to do your research from a beach in the Bahamas rather.
F. Genova, Berlin 7, Paris, 2 December 2009 The astronomical information network.
Making the Sky Searchable: Automatically Organizing the World’s Astronomical Data Sam Roweis, Dustin Lang &
26 October 2005HST Calibration Workshop1 The National Virtual Observatory and HST T HE US N ATIONAL V IRTUAL O BSERVATORY Robert Hanisch US National Virtual.
LSST: Preparing for the Data Avalanche through Partitioning, Parallelization, and Provenance Kirk Borne (Perot Systems Corporation / NASA GSFC and George.
AstroGrid: The UK’s Virtual Observatory Dr Dugan Witherick – Astrophysics Group, UCL Wednesday 5 th December 2007 The University of Warwick.
1 10-June-2004Andy Lawrence : PPARC data curation panel meeting AstroGrid, Data Centres, & Edinburgh What is curation ? Data Centres in the VO era Data.
Federation and Fusion of astronomical information Daniel Egret & Françoise Genova, CDS, Strasbourg Standards and tools for the Virtual Observatories.
Federated Discovery and Access in Astronomy Robert Hanisch (NIST), Ray Plante (NCSA)
Astronomical Data Archiving and Curation Clive Page AstroGrid Project University of Leicester 2004 March 22.
VAMP will enable access to, and vastly multiply the use of, astronomy image resources by standardizing and linking resource archives worldwide.
The International Virtual Observatory Alliance (IVOA) interoperability in action.
The ATNF Pulsar Data Archive Matthew Whiting (ATNF) Albert Teoh, David Smith, Lucyna Kedziora-Chudczer, Dick Manchester, Vince McIntyre 2nd Gravitational.
The Virtual ObservatorySVO School, October 2009 E. Solano. The Virtual Observatory Enrique Solano Spanish VO Principal Investigator LAEX-CAB (INTA-CSIC)
The US Long Term Ecological Research (LTER) Network: Site and Network Level Information Management Kristin Vanderbilt Department of Biology University.
German Astrophysical Virtual Observatory Overview and Results So Far W. Voges, G. Lemson, H.-M. Adorf.
21-jun-2009 IVOA Standards Pedro Osuna ESA-VO Project Science Archives and Computer Support Engineering Unit (SRE-OE) Science Operations Department (SRE-O)
1 Preserving and Archiving Astronomical Photographic Plates M. W. Castelaz, J. D. Cline 206 th Meeting of the American Astronomical Society Session
F. Genova, VO as a Data Grid, 2003/06/301 Interoperability of astronomy data bases Françoise Genova, CDS.
12 Oct 2003VO Tutorial, ADASS Strasbourg, Data Access Layer (DAL) Tutorial Doug Tody, National Radio Astronomy Observatory T HE US N ATIONAL V IRTUAL.
Moving the pretty pictures into the 21th century Lars Lindberg Christensen (ESA/ESO)
Introduction to the VO ESAVO ESA/ESAC – Madrid, Spain.
7 Dec 2009R. J. Hanisch: Astronomy Data Standards CERN 1 Data Standards in Astronomy Dr. Robert J. Hanisch Director, US Virtual Astronomical Observatory.
5-7 May 2003 SCD Exec_Retr 1 Research Data, May Archive Content New Archive Developments Archive Access and Provision.
February 12, 2002Tom McGlynn ADEC Interoperability Technical Working Group Report.
Introduction: AstroGrid increases scientific research possibilities by enabling access to distributed astronomical data and information resources. AstroGrid.
The INES Archive in the era of Virtual Observatories
Data Centres in the Virtual Observatory Age
Planning Observations
Long-Term Preservation of Astronomical Research Results
Observing with Modern Observatories (the data flow)
Google Sky.
Presentation transcript:

Data Archives: Migration and Maintenance Douglas J. Mink Telescope Data Center Smithsonian Astrophysical Observatory NSF

Archiving Issues NSF What do we save? Reduced Data? Raw Data? Calibration data? Data Products? Publications? How do we access the data? Google Search? (ADS) Through discipline Portal(s)? (VO registries) Through Data Center? Where does the data live? One or few Data Center(s)? Many Data Centers reachable through few Portals Number of Centers is limited by long-term funding

Migration Issues NSF Why do we migrate archives? Better access, Cheaper storage, more compact storage What do we migrate from an archive? Everything? Reduced Data only? Data Products? What do we do with the old media? Paper and glass are more stable than digital media! Magnetic tapes may be more stable than optical disks! Who pays? Is migration maintenance? Is a new, more useful archive being created?

Maintenance Issues NSF What are the costs? Space, Staffing Equipment maintenance and repair Backup protection? What is the safest way to back up a multi-Terabyte archive? Cloning to other sites improves access as well as providing backup How do we maintain old media? Old media may be more stable over time than new media! Can we maintain older, less compact data?

Some US Astronomical Archives NSF Online (All NASA Funded) Hubble Space Telescope 17.6 Terabytes Two-Micron All-Sky Survey 5 Terabytes Sloane Digital Sky Survey 1 Terabyte online (50 more offline) Palomar-QUEST 6 Terabytes (1/month since 9/2003) Off-Line NOAO Save-the-Bits 44.4 Terabytes (7.6 Terabytes/year) HPSSP (Harvard Plate Stack Scanning Project) 200 Terabytes Future LSST (Large Scale Synoptic Array) 7 Terabytes per night!

Growing Astronomical Catalogs ● 1989HST Guide Star Catalog 25,541,952 sources ● 1996USNO-A1.0 Catalog 488,006,860 sources ● 1998USNO-A2.0 Catalog 526,280,881 sources ● 2001GSC II Catalog (2.2.01) 998,402,801 sources ● 2002USNO-B1.0 Catalog 1,036,366,767 sources NSF

Virtual Observatory Portals NSF US: ADS (links from publications) Goddard (Skyview vizualization) JHU,NCSA,Caltech (VO Registry modelling) IPAC (IRSA, etc.) SAO WCSTools (desktop catalog access) England: The Grid France: CDS (Aladin/Vizier/Simbad)

International Virtual Observatory Alliance (IVOA) NSF Registries: Searchable databases containing descriptions of data available in the Virtual Observatory Data Model: Standards for data format and content VOTable: XML transfer format for metadata UCD: Uniform Column Descriptors (so everyone doesn't make up their own names for the same things) Data Access Layer (DAL): User interface Protocols: Open interfaces to large archives ease multi-level links (NSF funds US participation in IVOA)

IVOA Registries NSF Full Searchable Registry Full Searchable Registry Replicate Local Publisher (harvestable registry) Local Publisher (harvestable registry) Local Searchable Registry Client Data ReplicateReplicate DAL

● 500,000 glass plates covering the entire sky from ● Basis for fundamental discoveries in astronomy, such as using Cepheid variable stars as cosmic yardsticks ● A legacy of long-term commitment to astronomical photography and research ● Astronomy will not have an equivalent time frame from digital observations until Migrating Harvard's Astronomical Plate Collection from Paper and Glass to Bits CfA/PSSG,

International Astronomical Union Resolution B3, 2000 Safeguarding the Information in Photographic Observations The International Astronomical Union, Recognising that unless urgent action is taken, this unique historical record of astronomical phenomena will be lost to future generations of astronomers, Recommends the transfer of the historic observations onto modern media by digital techniques, which will provide worldwide access to the data so as to benefit astronomical research in a way that is well matched to the tools of the researcher in the future. CfA/PSSG,

Step 0: List what is in the archive (on the web) NSF

Typical large glass plate NSF

First: Digitize Metadata From hand-written cards and logbooks NSF

Digital access to plate metadata (interactive web page) NSF

Results of metadata search NSF

Next: Digital access to image data Move the plates out of the 20th century NSF

Proposed access to digital images User Stack Catalog search FITS or Tiff Image Archive (100 Terabyte) FITS Header Archive (WCS information) FITS extractor Object or coordinates and time Plate names and object (x,y) FITS images of plate portions NSF