Final Data Archiving of the Sloan Digital Sky Survey-an Example

Slides:



Advertisements
Similar presentations
Trying to Use Databases for Science Jim Gray Microsoft Research
Advertisements

THE TREASURES OF PULKOVO OBSERVATORY
Scientific Collaborations in a Data-Centric World Alex Szalay The Johns Hopkins University.
HATHITRUST A Shared Digital Repository HathiTrust current work, challenges, and opportunities for public libraries Creating a Blueprint for a National.
CSG Survey to understand Teaching & Learning space domain Guenthar, Lakhavani, Leonhardt, Stringer, Werner CSG Discussion May 16, 2014.
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
RAVE status report Matthias Steinmetz (AIP) June RAVE collaboration meeting - Padova Data collection  325k spectra for 280k stars (Jan.
HATHITRUST A Shared Digital Repository HathiTrust: A Second Life for Library Collections Jeremy York Exploring Humanities Cyberinfrastructure April 30,
Daniel Eisenstein – Univ. of Arizona Dark Energy and Cosmic Sound Bob Nichol on behalf of the SDSS Collaboration Copy of presentation to be given by Daniel.
Our Mission The OPTICON Integrated Infrastructure Initiative brings together many of Europe's astronomical observatories, data centres and laboratories.
1 LSST: Dark Energy Tony Tyson Director, LSST Project University of California, Davis Tony Tyson Director, LSST Project University of California, Davis.
Sloan Digital Sky Survey Astronomy April 2006 Margaret Flynn.
Astro-DISC: Astronomy and cosmology applications of distributed super computing.
University Biomedical Informatics Research Training Programs Supported by NLM Biomedical Informatics Training (BIT) Program University of California, Irvine.
HATHITRUST A Shared Digital Repository HathiTrust Past, Present, and Future A Brief Introduction.
The evolution of ARC/3.5m Current ARC Partner Institutions University of Washington New Mexico State University Princeton University (until June 2014)
HATHITRUST A Shared Digital Repository HathiTrust: Putting Research in Context HTRC UnCamp September 10, 2012 John Wilkin, Executive Director, HathiTrust.
A Multicolor CCD Survey for Quasars z > 3 Nikhil Revankar, Dr. Julia Kennefick, Shelly Bursick University of Arkansas, Arkansas Center for Space and Planetary.
Data Management Plans Bill Michener University Libraries and Biology Dept. University of New Mexico.
Preserving the Scientific Record: Case Study 1 – National Snow & Ice Data Center (NSIDC) Glacier Photos Matthew Mayernik National Center for Atmospheric.
Association of Universities for Research in Astronomy Presentation to Subaru Users Committee.
Astronomy: from Networks to the Grid Leopoldo Benacchio INAF (National Institute for Astrophysics) Padova Astronomical Observatory Italy.
Public Access to Large Astronomical Datasets Alex Szalay, Johns Hopkins Jim Gray, Microsoft Research.
HATHITRUST A Shared Digital Repository HathiTrust and TRAC DigitalPreservation 2012 July 25, 2012 Jeremy York, Project Librarian, HathiTrust.
Large Scientific Databases. Large scientific datasets are those which are systematically collected and organized and which stretch the technical capabilites.
LSST: Preparing for the Data Avalanche through Partitioning, Parallelization, and Provenance Kirk Borne (Perot Systems Corporation / NASA GSFC and George.
HATHITRUST A Shared Digital Repository HathiTrust and the Future of Research Libraries American Antiquarian Society March 31, 2012 Jeremy York, Project.
The Virtual Observatory Europe and the VO: the Astrophysical Virtual Observatory and the EURO-VO Astrophysical Virtual Observatory and the EURO-VO Paolo.
EURO-VO Structure Data Centre Alliance (DCA) A collaborative and operational network of European data centres who, by the uptake of new VO technologies.
The Large Synoptic Survey Telescope: The power of wide-field imaging Michael Strauss, Princeton University.
Sloan Digital Sky Survey Status Brian Yanny, reporting for the Experimental Astrophysics Group Fermilab Presentation to the Physics Advisory Committee.
HATHITRUST A Shared Digital Repository Institution Uses of HathiTrust Jeremy York University of Maine May 24, 2013.
PLANS FOR ACTIVITIES IN MEXICO Silvia Torres-Peimbert Instituto de Astronomía Universidad Nacional Autónoma de México.
1 LSST Town Hall 227 th meeting of the AAS 1/7/2016 Pat Eliason, LSSTC Executive Office Pat Osmer, LSSTC Senior Advisor.
HATHITRUST A Shared Digital Repository HathiTrust Large Digital Libraries: Beyond Google Books Modern Language Association January 5, 2012 Jeremy York,
Grant Writing for Digital Projects September 2012 IODE Project Office IODE Project Office Oostende, Belgium Oostende, Belgium Sustainability and.
The National Digital Stewardship Alliance: Stewardship, Collaboration, Inclusiveness, Exchange.
HathiTrust: A valuable and visionary Partnership.
RI EGI-InSPIRE RI Astronomy and Astrophysics Dr. Giuliano Taffoni Dr. Claudio Vuerli.
PAA on Scientific Data and Information Roberta Balstad Chair, PAA Panel.
LSST SLAC Office of Communications August 8, 2016 Communications Roadmap.
Redesigning the DOE Data Explorer to embed dataset relationships at the point of search and to reflect landing page organization Sara Studwell Department.
From LSE-30: Observatory System Spec.
Fresno State Digital Repository
Community Science Updates
Personal Archives Accessible in Digital Media
NRAO VLA Archive Survey
Preserving the Scientific Record: Case Study 1 – NSIDC Glacier Photos
aspects of archive system design
Jarek Nabrzyski Director, Center for Research Computing
NSDL: A New Tool for Teaching and Learning.
Optical Survey Astronomy DATA at NCSA
Concluding Remarks Paolo Padovani Head, Virtual Observatory Project Office, ESO, Garching bei München, Germany & EURO-VO Facility Centre.
Moving towards the Virtual Observatory Paolo Padovani, ST-ECF/ESO
Education of a scientist video
Jay Bhatt Drexel University Libraries
CNI Spring 2010 Membership Meeting
Latest from the Sloan Digital Sky Surveys
For more information, visit
Research on Data Curation and Repositories
Planning Observations
Gwyn P. Williams and Kim Kindrew Pizza Seminar, September 18, 2013
Research Computing Survey Results
ESciDoc Introduction M. Dreyer.
Preservation Update.
How Does SDSS Keep Going and Going? Past, present and future plans.
MSU Research Update February 27, 2019
Maria Teresa Capria December 15, 2009 Paris – VOPlaneto 2009
Chancellor Glen D. Johnson
INAF Long Term Preservation
Presentation transcript:

Final Data Archiving of the Sloan Digital Sky Survey-an Example Denver Sci Com Don York Elisabeth introduces presentation and emphasizes desire for discussion and ideas to inform scientists as to what might be possible with what is regarded as critical library involvement 9/13/16 Denver Sci Com

The science world is changing with the advent of petabyte (and more) datasets that are available. In the case of astronomy, the telescopes that produce the data sets are of an ever-changing sky. Facilities go out of date, but the data do not (see summary, Hipparchus 140 B. C. and Halley 1718.) A major issue is archiving the data so it remains publicly available and safe after observations cease. The same issues do or will confront scientists and agencies in geology, climate studies, high energy physics, … 9/13/16 Denver Sci Com

16 years of digital data released publicly from SDSS. The Sloan Digital Sky Survey is an exemplary case for considering the solution to the problem (and many more such projects are coming). In retrospect, SDSS was the start of “BIG DATA” 16 years of digital data released publicly from SDSS. 13 Data Releases science-ready and freely accessible (including advanced products) Papers from first eight years of data (2000-2007) still heavily cited (600 papers per year). Comparable to HST at <1/10 the cost. Cumulative total: over 7200 papers, 330,000 citations. Project was planned as a focused survey, but the data are now used by astronomers in most branches of astronomy, as well as by STEM groups. Survey observations will continue for 5-10 years. Archive is expected to have a very long life (>50 years) 9/13/16 Denver Sci Com

How are we proceeding? Scope (over 10 years) Strong belief that libraries are the ultimate best respository Centuries of curation experience Exist at the center of academia Already experienced with research collections Moving to modern digital collections Working models of distributed systems within expansive networks Scope (over 10 years) Collect paper archives, drawings, “history” into federated catalogue (project involves dozens of institutions and hundreds of astronomers) Collect and archive all versions so research results can be verified later. Scientists (in close coordination with libraries) upgrade reduction software to accompany archival data (and allow modifications) 9/13/16 Denver Sci Com

Three major stakeholders: Who pays for the archive? Three major stakeholders: The international community of scientists who use the data. The governments that fund the research. The libraries that distribute and preserve the data for their campus constituents and enable new approaches to education. Discussions with long lead times and with all at the table will be necessary to divide up the costs in the way most natural to these stakeholders. Serving of data will be needed by many Federally funded projects, should be done nationally (NOT by specialized hardware, library by library). Inexpensive compared to costs of the projects that generate the data, if done this way. Prototypes underway. 9/13/16 Denver Sci Com

FIN 9/13/16 Denver Sci Com

Slides for questions http://skyserver.sdss.org/dr7/en/ http://www.sdss.org/dr12/ DR7 homepageFamous PlacesNGC450SDSSnavigate, play with image, go to blue pea, spectrum. DR12 homepageSDSS Sky ServerSearchFamous Places 9/13/16 Denver Sci Com

SDSS IV Participating Institutions (34) Brazilian Participation Group, Carnegie Institution for Science, Carnegie Mellon University, Chilean Participation Group, French Participation Group, Harvard-Smithsonian Center for Astrophysics, Instituto de Astrofísica de Canarias, The Johns Hopkins University, Kavli Institute for the Physics and Mathematics of the Universe (IPMU) / University of Tokyo, Lawrence Berkeley National Laboratory, Leibniz Institut für Astrophysik Potsdam (AIP), Max-Planck-Institut für Astronomie (MPIA Heidelberg), Max-Planck-Institut für Astrophysik (MPA Garching), Max-Planck-Institut für Extraterrestrische Physik (MPE), 9/13/16 Denver Sci Com

National Astronomical Observatory of China, New Mexico State University, New York University, University of Notre Dame, Observatório Nacional / MCTI, The Ohio State University, Pennsylvania State University, Shanghai Astronomical Observatory, United Kingdom Participation Group, Universidad Nacional Autónoma de México, University of Arizona, University of Colorado Boulder, University of Oxford, University of Portsmouth, University of Utah, University of Virginia, University of Washington, University of Wisconsin, Vanderbilt University, Yale University. 9/13/16 Denver Sci Com

What would be the motivations for libraries to be interested? Questions What would be the motivations for libraries to be interested? -Preserve usabililty of Data for their own institutional researchers. -Gain experience with a widely used system and prepare for use of additional and bigger astronomy projects (LSST, DESI, DES,….). -Maintain in a more complex media the traditional spirit of openness and careful curation that is the hallmark of the print function of libraries. 9/13/16 Denver Sci Com