The VAO is operated by the VAO, LLC. Data Discovery, Access, and Management with the Virtual Observatory Robert Hanisch Space Telescope Science Institute.

Slides:



Advertisements
Similar presentations
September 13, 2004NVO Summer School1 VO Protocols Overview Tom McGlynn NASA/GSFC T HE US N ATIONAL V IRTUAL O BSERVATORY.
Advertisements

Extremely Large Telescopes and Surveys Mark Dickinson, NOAO.
Using Sakai to Support eScience Sakai Conference June 12-14, 2007 Sayeed Choudhury Tim DiLauro, Jim Martino, Elliot Metsger, Mark Patton and David Reynolds.
Desperately Trying to Cope with the Data Explosion in Astronomical Sciences Ray Norris CSIRO Australia Telescope National Facility.
14 October 2003ADASS 2003 – Strasbourg1 Resource Registries for the Virtual Observatory R.Plante (NCSA), G. Greene (STScI), R. Hanisch (STScI), T. McGlynn.
Canadian Virtual Observatory Project David Schade Canadian Astronomy Data Centre Herzberg Institute for Astrophysics National Research Council Canada.
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.
Depositing and Disseminating Digital Resources Alan Morrison Collections Manager AHDS Subject Centre for Literature, Linguistics and Languages.
Providing Access for US Astronomers to the Next Generation of Large Ground Based OIR Telescopes 1.Scientific Potential 2.Current Design Efforts 3.Complementarity.
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.
A Robust Health Data Infrastructure P. Jon White, MD Director, Health IT Agency for Healthcare Research and Quality
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
CXC Implementing 2007 NRC Portals of the Universe Report Chandra X-ray Center Recommended Best Practices Roger Brissenden and Belinda Wilkes 25 April 2012.
The NASA/NExScI/IPAC Star and Exoplanet Database 14 May 2009 David R. Ciardi on behalf of the NStED Team.
Virtual Observatory --Architecture and Specifications Chenzhou Cui Chinese Virtual Observatory (China-VO) National Astronomical Observatory of China.
National Center for Supercomputing Applications Observational Astronomy NCSA projects radio astronomy: CARMA & SKA optical astronomy: DES & LSST access:
MASSACHUSETTS INSTITUTE OF TECHNOLOGY NASA GODDARD SPACE FLIGHT CENTER ORBITAL SCIENCES CORPORATION NASA AMES RESEARCH CENTER SPACE TELESCOPE SCIENCE INSTITUTE.
Dec 2, 2014 MAST Data Discovery Portal Tom Donaldson Tony Rogers.
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
GENIUS kick-off - November 2013 GENIUS kick-off meeting WP400 – Tools for data exploitation X. Luri.
Science with the Virtual Observatory Brian R. Kent NRAO.
Sharing Research Data Globally Alan Blatecky National Science Foundation Board on Research Data and Information.
Summary of distributed tools of potential use for JRA3 Dugan Witherick HPC Programmer for the Miracle Consortium University College.
F. Genova, Berlin 7, Paris, 2 December 2009 The astronomical information network.
26 October 2005HST Calibration Workshop1 The National Virtual Observatory and HST T HE US N ATIONAL V IRTUAL O BSERVATORY Robert Hanisch US National Virtual.
Doug Tody E2E Perspective EVLA Advisory Committee Meeting December 14-15, 2004 EVLA Software E2E Perspective.
Instrumentation of the SAM-Grid Gabriele Garzoglio CSC 426 Research Proposal.
N. RadziwillEVLA NSF Mid-Project Report May 11-12, 2006 NRAO End to End (e2e) Operations Division Nicole M. Radziwill.
John Peoples for the DES Collaboration BIRP Review August 12, 2004 Tucson1 DES Management  Survey Organization  Survey Deliverables  Proposed funding.
Federation and Fusion of astronomical information Daniel Egret & Françoise Genova, CDS, Strasbourg Standards and tools for the Virtual Observatories.
Federated Discovery and Access in Astronomy Robert Hanisch (NIST), Ray Plante (NCSA)
EScience May 2007 From Photons to Petabytes: Astronomy in the Era of Large Scale Surveys and Virtual Observatories R. Chris Smith NOAO/CTIO, LSST.
Geosciences - Observations (Bob Wilhelmson) The geosciences in NSF’s world consists of atmospheric science, ocean science, and earth science Many of the.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
The VAO is operated by the VAO, LLC. LSST VAO Meeting Robert Hanisch Space Telescope Science Institute Director, Virtual Astronomical Observatory.
Common Archive Observation Model (CAOM) What is it and why does JWST care?
Nov Common Archive Observation Model What is CAOM and why should MAST use it? Brian McLean.
1 ASTRONET Coordinating strategic planning for European Astronomy.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
March 1st, 2006Prospective PNG PNG: Databases - Virtual Observatory.
The International Virtual Observatory Alliance (IVOA) interoperability in action.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Data Archives: Migration and Maintenance Douglas J. Mink Telescope Data Center Smithsonian Astrophysical Observatory NSF
Ray Norris, CSIRO Australia Telescope National Facility The Astronomers’ Data Manifesto.
AstroGrid NAM 2001 Andy Lawrence Cambridge NAM 2001 Andy Lawrence Cambridge Belfast Cambridge Edinburgh Jodrell Leicester MSSL.
Applications and Requirements for Scientific Workflow May NSF Geoffrey Fox Indiana University.
12 Oct 2003VO Tutorial, ADASS Strasbourg, Data Access Layer (DAL) Tutorial Doug Tody, National Radio Astronomy Observatory T HE US N ATIONAL V IRTUAL.
Arizona Astronomical Data Hub AAS 227: Dark/Orphaned Data P. Bryan Heidorn ORCID: University of January 2016.
DOE Data Management Plan Requirements
N. RadziwillEVLA Advisory Committee Meeting May 8-9, 2006 NRAO End to End (e2e) Operations Division Nicole M. Radziwill.
What’s Happening at Internet2 Renee Woodten Frost Associate Director Middleware and Security 8 March 2005.
Introduction to the VO ESAVO ESA/ESAC – Madrid, Spain.
Nov Virtual Astronomical Observatory (VAO) Gretchen Greene.
Applications and Requirements for Scientific Workflow May NSF Geoffrey Fox Indiana University.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
7 Dec 2009R. J. Hanisch: Astronomy Data Standards CERN 1 Data Standards in Astronomy Dr. Robert J. Hanisch Director, US Virtual Astronomical Observatory.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
February 12, 2002Tom McGlynn ADEC Interoperability Technical Working Group Report.
A Shared Commitment to Digital Preservation and Access.
RI EGI-InSPIRE RI Astronomy and Astrophysics Dr. Giuliano Taffoni Dr. Claudio Vuerli.
PAA on Scientific Data and Information Roberta Balstad Chair, PAA Panel.
Robert Hanisch, Radio Astronomy / LSST, Charlottesville8 May 2013 The VAO is operated by the VAO, LLC, a not-for-profit 501(c)(3) organization. 8 May 2013.
Strategies for NIS Development
INTAROS WP5 Data integration and management
Exploitation of ISS Scientific data - sustainability
Long-Term Preservation of Astronomical Research Results
Grid Application Model and Design and Implementation of Grid Services
The Astronomers’ Data Manifesto
Google Sky.
Presentation transcript:

The VAO is operated by the VAO, LLC. Data Discovery, Access, and Management with the Virtual Observatory Robert Hanisch Space Telescope Science Institute Director, Virtual Astronomical Observatory

5 May 2011Robert Hanisch 222 Data in astronomy 2 1-d, 2-d, 3-d: intensity/polarization vs. energy, time, position, velocity tables: catalogs, x-ray event lists, radio visibility measurements

5 May 2011Robert Hanisch 333 Quantity and distribution ~50 major data centers and observatories with substantial on- line data holdings ~10,000 data “resources” (catalogs, surveys, archives) data centers host from a few to ~100 TB each, currently ~1+ PB total current growth rate ~0.5 PB/yr, expected to increase soon current request rate ~1 PB/yr for Hubble Space Telescope, data retrievals are 3X data ingest; papers based on archival data constitute 2/3 of refereed publications

5 May 2011Robert Hanisch 444 Astro2010 Data archives  “Central to astronomy today”  HST, 2MASS, and SDSS archival research is major contributor to scientific productivity c/o R. White (STScI) and pp. 5-11, 5-12 of NWNHAA

5 May 2011Robert Hanisch 55 Astro2010 Virtual Observatory  “The National Virtual Observatory [with international VO collaboration]…has produced widely accepted standards for data formatting, curation, and the infrastructure of a common user interface.” [Note: VO not explicitly reviewed in Astro 2010, as it was an approved program in the 2000 Decadal Survey and already being implemented as Astro 2010 was in progress.] Data preservation and curation  “It is…necessary for NSF to adopt NASA’s model of long-lived data archive centers…for long-term curation of data.” Software  “New packages capable of handling large datasets are urgently needed. These are likely to be created and employed within a common-use environment.”

5 May 2011Robert Hanisch 66 Astro2010 Facility planning and data management  “Recommendation: Proposals for new major ground-based facilities and instruments with significant federal funding should be required as a matter of agency policy to include a plan and if necessary a budget for ensuring appropriate data acquisition, processing, archiving, and public access after a suitable proprietary period.” But note CODMAC (1982, NAS) report:  “Generally, data-system and data-analysis activities are not adequately funded. Underfunding results from at least three related causes: when there is insufficient planning in the early mission phases, the required funding will often be underestimated; overruns that occur during mission system development may absorb the funds allocated for data handling and analysis; and because of imperfections in the flight and ground hardware and software, the data processing may be more extensive than originally estimated.” CODMAC = Committee on Data Management and Computing

5 May 2011Robert Hanisch 777 Data management Well-characterized archival data enormously valuable, both from dedicated surveys and heterogeneous collections Data discovery/federation enabled by the Virtual Observatory; challenges remain  Need database technology capable of managing 10 9 – rows; potentially disruptive technology change  Need increases in network bandwidth, ability to move algorithm to data  Metadata management critical  Support for long-term access to survey data, other heritage data products, unclear Plan/budget for comprehensive archiving, long-term curation, VO-compatible access

5 May 2011Robert Hanisch 888 Observation and simulation Unprecedented opportunity for bringing together simulation and data faces us now Interoperability fostered by VO protocols/standards Need to improve access, transparency, reproducibility, return on investment, efficiency, and infrastructure Visualization tools essential for understanding simulations, large datasets, and relationships Simulations and observations must be made interoperable, facilitated by VO protocols and standards

5 May 2011Robert Hanisch 99 The Virtual Observatory The VO is foremost a data discovery, access, and integration facility International collaboration on metadata standards, data models, and protocols  Image, spectrum, time series data  Catalogs, databases  Transient event notices  Software and services  Distributed computing (authentication, authorization, process management)  Application inter-communication International Virtual Observatory Alliance established in 2001, patterned on WorldWideWeb Consortium (W3C)

5 May 2011Robert Hanisch 10 VO architecture

5 May 2011Robert Hanisch 11 VO architecture

5 May 2011Robert Hanisch 12 VO architecture

5 May 2011Robert Hanisch 13 US VO efforts National Virtual Observatory (NVO) development effort,  $14M, 17 organizations  NSF Information Technology Research program Virtual Astronomical Observatory (VAO) operational facility,  Funding is $5.5M/year for five years, subject to annual performance review, 9 organizations  $4M/year from NSF/AST  $1.5M/year from NASA  Covers ~27 FTE over the nine organizations VAO is managed by the VAO,LLC (limited liability company) co- owned by AUI (operates NRAO and ALMA) and AURA (operates NOAO and STScI)  VAO has its own Board of Directors (J. Gallagher, chair)  R. Hanisch, director; B. Berriman, program manager, D. De Young, project scientist, A. Szalay, technology advisor  G. Fabbiano, chair of Science Council

5 May 2011Robert Hanisch 14

5 May 2011Robert Hanisch 15 News at usvao.org

5 May 2011Robert Hanisch 16 Seven major areas of activity  Operations: T. McGlynn, HEASARC, A. Thakar, JHU  User Support: E. Stobie, NOAO, M. Nieto-Santisteban, JHU  Product Development: R. Plante, NCSA, G. Greene, STScI  Standards and Protocols: M. Graham, Caltech, D. Tody, NRAO  Data Preservation and Curation: A. Rots, SAO, J. Mazzarella, NED (A. Accomazzi, SAO/ADS)  Technology Evaluation: A. Mahabal, Caltech  Education and Public Outreach: B. Lawton, STScI Scope and functions

5 May 2011Robert Hanisch 17 Challenges Restarting a distributed team Working in an atmosphere of intense fiscal oversight Changing the mindset from R&D to facility operations Right-sizing processes: structure vs. straitjacket Managing expectations, timing releases of new capabilities User community take-up, building trust

5 May 2011Robert Hanisch 18 Science initiatives The VAO has selected seven science initiatives that were endorsed by the Science Council as providing maximal scientific impact in the astronomy community: 1.Development of a dedicated VAO Portal 2.Scalable cross-matching between catalogs of sources 3.Building and Analyzing Spectral Energy Distributions 4.Time Domain Astronomy: (a) Periodograms and light curve analyses; (b) Transient event services 5.Data Linking and Semantic Astronomy 6.Desktop Tool Integration 7.Data Mining and Statistical Analysis

5 May 2011Robert Hanisch 19 context sensitive interpreter Portal design concept

5 May 2011Robert Hanisch 20 SED tool Sherpa fitting module Specview display IVOA SAMP communication SAMP = Simple Applications Messaging Protocol

5 May 2011Robert Hanisch 21 SED tool architecture

5 May 2011Robert Hanisch 22 Cross-matching

5 May 2011Robert Hanisch 23 Time series integration/tools

5 May 2011Robert Hanisch 24 VAO-IRAF integration 2000 registered IRAF users ~5000 total users >700 IRAF tasks will become VO-aware

5 May 2011Robert Hanisch 25 Science studies 25 Four science initiatives will undergo a study period during Year 1:  Time Domain Astronomy (Transients)  Data Linking and Semantic Astronomy  Desktop Tool Integration, phase 2  Data Mining and Statistical Analysis The goals of these studies are to make recommendations on science deliverables for Year 2+ that will be evaluated by the Science Council.

5 May 2011Robert Hanisch 26 The research record and data Journals and preprints in astronomy are themselves data Data underlying the images and graphics published in journals not systematically preserved Without full stewardship of the research record, key elements of scientific process missing: reproducibility, integrity Develop data-friendly publication policies and long-term data stewardship solutions Monitor intellectual property, copyright, and open access policies and re-examine publishing business model VAO collaborating with NSF OCI-funded project, the Data Conservancy (DataNet program) NSF policy now requires data management plans with all proposals Roles for VAO: Advise on options, provide storage through VOSpace infrastructure, layer on Data Conservancy, integrate data/metadata capture into the publication process

5 May 2011Robert Hanisch 27 Science collaborations CANDELS: Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey  HST multi-cycle (3-year) treasury program, S. Faber and H. Ferguson, CoPIs, >100 members of science team  Multi-wavelength (radio to x-ray) study of >250k galaxies with 1.5 < z < 8  Understand initial epoch of star formation, disk formation, first generation of interactions and mergers, role of AGN formation in galaxy evolution SED-informed cross-matching VOEvent notices (supernovae) Image cut-out services

5 May 2011Robert Hanisch 28 CANDELS fields

5 May 2011Robert Hanisch 29 Small Magellanic Cloud Construct 3-dim model of SMC based on period-luminosity data on 3,000+ Cepheid variables  Construct SEDs for ~100M objects in 10x10 deg FOV  Stellar population study of a dwarf galaxy  Effects of galaxy interactions in dwarf systems  B. Madore (Carnegie) PI Test of scalable cross- matching and large-scale SED construction

5 May 2011Robert Hanisch 30 Summary Advanced facilities of the coming decade will produce unprecedented volumes of data, complex data Sound data management practices must be integrated into facility / instrumentation design and implementation We will live in a world of distributed data, distributed services Data discovery, access, re-use, and comparison, is enabled by adherence to VO standards and protocols New and/or potentially disruptive technologies will be needed to manage and understand massive data sets