Semantic Grid + Data Federation US National Virtual Observatory Roy Williams California Institute of Technology NVO co-director.

Slides:



Advertisements
Similar presentations
VOEvent An Information Infrastructure for Immediate Astronomical Events Roy Williams NVO co-director.
Advertisements

3 September 2004NVO Coordination Meeting1 Grid-Technologies NVO and the Grid Reagan W. Moore George Kremenek Leesa Brieger Ewa Deelman Roy Williams John.
September 13, 2004NVO Summer School1 VO Protocols Overview Tom McGlynn NASA/GSFC T HE US N ATIONAL V IRTUAL O BSERVATORY.
September 13, 2004NVO Summer School1 VO Protocols Overview Tom McGlynn NASA/GSFC T HE US N ATIONAL V IRTUAL O BSERVATORY.
6 September 2008NVO Summer School 2008 – Santa Fe1 DAL Clients: Scripting Data Access with Python Ray Plante T HE US N ATIONAL V IRTUAL O BSERVATORY.
Sept NVO Summer School1 Cone, SIAP, and OpenSkyQuery Client Development Gretchen Greene, Maria Nieto-Santisteban T HE US N ATIONAL V IRTUAL O.
8 September 2008NVO Summer School 2008 – Santa Fe1 Publishing Data and Services to the VO Ray Plante Gretchen Greene T HE US N ATIONAL V IRTUAL O BSERVATORY.
Collection Service. 19 February 2001CYCLADES Kick-off meeting Collection A set of documents A set of services on the documents A set of polices that regulate.
Grid Astronomy with Image Federation Roy Williams Michael Feldmann California Institute of Technology.
CASDA Virtual Observatory CSIRO ASTRONOMY AND SPACE SCIENCE Arkadi Kosmynin 11 March 2014.
1 OGC Web Services Kai Lin San Diego Supercomputer Center
14 October 2003ADASS 2003 – Strasbourg1 Resource Registries for the Virtual Observatory R.Plante (NCSA), G. Greene (STScI), R. Hanisch (STScI), T. McGlynn.
October 12, 2003ADASS NVO Tutorial1 How-To Implement Cone and SIA Services Gretchen Greene Space Telescope Science.
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.
Virtual Observatory Architecture Data Services Registry Services Compute Services Roy Williams Caltech US-VO co-director.
July 2004SC4DEVO -Caltech1 IVOA WebServices William O’Mullane The Johns Hopkins University T HE US N ATIONAL V IRTUAL O BSERVATORY.
An Astronomical Image Mosaic Service for the National Virtual Observatory / ESTO.
Digitized Sky Survey Update Brian McLean : Archive Sciences Branch / Operations and Engineering Division.
Why Build Image Mosaics for Wide Area Surveys? An All-Sky 2MASS Mosaic Constructed on the TeraGrid A. C. Laity, G. B. Berriman, J. C. Good (IPAC, Caltech);
Supported by the National Science Foundation’s Information Technology Research Program under Cooperative Agreement AST with The Johns Hopkins University.
11/27/2003IVOA Small Projects Meeting China-VO Data Access Service Based on OGSA Jian Sang National Astronomical Observatory of China Chinese Virtual.
Diversity of domain descriptions in natural science: virtual observatory as a case study Briukhov D.O., Kalinichenko L.A., Zakharov V.N. Institute of Informatics.
Astronomical Data Query Language Simple Query Protocol for the Virtual Observatory Naoki Yasuda 1, William O'Mullane 2, Tamas Budavari 2, Vivek Haridas.
DateADASS How to Navigate VO Datasets Using VO Protocols Ray Plante (NCSA/UIUC), Thomas McGlynn and Eric Winter NASA/GSFC T HE US N ATIONAL V IRTUAL.
Functions and Demo of Astrogrid 1.1 China-VO Haijun Tian.
29-30 April 2004NVO Team Meeting NCSA1 Data Access Layer (DAL) SSA, SIA Enhancement Doug Tody National Radio Astronomy Observatory National Virtual Observatory.
Virtual Observatory & LIGO Roy Williams California Institute of Technology.
Science with the Virtual Observatory Brian R. Kent NRAO.
NEON Obs School 11-Aug-2005 Archival Data and Virtual Observatories 1 Virtual Observatories...or how to do your research from a beach in the Bahamas rather.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Prototype system of the Japanese Virtual Observatory The Japanese Virtual Observatory (JVO) aims at providing easy access to federated astronomical databases.
Knowledge Extraction from Scientific Data Roy Williams California Institute of Technology SDMIV 24 October 2002 Edinburgh KE ToolsS Data.
JVO JVO Portal Japanese Virtual Observatory (JVO) Prototype 2 Masahiro Tanaka, Yuji Shirasaki, Satoshi Honda, Yoshihiko Mizumoto, Masatoshi Ohishi (NAOJ),
P Structured Query Language for Virtual Observatory Yuji Shirasaki National Astronomical Observatory of Japan, and Masahiro Tanaka (NAOJ), Satoshi.
The Digital Library for Earth System Science: Contributing resources and collections Meeting with GLOBE 5/29/03 Holly Devaul.
Federation and Fusion of astronomical information Daniel Egret & Françoise Genova, CDS, Strasbourg Standards and tools for the Virtual Observatories.
Federated Discovery and Access in Astronomy Robert Hanisch (NIST), Ray Plante (NCSA)
Web Services for the National Virtual Observatory Tamás Budavári Johns Hopkins University.
Some Grid Science California Institute of Technology Roy Williams Paul Messina Grids and Virtual Observatory Grids and and LIGO.
CMU-CS lunch talk, Gerard Lemson1 Computational and statistical problems for the Virtual Observatory With contributions from/thanks to: GAVO.
Hyperatlas Coregistered Federated Imagery Roy Williams Bruce Berriman George Djorgovski John Good Reagan Moore Caltech CACR Caltech IPAC Caltech Astronomy.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
The International Virtual Observatory Alliance (IVOA) interoperability in action.
Data Archives: Migration and Maintenance Douglas J. Mink Telescope Data Center Smithsonian Astrophysical Observatory NSF
Workshop on How to Publish Data in VO ESAC, June 25-June DAL (Data Access Layer) protocols Jesus Salgado
German Astrophysical Virtual Observatory Overview and Results So Far W. Voges, G. Lemson, H.-M. Adorf.
Virtual Observatories, Press Release Images, and Web Services Dr. Frank Summers Space Telescope Science Institute November 3, 2005.
IVOA RM, VOResources, Identifiers, Interfaces Chenzhou CUI.
12 Oct 2003VO Tutorial, ADASS Strasbourg, Data Access Layer (DAL) Tutorial Doug Tody, National Radio Astronomy Observatory T HE US N ATIONAL V IRTUAL.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
VOEvent and the Registry Introducing VOEventStream and VOEventService Roy Williams Caltech.
Publishing Combined Image & Spectral Data Packages Introduction to MEx M. Sierra, J.-C. Malapert, B. Rino VO ESO - Garching Virtual Observatory Info-Workshop.
7 Dec 2009R. J. Hanisch: Astronomy Data Standards CERN 1 Data Standards in Astronomy Dr. Robert J. Hanisch Director, US Virtual Astronomical Observatory.
IVOA Small Projects Meeting Application to the science S. Honda, Y. Shirasaki, M. Tanaka and JVO team National Astronomical Observatory of Japan.
VO Data Access Layer IVOA Cambridge, UK 12 May 2003 Doug Tody, NRAO.
© Roy Williams 2002 The Uphill Battle of Semantic Interoperability Roy Williams California Institute of Technology.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
An Application Profile and Prototype Metadata Management System for Licensed Electronic Resources Adam Chandler Information Technology Librarian Central.
Union Catalog of NDAP Johnny Chang. Agenda What is Union Catalog? Background Prospectus Who is involved? The architecture of Union Catalog Protocol Metadata.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Introduction: AstroGrid increases scientific research possibilities by enabling access to distributed astronomical data and information resources. AstroGrid.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
TRIG: Truckee River Info Gateway Dave Waetjen Graduate Student in Geography Information Center for the Environement (ICE) University of California, Davis.
Enhancements to Galaxy for delivering on NIH Commons
Standard Query Language for VO
Accomplishments RSM v0.7 First draft XML Schema completed: VOResource.xsd NVO: Working prototype resource using VOResource as format for metadata exchange.
OAI and Metadata Harvesting
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
Google Sky.
IVOA Interoperability Meeting - Boston
Presentation transcript:

Semantic Grid + Data Federation US National Virtual Observatory Roy Williams California Institute of Technology NVO co-director

What is NVO? –Standard protocols, standard data types XML transfer protocol (VOTable) Resource description (VOResource etc) Publish/discover to federated registry (OAI) Semantic Types (UCD) Services: Cone search, Simple Image Access –Computing with big data on the Grid Database Crossmatch Image Federation: Atlases

First NVO Discovery

Database Fuzzy Join 2MASS versus SDSS cross- identification with - j_m as 2MASS magnitude and - I_mtotn as SDSS magnitude 2MASS : j_m,+ 15 SDSS: I_mtotn <= 18 Billion Source Cross-Identification: A Computational Challenge SDSS unmatched 2MASS matched SDSS matched 2MASS unmatched

Crossmatch Services SDSS database 2MASS database query Crossmatch service query scientific knowledge! NVO protocols

First NVO Discovery Database crossmatch of two massive databases creates new science “The sum is greater than the parts”

Semantic Grid

Cone Search First VO standard service Input: RA, DEC, SR must be present –decimal degrees J2000 Output: VOTable of sky-located data records –must have columns with UCDs: POS_EQ_RA_MAIN, POS_EQ_DEC_MAIN, ID_MAIN RA=300 DEC=25 SR=0.1 IDRADECxyz Request Response

Cone Search Registry POS_EQ_RA_MAIN POS_EQ_DEC_MAIN POS_EQ ID URLbase RA=200&DEC=20&SR=2 Request: HTTPget of shape: Response: VOTable of shape: A collection of services that have the same shape

Cone Search + Density Probe Cone Search Density Probe baseURL Spacing Search radius interoperating NVO-compliant services! Federation of Multiple Services

NVO Image Protocol SIAP Specify box by position and size SIAP server returns relevant images Footprint Logical Name URL Can choose: standard URL: SRB URL srb://nvo.npaci.edu/…..

Simple Image Access Service Query is sky region May query on image type, image geometry Response is VOTable of images Each has WCS (geometry) parameters Plus a URL to fetch the image Designed for Set of pointed observations (eg Hubble) Wide-area survey (eg Sloan) Image service –Mosaicking –Reprojection

Data Inventory Service What data covers a position in the sky? Registry OAIPublish Registry OAIQuery Registry OAIPublish DIS Caltech NCSA JHU/StSci Goddard

Data Inventory Service Request is a cone on the sky

Data Inventory Service Relevant Images and Catalogs NVSS Image ROSAT catalog

Image Federation

VO Registry Schemas & Service Types VOResourceID ivo://me.com/file123 Query service R R Portals Tools & Services Databases Grid Virtual Data md server for ivo:// VOView Fill-in forms Visualization Reports Publishing OAI Publish service Aladin OASIS DIS

What is in the Registry? Answer: “Entities” It has a global identifier ivo://……. –Must be resolved by authority It has “VOViews” –Queries return these …..and that’s all!

3 Views of an Entitiy Zoo-keeper metadata: carrots yes strong Transportation metadata: 4000 kg no carrots heavy Zoo-manager metadata: per day carrots “entity”

VOResource A mandatory form plus other supporting forms

Schemas and Service Types VOResource –Entity description form Organzation, project, data collection, service Has ivo:// identifier VORegion –sky coverage form (α/δ/λ) VOTable –star catalog, image list, other tables OAI –Registry harvesting –Distributed virtual registry CONE –Request-response for catalog SIAP –Request-response for images When can I publish my own schema to VO?

Dublin Core Metadata Title A name given to the resource. Creator An entity primarily responsible for making the content of the resource. Subject A topic of the content of the resource. Description An account of the content of the resource. Publisher An entity responsible for making the resource available Contributor An entity responsible for making contributions to the content of the resource. Date A date of an event in the lifecycle of the resource. Type The nature or genre of the content of the resource. Format The physical or digital manifestation of the resource. Identifier An unambiguous reference to the resource within a given context. Source A Reference to a resource from which the present resource is derived. Language A language of the intellectual content of the resource. Relation A reference to a related resource. Coverage The extent or scope of the content of the resource. Rights Information about rights held in and over the resource. Curation data for “any human creation”

Dublin Core Dublin Core is how the VO will interoperate with libraries of the world A global metadata standard

Prototype Registry Organization Data Collection Project Service SIA service

VOViews VOResource view Dublin Core view

OAI: Open Archives Initiative Harvesting Protocol OAI is popular –Ask your University librarian Distributed Comprehensive Registry –Harvesting Different views for different purposes –Six blind men and the elephant

OAI Harvesting Protocol 6 magic verbs of OAI

VO Identifiers ivo://mydomain.com / mySkySurvey # file00037.fits URI form Still in flux Authority ID Registered with IVOA Must correspond to a registry Resource ID Created by Authority Resolved by registry Record ID Not known to registry delimiter

Image Federation

Multispectral Imagery Crab Nebula. 3 channels: X-ray in blue, optical in green, and radio in red. Moffet Field California. 224 channels from 400 nm to 2500 nm

Image Federation detection Stacking allows detection of faint sources. A 1-sigma detection in each of many bands becomes a 3- sigma detection. Images of the same galaxy taken several days apart are automatically subtracted from one another, and remaining bright spots may be supernova candidates. (NEAT project) Image subtraction allows detection of narrow-line features that are not also wide-band (eg Hα but not R- band)

Principle Components SDSS (5 channel) SDSS+2MASS (8 channel)

Mosaicking and Federation Every Astronomical image has a different projection different pointing of the telescope We want to mosaic different images We want to federate different information Compute intensive: flux in each pixel is carefully distributed into a new pixel grid Mosaicking Federation Infrared map Xray map today Xray map last year

Atlasmaker Uses Montage, Yoursky Project Estimate & correct Background Co-Add Data Chart David Hockney Pearblossom Highway 1986

Images and Charts Image Big data Chart Map: sphere → plane FITS-WCS header small data An atlas is a collection of charts Hyperatlas is an attempt to standardize atlases

Hyperatlas Standard naming for atlases and vcharts TM-5-SIN-20 Vchart TM-5-SIN Standard Scales: scale s means 2 20-s arcseconds per pixel SIN projection TAN projection TM-5 layout HV-4 layout Standard Projections Standard Layout

Parallel Atlasmaker MPI Parallellism ~2% serial work (Amdahl) Projection is parallel All nodes share filespace Making a single Image  Making an Atlas of 1736 Images Teragrid Distributed Federated Scheduling wanted SRB as Virtual Data Catalog

Atlasmaker Architecture NVO/IVO NED Sloan DPOSS FIRST [2MASS] NVO Protocol making atlas pages scale reproject compress sky index Virtual Data System YourSky VirtualSky Oasis VIEW Bus federation data mining Hyperatlas service SIAP services

Atlasmaker Virtual Data System Metadata repositories Federated by OAI Data repositories Federated by SRB Compute resources Federated by TG/IPG Mosaicked data is on file 2a. Mosaicked data is not on file 2d: Store result & return result 2c: Compute on TG/IPG User request Request manager 2b. Get raw data from NVO resources

Atlasmaker stack Mosaicking (executables) Atlasmaker (script) Hyperatlas (service) NVO Image Access (service) SRB (service) web MontageYourSky Virtual Data System -- Chimera?

Charts and Pages Chart – a frame for specific data Page – an organization for data The virtual disk is 400,000 pixels wide SIN projection

Background Correction UncorrectedCorrected

Montage Background Correction Project pixels to output chart Fit ramps on overlap regions Fit ramps on projected images Subtract from Pixel values