AstroGrid Datacenters AstroGrid Consortium Review Dec 2004 Martin Hill

Slides:



Advertisements
Similar presentations
VODA - A Sampo Project Johan Lindroos – CSC Scientific Computing Ltd, Finland Pekka Järveläinen – CSC Scientific Computing Ltd, Finland Richard Hook -
Advertisements

IVOA, Kyoto May Data Access Layer Working Group Working Group Report and Summary Doug Tody National Radio Astronomy Observatory International.
September 13, 2004NVO Summer School1 VO Protocols Overview Tom McGlynn NASA/GSFC T HE US N ATIONAL V IRTUAL O BSERVATORY.
September 13, 2004NVO Summer School1 VO Protocols Overview Tom McGlynn NASA/GSFC T HE US N ATIONAL V IRTUAL O BSERVATORY.
Sept NVO Summer School1 Cone, SIAP, and OpenSkyQuery Client Development Gretchen Greene, Maria Nieto-Santisteban T HE US N ATIONAL V IRTUAL O.
8 September 2008NVO Summer School 2008 – Santa Fe1 Publishing Data and Services to the VO Ray Plante Gretchen Greene T HE US N ATIONAL V IRTUAL O BSERVATORY.
Aus-VO Workshop 2003 International Virtual Observatory Alliance effort on Virtual Observatory Query Language Naoki Yasuda (JVO), VOQL WG.
Remote Visualisation System (RVS) By: Anil Chandra.
A PPARC funded project AstroGrid Framework Consortium meeting, Dec 14-15, 2004 Edinburgh Tony Linde Programme Manager.
A PPARC funded project The Grid Data Warehouse Description of prototype work in progress by AstroGrid. Access-Grid lecture to Universities of Leeds and.
CASDA Virtual Observatory CSIRO ASTRONOMY AND SPACE SCIENCE Arkadi Kosmynin 11 March 2014.
19-20 March 2003 IVOA Registry Workgroup LeSc Astrogrid Registry: Early Designs Elizabeth Auden Astrogrid Registry Workgroup Leader IVOA Registry Workgroup.
14 October 2003ADASS 2003 – Strasbourg1 Resource Registries for the Virtual Observatory R.Plante (NCSA), G. Greene (STScI), R. Hanisch (STScI), T. McGlynn.
A PPARC funded project Tony Linde Programme Manager eScience meets eFrameworks 28 th April 2006 NeSC, Edinburgh.
Solar and STP Physics with AstroGrid 1. Mullard Space Science Laboratory, University College London. 2. School of Physics and Astronomy, University of.
A PPARC funded project AstroGrid’s Common Execution Architecture Guy Rixon, reporting on behalf of Paul Harrison and the other AstroGrid developers SC4DEVO-1,
Data Grid Web Services Chip Watson Jie Chen, Ying Chen, Bryan Hess, Walt Akers.
Professional Informatics & Quality Assurance Software Lifecycle Manager „Tools that are more a help than a hindrance”
2003 April 151 Data Centres: Connecting to the Real World Clive Page.
Astrogrid Resource Registry Querying the Registry 1.Mullard Space Science Laboratory, University College London, Holmbury St. Mary, Dorking, Surrey RH5.
A PPARC funded project AstroGrid Architecture Consortium Meeting, Leicester, 3 rd Nov 2003.
Astronomical Data Query Language Simple Query Protocol for the Virtual Observatory Naoki Yasuda 1, William O'Mullane 2, Tamas Budavari 2, Vivek Haridas.
VO & Astro-Wise & others A.Belikov OmegaCEN
VOQL WG Progress Report May 28, 2004 Masatoshi Ohishi.
AstroGrid Datacenters ESO M Hill (ROE), Aug 2004.
WSRF Supported Data Access Service (VO-DAS)‏ Chao Liu, Haijun Tian, Dan Gao, Yang Yang, Yong Lu China-VO National Astronomical Observatories, CAS, China.
EdSkyQuery-G Overview Brian Hills, December
Functions and Demo of Astrogrid 1.1 China-VO Haijun Tian.
Last News of and
A PPARC funded project AstroGrid approach to the Virtual Observatory Architecture ADASS XIV Pasadena Oct-2004 Tony Linde Andrew Lawrence Keith Noddle.
29-30 April 2004NVO Team Meeting NCSA1 Data Access Layer (DAL) SSA, SIA Enhancement Doug Tody National Radio Astronomy Observatory National Virtual Observatory.
Virtual Observatory Interfaces reused in the Virtual Atomic and Molecular Data Centre Guy Rixon Institute of Astronomy University of Cambridge September.
Summary of distributed tools of potential use for JRA3 Dugan Witherick HPC Programmer for the Miracle Consortium University College.
A PPARC funded project AstroGrid Intro & Demo John Taylor Institute for Astronomy, Edinburgh.
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
1 Database Management Systems: part of the solution or part of the problem? Clive Page 2004 April 28.
Federation and Fusion of astronomical information Daniel Egret & Françoise Genova, CDS, Strasbourg Standards and tools for the Virtual Observatories.
IVOA, Kyoto May Data Access Layer Thoughts on ADQL/DAL Integration Doug Tody (NRAO) International V IRTUAL O BSERVATORY.
A PPARC funded project Astronomical services: situated software vs. commodity software Guy Rixon, AstroGrid/AVO/IVOA Building Service Based Grids - GGF11.
Astronomical Data Archiving and Curation Clive Page AstroGrid Project University of Leicester 2004 March 22.
Solar and space physics datasets within a Virtual Observatory: the AstroGrid experience Silvia Dalla * and Nicholas A Walton  * School of Physics & Astronomy,
AstroGrid Solar/STP planning meeting Agenda: Helioscope Preparing for Solar-B Time-series viewing application IVOA and time series A PPARC funded project.
User Working Group 2013 Data Access Mechanisms – Status 12 March 2013
16 October 2003Registry Interface CallsIVOA Interoperability, Strasbourg IVOA Interoperability Elizabeth Auden & Registry Workgroup 16 – 17 October 2003.
The International Virtual Observatory Alliance (IVOA) interoperability in action.
A PPARC funded project Tony Linde Programme Manager AG-SAG FM5: AG2/VOTech Scope & Targets Glasgow, 7 Feb 2005 VOTech Project.
May 17, 2005Maria Nieto-Santisteban, JHU / IVOA - Kyoto1 VO JHU Open SkyQuery and more … T. Budavari, S. Carliles, L. Dobos, G. Fekete,
May 24, 2004IVOA Interop Meeting1 An AXIS-based Java SkyNode Ramon Williamson NCSA T HE US N ATIONAL V IRTUAL O BSERVATORY.
AstroGrid Datacenters IVOA Interoperability meeting M Hill (ROE), May 2004.
German Astrophysical Virtual Observatory Overview and Results So Far W. Voges, G. Lemson, H.-M. Adorf.
AstroGrid Usability & Docs, JBO, 6 th Dec 2005 Jonathan Tedds Leicester University AstroGrid Usability & Documentation Usability –Documentation –Infrastructure.
UCL DEPARTMENT OF SPACE AND CLIMATE PHYSICS MULLARD SPACE SCIENCE LABORATORY Taverna Plugin VAMDC and HELIO (part of the ‘taverna-astronomy’ edition) Kevin.
A PPARC funded project Astro-Wise meeting April 2004 OmegaCEN, Kapteyn Institute, Groningen Tony Linde AstroGrid Project Manager University of Leicester.
12 Oct 2003VO Tutorial, ADASS Strasbourg, Data Access Layer (DAL) Tutorial Doug Tody, National Radio Astronomy Observatory T HE US N ATIONAL V IRTUAL.
The Large Synoptic Survey Telescope Project Bob Mann Wide-Field Astronomy Unit University of Edinburgh.
AstroGrid consortium meeting, December 2005 Slide 1 Architecture review Guy Rixon AstroGrid consortium meeting Jodrell Bank, December 2005.
The GridPP DIRAC project DIRAC for non-LHC communities.
USGS GRID Exploratory Status Review Stuart Doescher Mike Neiers USGS/EDC May
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
Publishing Combined Image & Spectral Data Packages Introduction to MEx M. Sierra, J.-C. Malapert, B. Rino VO ESO - Garching Virtual Observatory Info-Workshop.
Evanthia Hatziminaoglou, ESO - Garching Virtual Observatory Info-Workshop, SOFIA January 2008 VO Tools Overview.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
VO Data Access Layer IVOA Cambridge, UK 12 May 2003 Doug Tody, NRAO.
The GridPP DIRAC project DIRAC for non-LHC communities.
Sept. 2004IVOA Meeting / Pune1 Virtual Observatory Query Language (VOQL) Working Group William O’Mullane For Masatoshi Oishi T HE US N ATIONAL V IRTUAL.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Introduction: AstroGrid increases scientific research possibilities by enabling access to distributed astronomical data and information resources. AstroGrid.
Google Sky.
Google Sky.
CEA Experiences Paul Harrison ESO.
Presentation transcript:

AstroGrid Datacenters AstroGrid Consortium Review Dec 2004 Martin Hill

Outline Challenge Challenge Approach Approach Developed: Developed: StorepointsStorepoints Describing dataDescribing data Query LanguageQuery Language StatusStatus VersioningVersioning Software: Publisher’s AstroGrid Library Software: Publisher’s AstroGrid Library

Problem Challenge Outline Large datasets (to Petabytes) Large datasets (to Petabytes) So? So? Distributed; Science comes from combining Distributed; Science comes from combining Bandwidth rising slower than Bandwidth rising slower than No/few established suitable standards No/few established suitable standards FITS images/‘tables’. Ambiguous headers. Ambiguous subformat, eg spectra.FITS images/‘tables’. Ambiguous headers. Ambiguous subformat, eg spectra. VOTable introduced. Ambiguous subformat eg spectra vs catalogue. Verbose.VOTable introduced. Ambiguous subformat eg spectra vs catalogue. Verbose. No/few established common terms No/few established common terms Involves Scientists… Involves Scientists…

Approach: ‘Publisher’s AstroGrid Library’ General solution to: General solution to: Discover problems faced, accumulate solutions in softwareDiscover problems faced, accumulate solutions in software Experimentally publish sets and types (not host).Experimentally publish sets and types (not host). Many smaller datasets owned by people without web skills (eg solar) so:Many smaller datasets owned by people without web skills (eg solar) so: Need 'easy‘/’unskilled’ installation Need 'easy‘/’unskilled’ installation Able to proxy; 3rd parties can publish data without requiring more work from owner (eg VizieR, Trace) Able to proxy; 3rd parties can publish data without requiring more work from owner (eg VizieR, Trace) ‘Free’ website, range of standard interfaces ‘Free’ website, range of standard interfaces Danger: too general (any query against any dataset producing any results). Danger: too general (any query against any dataset producing any results).

Existing Solutions Common task: publish RDBMs to web Common task: publish RDBMs to web Accumulated tools & skill-sets Accumulated tools & skill-sets No combined solution offering: No combined solution offering: Standard interface (eg query language)Standard interface (eg query language) Scientific values (errors, units)Scientific values (errors, units) Spatial querying (common)Spatial querying (common) VO Metadata for query and resultsVO Metadata for query and results

Developing Standards Resource metadata Resource metadata Query language (ADQL/s, ADQL/x) Query language (ADQL/s, ADQL/x) Web interfaces Web interfaces Working beyond standards Working beyond standards  Feeding research to IVOA  Feeding research to IVOA Parallel development Parallel development In the VO: eg Starlink, NVO, VizieRIn the VO: eg Starlink, NVO, VizieR External: SRB, Taverna, GridPP monitorExternal: SRB, Taverna, GridPP monitor ConvergenceConvergence

Protocols & Interfaces Human – web pages Human – web pages SOAP SOAP Toolkit IncompatibilitiesToolkit Incompatibilities Streaming awkward (via Toolkits)Streaming awkward (via Toolkits) Longer term benefits?Longer term benefits? ‘Raw Http post’ (eg servlets, CGI) ‘Raw Http post’ (eg servlets, CGI) SimplerSimpler More existing skills amongst AstronomersMore existing skills amongst Astronomers Mixed (eg SIAP, SkyNode) Mixed (eg SIAP, SkyNode)  Don’t Choose – Implement  Don’t Choose – Implement Mix & Match, Plug & Play: Mix & Match, Plug & Play:

Releasing Deploy early – if temporarily Deploy early – if temporarily Independent & Integrated Access Independent & Integrated Access Versioning: Versioning: Servers & clients, ie new clients can still use old servers, and new servers work with old clients.Servers & clients, ie new clients can still use old servers, and new servers work with old clients. Add and ‘deprecate’, don’t changeAdd and ‘deprecate’, don’t change Delete intelligentlyDelete intelligently (Remove quickly unused i/fs, eg CEA if CEA upgrades, JSPs) (Remove quickly unused i/fs, eg CEA if CEA upgrades, JSPs) Need hosts… Need hosts… Hosts need hardwareHosts need hardware Publishers need to know their dataPublishers need to know their data

Describing Data Registry ‘Resource’ documents Registry ‘Resource’ documents IVO Tabular Sky Service IVO Tabular Sky Service Units, UCDsUnits, UCDs Solar vs Sky vs… Solar vs Sky vs… Images vs Catalogues Images vs Catalogues Concept extended for ‘RdmsMetadata’ Concept extended for ‘RdmsMetadata’ UCD1+ -> Dictionaries & OntologiesUCD1+ -> Dictionaries & Ontologies Relationships (simple: errors)Relationships (simple: errors) Queryable Queryable Mirrors vs Copies Mirrors vs Copies

Query Language SQL -> ADQL/xml SQL -> ADQL/xml Defined common functions – CIRCLE & XMATCH (sky not solar) Defined common functions – CIRCLE & XMATCH (sky not solar) Working on: Working on: XQLXQL UnitsUnits Investigating: UCDs instead of columnsInvestigating: UCDs instead of columns Cross-dataset queryingCross-dataset querying

Results Query+Metadata+RawResults = VoResults Query+Metadata+RawResults = VoResults FITS vs VOTable vs HDF vs CSV vs HTML vs… FITS vs VOTable vs HDF vs CSV vs HTML vs…  All of them  All of them Results -> queryable data -> inputs Results -> queryable data -> inputs

Data Analysis Faster  feasible Faster  feasible < 10^6s OK. 10^8 not…< 10^6s OK. 10^8 not… Joins Joins Polar coordinate matches (+ HTM, HealPix).Polar coordinate matches (+ HTM, HealPix). Cross-match algorithmsCross-match algorithms Distributed queries Distributed queries Breaking down queryBreaking down query Moving the right dataMoving the right data Combining the resultsCombining the results (Clive Page)

Status Readily available Readily available Debugging; developer Debugging; developer Debugging; astronomer Debugging; astronomer Inform User Inform User

Storepoints No data persistence at PALs No data persistence at PALs Web server machines not data storage onesWeb server machines not data storage ones Large result setsLarge result sets No workspace, memory models, etcNo workspace, memory models, etc  Streaming outputs  Streaming outputs SRB, GridFTP not ready. SRB, GridFTP not ready.

Identifying Storepoints Concepts Concepts MySpace FTP SRB HTTP GridFTP Community HomeSpace VoSpace (Registered)  FTP, File, MySpace + extend.  FTP, File, MySpace + extend. 3 rd iteration; 2 nd in use 3 rd iteration; 2 nd in use SRB GridFTP MySpace SRB

Data Service Architecture Datacenter Implementation Slinger Axis Cone SIAP Plugin Manager /XML/CSV zip/plain /file/ftp /myspace AstroGrid CEA SkyNode JSP

Publishers’ AstroGrid Library ‘Easy to publish to the VO’ ‘Easy to publish to the VO’ Web Application, includes: Web Application, includes: SOAP (AstroGrid, CEA, prepped for SkyNode)SOAP (AstroGrid, CEA, prepped for SkyNode) CGI (SIAP, NVO-cone search, SSA)CGI (SIAP, NVO-cone search, SSA) HTML pages (cone search, query builder, status monitor)HTML pages (cone search, query builder, status monitor) Features Features Asynchronous (‘stateful’) & Synchronous QueriesAsynchronous (‘stateful’) & Synchronous Queries QueuesQueues Comprehensive Status (incl historical)Comprehensive Status (incl historical) Variety resultsVariety results Fully ‘Streamed’ – no curation issuesFully ‘Streamed’ – no curation issues Server ‘Plugins’, including: Server ‘Plugins’, including: RDBMS (JDBC)RDBMS (JDBC) FITS file collectionFITS file collection eXist (XML)eXist (XML) Helper Tools Helper Tools Metadata GeneratorsMetadata Generators Ready-made website accessReady-made website access

Situation Now Installed: Installed: SuperCOSMOS Science Archive (RDBMS)SuperCOSMOS Science Archive (RDBMS) astrogrid.roe.ac.uk:8080/pal-ssa/ astrogrid.roe.ac.uk:8080/pal-ssa/ astrogrid.roe.ac.uk:8080/pal-twomass/ astrogrid.roe.ac.uk:8080/pal-twomass/ astrogrid.roe.ac.uk:8080/pal-usnob/ astrogrid.roe.ac.uk:8080/pal-usnob/ 6dF – Spectra6dF – Spectra grendel12.roe.ac.uk:8080/pal-6df/ grendel12.roe.ac.uk:8080/pal-6df/ Wide Field SurveyWide Field Survey TRACE (FITS files, Solar, under test)TRACE (FITS files, Solar, under test) Proxy (bespoke special plugins) Proxy (bespoke special plugins) All NVO-cone-compatible DBs (test)All NVO-cone-compatible DBs (test) VizieRVizieR Evaluated/ing at: Evaluated/ing at: ESOESO RAL (solar)RAL (solar) JBO (Merlin)JBO (Merlin) Reviewing Query Language, metadata documents, etc Reviewing Query Language, metadata documents, etc

Future Quality… Quality… Metadata ‘wizards’ Metadata ‘wizards’ Sell to hosts; deploy to Leicester, JBO, ESO, RAL, The World.... Sell to hosts; deploy to Leicester, JBO, ESO, RAL, The World.... Explicit and Investigative Queries Explicit and Investigative Queries Distributed queries & combining results (NVO Exec plans) Distributed queries & combining results (NVO Exec plans) Full SIA, SSA interface Full SIA, SSA interface More user & admin web pages More user & admin web pages Local authorisation Local authorisation