INFSO-RI Enabling Grids for E-sciencE ESR Database Access K. Ronneberger,DKRZ, Germany H. Schwichtenberg, SCAI, Germany S. Kindermann,

Slides:



Advertisements
Similar presentations
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
Advertisements

Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
1 Grid services based architectures Growing consensus that Grid services is the right concept for building the computing grids; Recent ARDA work has provoked.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
INFSO-RI Enabling Grids for E-sciencE Intelligent Distributed Data Management in Earth system science K. Ronneberger, DKRZ, Germany.
EU 2nd Year Review – Jan – WP9 WP9 Earth Observation Applications Demonstration Pedro Goncalves :
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
Towards a Javascript CoG Kit Gregor von Laszewski Fugang Wang Marlon Pierce Gerald Guo
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.
ES Metadata Management Enabling Grids for E-sciencE ES metadata OGSA-DAI NA4 GA Meeting, D. Weissenbach, IPSL, France.
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
EGEE-III INFSO-RI Enabling Grids for E-sciencE The Medical Data Manager : the components Johan Montagnat, Romain Texier, Tristan.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
INFSO-RI Enabling Grids for E-sciencE Project Gridification: the UNOSAT experience Patricia Méndez Lorenzo CERN (IT-PSS/ED) CERN,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks LOFAR Archive Information System Kor Begeman.
INFSO-RI Enabling Grids for E-sciencE Supporting legacy code applications on EGEE VOs by GEMLCA and the P-GRADE portal P. Kacsuk*,
WP9 – Earth Observation Applications – n° 1 WP9 report to Plenary ESA, KNMI, IPSL Presented by M. Petitdidier, IPSL DataGrid Plenary Session 5 th Project.
Enabling Grids for E-sciencE EGEE-III INFSO-RI I. AMGA Overview What is AMGA Metadata Catalogue of EGEE’s gLite 3.1 Middleware Main Feature of.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Integration of Astro-WISE with Grid storage.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
Enabling Grids for E-sciencE Introduction Data Management Jan Just Keijser Nikhef Grid Tutorial, November 2008.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks AMGA PHP API Claudio Cherubino INFN - Catania.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
INFSO-RI Enabling Grids for E-sciencE OGSA DAI Data Access and Integration Marek Ciglan Institute of Informatics, Slovac Academy.
© Geodise Project, University of Southampton, Geodise Middleware & Optimisation Graeme Pound, Hakki Eres, Gang Xue & Matthew Fairman Summer 2003.
Replica Management Services in the European DataGrid Project Work Package 2 European DataGrid.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The GILDA training infrastructure.
Scientific Data Grid & China-VO Kai Nan Computer Network Information Center Chinese Academy of Sciences November 27, 2003.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Status report on Application porting at SZTAKI.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management and Interoperability Peter Kunszt (JRA1 DM Cluster) 2 nd EGEE Conference,
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
INFSO-RI Enabling Grids for E-sciencE A service oriented framework to create, manage and update metadata for earth system science.
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Web Portal for Chemists M. Sterzel,
INFSO-RI Enabling Grids for E-sciencE Intelligent Distributed Data Management in Earth System Science S. Kindermann, DKRZ, Germany.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
INFSO-RI Enabling Grids for E-sciencE A Grid Approach to Distributed Image Analysis for Early Diagnosis of Alzheimer Disease Livia.
INFSO-RI User Forum 1-3 March 2006 Enabling Grids for E-sciencE Worldwide ozone distribution by using Grid infrastructure ESA: L.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The GILDA t-Infrastructure Roberto Barbera.
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
H. Widmann (M&D) Data Discovery and Processing within C3Grid GO-ESSP/LLNL / June, 19 th 2006 / 1 Data Discovery and Basic Processing within the German.
INFSO-RI Enabling Grids for E-sciencE Introduction Data Management Ron Trompert SARA Grid Tutorial, September 2007.
Enabling Grids for E-sciencE EGEE-II INFSO-RI Medical Data Manager 1 Dicom retrieval : overview of the DPM One command line to retrieve a file:
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarksEGEE-III INFSO-RI Astro-Wise and EGEE.
© Geodise Project, University of Southampton, Geodise Middleware Graeme Pound, Gang Xue & Matthew Fairman Summer 2003.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Evaluating Metadata access strategies with.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Introduction to P-GRADE Portal hands-on Miklos Kozlovszky MTA SZTAKI
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Xavier Jeannin (CNRS/UREC Paris, FR) 24.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks A GRID based platform to host multiple repositories.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using WMProxy advanced job submission.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite – UNICORE interoperability Daniel Mallmann.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
DataGrid France 12 Feb – WP9 – n° 1 WP9 Earth Observation Applications.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
2005 – 06 – - ESSP1 WDC Climate : Web Access to Metadata and Data Frank Toussaint World Data Center for Climate (M&D/MPI-Met, Hamburg)
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
GSAF Grid Storage Access Framework
GSAF Grid Storage Access Framework
Leigh Grundhoefer Indiana University
4/5 May 2009 The Palazzo dei Congressi di Stresa Stresa, Italy
gLite The EGEE Middleware Distribution
Presentation transcript:

INFSO-RI Enabling Grids for E-sciencE ESR Database Access K. Ronneberger,DKRZ, Germany H. Schwichtenberg, SCAI, Germany S. Kindermann, DKRZ, Germany J. Kraus, SCAI, Germany J. Biercamp, DKRZ, Germany

Enabling Grids for E-sciencE INFSO-RI ESR Data access- Genf Structure Data Requirements of ESR Example Climate workflow: –Access via Webservice-interface/Amga –Missing pieces –Future challenge Example Satellite Data: –Access via OGSA-DAI –Implementation –Evaluation

Enabling Grids for E-sciencE INFSO-RI ESR Data access- Genf ESR Data Requirements Metadata and data bases are commonly large data sets, handled by different teams. The RDBMS generally used are MySQL, PostgreSQL or Oracle Many databases already exist the aim is the implementation of an interface with EGEE or at least to access a copy of them. If new bases are created on EGEE they need to be accessible outside Grid. Some metadata and data are only accessible to authorized persons. Others available on web site have rules for publications (acknowledgement, co-author). Many queries concern matching in time and/or space, expressed in geographical coordinates.

Enabling Grids for E-sciencE INFSO-RI ESR Data access- Genf Typical climate workflow Collect & Prepare Visualize 4 Analyse Find & Select Distributed Climate Data Model Data Observation Data Analysis Dataset Result Dataset Scenario data 3 2 Data description 1 What is needed A central metadata catalog based on common and standardized metadata schema Uniform data access interfaces with transparent AA policies

Enabling Grids for E-sciencE INFSO-RI ESR Data access- Genf Grid-enabled climate data access EGEE UI CE (1) Find & Select (Amga Java API) Data Resource Metadata C3Grid data interface Metadata Server Climate Data Workspace Webservice Interface (a) Publish (ISO 19115/19139) (c) Request (jdbc or archive) (d) Retrieve and Preprocess (2) Collect & Prepare (webservice request) (b) Harvest (OAI-PMH) AMGA Metadata Catalog SE WN (f) Register & Store data (gLite) (3) Analyse (jdl job) sh LFC Catalog (4) Visualize (grads) (g) Process (cdo-tools) (e) Transfer (gridftp)

Enabling Grids for E-sciencE INFSO-RI ESR Data access- Genf Potential Impact Offering an alternative to current solutions for the daily workflows Additionally a common platform is provided to share data, tools and resources, supporting collaboration The common metadata scheme, based on international standards can be adapted/extended – by other disciplines – by International partners (discussion with NDG (GB) and ESG (USA) are ongoing)

Enabling Grids for E-sciencE INFSO-RI ESR Data access- Genf Next steps Registering of uploaded and processed files in Amga Grid-enabling the remaining data Data Centers Current Volume Grid enabled DKRZ Archive~4 PB~3 TB WDCs (Climate/Mare) ~200 TB~5 TB IFM Geomar~1 TB~500 GB DWD~200 GB The rest is coming soon… FUB~1 TB PIK~700 GB AWI~300 GB DLR~60 GB

Enabling Grids for E-sciencE INFSO-RI ESR Data access- Genf Future challenges Feedback from EGEE to C3 (publish updated metadata of AMGA for the C3 portal) Mapping and interoperability of the AA infrastructures of EGEE, C3 and DBs Direct and transparent transfer of external files to, and registration in EGEE – That is, automatic selection of a close and free SE for storage

Enabling Grids for E-sciencE INFSO-RI ESR Data access- Genf Validation of GOME/ERS experiment with Lidar data The goal is to develop for a specific case a prototype that includes the needed tools: Example: Two different instruments : Ground-based Lidar, spectrometer aboard the satellite, ERS. The satellite data stored by orbit or pixel; different algorithms The Lidar data stored in monthly files with one profile/night

Enabling Grids for E-sciencE INFSO-RI ESR Data access- Genf OGSA-DAI Installed Environment at SCAI SL 4.1 Web-Service Container: Tomcat OGSA-DAI OGSI 6.0 with GLOBUS (TLS by Port 8443) Three different resources today - MySQL MySQL spatial extensions only support convex polygons - PostgreSQL PostGIS (production) PostGIS adds support for geographic objects to Postgres: - Oracle 10g (also for Bio Applications)

Enabling Grids for E-sciencE INFSO-RI ESR Data access- Genf ES meta data clients : query OGSA-DAI Service SE EGEE UI ES meta data client query lfns data X 509 User Proxy use ES meta data client EGEE Job on WN X 509 User Proxy submits lfns use

Enabling Grids for E-sciencE INFSO-RI ESR Data access- Genf ES meta data clients straight forward installation by SCAI no integration fat client on nodes -- only for Authorisation (Globus ) User Authentication - with grid proxy certificates - mapping to db roles for every user

Enabling Grids for E-sciencE INFSO-RI ESR Data access- Genf Evaluation Advantage: access to existing databases - nothing to convert out-of-the box installation easy to extend by own classes “quasi industrial standard” multiple resources with multiple services Disadvantage: not fast scalable over the resources ? not integrated in gLite