IST-2006-026409 www.eu-eela.org E-infrastructure shared between Europe and Latin America Climate Application Jose M. Gutierrez Valvanuz Fernandez Antonio.

Slides:



Advertisements
Similar presentations
LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
Advertisements

1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
RAMADDA for Big Climate Data Don Murray NOAA/ESRL/PSD and CU-CIRES Boulder/Denver Big Data Meetup - June 18, 2014.
IST E-infrastructure shared between Europe and Latin America Climate Application Final Report Jose M. Gutierrez Valvanuz Fernandez.
1 NODC, Russia GISC & DCPC developers meeting Langen, 29 – 31 March E2EDM technology implementation for WIS GISC development S. Sukhonosov, S. Belov.
Development of a Community Hydrologic Information System Jeffery S. Horsburgh Utah State University David G. Tarboton Utah State University.
E-infrastructure shared between Europe and Latin America 1 EELA is a project funded by the European Union under contract E-Infraestructure.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
NERC Data Grid Helen Snaith and the NDG consortium …
The International Surface Pressure Databank (ISPD) and Twentieth Century Reanalysis at NCAR Thomas Cram - NCAR, Boulder, CO Gilbert Compo & Chesley McColl.
Активное распределенное хранилище для многомерных массивов Дмитрий Медведев ИКИ РАН.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
TPAC Digital Library Talk Overview Presenter:Glenn Hyland Tasmanian Partnership for Advanced Computing & Australian Antarctic Division Outline: TPAC Overview.
EU 2nd Year Review – Jan – WP9 WP9 Earth Observation Applications Demonstration Pedro Goncalves :
INFSO-RI Enabling Grids for E-sciencE FloodGrid application Ladislav Hluchy, Viet D. Tran Institute of Informatics, SAS Slovakia.
EGU 2011 TIGGE, TIGGE LAM and the GIFS T. Paccagnella (1), D. Richardson (2), D. Schuster(3), R. Swinbank (4), Z. Toth (3), S.
E-science grid facility for Europe and Latin America WAM Final Report Yassine LASSOUED & Ali Al Othman Coastal and Marine Resources Centre.
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
RDFS Rapid Deployment Forecast System Visit at: Registration required.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
Unidata TDS Workshop TDS Overview – Part I XX-XX October 2014.
THREDDS Data Server Ethan Davis GEOSS Climate Workshop 23 September 2011.
Ohio State University Department of Computer Science and Engineering 1 Cyberinfrastructure for Coastal Forecasting and Change Analysis Gagan Agrawal Hakan.
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
EARTH SCIENCE MARKUP LANGUAGE Why do you need it? How can it help you? INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
WRF4G The Weather Research Forecasting model workflow for the GRID Department of Applied Mathematics & Computer Sciences University of.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
INFSO-RI Enabling Grids for E-sciencE Project Gridification: the UNOSAT experience Patricia Méndez Lorenzo CERN (IT-PSS/ED) CERN,
E-science grid facility for Europe and Latin America Marcelo Risk y Juan Francisco García Eijó Laboratorio de Sistemas Complejos Departamento.
Accomplishments and Remaining Challenges: THREDDS Data Server and Common Data Model Ethan Davis Unidata Policy Committee Meeting May 2011.
The PROGRESS Grid Service Provider Maciej Bogdański Portals & Portlets 2003 Edinburgh, July 14th-17th.
Integrated Grid workflow for mesoscale weather modeling and visualization Zhizhin, M., A. Polyakov, D. Medvedev, A. Poyda, S. Berezin Space Research Institute.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
ARGONNE NATIONAL LABORATORY Climate Modeling on the Jazz Linux Cluster at ANL John Taylor Mathematics and Computer Science & Environmental Research Divisions.
E-science grid facility for Europe and Latin America E2GRIS1 Claudio Baeza Retamal and Rodrigo Delgado Urzúa SAEMC Project (
E-science grid facility for Europe and Latin America E2GRIS1 Alina Roig Rassi Maikel Dominguez Garcia CUBAENERGIA Itacuruça (Brazil), 2-15.
E-science grid facility for Europe and Latin America E2GRIS1 Gustavo Miranda Teixeira Ricardo Silva Campos Laboratório de Fisiologia Computacional.
Intergrid KoM Santander 22 june, 2006 E-Infraestructure shared between Europe and Latin America José Manuel Gutiérrez
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Applications in the EELA Project Rafael.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
Web Portal Design Workshop, Boulder (CO), Jan 2003 Luca Cinquini (NCAR, ESG) The ESG and NCAR Web Portals Luca Cinquini NCAR, ESG Outline: 1.ESG Data Services.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
NIEeS Workshop, Cambridge (UK), Sep 2002 Luca Cinquini for the Earth System Grid METADATA DEVELOPMENT for the EARTH SYSTEM GRID Luca Cinquini (SCD/NCAR)
- Vendredi 27 mars PRODIGUER un nœud de distribution des données CMIP5 GIEC/IPCC Sébastien Denvil Pôle de Modélisation, IPSL.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
May 6, 2002Earth System Grid - Williams The Earth System Grid Presented by Dean N. Williams PI’s: Ian Foster (ANL); Don Middleton (NCAR); and Dean Williams.
From Digital Objects to Content across eInfrastructures Content and Storage Management in gCube Pasquale Pagano CNR –ISTI on behalf of Heiko Schuldt Dept.
Information Technology: GrADS INTEGRATED USER INTERFACE Maps, Charts, Animations Expressions, Functions of Original Variables General slices of { 4D Grids.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
SCD Research Data Archives; Availability Through the CDP About 500 distinct datasets, 12 TB Diverse in type, size, and format Serving 900 different investigators.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Information System Claudio Cherubino.
PDAC-10 Middleware Solutions for Data- Intensive (Scientific) Computing on Clouds Gagan Agrawal Ohio State University (Joint Work with Tekin Bicer, David.
GIS for Atmospheric Sciences and Hydrology By David R. Maidment University of Texas at Austin National Center for Atmospheric Research, 6 July 2005.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
On the D4Science Approach Toward AquaMaps Richness Maps Generation Pasquale Pagano - CNR-ISTI Pedro Andrade.
→ MIPRO Conference,Opatija, 31 May -3 June 2005 Grid-based Virtual Organization for Flood Prediction Miroslav Dobrucký Institute of Informatics, SAS Slovakia,
Climate-SDM (1) Climate analysis use case –Described by: Marcia Branstetter Use case description –Data obtained from ESG –Using a sequence steps in analysis,
Developing GRID Applications GRACE Project
5-7 May 2003 SCD Exec_Retr 1 Research Data, May Archive Content New Archive Developments Archive Access and Provision.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
E-science grid facility for Europe and Latin America gRREEMM Report-1 Nov 7, 2008 E2GRIS1 Alina Roig Rassi Maikel Dominguez Garcia CUBAENERGIA.
DataGrid France 12 Feb – WP9 – n° 1 WP9 Earth Observation Applications.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
SEE-GRID-SCI WRF-ARW model: Grid usage The SEE-GRID-SCI initiative is co-funded by the European Commission under the FP7 Research Infrastructures.
The CUAHSI Hydrologic Information System Spatial Data Publication Platform David Tarboton, Jeff Horsburgh, David Maidment, Dan Ames, Jon Goodall, Richard.
Data Browsing/Mining/Metadata
Flanders Marine Institute (VLIZ)
(WCRP Seasonal Prediction Workshop) Applied Meteorology Group
HAO/SCD: VO, metadata, catalogs, ontologies, querying
Robert Dattore and Steven Worley
Presentation transcript:

IST E-infrastructure shared between Europe and Latin America Climate Application Jose M. Gutierrez Valvanuz Fernandez Antonio S. Cofiño Fernando García Jesús Fernandez Richard Miguel San Martín Mauricio Carillo Gabriela Rosas Amelia Diaz Delia Acuña Rodrigo Abarca Claudio Baeza UC-SpainSENAMHI-PerúUDEC-Chile

IST E-infrastructure shared between Europe and Latin America EGRIS-1, Itacuruçá (Brasil), Enabling grid computing for climate model simulation: Challenges Global circulation models provide a coarse description of the ocean and atmosphere (200km resolution) and have to be linked to regional models to obtain useful representations over areas of interest. CAM and WRF are open-source state of the art global and regional models. They need to be run in cascade: Sea surface temperature CAM WRF output converter NCAR Graphics library Regional models depend on many parameters related to sub-grid physical processes (multi-parametric jobs). CAM + WRF

IST E-infrastructure shared between Europe and Latin America EGRIS-1, Itacuruçá (Brasil), Enabling grid data access to simulations: Challenges Binary files with meteorological formats (netCDF, GRIB, BUFR, HDF5, etc.) need to be partially accessed (e.g. a certain geographical region). THREDDS (Thematic Realtime Environmental Distributed Data Services) project is developing middleware to bridge the gap between data providers and data users. A recent initiative, the Earth System Grid (ESG) project, have made an initial attempt to griddify this technology. To this aim, OpenDAP data servers are included within grid infrastructure and data enters into grid storage elements when it is first requested to OpenDAP servers.

IST E-infrastructure shared between Europe and Latin America EGRIS-1, Itacuruçá (Brasil), Enabling data mining applications on simulations: The high-dimensional character of the data involved in climate simulations requires efficient data mining techniques to extract some useful knowledge. Unsupervised clustering allows partitioning the simulation databases, producing characteristic weather or climate types (or groups) governing the global dynamics. Self-Organizing Maps (SOM) is one of the most popular clustering algorithms, which is especially suitable for high dimensional data visualization and modeling. The weather types can be locally projected to obtain statistical regional forecasts of variables of interest. (Right) Precipitation at two different stations in Peru for a El Niño period. Challenges

IST E-infrastructure shared between Europe and Latin America EGRIS-1, Itacuruçá (Brasil), Climate Cascade Demo Ensemble prediction systems comprise multiple runs of a weather model with slightly different initial conditions and/or model parameterizations. The resulting simulations contain valuable information about the sampled sources of uncertainty. Sea surface temperature CAM WRF (par 1) WRF (par 2) One El Niño year 365 simulations … WRF (par n) … SE SOM Compare the SOM distribution of each parameterization.

IST E-infrastructure shared between Europe and Latin America EGRIS-1, Itacuruçá (Brasil), CAM: Community Atmospheric Model The Community Atmosphere Model (CAM) is the latest in a series of global atmosphere models developed at NCAR for the weather and climate research communities. –grid size: 128 x 64 x 27 (XYZ) = gridpoints –6 output time steps = 197MB NetCDF -> 33MB/tstep –This includes ALL default variables (32x3D + 56x2D) –WRF only requires as input 5x3D and 9x2D (effective MB: 5/step = 620MB/month(6hly input). 720GB per 100 Years –1 day takes 8mins, then 1 Month is 4 hours. 1 Year 48 hours. 10 Years 20 days. 100 Years takes 7 months of computer simmulation. A case study simmulating the climate of the past century It will require a CAM job running 7 months. Then Checkpoints is an important feature.

IST E-infrastructure shared between Europe and Latin America EGRIS-1, Itacuruçá (Brasil), WRF: Weather Research and Forecasting Model The Weather Research and Forecasting (WRF) Model is a next-generation mesocale numerical weather prediction system designed to serve both operational forecasting and atmospheric research needs, developed at NCAR and contributed by the research community. –The current example uses grid dimensions: 74x61x28 (XxYxZ) = gridpoints –Time step: 1.5 min (40 steps/h) –Iberia Peninsula region:  Grid 63x4x31, points,  24h takes 10‘ (Multiple jobs for each CAM run)  5.2MB/tstep. 1.1GB per Month. 1.5TB per 100 years (3hly step) A CAM job will produce multiple WRF jobs during the climate simulation. How these jobs will be triggered?.

IST E-infrastructure shared between Europe and Latin America EGRIS-1, Itacuruçá (Brasil), NetCDF (network Common Data Form) NetCDF (network Common Data Form) is an interface for array-oriented data access and a library that provides an implementation of the interface. The netCDF library also defines a machine-independent format for representing scientific data. Together, the interface, library, and format support the creation, access, and sharing of scientific data. CAM netCDF datasets will be accessed from WRF simulations, but CAM data is Global and WRF will need only to access to a subregion.

IST E-infrastructure shared between Europe and Latin America EGRIS-1, Itacuruçá (Brasil), NcML: The netCDF Markup Language Metadata extraction from NetCDF datasets describing the contents of the dataset. All dataset generated will suitable for searching and retrieval, helping to the scientist querying to the GRID about past an ongoing simulations.

IST E-infrastructure shared between Europe and Latin America EGRIS-1, Itacuruçá (Brasil), CAM + WRF in the GRID Working in the grid Working in local Not working, yet Cam2wrfWRFSIWRFGraphics netCDF Catalog (LFC) MetaCatalog (AMGA) netCDF XML CAM CAM and WRF are running in the EELA testbed like separated process. Both produce and consume datasets from the file catalog in NetCDF. Metadata from datasets is generated in XML and it is processed to be inserted in AMGA.

IST E-infrastructure shared between Europe and Latin America EGRIS-1, Itacuruçá (Brasil), Structure of the current status There are a "static" repository in the file catalog: and another one with the updated modules: following structure will be created to run the CAM+WRF suite in WN: The output is the tar-ed 'output' directory stored in

IST E-infrastructure shared between Europe and Latin America EGRIS-1, Itacuruçá (Brasil), Job Description JDL file Shell Script executed by WN

IST E-infrastructure shared between Europe and Latin America EGRIS-1, Itacuruçá (Brasil), What is expected from EGRIS DAGs and Checkpointable job submission. –Restart of jobs with dependencies. Using metadata catalog from worker nodes: –Loading metadata with AMGA API from WN. –Integration of the metadata catalogs and datasets catalogue Data access protocol to datasets. –OpenDAP service in the Storage Element. Development of a portal for job submission and monitoring: –Authentication management from portal –Monitoring status of jobs. –Retrieval of information from metadata catalog