The Earth System Grid (ESG) Computer Science and Technologies DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003.

Slides:



Advertisements
Similar presentations
30-31 Jan 2003J G Jensen, RAL/WP5 Storage Elephant Grid Access to Mass Storage.
Advertisements

Jens G Jensen Atlas Petabyte store Supporting Multiple Interfaces to Mass Storage Providing Tape and Mass Storage to Diverse Scientific Communities.
A. Sim, CRD, L B N L 1 ANI and Magellan Launch, Nov. 18, 2009 Climate 100: Scaling the Earth System Grid to 100Gbps Networks Alex Sim, CRD, LBNL Dean N.
The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
Earth System Curator Spanning the Gap Between Models and Datasets.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
1 SRM-Lite: overcoming the firewall barrier for large scale file replication Arie Shoshani Alex Sim Lawrence Berkeley National Laboratory April, 2007.
Data Grids Darshan R. Kapadia Gregor von Laszewski
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
Seminar Grid Computing ‘05 Hui Li Sep 19, Overview Brief Introduction Presentations Projects Remarks.
Application of GRID technologies for satellite data analysis Stepan G. Antushev, Andrey V. Golik and Vitaly K. Fischenko 2007.
The DOE Science Grid Computing and Data Infrastructure for Large-Scale Science William Johnston, Lawrence Berkeley National Lab Ray Bair, Pacific Northwest.
The Globus Toolkit Gary Jackson. Introduction The Globus Toolkit is a product of the Globus Alliance ( It is middleware for developing.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
Toni Saarinen, Tite4 Tomi Ruuska, Tite4 Earth System Grid - ESG.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Grid Services at NERSC Shreyas Cholia Open Software and Programming Group, NERSC NERSC User Group Meeting September 17, 2007.
GridSphere for GridLab A Grid Application Server Development Framework By Michael Paul Russell Dept Computer Science University.
Simo Niskala Teemu Pasanen
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Commodity Grid (CoG) Kits Keith Jackson, Lawrence Berkeley National Laboratory Gregor von Laszewski, Argonne National Laboratory.
NCAR NCAR Data and Grid Efforts: The Earth System Grid & The Community Data Portal Don Middleton NCAR Scientific Computing Division CAS2003 September 11,
Presented by The Earth System Grid: Turning Climate Datasets into Community Resources David E. Bernholdt, ORNL on behalf of the Earth System Grid team.
Long Term Ecological Research Network Information System LTER Grid Pilot Study LTER Information Manager’s Meeting Montreal, Canada 4-7 August 2005 Mark.
DataGrid Middleware: Enabling Big Science on Big Data One of the most demanding and important challenges that we face as we attempt to construct the distributed.
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
Ian Foster Argonne National Lab University of Chicago Globus Project The Grid and Meteorology Meteorology and HPN Workshop, APAN.
CoG Kit Overview Gregor von Laszewski Keith Jackson.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Miguel Branco CERN/University of Southampton Enabling provenance on large-scale e-Science applications.
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
ESG The Earth System Grid (ESG) Presented by Don Middleton & Luca Cinquini NCAR Scientific Computing Division On Behalf of the ESG Team SCD Executive Committee.
The Earth System Grid (ESG) Goals, Objectives and Strategies DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003.
1 Use of SRMs in Earth System Grid Arie Shoshani Alex Sim Lawrence Berkeley National Laboratory.
File and Object Replication in Data Grids Chin-Yi Tsai.
PPDG and ATLAS Particle Physics Data Grid Ed May - ANL ATLAS Software Week LBNL May 12, 2000.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
High Performance GridFTP Transport of Earth System Grid (ESG) Data 1 Center for Enabling Distributed Petascale Science.
Major Grid Computing Initatives Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer Science The.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
Globus Replica Management Bill Allcock, ANL PPDG Meeting at SLAC 20 Sep 2000.
Perspectives on Grid Technology Ian Foster Argonne National Laboratory The University of Chicago.
Communicating Security Assertions over the GridFTP Control Channel Rajkumar Kettimuthu 1,2, Liu Wantao 3,4, Frank Siebenlist 1,2 and Ian Foster 1,2,3 1.
Intergrid KoM Santander 22 june, 2006 E-Infraestructure shared between Europe and Latin America José Manuel Gutiérrez
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
The Earth System Grid: A Visualisation Solution Gary Strand.
Web Portal Design Workshop, Boulder (CO), Jan 2003 Luca Cinquini (NCAR, ESG) The ESG and NCAR Web Portals Luca Cinquini NCAR, ESG Outline: 1.ESG Data Services.
Data Management and Transfer in High-Performance Computational Grid Environments B. Allcock, J. Bester, J. Bresnahan, A. L. Chervenak, I. Foster, C. Kesselman,
MTA SZTAKI Hungarian Academy of Sciences Introduction to Grid portals Gergely Sipos
Owen SyngeTitle of TalkSlide 1 Storage Management Owen Synge – Developer, Packager, and first line support to System Administrators. Talks Scope –GridPP.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
Fox 2 AISRP April 4-6, 2005  Earth System Grid  Grid-enabled OPeNDAP  Architecture - Server and Application access  Framework experience.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
May 6, 2002Earth System Grid - Williams The Earth System Grid Presented by Dean N. Williams PI’s: Ian Foster (ANL); Don Middleton (NCAR); and Dean Williams.
1 SRM-Lite: overcoming the firewall barrier for data movement Arie Shoshani Alex Sim Viji Natarajan Lawrence Berkeley National Laboratory SDM Center All-Hands.
Replica Management Kelly Clynes. Agenda Grid Computing Globus Toolkit What is Replica Management Replica Management in Globus Replica Management Catalog.
1 Accomplishments. 2 Overview of Accomplishments  Sustaining the Production Earth System Grid Serving the current needs of the climate modeling community.
1 Overall Architectural Design of the Earth System Grid.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
National Energy Research Scientific Computing Center (NERSC) Visportal : interface to grid enabled NERC resources Cristina Siegerist NERSC Center Division,
Protocols and Services for Distributed Data- Intensive Science Bill Allcock, ANL ACAT Conference 19 Oct 2000 Fermi National Accelerator Laboratory Contributors:
The NOAA Operational Model Archive and Distribution System NOMADS CEOS-Grid Application Status Report Glenn K. Rutledge NOAA NCDC CEOS WGISS-19 Cordoba,
1 Scientific Data Management Group LBNL SRM related demos SC 2002 DemosDemos Robust File Replication of Massive Datasets on the Grid GridFTP-HPSS access.
The Earth System Grid: A Visualisation Solution
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
Enable computational and experimental  scientists to do “more” computational chemistry by providing capability  computing resources and services at their.
Data Management Components for a Research Data Archive
Presentation transcript:

The Earth System Grid (ESG) Computer Science and Technologies DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003

May 8, 2003Earth System Grid2 Computer Science Perspective: Why is ESG Important? Application needs help formulate new frameworks and information technologies –Scientific apps good indicator of future trends –Climate community leading IT consumer Experimentation key to (computer) science –Needs robust instantiation of new technology –Needs an engaged community of consumers Multi disciplinary (intra-CS and CS-apps) teams are key to IT advances

May 8, 2003Earth System Grid3 We’re Particularly Interested in the Following Aspects of ESG “Enable [a community of] researchers to understand and make effective use of large, distributed climate datasets” –Dataset federation—physical and semantic –Security: who can do how much of what –Efficient analysis: distribution and placement of computation and data Within the context of real data centers, real data, real analyses, and real users

May 8, 2003Earth System Grid4 The Computer Science Team ESG engages CS people at every institution Four groups act as Grid technology providers –Argonne National Laboratory (Globus Toolkit, etc.) –Lawrence Berkeley National Laboratory (SRM) –USC Information Sciences Institute (Globus Toolkit, etc.) –Oak Ridge National Laboratory (monitoring) Two groups act as climate data analysis tech providers –NCAR (data delivery and analysis software) –PCMDI (data delivery and analysis software) Integration, application, experimentation are highly collaborative activities

May 8, 2003Earth System Grid5 Our Hammers … Storage Resource Managers, Multiple File Transfer service Grid Security Infrastructure, Community Authorization Service GRAM Job Management GridFTP data movement, Reliable File Transfer service, Metadata and replica management Monitoring technologies

May 8, 2003Earth System Grid6 ESG CS Mission Work closely with application groups to –Integate these (and other) components to provide end-to-end application solutions –Identify and, if possible, develop missing pieces –Evaluate what happens when real users apply our “solution” at scale Iterate to improve both Grid technologies and climate solutions

May 8, 2003Earth System Grid7 Contribution to IT New functionality: New features have been added to Globus Toolkit & SRM to meet climate community needs Robustness: “Production” deployment exposed limitations (functional and/or scale) and bugs in software tools Research: ESG requirements limitations exposed by deployment triggered new IT research directions

May 8, 2003Earth System Grid8 ESG Achievements Real value has been delivered to users –Mike Wehner, LLNL: “This has changed my life” Significant buy-in from climate scientists Middleware is more robust & easier to use Real interdisciplinary CS-climate scientist teams established National and international visibility for, and interest in, our work

May 8, 2003Earth System Grid9 Observations We are building a middleware and people infrastructure w/o long term commitment –How do we persuade the community to engage? Scope of the demand for ESG solutions is enormous, we can easily be overwhelmed –What is needed is an international environmental sciences Grid –How can ESG contribute to its realization, via leadership and technology development?

The Earth System Grid (ESG) Architecture DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003

May 8, 2003Earth System Grid11

May 8, 2003Earth System Grid12 ESG Architecture Metadata Catalog Replica Catalog Tape Library Disk Cache Attribute Specification Logical Collection and Logical File Name Disk ArrayDisk Cache Application Replica Selection Multiple Locations NWS Selected Replica gsiftp commands Performance Information and Predictions Replica Location 1Replica Location 2Replica Location 3 MDS

May 8, 2003Earth System Grid13 Metadata Catalog Replica Catalog Tape Library Disk Cache Attribute Specification Logical Collection and Logical File Name Disk ArrayDisk Cache Application Replica Selection Multiple Locations NWS Selected Replica gsiftp commands Performance Information and Predictions Replica Location 1Replica Location 2Replica Location 3 MDS ESG Architecture Remote Data Tookit Remote Calc. Toolkit Remote Viz Toolkit Generic Apps Grid Infrastructure BrokersInfoScheduleDataMonitorSecurity Grid Application Toolkit (Middleware) User Adm. Portals Applications Generic U.S. Users CDAT UsersFerret Users U.K. UsersClimate Community Commercial Users Community Outreach University Users Sponsors Networks ESG Grid U.K. NERC DataGrid CEOS Grid Other Grids

May 8, 2003Earth System Grid14 NCAR LBNL LLNL ISI ANL ORNL GSI CAS server CAS client MyProxy clientMyProxy server TOMCAT SECURITY services GRAM METADATA services FRAMEWORK services Auth metadata RLS NCAR MSS ORNL HPSS DATA storage The Earth System Grid THREDDS catalogs OGSA-DAISMCS TRANSPORT services gridFTP server/client TRM+DRM DRM openDAPg server ANALYSIS & VIZ services NCL openDAPg clientLAS server CDAT openDAPg client MONITORING services SLAMON daemon TOMCAT AXIS NERSC HPSS DISK mySQL xindice mySQLxindice mySQL

May 8, 2003Earth System Grid15 Typical Application Data (local) netCDF lib Application Data (remote) OPeNDAP Client Application OPeNDAP Via http Big Data (remote) ESG client Application ESG + DODS OPeNDAP Server ESG Server Distributed Application data OPeNDAP Via Grid Distributed Data Access Protocols Gridded Application

May 8, 2003Earth System Grid16 Data Movement Data (local) Data (remote) Big Data (remote) ESG client Application ESG + DODS ESG Server Distributed Analysis OPeNDAP Via Grid Additional Scenarios Big Data (remote) ESG Server

May 8, 2003Earth System Grid17 Grid and Network Infrastructure Grid-enabled storage systems Computational resources ? R CAS ESG services: information, replica, metadata, community authorization M Data consumers Data producers ESG: Collaboration Network