iRODS at the ASDC – Performance Results and Lessons Learned

Slides:



Advertisements
Similar presentations
INTRODUCING OLEANDER SOFTWARE SOLUTIONS PVT. LTD.
Advertisements

BEDI -Big Earth Data Initiative
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Western Regional Biomedical Collaboratory Creating a culture for collaboration.
Introduction and Overview “the grid” – a proposed distributed computing infrastructure for advanced science and engineering. Purpose: grid concept is motivated.
Systems Oceanography: Observing System Design. Why not hard-wire the system? Efficiency of interface management –Hard-wire when component number small,
02/07/2001 EOSDIS Core System (ECS) COTS Lessons Learned Steve Fox
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Windows XP Professional Deployment and Support Microsoft IT Shares Its Experiences Published: May 2002 (Revised October 2004)
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Hands-On Microsoft Windows Server 2008 Chapter 8 Managing Windows Server 2008 Network Services.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Discussion and conclusion The OGC SOS describes a global standard for storing and recalling sensor data and the associated metadata. The standard covers.
Implementation of HUBzero as a Knowledge Management System in a Large Organization HUBBUB Conference 2012 September 24 th, 2012 Gaurav Nanda, Jonathan.
, Data for Disaster Planning, Response, Management and Awareness ASDC Introduction The Atmospheric Science Data Center (ASDC) at NASA Langley Research.
14 Publishing a Web Site Section 14.1 Identify the technical needs of a Web server Evaluate Web hosts Compare and contrast internal and external Web hosting.
Data Merge Examples, Toolsets for Airborne Data (TAD): Customized Data Merging Function ASDC Introduction The Atmospheric Science Data Center (ASDC) at.
Preparing your Fabric & Apps for Windows Server 2003 End of Support Jeff Woolsey Principal Program Manager.
, Increasing Discoverability and Accessibility of NASA Atmospheric Science Data Center (ASDC) Data Products with GIS Technology ASDC Introduction The Atmospheric.
, Implementing GIS for Expanded Data Accessibility and Discoverability ASDC Introduction The Atmospheric Science Data Center (ASDC) at NASA Langley Research.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
KM Technology Assessment “Knowledge and team collaboration servers” DSC8030/CIS8260 Dr. Samaddar Summer 2004 Jon A. Preston.
1 A National Virtual Specimen Database for Early Cancer Detection June 26, 2003 Daniel Crichton NASA Jet Propulsion Laboratory Sean Kelly NASA Jet Propulsion.
Production Data Grids SRB - iRODS Storage Resource Broker Reagan W. Moore
Hands-On Microsoft Windows Server Implementing Microsoft Internet Information Services Microsoft Internet Information Services (IIS) –Software included.
, Key Components of a Successful Earth Science Subsetter Architecture ASDC Introduction The Atmospheric Science Data Center (ASDC) at NASA Langley Research.
EDUCAUSE 2005 Annual Conference October 19, 2005.
 PBMA-KMS deployed in March of 2001 is the first fully operational NASA-wide multi-functional Knowledge Management System  Knowledgebase 200+ Best Practices.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Framework for the Creation of Digital Knowledge Resources to meet the Challenges for Digital Future: A Librarian’s Perspective Dr. Harish Chandra Librarian.
GCRC Meeting 2004 BIRN Coordinating Center Software Development Vicky Rowley.
ASDC Data Distribution Architecture Michael M. Little Ver /26/14.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
Features Of SQL Server 2000: 1. Internet Integration: SQL Server 2000 works with other products to form a stable and secure data store for internet and.
More Information Working Group Composition End Users Data Modelers Data Analysts Airborne Measurement Scientists Airborne Instrument Scientists Data Management.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
ODISEES: A NEW PARADIGM IN DATA ACCESS Atmospheric Science Data Center NASA Langley Research Center Beth Huffer 1 Mike Little 2 John Kusterer 2 Lingua.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
CEOS Working Group on Information System and Services (WGISS) Data Access Infrastructure and Interoperability Standards Andrew Mitchell - NASA Goddard.
Name - Date Technology-enhanced Learning: tomorrow’s school and beyond Pat Manson Head of Unit Technology Enhanced Learning Directorate General.
Eric Peirano, Ph.D., TECHNOFI, COO
Popular Database Management Systems
Chapter 1 Computer Technology: Your Need to Know
Clouds , Grids and Clusters
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
EOSC MODEL Pasquale Pagano CNR - ISTI
Joslynn Lee – Data Science Educator
Cloud based linked data platform for Structural Engineering Experiment
Similarities between Grid-enabled Medical and Engineering Applications
Grid Portal Services IeSE (the Integrated e-Science Environment)
Introduction to Cloud Computing
DIGITAL LIBRARY.
Section 14.1 Section 14.2 Identify the technical needs of a Web server
CS 425/625 Software Engineering Architectural Design
Software Architecture
Geospatial and Problem Specific Semantics Danielle Forsyth, CEO and Co-Founder Thetus Corporation 20 June, 2006.
System Modeling Assessment & Roadmap Joint OMG/INCOSE Working Group
Unit# 5: Internet and Worldwide Web
Technical Capabilities
WGISS Connected Data Assets Oct 24, 2018 Yonsook Enloe
Web Mining Department of Computer Science and Engg.
AGMLAB Information Technologies
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
Creating a University IT Service Portfolio
The New Internet2 Network: Expected Uses and Application Communities
Data Management Components for a Research Data Archive
Item 2.2 of the agenda IT Working Group meeting 2016
Presentation transcript:

iRODS at the ASDC – Performance Results and Lessons Learned Federation between Atmospheric Science Data Center and NASA Center for Climate Simulation Mike Little, Andrei Vakhnin, Tiffany Mathews, Beth Huffer, & Brandi Quam, NASA Langley Research Center, Hampton, VA Dan Duffy, Scott Sinno, NASA Goddard Spaceflight Center, Greenbelt, MD M.M.Little@nasa.gov, Andrei.A.Vakhnin@nasa.gov, Tiffany.J.Mathews@nasa.gov, Elisabeth.B.Huffer@nasa.gov, Brandi.M.Quam@nasa.gov, Daniel.Q.Duffy@nasa.gov, Scott.S.Sinno@nasa.gov Abstract The Atmospheric Science Data Center (ASDC) is in the process of upgrading mechanisms by which its data can be discovered, accessed and understood. One mechanism which shows particular promise is Integrated Rule-Oriented Data System (iRODS). The ASDC, in conjunction with the NASA Center for Climate Simulation (NCCS), have conducted testing of iRODS as a data discovery and delivery mechanism and has found excellent performance. The ASDC then federated their implementation with the NCCS and established a production-level presence, making all data products available through this mechanism. We present lessons learned from its deployment, including the automation of the population of the directory system (iCAT) from ASDC's ontology. ASDC-NCCS Federation Performance Testing The ASDC, seeking more efficient high performance data delivery tools, experimented with federating iRODS with NCCS and consulted with RENCI, etc.. Testing consisted of functionality checks, and file transfer performance. Client software at the remote NCCS site, connected to their local iRODS server, was able to take advantage of the same functionality as clients directly connected to the ASDC server. Goal #1 The ASDC will strive to expand beyond its existing customer base by increasing accessibility to a broader, worldwide market; through the use of innovative technologies, the ASDC will enhance data access capabilities and develop plans to share data with new user communities. Goal #4 The ASDC will continue to foster innovation by actively assessing emerging technologies and their applicability to existing and projected customer needs and requirements in order to mitigate gaps in capability The 2013 ASDC strategic plan defines six goals that emphasize the vision and support the mission and values of the ASDC. The ASDC’s adoption of iRODS supports two goals: Characteristic Local File System ftp Data Transfer iRODS Data Transfer Latency 120ms 400ms Time to copy 9GB file 2 min 10 min Time to copy 10 9GB files 20 min 40 min iRODS Federation Between ASDC and NCCS ASDC-NCCS Federation Lessons Learned Planning Federation across computer security domains must engage a significant number of infrastructure managers, including local and Agency CIO offices, all the various computer security managers, and local and Agency network managers. Coordinating the infrastructure owners and debugging obstacles was the challenge. A precision ontology, while not essential, made information sharing across knowledge domains far easier than vague metadata that invariably means different things to different communities. A use case to help drive eradication of the obstacles is essential to creating a broadly capable information sharing capability. A local technical expert must be identified to leverage all functionality of iRODS. Implementing It is imperative to ensure infrastructure managers have clarity regarding requirements for their respective components needed to support iRODS federation. iRODS redesign between versions 2.x and 3.x preclude multi-generational federations. Operating and Maintaining Continuous monitoring/evaluating of connectivity is necessary to detect unannounced infrastructure changes. ODISEES Ontology Driven Interactive Search Environment Earth Science Semantic Web Tool iRODS Clients Assimilation & Climate Modeling (Via NCCS) Climate Modeling LIS Modeling Support NCCS NCCS File System Weather Modeling iCAT Rules Engine iRODS 3.3 Center Firewall ASDC-NCCS Federation Conclusions The use of iRODS is a highly effective way to expose information across knowledge domains. It provides a useful interface that can be used by software to access data without creating a local repository. A federation through iRODS is highly sensitive to changes in connectivity, protocol filtering, proxy filters and other interception-type computer security tools. iCAT Rules Engine iRODS 3.3 Internet Center Firewall and Computer Security Appliances ASDC-NCCS Federation Future Work Development of iRODS micro-services to interface ASDC access tools and clients to NCCS data, including the ODISEES client. Identification of other potential collaborators in sharing ASDC data via iRODS. Conversion of ODISEES ontology interface from batch upload to dynamic link to Allegrograph rdf-triple database. Testing of Registered vs. Ingested data products to determine scaling factors. ASDC Support The ASDC’s Data Products Online (DPO) GPFS File system consists of 12 x IBM DC4800 and 6 x IBM DCS3700 Storage subsystems, 144 Intel 2.4 GHz cores, 1,400 TB usable storage. . DPO (Data Products On-line) F S 1 2 3 4 ECS Data Pool Acknowledgements & Resources Thanks to John Kusterer, Phil Webster, Matthew Tisdale, and Al Settell for all their shared knowledge as well as their insights to lessons learned regarding each of the mentioned technologies. Their collaboration helped to make this poster possible. This is not an inclusive list, these are eight featured data products from a list of over forty. Remote Sensing Data Products Resources iRODS: https://www.irods.org ODISEES: Beth Huffer (Developer) ASDC: http://eosweb.larc.nasa.gov Earth’s Surface Earth’s Surface