Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 1 NPP Atmosphere.

Slides:



Advertisements
Similar presentations
Introduction to Maven 2.0 An open source build tool for Enterprise Java projects Mahen Goonewardene.
Advertisements

Testing and Quality Assurance
Software Modeling SWE5441 Lecture 3 Eng. Mohammed Timraz
Validata Release Coordinator Accelerated application delivery through automated end-to-end release management.
“GENERIC SCRIPT” Everything can be automated, even automation process itself. “GENERIC SCRIPT” Everything can be automated, even automation process itself.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
UNIT-V The MVC architecture and Struts Framework.
Tracking Services for ANY websites and web applications Zhu Xiong CSE 403 LCO.
NASA Goddard Space Flight Center Direct Readout Laboratory NPP/JPSS HRD/LRD Status Patrick Coronado NASA Goddard Space Flight Center directreadout.sci.gsfc.nasa.gov/ipopp.
CSCI ClearQuest 1 Rational ClearQuest Michel Izygon - Jim Helm.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
The MODIS online archive and on-demand processing Edward Masuoka NASA Goddard Space Flight Center, Greenbelt, MD, USA.
QWise software engineering – refactored! Testing, testing A first-look at the new testing capabilities in Visual Studio 2010 Mathias Olausson.
Rational Unified Process Fundamentals Module 4: Disciplines II.
The Pipeline Processing Framework LSST Applications Meeting IPAC Feb. 19, 2008 Raymond Plante National Center for Supercomputing Applications.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
EGEE is a project funded by the European Union under contract IST Testing processes Leanne Guy Testing activity manager JRA1 All hands meeting,
LCG Middleware Testing in 2005 and Future Plans E.Slabospitskaya, IHEP, Russia CERN-Russia Joint Working Group on LHC Computing March, 6, 2006.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
QUALITY ASSURANCE PRACTICES. Quality Plan Prepared and approved at the beginning of project Soft filing system approach followed. Filing location – –
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
CLASS Information Management Presented at NOAATECH Conference 2006 Presented by Pat Schafer (CLASS-WV Development Lead)
Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 1 NPP Atmosphere.
User Working Group 2013 Data Access Mechanisms – Status 12 March 2013
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
System/SDWG Update Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
Page 1 Land PEATE support for CERES processing Ed Masuoka Gang Ye August 26, 2008 CERES Delta Design Review.
Evolution of the JPSS Ground Project Calibration and Validation System Patrick Purcell, Gyanesh Chander and Peyush Jain JPSS Ground Project NASA, GSFC.
Miron Livny Computer Sciences Department University of Wisconsin-Madison Condor and (the) Grid (one of.
Selenium server By, Kartikeya Rastogi Mayur Sapre Mosheca. R
Aura Science Meeting Data Systems Working Group HIRDLS SIPS Status October 1, 2007 Vince Dean, Brendan Torpy, Greg Young Univ. of Colorado, Boulder Cheryl.
Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 1 NPP Atmosphere.
Physical Oceanography Distributed Active Archive Center THUANG June 9-13, 20089th GHRSST-PP Science Team Meeting GHRSST GDAC and EOSDIS PO.DAAC.
V7 Foundation Series Vignette Education Services.
Wednesday NI Vision Sessions
Managing multiple projects or services? Have a mix of Microsoft Project and more simple tasks? Need better visibility and control?
Review of PARK Reflectometry Group 10/31/2007. Outline Goal Hardware target Software infrastructure PARK organization Use cases Park Components. GUI /
A Solution for Maintaining File Integrity within an Online Data Archive Dan Scholes PDS Geosciences Node Washington University 1.
Applied Software Project Management SOFTWARE TESTING Applied Software Project Management 1.
Petr Škoda, Jakub Koza Astronomical Institute Academy of Sciences
Architecture Review 10/11/2004
HST and JWST Pipelines and Reference Files
Applied Software Testing
Software Configuration Management
Simulation Production System
NPP Atmosphere PEATE Development
Shared Services with Spotfire
Self Healing and Dynamic Construction Framework:
MODIS SST Processing and Support for GHRSST at OBPG
Testing for patch certification
T-StoRM: a StoRM testing framework
Deploying and Configuring SSIS Packages
Leanne Guy EGEE JRA1 Test Team Manager
Management of Virtual Execution Environments 3 June 2008
Applied Software Implementation & Testing
X in [Integration, Delivery, Deployment]
DUCKS – Distributed User-mode Chirp-Knowledgeable Server
Rui Wu, Jose Painumkal, Sergiu M. Dascalu, Frederick C. Harris, Jr
Lecture 1: Multi-tier Architecture Overview
Module 01 ETICS Overview ETICS Online Tutorials
Software models - Software Architecture Design Patterns
Course: Module: Lesson # & Name Instructional Material 1 of 32 Lesson Delivery Mode: Lesson Duration: Document Name: 1. Professional Diploma in ERP Systems.
SUSE Linux Enterprise Desktop Administration
Dtk-tools Benoit Raybaud, Research Software Manager.
Test Cases, Test Suites and Test Case management systems
Overview Activities from additional UP disciplines are needed to bring a system into being Implementation Testing Deployment Configuration and change management.
DBOS DecisionBrain Optimization Server
Software Development Process Using UML Recap
Securing HTCondor Flocking
Presentation transcript:

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 1 NPP Atmosphere PEATE Atmosphere PEATE Team Space Science and Engineering Center University of Wisconsin-Madison 10 July 2008 Climate Data Processing Made Easy Scott Mindock

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 2 Space Science Engineering Center (SSEC) The NPP Atmosphere PEATE is implemented within the framework and facilities of the Space Science and Engineering Center (SSEC) at the University of Wisconsin-Madison. SSEC has been successfully supporting operational, satellite-based remote- sensing missions since 1967, and its capabilities continue to evolve and expand to meet the demands and challenges of future missions. 1. Employs ~ 250 scientists, engineers, programmers, administrators and IT support staff. 2. Satellite missions currently supported: GEO: GOES 10/11/12/R; Meteosat 7/9; MTAT-1R; FY 2C/2D; Kalpana LEO: NOAA 15/16/17/18, Terra, Aqua, NPP, NPOESS, FY 3, MetOp

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 3 Atmosphere PEATE is funded under NASA Grant NNG05GN47A Award Date: 10/07/2005 Grant Period: 08/15/2005 to 8/14/2008 (renewal in progress) Related Work at SSEC: CrIS SDR Cal/Val and Characterization (Revercomb, IPO) VIIRS SDR and Cloud Cal/Val (Menzel, IPO) VIIRS Algorithm Assessment (Heidinger, IPO) International Polar Orbiter Processing Package (Huang, IPO) VIIRS Instrument Characterization (Moeller, NASA) Funding and Related Work

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 4 Creating Climate Data Products (CDR) is hard! Products track global trends Calibration must be accurate. (No calibration artifacts) Algorithms must be fully verified with global data (No regional artifacts) Data sets are large and hard to manage Developing the CDRs is an iterative process Large processing clusters are required Programming requires different skill set Distributed systems hard to test On going process Requirements change Technology changes Staff changes

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 5 The process requires multiple computing systems Single machine can be used for initial development but cluster computing needed to verify performance over full globe.

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 6 CDR development is an iterative process Initial development occurs on single machine Product verification requires data sets of increasing size Increasing data set size increase computation time

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 7 Strategies of processing simplification Reduce or remove the “Move to Cluster” step Make executions environments similar Make data access patterns similar Results in faster iterations

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 8 Use well defined interfaces between subsystems Decouples systems which reduces learning curve Allows evolution of subsystems Simplifies test and verification of software Create configuration driven subsystems Simplifies deployment of subsystems Allows operations to modify system behavior Leverage automated testing technologies Reduces learning curve Provides continuous test coverage Captures requirements in executable form Strategies for managing processing system

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 9 Ingest : ING Brings data into the Atmosphere PEATE Supports FTP, HTTP and RSYNC Data Management System : DMS Stores data in the form of files. Provides a Web Service to locate, store and retrieve files. Computational Resource Grid : CRG Provides Web Service to locate, store and retrieve jobs Algorithm : ALG Consumes jobs Runs algorithms in form of binaries Algorithm Rule Manager: ARM Combines data with algorithms to produce jobs Provides Web Service interface to locate, store and retrieve rules The system: Atmosphere PEATE

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 10 Configuration File Allows operations to add new sites Allows operations to maintain existing sites Customization allowed in form of scripts (BASH,PYTHON) QC Quick Look Metadata extraction Notices missing or late data ING: Ingest, bring data into system

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 11 Relives Scientist of having to manage data. Simple put and get functionality Configuration file Specify fileservers and directories Operations can Add/Remove fileservers DMS: Stores Data and Products File system - hold files Database - holds file information Public Access - DMS interface Worker - manages file system

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 12 Provides well-defined interface deployed as a web service Accepts job requests Provides Job Status Monitors Job State Allows processing nodes to be added or removed from system CRG : Provide nodes with jobs

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 13 Recreates development environment Retrieves data from DMS Retrieves and runs software packages Saves results to DMS, includes products, stdout and stderr AlgHost: Runs software the produces products

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 14 Cluster executes bash script Script is passed arguments Software Package Directory Working / Output directory Static Ancillary Directory Dynamic Ancillary Directory Inputs files Outputs files Software Package is called from the script Results are stored by the process that started script. Algorithm Script Structure

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 15 Provides well-defined interface deployed as a web service. Assigns jobs to CRG Monitors data in DMS Monitors the status of jobs in CRG Production rules can be added or removed dynamically by operations Volatile logic lives here ARM: Bind data to software packages

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 16 Use well defined interfaces between subsystems Decouples systems which reduces learning curve Allows evolution of subsystems Simplifies test and verification of software Create configuration driven subsystems Simplifies deployment of subsystems Allows operations to modify system behavior Leverage automated testing technologies Reduces learning curve Provides continuous test coverage Captures requirements in executable form Strategies for managing processing system (revisited)

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 17 Development Process: Spiral method Design Implement Test Deploy Build = Deploy to Operations

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 18 Employ standard software industry practices Automate with ANT, Make like, XML based Test with JUNIT, Java Unit Test Increases system quality Tests are reproducible Tests are run more often than they would be if they were manual Tests are improved over time Tests are configurable We don’t just build, the process includes testing and verification Testing Strategy

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 19 Builds system Tests subsystems Tests scenarios Updates repositories Logs results Scenarios demonstrate requirements Nightly Build

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 20 May use internal knowledge interfaces for testing Test and exercise public interfaces Stress test interfaces Evolve to test and verify bugs Fixed defects have specific tests added Tests run in nightly build Tests verify release Layered approach to testing Everything tested, Every Night Unit and Regression Testing

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 21 Test ingest function Test forward and redo functions Reflect CDR development process Testing Scenarios (1 of 2)

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 22 Documents doc - Level 4 requirements doc - Operations Concepts Test plans are implemented as scenario tests Tests correspond to Use Cases outlined in OpsCon At least one test for each requirement set Successful completion of test verifies requirements by demonstration Factors that determine success Generation of expected products Ability to track product heritage Ability to reproduce results Ability to uniquely identify products Test Scenarios (2 of 2)

Space Science and Engineering Center University of Wisconsin-Madison Space Science and Engineering Center University of Wisconsin-Madison 23 Conclusion: Climate Data Processing Is Easy Ingest system makes it easy to add and manage data sources Operators can control system Operator can monitor system The DMS makes it easy to maintain large data sets Scientists can find data Operators can add and remove servers Operators can add and remove sites The CRG and AlgHost make it easy to transfer CDR production the development to the cluster environment You still have to get the product correct!