Status Report of EDI on the CAA

Slides:



Advertisements
Similar presentations
Configuration management
Advertisements

Configuration management
Week 6: Chapter 6 Agenda Automation of SQL Server tasks using: SQL Server Agent Scheduling Scripting Technologies.
Title: ICON L0 from MOC to SOC automation Author: MOC/SOC Resides: SOC Description: This is a script that copies the ICON L0 files from the MOC to the.
Backup and Recovery Part 1.
CFT Offline Monitoring Michael Friedman. Contents Procedure  About the executable  Notes on how to run Results  What output there is and how to access.
Data Management Subsystem: Data Processing, Calibration and Archive Systems for JWST with implications for HST Gretchen Greene & Perry Greenfield.
11 14th CAA Cross-Calibration meeting, York, 5-7 Oct 2011 STAFF CAA products & Cross-Calibration activities Patrick ROBERT & STAFF Team 5) STAFF-SC CWF.
1 14th CAA Cross-Calibration meeting, York, 5-7 Oct 2011 STAFF CAA products & Cross-Calibration activities Patrick ROBERT & STAFF Team 5) STAFF-SC CWF.
CAA/CFA Meeting | CFA Team | ESAC | Octiber CFA Under Development CAA/CFA Meeting ESAC, Oct 11 th 2011 European Space AgencyCFA Team.
WHISPER action items Gábor Facskó, Jean-Gabriel Trotignon,Séna Kougblénou, Xavier Vallières, Guillaume Lointier LPC 2 E/CNRS, Orléans, France 10th CAA.
Page 1 of 13 Beginner’s Tutorial – The Monalog Sanitizer What data does Monalog collect from you?  Monalog collects what you type on the command line.
CAA Database Overview Sinéad McCaffrey. Metadata ObservatoryExperiment Instrument Mission Dataset File.
Status of EDI Archiving March, 2015 Leiden, Netherlands.
1 CAA 2009 Cross Cal 9, Jesus College, Cambridge, UK, March 2009 Caveats, Versions, Quality and Documentation Specification Chris Perry.
Cluster Active Archive Status of DWP Data Activities Simon Walker, Keith Yearby, Michael Balikhin Automatic Control and Systems Engineering, University.
FGM Report: Extended Mode Processing 22 nd Cross Calibration Workshop Chris Carr, Patrick Brown, Leah-Nani Alconcel, Tim Oddy, Peter Fox, Cary Colgan (summer.
Cluster Active Archive Status of DWP Data Activities Simon Walker, Keith Yearby, Michael Balikhin Automatic Control and Systems Engineering, University.
CIS Action Items 10 th Cross-Calibration Workshop Observatoire de Paris, Nov
1 CAA 21 th Cross Calibration Meeting, Leiden, th Mar 2015 CAA Cross Cal Meeting Mar 2015 Automated Pipeline Processing.
1 CAA 20 th Cross Calibration Meeting, MPS, Gottingen 16th Oct 2014 CAA Cross Cal Meeting Oct 2014 Pipeline Automation Chris Perry.
DAA Status/Progress B. Mihaljčić, A. Fazakerley, N. Doss, G. Watson UCL Department of Space and Climate Physics Mullard Space Science Laboratory 18 th.
8 th Cross-Calibration Workshop, Kinsale, Ireland, October 20081/14 Draft Replies and Actions to the Recommendations of the Review Panel for Final.
Emdeon Office Batch Management Services This document provides detailed information on Batch Import Services and other Batch features.
FGM Report 21 st Cross Calibration Workshop Chris Carr, Patrick Brown, Leah-Nani Alconcel, Tim Oddy, Peter Fox Imperial College London 24 March 2015.
14 th CAA Cross-calibration Workshop CIS archiving activities York October 5-7, 2011.
CAA inventory plots and graphical products overview Delphine Herment (ESTEC) ESAC - 17/03/2011.
20 th CAA Cross-Calibration Workshop MPS, Göttingen, Germany Oct CAA Dataset Inventory.
Gennia Michlin, Clinical Data Management Systems (CDMS) Project Leader Mar 2010 New RDC features training.
SOFTWARE TESTING TRAINING TOOLS SUPPORT FOR SOFTWARE TESTING Chapter 6 immaculateres 1.
Innotas Reports, Dashboards, and Filters
Product Training Program
Architecture Review 10/11/2004
CAA Operational Review 9 Double Star PEACE Team Report
Data Standards for Pharmacometric Analysis Data Sets
HORIZONT TWS/WebAdmin DS TWS/WebAdmin DS Tips & Tricks
ASPOC Presentation for the CAA Operations Review-1 Klaus Torkar and Harald Jeszenszky IWF/OAW Graz ESTEC, May 2006.
Cluster Active Archive – Wideband data BM2 mode
CSA Implementation overview since OR-8 Test campaigns Scheduled development plan Arnaud Masson CSA/CAA Operational Review 9 ESTEC, 04-June-2014.
20th CAA/DAA Cross Calibration Meeting
EDI – CAA STATUS REPORT EDI CAA Status
CAA-OR (End of Phase 1) CAA DWP Operations Review
H. Rème, I.Dandouras and A. Barthe IRAP, Toulouse, France
Configuration Management and Prince2
Annual Report of the DWP Experiment 9th CAA Operations Review
Status Report of EDI on the CAA
CAA Action Items Investigations PEACE Progress Meeting
10th CAA Operations Review Annual Report of the FGM Experiment
Software Documentation
PIC + TransNet.
Post Enumeration Survey Census
DepEd e-FORMS Automated Form Templates in Excel for Elementary and High School Alfonso C. Corpuz, Physics Teacher September 10, :00 pm.
Key points.
NIMAC for Publishers & Vendors: Delivering Files
Key points.
Document Custodian of the Drop Safe Log
Updating GML datasets S-100 WG TSM September 2017
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System Zaihua Ji Doug Schuster Steven Worley Computational.
Data Upload & Management
Distraction Tool.
TRAINING OF FOCAL POINTS on the CountrySTAT SYSTEM based on FENIX
Why Background Processing?
Innotas Reports, Dashboards, and Filters
Problem Statement and Significance
Overview of Workflows: Why Use Them?
CHAPTER 6 ELECTRONIC DATA PROCESSING SYSTEMS
M. Kezunovic (P.I.) S. S. Luo D. Ristanovic Texas A&M University
Data Quality 2 (DQ2) & Staff Reporting Webinar
Purchase Document Management
Neonatal Workload Tool standard BOXI reports
Presentation transcript:

Status Report of EDI on the CAA

CAA Public website All pages fully accessible by anyone

CAA Test Area Currently services accessible only by ESTEC/ESAC Individual instrument team members will be allowed on a need basis Some of the CAA Command-line services are available to all

EDI Datasets for Downloading

CAA EDI Graphics

CAA EDI Graphics

Dataset Inventory Analysis There are several similar datasets where the main difference is that data vectors or matrices are converted into another system. For instance, scientific data are given in different scientific units or coordinate systems Consistence of such datasets have never been investigated although there is some evidence that some errors can occur Such similar datasets can often be expected to have the same number of records Exception: if a dataset is given in raw units, the corresponding datasets in scientific units may have less records if poor records were deleted However, if poor data have been replaced with FILLVAL, the same number of records should exist In addition such datasets should have similar values for Version numbers Generation and ingestion dates Note: these metadata can have significant differences, so there is no automatic CAA has developed a tool that collects such metadata and gives them in a text file

Inventory Output The beta version of the inventory tool is available (currently) at http://caa.estec.esa.int/caa_stage/st-tk_inventory.xml When the tool is ready, inventory is executed at TBD frequency (only for the newly ingested files)

Example: RAPID - ESPCT6 Inventory analysis for C1 "Electron, omni-directional distribution" (C1_CP_RAP_ESPCT6). Compared datasets: 1: C1_CP_RAP_ESPCT6 2: C1_CP_RAP_ESPCT6_R Generation date 2014-10-08T21:34:36Z Analysis is based on the database content at 2014-10-08T11:31:06Z Data coverage 2000-12-07T00:00:00Z/2013-12-31T23:59:59Z Columns description: Date: YYMMDD OK?: OK: comparison OK, ERR: error ERR: if ERR, the reason for error: F: not all files exist R: number of records don't match T: timestamps don't match V: versions don't match Rx: number of records in file x Vx: version of file x Gx: generation time Ix: ingestion time

Example: RAPID ESPCT6

Example of Timing errors UT time stamps can differ up to 3 milliseconds

Automation Tool Purpose of the tool: Keeping track of dataset updates that may cause re-deliveries of other products Avoid risk of having products that are out-of-date Tool consists of a number of distinct components identification of intervals in need of (re-)processing scheduling pipeline: to issue jobs across the CAA machines standard wrapper and support routines for execution of pipelines common logging, pre-validation and submission system Instrument teams may benefit of this service, particularly the first part that identifies the intervals that are in need of (re-)processing

Automation Tool: Example # Check FGM since given date 2001-01-01 yesterday #2014-05-01 C1_CP_FGM_FULL,2014-05-01 Output: C1_CP_FGM_FULL #2014-05-01 2002-05-09T18:45:42Z/2002-05-12T03:48:59Z #2014-05-01 2009-12-03T03:00:11Z/2010-01-01T12:35:48Z #2014-05-01 2011-02-08T02:40:53Z/2011-02-10T08:57:25Z #2014-05-01 2012-03-03T00:09:44Z/2012-04-01T09:47:42Z #2014-05-01 2014-03-01T04:31:36Z/2014-05-01T06:05:48Z The check is being made on all data from 2001-01-01 to the most recent day primary dataset = time specification ->ingestion date of 2014-05-01 for the whole mission dependent dataset = C1_CP_FGM_FULL (a minimum ingestion date is specified but is not really needed in this case since it is the same as the primary dataset; it was included to avoid picking up the FGM data which was re-ingested with detached headers but the data were unchanged so did not want to trigger a reprocessing of the entire mission). Result = looking for intervals where the dependent dataset has been ingested more recently than the primary, so in this case it is finding all intervals where C1_CP_FGM_FULL has been ingested since 2014-05-01

Automation Tool: Example, cont … If the prime and dependent specifications are swapped, it would then list all C1_CP_FGM_FULL intervals that had not been ingested since 2014-05-01 # Check FGM not ingested since given date 2001-01-01 yesterday C1_CP_FGM_FULL #2014-05-01 Output: C1_CP_FGM_FULL 2001-01-01T00:00:00Z/2002-05-09T18:45:42Z C1_CP_FGM_FULL 2002-05-12T03:48:59Z/2009-12-03T03:00:11Z C1_CP_FGM_FULL 2010-01-01T12:35:48Z/2011-02-08T02:40:53Z C1_CP_FGM_FULL 2011-02-10T08:57:25Z/2012-03-03T00:09:44Z C1_CP_FGM_FULL 2012-04-01T09:47:42Z/2014-03-01T04:31:36Z C1_CP_FGM_FULL 2014-05-01T06:05:48Z/2014-10-21T00:00:00Z  

RAPID Example # Check if dataset C1_CP_RAP_EPITCH needs updating 2001-01-01 yesterday C1_CP_RAP_EPITCH C1_CP_FGM_FULL,2014-05-01 C1_CP_RAP_EPITCH 2002-05-09T18:45:42Z/2002-05-12T03:48:59Z C1_CP_RAP_EPITCH 2009-12-03T03:00:11Z/2010-01-01T12:35:48Z C1_CP_RAP_EPITCH 2012-04-01T00:00:00Z/2012-04-01T09:47:42Z C1_CP_RAP_EPITCH 2013-12-31T23:59:59Z/2014-05-01T06:05:48Z

RAPID Example, cont … Output can optionally be given as interval split/aligned e.g. by day 2002-05-09T00:00:00Z/2002-05-10T00:00:00Z 2002-05-10T00:00:00Z/2002-05-11T00:00:00Z 2002-05-11T00:00:00Z/2002-05-12T00:00:00Z 2002-05-12T00:00:00Z/2002-05-13T00:00:00Z 2009-12-03T00:00:00Z/2009-12-04T00:00:00Z 2009-12-04T00:00:00Z/2009-12-05T00:00:00Z 2009-12-05T00:00:00Z/2009-12-06T00:00:00Z 2009-12-06T00:00:00Z/2009-12-07T00:00:00Z ... 2009-12-26T00:00:00Z/2009-12-27T00:00:00Z 2009-12-27T00:00:00Z/2009-12-28T00:00:00Z 2009-12-28T00:00:00Z/2009-12-29T00:00:00Z 2009-12-29T00:00:00Z/2009-12-30T00:00:00Z 2009-12-30T00:00:00Z/2009-12-31T00:00:00Z 2009-12-31T00:00:00Z/2010-01-01T00:00:00Z 2010-01-01T00:00:00Z/2010-01-02T00:00:00Z 2012-04-01T00:00:00Z/2012-04-02T00:00:00Z Option also provided to give the next available version number for each interval

Search of Missing Files # Find missing FGM_FULL files 2001-01-01 yesterday C1_CP_FGM_FULL @MISSION 2001-01-01 yesterday C3_CP_FGM_FULL @MISSION Output: C1_CP_FGM_FULL 2001-01-01T00:00:00Z/2001-01-07T00:10:02Z C1_CP_FGM_FULL 2001-07-04T12:10:14Z/2001-07-06T21:16:07Z C1_CP_FGM_FULL 2009-10-28T19:32:41Z/2009-10-31T04:39:09Z C1_CP_FGM_FULL 2014-05-01T06:05:48Z/2014-10-14T00:00:00Z C3_CP_FGM_FULL 2001-01-01T00:00:00Z/2001-01-07T00:10:02Z C3_CP_FGM_FULL 2001-12-06T05:10:27Z/2001-12-08T14:18:08Z C3_CP_FGM_FULL 2005-10-05T07:30:17Z/2005-10-07T16:35:23Z C3_CP_FGM_FULL 2006-05-02T15:34:55Z/2006-05-05T00:40:56Z C3_CP_FGM_FULL 2006-07-03T12:17:51Z/2006-07-05T21:23:44Z C3_CP_FGM_FULL 2006-07-13T00:38:55Z/2006-07-15T09:47:08Z C3_CP_FGM_FULL 2009-01-14T01:32:09Z/2009-01-16T10:38:51Z C3_CP_FGM_FULL 2014-05-01T06:05:48Z/2014-10-14T00:00:00Z

EDI Delivery/Ingestion Activity The plots are regenerated daily around mid-night Monthly and 6-month plots Top two panels are taken from database Top: Number of files ingested into the database 2nd from top: average time used for one file to validate/add into the database Bottom five shows an instantaneous situation at the time of plot production 3rd: Number of files failed validation: e.g. wrong version number 4th and 5th: number of CEF and nn-CEF files in the delivery area 6th and 7th: number of CEF and non-CEF files waiting for validation

Status of File Transfer to CSA http://caa.estec.esa.int/caa/csa_stats.xml

EDI inventory Notes: If EGD exists, there is a chance for PP/SPIN/MP If EGD does not exist, no chance for PP/SPIN/MP QZC and CRF should exist always in EF-mode, so they should have the same coverage as CLIST/EF-mode PP and SPIN should have identical coverage MP should have a wider coverage than PP/SPIN

EDI inventory Inventory plots are visible in annex 2