Send2NCEI: Fostering Producer-Archive Propinquity..

Slides:



Advertisements
Similar presentations
AmeriCorps is introducing a new online payment system for the processing of AmeriCorps forms
Advertisements

Professional Development Management System (PDMS) A tutorial for professional development cluster Vendors, Providers and Instructors Charlie Michels PSB.
Recruitment Booster.
Next Gen Web Solutions Student Employment Employer Training Template.
Visibility Information Exchange Web System. Source Data Import Source Data Validation Database Rules Program Logic Storage RetrievalPresentation AnalysisInterpretation.
1 © 2006 by Smiths Group: Proprietary Data Smiths Group Online Performance Review Tool Training.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Realtime Equipment Database F.R.E.D. stands for Fastline’s Realtime Equipment Database. F.R.E.D. will allow you to list all your inventory online. F.R.E.D.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Long-term Archive Service Requirements draft-ietf-ltans-reqs-00.txt.
Education Google Calendar (GCal) English. Education Upon completion of this course, you will be able to:  Navigate the GCal interface  Search your calendar.
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
Recruitment Office Procedures Job Posting Requests Creating a Search Committee –Adding Search Committee MembersAdding Search Committee Members –Designating.
SWIS Digital Inspections Project (SWIS DIP) Chris Allen, Information Management Branch California Integrated Waste Management Board November 5, 2008 The.
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
Welcome to BLT Online NQT Induction. Points We Will Cover: What is BLT Online NQT Service? What are the advantages of using it? User roles on the site.
SuccessFactors: Phase 2 Annual Review Process Presented by: Patricia Kelly / VeLonda Dantzler Human Resources Management.
1 Next Generation of Operational Earth Observations From the National Polar-Orbiting Operational Environmental Satellite System (NPOESS): Program Overview.
EASI a free web database application for collecting and managing monitoring records.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
Training by the Office of Library and Information Services Contact for more information: karen.gardner- or
Rev.04/2015© 2015 PLEASE NOTE: The Application Review Module (ARM) is a system that is designed as a shared service and is maintained by the Grants Centers.
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
NOAA/NESDIS/National Oceanographic Data Center Following the Flow of Two Underway Data Streams Within the U. S. National Oceanographic Data Center Steven.
Setting Up TGO User Accounts. Creating User Accounts for Other Users If your company has other users who need to use the Active Orders system, your company’s.
M-1 INGEST OVERVIEW Don Sawyer National Space Science Data Center NASA/GSFC October 13, 1999.
The OAIS model SEEDS meeting May 5 th, 2015, Lausanne Bojana Tasic.
Rev.04/2015© 2015 PLEASE NOTE: The Application Review Module (ARM) is a system that is designed as a shared service and is maintained by the Grants Centers.
GMAP Grant Management, Application, and Planning Consolidate Application Training.
Invoices and Service Invoices Training Presentation for Raytheon Supply Chain Platform (RSCP) April 2016.
OAIS (archive) Producer Management Consumer. Representation Information Data Object Information Object Interpreted using its Yields.
OAIS (archive) OAIS (archive) Producer Management Consumer.
R2R ↔ NODC Steve Rutz NODC Observing Systems Team Leader May 12, 2011 Presented by L. Pikula, IODE OceanTeacher Course Data Management for Information.
Shipments Training Presentation for Raytheon Supply Chain Platform (RSCP) April 2016.
Core ELN Training: Office Web Apps (OWA)
Welcome to BLT Online NQT Induction.
Internet Made Easy! Make sure all your information is always up to date and instantly available to all your clients.
Helping Yourself in PD2 SPS Spotlight Series July 2015.
PearsonAccess EOC Training
Ingest and Dissemination with DAITSS
Redistributing Funds & Large Scale Award Set-up
Electronic Handbooks (EHBs) Overview
OAIS Producer (archive) Consumer Management
Status Update of AMWG Change Requests As of February 15, 2016
Comprehensive Continuous Improvement Plan (CCIP)
Required Data Files Review
Advanced Tracking and Resource tool for Archive Collections (ATRAC)
Request a Content Change for Novartis.com
Microsoft Dynamics.
IAMS Workflow System Training
SchoolFront - Notifications Training
CSDR Submit-Review Website Submitter Guide
Electronic Products Workshop Division of Air Resource Management -DEP
Penn State Educational Programming Record (EPR) Guide
iCIMS 17.1 Release: Highlights
The Grants.gov Online Grant Submission Portal November 8, 2017
Open Archival Information System
Robin Dale RLG OAIS Functionality Robin Dale RLG
ZTE Customer Request Self-Service Portal Operation Guide V1.0.5
Grants Management Solution Suite (GMSS)
European Statistical System Metadata Handler ESS MH (Super) Providers
Contract Management Software 100% Cloud-Based ContraxAware provides you with a deep set of easy to use contract management features.
WORKSHOP Establish a Communication and Training Plan
WHERE TO FIND IT – Accessing the Inventory
NOAA OneStop and the Cloud
Fundamental Science Practices (FSP) of the U.S. Geological Survey
Presentation transcript:

Send2NCEI: Fostering Producer-Archive Propinquity.. Kenneth S. Casey, PhD Deputy Director, Data Stewardship Division John Relph Technical Development Team Lead, Software Engineering Branch, Data Stewardship Division Propinquity: the state of being close to someone or something; proximity

The NODC Mission - 2014 “… organized for the purpose of acquiring, compiling, processing and preserving oceanographic data for ready retrieval”

The NCEI Mission - 2015 NCEI Levels of Stewardship 1: Long Term Preservation and Basic Access 2: Enhanced Access and Basic Quality Assurance 3: Scientific Improvements 4: Derived Products 5: Authoritative Records 6: National Services and Intl. Leadership NCEI Levels of Stewardship Focus here today “… responsible for preserving, monitoring, assessing, and providing access to the Nation’s treasure of environmental data and information.”

Open Archive InfoSys (OAIS) Reminder! Common Services PRODUCERS CONSUMERS Preservation Planning Data Management Queries/Results Ingest Descriptive Info Access SIP Archival Storage DIP AIP AIP AIP Administration MANAGEMENT DIP/SIP = Dissemination/Submission Information Package

Ingest - Two Primary Methods Common Services Automation PRODUCERS CONSUMERS Preservation Planning Data Management Queries/Results Ingest Descriptive Info Access SIP Archival Storage DIP Send2NCEI. AIP AIP AIP Administration MANAGEMENT

Send2NCEI A Producer-friendly interface for collecting metadata and uploading data files to the Archive as a well-formed SIP, and... An internal set of services for inserting the well-formed SIP into the existing Archive workflow Intended for “one-off” submissions To understand where Send2NODC fits in… need to understand (at a high level, not necessarily at the detailed level), the NODC archive workflow...

Archive “Swim Lanes” Producers Data Content Managers and Data Officers Consumers Data Content Managers (DCMs) do the bulk of the archive workflow. Data Officers (DOs) assign the submissions to DCMs, and provide an independent final review. This approach has great strengths: Our archive process is agnostic to both the structure and the content of the SIP. It can transmitted by any means (carrier pigeon? sure, why not?), the metadata can take any format (structured or otherwise), and the file format can vary (widely!). In other words, we have designed our system to be as easy as possible on the Producer, who has of course already done a lot of work to collect the data..

Archive “Swim Lanes” Making it simple here... … makes it harder here* But these strengths can also be a weakness. The SIP between the Producer swim lane and the Data Content Manager (DCM) lane is currently highly heterogeneous in every aspect: transmission method (ftp, email, etc.), the metadata "format" (readme file, email content, etc.), and the file format. This heterogeneity can create an additional burden on our DCMs, and it is harder to capture the kind of information that we really want (instead of just the minimums) in a structured way. NODC netCDF templates are already attacking the file format problem, and Send2NODC attacks the other two problems, thereby more rapidly advancing the DCM through his or her swim lane. * and some things, like file integrity checks, are basically impossible

Send2NCEI Addresses two of the three SIP heterogeneity problems* unstructured/missing metadata variable transmission Enforces structure while still attempting to keep it simple and easy for the Producer Improves overall quality and integrity Reduces the DCM level of effort (and to a lesser extent, the DO’s) * The third, varying file formats, is being addressed by the NODC netCDF templates 10

Send2NCEI Producer UI Flow Create new account new submitter forgot password Login Retrieve password existing Edit Account Profile My Submission Packages Change password Track Submitted Create New Edit Current View Archived 1. People & Projects 2. Dates & Locations 3. Data Types 4. Package Description 5. Upload & Submit Store Metadata Send Confirmation Email, Update Tables Store Data 11

Login Login or create account (< 3 minutes) Meets numerous federal requirements (password complexity and recovery, “fine print” legalese, etc.) Introduces user interaction model

My Submission Packages “Control Panel” Establishes user presence “in app” Create new packages Edit/delete active packages View submitted packages Link to archived packages

1. People & Projects Well defined user interaction model (autocomplete, instant validation, repeating entries, etc.) Multiple persons, multiple ISO roles Projects Funding Agencies

2. Dates & Locations Dates Locations Platforms Sea Areas/Regions

3. Data Types Variables Units (UDUNITS) Instrument Observation category Can include methods and data quality info

4. Package Description Title (optionally auto-generated) Abstract Authorship List for DOIs Purpose References

5. Upload & Submit Upload files Can separately provide some or all files if needed Comment on overall package Validate package prior to upload/submit Newbies averaged less than 20 minutes!

Send Confirmation Email, Update Tables 1. Responsible Persons/Projects Send2NCEI Internal UI Initial Appraisal by Data Officer of the Week (DOW) S2N creates Help Desk Ticket (HDT) HDT links to package metadata DOW appraises and selects appropriate Unit/Team Unit POC/DO assigns Data Content Manager (DCM) Retrieve password Change password Send Confirmation Email, Update Tables View Archived forgot password Edit Current 1. Responsible Persons/Projects 2. Dates & Locations 3. Measured Parameters 4. Package Description 5. Upload & Submit XML + data existing Login My Submission Packages Store Data Create New Create new account new submitter Track Submitted Store Metadata Edit Account Profile

Send2NCEI Internal Process Initial Appraisal By Data Officer of the Week (DOW) S2N creates Help Desk Ticket (HDT) HDT links to package metadata DOW appraises and selects appropriate Unit/Team Unit POC/DO assigns Data Content Manager (DCM) S2N Stages Data and Metadata for DCM AIP created in ingest area Data and Producer’s ISO XML copied to accession ATDB Brief Access Record pre- populated HDT updated and DCM notified to begin work Producer emailed and My Submission Packages updated Normal Archive Procedures Followed DCM reviews and interacts with Producer as needed DCM requests “archive-published” status from DO DO reviews and approves, or iterates Producer emailed and My Submission Packages updated

Send2NCEI Internal UI

Send Confirmation Email, Update Tables 1. Responsible Persons/Projects Magic Happens Here! Initial Appraisal by Data Officer of the Week (DOW) S2N creates Help Desk Ticket (HDT) HDT links to package metadata DOW appraises and selects appropriate Unit/Team Unit POC/DO assigns Data Content Manager (DCM) Retrieve password Change password Send Confirmation Email, Update Tables View Archived forgot password Edit Current 1. Responsible Persons/Projects 2. Dates & Locations 3. Measured Parameters 4. Package Description 5. Upload & Submit XML + data existing Login My Submission Packages Store Data Create New Create new account new submitter Track Submitted Store Metadata Edit Account Profile I don’t like this slide, I think I’m gonna delete it, and instead just go to the next one, which needs a new title, something like Flexible and Extendable.

What Magic? Human web interface only loosely coupled to backend processing All sorts of extensibility is possible Program-specific “tabs” could require additional metadata Other machine systems could bypass web interface entirely Web interface could be used to establish new collection level record for a future automated data stream Pre-populate fields from an ISO record or a netCDF file Other/additional vocabularies can be included in auto-complete fields...

Send2NCEI Status Paperwork Reduction Act approval granted Public comments collected and responses published Released on April 22, 2015: https://www.nodc.noaa.gov/s2n/ Integrate into NCEI Archive workflow Integrate into ATRAC?

Producer Confirmation Email

Help Desk Ticket