Components of the OTN Data Centre Document-oriented web-based data repository Data and metadata in secure folders per user.

Slides:



Advertisements
Similar presentations
2011 NetIS Presentation The Complete ePublishing Platform Designed for the 21 st Century.
Advertisements

28 March 2003e-MapScholar: content management system The e-MapScholar Content Management System (CMS) David Medyckyj-Scott Project Director.
Management Information Systems, Sixth Edition
Business Intelligence components Introduction. Microsoft® SQL Server™ 2005 is a complete business intelligence (BI) platform that provides the features,
Web-Enabling the Warehouse Chapter 16. Benefits of Web-Enabling a Data Warehouse Better-informed decision making Lower costs of deployment and management.
I Information Systems Technology Ross Malaga 3 "Part I Understanding Information Systems Technology" Copyright © 2005 Prentice Hall, Inc. 3-1 SOFTWARE.
Passage Three Introduction to Microsoft SQL Server 2000.
Definitions Collaboration – working together on team projects and sharing information, often through ad-hoc processes, to accomplish project goals. Document.
©2011 Quest Software, Inc. All rights reserved. Steve Walch, Senior Product Manager Blog: November, 2011 Partner Training Webcast.
OMap By: Haitham Khateeb Yamama Dagash Under Suppervision of: Benny Daon.
Overview of the ODP Data Provider Sergey Sukhonosov National Oceanographic Data Centre, Russia Expert training on the Ocean Data Portal technology, Buenos.
Web 2.0: Concepts and Applications 2 Publishing Online.
Overview of Mini-Edit and other Tools Access DB Oracle DB You Need to Send Entries From Your Std To the Registry You Need to Get Back Updated Entries From.
Digital Library Architecture and Technology
Ocean Tracking Network Bob Branton, Lenore Bajona, Susan Dufault, Brian Jones, Marta Mihoff Dalhousie University, Halifax Canada Global.OceanTrack.org.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Easy HTML DB. Michael Cunningham Developer/Database Administrator.
Mendeley Institutional Edition Hazman Aziz, eProduct Manager (APAC) University Kebangsaan Malaysia.
SharePoint 2010 Business Intelligence Module 10: Reporting Services.
1Copyright © 2012, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 8 Reporting from Contract.
IOOS DMAC 2015 Jon “OTN is a biological observing system of the UN’s Global Ocean Observing System.” Following on from the Census.
Tutorial 10 Adding Spry Elements and Database Functionality Dreamweaver CS3 Tutorial 101.
SiS Technical Training Development Track Day 8. Agenda  Quick Overview of PeopleSoft Security  Understand Permission Lists, Roles, User and Tree Security.
MAHI Research Database Data Validation System Software Prototype Demonstration September 18, 2001
EXtensible Catalog David Lindahl University of Rochester.
A primer on version control at OTN
Uniting Cultures, Technology & Applications A Case Study University of New Hampshire.
Dream Report: Secure and Reliable Reporting Renee Sikes Applications Engineer Dream Report Brand Manager.
Tutorial 121 Creating a New Web Forms Page You will find that creating Web Forms is similar to creating traditional Windows applications in Visual Basic.
Stephen Booth EPCC Stephen Booth GridSafe Overview.
1Copyright © 2012, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 8 Contract Management.
Esri UC2013. Technical Workshop. Technical Workshop 2013 Esri International User Conference July 8–12, 2013 | San Diego, California Migrating your Data.
Microsoft SharePoint Server 2010 for the Microsoft ASP.NET Developer Yaroslav Pentsarskyy
Use of Hierarchical Keywords for Easy Data Management on HUBzero HUBbub Conference 2013 September 6 th, 2013 Gaurav Nanda, Jonathan Tan, Peter Auyeung,
Archivists' Toolkit - CRADLE Presentation, 10 Feb The Archivists’ Toolkit CRADLE Presentation 10 Feb
Archivists' Toolkit - CDL Presentation, October 17, 2005 The Archivists’ Toolkit Lee Mandell Brad Westbrook.
GEM Portal and SERVOGrid for Earthquake Science PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce Computer Science, Informatics, Physics.
Data Validation OPEN Development Conference September 19, 2008 Sushmita De Systems Analyst.
VO Sandpit, November 2009 CEDA Metadata Steve Donegan/Sam Pepler.
IODE Ocean Data Portal - ODP  The objective of the IODE Ocean Data Portal (ODP) is to facilitate and promote the exchange and dissemination of marine.
The new European Toolkit EC-CHM Miruna Bădescu EEA contractor: Eau de Web.
The Digital Library for Earth System Science: Contributing resources and collections GCCS Internship Orientation Holly Devaul 19 June 2003.
Infrastructure for QA and automatic trending F. Bellini, M. Germain ALICE Offline Week, 19 th November 2014.
MDPHnet & ESP Data Partner Participation Overview The following slides describe the necessary steps for a data partner to participate in the MDPHnet Network.
1 © Xchanging 2010 no part of this document may be circulated, quoted or reproduced without prior written approval of Xchanging. MOSS Training – UI customization.
2012 Objectives for CernVM. PH/SFT Technical Group Meeting CernVM/Subprojects The R&D phase of the project has finished and we continue to work as part.
1 The EDIT System, Overview European Commission – Eurostat.
Adrian Jackson, Stephen Booth EPCC Resource Usage Monitoring and Accounting.
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
PDS4 Demonstration Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
What problems are we trying to solve? Hannes Tschofenig.
Introduction to Programming 1 1 2Introduction to Java.
Copyright © New Signature Who we are: Focused on consistently delivering great customer experiences. What we do: We help you transform your business.
Ontolica Fusion 4.0 The easy Automation Tool for SharePoint Steen Jakobsen Fusion Principal Architect
SG-OBIS-V, May 2016 Lenore Bajona, Director of Data Management OTN Data Centre OTN Data Portal:
Cooperation and Interoperability Lenore Bajona, Robert M. Branton Ocean Tracking Network
International Planetary Data Alliance Registry Project Update September 16, 2011.
Ocean Tracking Network I have since 2008, been data management director for the Ocean Tracking Network (OTN) at Dalhousie University.
Management Information Systems by Prof. Park Kyung-Hye Chapter 7 (8th Week) Databases and Data Warehouses 07.
SharePoint 101 – An Overview of SharePoint 2010, 2013 and Office 365
Patricia 5.7.
Marc-Elian Bégin ETICS Project, CERN
How to Submit Collection Metadata
An Overview of Data-PASS Shared Catalog
OTN Data Warehouse Overview: Data Workflow
A Guide to Shift’s Open Data ecosystem & Data workflow
Continuous Automated Chatbot Testing
Database Management Systems
Computational Environment Management
SSDT, Docker, and (Azure) DevOps
Presentation transcript:

Components of the OTN Data Centre Document-oriented web-based data repository Data and metadata in secure folders per user in their original formats Managed by the user Access and authorship permissions set by data owner Standardized metadata sheets for user projects Receivers and Sentinel Tags Tagging AUVs (Gliders) Environmental Sensors Cruise/Mission metadata Public-facing summary page describing project scopes and general location Citation text, abstract, contact information

Components of the OTN Data Centre Database Open-source, geospatially-aware database engine Project-level permissions with finer permissions available to power users (people fluent in SQL) Data and metadata verification done by data managers and by associated scripts Aggregation of all project data into queryable format

Components of the OTN Data Centre Interfaces to various endpoints for your data Web and GIS-friendly representations of data International metadata standards for acoustic detections (submitting data to OBIS) Publishable aspects of project data delivered in many formats

Data Nodes: Choice of DMS and web front left to local data managers All collection data kept private, even from the OTN parent node Project metadata can be harvested if node chooses to generate it (green area of DB) Database internal layout common to all nodes allowing for protocol and code exchange Access to community code repositories for data loading and generating data products

Puppet script repository – automated deployment of nodes Publishing OS and DB build process in Git for managing evolving data formats and structures Still in development Easy to deploy via VM tools like Vagrant Hopefully useful when deploying to cloud services

Node Toolset - Community of Acoustic Telemetry DMs Continuously evolving data parsers and insertion scripts for common data formats Data managers can author their own scripts or otherwise enhance the existing toolset

Node Toolset - Community of Acoustic Telemetry DMs Continuously evolving data parsers and insertion scripts for common data formats Data managers can author their own scripts or otherwise enhance the existing toolset

Node Toolset - Community of Acoustic Telemetry DMs Continuously evolving data parsers and insertion scripts for common data formats Data managers can author their own scripts or otherwise enhance the existing toolset

Node Toolset - Community of Acoustic Telemetry DMs A platform for code- sharing with permissions ranging from personal to completely public Mechanism for documentation, feedback, feature suggestion, and dissemination Contributions can be suggested with code or through issue-tracking

Summary OTN Database Node structure available for sharing –Built to support the OTN Data Policy –Helpful in identifying mystery / orphan tags across OTN Coding community for managing OTN DB Nodes –But also for visualization, statistical modelling, etc! –Programmers can contribute, non-programmers can define problems and OTN et al. can help solve them Equipment loans to extend existing telemetry effort

Installing a OTN db Node VM 1.Install Vagrant (vagrantup.com), VirtualBox (4.3.xx) and Git ( 2.Create a login to OTN’s GitLab – Jon will add you to OTN Partner Nodes group so you can d/l Node manifest 3.Create a folder on your computer to hold your Virtual Machines navigate to it using Git Bash or Terminal 4.Type the following commands:  git clone partner-nodes/db-node-puppet-installer.git  cd db-node-puppet-installer  vagrant up Installing the Node VM will give you a temporary copy of the OTN database to work in. It will only be accessible from your local machine, and can be deleted when you want it to be. We’ll use it to go through some data ingesting and formatting exercises over the next few days. Draft OTN Node Training Overview

What you get in a Node VM PostgreSQL PostGIS Template OTN DB Templates Tomcat GeoServer ERDDAP VM-only Miniconda Python environment JuPyTer (iPython Notebook) OTN.ipynb repo for data loading and verification

The OTN Database Node Just one component of an acoustic telemetry data management system Bridge between input data from research groups and well-formatted output data to researchers, stakeholders and the public Other crucial components: document management system for input data Web portal for output data ? ?

Tentative Schedule for DB Training Monday Projects and schemas Create project metadata Create and populate project schema Vendor-provided metadata Tagging Metadata Load raw tag metadata Verification Load cache tables + verify OBIS-like tables Wednesday Detection processing Events and Detections loading from CSV Batch processing w/ ULFX (if time) YYYY tables – sharding and inheritance Sensor-enhanced detections Generating Detection Extracts Discovery process – creating publishable metadata Tuesday Receivers Receiver metadata – Short form (OTN) – Long form (OTN) Add station records Verification Sentinel tags if there’s time

Three main components of acoustic telemetry data Detection Event Detection Data Receiver Deployment Tagging Activity Receiver deployments: generally uncontroversial and publishable data. Useful for informing potential collaborators of existing equipment deployed in their intended study areas that could detect their tags. Detection data: protected but not very informative without associated tagging activity data to add the ‘what’ to the ‘where’ and ‘when’. Tagging activity: The history of which tags are in which animals, where those animals were released, how long the tag will live, and all auxiliary measurements and observations made at tagging time by the researchers. Very sensitive, embargoed.

Jon Pye – Portal Manager Ocean Tracking Network Dalhousie University Halifax, Nova Scotia Canada 1 (902) oceantrackingnetwork.org