Data Formats & Data Structures

Slides:



Advertisements
Similar presentations
Using Multimedia on the Web Enhancing a Web Site with Sound, Video, and Applets.
Advertisements

CNIT 132 – Week 9 Multimedia. Working with Multimedia Bandwidth is a measure of the amount of data that can be sent through a communication pipeline each.
Data Management Workshop
File Management Systems
An Introduction to Earthquake Analysis Tools (Part 1) Bill Langin 10/2/02.
Systems Oceanography: Observing System Design. Why not hard-wire the system? Efficiency of interface management –Hard-wire when component number small,
Improved Quality Control for Seismic Networks ---MUSTANG.
Rebecca Boger Earth and Environmental Sciences Brooklyn College.
XP New Perspectives on Microsoft Access 2002 Tutorial 71 Microsoft Access 2002 Tutorial 7 – Integrating Access With the Web and With Other Programs.
Android 4: Creating Contents Kirk Scott 1. Outline 4.1 Planning Contents 4.2 GIMP and Free Sound Recorder 4.3 Using FlashCardMaker to Create an XML File.
August 13-19, 2010Data Management Workshop Foz do Iguassu- Brazil Seismic Quality Assurance Rick Benson IRIS DMC Rick Benson IRIS DMC.
11 Games and Content Session 4.1. Session Overview  Show how games are made up of program code and content  Find out about the content management system.
By Tim Ahern, Program Manager IRIS DMS A “Short” Introduction to the IRIS Data Management Center Data Holdings, Data Organization, and Data Access.
Data Formats: Using Self-describing Data Formats Curt Tilmes NASA Version 1.0 February 2013 Section: Local Data Management Copyright 2013 Curt Tilmes.
By Tim Ahern, Rick Benson, Rob Casey, Chad Trabant and Bruce Weertman and many more talented people at the IRIS DMC.
Array Response Functions with ArrayGUI
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
Mini-SEED Fundamentals
An introduction to PDCC the Portable Data Collection Center.
HDF-EOS Workshop VII, An XML Approach to HDF-EOS5 Files Jingli Yang 1, Bob Bane 1, Muhammad Rabi 1, Zhangshi Yin 1, Richard Ullman 1, Robert McGrath.
Applets & Video Games 1 Last Edited 1/10/04CPS4: Java for Video Games Applets &
London April 2005 London April 2005 Creating Eyeblaster Ads The Rich Media Platform The Rich Media Platform Eyeblaster.
Plenary meeting 2015 – Chania - Crete CASCADE Data Services Yusuf Yigini, Panos Panagos, Martha B. Dunbar Joint Research Centre - European Commission.
Accessing Data Using Web Services. IRIS Services – service.iris.edu FDSN Web services dataselect station event Documentation IRIS web services fedcatalog.
Accessing Data through Web Services. IRIS Services – service.iris.edu  FDSN Web services  dataselect  station  event  Documentation Documentation.
Web Services at IRIS Implementations, Directions, and International Coordination Tim Ahern, Director of Data Services, IRIS Web Services Team: Bruce Weertman,
COMPILATION OF STRONG MOTION DATA FROM EASTERN CANADA FOR NGA-EAST PROJECT Lan Lin Postdoctoral Fellow, Geological Survey of Canada.
Portable Data Collection Center (PDCC) and the Nominal Response Library (NRL) Tim Ahern.
Tutorial 7 Working with Multimedia
Tuesday Nov 10, 13:30 November 8-17, 2009Data Management Workshop Cairo, Egypt.
October 21-26, 2007Data Management Workshop Kuala Lumpur, Malaysia Using SEED Using: Jrdseed PQL SAC RESP JPlotResp Using: Jrdseed PQL SAC RESP JPlotResp.
Tutorial 7 Working with Multimedia. New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition 2 Objectives Explore various multimedia applications.
Tutorial 7 Working with Multimedia. New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition 2 Objectives Explore various multimedia applications.
Portable Broadband Seismic Stations - Checking Data Quality Cairo, Egypt November, 2009 Noel Barstow Bruce Beaudoin Marcos Alvarez.
Facilitate – Collaborate – Educate Tuesday, January 10, 11:05am Data Management Workshop Bangkok Thailand, January 2011.
25th & 26th August 2009ICAT developer workshop 1.
By Tim Ahern, Program Manager IRIS DMS A “Short” Introduction to the IRIS Data Management Center Data Holdings, Data Organization, and Data Access.
CS370 Spring 2007 CS 370 Database Systems Lecture 1 Overview of Database Systems.
Rick Benson Saturday, Aug 14, 14:00 August 13-19, 2010Data Management Workshop Foz do Iguassu- Brazil.
By Tim Ahern, Director of Data Services, IRIS A short introduction to the IRIS Data Management System Data Holdings, Data Organization, and Data Access.
Automated Data Quality Assurance: QUACK and MUSTANG Mary Templeton IRIS Data Management Center Mary Templeton IRIS Data Management Center.
Claritas – 3D Land Geometry Example Workflow for an Example 3D Land Geometry Dataset supplied by USGS.
1 Overview Finding and importing data sets –Searching for data –Importing data_.
Dataset registration process Sergey Sukhonosov, Dr. Sergey Belov National Oceanographic Data Centre, Russia Training course on establishment of the ODP.
John Porter Sheng Shan Lu M. Gastil Gastil-Buhl With special thanks to Chau-Chin Lin and Chi-Wen Hsaio.
Converting Waveform Data to Mini-SEED. August 2010 Foz do Iguaçu - Brasil IRIS Data Management Workshop Mini-SEED Fundamentals 1 Mini-SEED is a bare time.
Building a Network Dataless SEED Volume with PDCC.
Building Station Metadata with PDCC
SAC: An Overview SAC ( “Seismic Analysis Code” ) was originally developed by LLNL An interactive program for time-series data with emphasis placed on.
Understanding SEED Headers. SEED is an international standard for the exchange of digital seismological data SEED was designed for use by the earthquake.
Irakli Garibashvili Director, National Scientific Library in Georgia.
Tim Ahern IRIS Director of Data Services Active Source Data Management Workshop January 2014 – Tucson, Arizona.
Agenda FDSN WG2+3 SEED issues –Order of FIR filter coeff (bl 54) –QC flag indicating M for merged data –Est. delay and corr. Applied fields bl 57 –Minor.
SAC: An Overview SAC ( “Seismic Analysis Code” ) was originally developed by LLNL SAC ( “Seismic Analysis Code” ) was originally developed by LLNL An interactive.
® Sponsored by Improving Access to Point Cloud Data 98th OGC Technical Committee Washington DC, USA 8 March 2016 Keith Ryden Esri Software Development.
Thurs Nov 12, 12:45 Nov 8-17, 2009Data Management Workshop Cairo, Egypt.
June 12, 2004Synthetic Seismogram Exchange Standards But We Have A Standard. Can SEED Support Synthetics? Linus Kamb IRIS DMC.
Station Metadata: What do I Need?
HDF5 for Real-Time and/or Embedded Test Data
ISPAQ: IRIS System for Portable Assessment of Quality
Requesting a Standardized Data Set for the FDSN Network
Requesting a Standardized Data Set for the FDSN Network
SEG Y Rev 2.0 An update to the SEG seismic data interchange standard
Introduction to Programming the WWW I
Station Metadata: What do I Need?
Tutorial 7 Working with Multimedia
Training Module Introduction to the TB9100/P25 CG/P25 TAG Customer Service Software (CSS) Describes Release 3.95 for Trunked TB9100 and P25 TAG Release.
Tutorial 7 – Integrating Access With the Web and With Other Programs
ESRM 250/CFR 520 Autumn 2009 Phil Hurvitz
APE EAD3 introduction - DARIAH - Brussels
Presentation transcript:

Data Formats & Data Structures Crossroads of Active Source and Broadband Structures Justin Sweet USArray Advanced Short Course, August 2017

Variety of seismic deployments Passive seismic – natural sources (earthquakes) Tomography Regional and Global Scale Surface Wave Ambient Noise Tomography

Variety of seismic deployments Active seismic – artificial sources (explosives, hammer, etc.) Figure from Karplus

Variety of seismic data formats Passive source SEED SAC Active source SEG-Y, SEG-D, SEG-2 Both PH5

SEED: Standard for the Exchange of Earthquake Data International format, maintained by the FDSN Includes both time series data and metadata stored separately Comprehensive record of data recording process 200 page manual www.iris.edu/manuals/SEEDManual_V2.4.pdf Convert SEED to other formats using rdseed Three common variants: SEED Dataless Mini-SEED

SEED Structure control headers dataless SEED ASCII SEED is constructed from defined structures called blockettes waveform data raw data and embedded auxiliary information Mini-SEED binary (+ ASCII) (full) SEED volume Even though most SEED headers are in ASCII is is not readable, certainly not editable SEED response information is specified as a cascade of stages

Comprehensive Metadata Representation - The total system sensitivity is the product of all stage sensitivities and represents the scaling from ground units to digital samples in counts

SAC: Seismic Analysis Code Developed in 1980s by LLNL as a seismic data file format and processing package Currently made available by IRIS DMC Precision in timing and distance has improved beyond SAC single-precision capability For sample rates >0.01s and long record lengths, timing precision in SAC is inadequate Geographic values are marginally adequate (~1m precision) Increasing SAC precision will change SAC header and has implications on existing programs Addressing Precision Limitations in the Seismic Analysis Code (SAC) File Headers and Data Format from 2012 Eastern SSA presentation

SAC: Seismic Analysis Code Each SAC data file contains single data set – single data component recorded at a single station header record – describes the contents of the file SAC binary format (most common) Part of the motivation for developing ASDF stems from the fact that reading and writing SAC files for a large tomographic inversion practically brings a huge parallel file system to its knees due to the very large number of involved files.

SEG-Y: Society of Exploration Geophysicists for storing data Developed in 1970s for use by the active source community Variety of flavors SEG-D, developed for field data SEG-P, developed for positional data SEG-Y, intended to be format for everybody Recent release (rev 2.0, 2017) Higher capacity for trace data Microsecond time-stamp accuracy Higher-precision coordinates, depths, and elevation

SEG-Y: Society of Exploration Geophysicists for storing data What’s in a SEG-Y file? Text file header (green) Trace header (light blue) Actual data (purple) Major limitation – Data & metadata stored together Cannot update metadata (headers) without re-archiving the entire dataset

PH5: PASSCAL HDF5 format Developed at PASSCAL beginning in early 2000s Based on robust Hierarchical Data Format (HDF5) designed to store and organize large amounts of data Stores data and metadata separately Frontload waveform data into DMC Efficient metadata updates Allows access to parts of the data without needing to parse the entire file Exposes active source data to DMC request tools

PH5: PASSCAL HDF5 format Why HDF5? 20+ year history starting at NCSA Widely used & well supported Self-describing format Can create your own data organization Hierarchical Easy to edit Built-in compression Fast reading

Format Usage: Field versus Archive Different formats designed for different uses Eg. SEG-D (designed for field use) vs. SEG-Y (designed for archive and analysis) Field priorities Easy to visualize Archive priorities Highly-compressible Well-described

PH5 & Kitchen Software Suite Can we design a format the works well in both the field and in the archive? PH5 – Format definition that describes the HDF5 file organization Kitchen – Software suite that provides the tools to interface with PH5

Field data  PH5  Archive Data ingestion Waveform data ingested into PH5 using Kitchen suite tool (pforma) Metadata imported automatically for some raw data formats (e.g. SEG-D), or can be imported using Kitchen GUI tool (noven) Data QC PH5Viewer – options to select gathers, time windows, reduction velocity, simple gain and plotting options Development of tool to confirm all data described by metadata and/or missing metadata

Field data  PH5  Archive

Field data  PH5  Archive

Current DMC Interface

PH5: PASSCAL HDF5 format

User access to PH5 data at IRIS DMC Current capability – Web page interface Allows for extraction of SEG-Y shot and receiver gathers, or SAC waveforms Once requests made, data available via ftp Planned capability – webservices 2 services: station and dataselect station – access to metadata identical to standard station webservice currently in use for SEED data dataselect – access to waveform data into mseed format Once webservice is up and running, PH5 data will be accessible just like the rest of the DMC archive Exposed to DMC QC tools like MUSTANG Huge advance for active source data!

PH5: Container for both active and passive data Unlike earlier formats, PH5 is capable of holding active and passive data at the same time, as part of a single dataset. Data extraction Active source can be output at SEG-Y Passive source can be output at mseed or SAC.

Exercise: Let’s access the Wavefields dataset DMC data access tools http://ds.iris.edu/ds/nodes/dmc/tools List of experiments archived in PH5 http://ds.iris.edu/pic-ph5/experiment_list.php Presently, wavefields dataset is archived in miniseed Accessible via webservices Webservices info/syntax - http://service.iris.edu Local webservices access http://156.56.66.202:8080/fdsnws/dataselect/1

Exercise: Let’s access the Wavefields dataset Timeseries http://service.iris.edu/fdsnws/dataselect/1/ Use the URL builder to plot data from Network: YW Station: 511 Location: 01 Channel: HHZ Starttime: 2016-07-13T00:00:00 Endtime: 2016-07-13T12:00:00 Add a bandpass filter from 2-10Hz Try modifying the URL to access the local copy of the data via webservice http://156.56.66.202:8080/fdsnws/dataselect/1