Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham

Slides:



Advertisements
Similar presentations
Blended learning in WebCT some practical approaches Institute of Psychiatry, 16 November 2006.
Advertisements

Microsoft® Office Word 2007 Training
MICHAEL MARINO CSC 101 Whats New in Office Office Live Workspace 3 new things about Office Live Workspace are: Anywhere Access Store Microsoft.
Usage of the memoQ web service API by LSP – a case study
Programming Paradigms and languages
Growing the Semantic Web By Charla Woodbury June 11, 2004.
June 28 th – July 1 st 2006 Implementing Usability: Insights to improve your chances  CFUnited 2007.
Introduction to Databases
Microsoft ® Office Word 2007 Training Bullets, Numbers, and Lists ICT Staff Development presents:
Transitioning to XP or The Fanciful Opinions of Don Wells.
Introduction to a Programming Environment
How to build your own computer And why it will save you time and money.
CSE 219 COMPUTER SCIENCE III PROPERTIES OF HIGH QUALITY SOFTWARE.
Russell Taylor Lecturer in Computing & Business Studies.
Creating an HTML page Skills: edit and debug HTML pages IT concepts: text editor This work is licensed under a Creative Commons Attribution-Noncommercial-
CODING Research Data Management. Research Data Management Coding When writing software or analytical code it is important that others and your future.
Software Development Unit 6.
PowerPoint: Tables Computer Information Technology Section 5-11 Some text and examples used with permission from: Note: We are.
March 2010 PDS Imaging Node 1 NASA PDS Imaging Node: NASA PDS Imaging Node: Digital Data Archives and Distribution Archiving and distributing data and.
Software design and development Marcus Hunt. Application and limits of procedural programming Procedural programming is a powerful language, typically.
An Introduction to Content Management. By the end of the session you will be able to... Explain what a content management system is Apply the principles.
Chapter 7 Designing Classes. Class Design When we are developing a piece of software, we want to design the software We don’t want to just sit down and.
Computer Science : Information Systems Design and Development Unit Web Sites - National 4 / 5 St Andrew’s High School-Revised January 2013 Slide 1 St Andrew’s.
WIKI IN EDUCATION Giti Javidi. W HAT IS WIKI ? A Wiki can be thought of as a combination of a Web site and a Word document. At its simplest, it can be.
The HDF Group July 8, 2014HDF 2014 ESIP Summer Meeting HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann The.
TerraForm3D Plasma Works 3D Engine & USGS Terrain Modeler Heather Jeffcott Craig Post Deborah Lee.
THOUGHTS ON DATA MANAGEMENT by Justin Burruss & David Schissel SWIM Workshop November 7-9, 2005 Oak Ridge, TN.
HTML.
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
Creating Web Pages Overview. Design – Start with a Purpose Before you start any web page, you need to design the website. The first question that should.
The Effectiveness of Web Components Presented By: Geoffrey Zimmerman Computer Science Capstone Fall 2004/Spring 2005 Mentor: Dr. C. David Shaffer.
PowerPoint Lesson 10 Sharing and Delivering Presentations Microsoft Office 2010 Advanced Cable / Morrison 1.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
‘Tirgul’ # 7 Enterprise Development Using Visual Basic 6.0 Autumn 2002 Tirgul #7.
You Can Make A Wiki, Too A guide to creating a wiki of your own.
BTEC Unit 06 – Lesson 08 Principals of Software Design Mr C Johnston ICT Teacher
SharePoint document libraries I: Introduction to sharing files Sharjah Higher Colleges of Technology presents:
Just as there are many human languages, there are many computer programming languages that can be used to develop software. Some are named after people,
Google Apps in Education Workshop Presentation August 2010.
Autoplot Overview Autoplot developed originally for ViRBO Virtual Observatory, then adopted by VMO, and RBSP instrument and other teams.
Planetary Science Archive PSA User Group Meeting #1 PSA UG #1  July 2 - 3, 2013  ESAC PSA Archiving Standards.
SCIRun and SPA integration status Steven G. Parker Ayla Khan Oscar Barney.
Slide: 1 NeXus and Synchrotrons: Challenges and Requirements V.A. Solé – ESRF Software Group NeXus Data Format Workshop, PSI, May
What is the VSO? (and what isn’t it?). The VSO …  Allows you to search multiple archives in a single search  Keeps you from needing to keep track of.
Diagnostic Pathfinder for Instructors. Diagnostic Pathfinder Local File vs. Database Normal operations Expert operations Admin operations.
Dudok de Wit David.  Documents management in a deskless company  SharePoint Online as a solution  Redesigning the documentary organization  Interoperability.
240-Current Research Easily Extensible Systems, Octave, Input Formats, SOA.
ABSTRACT The JDBC (Java Database Connectivity) API is the industry standard for database- independent connectivity between the Java programming language.
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
A Look at HTML (and XHTML). Types of Web Applications.
MS Word 2010 Tutorial Prepared by: Mr. R. De Vera ii.
20081 Converting workspaces and using SALT & subversion to maintain them. V1.02.
Chapter 131 Applets and HTML Chapter Objectives learn how to write applets learn to write a simple HTML document learn how to embed an applet in.
WHAT IS PHP FRAMEWORK? Set of Classes and Functions(Methods) Design for the development of web applications Provide basic structure Rapid application development(RAD)
National Aeronautics and Space Administration TablePress Evaluation & Section 508 Accessible Tables with Visual Editor WP Workshop, 3/19/2014.
University of Macau Faculty of Science and Technology Programming Languages Architecture SFTW 241 spring 2004 Class B Group 3.
Microsoft Access 4 Database Creation and Management.
SharePoint document libraries I: Introduction to sharing files Why document libraries? Sharing files with others is essential to getting things done nowadays.
A Perspective on the Electronic Geophysical Year Raymond J. Walker UCLA Presented at eGY General Meeting Boulder, Colorado March 13, 2007.
1 G4UIRoot Isidro González ALICE ROOT /10/2002.
Google maps engine and language presentation Ibrahim Motala.
“Moh’d Sami” AshhabSummer 2008University of Jordan MATLAB By (Mohammed Sami) Ashhab University of Jordan Summer 2008.
Software Development Languages and Environments. Computer Languages Just as there are many human languages, there are many computer programming languages.
Towards a CTA high-level science analysis framework
Presentation Graphics
(Mohammed Sami) Ashhab
McIDAS-V: Why it’s Based on VisAD and IDV
Agile testing for web API with Postman
Planning and Storyboarding a Web Site
Presentation transcript:

Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham

Current Archiving Goal MissionTeam Raw Data Processed Data Write Papers Data Iteration Quality Data Archive Future Scientists Quality Data

Current Archiving Reality MissionTeam Raw Data Processed Data Write Papers Data Iteration Data Subsets Permanent Archive Future Scientists Unchecked Data Home Institution Archive Public Data

New Goal MissionTeam Raw Data Processed Data Write Papers Data Iteration Processed Data Archive Future Scientists Processed Data

Standardizing HOWTO Make it easy Make it useful Make it extensible

Make it Easy Reading / writing files must be super easy (i.e. cheap!) –Either with tools or libraries Tools can be command line or GUI

Make it Useful How do I look at it? –Plots/Analysis What else can I do with it? –Read into IDL, Matlab, Excel, etc. Must have immediate benefits

Make it Extensible Must be possible for others to add value added services Must be able to hold varieties of data Must agree to give up control on content

Case Studies: HTML Easy to create! Once done, look at in browser Embrace / Extend

Case Studies: SPASE Creation is slow and difficult Once created, no real benefits yet VxOs have embraced, no one extended yet

Case Studies: IDFS Until recently, difficult to create, complex Once in, easy to look at, use, archive, etc. Somewhat extensible

Things right with IDFS Efficient Self documenting Calibrations stored in text file Science units derived instead of stored Little to no reprocessing ever needed

Other IDFS Benefits Can store most types of space physics data from raw telemetry to highly processed science units Reversible from science units to raw telemetry Usable by data processor, scientist, and data archiver

Things wrong with IDFS Overly complex format and API Not enough support in other tools - poor buy-in Analysis routines merged with the file format - tried to do too much!

Implementation Plan Develop a simple file format that can contain any and all types of time series space physics data Develop tools that allow someone to create and inspect files in this format Merge in the best parts of IDFS, CDF, netCDF, HDF, FITS, etc... without breaking paradigm of simplicity

Simple File Format Format might already exist: –HDF5 –XML –JSON –Other data models?

Making it useful Get buy-in from visualization tools (SDDAS, DataShop, VisBard, IDL DLM, etc.) Get buy-in from archives sites (PDS, PSA, NSSDC, etc.) Seed money is essential

Advantages Providers Users Management

Advantages: Providers Instrument teams now have something to work toward Can develop expertise

Advantages: Users Quick ways to create plots or access data Expertise again!

Advantages: Management Homogenous archives are infinitely easier to manage and maintain Value added services are a natural extension of quality archives

Conclusion Why now? Because SPASE is gaining traction, this is the next logical step. This will save money for everyone in the long run. Everyone benefits with value added services.