Page 1 of 18 2005 Joint Assembly: AGU, SEG, NABS & SPD/AAS Session U08: eGY: e-Science for Geoscience May 24, 2005New Orleans, LA Volodya Papitashvili,

Slides:



Advertisements
Similar presentations
How to Set Up a System for Teaching Files, Conferences, and Clinical Trials Medical Imaging Resource Center.
Advertisements

Remote Visualisation System (RVS) By: Anil Chandra.
The Biosafety Clearing-House of the Cartagena Protocol on Biosafety Tutorial – BCH Resources.
Chapter 10: Designing Databases
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
16.1 © 2004 Pearson Education, Inc. Exam Planning, Implementing, and Maintaining a Microsoft® Windows® Server 2003 Active Directory Infrastructure.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Chapter 9: Moving to Design
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Distributed Systems: Client/Server Computing
Mgt 240 Lecture Website Construction: Software and Language Alternatives March 29, 2005.
1 Chapter 20 — Creating Web Projects Microsoft Visual Basic.NET, Introduction to Programming.
Presented by Mina Haratiannezhadi 1.  publishing, editing and modifying content  maintenance  central interface  manage workflows 2.
Creating Web Page Forms
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
TPAC Digital Library Talk Overview Presenter:Glenn Hyland Tasmanian Partnership for Advanced Computing & Australian Antarctic Division Outline: TPAC Overview.
ODBC Open DataBase Connectivity a standard database access method developed by Microsoft to access data from any application regardless of which database.
Module 4 Managing Client Access. Module Overview Configuring the Client Access Server Role Configuring Client Access Services for Outlook Clients Configuring.
With Internet Explorer 8© 2011 Pearson Education, Inc. Publishing as Prentice Hall1 Go! with Internet Explorer 8 Getting Started.
Test Review. What is the main advantage to using shadow copies?
For more notes and topics visit:
1 The SpaceWire Internet Tunnel and the Advantages It Provides For Spacecraft Integration Stuart Mills, Steve Parkes Space Technology Centre University.
Chapter 9 Elements of Systems Design
DATA COMMUNICATION DONE BY: ALVIN SAMPATH CARLVIN SAMPATH.
XHTML Introductory1 Forms Chapter 7. XHTML Introductory2 Objectives In this chapter, you will: Study elements Learn about input fields Use the element.
NASA/ESA Interoperability Efforts CEOS Subgroup - CINTEX Alexandria, Sept 12, 2002 Ananth Rao Yonsook Enloe SGT, Inc.
VGMO.NET Vladimir Papitashvili Anshuman Saxena Valeriy Petrov umich.edu Robert Clauer
THE OSI MODEL AND THE TCP/IP PROTOCOL SUITE CS 1202 Lectur3 part2.
Page 1 of European Geosciences Union Assembly Session US5: The International Polar Year April 27, 2005Vienna, Austria Vladimir Papitashvili,
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
DISTRIBUTED COMPUTING
Networks – Network Architecture Network architecture is specification of design principles (including data formats and procedures) for creating a network.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
SUSE Linux Enterprise Desktop Administration Chapter 12 Administer Printing.
The OSI Model and the TCP/IP Protocol Suite Outline: 1.Protocol Layers 2.OSI Model 3.TCP/IP Model 4.Addressing 1.
X-Road – Estonian Interoperability Platform
Volodya Papitashvili Anshuman Saxena Valeriy Petrov Robert Clauer Page 1 of 16 VGMO NET NASA/LWS Workshop: Virtual Observatories in Space and Solar Physics.
IGY+50, The IPY, and The electronic Geophysical Year (eGY) D.N. Baker Laboratory for Atmospheric and Space Physics University of Colorado, Boulder C. Barton.
Guide to Linux Installation and Administration, 2e1 Chapter 2 Planning Your System.
Unit – I CLIENT / SERVER ARCHITECTURE. Unit Structure  Evolution of Client/Server Architecture  Client/Server Model  Characteristics of Client/Server.
Page 1 of 18 NASA/LWS Workshop: Virtual Observatories in Space and Solar Physics October 27-29, 2004; Greenbelt, Maryland Dan Baker, Charlie Barton, Brian.
August 2003 At A Glance VMOC-CE is an application framework that facilitates real- time, remote cooperative work among geographically dispersed mission.
Dr. Azeddine Chikh IS444: Modern tools for applications development.
Chapter 15 – Part 2 Networks The Internal Operating System The Architecture of Computer Hardware and Systems Software: An Information Technology Approach.
Computer Networking From LANs to WANs: Hardware, Software, and Security Chapter 13 FTP and Telnet.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
IODE Ocean Data Portal - ODP  The objective of the IODE Ocean Data Portal (ODP) is to facilitate and promote the exchange and dissemination of marine.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
IPY Education and Outreach 21 July 2005 The eGY Opportunity: Education and Outreach Emily CoBabe-Ammann eGY_Team.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
Evaluating & Maintaining a Site Domain 6. Conduct Technical Tests Dreamweaver provides many tools to assist in finalizing and testing your website for.
The eGY Legacy: A framework for e-Discovery and e-Access (eDeA) to Scientific Data Vladimir Papitashvili, AOSS, University of Michigan The eGY General.
How to Set Up a System for Teaching Files, Conferences, and Clinical Trials Medical Imaging Resource Center.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
The Electronic Geophysical Year (eGY) D.N. Baker, C. Barton, B. Fraser, V. Papitashvili, A. Rodger, B. Thompson, J.H. Allen, B. Arora, D. Kerridge,
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
Distributed Data Servers and Web Interface in the Climate Data Portal Willa H. Zhu Joint Institute for the Study of Ocean and Atmosphere University of.
5/19/05 New Geoscience Applications 1 A DISTRIBUTED WORKFLOW DATABASE DESIGNED FOR COREWALL APPLICATIONS Bill KampBill Kamp, Lumnilogical Research Center,
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
9 Systems Analysis and Design in a Changing World, Fifth Edition.
E-Business Infrastructure PRESENTED BY IKA NOVITA DEWI, MCS.
Chapter 8 Environments, Alternatives, and Decisions.
Russian Academy of Sciences
The OSI Model and the TCP/IP Protocol Suite
The OSI Model and the TCP/IP Protocol Suite
Information Technology Ms. Abeer Helwa
Chapter 15 – Part 2 Networks The Internal Operating System
eGY Planning Meeting Boulder, February 2005
The OSI Model and the TCP/IP Protocol Suite
Presentation transcript:

Page 1 of Joint Assembly: AGU, SEG, NABS & SPD/AAS Session U08: eGY: e-Science for Geoscience May 24, 2005New Orleans, LA Volodya Papitashvili, Valeriy Petrov, Bob Clauer SPRL, University of Michigan, Ann Arbor, Michigan Anshuman Saxena TATA Consultancy Services Euro-Labs, Aalborg, Denmark Natasha Papitashvili SPDF & QSS Inc., NASA/GSFC, Greenbelt, Maryland Virtual Global Magnetic Observatory VGMO.NET: A Component of the Electronic Geophysical Year

Page 2 of 18 IGY Legacies: Allowed scientists from different countries to participate in global observations of geophysical phenomena using similar instruments and data processing methodologies Gathered unprecedented volume of geophysical data from around the World Launched first Earth artificial satellites and established the World Data Center System Our Motivation An overwhelming success of the International Geophysical Year ( )

Page 3 of 18 Data Collection Process since IGY: To get scientific data from various physically distributed sources, a scientist has had to: Ever Increasing Requirements: Geospace and Earth Systems Science Higher resolution in space and time Assimilation into models 5.Finally, do some real science with the collected data! 4.Process collected data using mostly proprietary codes, run models… and then… 3.Then ingest retrieved data into a personal (local) database… 2.Get data via snail-mail and air-mail, but only recently via and World Wide Web… 1.Search through a number of World Data Centers, various research institutions, physical observatories, contact colleagues...

Page 4 of th Century Paradigm of Sharing Data: Data Must be Submitted to Data Centers WDCs require continuous support for data acquisition, storage, and distribution Data submission to WDCs remains voluntary Collected data are often not suitable for the World Data Centers System For example, WDCs accept only absolute geomagnetic observations “Push Data” Concept Centralized distribution schemes – World Data Centers System (WDC,  ): Courtesy of the RAND Corporation

Page 5 of 18 A 21 st Century Paradigm: Sharing Distributed Geoscience Data via Virtual Observatories Now Deployed in Cyberspace Publishing and sharing Geoscience data through World Wide Web: Allows to avoid additional steps in the data preparation for submission to WDCs - they will be now pulling data from the providers Data providers achieve greater visibility amongst scientific and user communities A Grid (or Fabric) of interconnected data nodes is a new vision of distributed, self- populating data repositories and centers World Data Centers  become an integral and important part of the World Wide Data Fabric, serving as “clearing houses” to preserve at least 2-3 copies of a particular dataset across the network “Pull Data” Concept Courtesy of the RAND Corporation

Page 6 of 18 Main Elements of a Virtual Observatory Distributed data bases are accessed through the World Wide Web Data Portals and VO nodes Data Visualization Format Conversion Data Acquisition Location Discovery “Virtual Observatory” is a basic concept of the Electronic Geophysical Year we offer to IPY, IHY, IYPE, and World Data Centers

Page 7 of 18 The proposed VGMO.NET is a middleware that provides a new way for the worldwide geomagnetic community to share data and functionality in a platform-independent and location- neutral environment Design Goals Identify prospective geomagnetic data repositories on the Web and provide transparent access to the remote databases through a common interface: VGMO Data Portals Perform online acquisition and processing of geomagnetic data from remote datasets and construct self-populating databases on the VGMO portals and individual user nodes These self-sustained data nodes can then be made easily available to other users through future requests, thus building Data GRID-type (Data Fabric) access and computing VGMO.NET – A Virtual Global Magnetic Observatory Network

Page 8 of 18 A four-tier architecture of the proposed VGMO.NET LOCATION DISCOVERY Web Crawler Data Acquisition via World Wide Web and Internet DATA ACQUISITION via FTP, SSL, XML, HTTP, OPeNDAP… FORMAT CONVERSION (A2F) Flat File Manager IDL MATLAB Simulink Integrated Visualization Layer Highest Level of Data Analysis “ASCII to Flat File Format” for ingestion of downloaded data into the Web-based Portal or GRID-node databases Lowest layer - Location Discovery Module VGMO.NET – Architecture Unleashed

Page 9 of 18 Web-based Portal – runs at  A secure, scalable, platform independent, and user-friendly software for remote access to the portal’s Flat File Manager  The Flat File Manager Client is written to a Java 2 platform that requires a Java Web Start (Java Network Launching Protocol) Standalone Self-Populating Data Node – get from the Web site above  An alternate version to create, manage, and populate user’s local databases, building the VGMO “GRID” access and computing Two Implementations of the VGMO.NET framework UsersPortal

Page 10 of 18 VGMO.NET Highlights Remote (Client) Machine Requirements Java Runtime Environment (JRE), version or later Java Web Start (available for Windows 98/ME/NT/2000/XP, Linux, and Solaris OE) The library and “Java thin client” for the FFMN Client Server Requirements Any standard Web server configured for JNLP (Java Network Launching Protocol) Flat File Manager DLLs and Flat File Manager Server software Platform Independence FFMN Server can be deployed on a wide-variety of platforms (Linux, Solaris OE, Windows 98/ME/NT/2000/XP) and launched remotely from any platform Client Side Security and Notification of Application’s Origin The FFMN service provider signs the downloadable code to ensure that no other party can impersonate the application on the Web; thus, the VGMO framework provides flexibility without compromising security. The user is shown a dialog displaying the application's origin (based on the signer certificate) before the application is launched; thereby, the user can make an informed decision whether to grant additional privileges to the downloaded code If the user trusts the FFMN service provider, he/she can choose to grant additional system privileges, such as a write access to a local disk

Page 11 of 18 VGMO.NET Lookup Tables and Java Interfaces Remote Site Info Format Info Conversion Pointer ftp.iki.rssi.ru--- ftp.abs.xyz.edu---. Remote Site InfoFormat Information Conver- sion Pointer ftp.dmi.dk /pub/wdcc1/obsdata/ 1minval/ YYYY/ dmi.exe ftp.ngdc. noaa.gov /STP/GEOMAGNETIC_DA TA/ONE_ MINUTE_VALUES/YYYY/ ngdc.exe …………………………………………………………………… Prospective Sites Geo Magnetic Crawler (GeoMaC) A2F - Any Format to Flat File Conversion Module FFMN Flat File Manager INTERNET Active Sites

Page 12 of 18 VGMO.NET - Local Database oGeomagnetic data are published in widely different, often proprietary formats oWe convert all downloaded data sets into a Flat-File database oDatabases built via VGMO.NET conform to the Flat-File DBMS architecture Flat DBMS revisited [A. Smith, C. R. Clauer, 1984] oEach dataset consists of two files: a header file, which is an ASCII description of the dataset and a binary data file that is the data itself oLeverages advantages of ASCII presentation (readable and editable data description), as well as binary presentation (compact data storage and fast random access) oA sample header file: Name of header and data files: VOS01 Date files created: 13-May-2002 Record length of data file, in bytes: 20 Number of columns: 4 Number of rows: Flag for missing data: -0.10E+33 # name units source type loc 1 Time seconds T 1 2 VOCE nT Antarctic magnetometer R 9 3 VOSH nT Antarctic magnetometer R 13 4 VOSZ nT Antarctic magnetometer R 17 NOTES: Start time = 01-JAN-01 00:02: End time = 31-DEC-01 23:58: Antarctic magnetometer high resolution data END Note that the local database can hold a mixture of various “flat files”, like the interplanetary magnetic field/solar wind data, ionospheric data, etc.

Page 13 of 18 VGMO.NET Local Database (cont’d) File Name consists of three parts – a station IAGA 3-letter code, followed by a timestamp in YYYYMMDD format and some special tags that are attached for housekeeping purposes: Special Tags: absolute measurements: a variation measurements: v public access: p restricted access: r rate of data sampling (in sec): 60/30/1/ For example, a publicly accessible dataset consisting of 60-sec samples of absolute geomagnetic measurements from Antarctic magnetic observatory VOSTOK for December 2002 will be stored in the flat files named: \2000\06\MAG\VOS _60pa.hed VOS _60pa.dat Directory structure and naming convention

Page 14 of 18 VGMO.NET at Work FFMN Main Menu allows the user to select up to three data sets (File), then do certain operations with selected data sets (Action) by setting Options The File item allows the user to open the server database files or to create a temporary data set for the selected geomagnetic stations (selected either by names or geographic location) If the selected data are found in the server’s database, then the FFMN Server retrieves requested data for the plotting (and possible uploading) to the remote, FFMN client machine In addition, if the “Search worldwide” box is checked, the FFMN Server will look for the selected data on a number of remote FTP sites (listed in the FFMN Lookup File); these data are then downloaded, converted to flat files, and added to the FFMN server database When new FTP sites with geomagnetic data are found, they can be easily linked through additions to the FFMN Lookup File

Page 15 of 18 VGMO.NET Search & Plot Examples

Page 16 of 18 VGMO.NET: WWW Search By default all the sites presented in the list are contacted for world wide search The user can drop some sites from the list by making appropriate selections Each site remains in one of the following states Not connected - Site has not yet been contacted Connecting - Synchronization with the site is in progress Completed - Synchronization with the site has been completed Matching observatories found are listed against each site

Page 17 of 18  Existing World Data Centers continue to serve the worldwide scientific community in providing free access to global geophysical databases  Recently many digital geomagnetic datasets have been placed on the Web, often in near-real time, but some of these data are not even submitted to any data center  In this study, we formulated the concept and showed the developed prototype of the Virtual Global Magnetic Observatory (VGMO) Network  The Virtual Observatory concept is developed within the framework of the Electronic Geophysical Year Summary

Page 18 of 18  Saving retrieved data locally from multiple requests, a VGMO.NET user can build a personal data sub- center, avoiding the Web search if a new request falls within a span of earlier downloaded data  If this self-sustained sub-center is made available to other VGMO users, then the newly “Webbed” data node is integrated into the global DATA GRID (Data Fabric) of users/centers, where the crawling over the Web for data is absolutely transparent to users  However, more studies are needed to learn how the newly “Webbed” digital geomagnetic data can be automatically identified on the Web – and a Semantic Web approach looks the most promising Summary (cont’d)