The eGY Legacy: A framework for e-Discovery and e-Access (eDeA) to Scientific Data Vladimir Papitashvili, AOSS, University of Michigan The eGY General.

Slides:



Advertisements
Similar presentations
Business Development Suit Presented by Thomas Mathews.
Advertisements

DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
Unit 1: Module 1 Objective 10 identify tools used in the entry, retrieval, processing, storage, presentation, transmission and dissemination of information;
SACNAS, Sept 29-Oct 1, 2005, Denver, CO What is Cyberinfrastructure? The Computer Science Perspective Dr. Chaitan Baru Project Director, The Geosciences.
High Performance Computing Course Notes Grid Computing.
Information Types and Registries Giridhar Manepalli Corporation for National Research Initiatives Strategies for Discovering Online Data BRDI Symposium.
Study Period Report: Metamodel for On Demand Model Selection (ODMS) Wang Jian, He Keqing, He Yangfan, Wang Chong State Key Lab of Software Engineering,
DEVA Data Management Workshop Devil’s Hole Pupfish Project Data Management Workshop Devil’s Hole Pupfish Program Death Valley National Park Introduction.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
1 CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Global Earth Observation Grid Workshop, Bangkok, Thailand, March Integration Platform.
April 2009 OSG Grid School - RDU 1 Open Science Grid John McGee – Renaissance Computing Institute University of North Carolina, Chapel.
SPRING 2011 CLOUD COMPUTING Cloud Computing San José State University Computer Architecture (CS 147) Professor Sin-Min Lee Presentation by Vladimir Serdyukov.
Lee Romero blog.leeromero.org November 2010 Enterprise taxonomy Six components of a vision.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
Digital Library Architecture and Technology
New Generation SDI and Cyber-Infrastructure Prof. Guoqing Li CEODE/CAS March 29, 2009, Newport Beach, USA Presented to 4th China-US Roundtable Meeting.
A.V. Bogdanov Private cloud vs personal supercomputer.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Web 2.0 Technology & Social Media 1. Web 2.0 Space Some of them are technological components (e.g., AJAX, RIA‘s, and XML/DHTML) Some are principles (e.g.,
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
Research Data at NCAR 1 August, 2002 Steven Worley Scientific Computing Division Data Support Section.
WEB TERMINOLOGIES. Page or web page: a file that can be read over the world wide web Pages or web pages: the global collection of documents associated.
University Libraries Library Systems Office. Life on MARS Mason Archival Repository Service Dorothea Salo Digital Repository Services Librarian Library.
CI Days: Planning Your Campus Cyberinfrastructure Strategy Russ Hobby, Internet2 Internet2 Member Meeting 9 October 2007.
Page 1 of European Geosciences Union Assembly Session US5: The International Polar Year April 27, 2005Vienna, Austria Vladimir Papitashvili,
Cloud Computing.
Dept. of Architecture Ina Smith UPSpace Manager.
J. WILLARD MARRIOTT LIBRARY Preserving, Promoting and Presenting Research Posters: USpace’s New Poster Archiving Service Lisa Chaufty Western CONTENTdm.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Volodya Papitashvili Anshuman Saxena Valeriy Petrov Robert Clauer Page 1 of 16 VGMO NET NASA/LWS Workshop: Virtual Observatories in Space and Solar Physics.
IGY+50, The IPY, and The electronic Geophysical Year (eGY) D.N. Baker Laboratory for Atmospheric and Space Physics University of Colorado, Boulder C. Barton.
Open Science Grid For CI-Days Elizabeth City State University Jan-2008 John McGee – OSG Engagement Manager Manager, Cyberinfrastructure.
Page 1 of Joint Assembly: AGU, SEG, NABS & SPD/AAS Session U08: eGY: e-Science for Geoscience May 24, 2005New Orleans, LA Volodya Papitashvili,
Page 1 of 18 NASA/LWS Workshop: Virtual Observatories in Space and Solar Physics October 27-29, 2004; Greenbelt, Maryland Dan Baker, Charlie Barton, Brian.
WDCs and GSDI David M. Clark World Data Center Panel Global Data Access and Integration Workshop May 8-9, 2000, Canberra, Australia.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Streaming Media A technique for transferring data on the Internet so it can be processed as a steady and continuous stream.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
DATABASE MANAGEMENT SYSTEMS CMAM301. Introduction to database management systems  What is Database?  What is Database Systems?  Types of Database.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
IPY Education and Outreach 21 July 2005 The eGY Opportunity: Education and Outreach Emily CoBabe-Ammann eGY_Team.
CSE 102 Introduction to Computer Engineering What is Computer Engineering?
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
Overviews of the Library of Texas & ZLOT Project Dr. William E. Moen Principal Investigator.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
Managing Access at the University of Oregon : a Case Study of Scholars’ Bank by Carol Hixson Head, Metadata and Digital Library Services
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
National Geospatial Enterprise Architecture N S D I National Spatial Data Infrastructure An Architectural Process Overview Presented by Eliot Christian.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Electronic Business: Concept and Applications Department of Electrical Engineering Gadjah Mada University.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
Introduction: AstroGrid increases scientific research possibilities by enabling access to distributed astronomical data and information resources. AstroGrid.
Dr. Ir. Yeffry Handoko Putra
Brief overview on GridICE and Ticketing System
Using computers to search electronic databases
Digital library for Earth System Education Teaching Boxes
VI-SEEM Data Repository
WIS Strategy – WIS 2.0 Submitted by: Matteo Dell’Acqua(CBS) (Doc 5b)
Unit# 5: Internet and Worldwide Web
eGY Planning Meeting Boulder, February 2005
DELNET – Developing Library Network
Presentation transcript:

The eGY Legacy: A framework for e-Discovery and e-Access (eDeA) to Scientific Data Vladimir Papitashvili, AOSS, University of Michigan The eGY General Meeting, Boulder, Colorado, March 13-14, 2007

A Legacy of IGY - World Data Centers System 20 th Century Paradigm of Sharing Data: Data were to submitted to Data Centers  Data submissions to World Data Centers (  ) were and remains voluntary.  World Data Centers require significant and continuous support (financial & manpower) for data acquisition and storage.  Many types of collected scientific data are often not suitable for World Data Centers; e.g., the quality of geomagnetic variation data does not satisfy the WDC criteria, set mainly for the standard magnetic observatory data. “Push Data” Concept ÊAlthough at present the World Data Centers provide most of their data online, they still constitute a quasi-centralized system of data collection, storage, and dissemination. Courtesy of the RAND Corporation

21 st Century Paradigm: Data are published, visualized, and shared via World Wide Web  Sharing data via multiple Virtual Observatories allows data providers achieve greater visibility among scientific & user communities.  This eliminates the ‘voluntary’ need of submitting data to World Data Centers (  ) – the centers can “pull data” from the data provider Web sites.  A fabric of interconnected data nodes (providers and secondary archives) is a new vision for distributed, self-populating data repositories. “Pull Data” Concept Courtesy of the RAND Corporation ÊBeing integrated in this (Data Fabric) cyber- infrastructure, the World Data Centers will be playing even a more important role - as clearinghouses they would need to watch the always evolving “Data Fabric” and preserve at least 2-3 copies of a particular dataset across the global network of data.

Google and many other searches engines help finding INFORMATION about scientific data in cyberspace (“data discovery”) – this is mainly based on the keywords search. What is needed? - Geo-SML descriptors to list actual data sets on the World Wide Web. These descriptors would allow Google (and others) to search actual SCIENTIFIC DATA on the Web creating “look- up” tables for real e-Access to these data (eDeA). Wikipedia: Service Modeling Language (SML) is an XML-based specification by leading information technology companies that defines a consistent way to express how computer networks, applications, servers and other IT resources are described or modeled so businesses can more easily manage the services that are built on these resources.

Google and many other searches engines help finding INFORMATION about scientific data in cyberspace (“data discovery”) – this is mainly based on the keywords search. What is needed? - Geo-SML descriptors to list actual data sets on the World Wide Web. These descriptors would allow Google (and others) to search actual SCIENTIFIC DATA on the Web creating “look- up” tables for real e-Access to these data (eDeA). Thus, a major legacy of eGY could be a common framework (Geo-SML descriptors and appropriate cyber infrastructure) developed for scientific data representing various geoscience disciplines.