Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre

Slides:



Advertisements
Similar presentations
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Advertisements

The Replica Location Service In wide area computing systems, it is often desirable to create copies (replicas) of data objects. Replication can be used.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
An Operational Metadata Framework For Searching, Indexing, and Retrieving Distributed GIServices on the Internet By Ming-Hsiang.
Spatial Data Infrastructure: Concepts and Components Geog 458: Map Sources and Errors March 6, 2006.
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
Development of Japanese GIS Tool for use in the Humanities ○ Masatoshi ISHIKAWA †, Yoichi KAWANISHI ††, Hidefumi OKUMURA †††, Shoichiro HARA †††† † University.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Digital Library Architecture and Technology
Interoperability ERRA System.
The GeoConnections Discovery Portal Michael Robson MacDonald Dettwiler and Associates Brian McLeod, Michael Adair Natural Resources Canada.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Presenter: Dipesh Gautam.  Introduction  Why Data Grid?  High Level View  Design Considerations  Data Grid Services  Topology  Grids and Cloud.
DISTRIBUTED COMPUTING
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Jean François Doyon Tom Kralidis June 2003 Services Overview.
OEI’s Services Portfolio December 13, 2007 Draft / Working Concepts.
DATABASE MANAGEMENT SYSTEMS IN DATA INTENSIVE ENVIRONMENNTS Leon Guzenda Chief Technology Officer.
Production Data Grids SRB - iRODS Storage Resource Broker Reagan W. Moore
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Alexandria Digital Earth ProtoType DIGITAL LIBRARIES AND ENVIRONMENTAL INFORMATION Terence R. Smith Alexandria Digital Library Project.
Freelib: A Self-sustainable Digital Library for Education Community Ashraf Amrou, Kurt Maly, Mohammad Zubair Computer Science Dept., Old Dominion University.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
State Key Laboratory of Resources and Environmental Information System China Integration of Grid Service and Web Processing Service Gao Ang State Key Laboratory.
GIS data sources; catalogs of data and services. USGS: National Mapping.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
7. Grid Computing Systems and Resource Management
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
National Geospatial Enterprise Architecture N S D I National Spatial Data Infrastructure An Architectural Process Overview Presented by Eliot Christian.
May 2010 GGIM, New York City The National System for Coordination of Territorial Information SNIT NSDI of Chile.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
System Software Laboratory Databases and the Grid by Paul Watson University of Newcastle Grid Computing: Making the Global Infrastructure a Reality June.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Service Oriented Architecture (SOA) Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Chapter 1 Characterization of Distributed Systems
GeoNetwork OpenSource: Geographic data sharing for everyone
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Distributed Cache Technology in Cloud Computing and its Application in the GIS Software Wang Qi Zhu Yitong Peng Cheng
Access Grid and USAID November 14, 2007
Flanders Marine Institute (VLIZ)
GSAF Grid Storage Access Framework
GSAF Grid Storage Access Framework
Distributed Systems Bina Ramamurthy 11/30/2018 B.Ramamurthy.
Geospatial Data Use and sharing Concepts
Distributed Systems Bina Ramamurthy 12/2/2018 B.Ramamurthy.
Large Scale Distributed Computing
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Google Sky.
Distributed Systems Bina Ramamurthy 4/22/2019 B.Ramamurthy.
Introduction To Distributed Systems
Presentation transcript:

Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre

Content  Introduction to Grid  Demands from Digital Archive  DAGS Infrastructure and Architecture MSC ( Metadata & Storage Controller)  DORE Grid Service  Geospatial Grid Service  Summary

What is Grid   the Grid is a service for sharing computer power and data storage capacity over the Internet. The Grid goes well beyond simple communication between computers, and aims ultimately to turn the global network of computers into one vast computational resource (Web is a service for sharing information over the Internet)

Important Issues of NDAP  Intellectual property rights  Time, Space and Language Coordination  Multi-lingual issues (encoding)  Public information systems  Meta-language and Documentation Metadata Content Markup References and Linking  Dissemination and Sharing  Cooperation and collaboration  Scalability, Adaptability and Durability

Demands of Digital Archive  Persistent digital objects,  Well-organized information structure for effective content management  Efficient and accurate information retrieval mechanism  Flexible services for variant users needs  Consistency  Integrate relationship between information management and data management  High-performance remote data access  Authentication and authorization  Resource discovery and monitoring

What we shall supply  Metadata System Integration  Reliable and efficient storage system Reliable replication system -> replica locating mechanism Reduced query latency ->query routing scheme Load sharing Robust, high availability Manageability High Throughput Adaptive Transparency of location and protocol

Challenge  Big Challenge of IT for cataloging, searching, retrieval, management, identification, knowledge discovery, and integration  Integration and Retrieval of Information Resources

Approach  Develop Grid Services that can integrate heterogeneous metadata systems, distributed database management systems and geospatial information systems.  Provide a framework to exchange different metadata XML documents (EAD, DC, FGDC … ) in “ National Digital Archives Program ”.

Digital Archive Grid Service Infrastructure Data Grid Nodes Digital Archive Portal Participant Node Service Metadata Participant Node Service Metadata Object Index Data Detailed Object Data Aggregated Data Detailed Object Data Aggregated Data XML Data Service Metadata User Requests

Building Grid Service for DORE  DORE (Document REtrieval) is A middleware A library A tool for programmers to develop metadata database applications  DORE is a tool in Open Digital Archive Environment (ODAE).  Migrate DORE applications to Grid Infrastructure, and also have backward compatibility to existing system.

The UML of DORE Grid Service

Dore Grid Service GUI Client

Next Steps of DORE Grid  Security Issue Add CA authority in framework and archieveinter-organizational data sharing. Security management.  Deploy DORE Grid Service in organizations  Other organization could build their own client application to use this framework  DORE Grid systems deploy in orgs.  Data sharing between organizations.  WSRF & GT4 ?

Geospatial Grid Service  Three basic categories of GIS Grid Services: Data Services Processing Services Catalog Services

Services Architecture Approach Applications e.g. Historical research planning, Administrative boundary Changes Create map Services e.g., Metadata Service, Gazetteer service, Web Map Service Data e.g., topographic, thematic, imagery, toponymy, metadata Users Other Applications For Example… Find a historical map? Find place names in Qing Dynasty? uses Metadata Service, Gazetteer Service, Web map service based on Base historical maps, Geographical Names, Map features

Distributed Spatial Data Infrastructure Environment Service Catalog DGS DOREGrid service interface GGS Symbol Key web client DGSWMS clients other data GGS Catalog geodata DGS data metadata register application server Project Site other data GGS Catalog geodata DGS data metadata other data GGS geodata data metadata Search Gateway multiviewer Node 1Node 3Node 2 other distributed servers GeospatialGrid services interface

Metadata Service for Geospatial Data

MSC  MetaData & Storage controller

Plan the strategy DAGS is evolving to be an interoperable network of databases and information technology tools using Web services and Grid technologies.  In the near term, DAP will provide a national metadata registry of the available data with open interfaces through Grid service.  Building on the contents of this registry, DAGS will provide its own central portal that enables simultaneous queries against different databases held by distributed, even worldwide sources.  In the long term, different level objects can be linked to the system.  These will facilitate and enable data mining of unprecedented utility and e-Science.

Summary  Achievements 1. The Grid services cooperate with Geospatial Information system was developed and tested. 2.The DORE Grid middleware was implemented and rebuilded. 3. The metadata register of different provider and databases were completed. 4. Prototype of MSC (Metadata & Storage Controller)  Future Works 1.Integrate heterogeneous storage system and metadata system. 2.Refining the technologies of Data Grid 3.Developing the knowledge and e-Science discovery

Question? (demo)