Scientific Data Grid & China-VO Kai Nan Computer Network Information Center Chinese Academy of Sciences November 27, 2003.

Slides:



Advertisements
Similar presentations
Inetrconnection of CNGrid and European Grid Infrastructure Depei Qian Beihang University Feb. 20, 2006.
Advertisements

Data Management Expert Panel - WP2. WP2 Overview.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
High Performance Computing Course Notes Grid Computing.
Hydra Partners Meeting March 2012 Bill Branan DuraCloud Technical Lead.
DataGrid is a project funded by the European Union 22 September 2003 – n° 1 EDG WP4 Fabric Management: Fabric Monitoring and Fault Tolerance
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
CoreGRID Workpackage 5 Virtual Institute on Grid Information and Monitoring Services Authorizing Grid Resource Access and Consumption Erik Elmroth, Michał.
Massimo Cafaro GridLab Review GridLab WP10 Information Services Massimo Cafaro CACT/ISUFI University of Lecce, Italy.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
A Model for Grid User Management Rich Baker Dantong Yu Tomasz Wlodek Brookhaven National Lab.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Introduction to Scientific Data Grid Kai Nan Computer Network Information Center, CAS
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Apache Airavata GSOC Knowledge and Expertise Computational Resources Scientific Instruments Algorithms and Models Archived Data and Metadata Advanced.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
Grid Information Systems. Two grid information problems Two problems  Monitoring  Discovery We can use similar techniques for both.
Presenter: Dipesh Gautam.  Introduction  Why Data Grid?  High Level View  Design Considerations  Data Grid Services  Topology  Grids and Cloud.
A Metadata Catalog Service for Data Intensive Applications Presented by Chin-Yi Tsai.
Scientific Data Grid on NGI Kai Nan Computer Network Information Center Chinese Academy of Sciences CANS 2004, Miami.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
The ACGT Workflow Editing & Enactment Environment Giorgos Zacharioudakis Institute of Computer Science, Foundation for Research & Technology – Hellas (ICS-FORTH)
Using NMI Components in MGRID: A Campus Grid Infrastructure Andy Adamson Center for Information Technology Integration University of Michigan, USA.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University.
Holding slide prior to starting show. A Portlet Interface for Computational Electromagnetics on the Grid Maria Lin and David Walker Cardiff University.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
INFSO-RI Enabling Grids for E-sciencE OGSA DAI Data Access and Integration Marek Ciglan Institute of Informatics, Slovac Academy.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
Overview of Privilege Project at Fermilab (compilation of multiple talks and documents written by various authors) Tanya Levshina.
Scalable Grid system– VDHA_Grid: an e-Science Grid with virtual and dynamic hierarchical architecture Huang Lican College of Computer.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Enabling e-Research in Combustion Research Community T.V Pham 1, P.M. Dew 1, L.M.S. Lau 1 and M.J. Pilling 2 1 School of Computing 2 School of Chemistry.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
A Technical Overview Bill Branan DuraCloud Technical Lead.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
1 AHM, 2–4 Sept 2003 e-Science Centre GRID Authorization Framework for CCLRC Data Portal Ananta Manandhar.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
MGRID Architecture Andy Adamson Center for Information Technology Integration University of Michigan, USA.
The GRIDS Center, part of the NSF Middleware Initiative Grid Security Overview presented by Von Welch National Center for Supercomputing.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
E-science grid facility for Europe and Latin America Updates on Information System Annamaria Muoio - INFN Tutorials for trainers 01/07/2008.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Accessing the VI-SEEM infrastructure
Globus —— Toolkits for Grid Computing
Middleware independent Information Service
Presentation transcript:

Scientific Data Grid & China-VO Kai Nan Computer Network Information Center Chinese Academy of Sciences November 27, 2003

Outline Data Grid & VO Requirements Experiences on SDB Scientific Data Grid (SDG) Progress Update SDG & China-VO

Data Grid Grid –resource sharing –collaborative problem-solving Data Grid –more focus on data –(scientific) data become one footstone of modern sciences and research –data sharing is crucial to most scientists today

Typical Requirements towards Data Grid Identification Provenance Metadata –technical / context / content / management Access Control Universal Access Interface Publishing / Discovery / Retrieval Data Lifecycle …

Simplified 3 Steps Find the data –and get related info. (metadata) Obtain proper rights towards the data Access the data –maybe multiple distributed and heterogeneous databases involved within one request –maybe not just data, but processing and/or analysis these steps seem to be easy, but …

Grid Information Service Step 1-- To find the data Requirements –Define metadata schema resource discovery –answer to “What, How” – intrinsic properties of data –relatively static metadata, generated by man location & monitoring –answer to “Where, When” – extrinsic properties of data –dynamic information, generated by program –Define API Publish / Collect Query

Grid Security System Step 2 -- To ensure that data be accessed rightly Requirements –Single Sign-On –Delegation –Universal credentials –Integration with local policies –Policy management –Data-oriented access control –User-based trust/trusteeship –Logging –Open architecture & Interoperability with other Grids

Uniform Data Access Step 3 -- To get the data easily Requirements –Uniform access interface to single data resource –Coordinated access to multiple data resources –App-oriented, unified and convenient program interface –Schedule policy –Data replication –Data quality assurance

Our Experiences on SDB SDB – Scientific Database –a project funded by CAS since 1986 –a collection of scientific databases, which cover multiple disciplines including chemistry, biology, geography, astronomy, ecology, … By now, SDB has –45 member institutions across China –296 databases –data volume 8.2TB

SDB Characteristics & Challenges Characteristics –Distributed –Heterogeneous Challenges –Requirements for data sharing –More collaborative work across multi-sites and multi- disciplines –More collaborations with colleagues across the world under Knowledge Innovation Program of CAS –The data are from research, and for research.  Scientific Data Grid !

Scientific Data Grid (SDG) one-sentence statement –a grid which focuses on sharing multi-discipline scientific data and advancing cooperative research based on the utilization of scientific data more words –built upon the Scientific Database (SDB) of CAS –started in 2001 –plan to provide service by –for academic and research –built by CAS, open to the world

SDG Vision Resource Level – sharing and development –make scientific data more accessible –data integration –data  information  knowledge App Level – enabling e-Science applications –complex problem-resolving with heavy use of data –cross multiple databases / cross-disciplinary –demand more resources (cycle, storage, bandwidth, instrument, sensor, …)

Security System SDG Middleware Application Grid API Data Res. Broker Uniform Access Int. Local Data System Info. Service coordinated access to multiple data resources uniform access interface to single data resource local data management system, could be DBMS or file system app-oriented, unified program interface applications databases

SDG Information Service SDG Info. Service –DCIS: Data Container Info. Service built on Globus MDS design DIT for SDG (schema, OID, namespace) develop a program which collects information and returns it as LDIF, called info. provider configure a new MDS –MDIS: MetaData Info. Service actually a normal LDAP add ldbm-backend to MDS in order to store static metadata develop the metadata tool to manage MDIS –Compatible with Globus MDS 2.1 –Future plans: extend the infrastructure with Grid Services

SDG Information Service (cont’d) Query GRIP GRRP MDRP SDG GIIS SDG Sub-GIIS SDG Applications C-MDW MDIS C-MDISI-MDIS C-MDIS DCIS

SDG Universal Metadata Tool Requirements –why universal many disciplines in SDG  similarly many or more metadata standards it’s not good for us to develop a tool for every metadata schema individually input metadata for existing databases is more bothersome, so an ease-to-use tool might be must-have in practice –input: a metadata schema (xml DTD) –output: Web-based, customizable UI LDAP-based Storage Management functions (add, delete, modify and query) –back-end is MDIS

SDG Universal Metadata Tool MDIS (LDAP) interim XML MD schema User page Process (Java bean) XML engine install & configure universal, extensible customizable -metadata is tree-like and more flexible than fix-column tables, difficult to deal with on web UI -use xml files to store interim results

SDG Security System Services –Authentication (Based on Globus GSI) secure connection user proxy management –Authorization mapping global certificates to local roles role-based access control local role management –Accounting

SDG Security System (cont’d) Client CUKUCUKU C UP1 K UP1 App. GAPI DRB UAI SDG-IS C MDS /K MDS C APP /K APP C UP2 /K UP2 C DRB /K DRB C UP3 /K UP3 C UAI /K UAI DBMSLACL Map Step1 Create user proxy Authen. C UP1 VS C APP Step2 Step4 Authen. C UP2 VS C DRB Step3 Authen. C UP2 VS C IS Authen. C UP3 VS C IS Step5 Step6 Authen. C UP3 VS C UAI Step7 Authen. C UAI VS C IS Step10 Access data Step8 Map global cert to local role Step9 Role-based access control C X, K X X’s Cert & Key UP1, UP2, … User Proxy, 2nd-level User Proxy, … Full Process of security-related operations under SDG Security System

SDG Uniform Access Interface Interne t Oracle SQLServer FileSystem mySQL DB2 Foxpro …… Information Service …… Application ClientsGrid Level Services Member Institutes Node Level Services & Data Resources

SDG Uniform Access Interface (cont’d) OGSA-based Two Levels Services –Node level Data services on single node –Grid level Data services cross multiple nodes Data services –Data Query –Data Analysis –Data Processing –Data Replica –……

Progress Update SDG Middleware Tools and Services –Universal Metadata Tool, V2.0 –Local Access Control Tool, V1.0 –Certificate Management System, V1.0 –Statistics Services of Data Volume (SAT), V1.1 –Image Process Services, V1.0

SAT Architecture

Deployment on node- level institutes

Web Application Client of SAT Grid Level Service

China-VO on SDG Supported by MOST (863 Program) an application grid of CNGrid China-VO is an app. of SDG in this project

Thank you!