Scientific Data Grid on NGI Kai Nan Computer Network Information Center Chinese Academy of Sciences CANS 2004, Miami.

Slides:



Advertisements
Similar presentations
Grid Computing and Applications in China Kai Nan Computer Network Information Center (CNIC) Chinese Academy of Sciences (CAS) 17 Mar 2008.
Advertisements

CHINA REMOTE SENSING SATELLITE GROUND STATION, CHINESE ACADEMY OF SCIENCES China Grid Research Update Li Guoqing, RSGS, CAS, China.
The National Grid Service and OGSA-DAI Mike Mineter
Development of China-VO ZHAO Yongheng NAOC, Beijing Nov
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
1 CENTER FOR PARALLEL COMPUTERS An Introduction to Globus Toolkit® 3 -Developing Interoperable Grid services.
Grid Activity in China Sun Gongxing, IHEP, Beijing.
The Cactus Portal A Case Study in Grid Portal Development Michael Paul Russell Dept of Computer Science The University of Chicago
Massimo Cafaro GridLab Review GridLab WP10 Information Services Massimo Cafaro CACT/ISUFI University of Lecce, Italy.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Grid Services at NERSC Shreyas Cholia Open Software and Programming Group, NERSC NERSC User Group Meeting September 17, 2007.
International Workshop APAN 24, Current State of Grid Computing Researches and Applications in Vietnam Nguyen Thanh Thuy 1, Nguyen Kim Khanh 1,
GIS at SDSC Domains: –From geology, environmental science, hydrology, ocean biodiversity, regional development, Katrina response, archaeology, to neuroscience.
Introduction to Scientific Data Grid Kai Nan Computer Network Information Center, CAS
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
Enterprise Manager
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Service Computation 2010November 21-26, Lisbon.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
National Center for Supercomputing Applications NCSA OPIE Presentation November 2000.
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
DATABASE MANAGEMENT SYSTEMS IN DATA INTENSIVE ENVIRONMENNTS Leon Guzenda Chief Technology Officer.
Data Management BIRN supports data intensive activities including: – Imaging, Microscopy, Genomics, Time Series, Analytics and more… BIRN utilities scale:
E-Science Data Information and Knowledge Transformation Edikt : e-Science Data, Information and Knowledge Transformation E-Science Centres of Excellence.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
WP8 Meeting Glenn Patrick1 LHCb Grid Activities in UK Grid WP8 Meeting, 16th November 2000 Glenn Patrick (RAL)
An Introduction to Scientific Data Grid LUO Ze Computer Network Information Centre, Chinese Academy of Sciences.
Large Scale Parallel File System and Cluster Management ICT, CAS.
Tools for collaboration How to share your duck tales…
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
INFSO-RI Enabling Grids for E-sciencE OGSA DAI Data Access and Integration Marek Ciglan Institute of Informatics, Slovac Academy.
MTA SZTAKI Hungarian Academy of Sciences Introduction to Grid portals Gergely Sipos
EGEE-II INFSO-RI Enabling Grids for E-sciencE The GILDA training infrastructure.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Scientific Data Grid & China-VO Kai Nan Computer Network Information Center Chinese Academy of Sciences November 27, 2003.
August 3, March, The AC3 GRID An investment in the future of Atlantic Canadian R&D Infrastructure Dr. Virendra C. Bhavsar UNB, Fredericton.
Edinburgh e-Science MSc Bob Mann Institute for Astronomy & NeSC University of Edinburgh.
Internet2 AdvCollab Apps 1 Access Grid Vision To create virtual spaces where distributed people can work together. Challenges:
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
A Collaborative Research Environment for Avian Flu Research Luo Ze Computer Network Information Center, CAS
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
PROGRESS: GEW'2003 Using Resources of Multiple Grids with the Grid Service Provider Michał Kosiedowski.
DGC Paris Spitfire A Relational DB Service for the Grid Leanne Guy Peter Z. Kunszt Gavin McCance William Bell European DataGrid Data Management.
The National Grid Service Mike Mineter.
China Grid Activities update Li Guoqing NRSCC, China 13,Sept, 2005 Present for WGISS-20, Kiev.
The Storage Resource Broker and.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
Gang Chen, Institute of High Energy Physics Feb. 27, 2012, CHAIN workshop,Taipei Co-ordination & Harmonisation of Advanced e-Infrastructures Research Infrastructures.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
European and Chinese Cooperation on Grid Virtual Laboratory: Exploring e-Science in CAS CNIC,CAS Jianjun Yu
Enabling Grids for E-sciencE EGEE-II INFSO-RI Status of SRB/SRM interface development Fu-Ming Tsai Academia Sinica Grid Computing.
ChinaGrid: National Education and Research Infrastructure Hai Jin Huazhong University of Science and Technology
The Holmes Platform and Applications
Sun Gongxing, IHEP, Beijing
Open Source distributed document DB for an enterprise
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Globus —— Toolkits for Grid Computing
Grid Portal Services IeSE (the Integrated e-Science Environment)
Scientific Data Grid and e-Science
Signet & Privilege Management
Google Sky.
Computer Network Information Center, Chinese Academy of Sciences
Presentation transcript:

Scientific Data Grid on NGI Kai Nan Computer Network Information Center Chinese Academy of Sciences CANS 2004, Miami

Agenda Background & History Scientific Data Grid Next Steps with NGI

Background SDG is built upon the mass scientific data resources of the Scientific Database (SDB). Scientific Data Grid (SDG) is a typical project of CAS e-Science, also a pilot. The vision of SDG is to take valuable data resources into full play by benefiting from advanced information technologies, in particular, the Grid technology.

Scientific Database (SDB) SDB is a long-term project since 1983, in which there are multi-disciplinary scientific data accumulated through the course of science activities in CAS. many institutes involved long-term, large-scale collaboration data from research, for research

SDB status 45 institutes across 16 cities 388 databases (322 online) 13TB total volume (7.7TB online)

e-Science CAS Informatization Program –e-Science and ARP Scientific data is one of three poles of the e-infrastructure –Networks –Computing –Data SDG is a project of CAS e-Science

Milestones In 2000, the Scientific Database (SDB) project renewed fund by CAS 10th Five-year Program In March 2001, proposed “Scientific Data Grid” In October 2002, SDG joined the China National Grid (fund from MOST) In Nov 2003, SDG Middleware v1.0 released In July 2004, SDG got fund from NSFC In Sep 2004, SDG renewed fund from MOST In Oct 2004, DeepComp 6800 for SDG installed In Nov 2004, SDG Middleware v2.0 released

Supported by Chinese Academy of Sciences (CAS) –Informatization Program Ministry of Science and Technology of China (MOST) –863 Program/China National Grid National Science Foundation of China (NSFC) –Network-based Science and Research Environment (aka. NSFC e-Science)

What we do for SDG System Platform SDG Middleware Demo Applications

SDG System Platform Data Center –59 nodes of DeepComp 6800 –SAN Storage 20TB Disk Array 50TB Tape Library –TFLOPS-scale computing

SDG Software Modules

Security System SDG Middleware Application Grid API Data Res. Broker Uniform Access Int. Local Data System Info. Service coordinated access to multiple data resources uniform access interface to single data resource local data management system, could be DBMS or file system app-oriented, unified program interface applications databases

SDG Middleware and ToolKits SDG Middleware – Grid Information System –SDG Data Access System –SDG Security System SDG Toolkits SDG GIS V1.0 Universal Metadata Tool V2.0 Statistics Tool V1.1

SDG GIS V1.0 Backend MDS/LDAP Two types of Information –System info –Metadata Management and Service –Centralized –Distributed Query GRIP GRRP MDRP SDG GIIS SDG Sub-GIIS SDG Applications MDW MDIS C-MDISI-MDIS C-MDIS DCIS

SDG Universal Metadata Tool MDIS (LDAP) interim XML MD schema User page Process (Java bean) XML engine install & configure universal, extensible customizable - -metadata is tree-like and more flexible than fix-column tables, difficult to deal with on web UI - -use xml files to store interim results

Universal Metadata Management Tool

Windows 2k/xp Java 1.4 GT3 Core Statistics Services

Statistics & Analysis Tool (SAT) for Data Volume Features –Win2000/XP, Linux –Java 1.4 –Globus Toolkit 3 Core –Oracle, SQL Server, File System Deploy –Data nodes: 45 institutes at CAS, across 16 cities in China –Mediator: CNIC –Service Monitor

SDG Middleware and ToolKits SDG Middleware –Grid Information System – SDG Data Access System –SDG Security System SDG Toolkits Data Access Subsystem 1.0

SDG Data Access Service Framework Interne t Oracle SQLServer FileSystem mySQL DB2 Foxpro …… Information Service …… Application ClientsGrid Level Services Member Institutes Node Level Services & Data Resources

Data Access

SDG Middleware and ToolKits SDG Middleware –Grid Information System –SDG Data Access System – SDG Security System SDG Toolkits SDG CA V1.0 Access Control Toolkit V1.1

SDG Security System Client CUKUCUKU C UP1 K UP1 App. GAPI DRB UAI SDG-IS C MDS /K MDS C APP /K APP C UP2 /K UP2 C DRB /K DRB C UP3 /K UP3 C UAI /K UAI DBMSLACL Map Step1 Create user proxy Authen. C UP1 VS C APP Step2 Step4 Authen. C UP2 VS C DRB Step3 Authen. C UP2 VS C IS Authen. C UP3 VS C IS Step5 Step6 Authen. C UP3 VS C UAI Step7 Authen. C UAI VS C IS Step10 Access data Step8 Map global cert to local role Step9 Role-based access control C X, K X X’s Cert & Key UP1, UP2, … User Proxy, 2nd-level User Proxy, … Full Process of security-related operations under SDG Security System GSI based Use certificates to identify users Role-based local access control

Security Subsystem

SDG Middleware and ToolKits SDG Middleware –Grid Information System –SDG Uniform Access Interface –SDG Security System SDG Toolkits SDG Portal Image Process Tool 1.0 Storage Sharing Service

Demo Applications China Virtual Observatory –National Astronomical Observatory, CAS –Grid Services wrapping up astronomical data and code –quite a few services ready now HEP …

Training and Deployment SDB Technical Training –more than 100 participants –once a year SDB Work Evaluation Online –important to impel deployment of SDG middleware Distance training with partners –CNIC-UCSD/SDSC, February 2004

PRAGMA

SDG - good application on CNGI

Next Steps with NGI With higher bandwidth (e.g.. CNGI) –Mass data transmission, better data services –Data intensive applications using distributed superservers (not easy now) –Share data securely (often underestimated)

Next Steps with NGI 11th five-year Program ( ) –to build some subject data centers, which are well connected by NGI –to run SDG system platform routinely –to get SDG Middleware aware of NGI, and steady –to expand SDG beyond CAS –to develop more *real* science applications

Thank you!