Presentation is loading. Please wait.

Presentation is loading. Please wait.

Introduction to Scientific Data Grid Kai Nan Computer Network Information Center, CAS

Similar presentations


Presentation on theme: "Introduction to Scientific Data Grid Kai Nan Computer Network Information Center, CAS"— Presentation transcript:

1 Introduction to Scientific Data Grid Kai Nan Computer Network Information Center, CAS nankai@sdb.ac.cn

2 2015-7-16CANS '2002 Shanghai2 What is Scientific Data Grid one-sentence statement a grid which focuses on sharing multi-discipline scientific data and advancing cooperative research based on the utilization of scientific data more words built upon the Scientific Databases of CAS started in 2001 plan to provide service by 2004-2005 for academic and research built by CAS, open to the world

3 2015-7-16CANS '2002 Shanghai3 Scientific Databases (SDB) SDB is a project funded by CAS since 1986 SDB is a collection of scientific databases, which cover multiple disciplines including chemistry, biology, geography, astronomy, ecology, … By 2005, SDB will be 40+ member institutions across China 300+ databases data volume 10TB+ Distributed & Heterogeneous

4 2015-7-16CANS '2002 Shanghai4 Grid Computing Information Power Grid

5 2015-7-16CANS '2002 Shanghai5 why SDG – motivation resource level – sharing and development make the scientific data more accessible data integration data – information – knowledge app level – emerging scientific applications do what we can ’ t do before rely on data cross multiple databases / cross-disciplinary demand more resources (cycle, storage, bandwidth, instrument, sensor, … )

6 2015-7-16CANS '2002 Shanghai6 Requirements Identification Provenance Metadata technical/context/content/management Access Control Universal Access Interface Publishing/Discovery/Retrieval Data Lifecycle …

7 2015-7-16CANS '2002 Shanghai7 Simplified 3 steps find the data and get related info. (metadata) obtain proper rights towards the data access the data maybe multiple distributed and heterogeneous databases involved within one request maybe not just data, but processing and/or analysis these steps seem to be easy, but …

8 2015-7-16CANS '2002 Shanghai8 Tasks Testbed One data center Three subject centers Middleware Information Service Security System Data Access Interface Application chemistry/biology/astronomy/geoscience/ …

9 2015-7-16CANS '2002 Shanghai9 Data Center (CNIC) Cluster 16 nodes 15TB Bio Center Cluster 8 nodes 1-2TB Beijing Chemistry Center Cluster 8 nodes 1-2TB Shanghai SDG Resources: 20 TB 4 PC Clusters CSTNET Geo Center Cluster 8 nodes 1-2TB Beijing 1000M 155M

10 2015-7-16CANS '2002 Shanghai10 Mass StorageDatabase Application Server MDS Server CA Server Portal Server SDG Data Center Supercomputers at CNIC ~2 TFLOPS

11 2015-7-16CANS '2002 Shanghai11 Grid Middleware Globus Resource Management (GRAM) Information Service (MDS) Data Management (GridFTP) Security (GSI) Storage Request Broker (SRB) SDSC ’ s solution for data grid

12 2015-7-16CANS '2002 Shanghai12 SDG Middleware Application GAPI DRB UAI Local DBMS xMDS GSI coordinated access to multiple data resources universal access interface to single data resource local data management system, could be DBMS or file system app-oriented, unified program interface applications databases

13 2015-7-16CANS '2002 Shanghai13 Use case (1) GAPI DRB UAI DBMS Node A App GAPI DRB UAI DBMS Node H App X MDS

14 2015-7-16CANS '2002 Shanghai14 Use case (2) GAPI DRB UAI DBMS Node A App GAPI DRB UAI DBMS Node H App X MDS GAPI DRB UAI DBMS Node B App GAPI DRB UAI DBMS Node C App 1.Single sign-on 2.Query MDS 3.App  GAPI  DRB 4.DRB(H)  UAI(A, B, C) 5.UAI  local DBMS  DB

15 2015-7-16CANS '2002 Shanghai15 Use case (3) GAPI DRB UAI DBMS Node A App GAPI DRB UAI DBMS Node H App X MDS GAPI DRB UAI DBMS Node B App GAPI DRB UAI DBMS Node C App GAPI DRB UAI DBMS Node Z App X

16 2015-7-16CANS '2002 Shanghai16 Projects CAS the Tenth Five-year Program (2001-2005) – funded (37M RMB) 863 Program (by MOST) a special program for grid – proposed

17 2015-7-16CANS '2002 Shanghai17 Milestones mid 2003 testbed built end 2003 middleware developed 2004 deployment and test run 2005 applications developed and production run

18 2015-7-16CANS '2002 Shanghai18 Collaboration PRAGMA APGrid SDSC KISTI ASCC Texas A&M Univ. …

19 2015-7-16CANS '2002 Shanghai19 Thank you !


Download ppt "Introduction to Scientific Data Grid Kai Nan Computer Network Information Center, CAS"

Similar presentations


Ads by Google