Tieniu TAN Deputy Secretary-General Chinese Academy of Sciences (CAS) 29 Mar. 2010, Irvine, USA The 4th China-US Roundtable on Scientific Data Cooperation.

Slides:



Advertisements
Similar presentations
Microsoft Research Microsoft Research Jim Gray Distinguished Engineer Microsoft Research San Francisco SKYSERVER.
Advertisements

Inetrconnection of CNGrid and European Grid Infrastructure Depei Qian Beihang University Feb. 20, 2006.
Jorge Gasós Grid Technologies Unit European Commission The EU e Infrastructures Programme Workshop, Beijing, June 2005.
Grid-enabled Research Activities in CAS Kai Nan Computer Network Information Center (CNIC) Chinese Academy of Sciences (CAS) Shanghai, 21 Feb 2006.
Introduction to the Cooperation between CAS and DRIVER National Science Library,CAS Jianxia Ma Xiaolin Zhang Zhongming Zhu DRIVER Confederation.
Development of China-VO ZHAO Yongheng NAOC, Beijing Nov
China Academy of Transportation Sciences March 07, 2012 Dr. Jia wenzheng.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation,
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
FP6−2004−Infrastructures−6-SSA Data Grid Infrastructure for YBJ-ARGO Cosmic-Ray Project Gang CHEN, Hongmei ZHANG - IHEP.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
E-Science Workshop, Santiago de Chile, 23./ KIT ( Frank Schmitz Forschungszentrum Karlsruhe Institut.
Is 'Designing' Cyberinfrastructure - or, Even, Defining It - Possible? Peter A. Freeman National Science Foundation January 29, 2007 The views expressed.
Introduction to Scientific Data Grid Kai Nan Computer Network Information Center, CAS
New Generation SDI and Cyber-Infrastructure Prof. Guoqing Li CEODE/CAS March 29, 2009, Newport Beach, USA Presented to 4th China-US Roundtable Meeting.
HPC and e-Infrastructure Development in China’s High- tech R&D Program Danfeng Zhu Sino-German Joint Software Institute (JSI), Beihang University Dec.
China’s Scientific Data Sharing Initiatives and Future Perspective Pro. Peng, Jie Dr. Liu, Runda 5 March 2012,
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
CNGI Applications in CSTNET QingHua Zhang CSTNET January 2007.
CI Days: Planning Your Campus Cyberinfrastructure Strategy Russ Hobby, Internet2 Internet2 Member Meeting 9 October 2007.
Division Report Computing Center CHEN Gang Computing Center Oct. 24, 2013 October 24 ,
Grid Activities in China CHEP’06 Mumbai, 16/Feb/2006 Gang CHEN Institute of High Energy Physics, CAS Baoping YAN Computer Network Information Center, CAS.
Status of CNGrid and Interests on International Testbed Depei Qian Sino-German Joint Software Institute (JSI) Beihang University SC12 BoF on Computing.
HEP Grid Computing in China Gang Workshop on Future PRC-U.S. Cooperation in High Energy Physics.
Scientific Data Grid on NGI Kai Nan Computer Network Information Center Chinese Academy of Sciences CANS 2004, Miami.
Scientific data cloud infrastructure and services in Chinese Academy of Sciences Jianhui Yuanke
Data GRID Activity in Japan Yoshiyuki WATASE KEK (High energy Accelerator Research Organization) Tsukuba, Japan
A short introduction to the Worldwide LHC Computing Grid Maarten Litmaath (CERN)
Date donald smits center for information technology Centre of Information Technology RUG Robert Janz Centre of Information Technology University.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Scientific Database and Virtual Museums
The LHC Computing Grid – February 2008 The Worldwide LHC Computing Grid Dr Ian Bird LCG Project Leader 25 th April 2012.
Welcome to KISTI KISTI-CCIN2P3 FKPPL Workshop December 1, 2008 Minsun LEE.
BESIII Production with Distributed Computing Xiaomei Zhang, Tian Yan, Xianghu Zhao Institute of High Energy Physics, Chinese Academy of Sciences, Beijing.
14 Aug 08DOE Review John Huth ATLAS Computing at Harvard John Huth.
An Introduction to Scientific Data Grid LUO Ze Computer Network Information Centre, Chinese Academy of Sciences.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
ORIENT/ORIENTplus - Connecting Academic Networks in China and Europe Jennifer(Jie) An CERNET Center/Tsinghua University 14 Feb.2012.
IPv6 Development in CSTNET Xiaodan Zhang Computer Network Information Center,CAS 27 th APAN Kaosiung Mar. 2-6, 2008.
IHEP Computing Center Site Report Shi, Jingyan Computing Center, IHEP.
KISTI-GSDC SITE REPORT Sang-Un Ahn, Jin Kim On the behalf of KISTI GSDC 24 March 2015 HEPiX Spring 2015 Workshop Oxford University, Oxford, UK.
IPV6 activity in CSTNET Jiangning Chen
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
Les Les Robertson LCG Project Leader High Energy Physics using a worldwide computing grid Torino December 2005.
FP6−2004−Infrastructures−6-SSA Interconnection & Interoperability of Grids between Europe and China the EUChinaGRID Project F. Ruggieri – INFN Project.
IHEP(Beijing LCG2) Site Report Fazhi.Qi, Gang Chen Computing Center,IHEP.
Current Status and Future Prospects of Public Understanding of Science based on Internet in China Prof. Yan Baoping Computer Network Information Center.
EScience: Techniques and Technologies for 21st Century Discovery Ed Lazowska Bill & Melinda Gates Chair in Computer Science & Engineering Computer Science.
ORACLE IN CHINA An Emerging Giant … #6 18 Years in China 7,000+ Customers (6,800+ Tech and 700+ Apps Customers) 600+ Partners 1,600+ Staff 150,000 Strong.
| nectar.org.au NECTAR TRAINING Module 2 Virtual Laboratories and eResearch Tools.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
CSTNET and Its Applications Computer Network Info. Center, CAS.
INFSO-RI Enabling Grids for E-sciencE The EGEE Project Owen Appleton EGEE Dissemination Officer CERN, Switzerland Danish Grid Forum.
IHEP Computing Site Report Shi, Jingyan Computing Center, IHEP.
11 e-Infrastructures for Agriculture in China: State of the Art and Requirements Meng Xianxue Qian Ping
IPCEI on High performance computing and big data enabled application: a pilot for the European Data Infrastructure Antonio Zoccoli INFN & University of.
Dominique Boutigny December 12, 2006 CC-IN2P3 a Tier-1 for W-LCG 1 st Chinese – French Workshop on LHC Physics and associated Grid Computing IHEP - Beijing.
IHEP Computing Center Site Report Gang Chen Computing Center Institute of High Energy Physics 2011 Spring Meeting.
HPC-related R&D in 863 Program Depei Qian Sino-German Joint Software Institute (JSI) Beihang University Aug. 27, 2010.
The status of IHEP Beijing Site WLCG Asia-Pacific Workshop Yaodong CHENG IHEP, China 01 December 2006.
ChinaGrid: National Education and Research Infrastructure Hai Jin Huazhong University of Science and Technology
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Support to scientific.
Scientific Computing at Fermilab Lothar Bauerdick, Deputy Head Scientific Computing Division 1 of 7 10k slot tape robots.
Grids and SMEs: Experience and Perspectives Emanouil Atanassov, Todor Gurov, and Aneta Karaivanova Institute for Parallel Processing, Bulgarian Academy.
Scientific Data Processing Portal and Heterogeneous Computing Resources at NRC “Kurchatov Institute” V. Aulov, D. Drizhuk, A. Klimentov, R. Mashinistov,
Sun Gongxing, IHEP, Beijing
EUChinaGRID Applications
China Academy of Transportation Sciences
Gridifying the LHCb Monte Carlo production system
Presentation transcript:

Tieniu TAN Deputy Secretary-General Chinese Academy of Sciences (CAS) 29 Mar. 2010, Irvine, USA The 4th China-US Roundtable on Scientific Data Cooperation Advanced Cyber-infrastructure for Scientific Data Applicationsin CAS The 4th China-US Roundtable on Scientific Data Cooperation Advanced Cyber-infrastructure for Scientific Data Applications in CAS

Outline  Background  Advanced Cyber-Infrastructure in CAS  Typical Data Intensive e-Science Applications in CAS  Conclusion

Scientific Data Deluge  Scientists face a data deluge –Vast volume of scientific data captured by large scientific facilities, ubiquitous sensors, new instruments and computer models  Science and engineering research have become increasingly data- intensive –New scientific opportunities are emerging from increasingly effective data organization, access and usage (NSF, 2007)

Data-intensive scientific discovery: e-Science  The fourth paradigm: data-intensive scientific discovery (Microsoft, 2009) –A Transformed Scientific Method  e-Science is synthesis of information technology and science, giving priority to scientific data lifecycle and data exploration (Jim Gray) –data captured by instruments or generated by simulator; processed by software; information/knowledge stored in computer; scientist analyzes database / files; using data management and statistics

China National Scientific Data Sharing Initiatives  Ministry of Science and Technology (MOST) started the implementation of Scientific Data Sharing Program (SDSP) in 2002 –Supporting almost 20 projects to promote scientific data sharing  National Science & Technology Infrastructure (NSTI) was launched in 2005 by MOST and Ministry of Finance ( ) –Supporting 38 projects for promoting Science and Technology Resources, data and information sharing and Open Access –Total funding ~2 billion RMB

High Speed Network -CSTNET -CSTNET-CNGI -GLORIAD 1.Field observation stations 2.Large scientific facilities 3.others Advanced CI for Data Lifecycle in CAS Application Generation &Collection Trans- mission Computing &Analysis Storage &Curation Data Information Stream Data Centers -storage &preservation -Curation -Sharing and Service Supercomputing Grid -Computing -Analysis -Mining -visualization Data intensive e- Science Applications

Data generation  Large scientific facilities produce huge data –+20 in operation –+20 under construction  Long-term field observation stations –+100 stations covering Ecology, Environment, Space, etc.  Other research data, including experiments, modeling, computing, etc. –100 institutes, more than researchers in CAS

Network Field Observation  Network expanded to link field observations –Real Time Data Collection  CERN  China Ecology system Research Network  Disaster and Environment Observation  Astronomy and space observation

Meridian Space Weather Monitoring Program  More than 10TB data will be generated and transmitted to Beijing per year  data analysis needs 20Tflops  A data system and processing infrastructure being built

Cosmic-ray observatory: ARGO/AS   Cosmic-ray observatory at Yangbajing in Tibet: –ARGO: China-Italy –AS  : China-Japan  ~200TB raw data per year.  Data transferred from YBJ- ARGO and processed at IHEP and INFN  Rec. data accessible by collaborators.

BEPCII / BESIII BEPC: Beijing Electron-Positron Collider –upgrade: BEPCII/BESIII, operational in 2008 –2.0 ~ 4.6 GeV/C –(3~10)×10 32 cm -2 s -1 –36 Institutions from China, US, Germany, Russian, and Japan –4000+ KSI2K for data process and physics analysis –5+ PB in five years

Data Transmission-High Speed Network  China Science and Technology Network ( CSTNet )  Non-profitable, academic and research networks in China to support advanced science applications and research on next generation Internet  Connect some 200 institutes, and 1,000,000 end users

Lanzhou Xinjiang Xian Shenyang Changchun Chengdu Kunming Wuhan Guangzhou Shanghai Hefei Lasa Qingdao Haerbin Xining Dalian Guiyang Yangbajing Xishuangbanna Changsha TianJin 2.5Gb/s 155Mb/s < 155Mb/s Figure HongKong 1Gb/s Taiwan Shenzhen Fuzhou Ningbo Nanjing Shanxi Shijiazhuang Beijing CSTNET Backbone

Interconnecting with other Networks Russia Netherland USA KISTI Korea NICT Japan AS Hongkong GOOGLE Hongkong HKIX Hongkong CUHK Hongkong China169 China Unicom ChinaNet TELECOM CERNET HKOEPCSTNET Gloriad 10G 2.5G 1G 2.5G 2G 155M 700M BJ NAP 2.5G Hongkong 2G Internet Beijing

上海 Jiling 辽宁 Guangzhou 兰州 XinJiang Beijing 10Gbps International Link10G 羊八井 100+ Institutes 40+ Field stations and big science facilities Computing facilities and storage facilities CSTNET-CNGI An IPv6 Network for Science based on CSTNET will start to build this year Chengdu XI’AN Kunmin g WuHan Hefei Nanjing

Data Storage and Curation  A General Scientific Data Center –Common data infrastructure construction, operation –Data archive and preservation  Some domain specific scientific data centers –Discipline data curation and sharing service  A CAS scientific data app project –Multi-discipline data sharing and applications  A series of domain-based scientific data sharing systems and institute level data sharing infrastructure

Data Resource Center  A General Scientific Data Center  A new organization responsible for data preservation, curation and access service in CAS Mass data backup Data online service Mass data analysis and process Long-term preservation of important data Data Resource Center Technology service Network storage space system environment Application service mass data Managemen t system collaborator staff

Massive Storage System in Data Resource Center  Massive Storage System –Scientific data archive system (5PB tape) –Online data storage system (1PB disk array)  Internet-based service (Cloud Service) –Data backup –Archiving and curation –on-line data access and analysis

Domain Specific Scientific Data Centers  World Data Center ( World Data System ) in CAS –Natural Resource Environment Data Center –Astronomy Data Center –Space Data Center –Geophysics Data Center –Glacier and Frozen Earth Data Center

Scientific Databases (SDB)  A Long-term mission started in 1986 which was funded by CAS –data from research, for research  Collecting multi-discipline research data and promoting data sharing –More than 350 research databases and 400 datasets by 61 institutes –Over 60TB data available to open access and download

Scientific Databases (cont.)  8 Resource databases –Geo-Science –Biodiversity –Chemistry –Astronomy –Space Science –Micro biology and virus –Material science –Environment  2 Reference databases –China Species –compound  4 Application-Oriented databases –High Energy (ITER) –Western Environment Research –Ecology research –Qinghai Lake Research

Scientific Data Grid Scientific Data and databases Scientific Data Grid Middleware Scientific Data Grid Applications Bioscience GatewayGeosciences Gateway Chemistry Gateway Other Gateways CAS Scientific Data Grid  Integrating distributed scientific data into a com- prehensive service and application environment  Linking all data canters as a data net

Scientific Computing Grid Scientific Computing Grid Access Through network Local/Remote User Resource Abstracting Cooperation Resource Interconnection Other network resource and environment Database, e-Science, ARP, website, science, TRP CNGRID & environment Super Computing Grid Application service and Technical supporting System, Uniform System operating, Supporting & Service. Uniform Regulations SCCAS, 120+Tflops Computing capacity 8+ Branches : 50 Tflops common Computing capacity Institute Computing Resource 50 Tflops common Computing capacity Lenovo 7000, Peak: 143TeraFLOPS

Scientific Computing Grid HPC, Cluster, Workstation, Storage Windows / Linux Clients Web Portal Grid Middleware

HEP Grid in China  Access to the LHC data for scientific research: A grid computing system is built in CAS  WLCG MoU signed with CERN in 2006 to build a Tier-2 center at IHEP for both the ATLAS and CMS experiments. IHEP PKU SDU USTC NJU

Tier-2 site at IHEP  WLCG site based on EGEE/gLite  Associated with CC-IN2P3 in Lyon  Work nodes with 1600 cores  400 TB disk space

Typical data intensive e-Science Applications  Developing a series of pilot e-Science applications –Most are data intensive

Pt>20 GeV/c Tracks ttH(2l2b4j2  ) full simulation event display ttH-2L selection ttbar mimic to ttHWW HEP Grid Applications: ATLAS MC Study

Rosetta Early/Late Stage HEP Grid application: protein prediction  Explore the non-natural protein sequence space  Set up a massive protein structure prediction environment  Develop web tools for the biology community  Result of EUChinaGrid project (EU FP6 project) KWCWPFASHNDLKVQSQ WYVEPPDTIPPYNKYGTN FIKHCQYIAHMQGDTHFF NRVRMHQLWKIIVDCAY

ChinaFLUX Built in 2002 for climate change and environment research

31 Data System Observation system Modeling and visualization Data transmission ChinaFLUX e-Science Environment

Real data from sensors to field stations, then to institutes, finally to data centers to process and share Cyberinfrastructure for data collection

Data intensive application environment  Data synthesis and integration  Data analysis and modeling  visualization

OPEN SCIENCE CLOUD IaaS Network Service Computing Service Storage Service … IaaS Network Service Computing Service Storage Service … Conclusion Paas Data intensive application environment … Paas Data intensive application environment … Saas Software and tools for data curation, analysis, mining and visualization … Saas Software and tools for data curation, analysis, mining and visualization … Building an Open Science Cloud serving not only CAS researchers, but also the wider scientific community! DaaS Scientific data and databases Service DaaS Scientific data and databases Service

Thank you !