Building Virtual Scientific Computing Environment with Openstack Yaodong Cheng, CC-IHEP, CAS ISGC 2015.

Slides:



Advertisements
Similar presentations
Cloud computing is used to describe a variety of computing concepts that involve a large number of computers connected through a real-time communication.
Advertisements

© 2012 IBM Corporation Architecture of Quantum Folsom Release Yong Sheng Gong ( 龚永生 ) gongysh #openstack-dev Quantum Core developer.
SLA-Oriented Resource Provisioning for Cloud Computing
Chapter 22: Cloud Computing and Related Security Issues Guide to Computer Network Security.
An Approach to Secure Cloud Computing Architectures By Y. Serge Joseph FAU security Group February 24th, 2011.
1 Security on OpenStack 11/7/2013 Brian Chong – Global Technology Strategist.
CHEP 2012 – New York City 1.  LHC Delivers bunch crossing at 40MHz  LHCb reduces the rate with a two level trigger system: ◦ First Level (L0) – Hardware.
Tunis, Tunisia, 28 April 2014 Business Values of Virtualization Mounir Ferjani, Senior Product Manager, Huawei Technologies 2.
IHEP Site Status Jingyan Shi, Computing Center, IHEP 2015 Spring HEPiX Workshop.
Virtualization and the Cloud
M.A.Doman Model for enabling the delivery of computing as a SERVICE.
Cloud computing Tahani aljehani.
Network+ Guide to Networks 6 th Edition Chapter 10 Virtual Networks and Remote Access.
Testing as a Service with HammerCloud Ramón Medrano Llamas CERN, IT-SDC
Cloud Computing Why is it called the cloud?.
Clouds on IT horizon Faculty of Maritime Studies University of Rijeka Sanja Mohorovičić INFuture 2009, Zagreb, 5 November 2009.
Customized cloud platform for computing on your terms !
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
BESIII physical offline data analysis on virtualization platform Qiulan Huang Computing Center, IHEP,CAS CHEP 2015.
Ceph Storage in OpenStack Part 2 openstack-ch,
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
Cloud Computing in NASA Missions Dan Whorton CTO, Stinger Ghaffarian Technologies June 25, 2010 All material in RED will be updated.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
YAN, Tian On behalf of distributed computing group Institute of High Energy Physics (IHEP), CAS, China CHEP-2015, Apr th, OIST, Okinawa.
Mehdi Ghayoumi Kent State University Computer Science Department Summer 2015 Exposition on Cyber Infrastructure and Big Data.
608D CloudStack 3.0 Omer Palo Readiness Specialist, WW Tech Support Readiness May 8, 2012.
From Virtualization Management to Private Cloud with SCVMM 2012 Dan Stolts Sr. IT Pro Evangelist Microsoft Corporation
CloudNaaS: A Cloud Networking Platform for Enterprise Applications Theophilus Benson*, Aditya Akella*, Anees Shaikh +, Sambit Sahu + (*University of Wisconsin,
BESIII Production with Distributed Computing Xiaomei Zhang, Tian Yan, Xianghu Zhao Institute of High Energy Physics, Chinese Academy of Sciences, Beijing.
WNoDeS – Worker Nodes on Demand Service on EMI2 WNoDeS – Worker Nodes on Demand Service on EMI2 Local batch jobs can be run on both real and virtual execution.
Tim Bell 24/09/2015 2Tim Bell - RDA.
IHEP Computing Center Site Report Shi, Jingyan Computing Center, IHEP.
Jose Castro Leon CERN – IT/OIS CERN Agile Infrastructure Infrastructure as a Service.
Windows Azure Virtual Machines Anton Boyko. A Continuous Offering From Private to Public Cloud.
IHEP(Beijing LCG2) Site Report Fazhi.Qi, Gang Chen Computing Center,IHEP.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Architecture & Cybersecurity – Module 3 ELO-100Identify the features of virtualization. (Figure 3) ELO-060Identify the different components of a cloud.
CLOUD COMPUTING RICH SANGPROM. What is cloud computing? “Cloud computing is a model for enabling ubiquitous, convenient, on-demand network access to a.
Web Technologies Lecture 13 Introduction to cloud computing.
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
OpenStack Chances and Practice at IHEP Haibo, Li Computing Center, the Institute of High Energy Physics, CAS, China 2012/10/15.
INRNE's participation in LCG Elena Puncheva Preslav Konstantinov IT Department.
Virtual Cluster Computing in IHEPCloud Haibo Li, Yaodong Cheng, Jingyan Shi, Tao Cui Computer Center, IHEP HEPIX Spring 2016.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
INFN/IGI contributions Federated Clouds Task Force F2F meeting November 24, 2011, Amsterdam.
Update on Computing/Cloud Marco Destefanis Università degli Studi di Torino 1 BESIII Ferrara, Italy October 21, 2014 Stefano Bagnasco, Flavio Astorino,
IHEP Computing Center Site Report Gang Chen Computing Center Institute of High Energy Physics 2011 Spring Meeting.
The status of IHEP Beijing Site WLCG Asia-Pacific Workshop Yaodong CHENG IHEP, China 01 December 2006.
Status of BESIII Distributed Computing BESIII Collaboration Meeting, Nov 2014 Xiaomei Zhang On Behalf of the BESIII Distributed Computing Group.
Hello everyone I am rajul and I’ll be speaking on a case study of elastic slurm Case Study of Elastic Slurm Rajul Kumar, Northeastern University
The advances in IHEP Cloud facility
Elastic Computing Resource Management Based on HTCondor
Integration of Openstack Cloud Resources in BES III Computing Cluster
Report of Dubna discussion
Yaodong CHENG Computing Center, IHEP, CAS 2016 Fall HEPiX Workshop
Chapter 21: Cloud Computing and Related Security Issues
Introduction to Cloud Computing
The Scheduling Strategy and Experience of IHEP HTCondor Cluster
Chapter 22: Cloud Computing Technology and Security
OpenStack Ani Bicaku 18/04/ © (SG)² Konsortium.
Cloud Technology Group
Real IBM C exam questions and answers
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
HC Hyper-V Module GUI Portal VPS Templates Web Console
* Introduction to Cloud computing * Introduction to OpenStack * OpenStack Design & Architecture * Demonstration of OpenStack Cloud.
Cloud Computing: Concepts
Status and prospects of computing at IHEP
Building an Elastic Batch System with Private and Public Clouds
06 | SQL Server and the Cloud
Presentation transcript:

Building Virtual Scientific Computing Environment with Openstack Yaodong Cheng, CC-IHEP, CAS ISGC 2015

2/25 International Symposium on Grids and Clouds (ISGC) 2015 Contents  Requirements of scientific computing  IHEP cloud platform  Virtual machine types  Virtual computing cluster  Dirac distributed computing  Conclusion

3/25 International Symposium on Grids and Clouds (ISGC) 2015 Large science facilities  IHEP: The largest fundamental research center in China  IHEP serves as the backbone of China’s large science facilities  Beijing Electron Positron Collider BEPCII/BESIII  Yangbajing Cosmic Ray Observatory: ASg & ARGO  Daya Bay Neutrino Experiment  China Spallation Neutron Source (CSNS)  Hard X-ray Modulation Telescope(HXMT)  Accelerator-driven Sub-critical System (ADS)  Jiangmen Neutrino Underground Observatory (JUNO)  Under planning: BAPS, LHAASO, XTP, HERD, …

4/25 International Symposium on Grids and Clouds (ISGC) 2015 BEPCII/BESIII  36 Institutions from China, US, Germany, Russian, Japan,…  > 5PB in next 5 years  ~ 5000 CPU cores  simulation, reconstruction, analysis, …  long-term data preservation  data sharing between partners

5/25 International Symposium on Grids and Clouds (ISGC) 2015 Other experiments  Daya Bay Neutrino Experiment  ~200TB per year  JUNO: Jiangmen Neutrino Experiment  ~500TB per year  LHAASO  2PB per year after 2017,  accumulate 20PB+ in 10 years  Atlas and CMS Tier2 site  940TB disk, 1088 CPU cores  CSNS, HXMT, … 5PB data one year!!

6/25 International Symposium on Grids and Clouds (ISGC) 2015 Computing resources status  ~ CPU cores  ~ 50 queues, managed by Torque/PBS  difficult to share  ~ 5PB disk  Lustre, Glustre, dCache/DPM, …  ~ 5PB LTO4 tape  two IBM 3584 tape libraries  modified CERN CASTOR 1.7 PC farm built with blades Tape libraries

7/25 International Symposium on Grids and Clouds (ISGC) 2015 In the future, …  More HEP experiments, need to manage twice or more servers as today  but, no possibility of significant increase in staff numbers  Is cloud a good solution ?  Is cloud suitable for Scientific Computing?  Time to change IT strategy!!

8/25 International Symposium on Grids and Clouds (ISGC) 2015 What is Cloud?  NIST: Best Definitions  Essential characteristics  On-demand self-service  Broad network access  Resource pooling  Rapid elasticity  Measured service  Service models  IaaS, PaaS, SaaS  Deployment models  Public, private, hybrid ications/nistpubs/ /SP pdf Is cloud beneficial to scientific computing ?

9/25 International Symposium on Grids and Clouds (ISGC) 2015 Easy to Maintain  Hardware: services become independent of underlying physical machine  Cloud services: single set of services for managing access to computing resources  Scientific platforms: become separate layer deployed, controlled and managed by domain experts

10/25 International Symposium on Grids and Clouds (ISGC) 2015 Customized Environment  Operating systems suited to your application  Your applications preinstalled and preconfigured  CPU, memory, and swap sized for your needs

11/25 International Symposium on Grids and Clouds (ISGC) 2015 Dynamic Provisioning  New storage and compute resources in minutes (or less)  Resources freed just as quickly to facilitate sharing  Create temporary platforms for variable workloads

12/25 International Symposium on Grids and Clouds (ISGC) 2015 IHEPCloud: a Private IaaS platform  Launched in May 2014  Three use scenario  User self-Service virtual machine platform  User register and destroy VM on-demand  Virtual Computing Cluster  Job will be allocated to virtual queue automatically when physical queue is busy  Distributed computing system  Work as a cloud site: Dirac call cloud interface to start or stop virtual work nodes

13/25 International Symposium on Grids and Clouds (ISGC) 2015 IHEPCloud services  Who can use?  any user who has IHEP account  How many resources for user?  By default, each user has 3 CPU cores and 15GB memory  VM types  testing machine  full root privilege, no public storage  UI node  AFS authentication, No root privilege, public storage  No some limitations like memory, CPU time, process  OS types  SL 55, SL 58, SL 65, SL 7 64 bits, SL bits, Win7 64 bits  add new types depends on user requirement  VM IP address  internal IP address ( *.*) is allocated automatically  foreign IP address ( *) need the approval of administrator

14/25 International Symposium on Grids and Clouds (ISGC) 2015 Why does end user need IHEPCloud?  Virtual testing machine  Develop program or do some testing  generates VM in a few minutes  Login VM via ssh/VNC, remote desktop  Virtual UI node  debug program in computing environment  login node: lxslcxx.ihep.ac.cn  Limitations: Memory, CPU time, user processes, … cputime > 45m && %CPU > 60% KILL it!  Affected by other users  VMs: owned only by one user; no limitations

15/25 International Symposium on Grids and Clouds (ISGC) 2015 Virtual computing cluster  If a job queue is busy, the jobs can be allocated to a virtual queue  plan to run the service this year junoq: 128 CPU cores Submit job Cloud Scheduler Check Queue load IHEPCloud Start/stop VM Virtual queue Forward job Detailed: see haibo’s talk

16/25 International Symposium on Grids and Clouds (ISGC) 2015 Distributed computing  Distributed computing has integrated cloud resources based on pilot schema, implementing dynamic scheduling  Cloud resources used can be shrunk and extended dynamically according to job requirements VM1, VM2, … Cloud Distributed Computing Job1, Job2, Job3… Cloud Distributed Computing No Jobs User Job Submission Create Get Job VM Cloud Distributed Computing No Jobs Job Finished Delete

17/25 International Symposium on Grids and Clouds (ISGC) 2015 Cloud sites  5 cloud sites from Torino, JINR, CERN and IHEP have been set up and connected to distributed computing system  About 320 CPU cores, 400GB Memory, 10TB disk

18/25 International Symposium on Grids and Clouds (ISGC) 2015 Cloud tests  More than 4500 jobs have been done with 96% success rate  Failure reason is lack of disk space  Disk space will be extended in IHEP cloud

19/25 International Symposium on Grids and Clouds (ISGC) 2015 Performance and Physics validation  Performance tests has shown that running time in the cloud sites are comparable with other production sites  Simulation, Reconstruction, Download random trigger data  Physics validation has proved that physics results are highly consistent between clusters and cloud sites

20/25 International Symposium on Grids and Clouds (ISGC) 2015 OpenStack Dashboard CEPH NetworkDB Dirac Virtual Cluster API LDAP UMT (IHEP ) API interactive Push info. Get info. Storage path DNS API Backend storage Configuration management Register Puppet Get VM info. Register DNS Register Nagios Log Analysis Host Monitor Service monitor authentication UMT (CAS CLOUD) Interoperation Architecture

21/25 International Symposium on Grids and Clouds (ISGC) 2015 IHEPCloud components  Core middleware: openstack  open source cloud management system  most popular  Configuration management tool: Puppet  create VM image  manage applications in VM  keep the consistency of VM and computing environment  Authentication  IHEP account and password  AFS authentication for UI node  Network management  centric NetworkDB record MAC, IP, hostname, user, …  each VM has a hostname, *.v.ihep.ac.cn  network traffic accounting  External storage  Currently, images and instances are stored in local disk  evaluating CEPH to support GLANCE, NOVA and Cinder

22/25 International Symposium on Grids and Clouds (ISGC) 2015 Network in IHEPCloud  multiple IP subnets on one physical machine  Vlan mode (Just L2, no router) in Openstack neutron  IP gateway and 802.1Q in hardware switch  Problem: trunck  Big mac table  Pre-config  Risk of Broadcast storm  Future network  Core layer: Vxlan(hardware-based)  Access layer: Openstack vlan mode

23/25 International Symposium on Grids and Clouds (ISGC) 2015 IHEPCloud Current status  Released in 18 November, 2014  Built on openstack icehouse  1 control node  7 computing nodes  112 physical CPU cores / 224 Virtual CPU cores, 896GB memory totally  Active 96 VMs, 172 CPU cores, 628GB memory by 11 March Control Node Computing Node VM

24/25 International Symposium on Grids and Clouds (ISGC) 2015 Conclusion  cloud computing is widely accepted by industrial and scientific domain  Scientific computing are preparing the move to cloud  IHEPCloud aims at providing self-service virtual machine platform for IHEP user  IHEPCloud also supports virtual cluster and distributed computing  One small Cloud platform has been built and open to IHEP user freely  More resources (1000+) will be added to IHEPCloud this year  Investigate shibboleth to build federated cloud

Thank you! Any Questions?