Cloud Computing at the RAL Tier 1 Ian Collier STFC RAL Tier 1 GridPP 30, Glasgow, 26th March 2013.

Slides:



Advertisements
Similar presentations
Condor use in Department of Computing, Imperial College Stephen M c Gough, David McBride London e-Science Centre.
Advertisements

Tier-1 Evolution and Futures GridPP 29, Oxford Ian Collier September 27 th 2012.
Windows Azure for SharePoint people Dennis – Solution Architect Microsoft Windows Azure.
2  Industry trends and challenges  Windows Server 2012: Beyond virtualization  Complete virtualization platform  Improved scalability and performance.
 What Is Desktop Virtualization?  How Does Application Virtualization Help?  How does V3 Systems help?  Getting Started AGENDA.
System Center 2012 R2 Overview
SCD in Horizon 2020 Ian Collier RAL Tier 1 GridPP 33, Ambleside, August 22 nd 2014.
Cloud & Virtualisation Update at the RAL Tier 1 Ian Collier Andrew Lahiff STFC RAL Tier 1 HEPiX, Lincoln, NEBRASKA, 17 th October 2014.
VxWorks Real-Time Kernel Connectivity
“It’s going to take a month to get a proof of concept going.” “I know VMM, but don’t know how it works with SPF and the Portal” “I know Azure, but.
WLCG Cloud Traceability Working Group progress Ian Collier Pre-GDB Amsterdam 10th March 2015.
Be the Next Private Cloud Expert Now Dan Stolts (ITProguru) Sr. IT Pro Evangelist
COS302. = Managed for YouStandalone Servers IaaSPaaSSaaS Applications Runtimes Database Operating System Virtualization Server Storage Networking.
5205 – IT Service Delivery and Support
An Introduction to Cloud Computing. The challenge Add new services for your users quickly and cost effectively.
Tier-1 experience with provisioning virtualised worker nodes on demand Andrew Lahiff, Ian Collier STFC Rutherford Appleton Laboratory, Harwell Oxford,
Andrew McNab - Manchester HEP - 22 April 2002 UK Rollout and Support Plan Aim of this talk is to the answer question “As a site admin, what are the steps.
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
VAP What is a Virtual Application ? A virtual application is an application that has been optimized to run on virtual infrastructure. The application software.
Copyright © 2010 Platform Computing Corporation. All Rights Reserved.1 The CERN Cloud Computing Project William Lu, Ph.D. Platform Computing.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Cloud Models – Iaas, Paas, SaaS, Chapter- 7 Introduction of cloud computing.
Virtualisation Cloud Computing at the RAL Tier 1 Ian Collier STFC RAL Tier 1 HEPiX, Bologna, 18 th April 2013.
Microsoft’s Vision for IT as a Service The Server to Virtualized Datacenter to Private & Public Cloud Continuum David Greschler, Director, Microsoft Kondwani.
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
Datacenters of the Past StorageNetworkCompute Today’s datacenter.
Get More out of SQL Server 2012 in the Microsoft Private Cloud environment Steven Wort, Xin Jin Microsoft Corporation.
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
From Virtualization Management to Private Cloud with SCVMM 2012 Dan Stolts Sr. IT Pro Evangelist Microsoft Corporation
MKS at ITT: Managing ATM Projects A Presentation for WDCA CMWG Phyllis High Jones ADS-B Configuration Manager October 11, 2011 Surveillance and Broadcast.
WLCG Cloud Traceability Working Group face to face report Ian Collier 11 February 2015.
An Agile Service Deployment Framework and its Application Quattor System Management Tool and HyperV Virtualisation applied to CASTOR Hierarchical Storage.
Virtualisation & Cloud Computing at RAL Ian Collier- RAL Tier 1 HEPiX Prague 25 April 2012.
Microsoft Management Seminar Series SMS 2003 Change Management.
Windows Azure Virtual Machines Anton Boyko. A Continuous Offering From Private to Public Cloud.
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. 1 Automate your way to.
Stairway to the cloud or can we take the highway? Taivo Liik.
| nectar.org.au NECTAR TRAINING Module 5 The Research Cloud Lifecycle.
Virtualisation at the RAL Tier 1 Ian Collier STFC RAL Tier 1 HEPiX, Annecy, 23rd May 2014.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Web Technologies Lecture 13 Introduction to cloud computing.
Final Implementation of a High Performance Computing Cluster at Florida Tech P. FORD, X. FAVE, K. GNANVO, R. HOCH, M. HOHLMANN, D. MITRA Physics and Space.
Evolving Security in WLCG Ian Collier, STFC Rutherford Appleton Laboratory Group info (if required) 1 st February 2016, WLCG Workshop Lisbon.
RAL PPD Tier 2 (and stuff) Site Report Rob Harper HEP SysMan 30 th June
CERN IT Department CH-1211 Genève 23 Switzerland t Migration from ELFMs to Agile Infrastructure CERN, IT Department.
1 Update at RAL and in the Quattor community Ian Collier - RAL Tier1 HEPiX FAll 2010, Cornell.
Grid-Ireland test facilities Stephen Childs Dept. of Computer Science Trinity College Dublin.
Ian Collier, STFC, Romain Wartel, CERN Maintaining Traceability in an Evolving Distributed Computing Environment Introduction Security.
Tier 1 Experience Provisioning Virtualized Worker Nodes on Demand Ian Collier, Andrew Lahiff UK Tier 1 Centre, RAL ISGC 2014.
Going Hybrid – part 2 Moving to Hybrid Cloud with Windows Azure Virtual Machines & System Center 2012 R2.
Experience integrating a production private cloud in a Tier 1 Grid site Ian Collier Andrew Lahiff, George Ryall STFC RAL Tier 1 ISGC 2015 – March 20 th.
Clouding with Microsoft Azure
Cloud Technology and the NGS Steve Thorn Edinburgh University (Matteo Turilli, Oxford University)‏ Presented by David Fergusson.
Dag Toppe Larsen UiB/CERN CERN,
Dag Toppe Larsen UiB/CERN CERN,
HEPiX Spring 2014 Annecy-le Vieux May Martin Bly, STFC-RAL
Software Defined Storage
An Introduction to Cloud Computing
SCD Cloud at STFC By Alexander Dibbo.
JASMIN Success Stories
Introduction to Cloud Computing
Windows Server 2016 Software Defined Storage
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Upgrading Your Private Cloud with Windows Server 2012 R2
Cloud Computing: Concepts
Fundamental Concepts and Models
Productive + Hybrid + Intelligent + Trusted
06 | SQL Server and the Cloud
Presentation transcript:

Cloud Computing at the RAL Tier 1 Ian Collier STFC RAL Tier 1 GridPP 30, Glasgow, 26th March 2013

RAL Context at RAL Hyper-V Services Platform Scientific Computing Department Cloud Summary

What Do We Mean By Cloud For these purposes Computing on demand does not require administrator intervention Service owners dont have to care about where things run Resources expand to meet requirements seemlessly

Context at RAL Historically requests for systems went to fabric team –Procure new HW – could take months –Scavenge old WNs – could take days/weeks Kickstarts & scripts took tailoring for each system Not very dynamic For development systems many users simply run VMs on their desktops – hard to track & risky

Evolution at RAL Many elements play their part –Configuration management system Quattor (introduced in 2009) abstracts hardware from os from payload, automates most deployment Makes migration & upgrades much easier (still not completely trivial) –Databases feeding and driving configuration management system Provisioning new hardware much faster

Virtualisation & RAL Context at RAL Hyper-V Services Platform Scientific Computing Department Cloud Summary

Hyper-V Platform Over last three years –Local storage only in production –~200 VMs Provisioning transformed –Much more dynamic & responsive to changing requirements –Self service basis – requires training all admins in using management tools – but this Progress of high availability shared storage platform (much) slower than wed have liked

Hyper-V Platform Nearly all grid services virtualised now –fts, myproxy, bdii, cream-ce etc. Internal databases & monitoring systems Also test beds (batch system, CEs, bdiis etc) Move to production very smooth –Team had good period to become familiar with environment & tools

Hyper-V Platform When a Tier 1 admin needs to set up a new machine all they have to request is a DNS entry Everything else they do themselves Maintenance of underlying hardware platform can be done with (almost) no service interruption. This is already much, much better – especially more responsive – than what went before. Has many characteristics of private cloud –But we wouldnt usually call it cloud

Hyper-V Platform However, Windows administration is not friction or effort free (we are mostly Linux admins….) –Share management server with STFC corporate IT – but they do not have resources to support our use –Troubleshooting means even more learning –Some just dont like it Hyper-V continues to throw up problems supporting Linux –None show stoppers, but they drain effort and limit things –Ease of management otherwise compensates for now Since we began open source tools have moved on –We are not wedded to Hyper-V

Virtualisation & RAL Context at RAL Hyper-V Services Platform Scientific Computing Department Cloud Summary

SCD Cloud Prototype E-Science Department IaaS cloud platform –Specific use case was internal development and testing systems Began as small experiment 18 months ago Using StratusLab –Share Quattor configuration templates with other sites –Very quick and easy to get working –But has been a moving target as it develops Deployment done by graduates on 6 month rotation –Disruptive & variable progress

SCD Cloud Initially treat systems much like any Tier 1 system We allow users in whom we have high levels of trust –Monitor that central logging is active, sw updates are happening Cautiously introducing new user groups Plan to implement further network separation –Waiting for major reengineering of Tier 1 Network later this year One deployed use case effectively PaaS (Virtual frint ends to SCARF cluster)

SCD Cloud Resources –Began with 20 (very) old worker nodes ~80 cores Filled up very quickly 1 year ago added 120 cores in new Dell R410s – and also a few more old WNs This month adding 160 cores in more R410s Current –~300 cores – enough to Run a meaningful service – continue development to cover further use cases

SCD Cloud Usage 30 or so regular users (dept of ~200) ~200 vms at any one time –Typically running at 90-95% full Exploratory users from other departments Also adding very selective external (GridPP) users Proof of concept more than successful –Full time permanent staff in plan –It is busy – lots of testing & development People notice when it is not available

SCD Cloud Future Develop to full resilient service to users across STFC Participation in cloud federations Still evaluating storage solutions –For image store/sharing and S3 storage service –Ceph is looking very promising for both Have new hardware delivered for 80TB ceph cluster Integrating cloud resources in to Tier 1 grid work Reexamine platform itself. –Things have moved on since we started with StratusLab

Virtualisation & RAL Context at RAL Hyper-V Services Platform Scientific Computing Department Cloud Summary

Range of technologies in use Many ways our provisioning & workflows have become more dynamic, agile Private cloud has developed from a small experiment to beginning to provide real services –With constrained effort –Slower than we would have liked –Even the experimental platform is proving extremely useful We look forward to being able to replace Hyper-V for resilient services