Bending Ironic for Big Iron

Slides:



Advertisements
Similar presentations
© 2012 IBM Corporation Architecture of Quantum Folsom Release Yong Sheng Gong ( 龚永生 ) gongysh #openstack-dev Quantum Core developer.
Advertisements

Seamless migration from Nova-network to Neutron in eBay production Chengyuan Li, Han Zhou.
Profit from the cloud TM Parallels Dynamic Infrastructure AndOpenStack.
DCM Embedded Software Infrastructure, Build Environment and Kernel Modules A.Norman (U.Virginia) 1 July '09 NOvA Collaboration Mtg.
Storage Systems: Advanced Topics Learning Objectives: To understand limitations of “one file system per partition” model To understand Logical Volume Management.
11 HDS TECHNOLOGY DEMONSTRATION Steve Sonnenberg May 12, 2014 © Hitachi Data Systems Corporation All Rights Reserved.
Open-File Backup & App. Consistent Snapshots Microsoft Volume Shadow Copy Service Introduced 2002 –VSS Broadly Adopted by 3 rd Party Backup Vendors –Enables.
ITE PC v4.0 Chapter 1 1 Operating Systems Computer Networks– 2.
Best of.
Installing software on personal computer
Automating Linux Installations at CERN G. Cancio, L. Cons, P. Defert, M. Olive, I. Reguero, C. Rossi IT/PDP, CERN presented by G. Cancio.
Richard Smith Senior Consultant – Management, Operations and Deployment Microsoft UK Simple Deployments with Windows AIK and Windows DS.
Installing Windows Deployment Service
System Center 2012 Setup The components of system center App Controller Data Protection Manager Operations Manager Orchestrator Service.
Discussion: DNS for NIDs High Performance Data Division Doug Oucharek April 18, 2013 Intel® High Performance Data Division * Other names and brands may.
IT:NETWORK:MICROSOFT SERVER 2 DHCP AND WINDOWS DEPLOYMENT SERVICES.
Module 9: Configuring Storage
86% 50% Infrastructure provisioning Enterprise-class multi- tenant infrastructure for hybrid environments System Center capabilities Application.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
Enabling Palacios PXE-Boot Chen Jin Bharath Pattabiraman Patrick Foley.
Deploying Windows 7 Lesson 3. Objectives Understand enterprise deployments Capture an image file Modify an image file Deploy an image file.
Eucalyptus: An Open-source Infrastructure for Cloud Computing Rich Wolski Eucalyptus Systems Inc.
WINDOWS XP PROFESSIONAL AUTOMATING THE WINDOWS XP INSTALLATION Bilal Munir Mughal Chapter-2 1.
Sandor Acs 05/07/
Scott Drucker, Systems Engineer Migrating to Microsoft Vista with WinINSTALL.
Mark E. Fuller Senior Principal Instructor Oracle University Oracle Corporation.
EXPOSING OVS STATISTICS FOR Q UANTUM USERS Tomer Shani Advanced Topics in Storage Systems Spring 2013.
Microsoft Windows XP Professional MCSE Exam
CoprHD and OpenStack Ideas for future.
ISCSI. iSCSI Terms An iSCSI initiator is something that requests disk blocks, aka a client An iSCSI target is something that provides disk blocks, aka.
The 2001 Tier-1 prototype for LHCb-Italy Vincenzo Vagnoni Genève, November 2000.
OpenStack overview of the project Belmiro Daniel Rodrigues Moreira CERN IT-PES-PS January 2011 Disclaimer: This presentation reflects the experience and.
ITMT Windows 7 Configuration Chapter 3 – Deploying Windows 7 ITMT 1371 – Windows 7 Configuration 1.
IBM Software Group © 2008 IBM Corporation IBM Tivoli Provisioning Manager 7.1 OS Management with TPM for OS Deployment.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Grant.
 systemD  FirewallD  Network manager (NMCLI)  Target CLI (iscsi targets)  GRUB 2 (Booting process)  Network teamnig & bridging.
Running clusters on a Shoestring US Lattice QCD Fermilab SC 2007.
OpenQRM is not Dead the lightning version Building a cloud in 5 mnutes by Kris Buytaert.
Elara Introduction Wentao Zhang? (NOTE: PASTE IN PORTRAIT AND SEND BEHIND FOREGROUND GRAPHIC FOR CROP)
Canadian Bioinformatics Workshops
BY: SALMAN 1.
OpenStack.
Use of HLT farm and Clouds in ALICE
BY: SALMAN.
The Architecture of oVirt Node
OPEN-O Multiple VIM Driver Project Use Cases
oVirt Node Project Douglas Schilling Landgraf
Heterogeneous Computation Team HybriLIT
Quattor in Amazon Cloud
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Ops Manager API, Puppet and OpenStack – Fully automated orchestration from scratch! MongoDB World 2016.
Deploying and Maintaining Server Images
Oracle Solaris Zones Study Purpose Only
Deploy OpenStack with Ubuntu Autopilot
OpenStack Ani Bicaku 18/04/ © (SG)² Konsortium.
Multisite BP and OpenStack Kingbird Discussion
OPNFV Arno Installation & Validation Walk-Through
OpenStack Cloud DR PoC NEC India.
Networking for Home and Small Businesses – Chapter 2
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Neutron at Scale Justin Hammond - Developer
Cloud computing mechanisms
Networking for Home and Small Businesses – Chapter 2
Operating Systems Networking for Home and Small Businesses – Chapter 2 – Introduction To Networking.
Quattor Advanced Tutorial, LAL
Ch 17 - Binding Protocol Addresses
Robert Down & Pranay Sadarangani Nov 8th 2011
OpenStack Summit; Operations Track
Presentation transcript:

Bending Ironic for Big Iron Doug Szumski, Mark Goddard, Forest Godfrey

Overview OpenStack We need to support: The foundation of Cray’s next generation system management software We need to support: Booting large numbers of diskless compute nodes Cinder integration for Ironic Flexible provisioning of diskful nodes Bareon agent for Ironic

Cinder integration for Ironic Based on upstream spec by Satoru Moriya https://review.openstack.org/#/c/200496 Configured by instance_info fields No additional database tables or changes to any APIs Supports: Booting disklessly from Cinder via iSCSI (no FC) In-band connection to the iSCSI target Attachment of additional volumes at boot time (but not dynamically) Extended support for Dracut based ramdisks through the generation of the PXE config file We’ve shared our implementation here: https://review.openstack.org/#/c/265856 XC series compute blade (4 nodes) up to 48 blades per cabinet 100s of cabinets

Diskless boot 1. Nova boot from CLI 2. Request IP for instance 3. Get storage port MAC address (or IP) and IQN from Ironic node driver info 4. Lookup IP address of the node on the storage network from the storage port MAC address 5. Prepare Cinder volumes and retrieve iSCSI target info 6. Patch Ironic with block device info 7. Call Ironic to begin deployment 8. Cache kernel and ramdisk, build the kernel cmdline using Jinja2 9. Configure TFTP server 10. Setup DHCP for PXE boot 11. Set boot device to PXE 12. Reboot target node 13. Target node broadcasts, DHCP server responds with an IP and the location of the bootloader 14. PXE boot the kernel and ramdisk 15. Mount iSCSI targets and pivot into the rootfs

Bareon (Fuel) Agent What is Bareon? Why does Cray use Bareon? “flexible and data driven interface to perform actions which are related to operating system installation” - wiki.openstack.org/wiki/Bareon In particular, Cray uses the Bareon agent with Ironic Similar in concept to the Ironic Python Agent (IPA) Why does Cray use Bareon? Deploy baremetal nodes in a flexible, perhaps non-cloud like way Deploying multiple images / multi-boot Support complex partitioning schemes Eg. Creation of shared partitions, LVM groups, consistent identification of block devices. Rsync deploy – useful for upgrades / updates Run arbitrary actions during or post deploy https://github.com/openstack/bareon

Bareon agent 1. Nova boot from CLI 2. Request IP 3. Nova calls Ironic 4. Configure TFTP server 5. Cache images (deploy kernel & ramdisk, filesystem, cloud_default_deploy_config, deploy_config and driver_actions) and write provision script for Bareon agent 6. Update MAC and PXE config 7. Set boot device to PXE 8. Reboot target node 9. Target node gets IP 10. PXE boot the Bareon agent 11. Bareon agent calls back 12. SFTP across provision script and forward rsync server port by SSH 13. Trigger provisioning by SSH: provision --data_driver ironic --deploy_driver rsync 14. Partition and clean local storage, mount partitions, rsync filesystem across, write fstab, configure bootloader and unmount partitions 15. Run driver actions over SSH, eg update BIOS, SFTP file across from Swift 16. Set boot device to local disk 17. Reboot node

Scaling Ironic Where are we at? Immediate focus point Diskless boot tested on a 128 node system Read only Cinder volume with multi-attach and overlay filesystem Ironic multi-conductor Immediate focus point Deploying OpenStack with Kolla Support scaling of OpenStack services Where do we want to go? 100,000 (?) nodes by 2018 for Shasta http://www.cray.com/blog/the-cray-shasta-system/

Thank you for listening