Xrootd and clouds Doug Benjamin Duke University. Introduction Cloud computing is here to stay – likely more than just Hype (Gartner Research Hype Cycle.

Slides:



Advertisements
Similar presentations
SLA-Oriented Resource Provisioning for Cloud Computing
Advertisements

What’s New: Windows Server 2012 R2 Tim Vander Kooi Systems Architect
Locality-Aware Dynamic VM Reconfiguration on MapReduce Clouds Jongse Park, Daewoo Lee, Bokyeong Kim, Jaehyuk Huh, Seungryoul Maeng.
Duke Atlas Tier 3 Site Doug Benjamin (Duke University)
Duke and ANL ASC Tier 3 (stand alone Tier 3’s) Doug Benjamin Duke University.
“It’s going to take a month to get a proof of concept going.” “I know VMM, but don’t know how it works with SPF and the Portal” “I know Azure, but.
Ceph vs Local Storage for Virtual Machine 26 th March 2015 HEPiX Spring 2015, Oxford Alexander Dibbo George Ryall, Ian Collier, Andrew Lahiff, Frazer Barnsley.
CMS Diverse use of clouds David Colling
Chien-Chung Shen Google Compute Engine Chien-Chung Shen
Installing software on personal computer
1 Bridging Clouds with CernVM: ATLAS/PanDA example Wenjing Wu
Microsoft Load Balancing and Clustering. Outline Introduction Load balancing Clustering.
Ronen Gabbay Microsoft Regional Director Yside / Hi-Tech College
Analysis support issues, Virtual analysis facilities (ie Tier 3s in the cloud) Doug Benjamin (Duke University) on behalf of Sergey Panitkin, Val Hendrix,
Additional SugarCRM details for complete, functional, and portable deployment.
System Center 2012 Setup The components of system center App Controller Data Protection Manager Operations Manager Orchestrator Service.
Tier 3g Infrastructure Doug Benjamin Duke University.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Introduction To Windows Azure Cloud
OSG Public Storage and iRODS
Appendix B Planning a Virtualization Strategy for Exchange Server 2010.
Grid Appliance – On the Design of Self-Organizing, Decentralized Grids David Wolinsky, Arjun Prakash, and Renato Figueiredo ACIS Lab at the University.
Grid Developers’ use of FermiCloud (to be integrated with master slides)
Tier 3 Data Management, Tier 3 Rucio Caches Doug Benjamin Duke University.
Large Scale Test of a storage solution based on an Industry Standard Michael Ernst Brookhaven National Laboratory ADC Retreat Naples, Italy February 2,
Module 11: Implementing ISA Server 2004 Enterprise Edition.
Distributed Video Rendering using Blender, VirtualBox, and BOINC. Christopher J. Reynolds. Centre for Parallel Computing, University of Westminster.
Advanced Topics StratusLab Tutorial (Orsay, France) 28 November 2012.
D C a c h e Michael Ernst Patrick Fuhrmann Tigran Mkrtchyan d C a c h e M. Ernst, P. Fuhrmann, T. Mkrtchyan Chep 2003 Chep2003 UCSD, California.
Support in setting up a non-grid Atlas Tier 3 Doug Benjamin Duke University.
Atlas Tier 3 Virtualization Project Doug Benjamin Duke University.
Ocean Observatories Initiative CEI Demonstrations Tim Freeman OOI Cyberinfrastructure Life Cycle Objectives Milestone Review, Release 1 San Diego, CA February.
NA61/NA49 virtualisation: status and plans Dag Toppe Larsen CERN
GLIDEINWMS - PARAG MHASHILKAR Department Meeting, August 07, 2013.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
CSE 548 Advanced Computer Network Security Trust in MobiCloud using Hadoop Framework Updates Sayan Kole Jaya Chakladar Group No: 1.
Doug Benjamin Duke University. 2 ESD/AOD, D 1 PD, D 2 PD - POOL based D 3 PD - flat ntuple Contents defined by physics group(s) - made in official production.
(ITI310) By Eng. BASSEM ALSAID SESSIONS 9: Dynamic Host Configuration Protocol (DHCP)
Turn Bare Metal Into Silver Lining With SCVMM 2012, Today! Mark Rhodes OBS SESSION CODE: SEC313 (c) 2011 Microsoft. All rights reserved.
US Atlas Tier 3 Overview Doug Benjamin Duke University.
Atlas Software Structure Complicated system maintained at CERN – Framework for Monte Carlo and real data (Athena) MC data generation, simulation and reconstruction.
RALPP Site Report HEP Sys Man, 11 th May 2012 Rob Harper.
Launch Amazon Instance. Amazon EC2 Amazon Elastic Compute Cloud (Amazon EC2) provides resizable computing capacity in the Amazon Web Services (AWS) cloud.
Latest Improvements in the PROOF system Bleeding Edge Physics with Bleeding Edge Computing Fons Rademakers, Gerri Ganis, Jan Iwaszkiewicz CERN.
Wide-Area Xrootd testing at MWT2 Sarah Williams Indiana University US ATLAS Tier2/Tier3 Workshop at UChicago
Capacity Planning in a Virtual Environment Chris Chesley, Sr. Systems Engineer
Deploying Highly Available SQL Server in Windows Azure A Presentation and Demonstration by Microsoft Cluster MVP David Bermingham.
Cloud computing and federated storage Doug Benjamin Duke University.
Oracle 10g database installation kit  A bundle of scripts which allows to install Oracle 10g database server on a single node: Useful for both experienced.
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
PRIN STOA-LHC: STATUS BARI BOLOGNA-18 GIUGNO 2014 Giorgia MINIELLO G. MAGGI, G. DONVITO, D. Elia INFN Sezione di Bari e Dipartimento Interateneo.
Storage discovery in AliEn
A MAS to Manage and Monitor SLA for Cloud Computing
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Bentley Systems, Incorporated
ALICE & Clouds GDB Meeting 15/01/2013
Doug Benjamin Duke University On Behalf of the Atlas Collaboration
Introduction to Distributed Platforms
Blueprint of Persistent Infrastructure as a Service
Dag Toppe Larsen UiB/CERN CERN,
Progress on NA61/NA49 software virtualisation Dag Toppe Larsen Wrocław
Dag Toppe Larsen UiB/CERN CERN,
ATLAS Cloud Operations
StratusLab Final Periodic Review
StratusLab Final Periodic Review
StratusLab Tutorial (Bordeaux, France)
WLCG Collaboration Workshop;
VMDIRAC status Vanessa HAMAR CC-IN2P3.
NTC 324 Teaching Effectively-- snaptutorial.com
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Presentation transcript:

Xrootd and clouds Doug Benjamin Duke University

Introduction Cloud computing is here to stay – likely more than just Hype (Gartner Research Hype Cycle 2011)

Storage and clouds What is our Cloud compute/storage model? o Mostly computational (MC generation/simulation) Xrootd does not play much of a role here o User analysis? Storage is clearly an issue o Does one use the Storage in the cloud (EC3)? o Does one use the Storage provided by each VM How do we stitch it all together? o Should federate the storage within a cloud cluster o Xrootd could be a solution here (multi CPU VM’s for data serving and computation) o Initial work focusing on federating the worker node storage

Current status Using Fermi-Cloud for initial testing o Open Nebulus configuration o “static” VM’s Instantiated by hand based on configuration files OS is an approved image (1.5 GB SL 5 based) o Preconfigured and “static” Static / dynamically assigned IP External address (at least tested within FNAL campus) o Given 6 VM’s Redirector 3 data servers Xrootd proxy machine Interactive machine Have begun to reconfigure VM (add more disk space) and reinstantiate

Lessons Learned so far Need to have contextualization/configuration mechanism in place o Several options: o CernVM Copilot approach Jabber/XMPP messaging approach (used by Google Chat etc) o Puppet to configure the machines on the “fly” Need to setup the puppet master first (likely on Xrootd redirector for simplicity) o All nodes need to know the redirector IP and IP addresses are dynamic so need to discover this right away and place information into configuration files Need to fetch in small script to run setup puppet clients before puppet Install Xrootd on the fly through puppet Teach Puppet to setup Xrootd o Need to understand the security and account issues o Some combination of Both

Next steps Collaborate with CernVM team on Lxcloud to study the contextualization issues o Contextualize a VM creation o Contextualize with some information at start and connect to “central” server (puppet master) Prepare for the dynamic nature of the cloud o Could/will have more than 64 data servers in a cloud cluster o Need to dynamically setup Xrootd supervisors as the data server count grows Test effect of Xrootd redirector/Xrootd supervisors on VM’s vs all on one physical machine Understand virtualization I/O penalty better o Use the suite of typical user analysis jobs being collected within US ATLAS analysis support team

Longer term goals See if can use Xrootd to cluster ephemeral cloud worker storage o Automatic discovery and configuration the key here. Can we then federate this storage into the Xrootd federation? What are the cost model for this useage o Network/Storage costs Can we launch a cloud cluster/ load data / grow the cluster as needed, extract the precious data and shutdown the cluster? See if this all can be part of a Tier 3 cluster in the cloud (local analysis site, with interactive access)

Conclusions Cloud computing is here to stay Just starting the process of evaluating federating storage on cloud services – o Are we at the “peak of inflated expectations”? o “Through of disillusionment”? Time is short – 2014 is closer than we think Is this something that there is multi experiment appeal? How should we proceed?