1 Use of AFS in the nanoCMOS project Christopher Bayliss

Slides:



Advertisements
Similar presentations
Overview of local security issues in Campus Grid environments Bruce Beckles University of Cambridge Computing Service.
Advertisements

The National Grid Service and OGSA-DAI Mike Mineter
Futures – Alpha Cloud Deployment and Application Management.
Privileged Account Management Jason Fehrenbach, Product Manager.
Operating-System Structures
A Computation Management Agent for Multi-Institutional Grids
Network+ Guide to Networks, Fourth Edition Chapter 10 Netware-Based Networking.
K.Harrison CERN, 23rd October 2002 HOW TO COMMISSION A NEW CENTRE FOR LHCb PRODUCTION - Overview of LHCb distributed production system - Configuration.
Secure Private Cloud Storage for Business. The Market Trend File Sharing Any Device Any Where Public clouds are good enough to personal users but security.
Amazon EC2 Quick Start adapted from EC2_GetStarted.html.
Module 1: Installing Active Directory Domain Services
Test Review. What is the main advantage to using shadow copies?
Microsoft Windows 2003 Server. Client/Server Environment Many client computers connect to a server.
Chapter 2: Installing and Upgrading to Windows Server 2008 R2 BAI617.
AFS & Kerberos Best Practices Workshop 2008 Design Goals Functions that require authentication Solution Space Kerberos, GSSAPI or SASL (Decide on your.
OM. Brad Gall Senior Consultant
Web Servers Web server software is a product that works with the operating system The server computer can run more than one software product such as .
IT:Network:Applications.  Installation by CD/DVD? ◦ OK for 1 or 2 machines, not 100’s ◦ Consistency  Share distribution point and let user install it.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America GENIUS server installation and configuration.
INFSO-RI Enabling Grids for E-sciencE The GENIUS Grid portal Tony Calanducci INFN Catania - Italy First Latin American Workshop.
Ideas to Improve SharePoint Usage 4. What are these 4 Ideas? 1. 7 Steps to check SharePoint Health 2. Avoid common Deployment Mistakes 3. Analyze SharePoint.
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
Module 5: Designing a Terminal Services Infrastructure.
FTP Server and FTP Commands By Nanda Ganesan, Ph.D. © Nanda Ganesan, All Rights Reserved.
Nov 1, 2000Site report DESY1 DESY Site Report Wolfgang Friebel DESY Nov 1, 2000 HEPiX Fall
MCTS Guide to Microsoft Windows Server 2008 Applications Infrastructure Configuration (Exam # ) Chapter Four Windows Server 2008 Remote Desktop Services,
Module 7 Active Directory and Account Management.
Module 11: Implementing ISA Server 2004 Enterprise Edition.
1 Introduction to Microsoft Windows 2000 Windows 2000 Overview Windows 2000 Architecture Overview Windows 2000 Directory Services Overview Logging On to.
RAL PPD Computing A tier 2, a tier 3 and a load of other stuff Rob Harper, June 2011.
Support in setting up a non-grid Atlas Tier 3 Doug Benjamin Duke University.
Module 1: Implementing Active Directory ® Domain Services.
Andrew McNab - Manchester HEP - 11 May 2001 Packaging / installation Ready to take globus from prerelease to release. Alex has prepared GSI openssh.
Satisfy Your Technical Curiosity Specialists Enterprise Desktop -
Module 4 Planning for Group Policy. Module Overview Planning Group Policy Application Planning Group Policy Processing Planning the Management of Group.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
Lesson 12: Configuring Remote Management
EXPOSING OVS STATISTICS FOR Q UANTUM USERS Tomer Shani Advanced Topics in Storage Systems Spring 2013.
Configuring and Troubleshooting Identity and Access Solutions with Windows Server® 2008 Active Directory®
Intro to Datazen.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Stairway to the cloud or can we take the highway? Taivo Liik.
Creating SmartArt 1.Create a slide and select Insert > SmartArt. 2.Choose a SmartArt design and type your text. (Choose any format to start. You can change.
Module 10: Windows Firewall and Caching Fundamentals.
Linux Operations and Administration
1 Active Directory Service in Windows 2000 Li Yang SID: November 2000.
USGS GRID Exploratory Status Review Stuart Doescher Mike Neiers USGS/EDC May
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
MGRID Architecture Andy Adamson Center for Information Technology Integration University of Michigan, USA.
Directory Services CS5493/7493. Directory Services Directory services represent a technological breakthrough by integrating into a single management tool:
CEG 2400 FALL 2012 Linux/UNIX Network Operating Systems.
Anetd and the Abone SRI International Livio Ricciulli.
Automating Installations by Using the Microsoft Windows 2000 Setup Manager Create setup scripts simply and easily. Create and modify answer files and UDFs.
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
Active Directory Domain Services (AD DS). Identity and Access (IDA) – An IDA infrastructure should: Store information about users, groups, computers and.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
Microsoft Installing & Configuring Windows Server Exam Questions Answers Powered By:
Pass Microsoft Installing and Configuring Windows Server 2012 exam in just 24 HOURS! 100% REAL EXAM QUESTIONS ANSWERS Microsoft Installing.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Barracuda SSL VPN Remote, Authenticated Access to Applications and Data.
Barracuda SSL VPN Remote, Authenticated Access to Applications and Data.
Create setup scripts simply and easily.
2016 Citrix presentation.
Cloud based Open Source Backup/Restore Tool
Unit 27: Network Operating Systems
Software Defined Networking (SDN)
Chapter 2: System Structures
Jerald Overstreet, GISP Server Portal SQL Manager Admin
Presentation transcript:

1 Use of AFS in the nanoCMOS project Christopher Bayliss

2 The nanoCMOS project Designing > 40n-10n scale CMOS circuits. EPSRC funded project. 6 Academic and 6 Industrial partners. Simulates CPUs from transistor up. Based on transistor and circuit designs from industrial partners. 2

33

44

5

6

7

8

9 Current Status of nanoCMOS Still in development Running small device simulations on internal and partner clusters. –NGS deployment in testing. Non-developer users soon. 9

10 nanoCMOS A grid project –x509 certificates GSI proxies –SOAP / HTTP –Globus / OMII Required for NGS access. –Ideological attachment to grid solutions. Uses the Virtual Organisation model. –No overarching organisation. –Set of subsets –Lots of administrative boundaries. 10

11 Virtual Organisations Common pattern –User creates proxy certificate –Proxy used to access service –Service accesses user’s resources. Traditionally x509 certificates and GSI proxy certificates. –Users have problems with them –Complications with CAs Kerberos friendlier. –Built in or easily available support. –Many apps support it. –Drop to PAM when not natively supported. 11

12 Why choose AFS? Need for some form of distributed storage. –Secure –WAN Initial options AFS, SRB and custom code. –Those that had used it said SRB was horrible. –We liked the idea of a conventional file system. –We had some AFS expertise on the project. Two sites run AFS cells. –BaBar tried this before. According to my first suggested Oct 2007 as part of some other work. 12

13 Heterogeneous with a capital H 6 academic partners –No policies governing HW or SW. Using 3rd party compute resources. –No common job submission. –No common software environment. –No common architecture. Uses purchased and “acquired” equipment. 13

14 What does nanoCMOS use AFS for? Hosting: –Input data –Simulation software –Authentication tools –User space Storage for: –Simulation log files. –Simulation results. –Individual user accounts. 14

15 Current AFS infrastructure One AFS cell. –NESC.GLA.AC.UK –Hosted at NeSC Glasgow 2x Sun 12TB X GB currently in use Solaris ZFS SMF management scripts Currently OpenAFS Kerberos cell. –Also at NeSC. Single master / slave pair. GSSklog 15

16 The Cell 16

17 Current AFS infrastructure cont. Users have ssh access to an AFS client –Play around without installing. –Can push / pull files straight away. 17

18 Where? Client deployments –Windows, OS X and Linux clients –Clients on developer’s systems at all 6 sites. –Clients installed on 5 clusters. EE at Glasgow nodes [1160 cores] ScotGrid at Glasgow nodes [1916 cores] NeSC test cluster at Glasgow - 14 nodes [28 cores]. Manchester - 48 nodes [256 cores] Edinburgh nodes [1456 cores] 18

19

20 Issues The batch systems we use have x509 based authentication. –No Kerberos on the worker. –GSSklog stored in the cell readable by anyuser Admins not expecting to let a UDP protocol through their firewall. –Shouting / Patience. –tcpdump 20

21 Getting AFS installed Most clusters complex and fragile –Multiple submission mechanisms –Multiple users Differing and conflicting needs. –Heavily loaded Admins do not like the words kernel, firewall or reboot. –A constant stream of 30 day jobs on some workers. 21

22 Issues Bursty load –x000 jobs reading and writing from the same file, directory or volume. Many clients. –5000 cores available on partner clusters. ~8000 when primary NGS sites included. Capacity –Users with dozens of 1GB files to store. No metadata –Built own metadata service Directory permissions 22

23 Pitfalls The network is outside our control. –Debugging connectivity issues is painful and slow. Most clusters outside our control. –Must ask local admins nicely to add or fix AFS. –Must work through local nanoCMOS people. Most clients outside our control. –Random versions of AFS on a random OS. –Homogenisation not an option. 23

24 Good aspects of AFS Allows us to bypass staging data and executables to nodes. –Can be a real problem on the NGS for several reasons. pre-WS GRAM job submission. Unreliable environment at sites. PAGsh –Useful when running jobs on a shared cluster node. Easier to set up than equivalent NFSv4 Kerberos useful for services other than AFS. 24

25 Good aspects of –We run a lot of code from AFS. 25

26 Bad aspects of AFS Requires kernel module. –Not easy thing to get system admins to install. Debugging –Either no information or too much NAT –Fiddly to set up. –Almost all clusters use NAT. Not simple to modify client config. –You can’t simply make changes to the config and restart afsd. –Can’t modify client if you need some setting Currently test then continue / quit. 26

27 Good features of Kerberos Users understand passwords Can use Kerberos for other services. –Currently ssh with forwarding. Create users with expired passwords. –If they can log in and change their password they probably set it up OK. 27

28 Wish List GSSAPI authentication. Better NAT behaviour. Better out of the box settings. –Especially for clients. Single cache on a cluster. –Save bandwidth on parameter sweep jobs. Consistent command line parameters 28

29 Future AFS usage at Glasgow Starting to be used for data for other projects from other departments. –We need somewhere to store data that isn’t a drawer of USB keys. Quick way to backup data from servers. 29

30 Future AFS usage in nanoCMOS Improve server design. Probable move to storing cell and realm details in DNS – Centralise configuration somewhere we control. 30

31 Global file space simplifies user’s problems –Put data onto AFS from desktop –Copy paths into job –Retrieve data from path specified on desktop. 31