The Next Generation Root File Server Andrew Hanushevsky Stanford Linear Accelerator Center 27-September-2004

Slides:



Advertisements
Similar presentations
Operating System.
Advertisements

Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.
System Area Network Abhiram Shandilya 12/06/01. Overview Introduction to System Area Networks SAN Design and Examples SAN Applications.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 9 Distributed Systems Architectures Slide 1 1 Chapter 9 Distributed Systems Architectures.
Lecturer: Sebastian Coope Ashton Building, Room G.18 COMP 201 web-page: Lecture.
Chapter 7 LAN Operating Systems LAN Software Software Compatibility Network Operating System (NOP) Architecture NOP Functions NOP Trends.
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Highly Available Central Services An Intelligent Router Approach Thomas Finnern Thorsten Witt DESY/IT.
CMPT 300: Final Review Chapters 8 – Memory Management: Ch. 8, 9 Address spaces Logical (virtual): generated by the CPU Physical: seen by the memory.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Distributed Systems Architectures
File System Implementation
CMPT 300: Final Review Chapters 8 – Memory Management: Ch. 8, 9 Address spaces Logical (virtual): generated by the CPU Physical: seen by the memory.
16: Distributed Systems1 DISTRIBUTED SYSTEM STRUCTURES NETWORK OPERATING SYSTEMS The users are aware of the physical structure of the network. Each site.
Chapter 8: Network Operating Systems and Windows Server 2003-Based Networking Network+ Guide to Networks Third Edition.
PRASHANTHI NARAYAN NETTEM.
.NET Mobile Application Development Introduction to Mobile and Distributed Applications.
Frangipani: A Scalable Distributed File System C. A. Thekkath, T. Mann, and E. K. Lee Systems Research Center Digital Equipment Corporation.
N-Tier Architecture.
File Systems and N/W attached storage (NAS) | VTU NOTES | QUESTION PAPERS | NEWS | VTU RESULTS | FORUM | BOOKSPAR ANDROID APP.
Selling the Database Edition for Oracle on HP-UX November 2000.
Experiences Deploying Xrootd at RAL Chris Brew (RAL)
Data Center Infrastructure
Computer System Architectures Computer System Software
Xrootd Andrew Hanushevsky Stanford Linear Accelerator Center 30-May-03.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.
File System Access (XRootd) Andrew Hanushevsky Stanford Linear Accelerator Center 13-Jan-03.
UNIX SVR4 COSC513 Zhaohui Chen Jiefei Huang. UNIX SVR4 UNIX system V release 4 is a major new release of the UNIX operating system, developed by AT&T.
Operating System. Architecture of Computer System Hardware Operating System (OS) Programming Language (e.g. PASCAL) Application Programs (e.g. WORD, EXCEL)
Institute of Computer and Communication Network Engineering OFC/NFOEC, 6-10 March 2011, Los Angeles, CA Lessons Learned From Implementing a Path Computation.
Operating System Review September 10, 2012Introduction to Computer Security ©2004 Matt Bishop Slide #1-1.
Designing Authentication for a Microsoft Windows 2000 Network Designing Authentication in a Microsoft Windows 2000 Network Designing Kerberos Authentication.
Orbited Scaling Bi-directional web applications A presentation by Michael Carter
What is a Distributed File System?? Allows transparent access to remote files over a network. Examples: Network File System (NFS) by Sun Microsystems.
Module 11: Implementing ISA Server 2004 Enterprise Edition.
Selling the Storage Edition for Oracle November 2000.
Can we use the XROOTD infrastructure in the PROOF context ? The need and functionality of a PROOF Master coordinator has been discussed during the meeting.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
Disco: Running Commodity Operating Systems on Scalable Multiprocessors Edouard et al. Madhura S Rama.
Xrootd Update Andrew Hanushevsky Stanford Linear Accelerator Center 15-Feb-05
ROOT and Federated Data Stores What Features We Would Like Fons Rademakers CERN CC-IN2P3, Nov, 2011, Lyon, France.
Network Operating Systems : Tasks and Examples Instructor: Dr. Najla Al-Nabhan
Performance and Scalability of xrootd Andrew Hanushevsky (SLAC), Wilko Kroeger (SLAC), Bill Weeks (SLAC), Fabrizio Furano (INFN/Padova), Gerardo Ganis.
Xrootd Present & Future The Drama Continues Andrew Hanushevsky Stanford Linear Accelerator Center Stanford University HEPiX 13-October-05
VMware vSphere Configuration and Management v6
Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition File System Implementation.
Full and Para Virtualization
Copyright © 2006, GemStone Systems Inc. All Rights Reserved. Increasing computation throughput with Grid Data Caching Jags Ramnarayan Chief Architect GemStone.
6.894: Distributed Operating System Engineering Lecturers: Frans Kaashoek Robert Morris
Xrootd Proxy Service Andrew Hanushevsky Heinz Stockinger Stanford Linear Accelerator Center SAG September-04
Install, configure and test ICT Networks
PROOF tests at BNL Sergey Panitkin, Robert Petkus, Ofer Rind BNL May 28, 2008 Ann Arbor, MI.
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
Latest Improvements in the PROOF system Bleeding Edge Physics with Bleeding Edge Computing Fons Rademakers, Gerri Ganis, Jan Iwaszkiewicz CERN.
Modul ke: Fakultas Program Studi Teknologi Pusat Data 13 FASILKOM Teknik Informatika Infrastruktur Pusat Data.
Background Computer System Architectures Computer System Software.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
Lecture 13 Parallel Processing. 2 What is Parallel Computing? Traditionally software has been written for serial computation. Parallel computing is the.
Networking Week #10 OBJECTIVES Chapter #6 Questions Review Chapter #8.
Distributed Systems Architectures Chapter 12. Objectives  To explain the advantages and disadvantages of different distributed systems architectures.
N-Tier Architecture.
Operating System.
Introduction to Networks
TYPES OFF OPERATING SYSTEM
Storage Virtualization
#01 Client/Server Computing
Support for ”interactive batch”
Database System Architectures
#01 Client/Server Computing
Presentation transcript:

The Next Generation Root File Server Andrew Hanushevsky Stanford Linear Accelerator Center 27-September

27-Sep-042: xrootd What is xrootd? File Server Provides high performance file-based access Scalable, extensible, naively usable Fault tolerant Server failures handled in a natural way Servers may be dynamically added and removed Secure Framework allows use of almost any protocol Rootd Compatible

27-Sep-043: xrootd Goals II Simplicity Can run xrootd out of the box No config file needed for non-complicated/small installations Generality Can configure xrootd for ultimate performance Meant for intermediate to large-scale sites

27-Sep-044: xrootd How Is high performance achieved? Rich but efficient server protocol Combines file serving with P2P elements Allows client hints for improved performance Pre-read, prepare, client access & processing hints, Multiplexed request stream Multiple parallel requests allowed per client An extensible base architecture Heavily multi-threaded Clients are dedicated threads whenever possible Extensive use of OS I/O features Async I/O, device polling, etc. Load adaptive reconfiguration.

27-Sep-045: xrootd xrootd Server Architecture Protocol Layer Filesystem Logical Layer Filesystem Physical Layer Filesystem Implementation Protocol & Thread Manager (included in distribution) p2p heart

27-Sep-046: xrootd Rootd Bilateral Compatibility xrootd rootd xrootd rootd compability TNetFile Application TNetFile TXNetFile Application rootd Client-Side Compatibility Server-Side Compatibility client

27-Sep-047: xrootd How performant is it? Can deliver data at disk speeds (streaming mode) Assuming good network & proper TCP buffer size Low CPU overhead 75% less CPU than NFS for same data load It is memory hungry, however. General requirements Middling speed machine The more CPU’s the better 1-2GB of RAM

27-Sep-048: xrootd How is scalability achieved? Protocol allows server scalability Server directed I/O segmenting Request deferral to pace client load Unsolicited responses for ad hoc client steering P2P elements for lashing servers together Request redirection key element Integrated with a P2P control network olbd servers provide control information

27-Sep-049: xrootd How does it scale? xrootd scales in multiple dimensions Can run multiple load balanced xrootd’s Provides single uniform name and data space Scales from 1 to over 32,000 cooperating data servers Architected as self-configuring structured peer-to-peer (SP 2 ) data servers Servers can be added & removed at any time Client (TXNetFile) understands SP 2 configurations xrootd informs client when running in this mode Client has more recovery options in the event of failure

27-Sep-0410: xrootd Load Balancing Implementation Control Interface (olbd) Load balancing meta operations Find files, change status, forwarded requests Data Interface (xrootd) Data is provided to clients Interfaces to olbd via the ofs layer Separation is important Allows use of any protocol Client need not know the control protocol

27-Sep-0411: xrootd Entities & Relationships data xrootd olbdxrootd olbd Data Clients Redirectors Data Servers M S ctl olbd Control Network Managers & Servers (resource info, file location) xrootd Data Network (redirectors steer clients to data Data servers provide data)

27-Sep-0412: xrootd Typical SP 2 Configuration Dynamic Selection redirector subscribe subscribe

27-Sep-0413: xrootd Example: SLAC Configuration client machines kan01kan02kan03kan04 kanxx bbr-olb03bbr-olb04 kanolb-a

27-Sep-0414: xrootd Why do this? Can transparently & incrementally scale Servers can come and go Load balancing effects recovery New servers can be added at any time Servers may be brought down for maintenance Files can be moved around in real-time Client simply adjust to the new configuration TXNetFile object handles recovery protocol

27-Sep-0415: xrootd What we have seen For a single server: 1,000 simultaneous clients 2,200 simultaneous open files Bottlenecks Disk I/O (memory next behind)

27-Sep-0416: xrootd What Have We Heard The system is too stable Users run extra-long jobs (1-2 weeks) now Error not discovered until weeks later The system is too aggressive New servers are immediately taken over Easy configuration but startling for administrators

27-Sep-0417: xrootd Next: Getting remote data SLAC IN2P3 RAL xrootd’s RAL proxy IN2P3 proxy Firewalls require Proxy servers

27-Sep-0418: xrootd Proxy Service Attempts to address competing goals Security Deal with firewalls Scalability Administrative Configuration Performance Ad hoc forwarding for near-zero wait time Intelligent caching in local domain

27-Sep-0419: xrootd Proxy Implementation Uses capabilities of olbd and xrootd Simply an extension of local load balancing Implemented as a special file system type Interfaces in the ofs layer Functions in the oss layer Primary developer is Heinz Stockinger

27-Sep-0420: xrootd Proxy Interactions client machines red01data02data03proxy01 local olb data01data02data03data04 proxy olb local olb RAL SLAC

27-Sep-0421: xrootd Why This Arrangement? Minimizes cross-domain knowledge Necessary for scalability in all areas Security Configuration Fault tolerance & recovery

27-Sep-0422: xrootd Scalable Proxy Security SLAC PROXY OLBDRAL PROXY OLBD Authenticate & develop session key 2 2 Distribute session key to authenticated subscribers 3 3 Data servers can log into each other using session key Data Servers

27-Sep-0423: xrootd Proxy Performance Introduces minimal latency overhead Virtually undetectably from US/Europe Negligible on faster links 2% slower on fast US/US links 10% slower on LAN Can be further improved Parallel streams Better window size calculation Asynchronous I/O

27-Sep-0424: xrootd Conclusion xrootd provides high performance file access Unique performance, usability, scalability, security, compatibility, and recoverability characteristics Should scale to tens of thousand clients Can support tens of thousand of servers Distributed as part of the CERN root package Open software, supported by SLAC (server) and INFN-Padova (client)