File System Numbers 4/18/2002 Michael Ferguson

Slides:



Advertisements
Similar presentations
Overview of local security issues in Campus Grid environments Bruce Beckles University of Cambridge Computing Service.
Advertisements

High Performance Cloud Storage Technical Overview.
October 15, 2002MASCOTS WebTraff: A GUI for Web Proxy Cache Workload Modeling and Analysis Nayden Markatchev Carey Williamson Department of Computer.
Web Canary -- client honey pot UTSA. Architecture of Web canary. 2.
Module 1: Installing Windows XP Professional
1 Module 1 The Windows NT 4.0 Environment. 2  Overview The Microsoft Operating System Family Windows NT Architecture Overview Workgroups and Domains.
Effective Discovery Techniques In Computer Crime Cases.
An Adaptable Benchmark for MPFS Performance Testing A Master Thesis Presentation Yubing Wang Advisor: Prof. Mark Claypool.
File Management Systems
Chapter 14 Chapter 14: Server Monitoring and Optimization.
Case Study: Windows 2000 Part I Will Richards CPSC 550 Spring 2001.
OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.
Performance Evaluation
Chapter 10: File-System Interface Silberschatz, Galvin and Gagne ©2005 Operating System Concepts Chapter 10: File-System Interface File Concept.
Chapter 12 File Management Systems
1 Operating Systems Ch An Overview. Architecture of Computer Hardware and Systems Software Irv Englander, John Wiley, Bare Bones Computer.
OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.
Hands-On Microsoft Windows Server 2003 Administration Chapter 6 Managing Printers, Publishing, Auditing, and Desk Resources.
Copyright Arshi Khan1 System Programming Instructor Arshi Khan.
Operating Systems.
Session 3 Windows Platform Dina Alkhoudari. Learning Objectives Understanding Server Storage Technologies Direct Attached Storage DAS Network-Attached.
Module 8: Monitoring SQL Server for Performance. Overview Why to Monitor SQL Server Performance Monitoring and Tuning Tools for Monitoring SQL Server.
NovaBACKUP 10 xSP Technical Training By: Nathan Fouarge
70-293: MCSE Guide to Planning a Microsoft Windows Server 2003 Network, Enhanced Chapter 14: Problem Recovery.
1 Introducing Windows Backup There are different methods for starting Windows 2000 Backup. Requirements for running Windows 2000 Backup All users can back.
Windows 2000 Memory Management Computing Department, Lancaster University, UK.
Chapter 2 IT Foundation Data: facts about objects Store data in computer: – binary data – bits – bytes Five types of data.
Offline Performance Monitoring for Linux Abhishek Shukla.
1 Chapter 12 File Management Systems. 2 Systems Architecture Chapter 12.
Managing and Monitoring Windows 7 Performance Lesson 8.
CHAPTER 2: COMPUTER-SYSTEM STRUCTURES Computer system operation Computer system operation I/O structure I/O structure Storage structure Storage structure.
Lecture No 11 Storage Devices
Transparent Process Migration: Design Alternatives and the Sprite Implementation Fred Douglis and John Ousterhout.
Distributed File Systems Overview  A file system is an abstract data type – an abstraction of a storage device.  A distributed file system is available.
Chapter 10: File-System Interface Silberschatz, Galvin and Gagne ©2005 Operating System Concepts – 7 th Edition, Jan 1, 2005 Chapter 10: File-System.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 3: Operating-System Structures System Components Operating System Services.
Types of Operating Systems
Module 1: Installing Microsoft Windows XP Professional.
Background: Operating Systems Brad Karp UCL Computer Science CS GZ03 / M th November, 2008.
The Design and Implementation of Log-Structure File System M. Rosenblum and J. Ousterhout.
Performance Analysis of Real Traffic Carried with Encrypted Cover Flows Nabil Schear David M. Nicol University of Illinois at Urbana-Champaign Department.
Embedded System Lab. 서동화 The Design and Implementation of a Log-Structured File System - Mendel Rosenblum and John K. Ousterhout.
Types of Operating Systems 1 Computer Engineering Department Distributed Systems Course Assoc. Prof. Dr. Ahmet Sayar Kocaeli University - Fall 2015.
Monitoring and Managing Server Performance. Server Monitoring To become familiar with the server’s performance – typical behavior Prevent problems before.
Chapter 8: Installing Linux The Complete Guide To Linux System Administration.
11.1 Silberschatz, Galvin and Gagne ©2005 Operating System Principles 11.5 Free-Space Management Bit vector (n blocks) … 012n-1 bit[i] =  1  block[i]
European Laboratory for Particle Physics Window NT 4 Scaling/Performance Tests Alberto Di Meglio CERN IT/DIS/NCS.
THE WINDOWS OPERATING SYSTEM Computer Basics 1.2.
System Programming Basics Cha#2 H.M.Bilal. Operating Systems An operating system is the software on a computer that manages the way different programs.
Chapter Five Distributed file systems. 2 Contents Distributed file system design Distributed file system implementation Trends in distributed file systems.
Chapter 11 Analysis Methodology Spring Incident Response & Computer Forensics.
CITA 171 Section 1 DOS/Windows Introduction. DOS Disk operating system (DOS) –Term most often associated with MS-DOS –Single-tasking operating system.
Operating System (Reference : OS[Silberschatz] + Norton 6e book slides)
ITMT 1371 – Window 7 Configuration 1 ITMT Windows 7 Configuration Chapter 8 – Managing and Monitoring Windows 7 Performance.
Beyond Application Profiling to System Aware Analysis Elena Laskavaia, QNX Bill Graham, QNX.
1 Design and Implementation of a High-Performance Distributed Web Crawler Polytechnic University Vladislav Shkapenyuk, Torsten Suel 06/13/2006 석사 2 학기.
A Comparison of File System Workloads D. Roselli J. Lorch T. Anderson University of California, Berkeley.
Lesson 9: SOFTWARE ICT Fundamentals 2nd Semester SY
SQL Server Monitoring Overview
CPSC 641: LAN Measurement Carey Williamson
Chapter 1 Introduction to Operating System Part 5
Lecture 15 Reading: Bacon 7.6, 7.7
Distributed File Systems
Ch 9 – Distributed Filesystem
Carey Williamson Department of Computer Science University of Calgary
Operating Systems Structure
File System Interface (cont)
IBM Tivoli Storage Manager
The Design and Implementation of a Log-Structured File System
Presentation transcript:

File System Numbers 4/18/2002 Michael Ferguson

Why?  Make trace studies of filesystems to Inform development See trends in file system usage  Ask these questions How do people actually use filesystems? What to they store and how do they access their data? What caching strategies are best?  Filesystem statistics have wider implications Network activity may depend on these filesystem statistics (think of a web server)

What data do we gather?  User activity – e.g. number of users, amount of data transferred?  File access patterns – e.g. was the file read sequentially from start to finish?  File lifetimes – e.g. what percentage of files exist for less than a second?

File System Trace Studies  BSD Numbers from 1985 (Ousterhout & others)  Sprite Numbers from 1991(Ousterhout & others)  Windows NT numbers from 1999 (Vogels)

The BSD Study  Local BSD 4.2 filesystem on a 3 VAX-11/780s Ucbarpa – used by graduate students for program development and document formatting – 4 Mb of memory Ucbernie – used by grad students and by administration – 8 Mb of memory Ucbead – used to run CAD programs for EE – 16 Mb of memory  Average file accesses only a few hundred bytes/sec/user  75% of files open for less than ½ second  Many files only exist for a few seconds  File accesses tend to be sequential  Most file accesses are to short files but most bytes transferred are from large ones

Sprite Overview  Network-Oriented OS  File system servers and diskless workstations  Supports process migration

Sprite Study - Environment  mips workstations running Sprite  4 are fileservers  Memory averages 24Mb/workstation  Pmake commonly used to migrate processes and make use of idle workstations

Sprite Users  ~ ¼ OS researchers  ~ ¼ Architecture researchers design and simulate IO subsystems  ~ ¼ Researchers studying VSLI design and parallel processing  ~ ¼ Administrators, graphics researchers, and other people

Sprite – Measurement Approach  Instrumented kernels on file servers Kernel records trace of activity (open, close, delete, lseek, etc but not read or write) Kernel gives log to user process which records it in a file Can deduce exact range of bytes accessed lseek was modified to call file server Removed trace-file records and tape backup records  Total statistics are gathered in-kernel  I’ll talk about results in comparison with Windows

Windows NT Measurements  1998 – used 45 Windows NT 4 systems  Systems are used by one person at a time  Statistics are gathered with File system snapshots A transparent filter device driver for tracing

Windows trace summary

User Activity Comparison

File Access Pattern Comparison

File Lifetimes  Windows NT  Sprite

Sequential Runs - Comparison  Windows NT  Sprite

File Size Distribution - Comparison  Windows NT  Sprite

File Open Times - Comparison  Windows NT  Sprite

Windows NT interesting notes  Time between sequential reads and writes different – 90 microseconds for reads, 30 microseconds for writes  74% of sessions were opening files for control – not read or write common operation checks whether or not the volume is mounted

Statistical Gotcha!  The data from the Windows NT trace is not a Poisson process – it is better modeled by the Pareto distribution

Open requests vs. Poisson Process

What does it mean?  There is extreme variance at all time scales  Mean and variance of request distribution does not stabilize over time!  Other components have heavy-tail distribution as well: Process lifetime Number of DLLs accessed Number of files open per process Spacing of file accesses

File Size Distribution  File Sizes are not normally distributed!

Bottom Line – WinNT traces  Although all systems were interactive and used by a single person at a time 92% of file system operations were from processes that have no direct user input Even explorer.exe’s behavior does not come directly from the user “It is the structure and content of the filesystem that determines explorer’s file system interactions, not the user requests.”

Summary  We’ve followed several statistics through Sprite and Windows NT measurements Network filesystems are still feasible but Access is quite bursty Most accesses are for controlling files  But beware! Several statistical assumptions about filesystems seem to be just plain wrong

Summary