A Filesystem for Lots Of Little Files (LOL FS) Allison Regier and Karen Hollingsworth University of Notre Dame.

Slides:



Advertisements
Similar presentations
Memory Address Decoding
Advertisements

Chapter 4 : File Systems What is a file system?
Computer Forensics BACS 371
Day 29 File System.
Memory Management (II)
File System Implementation CSCI 444/544 Operating Systems Fall 2008.
File Systems Implementation
Ceng Operating Systems
Data Representation CS105. Data Representation Types of data: – Numbers – Text – Audio – Images & Graphics – Video.
Data Representation Kieran Mathieson. Outline Digital constraints Data types Integer Real Character Boolean Memory address.
assumes basic arithmetic
Unit 3—Part A Computer Memory
Manipulating Strings.
Bits and Data Storage. Basic Hardware Units of a Computer.
Communications Technology 2104 Mercedes Lahey. Bit 1. bit=From a shortening of the words “binary digit” 2. the basic unit of information for computers.
11 Aug 2015Computer introduction1 Storage devices Bits, Bytes, Kilobytes, MB, GB, Terabytes Hardware Moore’s law Disks Internal hard disk TB (
Bits and Bytes in a computers memory Inside the computer are millions of electronic switches. These are grouped together in bundles of 8. A switch can.
Aloha Aloha What you see: What the computer sees: binary number columns binary number columns
CS 111 – Aug – 1.3 –Information arranged in memory –Types of memory –Disk properties Commitment for next day: –Read pp , In other.
Data Representation A series of eight bits is called a byte. A byte can be used to represent a number or a character. As you’ll see in the following table,
Hardware Data Storage.
Data Representation and Storage Lecture 5. Representations A number value can be represented in many ways: 5 Five V IIIII Cinq Hold up my hand.
Fill in the blanks: (1) _________ has only two possible values 0 and 1. (2) There are __________bits in a byte. (3) 1 kilobyte of memory space can store.
What do computers know?  All they really know is on or off.  Kind of like a light switch  Computers aren’t nearly as smart as you are!
Bits & Bytes Created by Chris McAbee For AAMU AGB199 Extra Credit Created from information copied and pasted from
OBJECTIVES  Explain why a computer represents data in the form of binary  Explain the terms related to data storage: bit, byte, character, word  Calculate.
Getting to know Storage Media 1.Stores information 2.Retrieve information for later use.
Inside your computer. Hardware Review Motherboard Processor / CPU Bus Bios chip Memory Hard drive Video Card Sound Card Monitor/printer Ports.
Digital Literacy Lesson 3. The Role of Memory A computer stores data in the memory when a task is performed. Data is stored in the form of 0s and 1s.
Operating Systems COMP 4850/CISG 5550 File Systems Files Dr. James Money.
Unit 2—Part A Computer Memory Computer Technology (S1 Obj 2-3)
Chapter 1 Data Storage © 2007 Pearson Addison-Wesley. All rights reserved.
Chapter 4 Memory Management Virtual Memory.
Chapter 14: Files and Streams. 2Microsoft Visual C# 2012, Fifth Edition Files and the File and Directory Classes Temporary storage – Usually called computer.
Sorting by the Numbers Sorting Part Four. Question Suppose you are given the task of writing an application to sort a big data file. What do you need.
Data Storage © 2007 Pearson Addison-Wesley. All rights reserved.
FAT File Allocation Table
Measuring Memory and Storage
IT Essentials Chapter 1 Part C. 1.5 Math for a Digital Age.
Data Storage © 2007 Pearson Addison-Wesley. All rights reserved.
Computer Performance. Hard Drive - HDD Stores your files, programs, and information. If it gets full, you can’t save any more. Measured in bytes (KB,
Computer Fundamental - ITD13821 Lecture 01 Extra Numbering System.
 Computers are 2-state devices › Pulse – No pulse › On – Off  Represented by › 1 – 0  BINARY.
Binary a. express numbers in binary, binary-coded decimal (BCD), octal and hexadecimal;
CC111 Lec#2 The System Unit The System Unit: Processing and Memory Lecture 2 Binary System.
File I/O. I/O Flags Flags are passed to give some information about how the file is to be used. – Read only file – flag=0x0 – Write only file – flag=0x1.
CENG334 Introduction to Operating Systems
Memory COMPUTER ARCHITECTURE
Data Representation N4/N5.
The Data Types and Data Structures
Linux 103 Training MOdule Basic System Mgmt.
CS703 - Advanced Operating Systems
Storage Hardware This icon indicates the slide contains activities created in Flash. These activities are not editable. For more detailed instructions,
Computer Memory Digital Literacy.
Bits and bytes September 19, 2017.
Memory Parts of a computer
What is Binary? Binary is a two-digit (Base-2) numerical system, which computers use to process and store data. The reason computers use the binary system.
Intermediate 2 Computing
File Sizes Bits and Bytes.
3 - STORAGE: DATA CAPACITY CALCULATIONS
Unit 2 Computer Memory Computer Technology (S1 Obj 2-3)
M3 - Estimating the size of a database
Unit 3—Part A Computer Memory
Unit 3—Part A Computer Memory
Storage Devices and Data Capacity
Bits, Bytes, and Storage.
The Design and Implementation of a Log-Structured File System
Presentation transcript:

A Filesystem for Lots Of Little Files (LOL FS) Allison Regier and Karen Hollingsworth University of Notre Dame

Both of us use many small files in our research projects. The Problem y1 x1 r1 y2 x2 r2 24 byte files composed of approximately 24 ascii characters: Files that are 200 to 1000 bytes each, of ascii text: AAGTCACTCG…

Lots of little files require lots of overhead space and time! The Problem Amount of Data Total Disk Space Used Time Single File1,000,000 Bytes = 977 KB 984 KB< 1 second 100,000 Small Files 1,000,000 Bytes = 977 KB 394 MB1106 seconds

We use a single large file behind-the-scenes for many small files of data. The Solution Name size flags filedata Name size flags filedata Name … variable Number of bytes for each field

 Assumes files will be written once and not modified  Achieves excellent space performance!  Needs improvements in time performance Characteristics our file system The Solution

Writing 10,000 files, 10 bytes each Initial Results TimeSpace Linux seconds kilobytes LOL FS seconds 348 kilobytes

The Road Ahead  Improve the time performance of our filesystem  Fix memory errors that occur during large-scale testing  Perform more tests to compare our filesystem to a traditional file system Future Work