Lecture 2: CS623 2/3/2004 © Joel Wein 2003, modified by T. Suel.

Lecture 2: CS623 2/3/2004 © Joel Wein 2003, modified by T. Suel

Roadmap Brief Trail through the LINUX Kernel – http://www.win.tue.nl/~aeb/linux/vfs/trail.html http://www.win.tue.nl/~aeb/linux/vfs/trail.html Memory Management LINUX – Page Frame Management – Memory Area Management Virtual Memory

Memory Management Issues/Requirements Techniques – Fixed Partitioning – Dynamic Partitioning – Simple Paging – Simple Segmentation – Important Concept: Physical vs. Logical Addresses

Memory Although cost of memory has dropped substantially there is never enough main memory to hold all of the programs and data structures needed by active processes and by the operating system. Need to bring in and swap out blocks of data from secondary memory. – Chapter 7: Memory Management. Basic Techniques – Chapter 8: Virtual Memory Allow each process to behave as if it had unlimited main memory at its disposal.

Memory Management Requirements Relocation – Need ability to relocate process to different areas of memory. Need to handle memory references correctly. Protection – Programs in other processes should not be able to reference memory locations in a process without permission. Complication: Relocation Complication: Dynamic calculation of addresses at run time. Note: memory protection must be satisfied by the processor rather than OS. Can’t pre-screen program; only possible to assess permissibility of a memory reference at the time of execution of the instruction making the reference. Processor hardware must have that capability.

More Requirements Sharing – Several processes access same main memory areas. Logical Organization – If OS/hardware can effectively deal with user programs and data in the form of modules of some sort, then… Modules can be written and compiled independently Different degrees of protection to different modules Sharing modules easier. Physical Organization – Handle the two levels of main and secondary memory.

Memory Management Techniques Principal operation of memory management is to bring programs into main memory for execution by the processor. Simple Techniques: – Fixed Partitioning – Dynamic Partitioning – Simple Paging – Simple Segmentation Built upon these: virtual memory.

Fixed Partitioning Main memory is divided into a number of static partitions at system generation time. A process may be loaded into a partition of equal or greater size. – Equal Size Partitions: If program is too big need overlays Internal fragmentation -- blocks of data loaded smaller than the partition. – Unequal sized partitions helps some Placement: – Smallest into which it will fit (one process queue per partition) – Smallest available into which it will fit. (one queue)

Dynamic Partitioning Partitions are of variable length and number. Partitions are created dynamically, so that each process is loaded into a partition of exactly same size as the process. Holes created when processes pulled out – External fragmention: memory that is external to all partitions becomes increasingly fragmented. Compaction: Shift processes so they are contiguous. Pro: No internal fragmentation! More efficient use of main memory Con: Compaction is CPU intensive. Con: Complex to maintain

Placement Algorithm for Dynamic Partitioning Best-Fit – Block that is closest in size. – Leaves behind small fragments First-fit – Scan from beginning of memory – Pretty good! Next-fit – Scan from location of last placement – Quickly chews up end of memory which otherwise would usually be the largest block.

Relocation Issues Fixed Partitioning: – Could expect that process always assigned to same partition. (One process queue per partition.) – In this case all relative memory references in code could be replaced by absolute main memory addresses, determined by base address of loaded process.

Relocation II IF a process can be swapped back into different memory locations, or if we use compaction, locations of data and instructions reference by process are not fixed. We need to distinguish between: – Logical address: reference to a memory location independent of the current assignment of data to memory. Need to make translation to actually use. Relative address: Address expressed as location relative to some point, like start point of program. – Physical address/Absolute address: Actual location in main memory chips.

Relocation III Programs that use relative addresses are loaded using dynamic run-time loading. – All memory references in loaded process are relative to the origin of the program. – Need hardware mechanism to translate relative addresses to physical main memory addresses at time of execution.

Paging Combat internal and external fragmentation. Main memory is divided into a number of equal-size frames. Each process is divided into a number of equal-size pages of the same length as frames. A process is loaded by loading all of its pages into available, not necessarily contiguous, frames. OS maintains a page table for each process. Shows frame location for each page of the process. Use page sizes that are powers of 2.

How Does it Work? Within the program, each logical address consists of page number and offset within the page. Processor hardware still does logical-to-physical translation. – Now processor must know how to access page table of the current process. – Presented with logical address (page number, offset) it uses page table to produce (frame number, offset).

Simple Paging No external fragmentation! A small amount of internal fragmentation!

Simple Segmentation Each process is divided into a number of segments of potentially different sizes. A process is loaded by loading all of its segments into dynamic partitions that need not be contiguous. – Logical address is now (segment number, offset). No internal fragmentation, like dynamic partitioning. Comparison with dynamic partitioning: – Program may occupy more than one (non-contiguous) partition. – Suffers from external fragmentation, but not as much because process broken up into a number of smaller pieces.

LINUX Memory Management LINUX takes advantage of 80x86’s segmentation and paging circuits to translate logical addresses into physical ones. Some portion of RAM permanently assigned to kernel Remaining part of RAM is dynamic memory. – Need a robust and efficient strategy for allocating groups of contiguous page frames. 80x86 supports two levels of pages: 4KB and 4MB Three memory regions: DMA, NORMAL, HIGHMEM

LINUX: Buddy System Goal: combat (external) fragmentation. – Just use paging circuitry to map noncontiguous to look contiguous – Or have clever strategy to keep things contiguous – Second approach is better because… Sometimes really need contiguous page frames – buffers for DMA processor – DMA ignores paging circuitry. Leaves kernel page tables unchanged (TLB perf. Issues) Can also use 4MB pages of contiguous memory – makes things faster.

LINUX: Buddy System Compromise Between: – Fixed: May use space inefficiently; limits number of active processes. – Dynamic: Complex, compaction overhead. All free page frames grouped into 10 lists of blocks that contain groups of sizes 1,2,…, 512 contiguous 4KB page frames, respectively.

Buddy System, continued Let’s say you need 128. – If its there, grab it. – If not, look on 256 If its there, take 128, put the other 128 on the 128-list. If not, look on 512 – Take 128 – Put 256 on the 256-list – Put other 128 on 128-list.

Buddy System: Releasing Blocks Attempt to merge pairs of free buddy blocks of size b together into a single block of size 2b. Two blocks considered buddies if – Both blocks have same size b. – They are located in contiguous physical addresses. – The physical address of the first page frame of the first block is a multiple of 2*b*(4K)

LINUX Memory Area Management How deal with requests for small memory areas and avoid internal fragmentation? Slab Allocator Based on Solaris 2.4: – To avoid initializing objects repeatedly, the slab allocator does not discard the objects that have been allocated and then released but instead saves them in memory. – Kernel functions tend to request memory of the same type repeatedly. (New process creation). Save page frames allocating same memory areas in a cache and reuse quickly – Reusing/caching

Slab Allocator cont. Slab allocator groups objects into caches – a cache is a store of objects of the same type E.g. when a file is opened the memory area needed to store the corresponding “open file” object is taken from a slab allocator cache named “filp.” Area of main memory that contains a cache is divided into slabs. – Each slab consists of one or more contiguous page frames that contain both allocated and free objects. Slab allocator never releases the page frames of an empty slab on its own. It would not know when free memory needed.

Virtual Memory Chapter 8

Outline Basic Premise Locality and Virtual Memory Hardware and Control Structures – Paging Page Table Structure Translation Lookaside Buffer Page Size – Segmentation Operating System Software Fetch, Placement, Replacement Policies Resident Set Management Cleaning Policy Load Control

Basic Premise Paging and Segmentation give: – All memory references are logical addresses that are dynamically translated to physical addresses at run-time. (Can occupy different parts of main memory at different times.) – A process may be broken up into a number of pieces (pages or segments) that need not be contiguously located in main memory during execution. Using dynamic run-time address translation and page/segment table.

And so… If the previous characteristics are present, it is NOT necessary that all of the pages or segments of a process be in main memory during execution. – If the piece (segment or page) that holds the next instruction to be fetched and the piece that holds the next data location to be accessed are in main memory, then at least for a time execution may proceed.

How Does it Work? Resident set: portion of process in main memory. If processor encounters a logical address that is not in main memory: – Generates an interrupt indicating memory access fault. – OS puts interrupted process in a blocking state and takes control. – To resume this process, OS needs to bring into main memory the piece of the process that contains logical address that caused access fault. – Disk I/O request – When I/O interrupt issued, gives control back to OS which places affected process in Ready state.

Questions, Implications Efficient? Implications: – More process may be maintained in main memory. – A process may be larger than all of main memory. virtual Programmer perceives a much larger, virtual, memory.

Locality and Virtual Memory It works because typically processes use only a small part of a program at any time. – Principle of locality: Programs and data references within a process tend to cluster. – Should be possible to make intelligent guesses about which pieces of a process needed in near future to avoid thrashing.

Hardware & Software For VM to work need: – Hardware support for paging &/or segmentation scheme. – OS software must manage movement of pages and or segments between secondary memory and main memory.

Hardware Support: Paging Page table becomes more complex. – Bit P indicates whether present or not in main memory. – If P, also includes frame number of that page. – Modify bit M: Have contents been altered since last loaded into main memory? If not M, no need to write out when replace page in the frame it occupies.

Page Table Structure Basic required mechanism is translation from (page #, offset) to (frame #, offset) using page table. – Page table of variable length, can’t hold in registers; must be in main memory. – When process running, register holds start address of page table for process. Page number used to index it and look up frame number. – Note that if VM is large (2^^32) page table could be large (2^^20) and need to be stored in virtual memory as well – Huh? – Some processors (Pentium) make use of two-level scheme Page directory in which each entry points to a page table.

Translation Lookaside Buffer Problem: Each VM reference can cause 2 physical memory accesses, namely the page table entry and the desired data. Solution: High-speed cache for page table entries, the translation lookaside buffer. – Given a virtual address, processor will first examine the TLB. If hit, great. If not, if present, then retrieve and update TLB. If not present, memory access fault (page fault) happens. TLB misses expensive: random access to large data Sparc IIe: small TLB problems

TLB: Additional Details TLB contains only some of the page table entries; cannot index in based on page number. Therefore each entry must contain page number as well as complete page table entry. Processor has hardware that allows it to simultaneously check a number of TLB entries to look if there is a match on page number. This technique is referred to as associative mapping.

Page Size Considerations: – Internal fragmentation: Smaller page size  less internal fragmentation. (Good) Smaller the page  greater number of pages required per process  larger page tables  some portion of page table for active processes not in main memory. (Bad) – Page size Fault Rate. Page size small: lots of pages in memory, not too many faults. Middle: each page contains references further afield Large: page size approaches process size. – Contemporary programming techniques used in large programs reduce locality. OO techniques: many small programs and modules. Multithreaded applications.

Operating System Software Fetch Policy Replacement Policy Resident Set Management

Issues Want to minimize page faults to minimize software overhead. – Deciding which pages to replace – I/O of exchanging pages – Scheduling another process to run during page I/O. Relevant Issues in the Choice of a Policy: – Main memory size – Relative speed of main vs. secondary memory – Size and number of processes competing for resources – Execution behavior of individual programs.

Fetch Policy When should a page be brought into main memory? – Demand Paging. Flurry at start. Eventually locality kicks in. – Pre-paging (prefetching)

Placement Policy Where in real memory a process piece is to reside. – Pure Segmentation: placement policy an important design issue (remember discussion of Best-fit, first- fit, etc.) – Paging or Paging/Segmentation: placement not a big deal as address translation hardware can handle any page-frame combination with equal efficiency.

Replacement Policy Deals with selection of page in memory to be replaced when a new page needs to be brought in. Three issues that get lumped together. – How many page frames/process. – Whether page frames considered for replacement should be limited to those of the process that caused the page fault or encompass all the page frames in main memory. – Among the set of pages considered, which particular page should be selected for replacement? Call the first two Resident Set Management; third is Replacement Policy.

Replacement Policy Frame Locking: Some frames might be locked (kernel. OS). Basic Algorithms: – Optimal – Least Recently Used (LRU) – FIFO – Clock

Optimal Algorithm Select for replacement the page for which the time to next access is longest. This results in fewest page faults. Not implementable (clairvoyance issue) Running Example: 3 pages, sequence 232152453252

LRU Replace page not used for longest time Principle of locality: least likely to be used in the future Does pretty good Hard to implement! (sort of) Not always good …

FIFO Simple to implement Get rid of page in memory the longest Reasoning will often be wrong Exception: repeated scans!

Clock Policy Try to emulate LRU Associate “use bit” with each frame. – When page first loaded into a frame in memory, use bit for that frame set to 1. – When subsequently referenced, set to 1. – Set of pages that are candidates for replacement are a circular buffer with pointer. – When page replaced,pointer placed on the next frame. – When time to replace a frame scan for a use-bit 0 frame. When you encounter a use-bit 1, set to 0. – Like FIFO except skips use-bit 1.

Resident Set Management Resident Set Size: How much main memory to give to a particular process? Replacement Scope: What set of potential replacements do you choose from?

Resident Set Size Factors: – The smaller the assigned memory, the more processes in main memory. Increases probability that OS will find at least one ready process and avoid swapping – If a relatively small number of pages of a process are in main memory, then rate of page faults will be high – Beyond a certain size adding more memory not that useful

Resident Set Size II Two approaches: – Fixed allocation determined at initial load time – Variable Allocation: varies over lifetime of a process. Give more frames to processes that are faulting a lot.

Replacement Scope Local replacement policy: choose among resident pages of process that generated the fault. Global replacement policy: consider all unlocked pages in main memory as candidates to replace.

Possible Combinations Fixed Allocation, Local Scope – Drawback: If allocations too large or too small no good way to recover. Too small: lots of page faults Too large: processor idle time or lots of swapping Variable Allocation, Global Scope – Easiest to implement, widely adopted. – Processes that fault a lot should get helped out. – Hard to get a good replacement policy – not easy to figure out which process is best to choose from.

Combinations Variable Allocation, Local Scope – Try to overcome problems with a global- scope strategy. – From time to time reevaluate allocation to process. – The question: How do you determine resident set size for each process and how do you time the changes?

Lecture 2: CS623 2/3/2004 © Joel Wein 2003, modified by T. Suel.

Similar presentations

Presentation on theme: "Lecture 2: CS623 2/3/2004 © Joel Wein 2003, modified by T. Suel."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Lecture 2: CS623 2/3/2004 © Joel Wein 2003, modified by T. Suel.

Similar presentations

Presentation on theme: "Lecture 2: CS623 2/3/2004 © Joel Wein 2003, modified by T. Suel."— Presentation transcript:

Similar presentations

About project

Feedback