Presentation is loading. Please wait.

Presentation is loading. Please wait.

File Management Chapter 12. Files and File systems File system provides the resource abstractions typically associated with secondary storage. It permit.

Similar presentations


Presentation on theme: "File Management Chapter 12. Files and File systems File system provides the resource abstractions typically associated with secondary storage. It permit."— Presentation transcript:

1 File Management Chapter 12

2 Files and File systems File system provides the resource abstractions typically associated with secondary storage. It permit user to create data collections, called files with the following properties: Long-term existence Files are stored on disk/other secondary storage do not disappear when a user logs off. Sharable between processes Files have names and can have associated access permissions that permit controlled sharing

3 Files and File systems Structure file can have internal structure that convenient for a particular applications. It can be organized in hierararchical structure. A collection of functions that can be performed on files: Create New file is define and positioned within the structure of files Delete A file is removed from the file structure and destroyed

4 Open An existing file is declared to be “opened” by a process, allowing the process to perform function on the file Close The file is closed with to respect to a process, process no longer may perform functions on file Read a process reads all /portion of data in a file Write Process updates a file, add new data /changing values. Files and File systems

5 File structure Terms are commonly used when discussing about files: Field Basic element of data An individual field contains single value, e.g. employee‘s name It’s characterized by its length and data type Can be fixed or variable length depending on file design Can contains subfields

6 File structure Record Collection of related fields Can be treated as a unit by some application program. Exp: employee record have fields such as name, social sec number, date hired etc… Can be fixed/variable length File A collection of similar records Treated as s single entity by users and applications and may be referenced by name May be created and deleted Applying access control

7 File structure Database Collection of related data. Essential aspects of database are that the relationships that exist among elements of data are explicit and the database is designed for use by number of different applications. May contain all of the info related to an organization. Consists one/more types of files

8 File structure Operations that must be supported when to use files: Retrieve_All Retrieve all the record of a file. Required for an application that must process all of the info in the file at one time. E.g.: application that produces a summary of the info in the file This operation is often equated with the term sequential processing –since all records are access in sequence.O

9 File structure Retrieve_One: Just retrieve one record E.g.: interactive transaction-oriented applications need this operation. Retrieve_Next Retrieve the record that is “next” in some logical sequence to the most recently retrieved record. E.g.: interactive application like filling in forms, performing a search operation.

10 Retrieve_Previous Record that is “previous” to the currently accessed record is retrieved. Insert_One Insert new record into the file. Delete_One Delete an existing record. Update_one Retrieve a record, update one/more of its field and rewrite the updated record back into the file. File structure

11 Retrieve_Few Retrieve a number of record. The nature of the operations that are most commonly performed on a file will influence the way the file is organized. File structure

12 File Management Systems (FMS)  set of system software that provides services to users an applications in the use of files. Users/application may access files in through the FMS. Objectives: To meet the data mgmt needs and requirements of the user, which include storage of data and the ability to perform the operation required. To guarantee, to the extend possible, that the data in the file are valid.

13 To optimize performance, both from the system point of view in terms of overall throughput and from user’s point of view in term of response time. To provide I/O support for a variety of storage device types. To minimize/eliminate the potential for lost /destroyed data To provide a standardize set of I/O interface routines to use processes To provide I/O support for multiple users. File Management Systems (FMS)

14 For objective 1: meeting user requirement Requirements depends on the variety of applications and the environment in which the computer system will be used. For an interactive general-purpose system, the following constitute a minimal set of requirements: each user should be able to create, delete, read,write,modify files. Each user may have controlled access to other users’s files File Management Systems (FMS)

15 Each user may control what types of accesses are allowed to the user’s files Each user should be able to restructure the user’s files in a form appropriate to the problem. Each user should be able to move data between files Each user should be able to back up and recover the user’s files in case of damage Each user should be able to access the user’s files by using symbolic names File Management Systems (FMS)

16 Need to look at software organization in order to understand file mgmt. Figure 12.1 show the File system software architecture. Lowest level: device drivers communicate directly with peripheral devices Device driver responsible for starting I/O operations on a device and processing the completion of an I/O request. Exp: disk and tape. Part of OS. File System Architecture

17

18 Basic file system/physical I/O: Primary interface with the environment outside of the computer system. It deals with blocks of data that are exchanged with disk/tape Concerns with the placement of those blocks on the 2 nd storage And on the buffering in main memory Part of OS File System Architecture

19 Basic I/O supervisor Responsible for all file I/O initiation and termination Control structures are maintained that deals with device I/O, scheduling and file status Part of OS File System Architecture

20 Logical I/O Enables users and applications to access records Deals with file records. Provides a general-purpose record I/O capability and maintained basic data about files. Access method Level that closest to the user Provide standard interface between application and the file system and devices that hold the data Different access methods reflect different file structures and way of accessing and processing the data File System Architecture

21 File Management Functions Another way of viewing the functions of a file system is shown in Figure 12.2 User and application program interact with the file system by means of commands for creating and deleting files and performing operations on files. Before performing any operation the file system identify and locate a selected file Use a directory to describe the location of all files plus their attributes

22 File Management Functions On a shared system enforce user access control Only authorized users are allowed to access files. Basic operations may perform on a file are performed at record level Files are viewed as some structure that organizes the record Sequential structure – employee name stored alphabetically by last name Thus, to translate user commands into specific file manipulation commands, the access method appropriate to this file structure must be employed.

23 File Management Functions I/O is done on block basis. The records of a file must be blocked for output and unblocked after input. To support block I/O files: Secondary storage must be managed Allocating files to free blocks Managing free storage for available blocks.

24

25 File Organization and access File organization refer to the logical structuring of the records as determined by the way in which they are accessed. Criteria need to look when choosing a file organization: Short access time Ease of update Economy of storage Simple maintenance Reliability

26 Continue.. Focus on 5 organizations : The pile The sequential file The indexed sequential file The indexed file The direct/hashed file

27 The pile Least complicated Data are collected in the order in which they arrive Each record consists of one burst of data Purpose: simply to accumulate the mass of data and save it. Records may have different fields/similar fields in different order Each field should be self-describing, filed name as well as value The length of the field must be implicitly indicated by delimiters No structure to the pile record, record access is by exhaustive search.

28 The pile… cont i.e: need to find record that contains a particular field with a particular value, necessary to examine each record in the pile until found/not found. Pile files are encountered when data are collected and stored prior to processing/when data not easy to organize Uses space well when the stored data vary in size and structure Perfectly adequate for exhaustive searches, easy to update Not suit for most applications.

29 Pile

30 The sequential file Most common A fixed format is used for records All records are of the same length, consisting of the same number of fixed-length fields in a particular order First field in each record is referred as key field. The key field uniquely identifies the record Usually used for batch application Easily stored on tape/disk For interactive application that involve queries- poor performance

31 The sequential file… cont The Sequential File New records are placed in a log file or transaction file Batch update is performed to merge the log file with the master file

32 Sequential File

33 The Indexed Sequential File Maintains the key characteristic of the sequential file Records are organized in sequence based on key field. Add two features: An index to the file to support random access An overflow file Index provides lookup capability to reach quickly Overflow similar to log file used with sequential file but is integrated so that record in the overflow file is located by following a pointer from its predecessor record.

34 The Indexed Sequential File… cont Comparison of sequential and indexed sequential Example: a file contains 1 million records On average 500,00 accesses are required to find a record in a sequential file If an index contains 1000 entries, it will take on average 500 accesses to find the key, followed by 500 accesses in the main file. Now on average it is 1000 accesses

35 Indexed Sequential File

36 Uses multiple indexes for different key fields May contain an exhaustive index that contains one entry for every record in the main file May contain a partial index – contains entries to records where the field of interest exists. When new record is added to main file, all of the index files must be updated. Used in applications where timeliness of info is critical i.e airline reservation system, inventory control system. Indexed File

37

38 Directly access a block at a known address Key field required for each record Make use of hashing function on the key value. Often used when very rapid access is required, where fixed length length record sre used and where records are always accessed one at a time. i.e directories, pricing The Direct or Hashed File

39

40 File Directories Contains information about files Attributes Location Ownership Directory itself is a file owned by the operating system Provides mapping between file names and the files themselves

41

42

43

44 Simple Structure for a Directory List of entries, one for each file Sequential file with the name of the file serving as the key Provides no help in organizing the files Forces user to be careful not to use the same name for two different files

45 Two-level Scheme for a Directory One directory for each user and a master directory Master directory contains entry for each user Provides address and access control information Each user directory is a simple list of files for that user Still provides no help in structuring collections of files

46 Hierarchical, or Tree-Structured Directory Master directory with user directories underneath it Each user directory may have subdirectories and files as entries

47

48

49 Hierarchical, or Tree-Structured Directory Files can be located by following a path from the root, or master, directory down various branches This is the pathname for the file Can have several files with the same file name as long as they have unique path names

50 Hierarchical, or Tree-Structured Directory Current directory is the working directory Files are referenced relative to the working directory

51 File Sharing In multiuser system, allow files to be shared among users Two issues Access rights Management of simultaneous access

52 Access Rights-exp on access right None User may not know of the existence of the file User is not allowed to read the user directory that includes the file Knowledge User can only determine that the file exists and who its owner is

53 Access Rights Execution The user can load and execute a program but cannot copy it Reading The user can read the file for any purpose, including copying and execution Appending The user can add data to the file but cannot modify or delete any of the file’s contents

54 Access Rights Updating The user can modify, deleted, and add to the file’s data. This includes creating the file, rewriting it, and removing all or part of the data Changing protection User can change access rights granted to other users Deletion User can delete the file

55 Access Rights Owners Has all rights previously listed May grant rights to others using the following classes of users Specific user User groups All for public files

56 Simultaneous Access User may lock entire file when it is to be updated User may lock the individual records during the update Mutual exclusion and deadlock are issues for shared access

57 Record Blocking Records are the logical unit of access of a structured file Blocks are the unit of I/O with secondary storage. For I/O to be performed, records must be organized as blocks. Several issues to be consider : Should blocks be fixed or variable length? Most system, blocks are of fixed length What should the relative size of a block to be compared to the average record size? Larger the block, more record can be passed to I/O, with sequentially processed it’s an advantage. With random access-result in unnecessary transfer of unused records.

58 Record Blocking… cont Given the size of the block, there are 3 methods of blocking that can be used Fixed blocking Variable-length spanned blocking Variable –length unspanned blocking

59 Record Blocking… cont Fixed blocking Fixed-length records are used An integral number of records is stored in a block There may be unused space at the end of each block –internal fragmentation

60 Fixed Blocking

61 Record Blocking… cont Variable-length spanned blocking Variable –length records are used Packs into blocks with no unused space Some records must span two blocks, with the continuation indicated by a pointer to the successor block

62 Variable Blocking: Spanned

63 Record Blocking… cont Variable –length unspanned blocking Variable-length records are used, not employed spanning There is waste space in most block because of the inability to use the remainder of a block if the next record is larger than the remaining unused space.

64 Variable Blocking Unspanned

65 Secondary Storage Management Space must be allocated to files Must keep track of the space available for allocation

66 Preallocation vs Dynamic Allocation Preallocation Need the maximum size for the file at the time of creation Difficult to reliably estimate the maximum potential size of the file Tend to overestimated file size so as not to run out of space Dynamic allocation Allocates space to a file in portions as needed

67 Portion size Size of the portion allocated to a file Contiguity of space increases performance Large number of small portion increases the size of tables needed to manage the allocation info Fixed sized portion simplifies the reallocation of space Variable size/small fixed size portion minimizes waste of unused storage due to over allocation

68 Portion size… cont Alternatives: variable,large contiguous portion=better performance Variable size avoid waste, file allocation tables are small Space is hard to reuse Blocks= small fixed portion provide greater flexibility

69 Methods of File Allocation Contiguous allocation Single set of blocks is allocated to a file at the time of creation Only a single entry in the file allocation table Starting block and length of the file External fragmentation will occur Need to perform compaction

70

71

72 Methods of File Allocation Chained allocation Allocation on basis of individual block Each block contains a pointer to the next block in the chain Only single entry in the file allocation table Starting block and length of file No external fragmentation Best for sequential files No accommodation of the principle of locality

73

74

75 Methods of File Allocation Indexed allocation File allocation table contains a separate one- level index for each file The index has one entry for each portion allocated to the file The file allocation table contains block number for the index

76

77


Download ppt "File Management Chapter 12. Files and File systems File system provides the resource abstractions typically associated with secondary storage. It permit."

Similar presentations


Ads by Google