Download presentation
Presentation is loading. Please wait.
Published byBrooke Peters Modified over 8 years ago
1
Parallel IO for Cluster Computing Tran, Van Hoai
2
Why data is concerned? Ultimate performance of the machine depends heavily on the quality of input/output (I/O) operations People often say – let us assume that the data has been pre-loaded in the various processors – suppose that data can be sent to the processors in linear time – or other false premises
3
Technology development Moore's law: the doubling of transistors on a chip every 18 months or so Real disk storage densities progress at a rate of 60 to 80% per year Disk access time improvement has been less than 10% per year Typical disk drive is 10 5 times slower in performing a random access than is achieved in the main memory of a computer
4
Amdahl's law Sequential IO operations decrease overall performance
5
Main topics in Parallel IO File Systems and Parallel I/O for Clusters Data Distribution and Load Balancing in the Presence of I/O Operations Novel Hardware and Software I/O Architectures Parallel Disk Models and Algorithms Parallel I/O Support for Databases I/O Performance Analysis: Resources and Tools for Benchmarking Drivers and Application Programming Interfaces Advances in Storage Technology Tools for Operating and Managing I/O Operations Compilers Techniques for High Performance I/O Operations Language and Runtime Libraries Network Attached Storage and Storage Area Network Standards and Industrial Experiences with Massive Data Sets
6
Driver level Direct readings/writing in disk transfer
7
Driver level (2) Activity of buses with two concurrent readings
8
Driver level (3) Re-write SCSI driver of Linux to avoid one copy towards memory
9
Direct access vs normal access 25% throughput improved
10
Parallel File System A parallel file system transparently stripes data across multiple disks and I/O nodes It provides a global name space which results in simplified file management and flexible access to files
11
Parallel Virtual File System (PVFS) User-level system utilizing TCP and the existing file system on each I/O node File data is striped across I/O nodes according to user specification Client-server – Client: a daemon that controls reading and writing for that node. – Single manager: a daemon stores meta-data and controls file operations
12
Parallel Virtual File System
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.