Download presentation
Presentation is loading. Please wait.
1
1 The Case for Versatile Storage System NetSysLab The University of British Columbia Samer Al-Kiswany, Abdullah Gharaibeh, Matei Ripeanu
2
2 Introduction HotStorage ‘09 Versatile Storage System for large-scale platforms: Underutilized resources Application specialization The Deployment Approach: Configured at deployment time Coupled with the target application Potential: Higher performance and scalability
3
3 Platform Example – Argonne Blue Gene/P 160K cores 10 Gb/s Switch Complex GPFS 24 servers IO rate : 8GBps= 51KBps / core !! HotStorage ‘09 2.5K IO Nodes Torus Network 2.5 GBps per node 3D Torus 850 MBps per 64 nodes Tree Under utilized resources.
4
4 Workload Characteristics HotStorage ‘09 Workflows – Execution stages communicating through intermediate temporary files Source [Zhao et. al. SIGMOD record ‘05] Input file Output file Compute
5
5 Workload Characteristics HotStorage ‘09 Workflows – Execution stages communicating through intermediate temporary files Tibi Stef-Praun, et. al. [ e-Social Science ‘07 ]
6
6 Workload Characteristics Workflows – Execution stages communicating through intermediate temporary files HotStorage ‘09 AxesOptimizations Data life time (temporary ) Application informed caching Read (Seq. )Read-ahead Write (Seq. ) Asynch. write Consistency (no )Relaxed Consistency Workflows
7
7 Workload Characteristics Data Analysis – Analyze/search large data sets (e.g. BLAST) HotStorage ‘09 BLAST Match new sequences with a data set of known sequences (linear search) AxesOptimizations Data life time (temporary ) Application informed caching Read (Seq. )Read-ahead Write (Seq. ) Asynch. write Consistency (no ) Relaxed Consistency Locality Caching Workflows – Data Analysis
8
8 Workload Characteristics Checkpointing HotStorage ‘09 AxesOptimizations Data life time (temporary ) Application informed caching Read (Seq. )Read-ahead Write (Seq. ) Asynch. write Consistency (no ) Relaxed Consistency Locality Caching Compressibility Similarity detection Workflows Data Analysis Checkpointing
9
9 Workload Characteristics HotStorage ‘09 Workflows Data Analysis Checkpointing AxesOptimizations Data life time (temporary ) Application informed caching Read (Seq. )Read-ahead Write (Seq. ) Asynch. write Consistency (no ) Relaxed Consistency Locality Caching Compressibility Similarity detection Security Tunable sec. levels
10
10 Opportunities Specialization: Application specialized storage Under utilized resources Compute node storage space Interconnect bandwidth HotStorage ‘09
11
11 Our Solution Versatile Storage System: Application specialized The Deployment Approach: Configured at deployment time Life time coupled with the target application Potential : Higher performance and scalability HotStorage ‘09
12
12 Versatile Storage System Architecture Manager (Metadata management) HotStorage ‘09 Access Module Storage Node Compute Node
13
13 Configurable / Extensible IO Pipeline HotStorage ‘09 Application IO Queue Dispatcher Buffer Manag. … Consistency Metadata Operations Content Addressability Data Security Communication Agent Application IO Queue Dispatcher Buffer Manag. Metadata Operations Access Module Storage Node
14
14 Configurable / Extensible IO Pipeline HotStorage ‘09 Application IO Queue Dispatcher Buffer Manag. … Consistency Metadata Operations Content Addressability Data Security Communication Agent Dispatcher … Consistency Content Addressability Data Security Communication Agent Access Module Storage Node
15
15 Configurable / Extensible Support HotStorage ‘09 Metadata Service API Dispatcher Request New Module Support … Application IO Queue Dispatcher Buffer Manag. … Metadata Operations NM Communication Agent Access Module Storage Node Manager Access Module Header Request data
16
16 Preliminary Evaluation – Real Application HotStorage ‘09 DOCK6 workflow: Overall: 1.52x Stages Read input, compute, and write temporary results Summarize, sort, and select Archive Versatile Storage Optimizations Cache the input data Cache temporary files Asynch. flush results to GPFS Results (8K processors) 1.06x 11.76x 1.51x
17
17 Summary HotStorage ‘09 Versatile Storage System Underutilized resources Application specialization The Deployment Approach: Configured at deployment time Coupled with the target application Potential: Higher performance and scalability
18
18 Not addressed – Future work HotStorage ‘09 Configurability / extensibility evaluation Complete prototype Evaluation with a diverse set of applications Configuration Application profiling File system automated configuration
19
19 Thank you netsyslab.ece.ubc.ca
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.