Energy Efficient Prefetching – from models to Implementation 6/19/2015 1 Adam Manzanares and Xiao Qin Department of Computer Science and Software Engineering.

Slides:



Advertisements
Similar presentations
Reducing Energy Consumption of Disk Storage Using Power Aware Cache Management Qingbo Zhu, Francis M. David, Christo F. Deveraj, Zhenmin Li, Yuanyuan Zhou.
Advertisements

Conserving Disk Energy in Network Servers ACM 17th annual international conference on Supercomputing Presented by Hsu Hao Chen.
Energy Efficiency through Burstiness Athanasios E. Papathanasiou and Michael L. Scott University of Rochester, Computer Science Department Rochester, NY.
Managing Wire Delay in Large CMP Caches Bradford M. Beckmann David A. Wood Multifacet Project University of Wisconsin-Madison MICRO /8/04.
Daniel Schall, Volker Höfner, Prof. Dr. Theo Härder TU Kaiserslautern.
1 Storage-Aware Caching: Revisiting Caching for Heterogeneous Systems Brian Forney Andrea Arpaci-Dusseau Remzi Arpaci-Dusseau Wisconsin Network Disks University.
1 Conserving Energy in RAID Systems with Conventional Disks Dong Li, Jun Wang Dept. of Computer Science & Engineering University of Nebraska-Lincoln Peter.
1 Web Server Performance in a WAN Environment Vincent W. Freeh Computer Science North Carolina State Vsevolod V. Panteleenko Computer Science & Engineering.
RIMAC: Redundancy-based hierarchical I/O cache architecture for energy-efficient, high- performance storage systems Xiaoyu Yao and Jun Wang Computer Architecture.
What will my performance be? Resource Advisor for DB admins Dushyanth Narayanan, Paul Barham Microsoft Research, Cambridge Eno Thereska, Anastassia Ailamaki.
Embedded Network Controller with Web Interface Bradley University Department of Electrical & Computer Engineering By: Ed Siok Advisor: Dr. Malinowski.
Micro-Pages: Increasing DRAM Efficiency with Locality-Aware Data Placement Kshitij Sudan, Niladrish Chatterjee, David Nellans, Manu Awasthi, Rajeev Balasubramonian,
MobiDesk: Mobile Virtual Desktop Computing Ricardo A. Baratto, Shaya Potter, Gong Su, Jason Nieh Network Computing Laboratory Columbia University September.
Reducing the Energy Usage of Office Applications Jason Flinn M. Satyanarayanan Carnegie Mellon University Eyal de Lara Dan S. Wallach Willy Zwaenepoel.
Energy Efficient Prefetching with Buffer Disks for Cluster File Systems 6/19/ Adam Manzanares and Xiao Qin Department of Computer Science and Software.
Ziliang Zong, Adam Manzanares, and Xiao Qin Department of Computer Science and Software Engineering Auburn University Energy Efficient Scheduling for High-Performance.
A Server-less Architecture for Building Scalable, Reliable, and Cost-Effective Video-on-demand Systems Presented by: Raymond Leung Wai Tak Supervisor:
Gordon: Using Flash Memory to Build Fast, Power-efficient Clusters for Data-intensive Applications A. Caulfield, L. Grupp, S. Swanson, UCSD, ASPLOS’09.
Scalable Server Load Balancing Inside Data Centers Dana Butnariu Princeton University Computer Science Department July – September 2010 Joint work with.
CS246 Search Engine Scale. Junghoo "John" Cho (UCLA Computer Science) 2 High-Level Architecture  Major modules for a search engine? 1. Crawler  Page.
RAID-x: A New Distributed Disk Array for I/O-Centric Cluster Computing Kai Hwang, Hai Jin, and Roy Ho.
Analyzing the Energy Efficiency of a Database Server Hanskamal Patel SE 521.
1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Advisor: Professor.
Reliability Analysis of An Energy-Aware RAID System Shu Yin Xiao Qin Auburn University.
Abstract Load balancing in the cloud computing environment has an important impact on the performance. Good load balancing makes cloud computing more.
Performance Evaluation of Hybrid MPI/OpenMP Implementation of a Lattice Boltzmann Application on Multicore Systems Department of Computer Science and Engineering,
PARAID: The Gear-Shifting Power-Aware RAID Charles Weddle, Mathew Oldham, An-I Andy Wang – Florida State University Peter Reiher – University of California,
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
TRACK-ALIGNED EXTENTS: MATCHING ACCESS PATTERNS TO DISK DRIVE CHARACTERISTICS J. Schindler J.-L.Griffin C. R. Lumb G. R. Ganger Carnegie Mellon University.
OPTIMAL SERVER PROVISIONING AND FREQUENCY ADJUSTMENT IN SERVER CLUSTERS Presented by: Xinying Zheng 09/13/ XINYING ZHENG, YU CAI MICHIGAN TECHNOLOGICAL.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
November , 2009SERVICE COMPUTATION 2009 Analysis of Energy Efficiency in Clouds H. AbdelSalamK. Maly R. MukkamalaM. Zubair Department.
The Center for Autonomic Computing is supported by the National Science Foundation under Grant No NSF CAC Seminannual Meeting, October 5 & 6,
Improving Network I/O Virtualization for Cloud Computing.
The Red Storm High Performance Computer March 19, 2008 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
Building a Parallel File System Simulator E Molina-Estolano, C Maltzahn, etc. UCSC Lab, UC Santa Cruz. Published in Journal of Physics, 2009.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
Eneryg Efficiency for MapReduce Workloads: An Indepth Study Boliang Feng Renmin University of China Dec 19.
PARAID: A Gear-Shifting Power-Aware RAID Charles Weddle, Mathew Oldham, Jin Qian, An-I Andy Wang – Florida St. University Peter Reiher – University of.
Increasing Web Server Throughput with Network Interface Data Caching October 9, 2002 Hyong-youb Kim, Vijay S. Pai, and Scott Rixner Rice Computer Architecture.
PRoPHET+: An Adaptive PRoPHET- Based Routing Protocol for Opportunistic Network Ting-Kai Huang, Chia-Keng Lee and Ling-Jyh Chen.
Data Replication and Power Consumption in Data Grids Susan V. Vrbsky, Ming Lei, Karl Smith and Jeff Byrd Department of Computer Science The University.
Latency Reduction Techniques for Remote Memory Access in ANEMONE Mark Lewandowski Department of Computer Science Florida State University.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
Improving Disk Throughput in Data-Intensive Servers Enrique V. Carrera and Ricardo Bianchini Department of Computer Science Rutgers University.
June 30 - July 2, 2009AIMS 2009 Towards Energy Efficient Change Management in A Cloud Computing Environment: A Pro-Active Approach H. AbdelSalamK. Maly.
Adaptive Sleep Scheduling for Energy-efficient Movement-predicted Wireless Communication David K. Y. Yau Purdue University Department of Computer Science.
A Measurement Based Memory Performance Evaluation of Streaming Media Servers Garba Isa Yau and Abdul Waheed Department of Computer Engineering King Fahd.
Module 9 Planning and Implementing Monitoring and Maintenance.
Accounting for Load Variation in Energy-Efficient Data Centers
Università di Perugia Enabling Grids for E-sciencE Status of and requirements for Computational Chemistry NA4 – SA1 Meeting – 6 th April.
Ensieea Rizwani An energy-efficient management mechanism for large-scale server clusters By: Zhenghua Xue, Dong, Ma, Fan, Mei 1.
Improving the Reliability of Commodity Operating Systems Michael M. Swift, Brian N. Bershad, Henry M. Levy Presented by Ya-Yun Lo EECS 582 – W161.
GPFS: A Shared-Disk File System for Large Computing Clusters Frank Schmuck & Roger Haskin IBM Almaden Research Center.
Tackling I/O Issues 1 David Race 16 March 2010.
RobuSTore: Performance Isolation for Distributed Storage and Parallel Disk Arrays Justin Burke, Huaxia Xia, and Andrew A. Chien Department of Computer.
Load Rebalancing for Distributed File Systems in Clouds.
Capacity Planning in a Virtual Environment Chris Chesley, Sr. Systems Engineer
COMP7500 Advanced Operating Systems I/O-Aware Load Balancing Techniques Dr. Xiao Qin Auburn University
Input and Output Optimization in Linux for Appropriate Resource Allocation and Management James Avery King.
Optimizing Distributed Actor Systems for Dynamic Interactive Services
PA an Coordinated Memory Caching for Parallel Jobs
Auburn University COMP7500 Advanced Operating Systems I/O-Aware Load Balancing Techniques (2) Dr. Xiao Qin Auburn University.
Overview Introduction VPS Understanding VPS Architecture
Part V Memory System Design
Zhen Xiao, Qi Chen, and Haipeng Luo May 2013
Qingbo Zhu, Asim Shankar and Yuanyuan Zhou
CS246 Search Engine Scale.
Energy-Efficient Storage Systems
CS246: Search-Engine Scale
Presentation transcript:

Energy Efficient Prefetching – from models to Implementation 6/19/ Adam Manzanares and Xiao Qin Department of Computer Science and Software Engineering Auburn University

Adam Manzanares Ph.D. May 2010.

About me Ph.D.’04, U. of Nebraska-Lincoln 04-07, New Mexico Tech 07-10, Auburn University

About My Research Group

Presentation Outline Motivation Modeling Work DiskSim Modifications Energy Efficient Virtual File System (EEVFS) Parallel Striping Groups in EEVFS Conclusion 6/19/2015 5

Motivation EPA Report to Congress on Server and Data Center Energy Efficiency, /19/2015 6

Motivation  Using 2010 Historical Trends Scenario ◦ Server and Data Centers Consume 110 Billion kWh per year ◦ Assume average commercial end user is charged 9.46 kWh ◦ Disk systems can account for 27% of the energy cost of data centers 6/19/2015 7

Buffer Disk Architecture RAM Buffer m buffer disks n data disks Buffer Disk Controller Data Partitioning Security Model Load Balancing Power Management Prefetching Disk Requests Energy-Related Reliability Model 6/19/2015 8

IBM Ultrastar 36Z15 6/19/ Transfer Rate55 MB/sSpin Down Time: T D 1.5 s Active Power: P A 13.5 WSpin Up Time: T U 10.9 s Idle Power: P I 10.2 WSpin Down Energy: E D 13 J Standby Power: P A 2.5 WSpin Up Energy: E U 135 J Break-Even Time: T BE 15.2 S

Prefetching Disk 1 Disk 2 Disk 3 Buffer Disk 6/19/

Why Modeling & Simulation Allows us to determine the potential of our research ideas Can quickly evaluate many simulation parameters Allows us to test architectures and hardware without having the physical resources 6/19/

Modeling & Simulation Work  Developed Mathematical Model ◦ Disk Energy Consumption ◦ Conditions to prefetch  Developed Energy Saving Principles ◦ Investigated cases that exploit the energy saving principles  Implemented model in JAVA based simulator 6/19/

Energy Saving Principles  Energy Saving Principle One ◦ Increase the length and number of idle periods larger than the disk break-even time T BE  Energy Saving Principle Two ◦ Reduce the number of power-state transitions 6/19/

Paramaters Tested ParameterValues Data Size1, 5, 10, 25 MB # of Data Disks4, 8, 12 Inter-arrival Delay0, 0.1, 0.5, 1 S Hit Rate85, 90, 95, 100% 6/19/

Energy Savings Hit Rate 85% 6/19/

State Transitions 6/19/

Parameter Generalizations Larger data sizes produce greater energy savings and less state transitions Increasing the inter-arrival delay increases energy savings More data disks per buffer disks increases energy efficiency High hit rates produce the greatest energy efficiency 6/19/

Modeling & Sim. Summary  Hit Rate, Inter-arrival Delay, & Data Size combine to produce Idle Windows  Transitions important to reduce energy consumption ◦ May increase/decrease to reduce energy consumption  Disk parameters have large impact on energy savings  Model and simulator developed in-house 6/19/

DiskSim Event driven simulator developed at CMU Simulates disks at the block level The simulator has been validated Discrete event based simulator Provides a large amount of statistics Lacks Disk Power Models Ability to simulate large storage systems 6/19/

File System Simulator Large files important to energy savings Popularity of data is also useful Developed a block to file translator Interacts with DiskSim 6/19/

DiskSim with File System Simulator 6/19/

Modified DiskSim Results 6/19/

Modified DiskSim Summary Provides us with accurate disk statistics Only the changes to DiskSim need to be validated Heavily dependent upon disk parameters May miss details that can only be found in implementation 6/19/

Why a Cluster File System Block level prefetching difficult Natural place to track file accesses Control placement of data among storage nodes, and data disks Tiered approach simplifies management of files and disk states Eliminates some shortcomings of modeling and simulation 6/19/

Energy Efficient Virtual File System 6/19/

EEVFS Process Flow 6/19/

EEVFS Testbed ParameterStorage ServerStorage Node Type 1 Storage Node Type 2 CPUP4 2.0 GHzP4 3.2 GHzP4 2.4 GHz Memory (MB) Network Interconnect Disk TypeSATAATA/133 Disk Capacity120 GB80 GB Disk Bandwidth100 MB/s58 MB/s34 MB/s 6/19/

Energy Savings 6/19/

State Transitions 6/19/

Response Times 6/19/

Berkeley Web Trace 6/19/

EEVFS Summary Knowledge of requests assumed and may be hard to come by Performance tied to one of the buffer disks 6/19/

Parallel Striping Groups Disk 1 Disk 2 Group 1 Buffer Disk Storage Node 1 Disk 3 Disk 4 Buffer Disk Storage Node 2 Disk 5 Disk 6 Group 2 Buffer Disk Storage Node 3 Disk 7 Disk 8 Buffer Disk Storage Node 4 File 1File 2File 3File 4 6/19/

Striping Within a Group Disk 1 Disk 2 Group 1 Buffer Disk Storage Node 1 Disk 3 Disk 4 Buffer Disk Storage Node File 1 File /19/

Striping Within a Group Number of disks in a group can be matched to nearest bottleneck Striping within the group maintains relatively high performance Allows us to use a buffer disk for each storage node, while still maintaining file striping level 6/19/

Testbed ParameterStorage ServerStorage Node CPUCeleron 2.2 GHz Memory (MB)2000 Network Interconnect 1000 Disk TypeSATA Disk Capacity160 GB480 GB Disk Bandwidth126 MB/s 6/19/

Measured Results 6/19/

Measured Results 6/19/

Berkeley Web Trace 6/19/

Response Time Comparison Energy efficiency is slightly improved Response time gain is significant ParameterStripingNo Striping Energy Consumption (J)2,088,1132,100,243 Response Time (S) /19/

Parallel Striping Groups Summary Improves the energy efficiency and performance of a storage system Designed to scale –Needs to be tested on large scale storage system 6/19/

Conclusions Modeling and simulation used to test our ideas –System, Disk, Trace Parameters varied to study their impacts DiskSim Modifications –Added disk power models to DiskSim –Implemented block to file translator Energy Aware Virtual Cluster File System (EEVFS) –Implemented a prototype –Added parallel striping groups to improve the energy efficiency 6/19/

Future Work Improve the EEVFS prototype for production use Run EEVFS on large scale storage system –Investigate scaling effects 6/19/

Download the presentation slides

Questions