1 The University of Texas at Austin Ali Shafiee, A. Gundu, M. Shevgoor, R. Balasubramonian and M. Tiwari.

Slides:



Advertisements
Similar presentations
Posted CAS for DDR-II SDRAM Bill Gervasi Technology Analyst October, 2002 Posted CAS.
Advertisements

Jaewoong Sim Alaa R. Alameldeen Zeshan Chishti Chris Wilkerson Hyesoon Kim MICRO-47 | December 2014.
A Case for Refresh Pausing in DRAM Memory Systems
Memory Controller Innovations for High-Performance Systems
Application-Aware Memory Channel Partitioning † Sai Prashanth Muralidhara § Lavanya Subramanian † † Onur Mutlu † Mahmut Kandemir § ‡ Thomas Moscibroda.
LEMap: Controlling Leakage in Large Chip-multiprocessor Caches via Profile-guided Virtual Address Translation Jugash Chandarlapati Mainak Chaudhuri Indian.
University of Michigan Electrical Engineering and Computer Science University of Michigan Electrical Engineering and Computer Science University of Michigan.
4/17/20151 Improving Memory Bank-Level Parallelism in the Presence of Prefetching Chang Joo Lee Veynu Narasiman Onur Mutlu* Yale N. Patt Electrical and.
Rethinking DRAM Design and Organization for Energy-Constrained Multi-Cores Aniruddha N. Udipi, Naveen Muralimanohar*, Niladrish Chatterjee, Rajeev Balasubramonian,
Smart Refresh: An Enhanced Memory Controller Design for Reducing Energy in Conventional and 3D Die-Stacked DRAMs Mrinmoy Ghosh Hsien-Hsin S. Lee School.
Effects of Virtual Cache Aliasing on the Performance of the NetBSD Operating System Rafal Boni CS 535 Project Presentation.
1 MemScale: Active Low-Power Modes for Main Memory Qingyuan Deng, David Meisner*, Luiz Ramos, Thomas F. Wenisch*, and Ricardo Bianchini Rutgers University.
1 Adaptive History-Based Memory Schedulers Ibrahim Hur and Calvin Lin IBM Austin The University of Texas at Austin.
1 DIEF: An Accurate Interference Feedback Mechanism for Chip Multiprocessor Memory Systems Magnus Jahre †, Marius Grannaes † ‡ and Lasse Natvig † † Norwegian.
Micro-Pages: Increasing DRAM Efficiency with Locality-Aware Data Placement Kshitij Sudan, Niladrish Chatterjee, David Nellans, Manu Awasthi, Rajeev Balasubramonian,
Lecture 12: DRAM Basics Today: DRAM terminology and basics, energy innovations.
DRAM. Any read or write cycle starts with the falling edge of the RAS signal. –As a result the address applied in the address lines will be latched.
Handling the Problems and Opportunities Posed by Multiple On-Chip Memory Controllers Manu Awasthi, David Nellans, Kshitij Sudan, Rajeev Balasubramonian,
The many-core architecture 1. The System One clock Scheduler (ideal) distributes tasks to the Cores according to a task map Cores 256 simple RISC Cores,
DRAM Visualization Tool -- Austin, Prashant, Vincent1 DRAM SIMULATOR VISUALIZATION TOOL ENEE 759 H Austin Lanham Prashant Bhoot Vincent Chan.
Thermal-Aware SoC Test Scheduling with Test Set Partitioning and Interleaving Zhiyuan He 1, Zebo Peng 1, Petru Eles 1 Paul Rosinger 2, Bashir M. Al-Hashimi.
Systems I Locality and Caching
QUANTIFYING THE RELATIONSHIP BETWEEN THE POWER DELIVERY NETWORK AND ARCHITECTURAL POLICIES IN A 3D-STACKED MEMORY DEVICE Manjunath Shevgoor, Niladrish.
1 Towards Scalable and Energy-Efficient Memory System Architectures Rajeev Balasubramonian School of Computing University of Utah.
Memory Technology “Non-so-random” Access Technology:
McRouter: Multicast within a Router for High Performance NoCs
Spring 2007W. Rhett DavisNC State UniversityECE 747Slide 1 ECE 747 Digital Signal Processing Architecture SoC Lecture – Working with DRAM April 3, 2007.
Niladrish Chatterjee Manjunath Shevgoor Rajeev Balasubramonian
Leveraging Heterogeneity in DRAM Main Memories to Accelerate Critical Word Access Niladrish Chatterjee Manjunath Shevgoor Rajeev Balasubramonian Al Davis.
UNDERSTANDING THE ROLE OF THE POWER DELIVERY NETWORK IN 3-D STACKED MEMORY DEVICES Manjunath Shevgoor, Niladrish Chatterjee, Rajeev Balasubramonian, Al.
How Hardware and Software Work Together
1 Efficient Data Access in Future Memory Hierarchies Rajeev Balasubramonian School of Computing Research Buffet, Fall 2010.
Survey of Existing Memory Devices Renee Gayle M. Chua.
Reducing Refresh Power in Mobile Devices with Morphable ECC
1 Reducing DRAM Latencies with an Integrated Memory Hierarchy Design Authors Wei-fen Lin and Steven K. Reinhardt, University of Michigan Doug Burger, University.
Page Overlays An Enhanced Virtual Memory Framework to Enable Fine-grained Memory Management Vivek Seshadri Gennady Pekhimenko, Olatunji Ruwase, Onur Mutlu,
Timing Channel Protection for a Shared Memory Controller Yao Wang, Andrew Ferraiuolo, G. Edward Suh Feb 17 th 2014.
Embedded System Lab. 최 길 모최 길 모 Kilmo Choi A Software Memory Partition Approach for Eliminating Bank-level Interference in Multicore.
1 Fair Queuing Memory Systems Kyle Nesbit, Nidhi Aggarwal, Jim Laudon *, and Jim Smith University of Wisconsin – Madison Department of Electrical and Computer.
The Memory Hierarchy 21/05/2009Lecture 32_CA&O_Engr Umbreen Sabir.
Analyzing Performance Vulnerability due to Resource Denial-Of-Service Attack on Chip Multiprocessors Dong Hyuk WooGeorgia Tech Hsien-Hsin “Sean” LeeGeorgia.
1 Presented By: Michael Bieniek. Embedded systems are increasingly using chip multiprocessors (CMPs) due to their low power and high performance capabilities.
MIAO ZHOU, YU DU, BRUCE CHILDERS, RAMI MELHEM, DANIEL MOSSÉ UNIVERSITY OF PITTSBURGH Writeback-Aware Bandwidth Partitioning for Multi-core Systems with.
DTM and Reliability High temperature greatly degrades reliability
High-Performance DRAM System Design Constraints and Considerations by: Joseph Gross August 2, 2010.
Parallelism-Aware Batch Scheduling Enhancing both Performance and Fairness of Shared DRAM Systems Onur Mutlu and Thomas Moscibroda Computer Architecture.
AN ASYNCHRONOUS BUS BRIDGE FOR PARTITIONED MULTI-SOC ARCHITECTURES ON FPGAS REPORTER: HSUAN-JU LI 2014/04/09 Field Programmable Logic and Applications.
1 Adapted from UC Berkeley CS252 S01 Lecture 18: Reducing Cache Hit Time and Main Memory Design Virtucal Cache, pipelined cache, cache summary, main memory.
Efficiently Prefetching Complex Address Patterns Manjunath Shevgoor, Sahil Koladiya, Rajeev Balasubramonian University of Utah Chris Wilkerson, Zeshan.
33 rd IEEE International Conference on Computer Design ICCD rd IEEE International Conference on Computer Design ICCD 2015 Improving Memristor Memory.
Managing DRAM Latency Divergence in Irregular GPGPU Applications Niladrish Chatterjee Mike O’Connor Gabriel H. Loh Nuwan Jayasena Rajeev Balasubramonian.
Fall 2000M.B. Ibáñez Lecture 14 Memory Management II Contiguous Allocation.
Cache memory. Cache memory Overview CPU Cache Main memory Transfer of words Transfer of blocks of words.
Secure Dynamic Memory Scheduling against Timing Channel Attacks
Seth Pugsley, Jeffrey Jestes,
Good morning everyone, my name is Arjun Deb
Computer Memory.
Virtual Memory Use main memory as a “cache” for secondary (disk) storage Managed jointly by CPU hardware and the operating system (OS) Programs share main.
A Requests Bundling DRAM Controller for Mixed-Criticality System
Niladrish Chatterjee Manjunath Shevgoor Rajeev Balasubramonian
Bank-aware Dynamic Cache Partitioning for Multicore Architectures
Software Cache Coherent Control by Parallelizing Compiler
التأهيل التربوي المملكة العربية السعودية جامعة الملك عبدالعزيز
Ali Shafiee Rajeev Balasubramonian Feifei Li
Massachusetts Institute of Technology
Manjunath Shevgoor, Rajeev Balasubramonian, University of Utah
Niladrish Chatterjee Manjunath Shevgoor Rajeev Balasubramonian
Enabling Transparent Memory-Compression for Commodity Memory Systems
Rajeev Balasubramonian
Presentation transcript:

1 The University of Texas at Austin Ali Shafiee, A. Gundu, M. Shevgoor, R. Balasubramonian and M. Tiwari

Core1 $ $ MC Core0 $ $ 2 Avoiding Information Leakage in the Memory Controller with Fixed Service Policies

3 Core 0 $ $ MC 3rd party software Core 1 $ $ Core 0: load changed Core 1: access latency changed Avoiding Information Leakage in the Memory Controller with Fixed Service Policies

RD DM WR DM WR RD time Slot L Quantum Q= 4xL Goal: Minimize L Such that L is enough to transfer one read or one write While Satisfying cmd-to-cmd min time gaps Data Placement relaxes time gaps smart data placement  shorter L CPU Avoiding Information Leakage in the Memory Controller with Fixed Service Policies

Data Bus CA Bus Rank 0 Rank 1 Bank Memory Access = ACT+ CAS ACT CAS 5 Avoiding Information Leakage in the Memory Controller with Fixed Service Policies

6 ACTCAS ACT00 CAS06 Rank(A)  Rank(B) ACTCAS ACT50 CAS04 Rank(0) Bank(A)  Bank(B) ACTCAS ACT511 CAS284 Rank(0) Bank(A)  Bank(A) t BURST t RTRS t BURST t RRD t FAW t CCD t RCD t RAS t RC t RP t RTP Avoiding Information Leakage in the Memory Controller with Fixed Service Policies

Rank-Part: Rank Partitioning Bank-Part: Bank Partitioning No-Part: No Partitioning Core 0 Core 1 Core 2 Core 0 Core 1 Core 2 Core 3 7 Avoiding Information Leakage in the Memory Controller with Fixed Service Policies

RD WR RD WR time Data Bus 0L 1L 2L 3L 4L 5L time CA Bus ACT CAS T RCD T CAS CAS ACT T RCD T CWD KL-T CAS KL-(T CAS +T RCD ) KL-T CWD KL-(T CWD +T RCD ) Fixed Periodic Transfer 8 Avoiding Information Leakage in the Memory Controller with Fixed Service Policies

Enough Time to Transfer No Collision on CA Bus L ≥T BURST +T RTRS  L ≥ 6 CAS(RD) ≠ CAS (WR) KL-11 ≠ K’L-5 Rank-Part  L=7 Bank-Part  L=15 No-Part  L=43 9 (K-K’)L ≠6  L≠6 Avoiding Information Leakage in the Memory Controller with Fixed Service Policies

R R W W R R R R W W R R W W W W L=15 R R W W R R R R W W R R W W W W L=6 L=15 Return to CPU en masse Q=120 Q=63 10 Avoiding Information Leakage in the Memory Controller with Fixed Service Policies

L=43 Q= L= Q= Memory Bank = + + 3x15=45>43 11 CPU = Avoiding Information Leakage in the Memory Controller with Fixed Service Policies

Simics Simics – 8 4-way superscalar cores – L1I (32K)/L1D (32KB)/L2 (1MB)  per core USIMM USIMM – 1channel, 8 ranks, 8 banks Benchmark Benchmark – SPEC 2006 – NPB Temporal Partitioning Compared with Temporal Partitioning (HPCA’14) 12 Avoiding Information Leakage in the Memory Controller with Fixed Service Policies

Increase OS complexity RANK PARTITIONINGNO PARTITIONINGBANK PARTITIONING PERFORMANCE NON-SECURE BASELINE FS FS: RD/WR-REORDER FS: TRIPLE ALTERNATION TP % 12% 72% Avoiding Information Leakage in the Memory Controller with Fixed Service Policies

14 Avoiding Information Leakage in the Memory Controller with Fixed Service Policies

Thank You 15 Avoiding Information Leakage in the Memory Controller with Fixed Service Policies