Aya Fukami, Saugata Ghose, Yixin Luo, Yu Cai, Onur Mutlu

Slides:



Advertisements
Similar presentations
Floating Gate Devices Kyle Craig.
Advertisements

COEN 180 Flash Memory.
Circuit Modeling of Non-volatile Memory Devices
Flash storage memory and Design Trade offs for SSD performance
Thank you for your introduction.
The Performance of Polar Codes for Multi-level Flash Memories
1 A 90nm 512Mb 166MHz Multilevel Cell Flash Memory with 1.5MByte/s Programming Adopted from ISSCC Dig. Tech. Papers, Feb.2005, Intel Corporation[2.6] Presented.
February 25, 2009Chaitanya: MEE Project Defense1 Fully configurable hierarchical transaction level verifier for functional verification Master’s Defense.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 32: Array Subsystems (DRAM/ROM) Prof. Sherief Reda Division of Engineering,
1 Eitan Yaakobi, Laura Grupp Steven Swanson, Paul H. Siegel, and Jack K. Wolf Flash Memory Summit, August 2010 University of California San Diego Efficient.
Avishai Wool lecture Introduction to Systems Programming Lecture 8.3 Non-volatile Memory Flash.
1 Error Correction Coding for Flash Memories Eitan Yaakobi, Jing Ma, Adrian Caulfield, Laura Grupp Steven Swanson, Paul H. Siegel, Jack K. Wolf Flash Memory.
Coding for Flash Memories
Yinglei Wang, Wing-kei Yu, Sarah Q. Xu, Edwin Kan, and G. Edward Suh Cornell University Tuan Tran.
Error Analysis and Management for MLC NAND Flash Memory Onur Mutlu (joint work with Yu Cai, Gulay Yalcin, Erich Haratsch, Ken Mai, Adrian.
Justin Meza Qiang Wu Sanjeev Kumar Onur Mutlu Revisiting Memory Errors in Large-Scale Production Data Centers Analysis and Modeling of New Trends from.
Yu Cai1 Gulay Yalcin2 Onur Mutlu1 Erich F. Haratsch3
3/20/2013 Threshold Voltage Distribution in MLC NAND Flash: Characterization, Analysis, and Modeling Yu Cai 1, Erich F. Haratsch 2, Onur Mutlu 1, and Ken.
Yu Cai1, Erich F. Haratsch2 , Onur Mutlu1 and Ken Mai1
Thank you for your introduction.
Program Interference in MLC NAND Flash Memory: Characterization, Modeling, and Mitigation Yu Cai 1 Onur Mutlu 1 Erich F. Haratsch 2 Ken Mai 1 1 Carnegie.
/38 Lifetime Management of Flash-Based SSDs Using Recovery-Aware Dynamic Throttling Sungjin Lee, Taejin Kim, Kyungho Kim, and Jihong Kim Seoul.
National Institute of Science & Technology Technical Seminar Presentation-2004 Presented By: Arjun Sabat [EE ] Flash Memory By Arjun Sabat Roll.
Yu Cai, Yixin Luo, Erich F. Haratsch*, Ken Mai, Onur Mutlu
2010 IEEE ICECS - Athens, Greece, December1 Using Flash memories as SIMO channels for extending the lifetime of Solid-State Drives Maria Varsamou.
Extracting Robust Keys from NAND Flash Physical Unclonable Functions Shijie Jia, Luning Xia, Zhan Wang, Jingqiang Lin, Guozhu Zhang and Yafei Ji Institute.
STORAGE DEVICES Presentation By: Saurabh Mishra. A data storage device is a device for recording (storing) information (data). CD, Hard Disk and Flash.
Embedded System Lab. Daeyeon Son Neighbor-Cell Assisted Error Correction for MLC NAND Flash Memories Yu Cai 1, Gulay Yalcin 2, Onur Mutlu 1, Erich F. Haratsch.
Towards minimizing read time for NAND Flash Towards minimizing read time for NAND Flash Globecom December 5 th, 2012 Borja Peleato, Rajiv Agarwal, John.
Physical Memory and Physical Addressing By Alex Ames.
Neighbor-Cell Assisted Error Correction for MLC NAND Flash Memories Yu Cai 1 Gulay Yalcin 2 Onur Mutlu 1 Erich F. Haratsch 3 Adrian Cristal 2 Osman S.
TM Freescale™ and the Freescale logo are trademarks of Freescale Semiconductor, Inc. All other product or service names are the property of their respective.
Carnegie Mellon University, *Seagate Technology
COEN 180 Flash Memory. Floating Gate Fundamentals Floating Gate between control gate and channel in MOSFET. Not directly connected to an outside line.
Data Retention in MLC NAND FLASH Memory: Characterization, Optimization, and Recovery. 서동화
Eitan Yaakobi, Laura Grupp Steven Swanson, Paul H. Siegel, and Jack K. Wolf Flash Memory Summit, August 2011 University of California San Diego Error-Correcting.
FaridehShiran Department of Electronics Carleton University, Ottawa, ON, Canada SmartReflex Power and Performance Management Technologies.
Carnegie Mellon University, *Seagate Technology
Yixin Luo, Saugata Ghose, Yu Cai, Erich F. Haratsch, Onur Mutlu Carnegie Mellon University, Seagate Technology Online Flash Channel Modeling and Its Applications.
COS 518: Advanced Computer Systems Lecture 8 Michael Freedman
Smruti R. Sarangi IIT Delhi
Understanding Modern Flash Memory Systems
What you should know about Flash Storage
Neighbor-Cell Assisted Error Correction for MLC NAND Flash Memories
On-Chip ECC for Low-Power SRAM Design
DuraCache: A Durable SSD cache Using MLC NAND Flash Ren-Shuo Liu, Chia-Lin Yang, Cheng-Hsuan Li, Geng-You Chen IEEE Design Automation Conference.
Understanding Latency Variation in Modern DRAM Chips Experimental Characterization, Analysis, and Optimization Kevin Chang Abhijith Kashyap, Hasan Hassan,
Hardware Accelerator Test Bench for Error-Correcting Algorithms
Write-hotness Aware Retention Management: Efficient Hot/Cold Data Partitioning for SSDs Saugata Ghose ▪ joint.
Ambit In-Memory Accelerator for Bulk Bitwise Operations Using Commodity DRAM Technology Vivek Seshadri Donghyuk Lee, Thomas Mullins, Hasan Hassan, Amirali.
Neighbor-cell Assisted Error Correction for MLC NAND Flash Memories
Yixin Luo Saugata Ghose Yu Cai Erich F. Haratsch Onur Mutlu
Information Storage and Spintronics 09
Hasan Hassan, Nandita Vijaykumar, Samira Khan,
Vulnerabilities in MLC NAND Flash Memory Programming: Experimental Analysis, Exploits, and Mitigation Techniques Yu Cai, Saugata Ghose, Yixin Luo, Ken.
Information Storage and Spintronics 10
COS 518: Advanced Computer Systems Lecture 8 Michael Freedman
Vulnerabilities in MLC NAND Flash Memory Programming: Experimental Analysis, Exploits, and Mitigation Techniques HPCA Session 3A – Monday, 3:15 pm,
Yixin Luo Saugata Ghose Yu Cai Erich F. Haratsch Onur Mutlu
Memory.
Semiconductor Memories
Memory Architecture and Storage Systems
Committee: Onur Mutlu (Chair) Phillip B. Gibbons James C. Hoe
Yixin Luo Saugata Ghose Yu Cai Erich F. Haratsch Onur Mutlu
COS 518: Advanced Computer Systems Lecture 9 Michael Freedman
Prof. Onur Mutlu ETH Zürich Fall November 2018
RAIDR: Retention-Aware Intelligent DRAM Refresh
Yu Cai, Yixin Luo, Erich F. Haratsch, Ken Mai, Onur Mutlu
Saugata Ghose Carnegie Mellon University
Information Storage and Spintronics 08
Presentation transcript:

Aya Fukami, Saugata Ghose, Yixin Luo, Yu Cai, Onur Mutlu Improving the Reliability of Chip-Off Forensic Analysis of NAND Flash Memory Devices Aya Fukami, Saugata Ghose, Yixin Luo, Yu Cai, Onur Mutlu

Brief Summary of the Paper Our Goal: Identify error sources in NAND flash memory during chip-off Quantify errors in NAND flash memory introduced in chip-off Identify a mitigation process to reduce errors introduced during chip-off analysis Our findings: Long storage time of devices increases errors in NAND flash memory Heat in chip-off increases uncorrectable errors Read-retry mechanism can reduce errors introduced during chip-off

Talk Outline Background Testing Methodology and Experimental Results Basic operation of NAND flash memory Testing Methodology and Experimental Results Retention error Errors introduced by heat How to Improve Reliability of Chip-off Analysis Read-retry operation

Talk Outline Background Testing Methodology and Experimental Results Basic operation of NAND flash memory Testing Methodology and Experimental Results Retention error Errors introduced by heat How to Improve Reliability of Chip-off Analysis Read-retry operation

MLC NAND Flash Memory Cell Operation Substrate Drain Source Control Gate Floating Gate Oxide – – – – – – – – – – – – – – – – – – – – – – – – – – – – Stored data: 11    01 00 10 Threshold voltage Current[A] Voltage[V] Amount of charge = Threshold voltage of the cell = Stored data value

MLC NAND Cell Vth Distribution Read voltage 01 00 10 11 Number of cells Threshold Voltage Threshold voltages need to be between each read voltage

Retention Error on MLC NAND Flash Cell Substrate Drain Source Control Gate Floating Gate Oxide – – – – – – – – – – – – – – – – – – – – – – – – – – – – Read voltage 01 00 10 11 Number of cells Error! Error! Error! Threshold Voltage Charge leakage over time causes threshold voltage shifts Data error in result is called retention error

NAND Flash Cell Degradation – – – – Oxide – – – – – – – – – – – – – – – – – – – – – – – Programming Erasing Degraded Cell 00 10 Number of cells Error! Threshold Voltage Repeated programming and erasing (P/E cycle) accelerates charge leakage

NAND Flash Error Sources During Chip-off Heat guns or electrical rework machines De-solder NAND flash memory chips with heat Required temperature and duration: 250 °C (482 °F), ~2 minutes High temperature accelerates charge leakage

Error Correction Codes (ECC) Flash memory controllers store ECC codewords to correct errors in data Typical correction capability for recent chip: 40 bits correction capability per 1KB Errors exceeding ECC correction capability: uncorrectable errors

Talk Outline Background Testing Methodology and Experimental Results Basic operation of NAND flash memory Testing Methodology and Experimental Results Retention error Errors introduced by heat How to Improve Reliability of Chip-off Analysis Read-retry operation

Testing Environment Test chips: New 2y-nm NAND flash memory chips from two different vendors (hereafter called Chip A and Chip B) Controller: Altera DE0 FPGA

Testing Methodology: Retention Error Evaluation Repeated programming/erasing cycles (P/E cycles) 10, 300, 1000, 2500, and 4000 cycles Raw bit error rate (RBER) measurement at multiple retention age (=wait time after programming) Day 0 and 1, Week 1, 2, 3 and 4

Experimental Result: Retention Error: Chip A 10-2 10-3 10-4 10-5 ECC Error Correction Capability 18× 12× RBER grows as P/E cycle count and retention age increase

Experimental Result: Retention Error: Chip B 10-2 10-3 10-4 10-5 ECC Error Correction Capability 4× 3× RBER grows as P/E cycle count and retention age increase

Testing Methodology: Thermal Effect Evaluation Baking target chips at 250 °C for 2 mins at different retention age (simulating chip-off procedures) 1 Week 4 Weeks Raw bit error rate (RBER) measurement after baking

Experimental Result: Errors Introduced by Heat (Chip A) 100 10-1 10-2 10-3 10-4 10-5 ECC Error Correction Capability 51× 33× Heat introduces errors more than ECC can correct

Experimental Result: Errors Introduced by Heat (Chip B) 100 10-1 10-2 10-3 10-4 10-5 ECC Error Correction Capability 37× 51x Heat introduces errors more than ECC can correct

Experimental Result: Uncorrectable Errors after Baking Fraction of pages that contains uncorrectable errors (P/E cycle=300) Retention Period before Baking Chip A Chip B 1 Week 29.1% 78.1% 4 Weeks 84.2% 83.6% Heat introduces uncorrectable errors even when the chip has been only lightly used

Talk Outline Background Testing Methodology and Experimental Results Basic operation of NAND flash memory Testing Methodology and Experimental Results Retention error Errors introduced by heat How to Improve Reliability of Chip-off Analysis Read-retry operation

to reduce errors caused by threshold voltage shifts Read-Retry Mechanism Default read voltage Shifted read voltage Shifted read voltage 11 01 Number of cells 01 → 11 01 → 11 Threshold Voltage Read-retry mechanism shifts the read voltage to reduce errors caused by threshold voltage shifts

Testing Methodology: Read-Retry Evaluation Read-Retry command found on chip B Implemented as a vendor specific command Read operation with read-retry Evaluation of 2 modes (mode A and B)

Experimental Result: Error Reduction with Read-Retry 10-1 10-2 10-3 10-4 -94.6% ECC Error Correction Capability -88.6% Read-retry can reduce errors introduced by thermal-based chip-off procedure

Uncorrectable Error Reduction by Read-Retry Fraction of pages that contains uncorrectable errors (Chip B, after baking) Read Mode P/E Cycle Count 300 1000 Default 83.6% 99.7% Read-Retry A 0% 12.1% Read-Retry B Read-retry can reduce errors introduced by thermal-based chip-off procedure

Conclusions and Recommendations Wait time increases errors Conduct data extraction at the earliest possible time after receiving a device Heat introduces uncorrectable errors Keep the temperature as low as possible Read-retry can reduce errors Use read-retry after chip-off when available

Aya Fukami, Saugata Ghose, Yixin Luo, Yu Cai, Onur Mutlu Improving the Reliability of Chip-Off Forensic Analysis of NAND Flash Memory Devices Aya Fukami, Saugata Ghose, Yixin Luo, Yu Cai, Onur Mutlu