Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

AERO: Adaptive Erase Operation for Improving Lifetime and Performance of Modern NAND Flash-Based SSDs (2404.10355v1)

Published 16 Apr 2024 in cs.AR

Abstract: This work investigates a new erase scheme in NAND flash memory to improve the lifetime and performance of modern solid-state drives (SSDs). In NAND flash memory, an erase operation applies a high voltage (e.g., > 20 V) to flash cells for a long time (e.g., > 3.5 ms), which degrades cell endurance and potentially delays user I/O requests. While a large body of prior work has proposed various techniques to mitigate the negative impact of erase operations, no work has yet investigated how erase latency should be set to fully exploit the potential of NAND flash memory; most existing techniques use a fixed latency for every erase operation which is set to cover the worst-case operating conditions. To address this, we propose AERO (Adaptive ERase Operation), a new erase scheme that dynamically adjusts erase latency to be just long enough for reliably erasing target cells, depending on the cells' current erase characteristics. AERO accurately predicts such near-optimal erase latency based on the number of fail bits during an erase operation. To maximize its benefits, we further optimize AERO in two aspects. First, at the beginning of an erase operation, AERO attempts to erase the cells for a short time (e.g., 1 ms), which enables AERO to always obtain the number of fail bits necessary to accurately predict the near-optimal erase latency. Second, AERO aggressively yet safely reduces erase latency by leveraging a large reliability margin present in modern SSDs. We demonstrate the feasibility and reliability of AERO using 160 real 3D NAND flash chips, showing that it enhances SSD lifetime over the conventional erase scheme by 43% without change to existing NAND flash chips. Our system-level evaluation using eleven real-world workloads shows that an AERO-enabled SSD reduces read tail latency by 34% on average over a state-of-the-art technique.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (86)
  1. Samsung. Samsung enterprise SSDs, 2023. https://semiconductor.samsung.com/ssd/enterprise-ssd.
  2. SK Hynix. SK Hynix enterprise SSDs, 2023. https://product.skhynix.com/products/ssd/essd.go.
  3. Micron. Micron enterprise SSDs, 2023. https://www.micron.com/products/ssd/product-lines/9400.
  4. Western Digital. Western Digital Data Center SSDs, 2023. https://www.westerndigital.com/solutions/data-center/high-performance-ssd.
  5. Phase Change Memory. Proceedings of the IEEE, 2010.
  6. Design and Optimization of Nonvolatile Multibit 1T1R Resistive RAM. IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2013.
  7. Sanjeev Aggarwal. STT–MRAM: High Density Persistent Memory Solution. https://www.flashmemorysummit.com/Proceedings2019/08-07-Wednesday/20190807_NEWM-202B-1_Aggarwal.pdf, 2019.
  8. FeRAM. Springer US, 2009.
  9. 30.3 A 512Gb 3b/Cell 7 th-Generation 3D-NAND Flash Memory with 184MB/s Write Throughput and 2.0 Gb/s Interface. In Proceedings of the 2021 IEEE International Solid-State Circuits Conference (ISSCC), 2021.
  10. A 1Tb 3b/Cell 8th-Generation 3D-NAND Flash Memory with 164MB/s Write Throughput and a 2.4Gb/s Interface. In Proceedings of the 2022 IEEE International Solid-State Circuits Conference (ISSCC), 2022.
  11. GuardedErase: Extending SSD Lifetimes by Protecting Weak Wordlines. In Proceedings of the 20th USENIX Conference on File and Storage Technologies (FAST), 2022.
  12. Reducing SSD Read Latency via NAND Flash Program and Erase Suspension. In Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST), 2012.
  13. Practical Erase Suspension for Modern Low-latency SSDs. In Proceedings of the 2019 USENIX Annual Technical Conference (ATC), 2019.
  14. Error Characterization, Mitigation, and Recovery in Flash-Memory-Based Solid-State Drives. Proceedings of the IEEE, 2017.
  15. RealWear: Improving performance and lifetime of SSDs using a NAND aging marker. Performance Evaluation, 2021.
  16. The Operation Algorithm for Improving the Reliability of TLC (Triple Level Cell) NAND Flash Characteristics. In Proceedings of the 2011 3rd IEEE International Memory Workshop (IMW), 2011.
  17. A Semi-Preemptive Garbage Collector for Solid State Drives. In Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2011.
  18. ShadowGC: Cooperative Garbage Collection with Multi-level Buffer for Performance Improvement in NAND flash-based SSDs. In Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition (DATE), 2018.
  19. Dynamic Management of Key States for Reinforcement Learning-Assisted Garbage Collection to Reduce Long Tail Latency in SSD. In Proceedings of the 55th Annual Design Automation Conference (DAC), 2018.
  20. Exploring the Potentials of Parallel Garbage Collection in SSDs for Enterprise Storage Systems. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2016.
  21. Parallelizing Garbage Collection with I/O to Improve Flash Resource Utilization. In Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing (HPDC), 2018.
  22. Parallelism and Garbage Collection Aware I/O Scheduler with Improved SSD Performance. In Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2017.
  23. Reinforcement Learning-Assisted Garbage Collection to Mitigate Long-Tail Latency in SSD. ACM Transactions on Embedded Computing Systems (TECS), 2017.
  24. Reducing Garbage Collection Overhead in SSD Based on Workload Prediction. In Proceedings of the 11th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage), 2019.
  25. Preemptible I/O Scheduling of Garbage Collection for Solid State Drives. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2013.
  26. Rejuvenator: A Static Wear Leveling Algorithm for NAND Flash Memory with Minimized Overhead. In Proceedings of the 2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST), 2011.
  27. Pattern-based Write Scheduling and Read Balance-oriented Wear-Leveling for Solid State Drivers. In Proceedings of the 2019 35th Symposium on Mass Storage Systems and Technologies (MSST), 2019.
  28. LLSM: A Lifetime-Aware Wear-Leveling for LSM-Tree on NAND Flash Memory. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2022.
  29. Improving NAND Endurance by Dynamic Program and Erase Scaling. In Proceedings of the 5th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage), 2013.
  30. Lifetime Improvement of NAND Flash-based Storage Systems Using Dynamic Program and Erase Scaling. In Proceedings of the 12th USENIX Conference on File and Storage Technologies (FAST), 2014.
  31. Dynamic Erase Voltage and Time Scaling for Extending Lifetime of NAND Flash-Based SSDs. IEEE Transactions on Computers (TC), 2017.
  32. Exploiting Process Similarity of 3D Flash Memory for High Performance SSDs. In Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), 2019.
  33. Improving 3D NAND Flash Memory Lifetime by Tolerating Early Retention Loss and Process Variation. Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), 2018.
  34. P-Alloc: Process-Variation Tolerant Reliability Management for 3D Charge-Trapping Flash Memory. ACM Transactions on Embedded Computing Systems (TECS), 2017.
  35. Boosting the Performance of 3D Charge Trap NAND Flash with Asymmetric Feature Process Size Characteristic. In Proceedings of the 2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC), 2017.
  36. Layer-Aware Program-and-Read Schemes for 3D Stackable Vertical-Gate BE-SONOS NAND Flash Against Cross-Layer Process Variations. IEEE Journal of Solid-State Circuits (JSSC), 2015.
  37. Efficient Bad Block Management with Cluster Similarity. In Proceedings of the 2022 IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2022.
  38. Shaving Retries with Sentinels for Fast Read over High-Density 3D Flash. In Proceedings of the 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), 2020.
  39. Threshold Voltage Distribution in MLC NAND Flash Memory: Characterization, Analysis, and Modeling. In Proceedings of the 2013 Design, Automation & Test in Europe Conference & Exhibition (DATE), 2013.
  40. Read Disturb Errors in MLC NAND Flash Memory: Characterization, Mitigation, and Recovery. In Proceedings of the 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), 2015.
  41. An Erase Efficiency Boosting Strategy for 3D Charge Trap NAND Flash. IEEE Transactions on Computers (TC), 2018.
  42. A Novel Double-density, Single-Gate Vertical Channel (SGVC) 3D NAND Flash That Is Tolerant to Deep Vertical Etching CD Variation and Possesses Robust Read-disturb Immunity. In Proceedings of the 2015 IEEE International Electron Devices Meeting (IEDM), 2015.
  43. Reducing Solid-State Drive Read Latency by Optimizing Read-Retry. In Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2021.
  44. MQSim: A Framework for Enabling Realistic Studies of Modern Multi-Queue SSD Devices. In Proceedings of the 16th USENIX Conference on File and Storage Technologies (FAST), 2018.
  45. An Integrated Approach for Managing Read Disturbs in High-Density NAND Flash Memory. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2016.
  46. Prolonging 3D NAND SSD Lifetime via Read Latency Relaxation. In Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2021.
  47. Program Interference in MLC NAND Flash Memory: Characterization, Modeling, and Mitigation. In Proceedings of the IEEE 31st International Conference on Computer Design (ICCD), 2013.
  48. Improving Performance and Lifetime of NAND Storage Systems Using Relaxed Program Sequence. In Proceeindgs of the 53nd ACM/EDAC/IEEE Design Automation Conference (DAC), 2016.
  49. Improving Performance and Lifetime of Large-page NAND Storages Using Erase-Free Subpage Programming. In Proceedings of the 54th ACM/EDAC/IEEE Design Automation Conference (DAC), 2017.
  50. Vulnerabilities in MLC NAND Flash Memory Programming: Experimental Analysis, Exploits, and Mitigation Techniques. In Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA), 2017.
  51. Robert Gallager. Low-Density Parity-Check Codes. IRE Transactions on information theory, 1962.
  52. JEDEC. JEDEC Solid State Technology Assn., Method for Developing Acceleration Models for Electronic Component Failure Mechanisms, 2022. https://www.jedec.org/standards-documents/docs/jesd91B.
  53. A 1-Tb 4b/Cell 4-Plane 162-Layer 3D Flash Memory With a 2.4-Gb/s I/O Speed Interface. In Proceedings of the 2022 IEEE International Solid- State Circuits Conference (ISSCC), 2022.
  54. 28.2 A High-Performance 1Tb 3b/Cell 3D-NAND Flash with a 194MB/s Write Throughput on over 300 Layers đť—‚đť—‚\mathsf{i}sansserif_i. In Proceedings of the 2023 IEEE International Solid- State Circuits Conference (ISSCC), 2023.
  55. Inside NAND Flash Memories. 2010.
  56. Erase verify scheme for NAND flash, 1999. US Patent 6,009,014.
  57. Seiichi Aritome. NAND Flash Memory Technologies. 2015.
  58. SONOS Split-Gate eFlash Memory. Embedded Flash Memory for Embedded Systems: Technology, Design for Sub-systems, and Innovations, 2018.
  59. Data Randomization Scheme for Endurance Enhancement and Interference Mitigation of Multilevel Flash Memory Devices. Etri Journal, 2013.
  60. A Scalable Bidimensional Randomization Scheme for TLC 3D NAND Flash Memories. Micromachines, 2021.
  61. ONFI Workgroup. Open NAND Flash Interface Specification 4.1. http://www.onfi.org/~/media/onfi/specs/onfi_4_1_gold.pdf?la=en.
  62. 256Gb 3b/cell V-NAND Flash Memory with 48 Stacked WL Layers. In Proceedings of the 2016 IEEE International Solid-State Circuits Conference (ISSCC), 2016.
  63. JEDEC. JESD218B.02: Solid-State Drive (SSD) Requirements and Endurance Test Method, 2022. https://www.jedec.org/standards-documents/docs/jesd218b01.
  64. Svante Arrhenius. Über die Dissociationswärme und den Einfluss der Temperatur auf den Dissociationsgrad der Elektrolyte. Zeitschrift für Physikalische Chemie, 1889.
  65. Rino Micheloni. 3D Flash Memories. 2016.
  66. Kristian Vättö. Samsung SSD 840: Testing the endurance of TLC NAND, 2012. https://www.anandtech.com/show/6459/samsung-ssd-840-testing-the-endurance-of-tlc-nand.
  67. Billy Tallis. The Samsung 960 Pro (2TB) SSD review, 2016. https://www.anandtech.com/show/10754/samsung-960-pro-ssd-review.
  68. Three-Dimensional 128Gb MLC Vertical NAND Flash-Memory with 24-WL Stacked Layers and 50MB/s High-Speed Programming. In 2014 IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 2014.
  69. A 64Gb 533Mb/s DDR Interface MLC NAND Flash in Sub-20nm Technology. In 2012 IEEE International Solid-State Circuits Conference (ISSCC), 2012.
  70. DFTL: A Flash Translation Layer Employing Demand-Based Selective Caching of Page-Level Address Mappings. In Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2009.
  71. Osso Vahabzadeh. LDPC codes: Principles and implementation aspects. In Flash Memory Summit (FMS), 2016.
  72. David Declercq. Improving Waterfall Performance of low cost FAID LDPC Decoders. In Flash Memory Summit (FMS), 2019.
  73. LDPC-in-SSD: Making Advanced Error Correction Codes Work Effectively in Solid State Drives. In 11th USENIX Conference on File and Storage Technologies (FAST), 2013.
  74. Flash-Cosmos: In-Flash Bulk Bitwise Operations Using Inherent Computation Capability of NAND Flash Memory. In Proceedings of the 2022 55th IEEE/ACM International Symposium on Microarchitecture (MICRO), 2022.
  75. GenStore: A High-Performance In-Storage Processing System for Genome Sequence Analysis. In Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022.
  76. Lyle Smith. Samsung 990 Pro SSD Review (2TB), 2022. https://www.storagereview.com/review/samsung-990-pro-ssd-review-2tb.
  77. An Adaptive Striping Architecture for Flash Memory Storage Systems of Embedded Systems. In Proceedings of the Eighth IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS), 2002.
  78. An In-Depth Analysis of Cloud Block Storage Workloads in Large-Scale Production. In Proceedings of the 2020 IEEE International Symposium on Workload Characterization (IISWC), 2020.
  79. Write Off-Loading: Practical Power Management for Enterprise Storage. ACM Transactions on Storage (TOS), 2008.
  80. Venice: Improving Solid-State Drive Parallelism at Low Cost via Conflict-Free Accesses. In Proceedings of the 50th Annual International Symposium on Computer Architecture (ISCA), 2023.
  81. Improving Fairness for SSD Devices through DRAM Over-Provisioning Cache Management. IEEE Transactions on Parallel and Distributed Systems (TPDS), 2022.
  82. Self-Adapting Channel Allocation for Multiple Tenants Sharing SSD Devices. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2022.
  83. Access Characteristic Guided Partition for NAND Flash based High-Density SSDs. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2023.
  84. Adaptive Switch on Wear Leveling for Enhancing I/O Latency and Lifetime of High-Density SSDs. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2022.
  85. Dynamo: Amazon’s Highly Available Key-Value Store. In Proceedings of Twenty-First ACM SIGOPS Symposium on Operating Systems Principles (SOSP), 2007.
  86. Why Does the Cloud Stop Computing? Lessons from Hundreds of Service Outages. In Proceedings of the Seventh ACM Symposium on Cloud Computing (SoCC), 2016.
Citations (1)

Summary

We haven't generated a summary for this paper yet.