A Method to Implement Low Energy Read Operations, and Single Cycle Write after Read in Subthreshold SRAMs Arijit Banerjee Dated: 12/10/2012 VLSI Robust 6332 Low PProject ower VLSI Why did We Moved to Subthreshold Supply Voltages? ο§ πΈππ π΄π/ππ¦π = 1 2 πΆπππ ππ·π· 2 πππ π΄π = 1 2 πΆπππ ππ·π· ππππ₯ 2 ο§ Reducing Ceff is Costly in terms of design effort ο§ For Low frequency(in KHz) Medical Circuits, Vdd scaling to Subthreshold Voltages is Very Effective 2 Problems in 6T based Subthreshold Bitcells ο§ For 6T based 8T, 9T 10T … Half select Problem in Subthreshold domain ο§ Read Stress SNM in Half Selected Cells while Writing ο§ No Column Muxing a must ο§ Writeback (Two Cycle Write after Read ) a must in Writing 3 How to lower Energy in 6T based Subthreshold SRAMs? ο§ Possibly Voltage Scaling? ο§ Operating in Subthreshold Domain ο§ Voltage Scaling further? Not a good Idea! ο§ Vmin is Limited by Worst case RSNM, HSNM, VDRV, and so on 4 Minimum Operation Voltage (Vmin) Dependency ο§ Worse case μ - 3σ Read SNM(RSNM or RNM) for 6T Worst Case IBM 130nm data provided by James Boley BSN Chip Group @ UVa 5 Vmin Dependency ο§ Worse case μ - 3σ Hold SNM(HSNM) for 6T based 10T Worst Case MIT 180xlp data provided by James Boley BSN Chip Group @ UVa 6 Vmin Dependency ο§ Worse case μ + 3σ Data Retention Voltage (VDRV) Worst Case IBM 130nm data provided by James Boley BSN Chip Group @ UVa 7 Possible Solutions to “How to lower Energy in 6T based SRAMs” ο§ New Type of Bitcells ο§ Novel Read/Write Methods ο§ New SRAM Architectures 8 Earlier Works in SRAM Dynamic Energy/Power Mitigation Prior Work Ali Valaee … SRAM Read-Assist Scheme, ISOCC, 2011 Energy/Power Savings 21.30% S. Yoshimoto… Low-Energy Disturb Mitigation Scheme IEEE Symposium on VLSI Circuits Digest of Technical Papers , 2011 32% Atsushi Kawasumi… Bitline Amplitude Limiting (BAL) Scheme, IEEE Asian Solid-State Circuits Conference on, 2011 26% Mohammad Sharifkhani … Segmented Virtual Grounding Architecture, ISLPED 2006 44% A. Kawasumi … Energy Saving without Voltage Reduction ICICDT, 2012 60% 9 Can We do Better? ο§ Using RAS-CAS DRAM Timing Concept in SRAM ο§ Low Energy Read (LER)? ο§ Do Not operate Decoders, WL Drivers ο§ “N-1” LER Operations per one Read in N word Row ο§ Auto detection of LER 10 Single Cycle Write after Read(WAR)? ο§ Earlier Approach was Two Cycle Write after Read ο§ Current Approach is Single Cycle Write after Read ο§ Pulsed Read and Write Word Line Generation in WAR ο§ Controllable WAR Margins through External Pins ο§ Using Intermediate Latch to Latch the Read Row Before Write 11 Block Diagram of the 4KB Subthreshold Data Memory Row and Bank Decoders Column Decoders Address Input Flops Precharge and column Circuitry Global Bitline Mux/Demux 128 Bit Intermediate Latch Write after Read Control, Timing Control, LER Support Logic, and Power Control Logic 128 Bit to 16 Bit Bus Interface Logic 16 bit I/O Bus Control Signals 11 Bit Address Bus Memory Array 4 Banks X 1 KB 12 4KB Subthreshold SRAM Layout 1KB Bank 1 Decoders and Word Line Drivers 1KB Bank 2 Write after Read Control, Timing Control, LER Support Logic, and Power Control Logic Address Bus, Data Input Bus Control Signals Data Output Bus 128 Bit to 16 Bit Bus Mux 1KB Bank 3 1KB Bank 4 13 What about Half Select Problem? ο§ We do not solve it; We bypass it ο§ No Column Mux used for avoiding half select problem ο§ Gains? Penalty? 14 Energy Comparison: Read vs. LER LER Energy or Read Energy in Joules Comparison of Read Energy @ 0.3V 27C with LER Energy @ 0. 5V 27C in 4KB Subthreshold Memory 8.00E-13 7.00E-13 6.00E-13 5.00E-13 3X Savings 2.5X Savings 4.00E-13 LER Energy @ 0.5V 27C TT Read Energy @ 0.3V 27C TT LER Enrgy @ 0.5 27C FF 3.00E-13 Read Energy @ 0.3v 27C FF 2.00E-13 1.00E-13 0.00E+00 Supply voltage in Volts 15 Subthreshold LER Energy Savings Trend with this Scheme LER Energy Savings vs. Supply Voltage @ 27C in 4KB Subthreshold SRAM LER Energy Savings vs. Supply Voltage @ 27C in 4KB Subthreshold SRAM 10 8 6 TT FF 4 SS FS 2 SF SS, FS fails @ 0.35V and 0.3V, SF fails @ 0.3V 0 0.25 0.3 0.35 0.4 0.45 LER Energy Savings LER Energy Savings 10 8 6 TT FF 4 SS FS 2 SF 0 0.5 Supply voltage in Volts 0.4 0.45 0.5 Supply voltage in Volts 16 Above Subthreshold LER Energy Savings Trend LER Energy Savings vs. Supply Voltage @ 27C in 2KB High Speed SRAM LER Energy Savings vs. Supply Voltage @ 27C in 2KB High Speed SRAM 10 8 TT FF SS 6 FS SF LER Energy Savings LER Energy Savings 10 9 8 7 6 TT 5 FF 4 SS 3 FS 2 SF 1 0 4 0.8 0.9 1 1.1 1.2 1.3 Supply voltage in Volts 0.9 1 1.2 Supply voltage in Volts 17 How is LER Energy vs. Cycle Time? LER Energy vs. Cycle Time @ 27C in 4KB Subthreshold Memory LER Energy in Joules 3.64E-12 1.82E-12 9.09E-13 TT FF FS 4.55E-13 SF SS 2.27E-13 SS, FS fails @ 0.35V and 0.3V, SF fails @ 0.3V 1.14E-13 1.20E-08 5.01E-06 1.00E-05 1.50E-05 2.00E-05 Cycle Time in Seconds 18 How is Read Energy vs. Cycle Time? Read Energy vs. Cycle Time @ 27C in 4KB Subthreshold Memory Read Energy in Joules 7.28E-12 3.64E-12 TT 1.82E-12 FF FS SF SS 9.09E-13 SS, FS fails @ 0.35V and 0.3V, SF fails @ 0.3V 4.55E-13 1.20E-08 2.01E-06 4.01E-06 6.01E-06 8.01E-06 1.00E-05 Cycle Time in Seconds 19 How is Write after Read Energy vs. Cycle Time? Write after Read Energy in Joules Write after Read Energy vs. Cycle Time @ 27C in 4KB Subthreshold Memory 3.64E-12 1.82E-12 TT 9.09E-13 FF FS SF 4.55E-13 SS SS, FS fails @ 0.35V and 0.3V, SF fails @ 0.3V 2.27E-13 1.20E-08 5.01E-06 1.00E-05 1.50E-05 2.00E-05 Cycle Time in Seconds 20 Temperature Dependency of Cycle Time and Access Time Temperature vs. Access Time in 4KB Subthreshold Memory@ 0.5V 7.63E-06 3.81E-06 1.91E-06 9.54E-07 4.77E-07 TT 2.38E-07 FF SS 1.19E-07 FS 5.96E-08 SF 2.98E-08 -45 5 55 Temperature in degree Celsius Access Time in Seconds Cycle Time in Seconds Temperature vs. Cycle Time in 4KB Subthreshold Memory @ 0.5V 7.63E-06 3.81E-06 1.91E-06 9.54E-07 4.77E-07 TT 2.38E-07 FF SS 1.19E-07 FS 5.96E-08 SF 2.98E-08 -45 5 55 Temperature in degree Celsius 21 Comparison of Read Energy: Old vs. New Design (Old Design) Processes Average Rd Energy (New Design) Average @ 0.5V 27C (in Joules) Read Energy (in Joules) (New Design) Rd Energy Savings TT 1.74E-12 1.73E-12 1X FF 1.15E-12 1.85E-12 0.6X SS 8.21E-13 1.47E-12 0.55X FS 2.39E-12 2.26E-12 1.05X SF 7.51E-13 1.34E-12 0.56X (New Design) Average LER Energy (in Joules) (New Design) LER Energy Savings 2.47E-13 7X 2.86E-13 6.5X 2.42E-13 6X 3.07E-13 7.4X 2.34E-13 5.7X 22 Comparison of Write after Read (WAR) Energy: Old vs. New Design (Old Design) Average Wr + Average Rd (New Design) Average Processes Energy in two Cycle WAR or Wr Energy in @ 0.5V 27C (in Joules) one Cycle (in Joules) TT 2.36E-12 1.48E-12 FF 3.12E-12 1.51E-12 SS 1.30E-12 1.60E-12 FS 3.80E-12 1.53E-12 SF 9.76E-13 1.28E-12 Old(Wr+Rd)/new(WAR) Energy Savings 1.6X 2X 0.8X 2.5X 0.75X 23 Comparison of Leakage Current: Old vs. New Design (Old Design) Total Standby (New Design) Total Processes @ 0.5V Leakage Current Standby Leakage 27C (in Amps) Current (in Amps) Leakage Savings TT 7.05E-06 6.77E-06 1.04X FF 2.27E-05 2.32E-05 0.97X SS 2.55E-06 2.17E-06 1.17X FS 1.23E-05 1.26E-05 0.98X SF 6.90E-06 6.77E-06 1.01X 24 Comparison of Area and Cycle Time: Old vs. New Design Parameters Area in square microns Cycle Time @ TT, 0.5V,27C in nS Access Time @ TT, 0.5V, 27C in nS Old Design 547199.521 1401 735 New Design 584920.376 230 198 Gains -7% 6X 3.7X 25 Comparison With Prior Works Prior Work Ali Valaee … SRAM Read-Assist Scheme, ISOCC, 2011 Energy/Power Savings 21.30% S. Yoshimoto… Low-Energy Disturb Mitigation Scheme IEEE Symposium on VLSI Circuits Digest of Technical Papers , 2011 32% Atsushi Kawasumi… Bitline Amplitude Limiting (BAL) Scheme, IEEE Asian Solid-State Circuits Conference on, 2011 26% Mohammad Sharifkhani … Segmented Virtual Grounding Architecture, ISLPED 2006 44% A. Kawasumi … Energy Saving without Voltage Reduction ICICDT, 2012 This Work 60% 5.7X @ 0.5V SF 27C, 5.1X @ 0.45 SS 27C, 1.67X @ 0.4 FS 27C 26 Conclusion ο§ 7 LER Operations per one Read in 8 word Row ο§ WAR Margins are Externally Controllable ο§ Penalty of 7% area, 3% worst case Standby Leakage, 25% worst case WAR Energy, and 45% worst case Read Energy ο§ Worst case 5.7X LER energy savings in KHz frequencies @ 0.5V 27C 27 References ο§ ο§ ο§ ο§ ο§ ο§ ο§ ο§ ο§ ο§ ο§ ο§ ο§ ο§ [1] J. P. Kulkarni, K. Kim, and K. Roy, “A 160 mV Robust Schmitt Trigger Based Subthreshold SRAM,” IEEE Journal of Solid-State Circuits, vol. 42, no. 10, pp. 2303–2313, Oct. 2007. [2] I. J. Chang, J.-J. Kim, S. P. Park, and K. Roy, “A 32 kb 10T Sub-Threshold SRAM Array With Bit-Interleaving and Differential Read Scheme in 90 nm CMOS,” IEEE Journal of Solid-State Circuits, vol. 44, no. 2, pp. 650–658, Feb. 2009. [3] T. H. Kim, J. Liu, J. Keane, and C. H. Kim, “A high-density subthreshold SRAM with data-independent bitline leakage and virtual ground replica scheme,” in Solid-State Circuits Conference, 2007. ISSCC 2007. Digest of Technical Papers. IEEE International, 2007, pp. 330–606. [4] B. H. Calhoun and A. Chandrakasan, “A 256kb sub-threshold SRAM in 65nm CMOS,” in Solid-State Circuits Conference, 2006. ISSCC 2006. Digest of Technical Papers. IEEE International, 2006, pp. 2592–2601. [5] G. K. Reddy, K. Jainwal, J. Singh, and S. P. Mohanty, “Process variation tolerant 9T SRAM bitcell design,” in Quality Electronic Design (ISQED), 2012 13th International Symposium on, 2012, pp. 493–497. [6] Ali Valaee, Asim J. Al-Khalili, “SRAM Read-Assist Scheme for High Performance Low Power Applications” in International SoC Design Conference (ISOCC ) on , 2011, pp. 179-182. [7] S. Yoshimoto, M. Terada, S. Okumura, T. Suzuki, S. Miyano, H. Kqwaguchi and M. Yoshimoto, “A 40-nm 0.5-V 20.1-µW/MHz 8T SRAM with Low-Energy Disturb Mitigation Scheme,” in IEEE Symposium on VLSI Circuits Digest of Technical Papers on, 2011, pp. 72-73. [8] Atsushi Kawasumi, Toshikazu Suzuki, Shinich Moriwaki and Shinji Miyano, “ Energy Efficiency Degradation Caused by Random Variation in Low-Voltage SRAM and 26% Energy Reduction by Bitline Amplitude Limiting (BAL) Scheme,” in IEEE Asian Solid-State Circuits Conference on, 2011, pp. 165-168. [9] Mohammad Sharifkhani, Manoj Sachdev, “A Low Power SRAM Architecture Based on Segmented Virtual Grounding,” in International symposium on Low Power Electronics and Design (ISLPED) on, 2006, pp. 256-261. [10] A. Kawasumi, Y. Takeyama, O. Hirabayashi, K. Kushida, F. Tachibana. Y. Niki, S. Sasaki and T. Yabe, “Energy Efficiency Deterioration by Variability in SRAM and Circuit Techniques for Energy Saving without Voltage Reduction,” in IC Design & Technology (ICICDT), 2012 IEEE International Conference on, 2012. [11] Mohammed Shareef I, Pradeep Nair, Bharadwaj Amrutur, “Energy Reduction in SRAM using Dynamic Voltage and Frequency Management,” in 2008 21st International Conference on VLSI Design on, 2008, pp. 503-508. [12] http://download.micron.com/pdf/datasheets/psram/8mb_asyncpage_p23z.pdf [13] http://www.issi.com/pdf/41C-LV16256C.pdf [14] http://www.chiplus.com/Uploads/DataSheet/Pseudo%20SRAM/Pseudo_CS26LV32163%20(2.7).pdf 28 Acknowledgements ο§ Grad Students @ BSN Chip Team: James Boley , Yousef Shakhsheer, Alicia Klinefelter, Yanqing Zhang, Kyle Craig ο§ Mateja Putic, Grad Student, UVa ο§ Gary Lee, SEAS IT Administrator ο§ Professor Mircea Stan, ECE, UVa ο§ Professor Ben Calhoun, ECE, UVa 29 Thank You! Questions? 30