ISSCC 2003 / SESSION 22 / TD: EMBEDDED TECHNOLOGIES / PAPER 22.8 22.8 Zigzag Super Cut-off CMOS (ZSCCMOS) Block Activation with Self-Adaptive Voltage Level Controller: An Alternative to Clock-Gating Scheme in Leakage Dominant Era Kyeong-Sik Min1, Hiroshi Kawaguchi, Takayasu Sakurai University of Tokyo, Tokyo, Japan 1 Now wih Kookmin University, Seoul, Republic of Korea Leakage power is expected to be dominant in sub-70nm technology. In the leakage dominant environments, a clock gating scheme loses its effectiveness although currently it is the most effective way to reduce power consumption. A clock-gating substitute is proposed which achieves a 200ps wake-up time and 3 orders of magnitude leakage reduction for leakage dominant LSI’s and experimentally verifies the effectiveness of the scheme. Super Cut-off CMOS (SCCMOS) in Fig. 22.8.1a, which does not use a high-VTH switch, can be operated at sub-1V VDD [1] and is scalable. However, it suffers from a long wake-up time and a high current peak at the sleep-to-active transition. This prevents the scheme from being used as a clock-gating substitute with less than one cycle latency. Node (L) voltage and VSSV (virtual VSS line) are initially low but they become high during sleep because the switch, MN, is cut off completely. Once they become high, the charge associated with all low internal gate nodes and VSSV is restored low at wake-up. This makes the wake-up time long and the peak current high. Figure 22.8.1b shows Zigzag Super Cutoff CMOS (ZSCCMOS) that is proposed to improve the operating speed by eliminating the series-connected switches while achieving the relaxation of the high-voltage stress at the cut-off switch [2]. The zigzag configuration reduces the wake-up time and rush current, because the voltage of node (L) does not change during sleep and VSSV changes very little by means of a self-reverse-biasing effect. Though this zigzagging concept is previously proposed in [3], the cut-off switch with high VTH degrades both the wakeup time and operating speed severely and moreover, the wake-up feature was not discussed in [3]. The same effects of ZSCCMOS are expected for another zigzag scheme called Zigzag Boosted Gate MOS (ZBGMOS) and shown in Fig. 22.8.1c. The MN and MP switch MOSFET’s are made with thick gate oxide and high VTH. Unlike the ZSCCMOS, the power consumption in generating VGP and VGN which are higher and lower than VDD and VSS, respectively, is zero in a sleep mode but finite in an active mode (although it is negligible in most applications). Figure 22.8.1d shows a voltage stress plot of the cut-off switch, MN, in Figs. 22.8.1a and 22.8.1b. This shows that a voltage over-stress higher than VDD does not occur as long as |VGN| is smaller than 0.3V in the ZSCCMOS. Alternatively, in the conventional SCCMOS, |VGD| begins to exceed VDD when |VGN| is more than 0.05V and this small allowance of |VGN| limits the cut-off magnitude. In this clock-gating method both the switching and leakage power are reduced by block-wise activation. The ZSCCMOS block activation scheme is composed of a combinational logic block (CLB), F/F’s, and a control circuit as shown in Fig. 22.8.2a. A local clock (LCLK) is controlled by a block enable signal (EN). The fast recovery feature of the ZSCCMOS is best suited to block activation where the fast wake-up time of less than a cycle is essential. Figure 22.8.2b shows the schematic waveforms for Fig.22.8.2a. VSSV should be restored to VSS before the LCLK arrives at the input F/F, implying the necessity of the fast wake-up. To design the combinational logic block using ZSCCMOS, every node voltage in the block should be predictable at the sleep mode, since cut-off switches should be inserted in series to the off MOSFET’s in the combinational block. Unfortunately, node • 2003 IEEE International Solid-State Circuits Conference voltages are generally not predictable, since they are determined by the last inputs to the block. To make every node voltage predictable, a new phase-forcing circuit is added to the input F/F depicted in Fig. 22.8.3a, where P and Q are forced to be low or high, even though D is unknown during sleep. Without input phase forcing, the internal node voltages of the block may be changing from H to L or L to H during sleep. When the wake-up signal comes, if the internal node voltages are in between VDD and VSS, a large short-circuit peak current may arise and large wake-up delay ruins the scheme. Figure 22.8.3b shows a leakage-suppressed output F/F. Since Q of the output F/F is connected to the other logic block driven by a different LCLK or a global clock (GCLK), the Q is not be forced to H or L and keeps its state during the sleep. Therefore, G1, G2, and G3 are not be connected to the virtual power lines. Since G2 and G3 are not on the critical path, they can be built with highVTH devices. G1 is connected to both NMOS and PMOS cut-off switches to suppress the leakage. Figure 22.8.4 shows a VN (negative voltage) generator for the NMOS cut-off switch. It is composed of a self-adaptive voltagelevel detector (SAVLD) and a charge pump. In addition, an oxidestress-relaxed level shifter is also shown in Fig. 22.8.4, where negative voltage lower than VSS is distributed to VGN without giving rise to any over-stress [4]. In the SAVLD, if VN is too close to VSS, VLDO turns on the charge pump to lower VN. It should be noted that VN changes self-adaptively with the process, voltage and temperature variation. The lower the VGN of the cut-off switch is, the smaller the leakage is but at the same time, the lower VGN pushes up VSSV higher during sleep and thus increases the wake-up time and rush current. The self-adaptive VN generator makes a tradeoff between wake-up time and leakage suppression. Figure 22.8.5a shows the measured waveforms of the VP generator fabricated in the 0.6µm technology when VDD =1.1V. If VP goes lower than a target voltage of 1.2V even by 30mV, the SAVLD turns on the charge pump. Figure 22.8.5b shows the measured ∆VP with varying the temperature from 25OC to 75OC. This figure shows good agreement between measurement and simulation. Figure 22.8.6a shows the measured wake-up time of the ZSCCMOS and SCCMOS that are fabricated using the 0.6µm technology. The comparison shows that the wake-up time of the ZSCCMOS is 8 times faster than that of the SCCMOS. The inverter delay with a fan-out of 3 is also shown in Fig. 22.8.6a. From this figure, it is seen that the wake-up time of ZSCCMOS is almost the same with the inverter delay with a fan out of 3. Figure 22.8.6b compares the VSSV wake-up time of various schemes in future designs, where the VSSV wake-up time is defined by the time of VSSV being restored to 95% of VSS. Simulation is carried out by using the projected 70nm MOS models provided by the BPTM [5]. The wake-up time of the ZSCCMOS is 12 times faster than the SCCMOS in 70nm technology. As in Fig. 22.8.6a, the wake-up time of ZSCCMOS in 70nm technology in Fig. 22.8.6b seems to be almost the same with the inverter delay with a fan out of 3. Acknowledgements The authors acknowledge VLSI Design and Education Center (VDEC) for support in preparing the test chip and K. Kanda, K. Nose, and K. Inagaki for discussions. References [1] H. Kawaguchi et al., IEEE J. Solid State Circuits, vol. 35, pp. 14981501, Oct. 2000. [2] K. Min et al., 28th European Solid-State Circuits Conference, pp. 679682, Sep. 2002. [3] M. Horiguchi et al., IEEE J. Solid State Circuits, vol. 28, pp. 11311135, Nov. 1993. [4] Y. Nakagome et al., IEEE J. Solid State Circuits, vol. 26, pp. 10031010, July 1991. [5] Berkeley Predictive Technology Model web site (BPTM) 0-7803-7707-9/03/$17.00 ©2003 IEEE ISSCC 2003 / February 12, 2003 / Salon 1-6 / 4:45 PM 93 JHQHUDWRU %ORFN HQDEOH (1 /&/. / /HYHO VKLIWHU 9''9 9669 /9669 9 669 $FW966 $FW 6OHHS9'' D (1 (1/ /&/. 0103KLJK97+ WKLFNR[LGH 9669 9*1 F 9''9 9*3 + / 9''9 /RJLF EORFN 9669 9ROWDJHVWUHVV_9*'_ 96699*19 6OHHS 9*3$FWLYH9666OHHS9''9 9'' 03 + 966 *&/. 9*1$FWLYH 9''96OHHS9 01 D / / 9669 01 9*1 91 91 JHQHUDWRU + &RPELQDWLRQDO 03 ORJLFEORFN *&/.% 9*1$FWLYH9'' 6OHHS9 01 6OHHS =LJ]DJ/RJLFEORFN /RJLF EORFN / 9669 + /RFDO FON *OREDOFON *&/. + / (1/ 9'' 9*3 /RZOHDNDJH ))287 9'' 9*3$FWLYH9 6OHHS9'' 03 9'' 9''9 3KDVHIRUFLQJ )),1 9'' 9 93 /HYHO VKLIWHU /DWFK =6&&026RI)LJE 6&&026RI)LJD Figure 22.8.2: (a) ZSCCMOS block activation scheme (b) Its waveforms. R 7 &QPWHFK : &87: 727$/ /&/. 9*1$FWLYH9''6OHHS9 01 6PDOOVZLQJ 9669 9669$FWLYH9666OHHS9 6OHHS $FW 9''9$FWLYH9''6OHHS9 / 6OHHS $FWLYH E (1/% FNE FN 9'' 9 FNE LQ RXW FN 4+ 3/ '; 9*19 D ,19ZLWKKLJK97+ /&/. E G 9''9 Figure 22.8.1: (a) Super cut-off CMOS (SCCMOS) Scheme (b) Zigzag super cut-off CMOS (ZSCCMOS) Scheme (c) Zigzag Boosted Gate MOS (ZBGMOS) Scheme (d) Voltage stress plot of the cut-off switch of Figure 22.8.1(a) and (b). WCUT and WTOTAL mean the width of cut-off switch and the total width of MOSFET’s that are connected to the cut-off switch, respectively. 9'' 9*3 3/ '+ * 9669 966 4; 9*1 * * E Figure 22.8.3: (a) Input flip-flop with phase-forcing circuit (b) Low-leakage output flip-flop. 93 ,5() P9 9/'2 9'' 9 9%,$6 9/'2 9 (1/ 9'' 9 9*1 01 9 &KDUJHSXPS 2[LGHVWUHVVUHOD[HG /HYHOVKLIWHU Figure 22.8.4: VN (negative voltage) generator for NMOS cut-off switch with self-adaptive voltage level detector, charge pump, and oxide-stress-relaxed level shifter. • 2003 IEEE International Solid-State Circuits Conference ∆939 6HOIDGDSWLYH YROWDJHOHYHO GHWHFWRU6$9/' 22 91 9 0HDVXUHPHQW 6LPXODWLRQ D µP7HFK 9'' 9 R E 7HPSHUDWXUH & Figure 22.8.5: (a) Measured operating waveforms of VP (positive voltage) generator (b) Measured ∆VP with varying temperature. 0-7803-7707-9/03/$17.00 ©2003 IEEE 0HDVXUHGZDNHXSWLPHQV 6LPXODWHGZDNHXSWLPHSV 0HDVXUHGLQYHUWHUGHOD\)2 =6&&026 6&&026 6LPXODWLRQ ,2 EXIIHUV 5LQJFRQILJXUDWLRQWRPHDVXUHZDNHXSWLPH R 7 &µP7HFK:&87:727$/ D 9''9 =6&&02697+ 99*1 9/ µP 6&&02697+ 99*1 9/ µP 07&02697+ 99*1 9/ µP %*02697+ 99*1 9/ µP =%*02697+ 99*1 9/ µP ,QYHUWHUGHOD\97+ 9/ µP)2 6HOI$GDSWLYH ,2 EXIIHUV 9ROWDJH/HYHO 'HWHFWRU 93 JHQHUDWRU 6$9/' R 7 &9'' 9QP7HFK :&87:727$/ &KDUJHSXPS E Figure 22.8.6: (a) Measured wake-up times in 0.6µm technology; (b) Simulated wake-up times in the projected 70-nm technology. Figure 22.8.7: Test chip micrograph. 22 • 2003 IEEE International Solid-State Circuits Conference 0-7803-7707-9/03/$17.00 ©2003 IEEE 9'' 9 9'' 9*3$FWLYH9 6OHHS9'' 03 + / 9669 9*1$FWLYH9'' 6OHHS9 01 6OHHS =LJ]DJ/RJLFEORFN /RJLF EORFN / + /9669 9 669 $FW966 $FW 6OHHS9'' 9669 9*1$FWLYH 9''96OHHS9 01 0103KLJK97+ WKLFNR[LGH / + + / 9''9 /RJLF EORFN 9669 F 9ROWDJHVWUHVV_9*'_ 96699*19 D 9*3$FWLYH9666OHHS9''9 9'' 03 =6&&026RI)LJE 6&&026RI)LJD R 7 &QPWHFK : &87: 727$/ 9*1$FWLYH9''6OHHS9 01 6PDOOVZLQJ 9669 9669$FWLYH9666OHHS9 6OHHS $FW 9''9$FWLYH9''6OHHS9 / E 9''9 9'' 9 9*19 G Figure 22.8.1: (a) Super cut-off CMOS (SCCMOS) Scheme (b) Zigzag super cut-off CMOS (ZSCCMOS) Scheme (c) Zigzag Boosted Gate MOS (ZBGMOS) Scheme (d) Voltage stress plot of the cut-off switch of Figure 22.8.1(a) and (b). WCUT and WTOTAL mean the width of cut-off switch and the total width of MOSFET’s that are connected to the cut-off switch, respectively. • 2003 IEEE International Solid-State Circuits Conference 0-7803-7707-9/03/$17.00 ©2003 IEEE 93 JHQHUDWRU %ORFN HQDEOH (1 /DWFK 93 /HYHO VKLIWHU /&/. /HYHO VKLIWHU 91 JHQHUDWRU 91 9'' 9*3 / 9*1 D + 01 &RPELQDWLRQDO 03 ORJLFEORFN / /RZOHDNDJH ))287 /RFDO FON 3KDVHIRUFLQJ )),1 (1/ *OREDOFON *&/. 9'' 9''9 9669 966 *&/. *&/.% (1 (1/ /&/. 9*1 9669 9''9 9*3 6OHHS $FWLYH 6OHHS E Figure 22.8.2: (a) ZSCCMOS block activation scheme (b) Its waveforms. • 2003 IEEE International Solid-State Circuits Conference 0-7803-7707-9/03/$17.00 ©2003 IEEE /&/. (1/% FNE FN LQ FNE RXW FN 4+ 3/ '; D ,19ZLWKKLJK97+ /&/. 9''9 966 9'' '+ 9669 9*3 3/ 4; * 9*1 * * E Figure 22.8.3: (a) Input flip-flop with phase-forcing circuit (b) Low-leakage output flip-flop. • 2003 IEEE International Solid-State Circuits Conference 0-7803-7707-9/03/$17.00 ©2003 IEEE ,5() 9/'2 9'' 9 (1/ 9%,$6 9*1 01 9 91 9 6HOIDGDSWLYH YROWDJHOHYHO GHWHFWRU6$9/' &KDUJHSXPS 2[LGHVWUHVVUHOD[HG /HYHOVKLIWHU Figure 22.8.4: VN (negative voltage) generator for NMOS cut-off switch with self-adaptive voltage level detector, charge pump, and oxide-stress-relaxed level shifter. • 2003 IEEE International Solid-State Circuits Conference 0-7803-7707-9/03/$17.00 ©2003 IEEE 93 P9 9/'2 9 9'' 9 ∆939 0HDVXUHPHQW 6LPXODWLRQ D µP7HFK 9'' 9 R 7HPSHUDWXUH & E Figure 22.8.5: (a) Measured operating waveforms of VP (positive voltage) generator (b) Measured ∆VP with varying temperature. • 2003 IEEE International Solid-State Circuits Conference 0-7803-7707-9/03/$17.00 ©2003 IEEE 0HDVXUHGZDNHXSWLPHQV 6LPXODWHGZDNHXSWLPHSV 0HDVXUHGLQYHUWHUGHOD\)2 =6&&026 6&&026 6LPXODWLRQ R 7 &µP7HFK:&87:727$/ D 9''9 =6&&02697+ 99*1 9/ µP 6&&02697+ 99*1 9/ µP 07&02697+ 99*1 9/ µP %*02697+ 99*1 9/ µP =%*02697+ 99*1 9/ µP ,QYHUWHUGHOD\97+ 9/ µP)2 R 7 &9'' 9QP7HFK :&87:727$/ E Figure 22.8.6: (a) Measured wake-up times in 0.6µm technology; (b) Simulated wake-up times in the projected 70-nm technology. • 2003 IEEE International Solid-State Circuits Conference 0-7803-7707-9/03/$17.00 ©2003 IEEE ,2 EXIIHUV 5LQJFRQILJXUDWLRQWRPHDVXUHZDNHXSWLPH &KDUJHSXPS 6HOI$GDSWLYH ,2 EXIIHUV 9ROWDJH/HYHO 'HWHFWRU 93 JHQHUDWRU 6$9/' Figure 22.8.7: Test chip micrograph. • 2003 IEEE International Solid-State Circuits Conference 0-7803-7707-9/03/$17.00 ©2003 IEEE