Dynamic-Threshold Logic for Low-Power VLSI Design Alan J. Drake University of Michigan ajdrake@umich.edu Richard B. Brown University of Michigan brown@umich.edu IBM Technical Contact Jeffrey L. Burns Austin Research Lab jlburns@us.ibm.com Abstract Power dissipation is a serious concern for circuit designers. Partially-depleted SOI provides a Dynamic Threshold MOS transistor that may be useful in reducing static power and dynamic power. DTMOS can be used to choke off leakage current and improve performance of transistors under lower voltage conditions, but suffers from high bodycontact resistance, Miller capacitance, area penalties, and limited operating voltage. Driving the body with a separate conditioning signal and careful design are proposed as ways to offset the above problems and still take advantage of DTMOS. A ring oscillator has been implemented to verify the use of this technology in various configurations. This information will be helpful in the design of circuits for datapath elements. 1. Introduction Power dissipation is a problem of increasing concern to designers of VLSI circuits. Figure 1 shows the estimated power dissipation trend of high-performance microprocessors through 2005 taken from the SIA roadmap [1]. Low200 Total Power Static Limit Static Power 150 )sttaW( rewoP 100 2. SOI technology 50 0 1999 power microprocessors follow a similar curve. Both the dynamic and static components are increasing rapidly. The power dissipation of digital VLSI systems is composed mostly of dynamic power that is derived from the equation 2 f , where αC is the effective or switching P dyn = αC L V dd L capacitance, Vdd is the power supply voltage, and f is the operating frequency [2]. Since frequency is associated with improved performance, the way to reduce dynamic power is though a reduction of Vdd or the switching capacitance. Static power is dissipated largely in source and drain leakage currents whose magnitudes are roughly dependent on the threshold voltage, VT, and the subthreshold slope of the transistors used. The higher VT is with respect to the voltage used to turn off the transistor, the lower the leakage current will be. Unfortunately, the drive strength of transistors depends on the relationship Vdd-VT so leakage currents are increasing as Vdd and VT scale to maintain drive current. The roadmap estimates that designs must eliminate 88% of the static power in low-power designs and 81% of the static power in high-performance designs by 2005 in order to return static power dissipation to an acceptable level, and suggests using multi-VT circuits as a solution. SOI technology provides for a dynamic-threshold transistor (DTMOS) that can be used to address both static and dynamic power dissipation by providing both low-leakage transistors and low-Vdd operation. The pros and cons of DTMOS and a ring-oscillator experiment designed to test DTMOS configurations will be described. 2.1. SOI technology overview 2000 2001 2002 2003 2004 Year Figure 1: Projected power dissipation 2005 The main advantage of SOI is its reduced junction capacitance due to oxide isolation of individual circuit ele- ments, resulting in lower-power operation. Additional advantages are good short-channel effects due to shallow source/drain junction depth, latchup immunity, and good soft-error immunity. There are two types of SOI technology, fully-depleted (FD) and partially-depleted (PD). FDSOI fully depletes the silicon under the channel of charge carriers for the best short-channel behavior in SOI and no floating-body effects; but it suffers from high source/drain resistance and poor processing control of VT. PD-SOI, shown in Figure 2, has good VT processing control but does not fully deplete the silicon under the gate of charge carriers, leaving a floating body that affects the performance of the transistor. The floating body causes hysteretic timing patterns, increased subthreshold leakage currents, and the parasitic bipolar effect. The increased subthreshold current occurs due to charge accumulating on the body which lowers VT. The parasitic bipolar effect occurs when the source, drain, and body are at the same potential and then the source is quickly pulled down, creat- Gate–[G] N+ N+ tSi [B] P O x id e Oxide [D] tOX [S] 2.2. SOI design techniques tBOX Buried Oxide (BOX) P Substrate–[X] (a) G nFET S D I npn Cgb Csb Csx Isb Ibd Figure 3: DTMOS NFET and threshold-voltage variation curve. [10] ing a large voltage drop from the body to the source, which forms a base-emitter pair. The parasitic bipolar transistor turns on and conducts current when the device should be off [3]. In spite of these problems, the potential for 20-35% improvement in performance at a given power load over comparable bulk CMOS technology makes SOI the technology of choice for some modern microprocessors [4]-[7]. Cdb Cdx B Cbx X (b) Figure 2: (a) Schematic cross section of a PDSOI NFET. (b) Equivalent circuit model. (Inpn is the parasitic lateral npn current, the diodes are the base-emitter and base-collector junction diodes, Isb and Ibd are impact ionization currents.) [3] All major circuit families can be implemented in SOI. Leakage currents can be offset with weak keepers, levelrestoring transistors, or complementary pull-up networks. Floating body effects are mitigated by one or more of the following methods: pre-discharging internal nodes, alternately pre-charging and pre-discharging nodes, re-mapping logic to eliminate large soft nodes, moving parallel transistors closer to ground, and re-arranging or cross-coupling inputs [7][8]. A small margin may also be added to latch hold times or timing requirements may be relaxed. SOI designs can take advantage of current tool sets but there is a disadvantage in that the tools are structured around bulk design techniques which do not leverage SOI’s benefits such as increased series stacks [7]. 3. DTMOS An added degree of freedom in PD-SOI arises from the floating back-gate which can be controlled as an additional port. If the gate of the transistor is connected to the body, see Figure 3, a dynamic-threshold transistor (DTMOS)— also called Body Controlled MOS and Variable Threshold MOS—is created [10]. Notice that as the voltage on the gate rises, the body voltage is forward-biased with respect (a) (b) ref (c) (d) (e) ref ref Logic Circuits ref Boosted Gnd Vreff (f) (g) (h) Figure 4: Body-biasing schemes in the literature [12] to the source and VT drops. At 0 V gate input, VT is the same as a normal, floating-body transistor. DTMOS transistors have the advantages of low leakage current, high current drive, and little history effect while operating at low Vdd. There are also a number of disadvantages: the body-source and body-drain diodes limit the power supply voltage to below 0.7 V to prevent forward biasing these diodes; the high resistance of the body limits the operating frequency of the circuits; the body-drain capacitor forms a Miller capacitance that may eliminate any gain from added current drive; and the body contacts reduce integration levels [11]. The resistive body connection and low integration levels are technology constraints that exist regardless of circuit topology and must be mitigated with careful design. Such techniques include contacting both sides of devices, using multi-fingered FETs, and carefully selecting which transistors need body contacts. The other DTMOS problems of low supply voltage and Miller capacitance depend on the circuit topology. A variety of DTMOS body connections in inverters that try to solve one or more of these problems have been described [12]–[14]. The n-networks are shown in Figure 4 with the p-networks being the dual of those shown. Circuits (a)–(j) extend the operating voltage range and switching speed of DTMOS transistors [12] [13] and the circuit in (j) is used to boost the ground and limit the voltage drop across the circuits to about 0.5–0.7 V [14]. Simulations show that below 1.3V the delay and power-delay product of inverters designed using (a)–(i) was better than floating-body or bulk-like circuits for slightly more power dissipation due to the added transistor [12]. The configurations of Figure 4 lose some of the DTMOS advantages by adding extra tran- (i) (j) [14] sistors and thus load capacitance. Since operating voltages are expected to drop to 0.8 V for low power and 1.1V for performance chips by 2005, which is only 100 to 400 mV higher than the 0.7 V for DTMOS, this added complexity may be unwarranted. The threshold-lowering of DTMOS at 0.7 V could make it a higher performer than floatingbody circuits at 0.8 V and higher. If the Miller capacitance is too high, a separate signal, such as a pre-charge, can condition the body voltage without the added transistors shown in Figure 4. This technique, of driving the body with a conditioning voltage separate from the gate, will be referred to as decoupled DTMOS. The potential of DTMOS was demonstrated in a 32-bit ALU designed using pass-gate logic [14]. At 0.5V operating voltage, the off-state VT was 0.15V and the on-state VT was -0.05V while the ALU operated at 260 MHz and dissipated 2.5 mW. The power dissipation, was 1/50 and the power-delay-product was 1/20 that of a bulk pass-gate design with similar design rules. 4. Controlled body conditioning Another problem with DTMOS is that the conditioning on the body is delayed with respect to the input and a positive body condition is not established until after switching has begun or completed. A more aggressive approach involves pre-conditioning the body to establish better switching conditions before the input arrives. For example, the third inverter in Figure 5(a) has the body being driven by an inverter controlled by some external signal. It would be preferable to have the n-transistor body high before the input goes high to maximize drive current in the 5. Ring-oscillator test chip Vfb’ Vin Vout Vin Vout Vin Vout Vfb (a) Ring-oscillator inverters Vin Vfb’ Strong N Strong P Vout (b) Positive body conditioning Vin Vfb’ Strong P Strong N Vout (c) Negative body conditioning Figure 5: Ring-oscillator inverters and timing diagrams for body contacts n-transistor as it pulls down the output as in Figure 5(b). This may be referred to as positive body condition since the body voltage positively affects the drive strength of the transistor. Figur e5(c) shows the opposite case, or negative body condition, where the n-transistor body is low, causing a weak transistor trying to pull down the output against a strong p-transistor. By varying the shape and position of the body bias, favorable conditions can be obtained to maximize gain and minimize leakage currents in the transistors. The ideal case would be a positive body condition only while the transistor is driving the output and a negative body condition at all other times to minimize leakage. This technique is particularly applicable to synchronous circuits where clock shaping and conditioning already takes place and phased signals are readily available. A separate clock phase, or a phase already generated, can be routed to the circuits in the datapath to provide biasing in a manner that enhances the drive of transistors in the circuit while reducing leakage currents. The challenge will be in shaping that signal and aligning it for maximum benefit. A ring-oscillator test macro with 17 ring-oscillators has been designed in a 0.13 µm, 8 metal layer, 1.2V, copper PD-SOI process to compare floating-body versus DTMOS inverters and to explore the effects of body conditioning. Each oscillator is 41 stages and implements either floatingbody, standard DTMOS, or decoupled DTMOS as show in Figure 5(a). The signal of each ring-oscillator is output on a common bus and divider. An additional bus and divider are provided to output the feed-back body-conditioning signal in the decoupled DTMOS ring-oscillators. To simulate the effect of shifting the body signal with respect to the input signal, decoupled DTMOS is implemented with the output of each inverter stage being fed forward or fed backward to drive the body of another inverter in the ring-oscillator. Figure 6 shows the implementation of feed-forward one inverter (a) and feed-back one inverter (b). Ring-oscillators implement feeding the body conditioning signal forward from one to seven inverters and backward from 0 to 7 inverters, covering a wide range of body conditions with respect to the input signal. An inverter was added to drive the body of the de-coupled oscillators to isolate the body signal from the gate and remove the extra Miller capacitance of DTMOS. This macro will allow power dissipation and frequency to be measured and compared between SOI configurations over a range of power supply voltages and body conditions. Mature circuit models for body-contacted transistors have recently become available and the ring oscillator data will help verify the validity of those models. s1 s2 s4 0 s4 1 Vb Vb Vb Vb (a) Feed-back 1 oscillator s1 s2 s40 s41 Vb Vb Vb Vb (b) Feed-forward 1 oscillator Figure 6: Feed-back and feed-forward configurations 6. Conclusion SOI has become a mainstream technology used for some modern high-performance microprocessors. It is seen to be beneficial for performance and power dissipation. The controllability of the body of the transistors allows for DTMOS circuits to be implemented. DTMOS provides both low leakage and high current drive but must be carefully used to offset decreased area and lower supply voltages. In spite of its limitations, DTMOS shows promise for power-conscious designs. When off, a DTMOS transistor chokes off leakage current and when on it enhances its current drive to offset its low operating voltage. A ring-oscillator macro was developed to test the performance of DTMOS in a 0.13 µm, 8 metal, 1.1V, copper SOI process. Data from this macro will be used in developing low-power circuit styles to use for datapath elements. These circuits will be tested in a multiplyaccumulate unit which will be built in the next phase of this research. 7. References [1] Semiconductor Industry Association, “The International Technology Roadmap for Semiconductors: 2000 Update.” [2] Rabaey, J. M., Digital Integrated Circuits: A Design Perspective, 1st ed Upper Saddle River, New Jersey: Prentice Hall, 1996. [3] Chuang, C. T., Lu, P. F., Anderson, C. J., “SOI for Digital CMOS VLSI: Design Considerations and Advances,” Proceedings of the IEEE, vol. 86, no. 4, pp. 689-720, Apr. 1998. [4] Park, S. B. et. al., “A 0.25-µm, 600-MHz, 1.5-V, Fully Depleted SOI CMOS 64-Bit Microprocessor,” IEEE J. SolidState Circuits, vol. 34, no. 11, pp. 1436-1444, Nov. 1999. [5] Canada, M. et. al., “A 580MHz RISC Microprocessor in SOI,” ISSCC, pp. 430-431, Feb. 1999. [6] Aipperspach, A. G. et. al., “A 0.2-µm, 1.8-V, SOI, 550-MHz, 64-b PowerPC Microprocessor with Copper Interconnects,” IEEE J. Solid-State Circuits, vol. 34, no. 11, pp. 1430-1434, Nov. 1999. [7] Allen, D., Behrends, D., Stanisic, B., “Converting a 64b PowerPC Processor from CMOS Bulk to SOI Technology,” Design Automation Conference, pp. 892-897, June 1999. [8] Das, K. K. and Brown, R. B., “Evaluation of Circuit Design Approaches in Partially Depleted SOI-CMOS using a 32-bit PowerPC Design Vehicle,” IEEE Int. SOI Conference, pp. 98-99, Oct. 2000. [9] Sleight, J., Mistry, K. R., and Antoniadis, D. A., “Transient Measurements of SOI Body Contact Effectiveness,” IEEE Electron Device Letters, vol. 19, no. 12, pp. 499-501, Dec. 1998. [10] Assaderaghi, F., “DTMOS: Its Derivatives and Variations, and their Potential Applications,” Int. Conf. on Microelectronics, pp. 9-10, Nov. 2000. [11] Bernstein, K. and Rohrer, N. J., SOI Circuit Design Concepts, Boston: Kluwer Academic Publishers, 2001. [12] Casu, M. R. et. al., “Comparative Analysis of PD-SOI Active Body-Biasing Circuits,” Proc. IEEE International SOI Conference, pp. 94-95, Oct. 2000. [13] Gil, J. et. al., “A High Speed and Low Power SOI Inverter using Active Body-Bias,” Int. Symp. Low Power Electronics and Design, pp. 59-63, Aug. 1998. [14] Fuse, T. et. al., “A 0.5V 200MHz 1-Stage 32b ALU using a Body Bias Controlled SOI Pass-Gate Logic,” ISSCC, pp. 286287, Feb. 1997.