A Gain-Boosted 90-dB Dynamic Range Fast Settling OTA with 7.8

A Gain-Boosted 90-dB Dynamic Range Fast Settling OTA with 7.8-mW Power Consumption Yun Chiu, Ken Wojciechowski Electrical Engineering and Computer Science Department University of California, Berkeley Berkeley, CA 94720 Abstract - A fully differential high-dynamic range operational transconductance amplifier (OTA) to be used in switchedcapacitor filters and/or over-sampled A/D converters is presented. The architecture chosen for this design is folded cascode with fully differential gain boosting. It is demonstrated through the design analysis and HSPICE simulation that such a structure realizes the best trade-off between power consumption, speed, and dynamic range performances for this design. The OTA achieves a constant large signal DC gain (ADC) of > 84 dB over process and temperature variations. It is designed in a 0.35-µm 2 CMOS process and draws a DC power of 7.8 mW from a 3-V supply. The achieved rms output noise voltage ( V o, n ) is 50.1 µV for the worst case scenario. With a peak-to-peak output voltage swing (Vo,pp) of 4.5 V, the achieved output dynamic range (DR) is 90 dB. The OTA also demonstrates excellent settling behavior with a single-pole roll-off frequency response and no over-shoot or ringing is observed at the output when the required maximum 4-V transient is applied at the input. The settling time to < 0.05% accuracy for the worst case is ~ 19.5 ns. 1. INTRODUCTION TO GAIN-BOOSTING TECHNIQUE As CMOS design scales into low-power low-voltage regime, designing analog functional blocks under limited supply voltage becomes more and more difficult. One typical example is the basic gain stage. Cascoding is the mostly used technique to achieve high gain compared to 2-stage designs because of its superior frequency response. However, we quickly run into headroom problems while trying to cascode more transistors in a stack Vbp Vbp M1 Vcp In+ M2 In- Vcp Out+ OutVcn Diff. Pair Out- Out+ Vcn Vbn Vbn Single-ended gain boosting Fully-differential gain boosting Figure 1 CMOS gain-boosting techniques under limited supply voltage. Gain-boosting technique [1] was introduced to remedy this problem. It allows increasing the DC gain of the operational amplifier (op amp) without sacrificing the output swing of a regular cascode structure. Furthermore, it has been pointed out [2] that the gain-boosting technique decouples the DC gain and the frequency response of the amplifier. It is therefore possible to achieve high speed and high gain at the same time. These features are especially desirable in high-speed, high-dynamic range applications like switched-capacitor filters and A/D converters. EE240, Spring 2000 Final Project Report Typical gain-boosting structures are shown in Fig. 1. The single-ended boosting architecture uses four auxiliary op amps to regulate the transconductance (gm) of the cascode transistors and enhances the gain realized by the original cascoding effect. It is first reported by K. Bult and G. Geelen [1]. However, the single-ended version of the boosters are associated with some undesired effects, mainly, • Signal usually travels a longer path inside the booster and see an extra pole from an internal current mirror. Therefore, frequency response of the booster suffers, especially when optimizing the pole-zero doublet effect. • Noises generated by the biasing circuitry inside the boosters are not correlated. This means the noise overhead associated with gain boosting is higher. The problems are addressed by using a fully differential gain-boosting scheme. This improves the settling behavior and reduces noise as well. But fully differential amplifier requires common-mode feedback. In this design, an input common-mode voltage control circuit is adopted and will be illustrated in details later. 2. AMPLIFIER DESIGN 2.1. Optimum Architecture Three candidates are available, i.e., telescopic differential OTA with gain boosting, folded-cascode OTA with gain boosting, and 2-stage OTA with class-A output stage. The folded-cascode architecture with gain boosting is chosen for the following reasons, • Compared to the folded-cascode structure, a telescopic amplifier has a lower output swing range but twice the power efficiency. Its output has to accommodate a minimum of 5 Vdsat’s and 1 Vth (not taking into account that tail current source may need to be cascoded as well) while the folded-cascode structure has an overhead of only 4 Vdsat’s. Given the required DR of 90 dB, let’s do a quick calculation to see if the telescopic structure offers advantage. Assume Vdsat = 250 mV and Vth = 650 mV. For a 3-V supply, this indicates that the telescopic structure yields a maximum output swing Vo,max = 3-5*0.25-0.65 = 1.1 V while the folded-cascode architecture achieves Vo,max = 3-4*0.25 = 2 V. This means we need to knock down the noise in the telescopic amplifier by about 4 times! Taking into account that a telescopic amplifier has mainly 4 transistors contributing to the output noise while a folded-cascode one has 6, the later will show a noise factor (N) of about 5/3 times larger. Combining the two effects, we conclude that, although telescopic structure features a higher power efficiency and a lower noise factor, the dynamic range constraint well offsets these advantages. But in a situation where less dynamic range is demanded or enough supply room is available, the telescopic architecture will yield a better design. • 2-stage OTA, on the other hand, offers even larger output swing where only 2 Vdsat’s have to be roomed for. If a telescopic amplifier is used for the 1st stage, the architecture also offers a lower noise factor (ignoring noise contribution from the 2nd stage). But, unlike cascoding, cascading gain stages has negative impact on the frequency response of the amplifier, and thus frequency compensation (Miller/cascode compensation) techniques have to be employed to trade bandwidth for phase margin/stability. The 2nd stage usually has to burn a substantial amount of power in order to push the non-dominant pole (ωp2 = gm2/CL) far away from the unity-gain frequency (ωu = gm1/ Cc). It is therefore usually used in interfacing applications where an output stage for driving external resistive loads is indispensable. To drive on-chip capacitive loads, the superior frequency response and power efficiency of single stage amplifiers generally offer better performance. • Folded-cascode architecture with gain boosting is chosen as the optimum structure for this design. It offers relatively large output swing and good single-pole roll-off frequency response. For applications that require both high speed and high dynamic range, the folded-cascode structure offers the best trade-off compared to the afore mentioned two architectures. It also achieves a much larger input common-mode voltage range. A properly 2 EE240, Spring 2000 Final Project Report designed gain-boosting amplifier offers plenty of gain (usually ~ (gmro)4) for a regular cascode structure yet still preserves its frequency response characteristics. Lastly, it needs to be pointed out that it is well known that a pole-zero doublet is often associated with gain boosting. Later in this report, we will present the result from a small-signal analysis of the gain-boosting technique. It will become clear that the pole-zero doublet and its consequence (slow settling) can be very well suppressed. 2.2. Dynamic Range 2 P signal 1 V o, pp 9 The dynamic range is defined as, DR = ---------------- = --- ⋅ -------------- = 10 . We assume a maximum output swing Vo,pp = P noise 8 2 V o, n 4V based on the above analysis. So, V 2o, n = 1--- ⋅ 10 –9 ⋅ V 2o, pp ≈ 45µV . Assuming single pole frequency response, the 8 closed-loop output noise variance is 2 V o, n 2 1 kT = N ⋅ --- ⋅ --- ⋅ ------- , 3 β CL where N is the noise factor, β is the feedback factor, and CL is the load capacitance. We ignore the Flicker noise for the time being and only count the thermal noise because settling time gmp slope = -8.7dB/τ 2 V n, p gmi gain (dB) 2 2 V n, n time (s) 0 V n, i gmn Input-referred noise calculation DC gain Settling time calculation Figure 2 Diagrams for noise, settling time calculation of the large bandwidth requirement for fast settling. Flicker noise has an upper cut-off frequency which is ~ 100 KHz. For broadband systems it only contributes a negligible output noise power. For a folded-cascode amplifier, 4 current sources and 2 input diff. pair transistors contribute most of the output noise (Fig. 2). To calculate the noise, we refer Vn,p2 and Vn,n2 to the gate of the input transistor. Since 2 PMOS curg g mi 4I 2I V dsat, p V dsat, i 2V V dsat, p b b mp dsat, i rent sources drain double the current of the input transistor, the noise factor is N p = -------- = ------------------ ⁄ ---------------- = ------------------- g g mi 2I 2I V dsat, n V dsat, i V V dsat, n b b mn dsat, i - = ----------------- ⁄ ---------------- = ----------------for these PMOS’ and is N n = -------for the NMOS’. So the total noise factor is N = 2 ⋅ (N p + N n + 1) = 4 if we assume the gate overdrive voltage of the input diff. pair is 1/3 of that of the current sources. Since we assumed 250-mV Vdsat for current source transistors, this really means that we need to reduce the gate overdrive of the diff. pair in order to minimize N. This means the input transistors are relatively large. Let’s C Cs + C f + Ci 2 f - ≈ --- . Thus we obtain CL,min = assume Ci of the input transistor is about the same as Cs. Therefore, β = ------------------------------ 8.1 pF, the minimum load capacitance required to achieve a 90-dB dynamic range. 3 3 EE240, Spring 2000 Final Project Report We ignored the noise contribution from the gain boosters in the above calculation. The reason of this is that the noise factor associated with “cascode” transistors is usually much smaller than the current sources calculated above. However, gain boosters are usually much smaller (less gm) compared to the main amplifier in order to save power (they only need to drive the gate of the cascode). This leads to the observation that gain boosters may contribute a considerable amount of output noise power in a real design. Noise breakdown will be shown later in the simulation stage. In this design, about 20%-30% output noise power is generated by the gain boosting circuits. 2.3. Settling Speed and Small-Signal Bandwidth Refer to Fig. 2, a single pole frequency response results in a single slope of -8.7 dB/τ for settling. Here we assume slewing is not present. The settling accuracy is determined by the transient loop-gain achieved by the amplifier. We need to settle to < 0.05% in 20 ns. With a feedback factor β = 2/3, we obtain, – 8.7dB ------------------ ⋅ 20ns = 20 ⋅ LOG10 ( β ⋅ 0.05% ) ≈ – 70dB ⇒ τ = 2.5ns τ 1 - ⇒ ω u = 95MHz , so we obtain the open-loop unity-gain frequency ωu = 95 MHz. From ωu = gm/CL, we but τ = --------βω u can calculate gm = 4.83 mA/V. Finally, use the square-law relationship gm = 2Id/Vdsat, we get to the biasing current needed, Id = 200 µA. Where we used the previous assumption of Vdsat = (250/3) mV. This looks pretty good! But with 200-µA maximum output current delivered into 8.1 pF load, we know immediately it is going to slew. So this power budget is too optimistic. 2.4. Slew-rate Limit  t – --   Assuming a single time constant settling, the output voltage can be expressed as V o ( t ) = V o  1 – e τ , for t ≥ 0 . Thus the slew-rate SR = d V o ( t ) dt t=0 V I o, max = -----o- = --------------. τ CL For this design, the maximum differential output swing is Acl*4V = 1 V (with Acl = 1/4). So SR = 200 MV/s. With Id = 200 µA, the SR is limited to 25 MV/s. Thus, to settle to 0.05% accuracy within 20 ns without slewing, it requires 8 times more biasing current as calculated in section 2.3. This yields the minimum biasing current Id = 3.2 mA for the input diff. pair. The total biasing current for the OTA (excluding biasing and gain boosters) is then 6.4 mA and the total power consumption is about 20 mW. The above estimation is pessimistic in that the NO SLEWING presumption is totally unnecessary. Later in the section “Low-Power, High-Speed Design Techniques”, we will recalculate settling with slewing present and it will be seen that having slewing dominant in settling yields a low-power design. 2.5. DC Gain As mentioned before, the settling accuracy is determined by the loop-gain achieved by the amplifier. With β = 2/3, the minimum open-loop DC gain needed is ADC = 70 dB to achieve 0.05% (-66dB) settling accuracy assuming the OTA fully settles to this gain within 20 ns. Given process and temperature variations, some margin should be left for this figure. Also, it is generally known that BSIM models tend to overestimate the intrinsic output resistance (ro) of devices when they are biased on the edge of saturation. For low-voltage designs, output transistors very often fall into this biasing range. In summary, a rule of thumb is to give 10-20 dB margin for ADC in real designs. We set ADC,min to > 80 dB. With the fully differential cascoded gain-boosting technique, even with minimum chan- 4 EE240, Spring 2000 Final Project Report nel length transistors used in the main amplifier, we are able to achieve over 84 dB constant ADC over process and temperature variations. 3. HIGH-DYNAMIC RANGE, HIGH-SPEED, AND LOW-POWER DESIGN TECHNIQUES In this section, we will present the design techniques and analysis on how to achieve low-power consumption for high-speed, high-accuracy analog designs. These concepts are the core of this design that have been verified through theoretical analysis and HSPICE simulations. It is what enables us to achieve a 7.8-mW power consumption for the stringent design specs of this project. 3.1. Minimum Channel Length for Analog Design Short channel devices are often shunned by analog designers for considerations of mismatch, lower output resistance, lower intrinsic gain, hot carrier stress, and lifetime, etc. By using longer channel devices than minimum, we give up the gains accumulated by generations of technology scaling advancement. A big price in speed, power, and chip area is paid making this trade-off. So, after all, is this a wise choice of deep-submicron analog design methodology? Before answering this question, we first clarify some common misconceptions. • Long channel does not eliminate short-channel effects (SCE) of deep-submicron CMOS devices. It is commonly believed that by using a channel length far larger than minimum will eliminate the short-channel effects, such as mobility degradation, velocity saturation, drain-induced-barrier-lowering (DIBL), and hot electron effect. However, if asking a device engineer what is the dominant device parameter in CMOS process scaling, the answer will be the gate oxide thickness (Tox), not channel length. Without analog options, a deep-submicron process usually offers uniform thin oxide throughout the wafer. Therefore, simply by drawing the transistors longer will not help to reduce vertical field which directly impacts the channel carrier mobility. • Matching is a function of active gate area, NOT only the channel length. Conventionally, matching is believed to be a strong function of channel length only. This is a common misconception. What actually determines the matching is the active gate area, not just length [3]. Pelgrom [3] and Razavi [4] have pointed out the spatial averaging effect of large width but minimum length devices that improves matching. Therefore, using wide minimum length devices does not undermine matching property as long as the gate area is large. At the meantime, minimum channel length is also in favor of high-speed design. After all, a good design is a design that eliminates matching requirement at the architecture/algorithm level, not at the device level. • Scaling channel length to reduce hot carrier effect is ineffective for low-voltage design. High performance analog circuits usually conduct a DC biasing current at all time (class-A type). This stresses the device more than digital devices, which only turns on during transitions. Therefore it is often believed that minimum channel length should not be used for analog circuits. But if we take a look at the device equations, we will find that using long channel devices is not really effective to reduce hot carrier effect. Hot carrier effect is directly related to the maximum horizontal channel electric field (Em), which is expressed V –V l' ds dsat as E m = --------------------------- , where l' = 3T ox X j . We note that l' is not a function of channel length at all. The only place L 1 1 1 does play a role is Vdsat, which is ------------ = ----------------------- + ------------------ . When devices are biased with small overdrive voltages V dsat V gs – V th E sat ⋅ L (as in the case of low-voltage design), very large L does not help reduce Em. Furthermore, we can always implement circuit techniques (such as cascoding) to reduce Vds, which is more direct and far more effective than indirectly resorting to longer channel length. • Lower output resistance. A lot of circuit techniques can address this problem (gain boosting is one). It is often fruitless and painful in using long channel devices to achieve high DC gain. 5 EE240, Spring 2000 Final Project Report To summarize, long channel devices should NOT be used abusively. If problems can be solved using circuit techniques, one should not resort to long channel options because the speed of the device is reversely proportional to L2 (square law model). Therefore, low-power, high-speed designs definitely do not favor long-channel devices. In this project, the low power consumption achieved partly attributes to using minimum channel length devices consistently throughout the signal path. 3.2. Slewing is Favorable in High-Speed, Low-Power Design A common misconception is slewing hurts settling. It is true in the sense that the so-called “linear settling” or “proportional settling” concept does not work under slewing. However, in a sampled-data system, what actually interests us is the final value of settling, not the settling process. As long as the amplifier achieves its final accuracy within specified time, why do we need to impose an extra constraint that the settling process be absolutely exponential? On the contrary, a mostly slewing amplifier delivers all its current to the load more often than a non-slewing one, therefore yields higher power efficiency. Burning a lot of power to avoid slewing obviously is not the optimum solution for low-power applications (“class-A dilemma”). So how do we design for slewing? The answer is to use minimum allowable overdrive voltage for the input diff. pair. On one hand, with fixed biasing current, this will introduce more slewing because the input transistor will be easily cut off during large transients. On the other hand, the resulted larger gm of the input devices leads to a faster exponential settling after slewing with a smaller time constant. Combine the two aspects and an optimum low-power yet high-speed design yields. Vi Σ + − Vx Vi(t) gmVx Io Io Vx A(s) Vo Vx(t) VT diff. pair limiting β Vo(t) Linear feedback amplifier model with limiting slewing 0 ts time Figure 3 A simple model of slewing behavior of op amp To further illustrate this point, we present the following calculation. As shown in Fig. 3, slewing is modeled by the limiting characteristic of the input differential pair of an amplifier [5]. Other than limiting, the feedback loop is linear with a constant feedback factor β which is frequency independent. A common method to calculate slewing time is to assume a limiting voltage of the diff. pair, VT, which is I defined as -----o- according to the simplified piece-wise linear model. Io is the maximum output current that can be g m delivered to the load. By looking at the input of the diff. pair, we observe that the amplifier is slewing (f/b loop is open) between time 0 and ts when Vx > VT. Settling time can be calculated following some simple algebra [5]. 6 EE240, Spring 2000 Final Project Report Here, however, we propose a simpler method calculating the slewing time which observes the amplifier output instead of the input. A slew-rate limited settling will not enter “linear region” until the output voltage reaches certain level where the instant slew-rate of a non-slewing settling with the same time constant (curve (1) in Fig. 4) equals the actual slew-rate of the amplifier. t1 is the time for this to occur. This in turn indicates that t2 is the time when the amplifier gets out of slewing (t2 is equivalent to ts in Fig. 3). If we express the exponential settling as t – --  τ V o ( t ) = V o  1 – e  , for t ≥ 0 ,   we obtain, V oC CLV o t 1 = τ ⋅ ln  -------------- ⇒ t 2 = -------------L- – τ ,  I⋅τ  I [t – (t2 – t1)] hence – -------------------------------  τ V o(t ) = 1 – e  , for t ≥ t 2 .   To settle to an accuracy of ---1 , the settling time with slewing present is, A V oCL V oCL t settle =  -------------- – τ – τ ⋅ ln  -------------- + τ ⋅ ln A  I⋅τ   I  Using the square-law model, we can easily derive the minimum biasing current needed, V oCL V gt 2βV o V gt V gt  - – ------------- ln  ------------- + ------------- ln A I d, min = --------------  1 – -----------t settle  2βV o 2βV o  V gt  2βV o where, V gt = V gs – V th Settling Time vs. Slewing 1600 Vo(t) 1400 (1) curve saturates at low (Vgs-Vth) due to gm saturation in weak inversion region 1200 d Min. Biasing Current I (uA) (2) t1 t2 1000 800 time 600 Curve (1): linear settling; Curve (2): slewing followed by linear settling 400 200 0 100 200 300 400 500 600 700 V −V (mV) gs th Figure 4 Minimum biasing current calculation with slewing present The result is plotted in Fig. 4. The plot shows that, with the “help” of slewing, a much lower biasing current is enough to achieve the settling speed (in contrast to a purely exponential settling that requires a minimum biasing current of about 1.6 mA per transistor as calculated in section 2.4). Note that a few assumptions are made before we reach this conclusion (square law, constant β). In reality, the curve tends to saturate at lower current end because gm saturates when the device is pushed towards weak inversion region. Also, larger input capacitance associated with larger input device size reduces f/b factor, therefore reduces the loop bandwidth (and the loop-gain). Lastly, we summarize by listing the pros and cons associated with using small Vdsat for input transistors, • Using small overdrive hence large device size results in better matching and lower Flicker noise for the input diff. pair, which is often advantageous. According to [3], we have the variance of the output current mismatch of the input diff. pair expressed as, 7 EE240, Spring 2000 Final Project Report 2 2 2 2 σ (Id) 4σ ( V T 0 ) σ 2 ( β ) 4 A VT 0 A β - ≈ -------- ⋅ --------------------------- = ----------------------- + ------------- + -------2 2 2 2 β V gt WL WL Id V gt where AVT0 and Aβ are process related constants. For fixed tail current Id and device gate length L, the first term in the expression is kept constant as we increase W. But the second term diminishes for large W. Therefore we achieve better matching by using smaller overdrive for the diff. pair. • Loop gain bandwidth product is βωu = βgm/CL. Small overdrive leads to larger bandwidth with fixed biasing current. But excessive slewing usually results in the cascode transistors being cut off during large transients. Therefore it is one thing to watch out because this may lead to ringing (yet not observable in AC analysis). • Noise factor N is a strong function of the Vdsat of the input devices. A smaller Vdsat of the diff. pair produces a smaller N, therefore minimizes output noise with fixed CL. • The 2nd pole frequency pushes in and the stability suffers as the diff. pair puts more cap. on current folding nodes. Feedback factor β also decreases as a result of larger input parasitics, hence reduces speed and loop-gain. Lack of analytical model of gm in the operation range close to weak inversion, the optimum sizing for the input transistors is largely determined by HSPICE simulation. 3.3. Pole-Zero Doublet and Slow Settling Settling time to certain accuracy is usually very critical to the performance of sampled data systems. The timedomain response of a single pole amplifier shows a single time constant exponential settling behavior without overshoot or ringing. Settling accuracy within specified time window is therefore easily identified and well predictable. So, this is the ideal settling behavior a good designer is after (not the one with 60o phase margin, which is usually not a robust design). Although gain-boosting technique is able to achieve extremely large DC gain, it does not come in free. K. Bult and G. Geelen [2] first analyzed the pole-zero doublet and the slow settling behavior inherent to the CMOS gainboosting architecture. They proposed the following heuristic for optimizing the settling behavior, βω u < ω add < ω p2 where, βωu is the closed-loop dominant pole frequency, ωu is the open-loop unity-gain frequency, ωadd is the unity-gain frequency of the boosting amplifier and ωp2 is the 2nd pole frequency of the main amplifier (also the 2nd pole of the local gain-boosting loop). To better understand the pole-zero doublet and its effect on slow settling behavior, a small-signal frequency domain analysis of the gain-boosting technique is performed. For the complexity of this analysis, the details are not presented here. But some important conclusions drawn from the analysis are stated below, 1) As shown in Fig. 5, pole-zero doublet not only introduces slow settling component, it can also cause overshoot. The settling behavior is categorized into 3 cases depending on the pole/zero frequencies and their interaction with the main amplifier. 2) For typical situations encountered in CMOS gain-boosting cases, slow settling component always starts from the original gain Ao (not possible to obtain an “accurate” doublet). The time constant of the slow settling component, is determined by the doublet frequency. The solution proposed by K. Bult and G. Geelen essentially is to 8 EE240, Spring 2000 Final Project Report time (s) 0 (c) Ao Aadd Atot Out Cc (b) Ref (a) gain (dB) In (a) critically damped, (b) under-damped, and (c) over-damped (slow settling) Cp Figure 5 Settling behavior of pole-zero doublet and Cc compensation push the doublet frequency beyond the close-loop dominant pole frequency of the amplifier (“fast” doublet). Therefore, slow settling is not slow anymore! In high-speed designs with deep feedback, βωu is often not much smaller than ωp2. Therefore, optimizing the unity-gain bandwidth of the gain booster becomes difficult. A technique used by Bult and Geelen [2] is to place a small capacitor (usually a MOS cap.) Cc at the output of the booster to fine-tune ωadd, as shown in Fig. 5. This technique is implemented in this design. However, it needs to be pointed out that the final design should be insensitive to the value of Cc. This is an essential part of the verification process. In our design, large variations of Cc is forced to monitor its impact on the settling behavior of the OTA. 3.4. Minimizing Feedforward Zero Effect Capacitive f/b network provides a feedforward path for input signal to bypass the OTA. Since the f/b is negative, an instant input jump introduces a big spike at the output which is, unfortunately, to the opposite side of the final output voltage. A straightforward calculation yields the following expression for this spike, Cf ∆V o = V o ⋅ β ⋅ -------------------C f + CL Since the f/b factor is largely fixed by the closed-loop gain, the only parameter we have control on to minimize the initial spike is the ratio CL/Cf. Therefore, after the total load capacitance is determined by noise calculation, we can partition this capacitance between Cs, Cf, and CL in favor of a smaller spike. But too small Cs and Cf will also reduce the f/b factor hence reduce the settling speed. Since this expression does not offer a minimum, the optimum value is determined through HSPICE simulation. It is found that with a > 4:1 ratio of CL/Cf, the feedforward zero effect is negligible. 4. HSPICE SIMULATION AND VERIFICATION In this section, we first summarize the performance achieved by the final design and then we will present the implementation details, optimization procedures, and the verification of design for robustness. 4.1. Overall Performance Summary 9 EE240, Spring 2000 Final Project Report The following corner conditions are used in verification of the OTA design. For commercial products, usually temperature varies from 0oC to 75oC. These are combined with process variations to derive the worst case scenarios. For example, highest temperature gives rise to the highest thermal noise figure. Combined with the slow process parameters (which runs into headroom problems due to large Vth), it places the toughest test on the dynamic range performance. This is also the corner for lowest speed. On the contrary, the fast process corner combined with low temperature will probably introduce stability problem. Table I Corner conditions used for design verification Process variation Typical Slow Fast Temperature variation 25oC 75oC 0oC Table II summarizes the achieved performance of the fully differential gain-boosted OTA. Table III lists the chosen component values. The design specs are met throughout the corner conditions. Worst case power consumption including bias is 7.81 mW. Table IV summarizes all device operating point conditions for the OTA. It is generated by HSPICE simulation. Table II Fully diff. gain-boosted OTA performance summary Performance Metric Typical, 25oC Diff. Output Voltage Swing (peak-to-peak) Slow, 75oC Fast, 0oC 4.5 V RMS Output Noise (1 Hz to 100 GHz) 49.15 µV 50.14 µV 46.12 µV RMS Output Noise (w/o gain-boosting) 40.43 µV 44.20 µV 39.26 µV Output Dynamic Range 90.2 dB 90.0 dB 90.8 dB Large-Signal ADC (Vo,pp = 0 V) 87.4 dB 87.7 dB 84.7 dB Large-Signal ADC (Vo,pp = 4.5 V) 86.4 dB 85.8 dB 84.1 dB Small-Signal AV (Vo,pp = 0 V) 87.4 dB 87.7 dB 84.7 dB Small-Signal AV (Vo,pp = 4.5 V) 79.5 dB 72.8 dB 80.3 dB Diff. Loop-Gain (Vo,pp = 0 V) 82.9 dB 83.7 dB 79.9 dB 80.0o 83.2o 80.6o 96.0 MHz 87.5 MHz 106.1 MHz 50.5 dB 50.6 dB 49.2 dB 88.2o 88.3o 88.8o 35.6 MHz 33.4 MHz 42.7 MHz Settling Time (Vo = +1 V up to Vo,max) 19.4 ns 18.8 ns 16.3 ns Settling Time (Vo = -1 V down from 18.0 ns 16.8 ns 15.4 ns Diff. Loop Phase Margin (Vo,pp = 0 V) Diff. Loop Unity-Gain BW (Vo,pp = 0 V) C.M. Loop-Gain (Vo,pp = 0 V) C.M. Loop Phase Margin (Vo,pp = 0 V) C.M. Loop Unity-Gain BW (Vo,pp = 0 V) Settling Accuracy Total Power Consumption < 0.05% 7.79 mW 10 7.81 mW 7.76 mW EE240, Spring 2000 Final Project Report Table III Component values Cs Cf CL Ccmfb Ib,master 0.5 pF 2 pF 7 pF 0.6 pF 50 µA Table IV Transistor biasing condition Device Name Id (A) gm (A/V) gm/Id (V-1) Device Name Id (A) Main OTA gm (A/V) gm/Id (V-1) P-side Booster NIP 400u 8.18m 20.45 NIL, NIR 80u 1.74m 21.75 NIN 400u 8.18m 20.45 PBL, PBR 160u 737u 4.61 NBB 800u 4.62m 5.78 PCL, PCR 80u 892u 11.15 NBC 800u 13.09m 16.36 NBL, NBR 80u 622u 7.78 PBP 800u 5.38m 6.73 NCL, NCR 80u 638u 8.0 PBN 800u 5.38m 6.73 NCM 80u 1.69m 21.12 PCP 400u 5.72m 14.3 PCN 400u 5.72m 14.3 PIL, PIR 40u 618u 15.45 NBP 400u 2.20m 5.5 PBL, PBR 40u 315u 7.88 NBN 400u 2.20m 5.5 PCL, PCR 40u 323u 8.08 NCP 400u 6.38m 15.95 NBL, NBR 80u 365u 4.56 NCN 400u 6.38m 15.95 NCL, NCR 40u 444u 11.1 PCM 40u 598u 14.95 N-side Booster Main OTA In Table II, two types of DC gains are differentiated. One is called the large-signal DC gain (ADC) and the other small-signal DC gain (AV). As shown in Fig. 6, large-signal gain ADC refers to the slope of the straight line that runs through an operating point on the voltage transfer curve (VTC) and the origin. The small-signal gain AV is defined as the local derivative of the VTC curve. ADC is usually larger than AV because the VTC generally exhibits limiting characteristics. Both of these gains are simulated. We want to point out that, the settling accuracy is dictated by ADC, not AV, as a straightforward calculation may prove this. It is this important observation that leads to the appropriate (not excessive) sizing of the cascode transistors. AV Vout ADC 0 Vin Figure 6 ADC vs. AV Also, in simulating the common-mode f/b loop characteristics, we did not connect the OTA input to an AC ground. Instead, we put the differential f/b network on and fully functional. It is believed that this is the most accurate representation of the loop behavior. First, common-mode loading due to capacitive f/b network is precisely modeled. Secondly, due to finite output resistance of the cascode tail current source, the common-mode response of the diff. f/b loop is not negligible and it degenerates the loop-gain of the common-mode f/b loop. The c.m. openloop gain shown in the table is about 50 dB instead of 70 dB, the number measured when diff. f/b loop is disabled. 11 EE240, Spring 2000 Final Project Report 4.2. Biasing High swing cascode biasing is used for the main amplifier and the gain boosters. A lot of effort is invested to this part of the design in order to maximize the output swing yet to make the design robust over process variations. Transistor sizes are kept strictly scaled in all biasing circuits. This minimizes the systematic mismatch error. Wide transistor are broken into finger structures in order to make biasing circuit easily scalable and layout friendly. The available about 0.4-V headroom on both the PMOS and NMOS sides are optimally partitioned between the current source and the cascode transistors. The optimization involves the following thinking, • Large Vdsat for current sources minimizes the noise factor because they are the dominant noise contributors beside the input diff. pair. This is especially true for the P-side since two PMOS FETs conduct double current therefore have doubled noise factor. • A smaller Vdsat for cascode transistors increases their gm therefore increases the 2nd pole frequency. It can be shown that the 2nd pole is optimized when the cascode contributes half of the total cap. at the cascoding node (assuming square law). • The optimization is guided with a reasonable headroom of about 100-300 mV per transistor. Excessive scaling in order to maximize output swing is not wise. Pushing too far into weak inversion will increase parasitic capacitance therefore lose speed and phase margin. In this design, all tail current sources are realized using cascode structures as well. This increases the ro of current sources therefore increases the common-mode rejection ratio. With a folded-cascode structure for both the main amplifier and the boosters, tail current source biasing can be shared with the high-swing cascode biasing, therefore saves design effort and improves matching. Table V Power consumption breakdown Total Power Main OTA P Gain-Booster N Gain-Booster Biasing 7.8 mW 4.8 mW 1.2 mW 0.6 mW 1.2 mW Table V summarizes the power consumption breakdown of the OTA. A single 50-µA master biasing current is assumed available and long channel cascode current mirrors (we do use long channel devices sometimes) are used to distribute copies of this current into the amplifier and the boosters. It is essential that only current (not voltage) are distributed to bias the amplifier. This improves the robustness of the biasing network. No effort is attempted in order to excessively minimize the biasing circuitry power consumption. Under high-speed operation, due to capacitive coupling, biasing circuits also require enough bandwidth to settle. Therefore, it is not appropriate to shrink the biasing circuit excessively in order to save power. A rule of thumb is not to use a ratio higher than 5:1 for highspeed design. Minimal times of copying of the master current is desired in order to reduce inaccuracy in mirroring due to channel length modulation effect (actually, we only mirror it once). 4.3. Choice of Architecture NMOS input folded-cascode structure is chosen in order to minimize the virtual ground parasitics that reduces f/b factor. Since Flicker noise is negligible in this design, using PMOS input does not gain on noise performance. The lower mobility of PMOS FETs leads to about 3 times larger parasitics at input and also 3 times larger miller capacitance. Although NMOS diff. pair is susceptible to body effect in an N-well process therefore has worse common-mode rejection ratio, they are large and can be very carefully laid out to minimize the mismatch. It is concluded that NMOS input is preferable for high-speed applications. 12 EE240, Spring 2000 Final Project Report PMOS side gain booster has to accommodate high (close to VDD) input common-mode voltage. So it is chosen as an NMOS input folded-cascode structure as well. For the NMOS side booster, the situation is exactly the other way around. So a PMOS input folded-cascode structure is chosen. Biasing boosters is much easier than the main amplifier because they exhibits much smaller output swings. The available headroom is further exploited to minimize the noise factor of the current source transistors inside the booster. 4.4. Common-Mode Control Dynamic switched-capacitor output common-mode control circuit is implemented due to its simplicity, ease to design, low power consumption, and linearity. The c.m. control point is set to the N-side for the following thinking, • N-side is the only place to put c.m. f/b control so that the common-mode f/b loop has the minimum overlap with the differential signal path, which goes through the PMOS side. • N-side cascode structure has a higher 2nd pole frequency than that of the P-side, this improves the commonmode f/b loop stability. By using the “UIC” option in transient and AC simulations, we are able to initialize the switched-cap c.m f/b network. This is important because an ideal model for c.m. f/b control is simply inadequate to verify the actual performance of the circuit. Three cases, the ideal VCVS c.m. f/b, the switched-cap c.m. f/b, and the switched-cap c.m. f/b with 1-pF loop-gain attenuation cap., are compared in transient simulation. C.m. f/b loop is very stable (almost 90o phase margin) and no effect on diff. signal settling is observed. Fully differential gain boosters are ideal for improving noise performance. However, they need common-mode control as well. Switched-capacitor c.m. f/b circuit used for the main amplifier is cumbersome to use here because the boosters are a lot smaller and therefore only require tiny c.m. f/b caps. What is more troublesome is switches are not infinitesimal. The charge injection introduced by the switch for small capacitors will dominate the error of common-mode control accuracy. So, we use an input controlled c.m. f/b circuit instead. It is continuous time hence not prone to switching errors. Since the power consumption of the boosters is much less than the main OTA, burning an extra DC current for the c.m. f/b transistor does not add much to the power budget. The c.m. f/b loop bandwidth can be easily controlled by sizing the c.m. f/b transistor. The same size as the input diff. pair yields a c.m. f/b loop bandwidth 1/3 of that of the differential loop. 4.5. Optimizing Performance of Gain-Boosting The pole-zero doublet can introduce slow settling. It is therefore important to optimize the bandwidth of the booster to obtain optimum performance. The unity-gain frequency of the booster can not be less than βωu. But it can not be too higher than it as well for the stability consideration of the gain-boosting loop (the 2nd pole of the main amplifier is also the 2nd pole of the gain-boosting loop). MOS cap. is implemented as Cc, the fine tuning cap., in order to track process variation. If even higher settling speed is required, it will be harder to design with gainboosting technique. A pole isolation technique was proposed by Y. Chiu [6] which can be used in such situations to improve the performance and robustness of the conventional gain-boosting technique. For the noise perspective, large bandwidth of the booster (with small load capacitance) will increase its excessive noise contribution. Furthermore, since gain boosters are usually small (they only need to drive the cascode gate cap.), noise contribution from them is enhanced. Therefore, to design for low-power consumption, optimizing the bandwidth of the booster is essential. Fortunately, the Cc technique introduced by K. Bult is useful here. Thanks to the use of minimal channel length for the cascode transistors, the optimization process is not challenging since the 2nd pole is far out in the giga-hertz range. 13 EE240, Spring 2000 Final Project Report It is also observed (Fig. 9) that low-frequency noise of the cascode transistor is suppressed by the gain-boosting loop. At higher frequencies, the loop-gain drops and the suppression diminishes gradually. This is the main reason that the noise floor of the cascode device shows a “hump” at high frequency. The total integrated noise of cascode, however, remains largely the same. In summary, we achieved relatively low output noise (about 20%-30% of the total output noise power) of the gain boosters without burning a lot of power for them. The slow settling component is pushed beyond the dominant pole frequency of the main OTA. With 100% variation of the absolute value of Cc, the settling speed remains the same, and the slow settling component is consistently suppressed. 4.6. Noise Breakdown In Fig. 8, the output noise floor and the integrated output noise power are plotted using HSPICE. The major noise contribution arises from the input diff. pair, the PMOS current sources, and the NMOS current sources. Cascode devices and gain boosters contribute less but not negligible output noise power. 5. CONCLUSION A fully differential gain-boosted CMOS operational transconductance amplifier is designed and simulated using HSPICE. The low-power, low-voltage, high-speed analog design techniques introduced in this report greatly facilitates in optimizing the design of this OTA, both for performance and for robustness. All design specs have been achieved with a low power consumption of 7.8 mW. Gain-boosting technique is analyzed and improvement on performance is observed with a better understanding of pole-zero doublet effects. The design is verified throughout process corners and temperature variations and constant performance is observed. We conclude that this architecture is suitable for high-speed, high-performance OTA’s used in on-chip switched capacitor applications. 6. REFERENCES [1] K. Bult and G. Geelen. "A fast-settling CMOS op amp with 90-dB DC gain and 116 MHz unity-gain frequency," ISSCC Dig. Tech. Papers, Feb. 1990. pp. 108-109. [2] K. Bult and G. Geelen. "The CMOS gain-boosting technique," Analog Integrated Circuits and Signal Processing, vol.1, (no.2), Oct. 1991. pp. 119-35. [3] Pelgrom, M.J.M.; Duinmaijer, A.C.J.; Welbers, A.P.G. "Matching properties of MOS transistors," IEEE Journal of Solid-State Circuits, vol.24, (no.5), Oct. 1989. p.1433-9. [4] Razavi, Behzad. "Design of analog CMOS integrated circuits," [Preview ed.]. McGraw-Hill, c2000. [5] Kamath, B.Y.T.; Meyer, R.G.; Gray, P.R. "Relationship between frequency response and settling time of operational amplifiers," IEEE Journal of Solid-State Circuits, vol.SC-9, (no.6), Dec. 1974. p.347-52. [6] Y. Chiu, "Improved CMOS Gain-Boosting Scheme using Pole Isolation Technique," US patent pending, application serial No. 09/200,180. 14 Figure 9 Low-freq. noise of cascode transistors is suppressed by gain-boosting loop cascode xtor noise gain-boosting f/b loop has large b/w Figure 7 ADC (bottom) vs. AV (top) max. output swing = 4.5V 7. APPENDIX. SIMULATION RESULTS gain-boosting f/b loop has small b/w EE240, Spring 2000 15 Wave Symbol Symbol 0 100x 1g Frequency (log) (HERTZ) 0 2u gain-boosting f/b loop has large b/w 10g 100g 100g Effect of gain-boosting loop b/w on noise gain-boosting f/b loop has small b/w Figure 10 pmos cascode pboost noise nmos current source pmos current source input diff. pair 10g nboost noise 100x 1g Frequency (log) (HERTZ) integrated output noise breakdown nmos cascode 10x 10x p/nmos cascode 4u 1x 1x nboost noise floor pboost noise floor nmos current source most of output noises come from input diff. pair and P/N current sources output noise spectrum density breakdown pmos current source input diff. pair Final Project Report 6u 8u 10u 12u 14u 16u 18u 20u 22u 500e-21 1e-18 1.5e-18 2e-18 2.5e-18 3e-18 Figure 8 Output noise breakdown gain-boosting amps contribute about 20-30% total output noise power D0:A0:nboo D0:A0:pboo D0:A0:nnc D0:A0:npc D0:A0:nn D0:A0:np D0:A0:nin Wave D0:A0:onoise(5:mnir) D0:A0:onoise(4:mpil) D0:A0:onoise(3:mncp) D0:A0:onoise(3:mpcp) D0:A0:onoise(3:mnbp) D0:A0:onoise(3:mpbp) D0:A0:onoise(3:mnip) Noise Sq-V/Hz (lin) Result (lin) unity-gain freq. 20-ns time limit linear settling Figure 13 Transient gain (top) and diff. output voltage (down) of up-going settling error bound slewing Figure 11 Differential loop-gain and phase margin EE240, Spring 2000 16 Figure 14 Voltage waveforms at output and virtual ground of up-going settling Figure 12 Switched-cap. common-mode f/b loop-gain and phase margin unity-gain freq. Final Project Report linear settling 20-ns time limit slewing switched-cap. cmfb w/ 1-pF attenuation Figure 17 Common-mode (down) settling of up-going transition ideal VCVS cmfb switched-cap. cmfb c.m. settling has no observable effect on diff. mode settling Figure 15 Transient gain (top) and diff. output voltage (down) of down-going settling error bound EE240, Spring 2000 17 switched-cap. cmfb w/ 1-pF attenuation Figure 18 Common-mode (down) settling of down-going transition ideal VCVS cmfb switched-cap. cmfb c.m. settling has no observable effect on diff. mode settling Figure 16 Voltage waveforms at output and virtual ground of down-going settling Final Project Report

A Gain-Boosted 90-dB Dynamic Range Fast Settling OTA with 7.8

Related documents

Products

Support

A Gain-Boosted 90-dB Dynamic Range Fast Settling OTA with 7.8

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib