A Low-Power Fully-Differential CMOS Operational Transconductance Amplifier for A/D Converters Andrea Gerosa and David A. Sobel ABSTRACT The design of a fully differential CMOS transconductance amplifier is presented in this paper. This amplifier is to be used in the first stage of a 13-bit pipelined A/D converter, and was designed to meet the necessary specifications at a minimum level of power consumption. The amplifier presented below was based on a two-stage design of a telescopic cascode input stage followed by a common-source output gain stage. The amplifier was compensated with a variation of Miller compensation in order to achieve maximum bandwidth. The amplifier designed meets all performance specifications and consumes less than 2 mW of power. 1 INTRODUCTION Pipelined analog-to-digital converters (A/D’s) convert an analog input to a high-resolution digital output (i.e. high number of output bits) through a pipelined cascade of lower-resolution A/D converters, each with a resolution of typically 1 to 1.5 bits. Each pipeline stage calculates the digital output for its input, and then passes on its analog error residue to the next stage for a similar conversion. In order to ease the conversion process for later stages, an inter-stage gain of two is applied to the error residue to keep this signal at a reasonable magnitude. Hence, aside from the actual comparator, pipelined A/D’s also require a circuit that can provide an adding function and accurate gain. Both of these functions can be provided by an operational transconductance amplifier (OTA) connected in a switched-capacitor (SC) gain configuration. It is worth noting two facts about the performance requirements for an OTA within a pipelined A/D converter. First, the circuit specifications are determined by the overall performance metrics of the entire A/D converter. Therefore, pipelined A/D converters with high bit resolutions require more precise analog circuits than those converters with low-bit resolutions. Also, these circuit requirements are relaxed as the signal propagates down the pipeline. Hence, the performance specifications are most rigorous for the OTA in the first pipeline stage of the converter and scaling techniques can be used in subsequent stages of the A/D in order to minimize power dissipation [Cho95]. In this project, we designed a fully-differential OTA to be used as the first stage of a 13-bit pipelined A/D converter. The specifications to be met can be seen in Table 1, and the circuit has to be robust over process variations. Supply Dynamic Range Open-loop gain Settling Time Accuracy Power 3V > 80 dB > 10,000 < 100ns > 0.01% Minimum Table 1: OTA specifications The rest of this paper proceeds as follows: Section 2 includes a discussion of circuit topology considerations for low power. Section 3 outlines the approach we took to meeting the required specifications at a minimum power level. Section 4 presents our two-stage OTA design and the simulation results. Section 5 concludes this paper with a discussion of the practicality of our design and its effectiveness at achieving a low-power solution. 2 TOPOLOGY CONSIDERATIONS FOR LOW POWER An a priori design decision was the choice of a class A amplifier. While this choice does increase power dissipation, the tradeoff of much simpler circuit design was viewed as a net gain. With this decision in mind, the circuit specifications present the following considerations in choosing an OTA topology. 2.1 Transistor network topology With the process parameters given, an open-loop gain of 10,000 is on the order of (gmro)3. Therefore, at least 3 gain devices must be placed within the signal path in order to achieve the necessary gain. The following topologies are candidates for such a specification: • A single-stage “regulated” cascode OTA • A cascaded three-stage common-source OTA • A two-stage folded cascode/common-source OTA • A two-stage source OTA telescopic severely degrading output swing or introducing a complicated dynamic level-shifting scheme [Feldman97]. An OTA with a telescopic first stage would most likely have a PMOS first stage and an NMOS second stage in order to create a more stable topology. Therefore, for a given settling constraint and a fixed compensation technique, an OTA with a PMOS first stage would consume roughly (µn/µp) more power than an OTA with an NMOS first stage. For the process parameters given, this ratio roughly equals 2.5. Therefore, it can be seen from a power perspective, the two topologies are roughly equivalent; the power increase for the folded first-stage demanded by criterion (1) and (2) are roughly balanced by the power increase demanded by criterion (3) for the telescopic first-stage. As there is no clear power advantage to choosing one topology over another, we decided on the OTA with the telescopic cascode as it seemed easier to implement. It is worth noting that two main advantages of the folded first-stage unrelated to power consumption are not relevant in this circuit. Namely, the folded-first stage allows for a greater input common-mode range and a greater output swing. As the required input common-mode range of the circuit is not specified, and common-mode feedback is employed, the commonmode input range of the telescopic first-stage is more than sufficient. Also, the gain of the second stage of the OTA will be high enough such that the limited output range of the telescopic first stage has no effect on the overall output swing of the OTA. cascode/common- The high dynamic range requirement of the circuit is significantly easier to meet with a larger output signal swing. Therefore, the regulated cascode, with its cascoded devices on its output node, was deemed to be a less favorable topology. The three-stage common source cascade presents three main disadvantages to its implementation. First, its nested Miller compensation will narrow-band the system, as each level of compensation tends to reduce the bandwidth by a factor of two. Second, as the gain of the first stage is only on the order (gmro) noise contributions from the second stage will be appreciable. Finally, as this OTA is to be fully differential, a common-mode feedback circuit is necessary for all topologies. It seems feasible that a single CMFB circuit may not be enough to keep all three stages in saturation [Flores96]. The tradeoffs between the other two topologies— folded-cascode/common-source and telescopiccascode/common-source—are more subtle. Before the tradeoffs are considered, we should first introduce the primary characteristics desired for a two-stage OTA for minimum power: [Feldman97] 1) minimum number of current legs, 2) minimum number of devices contributing significant thermal noise, and 3) an all NMOS signal path 2.2 A folded-cascode stage has extra current legs which dissipate more static power than a similar telescopic stage. For a given settling constraint and a fixed compensation technique, this extra current leg will in effect double the power dissipated in the first stage of the folded cascode in comparison to a similar telescopic stage. Therefore, by criterion (1) presented above, a telescopic stage is favorable to a folded stage. A folded-cascode stage also has more devices which contribute significant input-referred thermal noise to the signal. The extra current source transistors comprising the “fold” add directly to the noise factor of the input stage. While the Vdsat’s of these transistor can be increased to minimize the added noise, the extra noise inserted requires a commensurate increase in power dissipation for a fixed dynamic range specification. Therefore, according to criterion (2) listed above, the telescopic stage is preferable to the folded stage. With careful biasing, however, an OTA with a folded-cascode first stage can consist of an all-NMOS signal path while still achieving reasonable output swing [Flores96]. An OTA with a telescopic first stage cannot achieve such a signal path without either Compensation Topology As the OTA topology selected is multi-stage a compensation technique is necessary. As described in appendix C, we sought to design an OTA with 70 degrees phase margin at the unity-gain frequency of the loop configuration. Standard Miller compensation can be employed for a simple solution, but this compensation technique introduces a zero is the right half-plane. In order to increase phase margin, techniques to push that zero out to infinity were explored. Putting a resistor in the feedback path is one possible solution. In order to achieve a reasonably sized resistance without consuming an excessive amount of chip area, transistors operating in triode would be needed to be placed in the compensation feedback path. This can be troublesome in low-voltage topologies, as it can be difficult to maintain a linear resistance in a triode device without a very high gate voltage. Furthermore, the extra devices were deemed an unnecessarily complex addition and alternate solutions were looked into. Unilateral feedback could also be employed, but it has been shown that such a technique not only adds to total static power consumption, but it also can limit slew rate and hence 2 degrade settling time. It was determined that the extra power consumption needed to make the slew rate limitation negligible was too excessive for the lowpower constraints of the design. Instead, a variant of standard Miller compensation was employed where the compensation capacitor is connected to the drain of the input device (rather that the drain of the cascode device). It has been determined that such a technique pushes the right half-plane zero out to infinity while adding another non-dominant pole at the complex conjugate of the primary non-dominant pole [Nakamura95]. As is shown in appendix C, this technique will result in a better phase margin than standard Miller compensation techniques and can lead to a more power-conscious compensation technique. 3 noise level (refer to appendix A for the precise math). Finally it is worth considering that the noise level will be increased by the flicker noise contribution; this expression for flicker noise power is: 2 2 o ,1 / f v The total noise power is the sum of these two contributions. 3.2 Settling Time The specification of settling time requires a settling accuracy of 0.01% within a period of 100ns. This specification can be directly translated into a trade-off between ωu and the vdsat of the input pair, as a greater ωu reduces the linear settling time and a greater vdsat cuts down on the slewing time for a given voltage swing. As shown in appendix B, the settling time can be expressed by the following equation: DESIGN ANALYSIS AND OPTIMIZATION In this section, we present the results from our analysis of the tradeoffs involved in meeting the specifications presented. We will use these results in presenting our device sizing in Section 4. 3.1 K g2 f 1 1 K fp = + fn m2 7 ⋅ ln H f FB Cox W1L1 W7 L7 g m1 1Hz t settle = 2 ωu vi 1 − v dsat 2 f FB v 1 − ln ξ i f FB ⋅ ω u v dsat Dynamic Range The design of high resolution A/D converters sets serious limitations on the allowable noise in the OTA in order to keep the overall system resolution. A typical requirement for this kind of system is a dynamic range (DR) greater than 80 dB. Note that this specification refers only to the amplifier noise and not to noise of the overall system. Therefore the sampling noise from switches, necessarily present in the switched capacitor (SC) circuits, is not considered in the following analysis. DR depends on both the output swing and the output noise power. Therefore a tradeoff between achievable swing and injected noise has to be found. In our design approach, the output swing was fixed to 2.2V (single-ended), considering the power supply value and reliable values for the saturation voltages of output stage devices. As shown in appendix A, the output power for thermal noise is given by: vth2 = 2 KT where vi is the maximum input swing seen, fFB is the feedback factor (slightly greater than 3, due to the parasitic input capacitance of the OTA), and ξ is the required settling accuracy. 3.3 Open-loop Voltage Gain As shown in appendix D, the gain of the telescopiccascode/common-source OTA is on the order of (gmro)3. A more first-order precise calculation of gain, leads to the following expression: ( ) (v Avo = α1α 2 2 λ 3 sat d1 ⋅ vdsat3 ⋅ vdsat9 ) −1 where α1, α2 < 1 to account for the loading effects of the current source loads of the first and second stages, respectively. In order to maximize gain, it is desired to keep the loading factors as close to unity as possible. This is achieved by making the load transistors longer than minimum length. 2 nf 1 , 3 f FB C C 3.4 where CC is the compensation capacitor, nf is the noise factor of the amplifier and fFB is the feedback factor. This relation clearly shows that a lower limit on the size of CC is imposed, in order to limit the noise level. Also, there is a lower limit on the sampling capacitor (CS) even though CS does not appear explicitly in the formula: indeed if CS is too small with respect to the input pair gate capacitance, the 1/f term would be appreciably higher than 3 and would result in a higher Summary of Design Tradeoffs and Optimization As can be seen in the above analysis, each of the OTA specifications present a set of constraints to be met by setting circuit parameters appropriately. These specifications put conflicting sets of constraints on the circuit parameters, and an optimal (i.e. minimal power) solution must be found within the subspace of acceptable circuit parameter sets. This is a difficult process not amenable to simple hand calculations as the constraints cannot be reduced to a closed-form 3 problem. Therefore, in our design process, we used simple hand calculations to set rough limitations on acceptable parameters, and then we developed and used a computer optimization technique to find an approximate optimal solution. This process is described in detail in appendix E. The results of the optimization procedure are summarized in Table 2: all the saturation voltages and currents for the circuit are determined, therefore the OTA dimensioning can be now easily obtained. These values refer to Figure 1 and the expected performances for slow process parameters are reported in Table 3. Vdsat [mV] 0.983 1.5 1.5 0.46 1 150uA 200uA Iss I2 4.1 gm [mS] 160 100 100 320 350 300 M1/M2 M3/M4 M5/M6 M7/M8 M9/M10 M11/M12 between the previewed and the actual performance, which are summarized in the following section. 1.5 pF 0.8 pF Cc Cs Table 2: Results of optimization process 81.5 dB 468,750 72 ns 71° 2.3V Dynamic Range Gain Settling time Phase margin Output Swing (s.e.) Table 3: Theoretical results for slow parameters from optimization parameters V V g0 Vg V + C M 11 1 1 i D D M0 M1 V - Vg C M2 i Vg V + o C 1 1 M 12 C C M3 3 M4 V o C S M9 S M5 Vg 5 Vg 7 M6 M7 M8 M 10 Design modifications One of the parameters deviating significantly from the ideal hand-analysis behavior has been the transconductance of the input stage, which was roughly 20% lower than originally expected. This is due to the fact that our optimization procedure did not account for weak inversion effects; as in any power-conscious design the input devices reside near the transition to the weak inversion region. We accounted for the transconductance sag by increasing the current of the input pair slightly. It is worth noting that—before compensating with an increased bias current—this effect was significantly reducing the unity gain bandwidth, therefore causing the system to be slower than predicted. Another deviation from ideal behavior is represented by variation of λ as the width of the device varies. This phenomenon directly reduces the second stage gain. In the first approach, the length of output devices was increased to enhance their resistance; however the width was also increased in order to keep constant the saturation voltage, and this resulted in a balancing effect on λ, leaving the resistance unchanged. Therefore in order to meet the gain specification in worst-case conditions (fast parameters process) a scaling of saturation voltages was necessary, according to the relations developed in appendix D. As a consequence of this last modification the output swing is decreased; moreover in the ideal calculation the minimum drain to source voltage across the output devices was assumed to be exactly their saturation voltage. In reality the output resistance of the transistors starts to drop before such critical voltage is reached. Therefore the overall gain is greater than 10,000 only for smaller voltage swing than predicted. This drop accounts for a safety margin included in the dynamic range calculation. With this last modification, the OTA is dimensioned and the device size and main parameters are summarized in Table 4. The simulations result, reported in Section 4.4 will show the performance obtained by this design. Figure 1: Two-Stage OTA 4 M1-M2 M3-M4 M5-M6 M7-M8 M9-M10 M11-M12 TWO-STAGE OTA Although the design developed in the previous section meets all the specifications according to hand calculations, the real behavior of the used technology deviates from the simple models used for hand analysis. In fact the more precise models used by SPICE have underlined some important differences Size (µm) 91.6/0.6 213.8/0.6 91.6/0.6 17.6/2.2 33/0.6 608/4.8 vdsat (mV) 160 100 100 320 300 330 Ibias (µA) 80 80 80 80 245 245 gm (mS) 0.821 1.13 Table 4: Final Device Parameters 4 1.02 4.2 Common Mode Feedback 4.3 As shown in Figure 2 dynamic switch capacitor common mode feedback circuit (CMFB) was integrated into the OTA. As the OTA is a two-stage amplifier, the common-mode gain from input to output is a positive value, and therefore CMFB cannot be applied directly from output to input; instead an inversion is needed. The necessary inversion is created by utilizing a differential pair; for instance, as the common mode output goes down, transistor MF1 steals additional bias current from the input pair, which in turn raises the common mode output back to its nominal value. As shown in Figure 3, the CMFB circuit has an open loop gain of 11,000 and unity gain frequency of 9.5 MHz. While this is lower than the unity gain frequency of the OTA, it should be sufficient as long as the common-mode input signal does not have significant signal strength above the CMFB circuit’s unity gain frequency. Bias Network The design of the bias network is a fundamental step in the OTA project, as this part ensures that all devices are working in the expected conditions. The sub-circuits of the OTA which require to be biased are: • second stage active load (M11/M12) • input pair current source (M0) • first stage cascode active load (M5 to M8) • first stage input pair cascode (M3/M4) • CMFB circuit The bias network is shown in Figure 4, and is explained in detail below. V MB1 Vg11 DD MB3 MC7 MC1 MC6 MC3 I CMFBRef MB4 A CMFBRef B MC5 Vg5 MC4 r ef Vg3 C M F B re f A MB5 VCM 0 CMBias Vg7 MC2 Figure 4: Bias Network The second stage active load has been biased using a mirrored version of the reference current, as the gate of Mb1 is directly connected to the gates of M11 and V + M12. The reference current is mirrored in the same M F1 way in the input pair and in all the other bias lags. Note C M F B re f M F2 that non-minimum current and devices have been V - chosen, in order to overcome mismatch in the mirrored currents. Moreover it would be more difficult to C M B ia s overcome current differences using cascoded current mirrors, because the power supply is quite low. The biasing of the first stage active load is particularly delicate: indeed, referring to the OTA schematic, the voltage across the cascode M5-M7 is C M F B re f V C M 0 fixed by the Vgs of M9, which can't be very high. Figure 2: Common-mode Feedback Circuit Moreover, during slew rate settling, M9 is expected to turn off completely. Therefore, the voltage at the gate of M9 is expected to go below threshold while still keeping the first stage cascode well in saturation. For these reasons a high-swing biasing has been chosen to generate Vg5 and Vg7. This part of the biasing network is formed by devices from MC1 to MC9. The current in the second stage is fixed biasing M7/M8, which is fed by a current mirror (MC2). The gate of M5 has to be fixed at the lowest possible voltage, without pushing M7 to the triode region, therefore: V D D o B o B vG 5 = vgs 5 + Vdsat 7 In reality we must account for a safety margin (~100mV), because the output resistance of M7 will start to decrease before the triode threshold. In the adopted circuit the bias voltage vg5 is given by: Figure 3: CMFB open-loop response 5 vg 5 = vgs 3 + vds 4 4.5 Table 6 shows the power breakdown of our circuit as reported by SPICE. The power dissipation of the first and second stage are reasonably low, comprising a little less than 2 mW under all process parameters. MC4 and MC5 are dimensioned in order to make Vds4 equal to the Vd7sat plus the safety margin. The equation regulating the circuit are: ( ) ( ) W L5 V sat = V sat ⋅ d7 d5 W L7 W sat L 5 vds 4 = Vd 5 ⋅ 1 − 1 − W L 4 ( ) ( ) First Stage Second Stage CMFB network Bias network Total of first and second stage Total of entire OTA The circuit has been dimensioned in order to have vds 4 = 1.18 ⋅ Vdsat 7 V =V sat d5 Process Nominal 485 µW 1480 µW 175 µW 1790 µW Fast 490 µW 1490 µW 180 µW 1845 µW 1.94 mW 1.96 mW 1.98 mW 3.82 mW 3.93 mW 4.01 mW The bias network consumes a large amount of power in comparison to the power of the other parts of the circuit. The power is purposefully made relatively large, because large bias currents are used so that matching problems are minimized when mirroring currents. ⇒ vgs 3 = v gs5 Note that this circuit is also insensitive to body effect. Typically, the cascoded input devices are biased indirectly off of the common source of the input pair. In a previous revision, this was attempted, but was deemed unsuccessful. This is because, during slew rate setting, one of the input devices is by definition turned off. Therefore, the common source no longer appears to be a virtual ground, thus causing the bias voltage of the cascode device to deviate from its DC value. This caused a slow-settling transient error, as the changing bias on the cascode effectively modulated the output voltage. Instead, a more robust biasing scheme— utilizing MB4 and MB5—was used in order to create a stable bias voltage on the order of vdsatB4. An added improvement of this biasing scheme is the independence of the bias voltage on VTH variations. The references needed by the CMFB were generated using currents mirrors and other diode connected transistors. 4.4 Slow 480 µW 1460 µW 165 µW 1710 µW Table 6: Power dissipation of two stage design MC3 has the same dimension of M5 and is driving the same current, so sat d3 Power Dissipation 4.6 Open-Loop Gain Simulation Figure 5 to 7 show the DC open-loop gain response of the circuit for all 3 processes. In each figure, the top plot shows the double-ended output response and the bottom plot shows the differential output. In all plots, the x-axis is the input single-ended voltage—the input differential voltage is simply twice that value. As can be observed on all three plots, the DC gain remains above 10,000 between +2.2V to –2,2V differential output. Results The circuit was simulated in SPICE and a summary of results are as follows: DR (dB) Av0 Settling time Slow 80.3 43,700 79 ns Process Nominal 80.4 27,500 75 ns Fast 80.6 17,000 68 ns Figure 5: DC open-loop response, slow parameters Table 5: Performance summary As can be seen above, all specifications were met. In fact, the settling time is well under specification. Part of this is due to the fact that a fair amount of safety margin was included in the original design in order to ensure proper operation. 6 Figure 6: DC open-loop response, nominal parameters Figure 9: AC open-loop response, nominal parameters Figure 7: DC open-loop response, fast parameters Figure 10: AC open-loop response, fast parameters 4.7 Frequency and Step Response Simulation Figure 8 to 10 show the AC open-loop response of the circuit for all 3 processes. As derived in appendix C, the phase margin is f ⋅I ⋅v C φ m = 90 − 2 ⋅ tan −1 FB 1 dsat 9 s I 2 ⋅ v dsat1 C c The plots show that all frequency responses have a phase margin of approximately 70 degrees at fFBωu = ωu/3. The plots show that the unity-gain frequency is slightly higher for the fast process than the slow process. This is to be expected, as the fast process has a higher kp, and also has slightly higher bias currents. Figure 11 shows the double-ended step response used in transient simulations. The output transitions from a +2.2V differential output to a –2.2V differential output, with the input step applied at 350ns. Figure 8: AC open-loop response, slow parameters 7 Figure 11: Typical step response Figures 12 to 14 show the step response of the differential output for each process. In all processes, the transient settles to 0.01% of its final value within 80ns. The transient does display a static settling error, but this is due to parasitic capacitive loading at the input. In order to minimize this static error, gains much greater than 10,000 would be necessary. Figure 13: Step response, nominal parameters Figure 12: Step response, slow parameters Figure 14: Step response, fast parameters 8 4.8 Noise Simulation In order to evaluate the dynamic range performances the output noise density was calculated by simulations. Figures 15 to 17 show the obtained output noise densities for the three process parameters. In Table 7, the corresponding dynamic range and noise factor are reported. The former quantity has been calculated integrating the noise density from 1Hz up to infinity, due to the fact that the presence of switches in the circuit will fold very high frequency noise into the base band. Table 8 reports the noise power contribution from each device. It should be noted that, in all cases, devices other than M1/M2 and M7/M8 contributed a total of less than 0.15% of the total noise density. Figure 17: Noise density, fast parameters Dynamic Range 80.3 dB 80.4 dB 80.6 dB Slow Nominal Fast Noise factor 1.37 1.35 1.34 4.9 Table 7: Dynamic Range and Noise Factor Slow M1/M2 M7/M8 Th 72% 28% 1/f 67% 33% Nominal Th 1/f 73% 71% 27% 29% Fast Th 74% 2%6 Design Summary As shown in the previous sections, the hand analysis of the circuit was supported by a computerbased optimization routine, in order to find the minimum power solution for the design. This analysis was then calibrated and modified in order to compensate for the main discrepancies between the ideal model and the actual behavior of the circuit. This approach has lead to a design solution which can be addressed as minimum power while meeting the specifications required by the specific application of the pipelined A/D converter. As SPICE simulations can present some errors, deviating from real circuit operation, a safety margin is advisable. We can say that the proposed solution satisfies this criterion, with the exception of the DR values. However increasing the DR requires a substantially bigger capacitor; as this implies a increase the currents by the same amount, it is to be avoided. 1/f 77% 23% Table 8: Thermal and Flicker Noise Contributions, by device 5 CONCLUSIONS The OTA presented is a two-stage amplifier. Its first stage is a PMOS telescopic cascode, and the second stage is a common-source gain stage. It is compensated using a variant of standard Miller compensation, where the compensation capacitor is connected to the drain of the input device rather than the drain of the cascode device. The OTA meets all performance specifications with a reasonable safety margin across all process variations, and dissipates slightly less than 2 mW of power. We believe that this OTA is a good design for a two-stage OTA with the required specifications. As discussed in section 2, the use of a telescopic first stage will give roughly the same performance/power tradeoff as a folded first stage, and the compensation technique allows for a more power-conscious design while keeping 70 degrees of phase margin. It is Figure 15: Noise density, slow parameters Figure 16: Noise density, nominal parameters 9 conceivable that a lower power design could be obtained with this topology for the following two reasons. First of all, in our optimization technique, we did not take into account weak inversion effects of the input pair. Therefore, in modifying the design for SPICE verification, the device sizes did not “track” the optimal transconductance/power trade-off as determined by SPICE. Also, our design has a good degree of safety margin included in it. In particular, the settling time for the slow process has a safety margin of roughly 20%. This margin could be reduced by reducing the input pair bias current, and thus the total power could be reduced. Still, due to process and performance variations not modeled by SPICE, we believe that this safety margin makes our design more realizable in real-world applications. The CMRR and PSRR of this circuit will be reasonable, but not exceptional. Both of these measures could have been improved by raising the output resistance of the input current source. Such a modification could be achieved by cascoding the input current source. This was not pursued, as the lowvoltage power supply and telescopic structure of the first stage did not leave enough voltage headroom for such a modification. Due to the high output resolution of the A/D system being designed, it seems as if device matching would present a very difficult problem. Indeed, with 13-bit precision, a single LSB error is only 268 µV. Therefore, common-centroid layout techniques and offset-cancellation circuitry in the SC structure seem a necessity in order to keep overall system resolution. density from an RC circuit; therefore the output power noise can be expressed as: vo2,th = KT f FB = vi2,1 / f K fn g m2 7 1 1 K fp = + , ∆f f C ox W1 L1 W7 L7 g m2 1 where f indicates a frequency and the other parameters have the usual meaning. As the corner frequency of flicker noise will be lower than the dominant pole of the amplifier, this noise is not band-limited by the OTA, therefore the output power is directly the integral of the noise density, multiplied by the squared feedback factor, to refer to the output of the amplifier. So the noise power for flicker noise becomes: 2 2 o ,1 / f v ∆f = 4 KT V DR(dB) = 10 ⋅ log 10 sw 2 ( 2 Vo 2,th 1 + Vo 2,1 / f ) B. SETTLING TIME CALCULATIONS Settling time consists of two components: slew rate settling and linear settling Slew rate limited settling is due to the class A configuration of the OTA and the limited amount of current the first stage of the amplifier can deliver to charge up the compensation capacitor. As derived in lecture, the slew rate is: 2 nf , 3 g m1 where gm1 is the transconductance of the input device and the noise factor, n f = 1 + K fn g m2 7 f H 1 1 K fp = + ⋅ ln 2 f FB Cox W1L1 W7 L7 g m1 1Hz where the upper integration limit fH has to be chosen higher enough than the corner frequency. Finally the total output noise power is given by the sum of the two powers calculated, multiplied by two, to account for the doubled structure of the differential amplifier. The expression for dynamic range then becomes: The total output noise power in a two-stage compensated OTA can be obtained by considering the input referred noise density of the amplifier. For simplicity we consider thermal and flicker noise separately, starting with the former. The input referred thermal noise density, can be expressed as v Cs 3C s + C gs1 where fFB is the feedback factor for the closed loop configuration of the amplifier. In a similar way the input density for the flicker noise can be considered: A. NOISE ANALYSIS CALCULATIONS 2 i ,th 2 nf 1 3 f FB CC Vdsat 1 , is accounting for Vdsat 7 SR = Iss/Cc the contribution to noise of other devices in the amplifier. Approximating the frequency response of the amplifier with a single pole response, where the dominant pole in determined by the compensation capacitance CC, the calculation of noise power can be simply expressed as the known integral of the noise which yields a slewing time of: t slew = 10 2 ωu vi 1 − v dsat 2 f FB where vi is the maximum differential voltage swing seen at the input and fFB is the feedback factor. The maximum voltage swing seen at the input will be 2.2V, as the output has a differential swing of 4.4V (+2.2V to –2.2V). Also, the feedback factor will be slightly greater than 3, due to the input parasitic capacitance of the OTA. The linear settling time is: t linear = − v 1 ln ξ i f FB ⋅ ω u v dsat The compensation technique used is not standard Miller compensation. Instead, it is a variant of this, where the compensation capacitor is attached to the drain of the input device rather than the drain of the cascoded device. The end result of this compensation is to push the right-half plane zero out to infinity and introduce a new non-dominant pole at the complex conjugate of the first non-dominant pole. As the zero is pushed out to infinity, there is no concern over a polezero doublet occurring near the frequency band of concern. This compensation technique increases phase margin slightly over standard Miller compensation as the frequency location of the Miller zero is: which yields a total settling time of: t settle = 2 ωu vi 1 − v dsat 2 f FB v 1 − ln ξ i f FB ⋅ ω u v dsat zero = and the frequency of the new pole is: pole = The above results are based on the assumption that the slew rate is set by the slewing capability of the first stage and is not at all limited by the slewing of the second stage of the OTA. In order for this assumption to be correct, two conditions must hold, namely: (1) (2) g m9 ωu g m1 g m 9 Cc g m1 C s ω u Therefore, for Cc greater than Cs, (which is always the case), the additional pole is at a higher frequency than the old zero. Thus, a net gain in phase margin is obtained. As the OTA has a global feedback network with a feedback factor of roughly 1/3, the phase margin is measured at the location where the open-loop gain is approximately 3. The equation for phase margin becomes: The drain of the cascode devices must be able to completely turn off the NMOS transistor in the second stage. The current supplied by the PMOS load of the second stage is sufficient to supply the necessary current to the slew-limited first stage and simultaneously supply enough current to charge up the load capacitance. f ⋅ω φ m = 90 − 2 ⋅ tan −1 FB u ω p1 Making some simple substitutions: The first condition is met by biasing the NMOS loads of the first stage with a high-swing biasing technique—as described in Section 4.3—to allow the output voltage of the first stage to go below the threshold voltage of the second stage input. The second condition is met by setting the second stage bias current as follows: C + Cs I 2 ≥ I1 c Cc f ⋅I ⋅v C φ m = 90 − 2 ⋅ tan −1 FB 1 dsat 9 s I 2 ⋅ v dsat1 C c This expression for phase margin, with a requirement of 70 degrees, was used in the optimization procedure in order to size the devices correctly. D. DC GAIN CALCULATIONS It can be shown that the DC gain for a two-stage telescopic-cascode/common-source is as follows: where I2 is the bias current of M9 and M10 and I1 is the bias current of M1 and M2. Av 0 = g m1 [(ro1 g m 3 ro 3 ) (ro 5 g m 5 ro 7 )]× g m 9 (ro 9 r011 ) C. PHASE MARGIN CALCULATIONS If the following substitutions are made: g m = 2 I vdsat and ro = 1 λI Dynamic range calculations discussed in appendix A set rough guidelines on the value of Cc, and feedback factor requirements set rough guidelines on the value of Cs. With the parameters set as outlined above, the requirements for 70 degree phase margin can be calculated. the equation for gain can be expressed as follows: ( ) (v Avo = α1α 2 2 λ 11 3 sat d1 ⋅ vdsat3 ⋅ vdsat9 ) −1 where α1, α2 < 1 are attenuation factors due to the parasitic resistance of the current source loads. The above expression shows an inverse relationship between open loop gain and the vdsat’s of the transistors in the signal path. This relationship was used in the optimization procedure to size the signal path transistors in order to meet the required gain specification. It is worth noting that the above expression is only correct to the first order. This is due to the assumption that λ is independent of device width. While this is assumed to be true in first-order transistor models, it does not hold in actual process parameters. Therefore, the above expression was used as a guideline, but a gain safety margin was included in the optimization procedure in order to compensate for λ inaccuracies. requirements, another relation limit is imposed on I2 and this time the lower limit in decreasing as a function of Vd1sat. The two relations are reported below: 2 Vstep 3 3 Vstep CsVdsat 9 I 2 ≥ − − ln ε sat 2 ts Vd1 90° − ϕm t s Vdsat 1 6 tan 2 sat I 2 ≥ Vd1 ωu (CC + Cs ) 1 2 As it can be easily observed, these two constraints let us find an optimum point, where both the inequalities are satisfied and the current is minimum: that value has been assumed for Vd1sat. Once determined Vd1sat, Vd7sat can be fixed in order to satisfy the requirement chosen for the noise factor. Finally the two input cascodes are to be dimensioned. Their saturation voltages have been chosen as low as possible: this criterion leads to an easier biasing, boosts the gain (see appendix D), and helps move the second non-dominant pole introduced by the compensation network to a higher frequency. As mentioned in Section 3, this procedure allowed us to determine all currents and saturation voltages for the OTA. Therefore the transistor dimensioning becomes straightforward. According to the developed math, these values are the minimum power compromise which meets the required specs; in reality currents and capacitor values have been increased to grant a safety margin in the real circuit. E. OPTIMIZATION ROUTINE In the previous appendix, various circuit characteristics were expressed analytically as functions of circuit parameters. The assigned specifications can be used with these relations in order to dimension the amplifier. However some degrees of freedom are left and finding an optimum solution in terms of power dissipation becomes a challenge. For this reason the various relations were combined to find a target functions that was minimized in order to find the minimum power configuration. As a first step the output swing and the noise factor were fixed to some achievable values (respectively 2.2V and 1.5). The value of output swing allows the calculation of the maximum saturation voltages for the output stage and the amplitude of the input step for settling time testing. Moreover fixing these parameters, from the dynamic range expression, it is possible to determine the value for CC. CS was arbitrarily fixed at a value that would not overly degrade the feedback factor. At this point we can choose the saturation voltage of M1/M2 in order to minimize the total current. From appendix B, ISS results a function of ωu and Vd1sat; however ωu can be expressed as a function of Vd1sat; therefore Iss only depends on Vd1sat: I ss ≥ [Cho95] Cho, T. “Low-Power, Low-Voltage Analogto-Digital Conversion Techniques using Pipelined Architectures.” PhD Thesis, University of California, Berkeley. 1995. [Flores96] Flores, A., Daniel, L., “Design of a Minimum Power, Low-Voltage Supply FullyDifferential Transconductance Amplifier for A/D Converters.” EE240 Project Report. University of California, Berkeley. 1996. [Feldman97] Feldman, A. R., “High-Speed, LowPower Sigma-Delta Modulators for RF Baseband Channel Applications.” PhD Dissertation, University of California, Berkeley. 1997. [Nakamura95] Nakamura, K., Carley, L., “An 85mW, 10b, 40 Msample/s CMOS Parallel-Pipelined ADC.” IEEE Journal of Solid-State Circuits, Vol. 30, No. 3, March 1995. 2CC 3 sat 3C sat Vstep Vstep − Vd1 − c Vd 1 ln ε sat 2 ts ts Vd1 This relation is a monotonically increasing function of Vd1sat. Therefore to minimize the current in the first stage we can minimize the saturation voltage of input devices. However the current in the second stage also has to be considered. From phase margin requirements, I2 can be expressed as a function of ISS/Vd1sat, therefore again as a function of only Vd1sat, which in this case is monotonic decreasing. Moreover from slew rate 12