An On Chip Low Skew Optical Clock Receiver by Allan Marn Loy Lum Submitted to the Department of Electrical Engineering and Computer Science in partial fulllment of the requirements for the degree of Master of Engineering in Electrical Engineering at the MASSACHUSETTS INSTITUTE OF TECHNOLOGY August 2001 c Massachusetts Institute of Technology 2001. All rights reserved. Author . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Department of Electrical Engineering and Computer Science August 30, 2001 Certied by. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Duane Boning Associate Professor of Electrical Engineering and Computer Science Thesis Supervisor Accepted by . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Arthur C. Smith Chairman, Department Committee on Graduate Students 2 An On Chip Low Skew Optical Clock Receiver by Allan Marn Loy Lum Submitted to the Department of Electrical Engineering and Computer Science on August 30, 2001, in partial fulllment of the requirements for the degree of Master of Engineering in Electrical Engineering Abstract Clock distribution across a digital chip raises several serious issues in current and future integrated circuit technology. The distribution of clocks with frequencies in the giga-hertz range is dicult because of interconnect parasitics. Clock skew due to variation sources is becoming dicult to control with traditional balanced distribution networks. Optical interconnects are currently being evaluated as a technique to distribute a clock signal throughout a digital chip. Optics provides the potential for very low skew distribution. This thesis presents a design for an optoelectric clock receiver operating at 1 GHz. The design has been simulated successfully and a full circuit layout has been completed. The impact of variation sources from the input signal, process, and the environment have been evaluated. Process variation is found to contribute most to skew. A strategy has been prepared for testing the fabricated chip. The result is a simulation and layout of a fully functional CMOS optical clock receiver in 0.18m technology operating at 1 GHz. Thesis Supervisor: Duane Boning Title: Associate Professor of Electrical Engineering and Computer Science 3 4 Acknowledgments These past ve years here at MIT have been a very good experience for me. I am very fortunate that it is nally culminating with the completion of this thesis. It has been a long ride though, and as a result, there are several people that I would like to thank for their support. I would rst like to thank my thesis advisor, Professor Duane Boning, for taking me on as a student for such a short time frame. He encouraged me along through continual faith and sound advice. I would have never been able to complete this thesis in eight months without his support. I would also like to thank Professor Anantha Chandrakasan for referring me to Professor Boning and for advising me throughout the thesis. Ron Roscoe, thank you for giving me the opportunity to be TA. I have learned many wonderful things from that experience. Bunnie and Shamik, thank you for your friendship and awless technical support throughout my MIT experience. They have both pulled me out of countless dicult situations. Special thanks to Bunnie, whose Cadence tutorials cut a month o of my thesis schedule. I would also like to thank Bginzz, Pedro, Bobby, and all my other friends at ZBT for putting up with my complaints and rants of frustration over the years. I would next like to thank my oce colleagues. Mike and I were working on similar thesis projects and compared their diculty to \trying to teach a monkey how to speak Chinese." Well Mike, the monkey can now say a few words; I leave it up to you to nish the job. Aaron, thank you for your computer and network assistance and for introducing me to the great hot chocolate machine. Joe, Karen, Vikas, Han, Dave and Brian, thank you for all the interesting conversations and for contributing to an enjoyable work environment. I would also like to thank my family for their support throughout the years. Mom, Dad, thank you for shielding me from many problems and allowing me to focus on my interests. Randy, Lyn, Shane, Jimmy, Bernie, Mike, Stephanie, and Laressa, thank 5 you for your encouragement and words of wisdom. This work has been supported in part by MARCO and DARPA under the Interconnect Focus Center program. 6 Contents 1 Introduction 17 2 Design Strategy to Minimize Skew and Eects of Variation 23 1.1 Motivation for On-Chip Opto-Electronic Clock Distribution . . . . . 1.2 Previous Receiver Designs . . . . . . . . . . . . . . . . . . . . . . . . 1.3 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.1 Generalized Topology . . . . . . . 2.2 Variation vs. Low Skew . . . . . 2.3 Design Implications/Challenges . 2.3.1 Photodetector . . . . . . . 2.3.2 Transimpedance Amplier 2.3.3 Voltage Amplication . . . 2.4 Receiver Topology Overview . . . 2.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 Circuit Design and Operation 17 19 21 23 24 24 25 27 28 29 30 31 3.1 Operation Amplier Designs . . . . 3.1.1 Two-Stage Opamp . . . . . 3.1.2 Folded Cascode Opamp . . 3.2 Transimpedance Amplier . . . . . 3.3 High Bandwidth Voltage Ampliers 3.3.1 Cascode Architecture . . . . 3.3.2 Feedback Biasing . . . . . . 7 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 31 33 34 35 36 36 3.4 Output Stage . . . . . . . . . . . . . . . . . . . 3.5 Linear Voltage Regulator . . . . . . . . . . . . . 3.6 Biasing . . . . . . . . . . . . . . . . . . . . . . . 3.6.1 Bandgap Reference . . . . . . . . . . . . 3.6.2 Current Sources and Voltage References 3.6.3 Passive Elements . . . . . . . . . . . . . 3.7 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 Special Design Considerations and Tradeos 4.1 Power Consumption Analysis . . . . . . . . . . . . . . 4.2 Asymmetric Clipping . . . . . . . . . . . . . . . . . . . 4.3 Automatic Gain Control . . . . . . . . . . . . . . . . . 4.3.1 Variable Gain Amplier . . . . . . . . . . . . . 4.3.2 Peak Detector . . . . . . . . . . . . . . . . . . . 4.4 Convergence Issues . . . . . . . . . . . . . . . . . . . . 4.5 Balanced Switching Thresholds . . . . . . . . . . . . . 4.6 Voltage Reference Comparison - Bandgap vs. Self Bias 4.7 Summary . . . . . . . . . . . . . . . . . . . . . . . . . 5 Variation Analysis 5.1 Functionality Due to Input Signal Variation . . . . . 5.1.1 Input Frequency and Photodiode Capacitance 5.1.2 Realistic Photodiode Input Waveform . . . . . 5.1.3 Detailed Photodiode Model . . . . . . . . . . 5.1.4 Input Signal Intensity . . . . . . . . . . . . . 5.2 Process Variation . . . . . . . . . . . . . . . . . . . . 5.2.1 Channel Length Variation . . . . . . . . . . . 5.2.2 Threshold Voltage Variation . . . . . . . . . . 5.3 Environmental Variation . . . . . . . . . . . . . . . . 5.3.1 Power Supply Variation . . . . . . . . . . . . 5.3.2 Temperature Variation . . . . . . . . . . . . . 8 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 43 44 44 48 49 50 51 51 53 54 54 55 57 58 59 60 61 61 62 63 64 64 66 67 68 69 69 70 5.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 Silicon Layout 70 73 6.1 Implementation Overview . . . . . . . . . . . . . . . . . . . . 6.2 Cadence Technology Library . . . . . . . . . . . . . . . . . . . 6.3 Layout Hierarchy . . . . . . . . . . . . . . . . . . . . . . . . . 6.3.1 Folded Cascode Opamp Layout . . . . . . . . . . . . . 6.3.2 Two-Stage Opamp Layout . . . . . . . . . . . . . . . . 6.3.3 Transimpedance Amplier Layout . . . . . . . . . . . . 6.3.4 High Bandwidth Voltage Amplier Layout . . . . . . . 6.3.5 Output Stage Layout . . . . . . . . . . . . . . . . . . . 6.3.6 Linear Voltage Regulator Layout . . . . . . . . . . . . 6.3.7 Bandgap Voltage Reference Layout . . . . . . . . . . . 6.3.8 Full Layout with Biasing . . . . . . . . . . . . . . . . . 6.3.9 Design Verication, Extraction, and Layout Simulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Testing Strategy 73 74 75 75 76 78 78 78 79 79 79 82 85 7.1 Functionality and Skew Testing . . . . . . . . . . . . 7.2 Circuit Layout Variants . . . . . . . . . . . . . . . . 7.2.1 Normal Receiver . . . . . . . . . . . . . . . . 7.2.2 Normal Receiver With Isolated Power Supply 7.2.3 External Power Supply . . . . . . . . . . . . . 7.2.4 External Voltage Reference . . . . . . . . . . . 7.2.5 Current Input Signal Injection . . . . . . . . . 7.2.6 Voltage Input Signal Injection . . . . . . . . . 7.2.7 Photodiode Variants . . . . . . . . . . . . . . 7.3 Overall Chip Layout Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 Conclusion 85 86 86 86 87 87 87 87 87 88 89 8.1 Evaluation of Variation Performance . . . . . . . . . . . . . . . . . . 8.2 Alternative Techniques Not Used . . . . . . . . . . . . . . . . . . . . 9 89 89 8.2.1 Adaptive Biasing . . . . . . . . 8.2.2 Amplifying and Hard Limiting . 8.2.3 Resoanant Tank Ampliers . . . 8.2.4 Phase Locked Loops . . . . . . 8.3 Future Improvements . . . . . . . . . . 8.4 Summary of Contributions . . . . . . . 10 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90 90 90 90 91 91 List of Figures 1-1 Typical H-Tree Clock Distribution Network [1] . . . . . . . . . . . . . 1-2 Global Optical Clock Network . . . . . . . . . . . . . . . . . . . . . . 18 18 2-1 2-2 2-3 2-4 2-5 Typical Overall Block Diagram . . . . Eect of Light Absorbed in Substrate . Traditional Transimpedance Amplier Linear Power Supply Regulator . . . . Receiver Block Diagram . . . . . . . . . . . . . . . . . . . . . . . 24 26 27 29 30 3-1 3-2 3-3 3-4 3-5 3-6 3-7 3-8 3-9 3-10 3-11 3-12 3-13 3-14 3-15 Two-Stage Opamp . . . . . . . . . . . . . . . . . . . . . . . . . . Folded Cascode Opamp . . . . . . . . . . . . . . . . . . . . . . . . Transimpedance Amplier . . . . . . . . . . . . . . . . . . . . . . Transimpedance Amplier Circuit . . . . . . . . . . . . . . . . . . High Bandwidth Voltage Amplier . . . . . . . . . . . . . . . . . Gain Block Representation for High Bandwidth Voltage Amplier Gain Block Grouping for High Bandwidth Voltage Amplier . . . Feedback Biasing Circuit . . . . . . . . . . . . . . . . . . . . . . . Feedback Biasing Loop At DC . . . . . . . . . . . . . . . . . . . . High Frequency Feedback Biasing Loop Model . . . . . . . . . . . High Bandwidth Voltage Amplier Frequency Response . . . . . . Output Stage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Voltage Regulator . . . . . . . . . . . . . . . . . . . . . . . . . . . Bandgap Temperature Independence Example . . . . . . . . . . . Bandgap Reference . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32 33 35 35 36 37 37 38 40 40 42 42 43 45 46 11 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3-16 3-17 3-18 3-19 3-20 Bandgap Voltage Reference Circuit . . . . . . . . . . . Parasitic PNP Bipolar Transistor . . . . . . . . . . . . Current Source and Voltage Reference Biasing Circuits Receiver Block Biasing . . . . . . . . . . . . . . . . . . N-Well Resistor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47 47 48 48 49 4-1 4-2 4-3 4-4 4-5 4-6 4-7 Power Consumption Tree . . . . . . . . . . . . . . . . . . . . Block Diagram With AGC . . . . . . . . . . . . . . . . . . . Variable Gain Amplier for AGC . . . . . . . . . . . . . . . Peak Detector Circuit . . . . . . . . . . . . . . . . . . . . . Sample Peak Detector Waveform . . . . . . . . . . . . . . . Balanced and Unbalanced Switching Threshold Comparison Self Biasing Voltage Reference . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52 54 55 56 57 58 59 5-1 5-2 5-3 5-4 5-5 5-6 5-7 5-8 5-9 5-10 100MHz Output Waveform With Input Capacitance Variation . . Realistic Photodiode Input Waveform . . . . . . . . . . . . . . . . Output Waveform Due To Realistic Photodiode Input Waveform . Detailed Photodiode Model . . . . . . . . . . . . . . . . . . . . . Output Waveform Using Detailed Photodiode Model . . . . . . . Output Waveform Due to Input Signal Intensity Variation . . . . Output Waveform Due To Channel Length Variation . . . . . . . Output Waveform Due To Threshold Voltage Variation . . . . . . Output Waveform Due To Power Supply Variation . . . . . . . . Output Waveform Due To Temperature Variation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 63 63 64 65 65 67 68 70 71 6-1 6-2 6-3 6-4 6-5 6-6 Guide to Folded Cascode Opamp Layout . . . . . . . . Folded Cascode Opamp Layout . . . . . . . . . . . . . Guide to Two-Stage Opamp Layout . . . . . . . . . . . Two-Stage Opamp Layout . . . . . . . . . . . . . . . . Two-Stage Opamp (within Bandgap Reference) Layout Guide to Transimpedance Amplier Layout . . . . . . . . . . . . . . . . . . 75 76 76 77 77 78 12 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6-7 6-8 6-9 6-10 6-11 6-12 6-13 6-14 6-15 Transimpedance Amplier Layout . . . . . . . . . Voltage Amplication Stage Layout . . . . . . . . Output Stage Layout . . . . . . . . . . . . . . . . Linear Voltage Regulator Layout . . . . . . . . . Guide to Bandgap Reference Layout . . . . . . . Bandgap Reference Layout . . . . . . . . . . . . . Guide to Full Receiver Layout . . . . . . . . . . . Full Receiver Layout . . . . . . . . . . . . . . . . Layout Extraction Simulation Output Waveform . 13 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79 80 80 80 81 81 82 83 84 14 List of Tables 3.1 Opamp Performance Comparison . . . . . . . . . . . . . . . . . . . . 3.2 Feedback Biasing Loop Poles . . . . . . . . . . . . . . . . . . . . . . . 3.3 Bandgap Variation Analysis . . . . . . . . . . . . . . . . . . . . . . . 34 41 48 4.1 Comparison Between Bandgap and Self Biasing References . . . . . . 60 5.1 5.2 5.3 5.4 5.5 66 67 68 69 70 Duty Cycle Changes Due to Input Signal Intensity . Skew Due to Channel Length Variation . . . . . . . Skew Due to Threshold Voltage Variation . . . . . . Skew Due to Power Supply Variation . . . . . . . . Skew Due to Temperature Variation . . . . . . . . . 15 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 Chapter 1 Introduction This chapter presents the motivation for on-chip optical clock distribution on a digital chip. Previous receiver designs are analyzed and their limitations are discussed. This thesis attempts to overcome these limitations and create a fully functional optical clock receiver operating at 1GHz. 1.1 Motivation for On-Chip Opto-Electronic Clock Distribution Optics have been widely used in the telecommunications area to transmit high speed data. This concept can be applied to sending high speed data across a small digital chip. Miller [2] presents a historical summary of the transistion of optics from large scale telecommunications networks to small, on-chip devices. In addition to transmitting data through on-chip optics, this idea can also be applied to on-chip clock distribution. Clock distribution is becoming a serious issue in modern digital CMOS chips. Existing schemes for clock distribution such as H-trees and matched delay lines rely on symmetry to achieve minimal skew (dierence in signal arrival times) across all leaf nodes on the chip (Figure 1-1). The skew of a clock distribution network is increased in the presence of process and environmental variations. The individual 17 lines themselves are especially prone to metal width variations. Clock buers are often added to decrease the dependency of skew on individual delay lines. These buers along with the initial clock driver circuitry at the heart of the distribution network account for a signicant percentage of total power dissipation on a digital chip. Figure 1-1: Typical H-Tree Clock Distribution Network [1] The fundamental idea of this project is to use light to distribute the clock signal to multiple receivers across the chip (Figure 1-2). Light can be distributed to multiple areas of the chip with very low skew. If this light can be reliably converted to an electrical clock signal, this method could serve as a good replacement for current clock distribution techniques. Optical Clock Source Waveguides Local Electrical Clock Network Figure 1-2: Global Optical Clock Network In order for this technique to be useful, a circuit must be constructed to convert the light signal to an electrical clock with minimal skew. The thesis presents a design 18 for a circuit that receives a globally distributed optical clock signal through an onchip low skew opto-electrical interface and amplication network. Once the signal is converted to an electrical clock signal, it can be distributed to individual leaf nodes through a local clock distribution network. This thesis evaluates various aspects of the receiver design. These include circuit issues, design tradeos, and receiver susceptibility to variation eects. The receiver is capable of handling a 1GHz clock signal. A previous design [3] has shown that the most signicant variations sources are from the power supply ripple and transistor channel length. The proposed design is engineered to yield consistent results in the face of these and other process and environmental variations. The next part of this chapter will discuss previous work related to optical clock distribution and receivers. Chapter 2 describes the overall strategy of the receiver topology. It also cites specic design choices to reject the eects of variation. Chapter 3 discusses the design and operation of the opto-electrical receiver. All functional blocks in the receiver design will be covered here. Next, Chapter 4 will cover, in detail, the more subtle design issues in the receiver design. Receiver performance with respect to clock skew in the presence of input, environment, and process variation will be evaluated in Chapter 5. The circuit layout and extraction simulation results will be presented in Chapter 6. This thesis does not encompass the fabrication and testing of the receiver. Because of this, Chapter 7 includes a comprehensive testing strategy for the fabricated circuit. Finally, Chapter 8 will discuss overall receiver performance and will present alternative ideas not implemented in this thesis. 1.2 Previous Receiver Designs To this date, there has been little published research on on-chip optical receivers for the purpose of clock distribution. There are many documented optical receiver design techniques for sensor and telecommunications applications. These techniques have fundamental problems in their immediate relation to the on-chip optical clock application. 19 In Alexander's text [4], general high frequency optical receiver designs are presented. Although there are many useful ideas here, the majority of the circuits that are reported cannot be applied to the task of on-chip optical clock distribution. These designs use inductors, BJT's, non silicon photodetectors, and other components that are not available on a standard CMOS digital process. The architecture of these receiver designs are discussed next. However, each has a set of limitations that prevent the design from having useful application to optical clock distribution. Woodward et al. [5] present a simple single ended solution based on inverters. The transimpedance amplier is an inverter with a MOSFET used as a feedback resistor. The voltage amplication stage uses a string of inverters. Tanabe et al. [6] use a dierential structure whose transimpedance amplier is a common source amplier. This design diers from the more conventional common source followed by a source follower stage. By doing this, gain is sacriced while making the circuit more independent to Vth variations. Ingels et al. [7] return to the inverter based transimpedance amplier. Its post amplier consists of a string of inverters whose biasing is set through a complex replica biasing network. This circuit is the basis of Sam's thesis [3]. The receiver designs presented by Woodward [5], Tanabe [6], and Ingels [7] do not consider the eect of variation. Each of their transimpedance ampliers are based on transistors simulating resistors. This raises the tolerance issue that will be covered in Chapter 2. The voltage amplication stages in these circuits are also highly prone to power supply variation. Because these receivers were not designed specically for a clock distribution application, skew induced by process or operational variation is not a substantial concern. The designs are just concerned that the absolute speed of the receiver is not adversely aected with variation and do not consider matching signal arrival times. In the case of on-chip optical clock distribution, however, skew is a fundamental concern and the receiver presented in this thesis has been designed with variation in mind. Two recent publications have examined the feasibility of on-chip optical waveguides rather than focusing on the receiver. Verma et al. [8] attempt to demonstrate 20 an integrated, all silicon based, optical interconnect system working in the GHz range. However, these claims are not supported in the paper. The design is performed using discrete components rather than integrated circuits, consumes high power, and is only shown to work up to 100kHz. Finally, Verma et al. [8] suggests that \the receiver circuit is not expected to cause any problems for the nal development of integrated on-chip optical interconnects." The diculty in achieving robust, high performance optical receivers has been demonstrated by Sam [3]. Mule et al. [9] consider issues in the design of an on-chip optical waveguide network. The system uses a board level vertical cavity surface emitting laser (VCSEL) to feed a clock distribution network. This network is made up of a high fanout, single split junction that connects to an optical waveguide single-split H-tree. Mule demonstrates less than 9ps of maximum skew for a fanout of 256 and a die area of 817mm . The receiver designed in this thesis would perform well at each node of this single-split balanced optical distribution network. A previous thesis at MIT, written by Shiou Lin Sam [3], seeks to demonstrate a functional optical clock receiver. Sam's design faces a variety of problems. First, Sam uses a TSMC 0.35 process whose analog properties are very poor at 1GHz. High gains at 1GHz are impossible to attain and are crucial in typical receiver designs. In addition, Sam's design attempts to use silicon PIN diodes as photodetectors. Silicon photodetectors highly attenuate and distort the clock signal in the light to current conversion. The goal of the second generation receiver proposed in this thesis is to overcome many of the limitations identied by Sam's Work. 2 1.3 Summary Optical clock distribution is a possible alternative to traditional balanced electrical distribution networks. To prove the feasibility of an optical network, a receiver must be designed to have low skew at high clock frequencies with minimal variation dependence. The following chapters present the design and analysis of such a design. 21 22 Chapter 2 Design Strategy to Minimize Skew and Eects of Variation Current optical receiver designs share similar functional blocks. However, traditional implementation of the photodetector, transimpedance amplier, and voltage amplication stage have limitations when applied to optical clock distribution. This chapter will present these limitations along with an overview of the total receiver topology used in this thesis. 2.1 Generalized Topology The optical clock signal is distributed through the chip to each receiver. The receiver must take the optical signal and convert it into a corresponding CMOS compatible rail to rail voltage signal. A standard receiver topology has three main functional parts, as illustrated in Figure 2-1. First, a photodetector is used to convert light into a current signal. Next, this current must be changed into a voltage through a transimpedance conversion. Lastly, the voltage signal must be amplied to a rail to rail square wave in order to be used in a standard digital system. 23 Photodetector Transimpedance Amplifier Voltage Amplifier Optical Clock from Waveguide I in Full Swing Clock Power Supply Independent Voltage Gain High Gain R Figure 2-1: Typical Overall Block Diagram 2.2 Variation vs. Low Skew The receiver is designed to be part of a global clock distribution network. Because this network distributes a clock signal, skew between receivers must be minimized. Skew is the time measurement between misaligned clock signals, considered at the output of each copy of the receiver. One approach to minimizing skew is to accurately predict and reproduce delay in all copies of this circuit. This approach is impossible due to random process and environmental variations across the die. Another approach is to design the circuit to reject most sources of variation. If this is done, skew will remain constant and can be minimized. This is the approach taken in this thesis. 2.3 Design Implications/Challenges The key issue that signicantly constrains the performance of the receiver is the integration with a standard CMOS digital process. The TSMC 0.18 MOSIS logic process is chosen for the test chip. The receiver must transform an optical clock signal to an electrical equivalent while running side by side with other digital logic. This imposes two key restrictions. First, the photodetector must either be made of silicon 24 or be made on a process that can be integrated or bonded to a silicon die. On the electrical side, the maximum speed that the receiver can run at is inherently limited by the speed of the MOSFETs. For example, a proof of concept for the receiver was rst designed using ideal opamps. Replacing these ideal elements with realistic opamps revealed a host of serious issues. These problems are at the beginning of a list of challenges addressed in the receiver design. The photodetector, transimpedance amplier, and voltage amplier will now be examined. 2.3.1 Photodetector The photodetector serves as the interface between the optical and electrical domains. Photodiodes are used in this application as detectors. On a basic level, a photodiode is a PN junction that is reverse biased and then exposed to light. The reverse bias causes an electric eld to form across the depletion region of the diode. Exposure to light will create free electron-hole pairs in the depletion region. These mobile carriers will be swept to the diode contacts by the electric eld. The total carrier movement translates to an electrical current. In the ideal case, a single photon will be enough for a single electron/hole pair to be generated. These mobile carriers should also instantaneously exit the depletion region and contribute to current. This implies that a square wave of light input will result in a square wave of current output. This is not the case in reality. Photoemitters require a minimum light wavelength to modulate an optical signal at 1GHz. At this given wavelength, silicon has a very low absorbtion coecient, and a silicon photodiode must be very thick to absorb all exposed light. Because of this transparency eect, silicon's conversion of light to current is very inecient. Ideally, all light exposed to the photodiode is absorbed in the depletion region. However, in any actual implementation of the photodiode, the substrate sits underneath the device. Light shines onto the photodiode, but not all photons are absorbed in the depletion region. A number of them penetrate into the substrate and create electron-hole pairs there (Figure 2-2). Due to the absence of an electric eld in the substrate, the carriers take a long time to migrate back up to the depletion region. 25 This results in low frequency gain where a square wave of light is converted into a highly distorted square wave. Light From Waveguide metal 1 Mobile carriers generated in depletion region are swept to contacts by high electric field metal 1 Vdd n+ n well Mobile carriers generated in substrate take longer to reach depletion region Depletion Region p substate (biased lower than Vdd) Figure 2-2: Eect of Light Absorbed in Substrate Rooman et al. [10] presented a receiver with a novel photodetector based on spatial modulation. The photodetector consists of a row of photodiode strips. Every other photodiode is covered with metal so that it will not be exposed to light. When light shines on this photodetector, some photons are absorbed in the depletion regions of the uncovered photodiodes. The rest penetrates into the substrate and creates mobile carriers there. These slow carriers now have equal probablity of reaching a covered photodiode as an uncovered photodiode. Therefore, the source of distortion will split evenly between the covered and uncovered devices. Each photodiode creates a current which is then converted to a voltage. The dierence between the voltages produced will be an undistorted square wave because the distortion is common mode and thus cancels out. The spatial modulation technique was not used in this thesis; instead, our focus is on the receiver circuit. In the design in this thesis, the test chip will contain both silicon and GaAs photodiodes. The silicon diodes are diusion to substrate PN junctions. GaAs has a much higher absorption coecient than silicon. These photodiodes will be fabricated on a separate process and then later bonded to the CMOS chip. Many copies of each type are on the test chip to account for defects and fabrication diculties. 26 2.3.2 Transimpedance Amplier The transimpedance amplier (TIA) converts current into a proportional voltage. Intuitively (through Ohm's law) this current to voltage relationship will be highly dependent on a resistance. Therefore, any changes in resistance will alter the transimpedance relationship and will ultimately alter skew. The most basic of transimpedance ampliers is a simple resistor. Input current enters and creates a proportional voltage across the resistor. However, to use this voltage, the non-grounded resistor node must be read. This will introduce some degree of resistive loading and will disturb the transimpedance relationship. Current From Photodetector High Gain To Voltage Amplification Stage R Figure 2-3: Traditional Transimpedance Amplier The method that is most frequently used in building transimpedance ampliers is based on negative feedback (Figure 2-3). The idea relies on an amplier with a high negative gain and high input impedance. A feedback resistor connects the input and output nodes. This conguration results in the following transimpedance ARf . If the gain (A) of the amplier is much greater than 1, relationship: VIout A in = then Vout=Iin will be equal to Rf . In a MOS implementation, the amplier gain is based on a gm Rload product. Therefore, if the transconductance is high and Rload changes, the transimpedance relationship will be preserved. There are several problems associated with this negative feedback transimpedance conversion approach. The rst is that it requires a high amplier gain at signal frequency (1GHz). Using a 0.18 process, the highest gain that can be attained is about 10. In most cases, higher gain can be achieved by cascading multiple stages. However, this is impossible here because each stage adds another dominant pole and with it, 90 of negative phase. When a feedback loop is closed around this amplier, (1+ ) 27 the phase margin will be severely degraded and instability will occur. Aside from the gain issue, the negative feedback TIA approach has a serious problem because it is still dependent on a distinct resistance. In a CMOS process, resistor values commonly may vary up to 20%. This will result in an unpredictable transimpedance relationship. Also, in order to achieve a high transimpedance gain, a very large resistor must be used. Using a eld eect transistor (FET) as a resistor overcomes this last problem. The receiver design in this thesis uses a transimpedance amplier that has no feedback. It therefore does not require a single stage high gain amplier. Furthermore, the TIA relies on a MOSFET small signal output impedance as its principle resistance. This allows for a large transimpedance gain with less susceptibility to variation than a passive resistor. 2.3.3 Voltage Amplication An output stage is required in a typical receiver. This stage will, at the very least, be a rail to rail voltage buer. More often, the stage will also need to provide voltage gain. Sam [3] uses a topology based on CMOS inverters. Although this method fullls the above criteria for an output stage, it introduces large skew in the presence of power supply variation. When Vdd changes, the switching thresholds of the inverters also change. Sam found that a 10% variation in power supply caused 100ps of skew over the overall amplifer. The voltage amplication stage in this thesis is split into two parts. First, high bandwidth cascode ampliers are used to amplify small voltage signals. The signals are amplied to give ample noise margin in the subsequent inverter amplier stages. To account for power supply variation, a linear voltage regulator is constructed to x Vdd at 1.8V (Figure 2-4). This method requires a separate supply higher than Vdd and a voltage reference. Power supply variation skew tests are performed on the inverter with and without power supply regulation. A skew test measures the time dierence between the output 28 Vdd High (3V) vddref + − Vddreg (1.8V) 10nF Figure 2-4: Linear Power Supply Regulator from a 90% Vdd amplier and a 110% Vdd amplier. A normal inverter has 15ps of skew while one with a simple regulated power supply has 2.5ps skew. Power supply regulation therefore provides more than an 80% reduction in skew. 1 2.4 Receiver Topology Overview The overall receiver topology can be seen in Figure 2-5. The optical signal is converted to a current through the backbiased photodiode. Since the photodiode is a single element, it is grouped with the next stage. The current is changed to a small voltage through transimpedance conversion. From this point forward in the circuit, the output voltage from each stage is biased at VDD =2. This ensures proper input signal biasing of the individual stages and helps prevent skew. The method used is feedback biasing, which uses a high gain negative feedback loop to keep a constant DC level while preserving the gain of the amplier at 1GHz. After the transimpedance conversion, the voltage is amplied through a high bandwidth voltage amplier with feedback biasing. Up until this point in the circuit, linear system analysis can be used since the voltage signals are small. Gain and bandwidth are primary concerns in the linear stages. However, this changes when the voltage signal becomes rail to rail after 1 Amplier used in test is a single stage dierential pair. 29 passing through the inverter output stage. Now, the system becomes a digital system where fast transition square waves are the primary goal. O the signal path are a collection of circuits that support receiver operation. First, a linear voltage regulator rejects power supply ripple and provides a consistent Vdd to every other circuit in the receiver. This circuit is critical since the output of every stage is biased at VDD =2. Next, a biasing network provides various voltage references and current sources. The main current reference is constructed with a bandgap reference feeding a MOSFET. Bias stability is a primary concern for maintaining low skew. The bandgap reference proved to be very independent of Vth, L, and temperature variation. Linear Voltage Regulator (Vdd) Light from Waveguides Transimpedance Amplifier (TIA) High Bandwidth Voltage Amplifiers Inverter Amplifiers To Local Clock Distribution Circuit Figure 2-5: Receiver Block Diagram 2.5 Summary The challenges of building a high speed optical clock receiver have been discussed in this chapter. Existing architectures of the photodetector, transimpedance amplier, and voltage amplication stage have limitations that make it dicult to meet the requirements of this application. Instead, a design overview for a new receiver architecture is proposed. The following chapter will detail this receiver's design and operation. 30 Chapter 3 Circuit Design and Operation A complete design documentation is presented in this chapter. First, the two styles of operational ampliers (opamps) are discussed. These opamps will be used in the construction of larger functional blocks. Next, the discussion focuses on the three blocks directly in the signal path (transimpedance amplier, high bandwidth voltage amplication, and output stage). Finally, the linear voltage regulator and the biasing network are described in detail. 3.1 Operation Amplier Designs The opamps described in this section are needed as building blocks in other sections of the receiver. Six opamps are needed in the complete receiver design. Two main opamp topologies are chosen and each has two variants. 3.1.1 Two-Stage Opamp The rst opamp topology chosen is the standard two-stage architecture (Figure 3-1). This opamp consists of a current mirror loaded dierential pair followed by an actively loaded common source amplier. The dierential pair bias current is supplied by a current mirror to ensure high common mode rejection. The two-stage opamp has a system function of H (s) = s ADC s with two ( 1 +1)( 2 +1) 31 Vdd Vout Vin- Ccomp Rcomp Vin-+ Figure 3-1: Two-Stage Opamp dominant poles. One pole is caused by the high impedance node at the output of the dierential pair stage. The second pole is caused by the high load capacitance at the output of the second gain stage. In the frequency domain, each of these poles contributes 90 of negative phase. Because of this, the total phase margin at unity gain crossover is very small. If not corrected, this will contribute to instability when a feedback loop around the opamp is connected. First, a Miller capacitor is inserted across the gate-drain junction. This has the eect of signicantly lowering the dominant pole of the system through the Miller eect while pushing the second pole up in frequecy. However, the addition of this capacitor also brings a right half plane zero into the frequency range of interest. This zero degrades phase margin by providing a magnitude boost while adding negative phase. To compensate for this, a resistor (Rcomp) is added in series with the capacitor (Ccomp) to move the right half plane zero into the left half plane. The receiver uses three two-stage opamps. The rst two are used in the high bandwidth amplier stages as voltage buers. After compensation, these opamps have a gain of 20.8, bandwidth of 12.1MHz, and a phase margin of 90. This opamp is an ideal choice for a buer because its unity feedback closed loop response is at past 1GHz. The last two-stage opamp is used in the bandgap circuit. Its DC characteristics are adjusted to the values relevant to the bandgap reference. 32 3.1.2 Folded Cascode Opamp The folded cascode opamp (Figure 3-2) is chosen for use in the receiver because it can achieve a higher gain and bandwidth than the two-stage opamp [11]. Its input stage is a dierential stage whose current is supplied by a current mirror. The output stage has a cascoded current mirror PMOS structure along with a biased cascode NMOS structure. The cascode conguration on both sides gives the output node very high output impedance. The input stage is coupled directly into the output stage by steering current to the NMOS transistors. Vdd Vout Vin+ VinVbias2 Vbias1 Figure 3-2: Folded Cascode Opamp The system function for the folded cascode opamp is H (s) = sADC . There is only one high impedance node in this circuit because all other nodes see 1=gm. It is located at the output node of the opamp and is caused by the high output resistance of the opamp and the load capacitance. The pole dominates the frequency characteristics of the opamp. This gives the amplier a single dominant pole at the output node. Using this feature, the folded cascode can be built to have a reasonable gain with high bandwidth. Two of these high bandwidth opamps are used in the high bandwidth amplier stages. Each has a limited gain of 5.86, but a bandwidth of 192MHz with 90 of phase margin. However, because of the higher order poles and zeros, the poor unity gain closed loop response of this opamp makes it not t for a unity gain buer. An alternative use for the single high impedance output node is to make an opamp ( 33 +1) with very high gain and phase margin. The linear voltage regulator does not require high bandwidth because it mainly deals with DC signals (Section 3.5). A special folded cascode opamp is designed for this application, featuring a gain of 355, bandwidth of 21.8MHz, and phase margin of 90. Table 3.1 compares all opamps used in the receiver in terms of gain and bandwidth. Topology Circuit Used In Quantity Gain Bandwidth Two Stage High BW Amplier Stage 2 20.8 12.1MHz Two Stage Bandgap Reference 1 14.6 10.5MHz Folded Cascode High BW Amplier Stage 2 5.86 192MHz Folded Cascode Linear Voltage Regulator 1 355 21.8MHz Table 3.1: Opamp Performance Comparison 3.2 Transimpedance Amplier As discussed in Section 2.3.2, the transimpedance amplier of this receiver is not the typical resistor feedback around a high gain amplier. Instead, the open-loop topology of Figure 3-3 is chosen. Note that the transimpedance amplier shares the rst cascode amplier with the high bandwidth voltage amplication stage. Light is converted to current in the photodiode. The photodiode is reverse biased with one end connected to Vdd and the other to a current mirror. The current mirror is constantly biased to the on state using a PMOS device in parallel with the photodiode. The bias current is 54uA and is purposely kept large compared to the photodiode current so that the output impedance of the diode connected NMOS device remains fairly constant. Therefore, the photodiode current rides on top of a larger DC current. Current is converted through voltage as it passes through the diode connected NMOS device. Its eects are amplied further in that the current mirror ratio is 1 to 6. This requires that the DC current be kept at a reasonable size such that the mirrored current in the next stage is not too large. At 1GHz, the transimpedance gain is 1:18kV=A. The transistor level schematic of the TIA is seen in Figure 3-4. 34 Transimpedance Input Stage First Cascode Amplifier Stage Vdd Vref1 Ibias 54uA Iphoto Photodiode Feedback Biasing mpl2 2.28u/.18u mpin 1u/.18u Vref2 mnm1 1u/.18u mnc2 3u/.18u mnm2 6u/.18u Figure 3-3: Transimpedance Amplier 3.3 High Bandwidth Voltage Ampliers Once the signal is converted to voltage, its peak to peak voltage is only about 17mV. This voltage must be amplied to the point when normal inverters can take over and rail the signal. Two key architectures are used in this circuit. Cascode Two-Stage Opamp Vdd Vdd Folded Cascode Opamp Photodiode Figure 3-4: Transimpedance Amplier Circuit 35 vref5 vref6 vref3 bf bt Vout vref2 vref1 Low-Pass Filter Vdd p1 + + Two Stage Opamp − Folded Cascode Opamp − Vref Vout Vbias Vin n2 n1 Figure 3-5: High Bandwidth Voltage Amplier 3.3.1 Cascode Architecture The actual voltage amplication is done through the cascode architecture (Figure 3-5). MOS transistors n1 and p1 form a standard common-source voltage amplier. The lower device, n2, exists to isolate the input node from the output node, thus killing the eect of the Miller capacitance and extending the bandwidth of the amplier. The output impedance of the amplier is also increased by n2, thus raising its gain. The result is a conventional high gain, high bandwidth voltage amplier. 3.3.2 Feedback Biasing The central strategy of this receiver design is to bias the outputs of each amplier at VDD =2. This reduces skew by standardizing the bias points of each amplier and preventing clipping before railing the signal at the inverter stages. The output of the high bandwidth voltage ampliers is set to VDD =2 using a feedback biasing technique. This technique is meant to hold a specic DC bias without interfering with the small signal voltage amplication of the cascode stage. To fully understand the subtleties of feedback biasing, linear analysis must be 36 -A 2 A Vin -A R 1 3 out Vout Figure 3-6: Gain Block Representation for High Bandwidth Voltage Amplier performed. The full high bandwidth voltage amplier (cascode stage with feedback biasing) can be modeled with Figure 3-6. In this model the gain blocks A , A , and A represent the transconductances of various sections of the amplier (Figure 3-7). The output resistance block, Rout , is the parallel combination of small signal output resistances: 1 2 3 Rout = ron jjrop 2 1 Vdd - A2 p1 + + Two Stage Opamp − Folded Cascode Opamp − A3 Vref Vout Vbias Vin n2 n1 - A1 Figure 3-7: Gain Block Grouping for High Bandwidth Voltage Amplier A transfer function for each transconductance gain block must be found. First, the small signal transconductance of the entire amplier is gmn . Because of the cascode conguration, the dominant pole of A is across gate and source of n1. 1 1 37 A = 1 ( gmn1 a1 s+1) A has a similar transfer function because it relates the change from the feedback biasing loop to the output. 2 A = 2 ( gmp1 a2 s+1) Next is the analysis of the feedback biasing loop itself (Figure 3-8). A is the compilation of three smaller transfer functions. 3 Vdd + H 1(s) + R Two Stage Opamp − C − R To next stage H 2(s) Folded Cascode Opamp Vref Figure 3-8: Feedback Biasing Circuit H is the unity gain closed loop transfer function of the two-stage buer opamp (Section 3.1.1). Since it has two stages, there are two dominant poles in the open loop response. 1 H (s) = 1 s 1 s ( 1 +1)( 2 +1) When the loop is closed in unity gain, a single high frequency dominant pole is created. H cl(s) = 1 s 1 s ( 1 + 1 +1)( 2 +1) 1 ( 3 +1) s H is the open loop transfer function of the folded cascode opamp (Section 3.1.2). This opamp has a single dominant pole around 200MHz. 2 H (s) = 2 38 2 s ( 4 +1) H is the transfer function of the RC low pass lter. The small output resistance of the two-stage opamp (rots ) does not aect the time constant here because the resistors themselves are much larger. 3 H (s) = R 3 R RCs 1+ rots + 1+RRCs + = R R R rots )+(R+rots )RCs ( + + A = H cl(s)H H = 3 1 2 3 s 2 s ( 3 +1)( 4 +1)(2+ 1 (2+ RCs) RCs) Now that all the blocks have been found, feedback analysis can proceed. The feedback biasing circuit must do two things: It must bias the DC output voltage at VDD =2. It must not interfere with the high freqency voltage amplication of the cascode amplier. The following sections will discuss these specic tasks. Biasing the DC Output Voltage at VDD =2 The main feature of feedback biasing is that it allows the output bias point to be accurately set. At DC, the cascode amplier has no gain. It can therefore be thought of as a constant current source with high output impedance. The rest of the loop can be seen in Figure 3-9. The frequency components of all the blocks are removed because this analysis is at DC. The forward path gain becomes the product of the folded cascode opamp's DC gain, , and the non-linear square law component of p1. The return feedback path gain is simply the resistor divider ratio, , because the two-stage opamp is connected as a buer. The negative feedback forces the output node to be twice Vref . It also forces the gate to source voltage of p1 to accomodate this output voltage given the DC bias current set up by n1. Vref is set to 0.44V to account for the non-innite forward loop gain. This circuit succeeds in biasing the output voltage of the amplier at VDD =2. 2 1 2 39 - H2 Vout A2 v H1 v H3 v v Σ v + Vref Figure 3-9: Feedback Biasing Loop At DC Non-Interference With High Frequency Voltage Amplication The initial high frequency gain of the cascode amplier must be preserved. Figure 3-10 illustrates how feedback biasing aects the high frequency gain. + Σ - v 1 v v -A R out Vout v Vin A2 A 3 Figure 3-10: High Frequency Feedback Biasing Loop Model At high frequencies, the DC voltage reference is shorted to ground. The overall transfer function from input to output is given by: Vout Vin = ;A ( 1 1 1+ 2 A A3 )Rout = ;agmn s 1 1 +1 1 gmp 1 2 1+ ( s+1)( s+1)( s+1)(2+RCs) 2 3 4 (ron jjrop ) 2 1 Notice the dierence between this overall transfer function and that of a normal cascode amplier. The middle term represents the eect of feedback biasing on the gain of the amplier. In order for the gain to be unaected by feedback biasing, the open loop gain term must be much less than unity at the frequencies of interest (1GHz). Therefore, the feedback biasing open loop response must be designed such that its unity gain crossover occurs before 1GHz. The other serious design issue is loop stability. The feedback loop must be designed to have a very stable (high phase margin) open loop response. If not, a magnitude spike will develop in the frequency response which will give low frequency gain. If this gain spike is high enough, the transient response will oscillate at the spike's frequency. 40 Pole Frequency Location 20MHz 1GHz 200MHz 1.5MHz RC Table 3.2: Feedback Biasing Loop Poles 1 2 2 2 1 1 2 2 3 4 2 Stabilizing this feedback loop is very dicult. Table 3.2 shows the frequencies of the amplier poles. Two main requirements have to be met. First, the open loop frequency response of the feedback biasing has to drop below unity gain before 1GHz. Second, this unity gain crossover has to occur at a point with enough phase margin or the amplier will oscillate. To obtain the required phase margin, dominant pole compensation is used. The obvious choice for a dominant pole is the one used to set the low pass lter. Since the next dominant pole in the loop is at 20MHz, the dominant pole needs to be placed below 1.5MHz to maintain stability. However, this pole is set by an RC time constant. It cannot be set arbitrarily low or else the capacitor and resistor sizes will make silicon fabrication impractical. A pair of 100k resistors and a 2pF capacitor are used in this design to set the time constant. The result is a feedback biasing circuit that does not interfere with the high gain of the cascode amplier. Figure 3-11 shows the frequency response of the two high bandwidth voltage ampliers. Note that the rst one is integrated with the transimpedance amplier (Section 3.2). 3.4 Output Stage After the signal is amplied to 700mV peak to peak, it is put through a series of inverters (Figure 3-12). This method provides additional amplication to the rails, thus turning the signal into a square wave with fast rise and fall times. Each inverter is sized such that its switching threshold (VM ), is exactly at Vdd=2. This helps to preserve the duty ratio of the signal. This fact also makes these inverters 41 Figure 3-11: High Bandwidth Voltage Amplier Frequency Response Vdd mpi1 2u/.18u mpi2 2u/.18u mpi3 4u/.18u mpi4 7u/.18u mpi5 16u/.18u mni1 .5u/.18u mni2 .5u/.18u mni3 1u/.18u mni4 2u/.18u mni5 4u/.18u Figure 3-12: Output Stage very sensitive to power supply variation. That is the main justication for using the regulated power supply. The inverter sizes progressively increase as they get closer to the output. This prevents excessive loading from large capacitive loads placed on the output. The number of these inverters may vary in the actual use of the receiver, and will depend primarily on the local clock distribution network that takes the electrical clock signal from the receiver. The end user can customize the size and number of inverters to a specic application. 42 3.5 Linear Voltage Regulator Power supply regulation is needed to provide a constant Vdd supply to the receiver. Across a digital chip, the power supply voltage can easily vary by 10% due to IR drops and have signicant ripple from the swiching digital logic. A linear voltage regulator topology (Figure 2-4) is chosen because of its simple concept and the availability of a good opamp as presented in Section 3.1.2. The performance of the voltage regulator is based on four factors. First is the presence of a solid Vdd (1.8V) reference which must remain constant despite the presence of variation. The bandgap reference, discussed later in Section 3.6.1, provides a very solid voltage source from which the VDD reference can be derived. Once a good reference is obtained, the opamp is the next thing to consider. The regulator topology relies entirely on negative feedback. A single large NMOS device controls the ow of power. It is connected in a follower conguration with a voltage drop between its gate and source. The negative feedback forces the source of the transistor to the reference voltage in order to minimize the error between the two opamp terminals. A customized folded cascode opamp (Section 3.1.2) is constructed for this application. This opamp has a high DC gain with a low bandwidth. Figure 3-13 is a transistor level circuit diagram of the voltage regulator. Vdd High (3V) Vdd Reg (1.8V) Power FET 10nF vref5 vref6 vddref breg Folded Cascode Opamp Figure 3-13: Voltage Regulator 43 Because the opamp has a low bandwidth, its gain at signal frequencies (i.e., 1GHz) is very low. To compensate for this, a large decoupling capacitor is needed. The capacitor creates a very low cuto low pass lter on the regulated voltage node. However, because the value of the capacitor is so large, it is more feasible to run it o chip instead of trying to fabricate it using silicon. high power The largest drawback of this circuit is that it requires a separate VDD supply. This contributes to added overall power dissipation. The power required by this circuit is equal to the power draw of the opamp (< 1mA) plus the product of high and the total current draw of the receiver and the voltage dierence between VDD regulated . VDD high for proper functionality. The NMOS power Constraints must be placed on VDD high must be kept larger FET is a maximum sized device so its VDsat is minimized. VDD regulated or else the power FET will come out of saturation. V high than VDsat plus VDD SS must also be kept low enough such that the maximum VDS , specied at 1.8V, is not exceeded. 3.6 Biasing The next step after designing the funtional blocks of the receiver is to construct a stable biasing system. The biasing circuits must provide reliable voltage references and current sources across all variation sources. In a sense, robustness to variation begins with stable biasing. There are three parts to the biasing scheme: the bandgap reference, current sources, and voltage references. 3.6.1 Bandgap Reference In order to construct the voltage references and current sources of the biasing network, a single, stable current source is needed. Since every other voltage and current used in the receiver is derived from this source, performance will be severely aected if a drastic change occurs with variation. That is why the bandgap topology is chosen; the bandgap method is known to be an eective strategy for designing against tem44 perature dependence. The circuit is based on the bandgap of silicon and is designed to output the a constant voltage (1.24V) around a specic temperature (300 K). The bandgap's temperature independence is based on adding two voltages with opposite dependencies on temperature. The voltage drop across the base-emitter (PN) junction of a bipolar transistor has a negative temperature dependence (;2mV=C ). However, subtracting the base-emitter drops of two bipolar transistors generates a dierence that is proportional to absolute temperature (PTAT): Vth = kTq VBE = Vth ln IICS VBE = Vth ln IICS VBE = VBE ; VBE = Vth ln IICC 1 1 2 2 1 2 1 2 After adding the two voltages, there exists a specic temperature where the two have equal magnitude and opposite sign. At this point, their temperature dependencies cancel each other out and virtual temperature independence is achieved for a wide range of temperatures (Figure 3-14). Figure 3-14: Bandgap Temperature Independence Example Figure 3-15 shows the functional implementation of the bandgap reference. The negative feedback forces the opamp input nodes to be at equal potential, and the ratio of R to R sets the ratio of currents through m and m . Because of the dierence in current, the base-emitter drops of m and m will not be equal. Their dierence 1 2 1 1 2 45 2 R1 R2 + (bgp) Vbg − (bgn) R3 m1 m2 Figure 3-15: Bandgap Reference (PTAT) will be reected in the voltage drop across R . The PTAT voltage across R is summed with the base-emitter drop of m and the collector currents are set such that the desired reference voltage (1.24V) is formed at the output of the opamp. 3 3 2 VBE :75V VR = Vbg ; VBE = :49V R = VIR = : AV = 61k R = R II = 6:1k k = :: V ;: V = 8:248 R = Rk = 7:0k 1 1 49 8 3 3 2 1 2 3 1 24 0258 2 1 75 ln 10 3 The transistor level circuit diagram of the bandgap is shown in Figure 3-16. The bipolar PNP transistors are made from parasitic diusion-well-substrate PNP transistors (Figure 3-17). Although these transistors have very poor current gains ( 3), they are adequate for use in this circuit. The opamp has a two-stage architecture and is customized to the appropriate DC levels (Section 3.1.1). Besides showing independence to temperature variation, the bandgap also rejects the other major variation sources. The voltage reference itself is not based on any MOSFETS so channel length and threshold voltage variation will only aect the bandgap circuit through the opamp. The same argument applies to power supply 46 Vdd R1 6.1k R3 61k (bgp) Vbg Ccomp Rcomp (bgn) R2 7.1k m1 m2 Figure 3-16: Bandgap Voltage Reference Circuit E C B P+ N+ B E P+ diff C B C N+ P+ N−Well P−Substrate Figure 3-17: Parasitic PNP Bipolar Transistor because the only connection to Vdd comes from the opamp. In addition, the negative feedback of the circuit only requires the gain of the opamp to be large; it need not be constant. Table 3.3 shows how well the bandgap performs with variation of temperature, power supply, channel length, threshold voltage, and resistance. Note that in the resistor test, all three resistors were changed in the same direction; the worst case combination is not represented. Once the bandgap reference is established, it can be connected across the gate and source of a MOSFET to create the single current source that the rest of the biasing scheme is based on. 47 Variation Source % Variation Change % Output Change l +10% 0.25% Vth +10% 0.27% VDD +10% 1.02% Temperature +100% 0.54% Resistance +10% 0.016% Table 3.3: Bandgap Variation Analysis Vdd mpv2 10u/.45u mpv1 10u/.45u mpb1 10u/.8u mpb2 10u/.8u mpb5 20u/.8u mpb6 15u/.8u mpb7 15u/.8u mpb3 10u/.8u mpb4 9.15u/1.2u b1 ref3 (.44V) ref2 (1.56V) b2 mnb5 20u/.8u ref1 (.8V) Vbandgap mnv2 1u/1.5u mnv1 1.45u/.9u mibiasa 1.55u/1.1u mnb4 2.7u/.8u ref4 (1.1V) mnb3 2.7u/.8u ref5 (.9V) breg (200uA) ref6 (.7V) mnb6 3.8u/.8u mnb7 11u/.8u mnb1 10u/.8u mnb2 13.3u/.8u mnb8 20u/.8u b1t (100uA) mnb9 10u/.8u b1f (200uA) mnb10 20u/.8u b2t (100uA) mnb11 10u/.8u b2f (200uA) mnb12 20u/.8u bg1 (100uA) mnb13 10u/.8u Figure 3-18: Current Source and Voltage Reference Biasing Circuits 3.6.2 Current Sources and Voltage References The remaining current sources and voltage references (Figure 3-18) are based on the bandgap reference. Current mirrors reect and scale the current to provide multiple current sources. The voltage references are created by injecting the reected current through diode connected MOSFETS. By adjusting the size of these transistors, the VDS (equal to VGS ) can be changed. The voltage reference is then taken directly from this. Transimpedance Amplifier (TIA) Linear Voltage Regulator (Vdd) High Bandwidth Voltage Amplifiers breg Inverter Amplifiers b2t b2f ref4 ref3 ref5 ref6 Light from Waveguides Bandgap Reference ref1 b1t b1f ref2 ref3 ref5 ref6 bg1 Figure 3-19: Receiver Block Biasing 48 To Local Clock Distribution Circuit Figure 3-19 shows where in the receiver the various current sources and voltage references are used. Nodes bg1, b1t, and b2t connect to 100A current sources for two-stage opamps. Nodes breg, b2t, and b2t connect to 200A current sources for folded cascode opamps. Finally, the various ref nodes are voltage references used in various parts of the receiver. 3.6.3 Passive Elements The receiver design uses many passive elements in addition to transistors. These resistors and capacitors must be fabricated in silicon along side the active elements. A total of ten resistors are needed in this receiver. They are made using the resistance of n-well's (Figure 3-20). To implement the very large (100k ) resistors, multiple smaller resistors are placed in series. Special care must be taken to ensure that the substrate around the well resistor is kept at ground potential. This ensures that the parasitic diode formed by the PN junction remains reverse biased. M1 Oxide M1 Oxide N+ Oxide N+ N−Well P−Substrate Figure 3-20: N-Well Resistor Two types of capacitors are used in this design. Smaller capacitances (< 25fF ) are obtained with M2-M3 parallel plate capacitors. Any capacitance larger than this is made with a MOS capacitor. Here, the capacitance non-linearly varies with the voltage across the capacitor. In this design, the DC voltage across each MOS capacitor is xed to a known voltage. Several decoupling parallel plate and MOS capacitors are included in the layout to reduce power supply noise. 49 3.7 Summary This chapter has presented the full design of the optical clock receiver. Individual modules were considered in detail and their interaction with one another was discussed. The next chapter will summarize the more subtle design issues of this project. 50 Chapter 4 Special Design Considerations and Tradeos A variety of additional special issues have been considered during the design process. Many of them require subtle design decisions that might otherwise be overlooked. This chapter is dedicated to discussing certain key issues and design tradeos. In Section 4.1, the power dissipation of the receiver is analyzed. Section 4.2, the concerns regarding duty cycle variation due to asymmetric clipping are addressed. An automatic gain control approach has been developed, and is described in Section 4.3. Convergence related issues are discussed in Section 4.4. The importance of having balancing switching thresholds is emphasized in Section 4.5. Finally, Section 4.6 presents a comparison between the bandgap reference used in this receiver design and an alternative self bias approach. 4.1 Power Consumption Analysis The receiver consumes a total of 19.27mW of power during steady state operation. Figure 4-1 explains the power breakdown throughout the individual components. Power reduction is not a top priorityin this design; rather, functionality at 1GHz and variation independence are the primary goals. There are two key reasons that this cirucit consumes so much power. First, note 51 Full Optical Clock Receiver 19.27mW Bandgap Reference 0.4785mW Transimpedance Amplifier w/ First Voltage Amplification Stage 2.953mW Cascode Amplifier w/ Input Stage 0.7282mW High Bandwidth Voltage Amplifier 2.518mW Output Stage 0.4955mW Cascode Amplifier 0.2932mW Two Stage Opamp 1.388mW Two Stage Opamp 1.388mW Folded Cascode Opamp 0.8368mW Folded Cascode Opamp 0.8368mW Linear Voltage Regulator 8.5114mW Biasing Network 4.3136mW Folded Cascode Opamp 1.339mW Power FET 7.1724mW Figure 4-1: Power Consumption Tree that until the output stage, this circuit is fully analog. It contains many linear ampliers and opamps which dissipate power statically in their biasing currents. The circuit must be mostly analog in order to obtain the transimpedance conversion and high bandwidth voltage amplication functions. The other major reason that this circuit consumes so much power is the linear voltage regulator. This component gives the receiver full power supply independence but consumes 44% of the total power. The vast majority of this power loss appears across the power FET. The biasing network consumes 22% of the total power. By monitoring the current through each path of the biasing network, it can be optimized to pull less current and dissipate less power. Also, each opamp has redundant current mirrors in their biasing. This provides modularity but adds extra current paths and therefore more power loss. The receiver would consume much less power if the voltage regulator was removed. However, power supply rejection would greatly be aected. Instead, chip area can be traded o with power to reduce consumption. The main power loss is across the power FET and is equal to the voltage dierence of the regulated and unregulated supplies times the current draw of the receiver. Thus, the goal should be to reduce the unregulated supply. The unregulated supply has been chosen to have a value of 3V. 52 It needs to be higher than the regulated voltage plus the VGS drop of the power FET to ensure that the opamp will be able to keep the regulated voltage equal to VDDref through negative feedback. One possible x to this problem is to provide a third power supply line with a value in between the unregulated and regulated voltages. This will be attached to the source of the power FET and will proportionally reduce the power loss. Only the opamp really needs to have a high power supply. Another possible option is to add more power FET's in parallel. This adds more current source capability but more importantly, reduces VGS in the power FET. Therefore, the unregulated power supply can be reduced and with it, power dissipation is reduced. 4.2 Asymmetric Clipping One of the largest sources of skew and duty cycle variation is asymmetric clipping in the high bandwidth voltage ampliers. The core cascode amplier architecture makes the amplier saturate on the ground rail before it saturates on the power rail. This is due to the amplier requiring two VDsat drops from the output node to ground. Once clipped, nonlinearities will be introduced to the signal and will be passed through to later stages. This will ultimately lead to skew and duty cycle variation at the receiver output node. There are a number of techniques that can be used to prevent asymmetric clipping. The rst was to reduce the number of cascode amplier stages from three to two. Once the signal is large enough to pass the inverter thresholds, additional linear amplication is unnecessary. Next, the input signal intensity (Section 5.1.4) must be limited to ensure duty cycle integrity. Finally, automatic gain control can be used to equalize the signal magnitude before passing through the high bandwidth cascode ampliers. While the ultimate decision is to not include automatic gain control in the present receiver, a design is presented in the next section for potential future use. 53 4.3 Automatic Gain Control Automatic gain control was considered in the receiver design. Implementation proved to be dicult so the idea was not used in the nal receiver design. The purpose of the automatic gain control (AGC) stage is to account for amplitude variations. The largest source of amplitude variation comes from the photodiode; however, this stage would also account for gain variations in the cascode stages and prevent asymmetric clipping. Finally, any amplitude variation should be corrected before railing the signal with the inverter ampliers. The AGC consists of three parts: the variable gain amplier (VGA), peak detector, and feedback biased (FBB) load (Figure 4-3). The peak detector stores the peak at the AGC's output. The VGA uses the stored peak value to alter its gain in the input signal. The FBB load, as explained earlier, biases the output signal around a known voltage. Linear Voltage Regulator (Vdd) Light from Waveguides Transimpedance Amplifier (TIA) Automatic Gain Control (AGC) High Bandwidth Voltage Amplifiers Inverter Amplifiers To Local Clock Distribution Circuit Figure 4-2: Block Diagram With AGC 4.3.1 Variable Gain Amplier The VGA (Figure 4-3) uses a feedback loop to keep the input transistor, mn1, in its linear triode region of operation [12]. The negative feedback attempts to keep the two inputs to the opamp, node a and node c, at the same voltage. The opamp sets the gate voltage of mn2 (node b ) such that the VDS of mn1 is xed to the output voltage of the peak detector. Assuming now that mn1 is kept in its triode region, its drain current and transconductance can be expressed by: ID = ( WL )nCox (VGS ; VTn ; VDS )VDS 2 54 Vdd mp1 (d) Feedback Biasing Vout mn2 − (a) Vin mn1 (b) + (c) Peak Detector Figure 4-3: Variable Gain Amplier for AGC D = ( W ) C V gm = dVdIGS L n ox DS Therefore, as long as mn1 is kept in its triode region, its transconductance will be proportional to its VDS . Since mn2 acts as a common gate amplier and has no aect on the ID , the overall gain of the VGA is ;gm (ron jjrop ). 2 1 4.3.2 Peak Detector The peak detector controls the gain of the AGC. An exclusively CMOS topology [13] is chosen to avoid diode non-idealities. If the output voltage amplitude is too high, its sampled peak signal to the VGA must lower. The opposite applies to an output voltage amplitude that is too low. Note the inverse relationship. The output signal is biased at a known positive DC voltage. The peak detector in Figure 4-4 records the low peaks of this signal. The peak detector stores information of the output signal peak on Cstore. The voltage on Cstore is buered once to prevent loading and then again to give the necessary level shift for proper VGA operation. Figure 4-5 shows the ideal operation of the peak detector. Assume rst that the value stored on Cstore is higher than the lower peak of the output signal, so that the 55 Vdd mp1 Cstore (b) (c) Vin (d) Vpeak (a) mn1 Biasing Differential Pair Biasing for mp1 mn2 Current Mirror plus Capacitive Storage Buffer 1 Buffer2 Biasing Figure 4-4: Peak Detector Circuit voltage at Vin is higher than the voltage at node d. Node a is at the inverting output of the dierential pair. In this case, the dierential pair pulls node a down as low as it can go until its current mirror saturates. Because node a is very low, the current mirror in the next stage does not turn on. However, node a also feeds a follower amplier which biases mp1 in the on state. This current leaks down through mn2 and the voltage on Cstore remains unchanged. Assume now that the voltage at Vin is lower than the voltage at node d. Then node a will be forced to a high potential, thus turning on the current mirror at mn1 and biasing mp1 o. The current that ows through mn2 will discharge Cstore to the point where node d is equal to the low peak value. This phase is known as the tracking mode. After the peak is found, the peak detector remains in the hold mode in which the voltage on Cstore remains relatively unchanged. This voltage can change through leakage discharge or a change in the output voltage amplitude. 56 Output Stored Peak Input Signal 1 0.8 0.6 0.4 0.2 0 −0.2 −0.4 −0.6 −0.8 | | | −1 Hold 0 1 2 3 Track 4 | Hold 5 6 7 Figure 4-5: Sample Peak Detector Waveform 4.4 Convergence Issues During the course of this thesis, many Hspice convergence issues were encountered. Convergence is needed to obtain valid circuit simulation results. The models used in the receiver simulation are the TSMC 0.18 analog/mixed signal models. These include complete modeling of parasitic components. Because of their detail, Hspice often does not converge if the models are used at high frequencies (1GHz). This problem was never xed; a simple move to the TSMC 0.18 digital models was made. These models do not include the complex parasitic modeling and converged while maintaining enough accuracy for this thesis. The other reoccurring issue with convergence is observed when complex feedback loops are used. Techniques such as feedback biasing cause Hspice to lose DC convergence and fail at transient simulation. After researching through the Avant! Hspice Manual [14], methods for xing this problem were found. The rst and more ideal way is to use initial conditions for any node that might oat on startup. The other is to add conductance ( 0:5u0) and capacitance ( 1fF ) from every node in the circuit to ground. This helps stabilize the circuit during transient simulation. However, this method could cause problems if the conductance and capacitance interferes with normal circuit operation. The nal receiver design Hspice simulation uses initial 57 conditions and 1fF of capacitance at all nodes to ground. 4.5 Balanced Switching Thresholds Careful sizing of transistors is required to ensure balanced switching thresholds. Balanced switching thresholds are crucial to the design methodology of biasing every amplier's output at VDD =2. Figure 4-6 shows a comparison between balanced and unbalanced swithing thresholds. The rst example has an input signal biased at 0.9V and an amplier with a balanced threshold at 0.9V. The second has the same input signal passing through an amplier with a switching threshold at 0.8V. The DC voltage dierence between the input signal bias and the switching threshold is amplied. The output signal ceases to be centered at 0.9V. Amplification of 3 With Balanced Switching Thresholds Input Output 1.5 1 0.5 0 0 10 20 30 40 50 60 70 80 90 100 Amplification of 3 With Unbalanced Switching Thresholds 1.5 1 0.5 0 Input Output 0 10 20 30 40 50 60 70 80 90 100 Figure 4-6: Balanced and Unbalanced Switching Threshold Comparison There are two crucial circuit blocks that require balanced switching thresholds. In the high bandwidth cascode voltage ampliers, the sizing of the transistors in the 58 signal path is crucial to keeping them in saturation. This must also be done with each inverter amplier to ensure equal switching between the PMOS and NMOS devices. In both these cases, unbalanced switching thresholds result in duty cycle variation and, in extreme cases, a complete loss of functionality. For these reasons, special emphasis is put on the sizing of transistors. 4.6 Voltage Reference Comparison - Bandgap vs. Self Bias One of the keys to variation independence is having a solid central voltage reference (Section 3.6). Originally, the bandgap circuit used in this thesis was seen as impractical because of the bipolar transistors needed to make it work. Instead, a self biasing circuit (Figure 4-7) was used as the central voltage reference for the receiver. Vdd Vdd High (3V) mp1 30u/.45u reset mp2 30u/.45u mrb 1u/.18u mrc 1u/.18u Ibias mn2 30u/.45u mn1 30u/.45u mibias 19.5u/.45u 3k Figure 4-7: Self Biasing Voltage Reference This circuit relies on current mirror PMOS devices to equalize the current in both legs. The voltage drop across the resistor will be balanced with the gate to source voltage of mn1. The current owing through each leg will be that voltage divided by the resistance. 59 This circuit has two stable operating points. One is when the voltage on both sides of the resistor is at ground and there is no drain to source voltage across mp1. At that point, no current ows through either leg and a useful voltage cannot be extracted. The other operating point occurs when the voltage across the resistor is larger than the threshold voltage of mn1. That voltage can be fed into the gate of another NMOS device to create a useful current reference. To make sure the circuit is in this operating point, mrb and mrc are added as reset devices. One of the largest advantages of using this circuit is its power supply independence. If VDD changes, the voltage across the resistor will remain nearly constant because of the high output impedance of the transistors. However, because this circuit relies on matching a VGS with a resistance, it performs poorly when faced with threshold voltage, channel length, and resistance variation. Variation % Var. Change % Bandgap Change % Self Biasing Change l +10% 0.25% 4% Vth +10% 0.27% 5% VDD +10% 1.02% 0.2% Temperature +100% 0.54% 1% Resistance +10% 0.016% 7% Table 4.1: Comparison Between Bandgap and Self Biasing References Table 4.1 compares the performance of the bandgap and self biasing references under variation. The bandgap reference is clearly far superior with respect to all these variations except power supply deviations. This weakness of the bandgap approach is corrected with the addition of the voltage regulator described in Section 3.6.1. 4.7 Summary This chapter has covered the special considerations and decisions that were made in the design of the optical clock receiver. Previous to this, Chapter 3 discussed its full design and operation. Next, in Chapter 5, the performance of the receiver design will be analyzed with respect to variation dependencies. 60 Chapter 5 Variation Analysis If all process and environmental parameters were kept constant over every receiver in the chip, skew between recovered clock signals would not exist. In reality, however, variation in these parameters creates non-zero skew. Variation analysis for the purposes of this thesis focuses on intra (within) die variation and will not include analysis of the waveguides and photoemitters. The rst section will describe how functionality is aected by realistic input signal changes. Sensitivity analysis has been performed on the receiver circuit. Next, the eects of various process and environmental variation is observed. In each, the eects of perturbing the variation source are compared to a nominal, variation free, output. 5.1 Functionality Due to Input Signal Variation The nominal input signal is a 1GHz, 10A peak-to-peak square wave. The photodiode is modeled as an ideal current source in parallel with a 100fF capacitor. Tests on input frequency, photodiode capacitance, realistic photodiode input waveform, realistic photodiode model, and input signal intensity are performed to analyze the non-idealities of the input waveform and photodiode model to ensure functionality. 61 5.1.1 Input Frequency and Photodiode Capacitance The rst test is to determine if the circuit will still function at lower frequencies and larger photodiode capacitance. The circuit must function at lower frequencies in case the test equipment itself cannot operate at 1GHz. Figure 3-11 shows that in the 100MHz band, the linear portion of the circuit actually has more gain. Therefore, when simulated at 100MHz, the receiver circuit will not only function, but the output waveform will have faster rise and fall times. Running the circuit at a slower clock speed might also be a valid application to reduce power consumption across the total chip. Also, the full implementation of this circuit must operate with either Si or GaAs photodiodes. The silicon photodiodes have large amounts of capacitance (1pF) associated with them. The input photodiode capacitance is a limiting factor in the speed of the entire receiver design. This capacitance creates a large dominant pole at the front of the receiver, resulting in an immediate reduction in bandwidth. This will ultimately cause the receiver to not function properly at 1GHz. Figure 5-1: 100MHz Output Waveform With Input Capacitance Variation The circuit is successfully simulated with a 100MHz input signal and the assumed 100fF input capacitance (generally achievable with GaAs photodiodes). The circuit also functions when the photodiode capacitance is increased to 1pF (Figure 5-1), 62 which is a typical capacitance with Si photodiodes. 5.1.2 Realistic Photodiode Input Waveform Section 2.3.1 covered the non-idealities of a real photodiode. Because of the slow carrier low frequency gain, the nominal square wave input current waveform will be distorted to the waveform in Figure 5-2, using the extended equivalent model of Section 2.3.1. 15 current (uA) 10 5 0 −5 0 0.5 1 1.5 time (ns) 2 2.5 3 Figure 5-2: Realistic Photodiode Input Waveform Figure 5-3: Output Waveform Due To Realistic Photodiode Input Waveform 63 When simulated with the more realistic input current waveform at 1GHz, the receiver circuit can still recover a square wave clock (Figure 5-3). The time delay between the waveforms is due to the fact that the realistic current waveform has diminished non-fundamental harmonics. This contributes to non-zero rise and fall times as the input waveform is amplied. 5.1.3 Detailed Photodiode Model The nominal receiver simulation does not include a complete photodiode. Two additional resistances are added in this section to reveal a more detailed photodiode model (Figure 5-4). The Rdark resistor appears in parallel with the current source and capacitor. It represents the non-zero dark current that ows when the photodiode is reverse biased in the absence of light. This is represented by a small signal resistance of 100k . The Rcontact resistor is represents the series resistance of the n+ and p+ highly doped regions and the resistance of the metal-silicon junctions. This resistor is represented with a 50 series resistor. Rcontact Iphoto Cdiode Rdark To Receiver Figure 5-4: Detailed Photodiode Model Figure 5-5 shows the output waveform that results from the detailed photodiode model. No signicant dierence is present when this waveform is compared to the nominal output waveform. 5.1.4 Input Signal Intensity Variation is likely to exist in the intensity of the signal that the waveguide feeds to the photodiode. This type of variation is a serious concern for this receiver circuit, as it causes both skew and change in duty cycle (Figure 5-6). Because of the change 64 Figure 5-5: Output Waveform Using Detailed Photodiode Model of duty cycle, an accurate measurement for skew cannot be made. Instead, Table 5.1 displays the change in duty cycle when the input waveform is changed. Figure 5-6: Output Waveform Due to Input Signal Intensity Variation Certain trends can be observed here. First, the receiver will fail to output a clock signal at 1GHz if the input current is smaller than 3A peak-to-peak. If the input current is too low, the linear ampliers will not be able to raise the peak to peak voltage levels above the inverter noise margins (thresholds). This leads to loss of cycles and an inconsistent output waveform. Second, as the input current is raised, the duty cycle is diminished. There are 65 IinP ;P (uA) Duty Cycle % High 3 48% 10 48% 20 44% 30 40% 40 37% 50 34% Table 5.1: Duty Cycle Changes Due to Input Signal Intensity two reasons for this. The input stage bias current is only 54A. If the input current becomes too high, it will become a signicant fraction of the input current and the small signal approximation will be violated. This will change the operating point and cause non-linearities. To design against this problem, the input stage bias current would need to be increased. However, this will either lead to added power dissipation or lower gain due to the current mirror amplication of the transimpedance amplier (Section 2.3.2). The second reason for the duty cycle change is the cascode architecture of the high bandwidth voltage ampliers. The cascode architecture allows for asymmetric clipping. This will also contribute to non-linearity before the signal is railed in the inverter output stage. 5.2 Process Variation In the previous section, the eects of realistic operating conditions with relation to input signal and photodiode variation have been discussed. In this section, the two dominant variation sources from the processing of the silicon are analyzed. Because it is dicult to design against these variation sources, they are the dominant sources of skew in this receiver design. 66 5.2.1 Channel Length Variation The MOSFET channel length is typically the smallest manufactured dimension on the chip. Because of this, it is especially prone to lithography and etch errors during fabrication. In this test, all devices (both PMOS and NMOS) changed together. Figure 5-7 and Table 5.2 summarize the skew generated from channel length variation. Note that only positive percentages are tested; simulations for -5% and -10% failed because the models are not dened below the minimum device sizes. Figure 5-7: Output Waveform Due To Channel Length Variation % Variation From Nominal Skew +5% +20ps +10% +80ps Table 5.2: Skew Due to Channel Length Variation There are two ways to design against channel length variation. The rst is to make the performance of the circuit dependent on the ratios of channel lengths instead of their absolute values, to take advantage of beter local matching of channel lengths (compared to global or within chip channel length variation). This is done in areas such as current mirrors. However, this design practice has limited application. Another approach is to scale the ( WL ) ratio of sensitive transistors. To rst order, this 67 should have no eect on normal circuit operation while reducing the sensitivity to channel length variation. This design practice has limitations also in that increasing the ratios adds signicant capacitance to adjacent nodes. This leads to a degraded frequency response and in some cases, instability. Both these design methodologies are used as much as possible in the receiver design. 5.2.2 Threshold Voltage Variation Threshold voltage variation is caused by a variety of factors. The most dominant cause of Vth variation is ion implantation and doping level variation. Figure 5-8 and Table 5.3 summarize the skew generated from threshold voltage variation. Figure 5-8: Output Waveform Due To Threshold Voltage Variation % Variation From Nominal Skew +5% -20ps +10% -55ps -5% +20ps -10% +70ps Table 5.3: Skew Due to Threshold Voltage Variation Overall, this is the most dicult variation source to design against. However, there are methods to somewhat reduce its eect on circuit performance. First, nothing in 68 the circuit should be dependent on a DC VGS drop. This eliminates the use of typical source follower buers. Next, in common source type ampliers, the input VGS should be kept much larger than Vth. This will ensure that the transistor is strongly inverted and will reduce the sensitivity to threshold voltage variation. Both of these design methodologies are used, where applicable, in the receiver design, resulting in the sekw results shown in Table 5.3. 5.3 Environmental Variation After the circuit is fabricated in silicon, its performance is aected by certain environmental factors, particularly power supply and temperature variations. However, additional design techniques are used in the receiver design to minimize skew in their presence. 5.3.1 Power Supply Variation Across the chip, the power supply can easily vary up to 10% in either direction. Ripple is often caused by switching logic and IR drops can cause DC shifts in VDD . Sam's variation analysis [3] showed that power supply was the largest form of variation for her particular design. The new proposed receiver design uses a linear voltage regulator to control the power supply that the signal path sees (Section 3.5). Table 5.4 shows that the current receiver design performs remarkably well with respect to VDD variation. % Variation From Nominal Skew +5% -3ps +10% -20ps -5% +0.5ps -10% +24ps Table 5.4: Skew Due to Power Supply Variation 69 Figure 5-9: Output Waveform Due To Power Supply Variation 5.3.2 Temperature Variation It is dicult to predict how much temperature will vary without knowing the surrounding circuitry. Across a chip, hot spots may force certain receivers to perform at temperatures much higher than nominal room temperature. The addition of the bandgap reference signicantly reduces ths receiver design sensitivity to temperature. Table 5.5 demonstrates insensitivity to temperatures up to 200% higher than ambient temperature (25 C ). % Temperature (C) Skew 0 (-100%) -5ps 50 (+100%) +18ps 75 (+200%) +36ps Table 5.5: Skew Due to Temperature Variation 5.4 Summary The newly proposed receiver design performs well under realistic operating conditions and environmental variation. It is still sensitive to process variations such as channel 70 Figure 5-10: Output Waveform Due To Temperature Variation length and threshold voltage; however, these dependencies produce skew within the acceptable limits. The conceptual design of the optical clock receiver has been completed. In the next chapter, the actual layout implementation of the circuit will be discussed. 71 72 Chapter 6 Silicon Layout This chapter presents an overview of the optical receiver circuit layout in silicon. The process of converting the circuit from netlist to a practical layout will also be covered. Hierarchical construction and analysis is used to reduce the overall circuit into several managable blocks. Finally, the circuit blocks are pieced together to form the total layout of the circuit. This layout is extracted and simulated to verify functionality. 6.1 Implementation Overview The inital design and functionality verication is done using Hspice, as described in Chapters 3 and 4. The layout is constructed in Cadence. Several steps need to be taken to convert the Spice deck into a Cadence layout. First, a Cadence schematic representation of the receiver circuit is needed. The Spice netlist must be manually entered into a new schematic view in Cadence. This schematic should then be simulated in the analog environment of Cadence. However, this step was skipped and as a result, simple connectivity problems were later found. Once the schematic is entered into Cadence, a primitive layout can be generated from the schematic source. This layout contains the transistors and passive devices of proper size and shows connectivity between nodes. The next step is to place and route all elements in the circuit. After the layout is completed, the design rule checker must be run to catch viola73 tions. Next, the layout vs. schematic checker can be run to check node connections. This last step was skipped also due to problems with certain parameterized transistor denitions. This will be discussed in Section 6.2. The circuit layout is then extracted with parasitics. The Analog Artist tool is used to generate a partial, but sucient, netlist, and this netlist is then taken and formatted to be compatible with Hspice. A full Hspice simulation can then be run to verify layout functionality. 6.2 Cadence Technology Library The receiver layout complies with the TSMC 0:18 MOSIS design rules. However, the library les obtained from MOSIS for this work were not complete and needed to be altered. The cmosp18 and analoglib (generic analog components) were the main libraries available. The cmosp18 library contained the symbol and simulation information for all devices (active and passive). The layout for each device needed to be copied from the analoglib library and linked to the symbol through a new schematic. After this was done, the device was ready to be used in Cadence. Two additional modications have been made to complete the cmosp18 library. First, the layouts for capacitors and resistors have been altered. The capacitor layout view has been changed into an M 2 ; M 3 structure to allow for stacking above active devices and therefore, area reduction. The resistor has been changed from a poly resistor into an N-well resistor to yield larger resistors. In addition to altering the passive devices, the active devices have also been changed to accommodate transistor ngering. The ngering of MOSFET's is a technique used to break large devices into smaller ones. The smaller devices are placed side by side and share sources and drains in an attempt to reduce parasitic capacitance. However, Cadence will not support a simple placement of transistors directly next to each other. Therefore, a special parameterized layout cell is needed to allow transistor ngering. Fixed nger, four gate, NMOS and PMOS devices have been constructed. Although these devices performed properly after extraction, Cadence 74 still views each as four devices. Because of this, the layout vs. schematic check fails. Instead of xing this immediate problem, a simulation of the layout extraction was used to verify circuit connectivity and functionality (Section 6.3.9). The nal addition to the cmosp18 library is a parasitic bipolar PNP transistor. No layout view exists within either library. A new layout for the PNP has been constructed using the design rules as a guide. 6.3 Layout Hierarchy The layout design has been performed in a hierarchical fashion. First, the small opamp blocks are laid out. Next are the larger sub-blocks (transimpedance amplier, high bandwidth voltage amplier, output stage, bandgap reference, and linear voltage regulator). Finally, each sub-block is connected globally and the biasing transistors are added. 6.3.1 Folded Cascode Opamp Layout The folded cascode opamp (Section 3.1.2) layout uses all ngered devices. Its layout is simple and corresponds directly to schematic placement. Figure 6-1 presents a guide to the layout (Figure 6-2). Biasing Biasing Active Cascode Load Input Differential Pair Folded NMOS Cascode Devices Figure 6-1: Guide to Folded Cascode Opamp Layout 75 Figure 6-2: Folded Cascode Opamp Layout 6.3.2 Two-Stage Opamp Layout Biasing Compensation Resistor Compensation Capacitor Biasing Active Load Input Differential Pair Current Mirror Load Second Gain Stage Figure 6-3: Guide to Two-Stage Opamp Layout The two-stage opamp (Section 3.1.1) layout is again very simple and similar to schematic placement. This circuit requires a resistor and capacitor that are constructed next to the active devices (Figure 6-3). Figure 6-4 shows the layout for the two-stage opamp used as buers in the feedback biasing stages. Figure 6-5 presents the layout for the opamp used in the bandgap reference. 76 Figure 6-4: Two-Stage Opamp Layout Figure 6-5: Two-Stage Opamp (within Bandgap Reference) Layout 77 6.3.3 Transimpedance Amplier Layout The transimpedance amplier (Section 2.3.2) layout includes the rst voltage amplication stage and consists of the input bias transistor, mirror input transistor, cascode amplier, two-stage opamp buer, low pass lter, and folded cascode opamp (Figure 6-6). The low pass lter requires large resistors and capacitors so the layout area is dominated by the passive components. The resistors are laid out adjacent to each other to promote matching. Figure 6-7 shows the complete layout for the transimpedance amplier. 100k Ohm N−Well Resistor 100k Ohm N−Well Resistor 2nF MOS Capacitor Two−Stage Opamp TIA Folded Cascode Opamp Figure 6-6: Guide to Transimpedance Amplier Layout 6.3.4 High Bandwidth Voltage Amplier Layout As stated earlier, the transimpedance amplier contains the rst voltage amplication stage. Therefore, the second high bandwidth voltage amplier (Section 3.3) layout, as seen in Figure 6-8 looks almost exactly like the rst. The only dierence is the lack of input bias and mirrored input device. 6.3.5 Output Stage Layout The output stage (Section 3.4) is simply a chain of progressively sized inverters. The rst four are small enough to construct with single gate devices but the last inverter is large enough to use ngering. The layout for the output stage is seen in Figure 6-9. 78 Figure 6-7: Transimpedance Amplier Layout 6.3.6 Linear Voltage Regulator Layout The voltage regulator (Section 3.5) layout, shown in Figure 6-10, is simply an opamp connected to a very large power FET (top-right of the circuit). The FET is ngered but still dominates the area of the sub-circuit. This circuit, with the appropriate biasing transistors, is the only circuit that sees the VDDhigh power line. 6.3.7 Bandgap Voltage Reference Layout The bandgap sub-circuit layout is the most diverse: in addition to using an opamp, it uses three resistors and two parasitic PNP bipolar transistors. The resistors are laid out directly next to each other to promote matching in the face of process variation. Figure 6-11 presents a guide to the layout (Figure 6-12). 6.3.8 Full Layout with Biasing After all the sub-circuit layouts have been constructed, the nal layout of the full receiver is completed. Figure 6-13 describes the placement of the sub-circuits. After 79 Figure 6-8: Voltage Amplication Stage Layout Figure 6-9: Output Stage Layout Figure 6-10: Linear Voltage Regulator Layout 80 Two−Stage Opamp 7.1k Resistor 6.1k Resistor 61k Resistor PNP PNP Figure 6-11: Guide to Bandgap Reference Layout Figure 6-12: Bandgap Reference Layout 81 the sub-blocks are laid out, the biasing transistors are added. Each transistor of the biasing network is placed in a row to ensure proper matching. The biasing row is placed in the middle of the full receiver so that certain references have a central distribution point. Bypass capacitors are added on the power supply and at noise sensitive nodes. The capacitors are a combination of MOS capacitors and stacked M 1 ; M 2 ; M 3 capacitors. They ll the empty space within the layout. Note that the voltage regulator does require an external bypass capacitor on the power supply line. The nal receiver design, without the photodiode, is 205:5m 170:0m or 0:035105mm (Figure 6-14). 2 Transimpedance Amplifier High Bandwidth Voltage Amplifier Output Stage Biasing Transistors Voltage Regulator Bandgap Voltage Reference Biasing Figure 6-13: Guide to Full Receiver Layout 6.3.9 Design Verication, Extraction, and Layout Simulation Once the entire circuit has been laid out, the design rule checker is run. The nal design is DRC clean except for one pair of warnings. These warnings apply to the N-wells of the parasitic PNP bipolar devices. Normally, the N-well is tied to the most positive potential to ensure that the PN junctions formed with the diusion and substrate do not forward bias. However, the bandgap circuit relies on the PN junction between the emitter and base turning on. Therefore, this feature is desired 82 Figure 6-14: Full Receiver Layout 83 and the warning is ignored. Next, the circuit is extracted, including the extraction of parasitic capacitances. The Analog Artist tool of Cadence is used to generate a netlist from the extracted view. The netlist only contains resistor nodes; it does not contain resistance values. The resistance values, however, are present in the extracted view. In addition, the bipolar devices are completely absent from the netlist. These problems are the result of an incomplete set of library les. Both the resistor and PNP libraries have been altered during this project because of their lack of cell views. To complete the netlist, these elements are entered in manually, in order to create an accurate netlist. The netlist is put into Hspice format and simulated. The output waveform that results from the layout extraction simulation is seen in Figure 6-15. In this gure, the waveform is compared to the original Hspice simulation. The overall shape of the extracted waveform matches the Spice simulation but there are minor dierences in duty cycle and time delay. These deviations are caused by sizing dierences between the original simulation and the layout implementation. Because the two waveforms of Figure 6-15 match, the layout of the optical clock receiver is veried. Figure 6-15: Layout Extraction Simulation Output Waveform 84 Chapter 7 Testing Strategy This chapter is devoted to the strategy for testing the optical receiver circuit. The actual fabrication and testing of the chip has not been completed as part of this thesis, and is expected to be accomplished in future work in parallel with on-chip optical data receiver research [15]. The testing strategy for the optical clock receiver circuit in this thesis is catered to demonstrating full functionality rather than measuring actual skew. 7.1 Functionality and Skew Testing Special issues must be considered because the circuit receives and processes a clock signal at 1 GHz. A 1 GHz signal is very dicult to measure using conventional methods because of the parasitic capacitance in the pads and probes. Special non-intrusive or ultra-low capacitance equipment can be used to measure the high frequency signal. This method is very costly and not available for this project. Alternatively, a special circuit can be built to process the receiver output signal. This circuit can be as simple as a frequency divider that outputs a frequency at a lower, more measureable, clock speed. A more sophisticated circuit can be built that actually compares two receiver outputs to determine skew [16]. Such a circuit can then output the skew data at a much slower rate. When adding circuits to the outputs of the receiver, special attention must be taken to ensure that the post-receiver circuits themselves do not 85 add skew. To test the receiver in this thesis, a simple frequency divider should be built. After frequency division, a progressively sized inverter chain should be added to drive each pad. These circuits should allow the receiver to be tested with conventional non-specialized equipment. Skew measurement should be the main focus on the second run of the chip. This chip should contain multiple identical copies of the circuit to test the eects of variation across the chip. 7.2 Circuit Layout Variants On the current test chip, several replicate copies of the receiver should be used to test functionality. Ideally, each sub-block of the receiver design should be tested to ensure individual functionality. After that, any diculties can be attributed to biasing or interfacing. Since this approach is not practical, an alternative strategy is to use external stimulus to simulate various sub-blocks of the receiver. The following variants should be used in the nal chip layout. 7.2.1 Normal Receiver Since the voltage regulator dissipates a lot of power and requires an external bypass capacitor, a single voltage regulator should supply the power supply line to multiple receivers. A pair of receivers should be built to share a single voltage regulator. 7.2.2 Normal Receiver With Isolated Power Supply One copy of the receiver should have an exclusive voltage regulator. This will reduce loading on the regulator signicantly. 86 7.2.3 External Power Supply On one of the variants, the voltage regulator should be entirely removed and an external 1.8V power supply line should power this receiver. This will test for voltage regulator defects. 7.2.4 External Voltage Reference The bandgap reference should be bypassed and an external 1.243V reference should be added to this variant. This will test for defects in the bandgap reference. 7.2.5 Current Input Signal Injection A single receiver should be added with its input node connected to a pad instead of a photodiode. A pulse train of current can be used to test receiver operation. This will test for photodiode defects. However, because of parasitic capacitance in the pad, the speed that the receiver can be tested at will be limited. 7.2.6 Voltage Input Signal Injection Instead of using a current pulse train to test the receiver, a voltage input can be used in place of the photodiode. To enable this feature, however, the input biasing transistor and the rst input mirror transistor need to be removed. The voltage should be inserted into the gate of the rst cascode input transistor, and it should be a small voltage pulse train with an appropriate DC bias. 7.2.7 Photodiode Variants Both silicon and GaAs photodiodes will be used in the test chip. Multiple copies of each photodiode-receiver set should be built to account for photodiode defects. Depending on the space constraints on the chip, one of each photodiode could be used for each of the above variants. 87 7.3 Overall Chip Layout Summary As discussed in Section 5.1.1, the circuit should also operate at a clock speed of 100MHz. Therefore, in addition to testing at 1GHz, a 100MHz signal may also be used to verify functionality of the receiver. The nal chip should include as many variants as the chip area permits. Each variant should have a frequency divider and inverter buer pad driver. If there are more variants than can be supported with pads, an analog multiplexing scheme can be used before the pad drivers. This testing strategy will be capable of demonstrating functionality of the receiver in the fabricated chip. 88 Chapter 8 Conclusion This thesis has presented a design for a fully functional optical clock receiver operating at 1GHz. A layout has been constructed in Cadence and its extraction has been successfully simulated with Hspice. The result is a circuit that converts an optical signal into a rail to rail voltage clock signal. The receiver awaits fabrication. 8.1 Evaluation of Variation Performance Sensitivity analysis of receiver variation dependencies reveals that process variations including threshold voltage and channel length have the greatest impact on skew. The receiver performs remarkably well through environmental variations in power supply voltage and temperature. Performance is not aected signicantly when non-idealities in the photodiode and input signal are added to the model. 8.2 Alternative Techniques Not Used Several ideas were generated during this thesis but were not included because of implementation problems or time constraints. These ideas present alternative approaches that may be used in future designs. 89 8.2.1 Adaptive Biasing This technique uses a oating bias for an NMOS input single stage analog amplier. The bias tracks the power supply so that the VGS of the load remains constant. Ideally, a voltage source would connect the power supply to the load transistor's gate. Instead, a diode with a DC path to ground can be used as a voltage source. By keeping the load transistor's VGS constant, the gain of the amplier will become less sensitive to power supply variation. This technique can be used in the voltage amplication stage instead of using feedback biasing (Section 3.3.2). 8.2.2 Amplifying and Hard Limiting Another method that was considered for reducing skew was to use a stage with tremendous gain to amplify the small voltage signal o of the transimpedance amplier. That signal would then be hard limited at the rail voltage. If the input signal were centered around a constant DC voltage, skew would reduce with increasing gain. As long as the gain is large, the actual magnitude of the gain is not of concern. 8.2.3 Resoanant Tank Ampliers Many existing discrete optical ampliers use resonant tanks [4]. This concept can be applied to a single chip design using the spiral coil inductors found on newer processes. Resonant tanks allow an extension in a normal linear amplier's bandwidth. This provides higher gain at a higher frequency. However, it is non-trivial to use these ampliers within a feedback loop as the inductance causes many adverse phase eects. 8.2.4 Phase Locked Loops One of the largest challenges for this design and future optical clock receivers is the high frequency of the signal. As demands for faster and faster clock speed increase, the traditional strategies will fail to provide useful designs. One technique that can be used is to optically distribute a slower multiple of the high speed clock, convert 90 the signal to a rail to rail voltage, and then use a phase lock loop (PLL) to frequency multiply the signal to the desired frequency. In this case, the skew of the receiver will be multiplied and the PLL itself may add skew. However, the bandwidth demands on the receiver will be signicantly lower and low skew architectures may be used. 8.3 Future Improvements Although this thesis presents a fully functional design, several aspects may be pursued to build on this work. The receiver was designed using a 0:18 process. A move to a smaller process would increase the speed of operation for the receiver by allowing higher bandwidth ampliers. This design was focused on achieving functionality at 1 GHz. Therefore, it may easily be optimized for power consumption and area. Additional work may also be done to decrease its sensitivity to process variation. Finally, the frequency of the optical distribution network will not be limited by the receiver design; rather it will rst be limited by the photodiode. One option is to use special layout techniques such as ngering or spatial modulation [10]. However, these techniques will eventually be limited. Instead, integration of CMOS circuits with alternative materials to silicon need to be investigated in order to achieve a fast photodiode design. 8.4 Summary of Contributions The fundamental goal of this thesis research has been to explore a robust, high performance, standard CMOS circuit design for potential use in on-chip optical clock distribution. The key challenges are high speed and low skew. Previous work by Sam [3] indicated that process and operating environment variations can induce large skew in a baseline optical receiver design. In this work, we have contributed a new design that features a balanced DC biased signal path using the feedback biasing 91 technique. The design also includes power supply regulation and a variation independent bandgap voltage reference. Given integrated high performance photodiodes (requiring SiGe or GaAs), the proposed circuit is expected to achieve robust and low skew operation at 1GHz. Future fabrication and testing of the circuit will evaluate this performance. Additional investigation of methods to further reduce the eects of process variation induced skew is also needed to achieve even higher optical clock receiver frequencies. 92 Bibliography [1] P. Hofstee, N. Aoki, D. Boerstler, P. Coulman, S. Dhong, B. Flachs, N. Kojima, O. Kwon, K. Lee, D. Meltzer, K. Nowka, J. Park, J. Peter, S. Posluszny, M. Shapiro, J. Silberman, O. Takahashi, and B. Weinberger. A 1 GHz Single-Issue 64 b PowerPC Processor. Proc. ISSCC, pages 92{93, February 2000. [2] D. A. B. Miller. Optical Interconnects to Silicon. IEEE Journal on Selected Topics in Quantum Electronics, 6(6):1312{1317, 2000. [3] Shiou Lin Sam. Characterization of Optical Interconnects. Master of science thesis, Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2000. [4] S. B. Alexander. Optical Communication Receiver Design. SPIE - The International Society for Optical Engineering, The Institution of Electrical Engineers, 1997. [5] T. K. Woodward and A. V. Krishnamoorthy. A 1 Gbit/s CMOS Photodetector Operating at 850nm. Electronic Letters, 34:1252{1253, 1998. [6] A. Tanabe, M. Soda, Y. Nakahara, T. Tamura, K. Yoshida, and A. Furukawa. A Single-Chip 2.4Gb/s CMOS Optical Receiver IC with Low Substrate Cross-Talk Preamplier. IEEE Journal of Solid-State Circuits, 33(12):2148{2153, 1998. [7] M. Ingels and M. S. J. Steyaert. A 1-Gb/s .7 CMOS Optical Receiver with Full Rail-to-Rail Swing. IEEE Journal of Solid-State Circuits, 34(7):971{977, July 1999. 93 [8] A. Verma, A. Chatterjee, and B. Bhuva. All Si-based Optical Interconnect for Signal Transmission. IEEE IITC, 2001. [9] A. Mule, S. Schultz, E. Glytsis, T. Gaylord, and J. Meindl. Input Coupling and Guided-wave Distribution Schemes for Board-level Intra-chip Guided-Wave Optical Clock Distribution Network Using Volume Grating Coupler Technology. IEEE IITC, 2001. [10] C. Rooman, D. Coppee, and Maarten Kuijk. Asynchronous 250-Mb/s Optical Receivers with Integrated Detector in Standard CMOS Technology for Optocoupler Applications. IEEE Journal of Solid-State Circuits, 35(7):953{958, 2000. [11] David A. Johns and Ken Martin. Analog Integrated Circuit Design. John Wiley and Sons, 1997. [12] Michael Green and Sridevi Joshi. A 1.5V CMOS VGA Based on PseudoDierential Structures. IEEE International Symposium on Circuits and Systems, May 2000. [13] M. W. Kruiskamp and D. M. W. Leenaerts. A CMOS Peak Detector Sample and Hold Circuit. IEEE Transactions on Nuclear Science, 41(1), 1994. [14] Avant! Star-Hspice Manual Index. http://www.ece.orst.edu/moon/hspice98/. [15] Michael Mills. An On-Chip Data Transmission Ciruit. Master of Engineering Thesis Proposal, Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, May 2001. [16] V. Gutnik and A. Chandrakasan. On-chip Picosecond Time Measurement. Symposium on VLSI Circuits Digest of Technical Papers, pages 52{53, 2000. [17] Payman Zerkesh-Ha et al. Characterization and Modeling of Clock Skew with Process Variations. IEEE Custom Integrated Circuits Conference, 1999. [18] H. Zimmerman. Integrated Silicon Opto-electronics. Springer, 2000. 94 [19] Paul R. Gray and Robert G. Meyer. Analysis and Design of Analog Integrated Circuits. John Wiley and Sons, 1977. [20] Roger T. Howe and Charles G. Sodini. Microelectronics - An Integrated Approach. Prentice-Hall, 1997. [21] Thomas H. Lee. The Design of CMOS Radio-Frequency Integrated Circuits. Cambridge University Press, 1998. [22] Yannis Tsividis. Operation and Modeling of the MOS Transistor. McGraw-Hill, 1999. [23] Cadence. Virtuoso Layout Editor User's Guide, 2000. 95