8 Chapter 2 Polarization-mode dispersion, polarization-dependent loss and receiver modeling 2.1 Polarization-mode dispersion Polarization-mode dispersion arises in single-mode fiber and fiber-optic components due to a small difference in refractive index for a particular pair of orthogonal polarization states called the fast and the slow polarization states. This property is called birefringence [31]. This index difference results in a difference in the propagation time called differential group delay (DGD) for waves traveling in these two polarization modes. In single mode fibers, the birefringence originates from a slight ellipticity of the core and from non-circularly symmetric stresses. It varies randomly from fiber to fiber because of the randomness of the geometry of the core and stress irregularities. As a result, coherent interference between the polarization modes causes the polarization-mode dispersion to vary randomly with the wavelength and ambient temperature [32]. In specially fabricated polarization maintaining (PM) fiber, the core of the fiber is exposed to an asymmetric field that is uniform along the fiber’s length. The polarization modes are maintained 9 along the fiber and energy does not couple between the modes. The polarization modes and DGD of a PM fiber are independent of wavelength. The propagation of a pulse through a long length of fiber can be very complicated since the birefringence varies randomly along the fiber [32–33]. However, there are two special orthogonal polarization states at the fiber input for which the output pulse is undistorted to first order. These two polarization states are called principal states of polarization (PSP). The PSP model was originally developed by Poole [34]. This model provides both a time domain and a frequency domain characterization of PMD. In the time domain, when a pulse is not launched in one of the two principal states, the output pulse is a combination of two delayed versions of the input pulse propagated in the two principal states, each with a fraction of the energy of the input signal. As a result the output pulse is typically broadened due to the differential group delay between the two polarization modes. In the frequency domain, a PSP is defined as the input polarization state for which the output polarization state is independent of frequency to first order. In the absence of polarization-dependent loss, the two PSPs are orthogonal to each other. For each pair of PSPs at the fiber input, there exist a corresponding pair of orthogonal PSPs at the output. The input and output PSPs are related by the fiber’s transmission matrix. PMD causes a variety of impairments in optical fiber transmission systems. One impairment is the inter-symbol interference (ISI) within a channel due to the pulse broadening that is caused by DGD when the input polarization state does not coincide with either of the two PSPs [7]. In this first-order PMD effect, a fraction of the energy of the input signal is coupled into one PSP and the rest of the energy is coupled into the other PSP. Although the pulse broadening resembles that of chromatic dispersion, the two effects 10 differ in many ways. Chromatic dispersion is a relatively stable phenomenon. The total chromatic dispersion of an optical fiber communication system is the sum of the chromatic dispersion in each of its subsections, and the location and amount of dispersion compensation can be planned in advance. In contrast, the PMD of a single-mode optical fiber at a given wavelength is not stable, forcing system designers to make statistical predictions of PMD effects and making passive compensation impossible [35]. In addition, system impairment due to second- and higher order PMD occurs for larger bandwidth signals, particularly when the PMD components combine with chromatic dispersion or signal chirp. PMD impairment also occurs in WDM systems where PMD induces coherent crosstalk between multiplexed channels [36]. In polarization-multiplexed transmission systems, adjacent channels are launched with orthogonal polarization states in order to suppress nonlinear impairments such as cross-phase modulation (XPM) or four-wave mixing (FWM). PMD destroys the orthogonality of these polarization states [37]. Impairments due to second- and higher order PMD and in WDM systems are not covered here since they are not the goal of the current work. 2.2 Polarization-dependent loss PDL is an important polarization effect in long-haul optical fiber communication systems. PDL occurs when the loss through a device depends on the input state of polarization of the signal. The combined contribution of PDL in each of the components of a system can cause large overall power fluctuations at the receiver. This in turn may seriously affect the overall system performance [13]. In general, the total PDL of several concate- 11 nated components cannot be obtained by simply adding up the individual PDLs because the PDL axes of the components are usually randomly aligned and the alignments are not stable due to the drift of the fiber realization in the fibers that connect the PDL elements [38]. However, the sum of the individual PDL values yields the worst case system PDL. In optical fiber communication systems, polarization-sensitive elements such as isolators and WDM couplers in optical amplifiers are the main source of polarizationdependent loss. The effect of PDL is to cause excess loss in one of two orthogonal polarization states. One simple method to characterize the PDL is called the maximumminimum power method. By scanning all the possible polarization states at the input of the device and determining the power variation at the output, one can calculate the ratio of the maximum power Pmax and minimum power Pmin and obtain the PDL value in dB as P PDL 10log10 max Pmin . (2.1) Typically, for a single device the PDL level can vary from a few tenths of a dB to less than a tenth of dB. In transoceanic systems, the requirement for the PDL in each EDFA is very stringent. It must be below 0.1 dB to avoid significant re-polarization of the unpolarized noise [14]. In terrestrial systems, the PDL allowed for each EDFA is higher and can be a few tenths of a dB due to the shorter propagation distance and fewer PDL elements. Although the PDL in the system does not significantly degrade the average signalto-noise ratio (SNR), it may cause large variations of the SNR [15–16]. Furthermore, by polarizing the initially unpolarized signal and noise, it can seed further degradations due to PMD. 12 2.3 Stokes representation and Poincaré sphere A polarized lightwave signal that propagates in the fiber is represented by an electric field vector that lies in a plane transverse to the direction of propagation. The state of polarization of the light is defined in terms of the curve traced out in this transverse plane by the electric field vector as a function of time or distance along the fiber. Unpolarized light can be represented by an electric field vector whose direction varies randomly in the transverse plane. For light to be regarded as unpolarized, the time required for the direction of electric field vector to randomize must be short compared to the time that it takes to measure the polarization state. This observation is important because there are telecommunications applications in which polarized light is scrambled at a rate fast enough to appear unpolarized to the optical amplifiers [39], even though in a short enough time window the light may appear partially or completely polarized. Partially polarized light can be modeled as the superposition of completely polarized and unpolarized light. The degree of polarization (DOP) is defined as DOP Ppol Ppol Punpol , (2.2) where Ppol and Punpol are the power of polarized and unpolarized light on a linear scale, respectively. In free space, the light will maintain its DOP. However, when the light propagates in an optical fiber with PMD, the DOP can decrease depending on the spectral width and the input polarization state of the signal. I now clarify the notation conventions I use for vectors in this dissertation. For Jones vectors with two components, I will use lower case bold symbols without arrows, e.g. e. For a vector with three components, I will use a capital bold symbol, e.g. S. For all vec- 13 tors with four components, I will use capital bold symbols in the font of Monotype Corsiva, e.g. S. In order to unambiguously describe a state of polarization, an explicit coordinate system must be specified. If the z-axis is the axis of propagation, then polarized light can be represented in terms of the x- and y-axis projections of the electric field vector by E x ( z, t ) E0 x cos(t x ) (2.3) E y ( z , t ) E0 y cos(t y ) , (2.4) where E0x and E0y are the amplitudes, and δx and δy are the corresponding phases. R. C. Jones [40] developed a new polarization calculus based on optical fields rather than intensities. In the Jones calculus, polarized light can be represented by a two-element complex vector. The elements of the vector specify the magnitude and phase of the x- and ycomponents of the electric field. The Jones vector takes the form Ex E0 x exp(i x ) e , E E exp( i ) y 0 y y (2.5) where E0x, E0y, δx and δy are real and represent the amplitude and the phase of each of the polarizations respectively. The transmission property of a two port optical device can be described by a complex 2 × 2 Jones matrix. The input and output of the Jones vector can be related by the Jones matrix J as eout = J ein. (2.6) Even though the Jones calculus provides a simple description of the polarization state of light in an optical fiber communication system, it is not convenient to measure the electric field of an optical signal experimentally, especially when the signal is partially polarized. Methods have been developed to express polarizations in easily measurable 14 terms, such as optical power. One widely used tool is the Stokes vector, whose components are defined in terms of the optical powers of components of the light in specific polarization states. The Stokes vector, which uniquely describes the polarization state of a signal, takes the form S1 S S2 . S3 (2.7) If S0 denotes the total power of the light, then S0, S1, S2, S3 are called the Stokes parameters. The Stokes vector is related to the Jones vector using the 2 × 2 Pauli spin matrices [41]. The meaning of the Stokes parameters is as follows: S0 = Total power, S1 = Power through linear horizontal polarizer (LHP), S2 = The Power difference between light transmitted through a linear + 45o polarizer (L + 45o) and through a linear – 45o polarizer (L – 45o), and S3 = The power difference between light transmitted through a right circular polarizer (RCP) and through a left circular polarizer (LCP). For monochromatic light, one can show that S 02 S12 S 22 S 32 . (2.8) In general, the power in the polarized part of the signal is given by Ppol S12 S 22 S 32 , (2.9) and the DOP of the signal is given by DOP S12 S 22 S 32 S0 . (2.10) 15 The Poincaré sphere is a graphical tool that allows any state of polarization to be uniquely represented in a real, three-dimensional space, on or within the unit sphere. A few special polarization states are plotted on the Poincaré sphere in Fig. 2.1. S3 RCP (0,0,1) LVP (–1,0,0) L + 45o (0,1,0) LHP (1,0,0) S2 S1 L – 45o (0,–1,0) LCP (0,0,–1) Fig. 2.1 Poincaré sphere representation of the polarized light Fully polarized light is represented by a point on the surface of the sphere. The DOP of depolarized light is less than one and its state of polarization lies within the sphere. In Fig. 2.1, circular states are at the poles, linearly polarized states are on the equator, and right-hand and left-hand elliptical states lie on the northern and southern hemisphere respectively. One important difference between the Poincaré sphere representation and the Jones space representation is that the angle between two polarization states in the Jones space representation is always half of the angle in the Poincaré sphere representation. For example, when there is no PDL in the transmission line, the PSPs are orthogonal in the Jones space representation whereas they are antipodal in the Poincaré sphere representation. 16 Since any polarization state is uniquely represented by a point on the Poincaré sphere, a continuous evolution of polarization states will trace out a smooth curve on the Poincaré sphere. When a single pulse is launched into a piece of PM fiber with negligible second- and higher order PMD, the signal power is coupled to the two principal polarization states. Their relative powers are determined by the launching angle between the SOP of the signal and the principal states of the fiber. At the output of the PM fiber, the pulses in each of the principal states are offset from each other by an amount given by the DGD of the PM fiber. On an oscilloscope, one can see two pulses on two sides of the center of the bit slot with a delay of one half of the DGD relative to the center. On the Poincaré sphere, the SOP of the signal as a function of time will be located at one of the two principals states at times when the power in one principal state is small compared to the power in the other. At times when both principal states are present, the SOP of the signal may transition from one PSP to the other along an arc of a great circle, as shown in Fig. 2.2. DGD S1 PSP+ S0 S3 PSP– S2 Center of bit slot S1 (a) (b) Fig. 2.2 (a) SOP of the output pulse as a function of time. (b) Normalized SOP as a function of time on the Poincaré sphere. 17 In Fig. 2.2, I simulated the SOP of a single pulse after propagation through a PM fiber with 57 ps of DGD. The PMD-induced depolarization of the output pulse can be seen in Fig. 2.2(a) and (b) as large variations of the Stokes vector (S1, S2, S3) as a function of time. In Fig. 2.2 (a), the left half of the bit slot is filled mostly with the energy from one principal mode and the right half of the bit slot is filled with the other principal mode. The transition of the SOP of the signal from one principal state to the other principal state happens at the center of the bit slot. This large variation of the Stokes vector across the bit slot is one of the fundamental differences between pulse distortions induced by chromatic dispersion and by PMD. Fig. 2.2 will be very useful in later chapters to help understand the performance variation caused by pulse distortion induced by PMD and partially polarized noise induced by PDL. 2.4 Receiver models 2.4.1 System performance measures Fiber optic systems consist of three major subsystems: the transmitter, the transmission path, and the receiver. These subsystems jointly determine the system performance and they are all critical in the design of a fiber optic system. In particular, the system performance is evaluated at the receiver. Given a transmitter and a transmission path, an important task is to design a receiver to maximize the system performance. To achieve this goal, one must identify the transmission impairments and design a receiver to minimize the effects that these impairments have on the system performance. Much work has been done on optical receivers, including the comparison of different modulation formats, re- 18 ceiver optimization and studies of receiver sensitivity [22–30]. The most widely used parameter to characterize the system performance is the bit-error ratio (BER). Typically, optical fiber transmission systems are designed and operated at a BER below 10–12. This BER limit can be raised to 10–4 if forward-error correction codes are used [42]. However, to reduce the possibility of exceeding the BER limit, practical systems always operate substantially below the BER limit. As a result, direct BER measurement may require hours of transmission experiments [42] and direct BER computation may take days of standard Monte Carlo simulations [43]. Alternatively, another widely used performance measure is the Q-factor. The Q-factor is a function of the mean and standard deviation of the received currents in the marks and spaces and it is often defined as [44] Q where i1 and i1 i0 1 0 , (2.11) i0 are the mean and 1 and 0 are the standard deviation in the marks and spaces respectively. The measurement and computation of the Q-factor is much less time consuming than that of BER. When the electric currents in the marks and spaces are both Gaussian-distributed, the Q-factor is a good estimate of the BER and is related to the BER by BER erfc( Q / 2 ) / 2 exp( Q 2 / 2) /( 2 Q) . (2.12) Although the actual distributions of the marks and spaces are generalized χ2 distributions, the Q-factor is still a good estimate of BER when the BER is not too low [42]. Both the BER and the Q-factor depend on the characteristics of the receiver, the polarization state of the signal and the noise, and on the shape of the optical pulses after transmission. 19 2.4.2 Previous receiver models Receiver studies in optical fiber communication systems were first conducted more than two decades ago [45]. In the early 1990s, Marcuse [22, 46] derived the bit error probability of binary OOK signals for a simple system in a closed analytical form. In his work, the optical filter has a rectangular shape and its bandwidth is wide enough to let the signal pass undistorted. This assumption is not valid in modern systems with narrow optical filters that have bandwidths as small as 10 GHz. In fact, studies [25,29] have shown that the optimum bandwidth of the optical filter has to be narrow enough to reduce the ASE noise that enters the receiver as much as possible without significantly attenuating the amplitude of the signal. One important assumption in his work is that the signal is polarized and the optical noise is either unpolarized or completely polarized and copolarized with the polarization state of the signal. These assumptions do not hold for optical fiber transmission systems in which PMD and PDL induce penalties. Although Marcuse only dealt with simplified signals and receiver structures, his work represents a milestone in receiver modeling and has been widely referenced by other researchers [25, 30, 47–49]. More recently, Winzer et al. [25] performed a comprehensive study of receivers. They compared the receiver sensitivities of return-to-zero (RZ) and non-returnto-zero (NRZ) modulation formats using realistic optical and electrical filter shapes and analyzed the impact of degrading effects on receiver sensitivities and optimum receiver bandwidths, including finite extinction ratio, chirp and optical-carrier fluctuations. They also compared the performance of RZ and NRZ formats in the presence of first-order PMD. However, their work does not account for partially polarized noise and their experimental results showed large scatter compared to his simulation results [50]. 20 Important work that is closely related to my study was conducted by Lima, et al. [27– 30] and Sun, et al. [20]. Lima, et al. generalized Marcuse’s and Humlet and Azizoglu’s results by deriving a formula for the Q-factor as a function of the optical signal-noiseratio (OSNR) for arbitrary pulse shapes, realistic receiver filters and arbitrarily polarized noise. In this part of their work, the effect of PMD on the signal is assumed to be negligible. They also studied the case where the signal is depolarized by PMD; however in this case they assumed that the optical noise is unpolarized. Sun, et al. [20] derived an expression for the distribution of the Q-factor for a fixed signal-to-noise ratio and a fixed SOP of the signal when the angle between the SOP of the signal and the polarized part of noise is randomly varied. They also conducted an experimental study of the effect of partially polarized noise on the system performance at the receiver in a back-to-back system. In this work, the system was not impaired by PMD. In this study, I will extend the work of Lima, et al. and Sun, et al. by studying the combined effects that a depolarized signal due to PMD and partially polarized noise due to PDL have at the receiver. In particular, I will derive a theoretical formula for calculating the Q-factor that accounts for such combination of polarization effects. In comparison to calculating the system performance using Monte Carlo simulations or by conducting experiments, the analytical formula has two advantages. First, it can significantly reduce the cost of long computation and measurement times. Second, it can be used to understand exactly how and when the system performance is affected by polarization effects. I will validate the results by comparing the results obtained from experiments and Monte Carlo simulations using the formula. I will show that, even with a fixed OSNR, the performance of the system can vary widely, depending on the power splitting ratio of the 21 signal, the DOP of the noise and on the relation between the polarization states of the signal and the polarized part of the noise. This receiver model can be used in any on-off keyed (OOK) system in which the PMD and PDL induce significant performance impairments.