as a PDF

advertisement
An Analytical Crosstalk Model with Application to ULSI Interconnect Scaling
Dennis Sylvester, O. Sam Nakagawa *, and Chenming Hu
Department of Electrical Engineering and Computer Sciences
University of California, Berkeley
* ULSI Laboratory, Hewlett-Packard Company, Palo Alto, CA
Cc
Ra
Abstract
A new analytical model is presented to accurately determine
crosstalk noise due to on-chip interconnections. This model takes
+
into account interconnect line resistance and driver strength,
which have not been adequately considered in previous crosstalk
+
Vdd
Cv Rv Vx
Ca
models. Our model can be used in conjunction with simple
Tr
analytical device timing expressions to provide close agreement
with SPICE simulations for peak crosstalk in 0.25 µm technology
generations and beyond. Furthermore, the model is applied to
future ULSI interconnect scaling scenarios to analyze their impact Fig. 1 Simplified circuit schematic described in the text. Aggressor
is modeled as a voltage source with ramp rate Vdd/Tr, and crosstalk is
on noise issues.
denoted as Vx. Rv is the sum of victim driver and line resistances.
Introduction
On-chip crosstalk is a major concern in ULSI circuits due to
scaling linewidths, increasing aspect ratios and larger die sizes.
Also, due to decreasing noise margins and larger ground bounce,
noise issues become even more important. There is a need for
accurate yet fast methods to analyze on-chip crosstalk.
Previously presented crosstalk models typically ignore wiring
resistance [1,2] or apply only to step inputs [3]. Our model is
simple, accurate and provides an excellent basis for a crosstalk
screening tool. We demonstrate its accuracy in future generation
logic gates, concentrating on inverters and NAND gates. The
model can be used in conjunction with timing macromodels or
other timing models, as driver rise time is an important parameter
in crosstalk calculation. We introduce one such timing model for
use in this paper. Finally, we use our model to analyze future
developments in ULSI interconnect and their impact on peak
crosstalk. Specifically, we focus on the move to low-k dielectrics
and copper wiring.
Model Derivation
Crosstalk is a complex form of electromagnetic coupling
between two or more conducting lines. In order to obtain a
tractable analytical expression for peak crosstalk noise, several
assumptions must be made. First, the aggressor gate is modeled
as a ramp voltage source with rise/fall time, Tr. This rise time can
be obtained by use of a circuit timing simulator or through use of
analytical expressions such as those presented in [4]. A new
timing macromodel is presented in the next section for this
purpose. A second assumption is that all interconnect and load
capacitances (including fan-out gates and drain junction
capacitances) are modeled as a lumped capacitance to ground,
excluding the coupling capacitance. Finally, the victim line
driver is modeled as an effective resistance. This resistance can
be found using the slope of a device’s Id-Vd curves at the origin
and need only be found once for a given technology as it scales
with device width.
The resulting equivalent circuit is shown in fig. 1. From circuit
analysis principles, the victim line voltage, Vx, can be found to
be:
Vx =
Vx =
R v Cc Vdd
τ0 Tr
 
R v Cc Vdd  1  −
e
τ0Tr τ1 


 
t
τ1
− e
−
−t
−t


τ1
τ2
e
e
τ
+
τ
−
τ
1
2
 0



1 −
−
 τ e

2 


(t − Tr 
)

τ1
t
τ2
− e
−





( t − Tr )
τ2
(1)






(2)
In these equations, Tr is the rise time at the output of the
aggressor driver and τ0, τ1 and τ2 represent different time
constants of the circuit. Definitions of these time constants are
contained in the appendix. Eqn. (1) is applicable during the
aggressor rise time, Tr, and (2) applies when t > Tr. The peak
value of crosstalk noise, Vmax, can be found by differentiating (2)
as this peak always occurs at t > Tr.
Vmax

 Y1
R v C c Vdd 
=
τ1 Y1 
Y
τ 0 Tr 
 2

τ2
 Y1
 τ1 − τ2

− τ 2 Y2 
Y

 2

τ1 
 τ1 − τ2 






(3)
Here Y1 = exp(-Tr / τ1) –1 and Y2 = exp(-Tr / τ2) –1. For slow rise
times (Tr >> τ2), (3) can be seen to approach a limiting value of
RvCcVdd / Tr. Also, the model presented in [3] is a special case of
(3) when Ra = Rv, Ca = Cv and the gate output is a step. Thus, the
model can be seen to have wider applicability than those
presented previously.
Deep submicron interconnect cannot be effectively modeled by
a lumped RC model. However, to derive a simple analytical
model, a lumped topology must be assumed. So, for our model to
accurately represent on-chip interconnect the capacitances must
be scaled to account for the distributed properties of an RC line.
Fig. 2 shows the representative cross-section of the 3-line system
utilized throughout this paper. Based on the Elmore delay model,
the ground capacitances Ca and Cv are scaled by a factor of 0.5
(for an N-step ladder, the factor (N+1) / 2N should be used) since
the distributed capacitances only see the upstream resistance,
rather than the total line resistance [5]. Coupling capacitance is
scaled by a different, larger which is technology independent and
given by:
α = (1 − χ)e
− Tr
τ0
+ χ
(4)
W
S
45
Cc
Cc
Ca
Cv
Ca
Ground Plane
Fig. 2 Cross-sectional representation of the 3-line conductor system
used in this paper. Worst-case crosstalk occurs when two aggressor
lines switch simultaneously, acting upon a single victim line.
The parameter χ is a simple expression accounting for the victim
driver resistance and is equal to 1 for shorter lines (devicedominated) and 0.5 for long lines (interconnect-dominated). As
mentioned earlier, our model can be used with ECAD tools that
provide gate output rise times to determine peak crosstalk. Fig. 3
demonstrates the accuracy of the model, including the capacitance
scaling scheme, when the aggressor gate rise time is known. In
this and all subsequent cases, a distributed RC network is used in
SPICE to simulate crosstalk using 0.25 µm device and
interconnect parameters unless otherwise specified.
All
interconnect parameters used in this paper are included in table 1,
and capacitance values were determined using 3-D simulations.
V max / V dd (%)
T
H
45
SPICE simulation
Timing level model
40
40
35
35
30
30
25
25
20
20
15
15
10
10
5
5
0
100
1000
0
10000
Line Length ( µm)
Fig. 3 This plot demonstrates the model’s accuracy when the
rise time is known (Tr = 250 ps). Maximum error is about 4
percent and is due to the lumped approximations of our model.
capacitances. In the case of a 0.25 µm inverter analyzed in this
work, tr2 constitutes 31% of the total rise time at a line length of 1
mm when the input rise time is 200 ps. Inaccurate modeling of
this term, or the assumption of a step input, will result in large
errors in rise time estimation.
Finally, capacitive loading comprises the largest portion of Tr.
A modified analytical model based on [6] results in the
expressions:
V − 0.1Vdd ( Vdd − Vt ) 19Vdd − 20Vt 
Transistor-level Model

t r 3 = 1.2λC L  t
ln 
+


This section will introduce a new timing macromodel for use
2I dsat
Vdd



 I dsat

with (3) and (4) above. The rise time of a gate driving an
4
(7)
interconnect can be broken into 3 distinct parts. First is the


R a,int


intrinsic rise time of the device with no load and a step input at
λ= 1 −
(8)

R
the gate. The value of this delay is proportional to the effective
 a, int + 3 R a,dev 
resistance of the device multiplied by the capacitance at the
In these expressions, λaccounts for the shielding of capacitance
output node.
(5) by line resistance, CL is the interconnect capacitive load (Ca + Cc
t r1 = 0.5R sat Cout
in fig. 2), and the factor of 1.2 accounts for short-circuit current.
where Rsat is calculated as Vdd / Idsat. This term is small, in the Ra,int and Ra,dev refer to the aggressor line resistance and the
range of 5-10 ps for inverters, and is independent of device width aggressor device resistance. Short-circuit current is overas the geometry dependencies of Rsat and Cout cancel. For older estimated for large line lengths but does not result in large errors
technologies or large gates (e.g. 4-input NAND), this term can be in Tr. Eqn. (8) is an empirical determination of resistive shielding
and could be optimized for individual technologies. By summing
sizable.
The second component of rise time is the input waveform tr1, tr2 and tr3, a simple expression for Tr is obtained. Results are
dependency. Since the input to any gate is not an ideal step, there shown in fig. 4 for 2 different gate topologies and loading
is less drive current supplied initially due to the fact that Vgs < conditions. Overestimation at long line lengths is due to the fact
Vdd. Simulations show this input waveform dependency to be that short-circuit current is nearly negligible when CL is very
linear, both in propagation delay and rise time. Thus, the large. However, the sensitivity of Vmax to Tr is very high for short
lines but drops dramatically for long line lengths. For example, at
following expression describes this dependency well:
a line length of 300 µm, a 15% increase in Tr results in a 13%
t r 2 = k i t in
(6) drop in V for a 0.25 µm inverter. At a line length of 4 mm, a
max
Here ki is a constant for a given technology and varies between 15% change in T yields less than a 0.5% difference in V . For
r
max
0.1 and 0.2, signifying that 10 to 20% of the input transition time this reason, overestimation for large loads is tolerable to obtain
is reflected in the output delay or rise time. This term can be very higher accuracy at short line lengths.
significant, especially for slow input signals and small load
Table 1. Nominal interconnect parameters used in this study, based loosely on SIA National Technology Roadmap [7]
Technology
W=S
T
H
Resistance
Cv
Ca
Generation
(µm)
(µm)
(µm)
(Ω / µm)
aF / µm
aF / µm
0.25 µm
0.45
0.6
0.8
0.1055
46.9
82
0.32
0.55
0.7
0.1620
40.9
77.4
0.18 µm
0.23
0.45
0.6
0.2755
36.1
74.1
0.13 µm
0.17
0.4
0.55
0.4193
31.1
70.4
0.10 µm
0.12
0.35
0.5
0.6789
26.2
67.1
0.07 µm
Cc
aF / µm
78.4
93.9
104
120
141.1
Output Rise Time (ps)
700
700
600
600
500
500
400
400
sizes are varied to obtain a large range of rise times. The timing
model is used to calculate Tr in each case and the crosstalk values
are still in very good agreement with SPICE. These results are
also shown in fig. 6 and verify that both the crosstalk model and
the timing model are valid over a wide range of parameters.
Interconnect Scaling
Several technological breakthroughs in the area of on-chip
interconnections loom in the near future. Foremost among these
SPICE NAND2
200
200
are the introduction of copper for ULSI wiring, and the use of
Model NAND2
SPICE inverter
100
100
new, low-k dielectrics in place of SiO2. Copper provides the dual
Model inverter
advantages of higher electromigration resistance and lower
0
0
resistivity over aluminum-based wiring. Low-k dielectrics will
0
2000
4000
6000
8000
10000
reduce the capacitive load on gates, allowing for faster switching
Line Length ( µm)
Fig. 4 Comparison of rise times for 3-input NAND’s and inverters times and clock speeds. While both these advancements have
faced processing difficulties, both copper wiring and low-k
with results from SPICE and the model presented.
dielectrics are expected to be incorporated into 0.18 µm
45
45
processes, if not earlier. By inspection of (3), reductions in line
Analytical Model, NAND3
40
40
resistance and coupling capacitance can be seen to have direct
SPICE Simulation, NAND3
impact on noise issues. This section aims to accurately assess the
Analytical
Model,
Inverter
35
35
SPICE Simulation, Inverter
impact of new interconnect materials on crosstalk using our new
30
30
analytical models.
25
25
The interconnect parameters in table 1 are calculated within the
20
20
traditional Al/SiO2 scenario. In this study, copper wiring is
estimated to yield 40% less resistance than Al and low-k
15
15
dielectrics exhibit a relative dielectric constant of 2.5 (compared
10
10
to 4.0 for SiO2). Several different scaling scenarios are
5
5
examined:
300
V max / V dd (%)
300
0
10000
0
1)
2)
Fig. 5 Crosstalk model fit when using timing macromodel to obtain 3)
Tr. Note the NAND gate has higher crosstalk due to a larger effective
4)
100
1000
Line Length ( µm)
victim resistance (3 NMOS in series).
Victim Driver Resistance ( Ω )
V max / V dd (%)
0
50
350
50
45
45
40
40
35
35
30
30
25
25
20
20
15
50
100
150
200
250
300
15
SPICE, Inverter
Model
SPICE, 3-input NAND
Model
10
5
10
5
0
0
0
200
400
600
800
1000
Rise Time at output of gate (ps)
Fig. 6 Peak crosstalk is plotted as a function of victim driver size in a
3-input NAND gate, and as a function of aggressor rise time for an
inverter topology. Fit is within 12% for all rise times and 7% for all
driver sizes. Line length in both cases is fixed at 2.5 mm.
Fig. 5 demonstrates the fit of the crosstalk model for a 3-input
NAND and inverter when using the above timing model to obtain
Tr. Error is less than 10% for all line lengths up to 1 cm and in
most cases is less than 5%. To demonstrate the wide applicability
of this model, the victim driver size is varied by an order of
magnitude in a 3-input NAND gate (Wn = 10 to 100 µm) and the
resulting crosstalk is plotted in fig. 6. Finally, aggressor driver
5)
6)
Nominal case: Al-alloy wiring, SiO2 dielectric
Copper case: Copper wiring, SiO2 dielectric
Low-k case: Al-alloy wiring, low-k replaces SiO2
Split low-k case: Al-alloy wiring, low-k used between wires
on same level, SiO2 between levels
Low-k + Copper
Split low-k + Copper
The split low-k case is a realistic compromise between an all
low-k approach and the current SiO2 processes. Due to
processing issues, such as the lack of low-k chemical-mechanical
polishing (CMP) processes, the stacking of low-k on SiO2 may be
necessary in future technologies, especially 0.18 µm.
A critical line length, Lcrit, is defined as the line length at which
crosstalk reaches 20% of Vdd. To analyze the impact of each
scaling scenario on crosstalk, Lcrit is plotted in fig. 7, focusing on
the next 12 years of CMOS technological evolution. Cases 4 and
6 are not shown for the sake of clarity, as these scenarios result in
nearly identical Lcrit values as cases 3 and 5 respectively. As
expected, cases 5 and 6 above offer the biggest gains in terms of
Lcrit increases. It is surprising to see that the split low-k case does
not offer significant advantages over the all low-k scenario. In its
most basic form, crosstalk is modeled as a simple capacitive
divider; Vmax = Vdd * Cc / Ctot. Since split low-k dielectrics lead to
larger reductions in Cc than Ctot, it is expected that this case will
yield the largest gains in crosstalk reduction. However, at short
line lengths (L < 1 mm) total capacitance is dominated by
junction capacitances and fan-out. Therefore, the reduction in
crosstalk is directly proportional to the reduction in coupling
capacitance. For this reason, the all low-k approach yields lower
crosstalk at short line lengths but approaches the nominal case
crosstalk for long lines. This point is better illustrated in fig. 8,
which demonstrates the reduction in crosstalk in comparison to
2000
2000
eqns. (7) and (8)) and it is the first effect that dominates, leading
to lower crosstalk
Finally, fig. 9 demonstrates the effectiveness of several novel
1600
1600
approaches to crosstalk reduction. Specifically, split low-k is
1400
1400
shown to have slightly worse noise margins than an all low-k
1200
1200
approach, as discussed above. Another approach is the use of
1000
1000
“flat” wires; copper wires with aspect ratio reduced by 30 to 40%
800
800
from current projections. This will yield line resistances
600
600
comparable to aluminum wiring at the original aspect ratio but
400
400
will cut coupling capacitance by > 25%. The “flat” wiring
approach was taken by IBM in designing the first 1 GHz
200
200
microprocessor [8]. In addition, we recommend the use of
0
0
0.05
0.10
0.15
0.20
0.25
asymmetric pitches, where spacing is somewhat larger than
Technology Generation ( µm)
linewidth, in noise-critical cases. Area penalties are minimized
Fig. 7 Critical line lengths for various interconnect scenarios. This while still obtaining substantial reduction in crosstalk noise.
Nominal
Copper
All low-k
All low-k + Copper
1800
Critical Line Length (
µm)
1800
plot spans the 0.25 to 0.07 µm generations.
Conclusions
This paper introduces a new analytical crosstalk model that is
accurate, simple, and more general than those presented in the
literature. In addition, a timing model is demonstrated for use
30
30
with our crosstalk model in providing fast estimations of noise in
deep submicron technologies. All model parameters are typically
available in an EDA environment, therefore the model is ideal for
20 Split low-k ( ε = 2.5)
20
rapid crosstalk estimation and signal integrity verification. The
impact of copper wiring and low-k dielectrics on crosstalk is
investigated. It is determined that even these processing
breakthroughs will not provide adequate crosstalk reduction at
10
10
long line lengths beyond the 0.25 µm generation. Designers will
Copper
need to make tradeoffs between noise and wiring density in these
cases. Both copper and low-k dielectrics have their maximum
0
0
10
100
1000
10000
impact on crosstalk in short lines (L < 1 mm). These two
Line Length ( µm)
conclusions make repeaters an attractive alternative to the use of
Fig. 8 Crosstalk reduction due to new materials is primarily achieved long interconnects.
40
0.18 µm inverter, C fan-out small
0.25 µm NAND3, C fan-out large
40
% reduction in peak crosstalk
All low-k ( ε = 2.5)
at short line lengths. Circuits with larger loads benefit more from
new materials according to the charge-sharing principle.
30
% drop in peak crosstalk
25
30
L = 2 mm
Cu + ε = 2.5
25
20
20
15
15
10
5
10
Low-k Stack
"Flat" Wiring
Asymmetric Pitch with 1.5X Spacing
Appendix
The time constants used in eqns. (1) – (4) are defined as follows:
1
2

R (C + C c )+ R v (C v + C c )] 
[
2
τ0 =  a a


− 4 R v R a (C v C c + C v C a + C c C a )

τ1 =
[2R v R a (CvCc + C vCa + CcCa )]
[R a (Ca + Cc )+ R v (Cv + Cc )+ τ0 ]
τ2 =
[2R v R a (C vCc + C v Ca + CcCa )]
[R a (Ca + Cc )+ R v (C v + Cc )− τ0 ]
5
0
0
-5
-5
-10
(A1)
(A2)
(A3)
-10
0.10
0.15
0.20
Technology Generation ( µm)
0.25
Fig. 9 Percentage drop in peak crosstalk at L=2 mm using novel
noise avoidance techniques. “Flat” wiring becomes more beneficial
with technology scaling.
the nominal case for scenarios 2-4 above. It should be noted that
the use of copper provides some degree of crosstalk reduction at
intermediate lengths but very little for long wires or large loads.
This is due to the fact that a lower line resistance for both the
aggressor and victim ultimately has a counteracting effect, with
the victim more tightly held to ground and the aggressor
switching more quickly. At short line lengths, the aggressor
transition time has very little dependency on line resistance (see
References
[1] A. Vittal and M. Marek-Sadowska, “Crosstalk reduction for VLSI,” IEEE
Trans. CAD of Int. Circuits and Systems, v. 16, pp. 290-298, March 1997.
[2] E. Sicard and A. Rubio, “Analysis of crosstalk interference in CMOS integrated
circuits,” IEEE Trans. Emag. Compat., v. 34, pp. 124-129, May 1992.
[3] T. Sakurai, “Closed-form expressions for interconnection delay, coupling and
crosstalk in VLSI,” IEEE Trans. Elec. Dev., v. 40, pp. 118-124, Jan 1993.
[4] T. Sakurai and R. Newton, “Alpha-power law MOSFET model and its
applications to CMOS inverter delay and other formulas,” IEEE Jour. of SolidState Circuits, v. 25, pp. 584-593, April 1990.
[5] M. Shoji, CMOS Digital Circuit Technology, Prentice-Hall, New Jersey, 1988.
[6] N. Weste and K. Eshraghian, Principles of CMOS VLSI Design, AddisonWesley, New York, 1993.
[7] National Technology Roadmap for Semiconductors, Semiconductor Industry
Association, 1997.
[8] J. Silberman et al., “A 1.0 GHz single-issue 64-bit PowerPC integer processor,”
ISSCC Dig. Tech. Papers, 1998, pp. 230-231.
Download