Next Generation Nanotechnoloy Electronic Design

advertisement
Next Generation Carbon Nanotube Based
Electronic Design
Outline
1
2
3
4
5
6
7
Introduction
Carbon Nanotubes (CNT) Interconnect
Buffering CNT Interconnect for Timing Optimization
Preliminary Results for CNT Buffering
Fabrication Variation Aware CNT Buffering
Fabrication Variation Aware CNT based VLSI Synthesis
Conclusion
2
Interconnect Delay Dominates
Copper
3
Copper resistivity (uOhms.cm)
Bottleneck of Prevailing Copper Technology
Copper Resistivity
Copper interconnect technology has its
fundamental physical limit, interconnect
delay due to ever increasing wire
resistivity has greatly limited the circuit
miniaturization.
6
5
4
3
2
1
0
80
65
45
32
22
14
Technology node (nm)
4
Copper Global Interconnect Delay
Driver
Load
Interconnect delay (ps)
Copper Global Interconnect
Technology node
(nm)
Minimum pitch
(nm)
68
210
52
156
1500
40
120
1000
32
96
22
66
18
54
14
42
3500
3000
2500
2000
500
0
68 59 52 45 40 36 32 28 25 22 20 18 16 14
Technology node (nm)
Interconnect RC delay (ps) for a 1 mm length
minimum pitch Cu global wire (ITRS 2007)
5
Carbon Nanomaterial
Carbon Nanotube (CNT) is one of the material of carbon as well as
Graphenes.
Graphenes
CNT
6
Nobel Prize
The Nobel Prize in Physics 2010 was
awarded jointly to Andre Geim and
Konstantin Novoselov “
for groundbreaking experiments
regarding the two-dimensional material
graphene"
7
SWCNTs: Single-Walled Carbon Nanotubes
Single-walled carbon nanotube (SWCNT) can be envisioned as a rolled up
graphene sheet into a seamless cylinder with fullerene caps.
Diameters of SWCNTs are typically 0.5 to 3nm.
CNT lengths range from less than 100 nm to several centimeters
8
Bundled SWCNTs
1nm
0.32nm
Bundled SWCNTs consist of a bundle of parallel single SWCNTs
9
Mean Free Path of CNT
40nm
1000nm
All particles suffer from collisions with other particles such that their path
through space is very short for high densities. This typical path length is called
the mean free path.
The longer mean free path results in smaller resistivity.
Resistivity ๐œŒ ∝
1
๐‘™
10
Advantage of CNT Compared to Cu
Properties
CNT
Cu
Mean Free Path
1000๐‘›๐‘š
40๐‘›๐‘š
Max. Current Density
1010 ๐ด/๐‘๐‘š2
106 ๐ด/๐‘๐‘š2
Thermal Conductivity
6000๐‘Š/๐‘š๐พ
400๐‘Š/๐‘š๐พ
๏‚ง Copper interconnect technology is approaching its fundamental
physical limit, and issues such as electromigration and ever
increasing wire resistivity which has greatly limited the circuit
miniaturization.
๏‚ง Carbon nanotube (CNT) interconnect is a promising replacement
material. CNT has better performance in mean free path,
maximum current density and thermal conductivity.
11
CNT Fabrication Process: Chemical Vapor Deposition
(CVD)
CVD uses carbon precursors from gas phase to form CNTs that takes place at
relatively low temperatures (500–1000 °C), providing great scalability and
controllability. Thus, CVD is regarded as a highly promising CNT growth
technique for the purpose of large-scale synthesis
12
Quartz Wafer-Scale Aligned CNT Growth
Quartz wafer with catalyst
Aligned CNT growth
13
Transfer CNTs from Quartz Wafers to Silicon Wafers
CNT transfer technique using
Thermal Release Tape.
(a) SEM of CNTs on quartz.
(b) 100 nm of Au evaporated on 4”
quartz wafer after CNT growth.
(c) Thermal release tape is applied
to the Au film and the tape/Au
bilayer is peeled off.
(d) SiO2/Si Wafer with transferred
Au after tape release at 120oC.
(e) SEM images of SWNTs
transferred from quartz to 50
nm SiO2 on Si after gold
etching (KI/I2).
(f) (f) Si wafer after substrategated CNFET fabrication.
•
N. Patil, A. Lin, E. Myers, H.-S. P.Wong, and S. Mitra, “Integrated waferscale growth and
transfer of directional carbon nanotubes and misalignedcarbon-nanotube-immune logic
structures,” in Proc. Symp. VLSI Technol., pp. 205–206, 2008.
14
CNT Fabrication Orientation Control
Surface structures of substrate and state of gas flow can
partially decide the growth orientation of SWCNTs under
a suitable temperature range
15
CNT Fabrication Length Control
To selectively grow SWCNT arrays with certain length, one
easy way is confining the spatial termination position of
growing SWCNTs, which means to obstruct the growth of
SWCNTs by instantaneously stopping the catalysts'
activity possibility with additional barriers at a certain
position.
Rogers et al. introduce a layer of amorphous SiO2 onto
quartz surface, and SWCNTs terminated at the edge of the
SiO2 layer because of the surface relief
16
CNT Fabrication Density Control
Keeping catalyst activity; multiple-cycle CVD growth; adding
in new catalysts to grow new SWCNTs
17
CNT Variations in Density
Aligned arrays of CNTs can take full advantage of
the superior transport characteristics of CNTs,
and therefore are considered to be most suitable
for high-performance circuit applications.
The density of variations will impact on the
resistance and capacitance parameters of
bundled CNTs
18
CNT Fabrication Diameter Control
Depend on the diameter of catalysts and Temperature
19
CNT Variation in Diameter
20
Mis-alignment CNT
State-of-the-art CNT growth techniques today are able to
produce CNTs with >99.9% alignment for low density.
In our design, high density CNTs are desired and misalignment becomes an important issue.
21
Outline
1
2
3
4
Introduction
Carbon Nanotubes (CNT) Interconnect
Buffering CNT Interconnect for Timing Optimization
Preliminary Results for CNT Buffering
5
Fabrication Variation Aware CNT Buffering
6
Fabrication Variation Aware CNT based VLSI Synthesis
7
Conclusion
22
Overview of Bundled SWCNTs Circuit Model
Driver
Load
Bundled SWCNTs
interconnect
๐‘…๐‘‘๐‘Ÿ
๐ถ๐‘‘๐‘Ÿ
๐‘…๐‘_๐‘ข๐‘
๐‘…๐‘„ ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’
2
๐ถ๐ธ ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ โˆ™ ๐‘™
2
๐‘…๐‘†๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ ๐‘™
๐‘…๐‘„ ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’
2
๐ถ๐ธ ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ โˆ™ ๐‘™
2
๐‘…๐‘_๐‘‘๐‘œ๐‘ค๐‘›
๐ถ๐‘™๐‘œ๐‘Ž๐‘‘
23
Quantum Resistance of Single SWCNT
๏ƒ˜ If the size of the structure is of the same scale as the
mean free path of an electron, Ohm’s law may not
apply, there exist quantum effects
๏ƒ˜ Quantum resistance (the lowest possible resistance
of an isolated SWCNT)
โ„Ž
๐‘…๐‘„ = 2 = 6.45๐‘˜Ω
4๐‘’
โ„Ž =6.626×10−34 J·s -- Plank’s constant
๐‘’ =1.602×10−19 Coulombs -- the electronic charge
24
Scattering Resistance of Single SWCNT
๏ƒ˜ Scattering unit resistance
โ„Ž
๐‘…๐‘† = 2 = 6.45๐‘˜Ω/๐œ‡๐‘š, when ๐‘™ > λ
4๐‘’ ๐œ†
For simplicity, define ๐‘…๐‘† = 0, when ๐‘™ ≤ λ
๏ƒ˜ Total resistance of a single SWCNT
๐‘…๐‘–๐‘ ๐‘œ๐‘™๐‘Ž๐‘ก๐‘’๐‘‘ = ๐‘…๐‘„ + ๐‘…๐‘† ๐‘™
โ„Ž =6.626×10−34 J·s -- Plank’s constant
๐‘’ =1.602×10−19 Coulombs -- the electronic charge
๐œ† is the mean free path of electrons for a CNT
๐‘™ is the length of CNT
25
Resistance of Bundled SWCNTs
๏ƒ˜ Total resistance of bundled SWCNTs
๐‘…๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ = ๐‘…๐‘–๐‘ ๐‘œ๐‘™๐‘Ž๐‘ก๐‘’๐‘‘ /๐‘๐‘๐‘›๐‘ก
๐‘…๐‘–๐‘ ๐‘œ๐‘™๐‘Ž๐‘ก๐‘’๐‘‘ is total resistance of a single
SWCNT
๐‘๐‘๐‘›๐‘ก is the number of SWCNTs
contained in the bundle
For global interconnect, the scattering resistance dominates, thus,
๐‘…๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ = ๐‘…๐‘–๐‘ ๐‘œ๐‘™๐‘Ž๐‘ก๐‘’๐‘‘ /๐‘๐‘๐‘›๐‘ก =๐‘…๐‘† ๐‘™/๐‘๐‘๐‘›๐‘ก
26
Contact Resistance
Imperfect contacts between copper and carbon nanotubes
CNT interconnect layer
Contact resistance
Some research groups have accomplished to fabricate the contact
resistances ranging from a few hundred ohms to a few kilohms
27
Quantum Capacitance of Single SWCNT and
Bundled SWCNTs
๏ƒ˜ Quantum unit capacitance
๐ถ๐‘„ =
2๐‘’ 2
โ„Ž๐‘ฃ๐‘“
where ๐‘ฃ๐‘“ is the Fermi velocity (๐‘ฃ๐‘“ ≈ 8 × 105 ๐‘š/๐‘ )
๏ƒ˜ Since an SWCNT has four conducting channels, the
net quantum capacitance of a single SWCNT is
๐ถ๐‘„๐ถ๐‘๐‘‡ = 4๐ถ๐‘„
๏ƒ˜ Quantum capacitance of a bundled SWCNTs is
๐ถ๐‘„๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ = ๐‘๐‘๐‘›๐‘ก ๐ถ๐‘„๐ถ๐‘๐‘‡
28
Electrostatic Capacitance of Single SWCNT and
Bundled SWCNTs
๏ƒ˜ Electrostatic unit capacitance
2๐œ‹๐œ–
๐ถ๐ธ =
−1
๐‘๐‘œ๐‘ โ„Ž
(๐‘ฆ/๐‘‘)
๏ƒ˜ FastCap is used to calculate
the electrostatic capacitance
of each single SWCNT
29
Effective Capacitance of Bundled SWCNTs
The effective capacitance of an SWCNT
bundle interconnect (Cbundle) is given by
the series combination of its electrostatic
capacitance and quantum capacitance.
As shown in left figure, the effective
SWCNT bundle capacitance is nearly
equal to its electrostatic capacitance and
the effect of the quantum capacitance is
small.
N. Srivastava, H. Li, F. Kreupl and K. Banerjee, “On the Applicability of Single-Walled Carbon Nanotubes
as VLSI Interconnects,” IEEE Transactions on Nanotechnology, vol. 8, no. 4, pp. 542–559, 2009.
30
Inductance of Single SWCNT and Bundled SWCNTs
๏ƒ˜Kinetic inductance of single SWCNT
โ„Ž
๐‘–๐‘ ๐‘œ๐‘™๐‘Ž๐‘ก๐‘’๐‘‘
๐ฟ๐พ
= 2
8๐‘’ ๐‘ฃ๐น
๏ƒ˜Magnetic inductance of single SWCNT
๐œ‡
๐‘ฆ
๐‘–๐‘ ๐‘œ๐‘™๐‘Ž๐‘ก๐‘’๐‘‘
−1
๐ฟ๐‘€
=
๐‘๐‘œ๐‘ โ„Ž ( )
2๐œ‹
๐‘‘
๏ƒ˜Inductance of bundled SWCNTs
๐ฟ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ =
๐ฟ๐พ
๐‘–๐‘ ๐‘œ๐‘™๐‘Ž๐‘ก๐‘’๐‘‘
+ ๐ฟ๐‘€
๐‘–๐‘ ๐‘œ๐‘™๐‘Ž๐‘ก๐‘’๐‘‘
๐‘๐‘๐‘›๐‘ก
31
Inductive Effect of Bundle SWCNTs is Not Important
1
๐‘…๐‘‘๐‘Ÿ ๐ถ๐‘™ < ๐‘…๐‘™๐ถ๐‘™ < ๐ฟ๐ถ๐‘™
2
๐‘…๐‘‘๐‘Ÿ ๐ถ๐‘™
where Rdr is the driver impedance and R, C and L are
the per unit length interconnect resistance, capacitance
and inductance.
(1/2)๐‘…๐‘™๐ถ๐‘™
๐ฟ๐ถ๐‘™
The above inequality never holds, inductance can be ignored.
N. Srivastava, H. Li, F. Kreupl and K. Banerjee, “On the Applicability of Single-Walled Carbon Nanotubes
as VLSI Interconnects,” IEEE Transactions on Nanotechnology, vol. 8, no. 4, pp. 542–559, 2009.
32
Bundled SWCNTs Interconnect Model
CNT interconnect layer
Driver
Load
Bundled SWCNTs
interconnect
๐‘…๐‘„๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’
๐‘…๐‘‘๐‘Ÿ ๐‘…๐‘,๐‘‘๐‘œ๐‘ค๐‘›๐‘ ๐‘ก๐‘Ÿ๐‘’๐‘Ž๐‘š
2
๐‘…๐‘†
๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’
๐ถ๐‘„๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’
๐ถ๐ธ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’
๐‘…๐‘†
๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’
๐‘…๐‘†
๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’
๐ถ๐‘„๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’
๐ถ๐‘„๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’
๐ถ๐ธ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’
๐ถ๐ธ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’
๐‘…๐‘„๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’
2
๐‘…๐‘,๐‘ข๐‘๐‘ ๐‘ก๐‘Ÿ๐‘’๐‘Ž๐‘š
๐ถ๐‘™๐‘œ๐‘Ž๐‘‘
33
Bundled SWCNT Interconnect π Model
๐‘…๐‘‘๐‘Ÿ
๐ถ๐‘‘๐‘Ÿ
๐‘…๐‘,๐‘‘๐‘œ๐‘ค๐‘›๐‘ ๐‘ก๐‘Ÿ๐‘’๐‘Ž๐‘š
๐‘…๐‘„๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ + ๐‘…๐‘† ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ ๐‘™
๐ถ๐‘„๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ โˆ™ ๐ถ๐ธ ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ โˆ™ ๐‘™
2(๐ถ๐‘„๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ + ๐ถ๐ธ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ )
๐ถ๐‘„๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ โˆ™ ๐ถ๐ธ ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ โˆ™ ๐‘™
2(๐ถ๐‘„๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ + ๐ถ๐ธ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ )
๐‘…๐‘,๐‘ข๐‘๐‘ ๐‘ก๐‘Ÿ๐‘’๐‘Ž๐‘š
๐ถ๐‘™๐‘œ๐‘Ž๐‘‘
34
Bundled SWCNT Global Interconnect Simplified π
Model
๐‘…๐‘‘๐‘Ÿ
๐ถ๐‘‘๐‘Ÿ
๐‘…๐‘†๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ ๐‘™
๐‘…๐‘,๐‘‘๐‘œ๐‘ค๐‘›๐‘ ๐‘ก๐‘Ÿ๐‘’๐‘Ž๐‘š
๐ถ๐ธ ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ โˆ™ ๐‘™
2
๐‘…๐‘,๐‘ข๐‘๐‘ ๐‘ก๐‘Ÿ๐‘’๐‘Ž๐‘š
๐ถ๐ธ ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ โˆ™ ๐‘™
2
๐ถ๐‘™๐‘œ๐‘Ž๐‘‘
35
Outline
1
2
3
4
Introduction
Carbon Nanotubes (CNT) Interconnect
Buffering CNT Interconnect for Timing Optimization
Preliminary Results for CNT Buffering
5
Fabrication Variation Aware CNT Buffering
6
Fabrication Variation Aware CNT based VLSI Synthesis
7
Conclusion
36
Delay for A Circuit
Delay = ๏“all Ri ๏“all Cj downstream from Ri Ri*Cj
Elmore delay to n1 R(B)*(C1+C2)
Elmore delay to n2 R(B)*(C1+C2)+R(w)*C2
37
Buffer Insertion For Delay Reduction
38
Why Buffers Can Reduce Delay?
(Rd,Cd)
๐‘Ÿ๐‘™
๐‘…๐‘‘
๐ถ๐‘‘
๐‘๐‘™
2
๐‘Ÿ๐‘™/2
๐‘…๐‘‘
๐‘๐‘™
2
(Rb,Cb)
(Rd,Cd)
(CL)
๐ถ๐ฟ
๐‘๐‘™
๐‘ก๐‘ข๐‘›๐‘๐‘ข๐‘“ = ๐‘…๐‘‘ ๐‘๐‘™ + ๐ถ๐ฟ + ๐‘Ÿ๐‘™( + ๐ถ๐ฟ )
2
Suppose that ๐‘…๐‘‘ = ๐‘…๐‘ , ๐ถ๐‘‘ = ๐ถ๐‘
โˆ†๐‘ก = ๐‘ก๐‘๐‘ข๐‘“ − ๐‘ก๐‘ข๐‘›๐‘๐‘ข๐‘“ = ๐‘…๐‘ ๐ถ๐‘ − ๐‘Ÿ๐‘๐‘™ 2 /4
๐‘๐‘™
4
๐ถ๐‘‘
๐‘ก๐‘๐‘ข๐‘“ = ๐‘…๐‘‘
๐‘๐‘™
4
๐‘…๐‘
๐ถ๐‘
(CL)
๐‘Ÿ๐‘™/2
๐‘๐‘™
4
๐‘๐‘™
4
๐ถ๐ฟ
๐‘๐‘™
๐‘Ÿ๐‘™ ๐‘๐‘™
๐‘๐‘™
๐‘Ÿ๐‘™ ๐‘๐‘™
+ ๐ถ๐‘ +
+ ๐ถ๐‘ + ๐‘…๐‘
+ ๐ถ๐ฟ + ( + ๐ถ๐ฟ )
2
2 4
2
2 4
โˆ†๐‘ก
๐‘™
39
CNT Buffering and Copper Buffering
CNT interconnect layer
Copper interconnect layer
Copper buffering
CNT buffering
40
Existing Works
๏ฑ Some existing works consider the CNT interconnect or buffered
CNT interconnect, however, they only use a two pin mode
interconnect model
๏ฑ None of the existing works consider the deployment of CNT into
VLSI physical design
•
•
•
•
N. Srivastava, H. Li, F. Kreupl and K. Banerjee, “On the Applicability of SingleWalled Carbon Nanotubes as VLSI Interconnects,” IEEE Transactions on
Nanotechnology, vol. 8, no. 4, pp. 542–559, 2009.
A. Nieuwoudt and Y. Massoud, “On the optimal design, performance, and
reliability of future carbon nanotube-based interconnect solutions,” IEEE
Transactions on Electron Devices, vol. 55, no. 8, pp. 2097–2110, 2008.
G. Close and H.-S. Wong, “Assembly and electrical characterization of multiwall
carbon nanotube interconnects,” IEEE Transactions on Nanotechnology, 2008.
A. Naeemi and J. D. Meindl, “Design and performance modeling for singlewall
carbon nanotubes as local, semi-global, and global interconnects in gigascale
integrated systems,” IEEE Transactions on Electron Devices, vol. 54, no. 1, pp.
26–37, 2008.
41
Problem Formulation
Timing Constrained Minimum Cost Buffering for Carbon
Nanotube Interconnects:
Given a binary routing tree with ๐‘› candidate buffer locations in carbon
nanotube interconnect layer, a buffer library and a set of candidate
buffer positions, to compute a buffer assignment solution such that the
timing constraint is satisfied, and the total buffer cost is minimized.
42
Solution Characterization
๏ฎ
To model effect to
downstream, a
candidate solution
is associated with
•
v: a node
•
C: downstream
capacitance
•
Q: required arrival
time
•
W: cumulative
buffer cost
43
Candidate Buffering Solutions
44
Dynamic Programming (DP)
๏‚ง Start from sinks
๏‚ง Candidate solutions
are generated
๏‚ง Three operations
– Add Wire
– Insert Buffer
– Merge
Candidate solutions are
propagated toward the source
๏‚ง Solution Pruning
45
Solution Propagation: Add Buffer
(๐‘„ ๐›พ ′ , ๐ถ ๐›พ ′ , ๐‘Š ๐›พ ′ )
(๐‘„ ๐›พ , ๐ถ ๐›พ , ๐‘Š(๐›พ))
46
Solution Propagation: Add Wire
(๐‘„ ๐›พ๐‘ฃ , ๐ถ ๐›พ๐‘ฃ , ๐‘Š ๐›พ๐‘ฃ )
๐‘ข
(๐‘„ ๐›พ๐‘ข , ๐ถ ๐›พ๐‘ข , ๐‘Š ๐›พ๐‘ข )
๐‘… ๐‘ข, ๐‘ฃ , ๐ถ(๐‘ข, ๐‘ฃ)
๐‘ฃ
47
Solution Propagation: Add Driver
(๐‘„ ๐›พ ′ , ๐ถ ๐›พ ′ , ๐‘Š ๐›พ ′ )
(๐‘„ ๐›พ , ๐ถ ๐›พ , ๐‘Š(๐›พ))
48
Branch Merge
(๐‘„ ๐›พ , ๐ถ ๐›พ , ๐‘Š(๐›พ))
(๐‘„ ๐›พ1 , ๐ถ ๐›พ1 , ๐‘Š ๐›พ1 )
(๐‘„ ๐›พ2 , ๐ถ ๐›พ2 , ๐‘Š ๐›พ2 )
49
Exponential Runtime
16
solutions
8
solutions
4
solutions
2
solutions
n candidate buffer locations lead to 2n solutions
50
Too Many Solutions
๏‚ง Needs solution pruning for acceleration
๏‚ง Two candidate solutions
– (v, c1, q1,w1)
– (v, c2, q2,w2)
๏‚ง Solution 1 is inferior to Solution 2 if
– c1 ๏‚ณ c2 : larger load
– and q1 ๏‚ฃ q2 : tighter timing
– and w1 ๏‚ณw2: larger cost
51
Pruning
(Q1,C1,W1)
inferior/dominated
if C1 ๏‚ณ C2,W1 ๏‚ณ W2
and Q1 ๏‚ฃ Q2
(Q2,C2,W2)
52
Outline
1
2
3
4
5
6
7
Introduction
Carbon Nanotubes (CNT) Interconnect
Buffering CNT Interconnect for Timing Optimization
Preliminary Results for CNT Buffering
Fabrication Variation Aware CNT Buffering
Fabrication Variation Aware CNT based VLSI Synthesis
Conclusion
53
Experimental Environment
The proposed carbon nanotube interconnect based timing
driven minimum cost buffer insertion algorithm is
implemented in C and tested on a machine with 3.40GHz
Intel Pentium CPU and 3GB memory.
Simulation 1: Timing constrained minimum cost buffering
Simulation 2: Timing minimization without considering cost
54
Experimental Setup – Layer RC Information
Cu
Bundled SWCNT
(1000 SWCNTs)
Resistance
(โ„ฆ)
14.50
6.45
Capacitance
(fF)
0.16
0.16
๐œŒ๐‘™
๐ด
CNT density = 1000/(33 × 88) = 0.34๐‘›๐‘š2
๐‘…๐‘๐‘ข =
Contact resistance is set to 100โ„ฆ
55
Experimental Setup – Buffer Library
Property
BUF_X1
BUF_X2
BUF_X4
BUF_X8
BUF_X16
Resistance (โ„ฆ)
2310.0
1201.0
618.9
315.5
159.6
๏ดIntrinsic delay (ps)
0.21
Capacitance (fF)
Linear fitting 2.93
Area (nm2)
15197.6
Property
INV_X1
๏ด ๏€ฝ RC1846.0
๏€ซ๏ด 0
Resistance (โ„ฆ)
0.44Reference
0.88
driver
A2.91
B
2.87
30395.2
60790.4
INV_X2
INV_X4
Buffer w/ unknown
3.51
capacitance
1.76
2.87
121580.8
INV_X8
Reference driver
C
A’
976.5
514.8 B’ 270.2ref
2.87 C
243161.6
INV_X16
139.7 C’
Capacitance (fF)
0.44
0.87
1.74
3.49
6.97
Intrinsic delay (ps)
0.59
0.62
0.61
0.61
0.61
Area (nm2)
C
10115.6
20231.2
40462.4
80924.8
161849.6
56
Experimental Setup – Global Nets
Our experiments are performed to 500 global nets.
Due to the lack of industrial nets in 22nm technology, we
scale wire lengths of old technology nets to 22nm
technology.
57
Experiment 1
Timing constrained minimum cost buffering
Timing minimization without considering cost
58
Experiment 1 Results of 500 Nets On Average
(Normalized)
Chart Title
1.2
1
Ratio
0.8
0.6
0.4
0.2
0
CNT w/o contact
resistance
Area
# Buffers
CNT w/ 100 Ohms
contact resistance
Delay
# Solutions
Cu
CPU
59
Experiment 1 Results on Five Representative Nets
Test cases
CNT w/o contact
resistance
CNT w/ contact
resistance
(100โ„ฆ)
Cu
1
2
3
4
5
Average
Area (nm2)
318666.0
394605.0
222543.0
50578.0
40462.4
205370.88
# Buffers
7
9
5
3
2
5.2
Delays (ps)
754
1128
676
1019
722
859.8
Area (nm2)
379359.0
263151.0
222543.0
80924.8
40462.4
197288.04
# Buffers
7
9
5
4
2
5.4
Delays (ps)
762
1130
691
927
736
849.2
Area (nm2)
955997.0
612308.0
475433.0
202312.0
91040.4
467418.08
# Buffers
18
19
12
10
5
15.0
Delays (ps)
766
1180
702
994
870
902.4
60
Area and Delay Trade-off Curves for Cu and CNT
61
Experiment 2
Timing constrained minimum cost buffering
Timing minimization without considering cost
62
Experiment 2 Results on Five Representative Nets
Test cases
CNT w/o
contact
resistance
CNT w/
contact
resistance
(100โ„ฆ)
Cu
1
2
3
4
5
Average
Area
(nm2)
3307950.0
3738560.0
2477160.0
3039520.0
1945290.0
2913844.00
# Buffers
50
61
44
44
32
46.0
Delays
(ps)
376
724
314
249
188
370.2
Area
(nm2)
1463910.0
1995870.0
1408250.0
1458970.0
1094230.0
1555162.00
# Buffers
36
36
31
24
18
30.8
Delays
(ps)
423
797
347
302
229
419.6
Area
(nm2)
2851920.0
3794220.0
2269040.0
2872350.0
2142860.0
3040382.00
# Buffers
65
67
56
48
36
59.4
Delays
(ps)
479
877
382
363
276
475.4
63
Observations
๏ฑ In order to achieve the similar delay, the CNT buffering saves
more than 50% buffer area over copper buffering
๏ฑ The total number of buffers in CNT buffering is much (about 2X)
smaller than that of copper buffering thanks to the fact that wire
resistivity of bundled SWCNTs is much lower than that of copper
for global interconnect
๏ฑ The contact resistance does not have significant impact on the
performance for CNT interconnect timing constrained minimum
cost buffering
๏ฑ CNT buffering always outperforms the copper buffering in terms
of timing and buffer area
๏ฑ CNT buffering can reduce timing by up to 32% comparing to
copper buffering for buffering timing minimization without
considering cost
64
Summary of CNT Buffering
๏ƒ˜ Carbon nanotube interconnects have become a
promising replacement material for copper
interconnects thanks to their superior conductivity.
๏ƒ˜ This work develops the first timing driven buffer
insertion technique for carbon nanotube interconnects.
๏ƒ˜ In the experimental results, it demonstrates that with
the same timing constraint, CNT buffering can save
over 50% buffer area compared to copper buffering. In
addition, CNT buffering can effectively reduce the delay
by up to 32% without considering cost.
๏ƒ˜ This work is published in ISVLSI2014.
Lin Liu, Yuchen Zhou and Shiyan Hu, “Buffering Single Walled Carbon Nanotube
Bundle Interconnects for Timing Optimization”, to appear in Proceedings of IEEE
Computer Society Annual Symposium on VLSI (ISVLSI), 2014.
65
Outline
1
2
3
4
5
6
7
Introduction
Carbon Nanotubes (CNT) Interconnect
Buffering CNT Interconnect for Timing Optimization
Preliminary Results for CNT Buffering
Fabrication Variation Aware CNT Buffering
Fabrication Variation Aware CNT based VLSI Synthesis
Conclusion
66
Fabrication Imperfectness
• It is very difficult for today’s CNT
processing to produce perfect CNTs.
• New design techniques must be
employed that are immune to these
inherent CNT imperfections.
• These new design techniques must be
compatible with VLSI processing, and
must have minimal impact on existing
VLSI design flows.
67
Existing Works
๏ฑ Some research works studied the variations of CNT and some
proposed robust design considering CNFET
๏ฑ Some research works studied the copper variation based buffer
insetion
•
•
•
•
•
Jie Zhang, et al, "Carbon Nanotube Robust Digital VLSI," Computer-Aided Design of Integrated
Circuits and Systems, IEEE Transactions on , vol.31, no.4, pp.453,471, April 2012
Patil, N.; Lin, A.; Myers, E.R.; Ryu, Koungmin; Badmaev, A.; Chongwu Zhou; Wong, H. -S P;
Mitra, S, "Wafer-Scale Growth and Transfer of Aligned Single-Walled Carbon Nanotubes,"
Nanotechnology, IEEE Transactions on , vol.8, no.4, pp.498,504, July 2009
Jie Zhang; Patil, N.P.; Hazeghi, A.; Wong, H. -S P; Mitra, S, "Characterization and Design of Logic
Circuits in the Presence of Carbon Nanotube Density Variations," Computer-Aided Design of
Integrated Circuits and Systems, IEEE Transactions on , vol.30, no.8, pp.1103,1113, Aug. 2011
Raychowdhury, A.; De, V.K.; Kurtin, Juanita; Borkar, S.Y.; Roy, K.; Keshavarzi, A., "Variation
Tolerance in a Multichannel Carbon-Nanotube Transistor for High-Speed Digital Circuits,"
Electron Devices, IEEE Transactions on , vol.56, no.3, pp.383,392, March 2009
Jinjun Xiong; Lei He, "Probabilistic Transitive-Closure Ordering and Its Application on Variational
Buffer Insertion," Computer-Aided Design of Integrated Circuits and Systems, IEEE Transactions
on , vol.26, no.4, pp.739,742, April 2007
68
CNT Fabrication Variations
Distribution
CNT diameter
variations (d)
Normal
CNT length
variations (l)
Normal
CNT counts (Ncnt)
Normal
๐œ‡๐‘‘ = 1.3๐‘›๐‘š
๐œŽ๐‘‘ = 0.2๐‘›๐‘š
๐œ‡๐‘™
๐œŽ๐‘™ =
๐œ‡๐‘™
cos 10°
๐œ‡๐‘๐‘๐‘›๐‘ก = 1000
๐œŽ๐‘๐‘๐‘›๐‘ก = 22
CNT distance to
ground (y)
Normal
๐œ‡๐‘ฆ
๐œŽ๐‘ฆ = 0.1๐‘›๐‘š
Jie Zhang; Patil, N.P.; Hazeghi, A.; Wong, H. -S P; Mitra, S, "Characterization and Design of Logic
Circuits in the Presence of Carbon Nanotube Density Variations," Computer-Aided Design of Integrated
Circuits and Systems, IEEE Transactions on , vol.30, no.8, pp.1103,1113, Aug. 2011
http://www.sciencedirect.com/science/article/pii/S1369702113000205
71
Parameters Variations
๐‘…๐‘‘๐‘Ÿ
๐‘…๐‘†๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ ๐‘™
๐‘…๐‘,๐‘‘๐‘œ๐‘ค๐‘›๐‘ ๐‘ก๐‘Ÿ๐‘’๐‘Ž๐‘š
๐ถ๐ธ ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ โˆ™ ๐‘™
2
๐ถ๐‘‘๐‘Ÿ
๐‘…๐‘,๐‘ข๐‘๐‘ ๐‘ก๐‘Ÿ๐‘’๐‘Ž๐‘š
๐ถ๐ธ ๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ โˆ™ ๐‘™
2
๐ถ๐‘™๐‘œ๐‘Ž๐‘‘
๐‘™, ๐‘๐‘๐‘›๐‘ก , ๐‘ฆ and ๐‘‘ follow normal distribution
In a SWCNT bundle that contains large amount of CNTs
in parallel; according to Central Limit Theorem, we have
๐‘…๐‘†๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ ~๐‘ ๐œ‡๐‘…๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ , ๐œŽ๐‘…๐‘๐‘ข๐‘›๐‘‘๐‘™๐‘’ 2
๐‘†
๐ถ๐ธ ~๐‘(๐œ‡๐ถ๐ธ , ๐œŽ๐ถ๐ธ
2)
๐‘†
70
Uncertainty Aware Design Idea
1. According to the distribution of each variable, generate
large amount of test cases with deterministic values;
2. For each deterministic test case, perform the
deterministic algorithm;
3. If the yield rate is large enough, the current solution is
return; else generate new test cases and repeat 2.
71
Uncertainty Aware Design on CNT Buffering
Algorithm
Given the maximum and minimum of each variable
Generation the value of the variables by
๐‘‰=๐›ฝ๐‘‰๐‘š๐‘Ž๐‘ฅ+(1−๐›ฝ)๐‘‰๐‘š๐‘–๐‘›, ๐›ฝ is initialized to 0
Generate large amount of test cases given normal
distribution of each variables
Run the deterministic CNT buffering algorithm
Timing constraints yield
rate is satisfied
No
Update ๐›ฝ = ๐›ฝ + โˆ†
Yes
Return the buffering solution
72
Outline
1
2
3
4
5
6
7
Introduction
Carbon Nanotubes (CNT) Interconnect
Buffering CNT Interconnect for Timing Optimization
Preliminary Results for CNT Buffering
Fabrication Variation Aware CNT Buffering
Fabrication Variation Aware CNT based VLSI Synthesis
Conclusion
73
Traditional Physical Synthesis of Copper Based
Design
Given: nets, a set of cells and constraints of timing, power
and area.
Placement: determine the physical location of each cell
Routing: given a placement solution, determine the
routing path of every cell
Buffer insertion: given a routing solution, insert buffer to
reduce the delay
Layer assignment: determine the layer of interconnects
74
Fabrication Variation Aware CNT based VLSI Physical
Synthesis
Placement
Routing
CNT Co-design
Layer Assignment
No
Converged?
CNT Buffering
Yes
Output
75
Cross-Entropy (CE) Method
CE is developed as an efficient estimation technique for
rare-event probabilities in discrete event simulation systems
and is adapted for use in optimization.
Two phases of CE:
1.Generate a random data sample according to a specified
mechanism.
2.Update the parameters of the random mechanism based
on the data to produce a "better" sample in the next
iteration.
76
Placement: SimPL
Initial Placement
Global Placement
Post Global
Placement
Uniformly Distributed
Placement
Look-ahead
Legalization (Upper
Bounds)
Last Upper-bound
Placement
Pseudonets linking
each cell to its
legalized location
Final Legalization
and Detailed
Placement
B2B Graph Update
Linear System
(Lower Bounds)
Legal Placement
Netlist ๏ƒ  Graph
(B2B Net Model)
Linear System
No
Converged?
(Δ HPWL)
Yes
No
Converged?
(Gap+ΔHPWL)
Yes
77
Initialize PDF for each cell
Cross-Entropy Based
Flowchart
Pull cell to available placement according to
PDF of each cell using Monte-Carlo method
Update PDF for
cells
associated with
survivals
Keep Samples
With Smaller
Gaps
Sample 1 (Upper
Bounds)
Sample 2 (Upper
Bounds)
Sample N (Upper
Bounds)
B2B Graph Update
Linear System
(Lower Bounds)
B2B Graph Update
Linear System
(Lower Bounds)
B2B Graph Update
Linear System (Lower
Bounds)
Copper based
Routing
Copper based
Routing
Copper based Routing
CNT Co-Design
Layer Assignment
CNT Co-Design Layer
Assignment
CNT Co-Design Layer
Assignment
CNT Buffering
CNT Buffering
CNT Buffering
Yes
No
Converged?
Finish
78
CNT for Security?
๏‚ง CNT technologies can be utilized to design Physically
Unclonable Function (PUF) for security applications.
๏‚ง Based on a physical system (e.g. random variation during
an IC fabrication process)
๏‚ง For use in crypto applications
๏‚ง Easy to construct and evaluate a PUF
๏‚ง Very hard (“impossible”) to produce two PUFs with similar
challenge-response behavior
Challenge
PUF
Response
79
One Example
C
D
Q
0
x
No change
1
1
1
1
0
0
1
0
1 0
D
1 0
Q
1
0
C
If top path is faster, the D = 0, C = 1, output Q = 0;
If bottom path is faster, the D = 1, C = 0, output Q remains at 1.
The fabrication variation will generate unpredictable and random output.
80
Basic Properties of PUF
๏‚ง Minimum requirements:
๏‚ง For two random PUFs, difference between expected
responses to same challenge, should be large
๏‚ง For single random PUF, difference between two
measured responses to same challenge should be
small
๏‚ง For single random PUF, uncertainty about response to
challenge is large
81
PUFs Classification
๏‚ง Delay based intrinsic PUFs
๏‚ง Utilize the propagation delay between identical
circuits in order to derive a response
๏‚ง e.g. Arbiter PUF, Ring oscillator PUF
๏‚ง Memory based intrinsic PUFs
๏‚ง Produce an output response based on the
unpredictable startup state of feedback-based
CMOS memory structures
๏‚ง e.g. SRAM PUF
82
Arbiter PUF
๏‚ง Initial design
๏‚ง switch block: e.g. two MUX
๏‚ง arbiter: e.g. a latch or a flip-flop
๏‚ง n switch blocks ๏ƒ  2n “different” delays
Lee, J.W.; Lim, D.; Gassend, B.; Suh, G.E.; van Dijk, M.; Devadas, S., "A technique to
build a secret key in integrated circuits for identification and authentication applications," in
VLSI Circuits, 2004. Digest of Technical Papers. 2004 Symposium on , pp.176-179, 2004
83
SRAM PUF
๏‚ง Which state right after power-up?
๏‚ง depends on physical mismatch
between M2 and M4
๐‘„
๐‘ธ
1
0
0
1
2 possible stable states
J. Guajardo, S. S. Kumar, G. 1. Schrijen , and P. Tuyls, "FPGA Intrinsic
PUFs and Their Use for IP Protection ," in CHES, 2007, pp. 63-80.
84
CNT Fabrication Process
CNT fabrication process at the Future Carbon GmbH in Bayeruth, Germany
85
CNT Density Variation
๏‚ง CNT density is defined as the CNT count per unit width (1um).
๏‚ง CNT density variation is caused by randomness of CNT
manufacturing process.
๏‚ง Spacing between alligned CNTs varies significantly, leading to huge
CNT density variation.
(i-1,j)
(i,j)
(i-1,j+1)
(i,j+1)
(i+1,j)
Bundled CNT
interconnect
(i+1,j+1)
86
Parallel Bundled SWCNTs Unit
A
Timing
B comparator
Parallel
Bundled
SWCNTs Unit
Output 1/0
If signal A arrives first, the output is 1; if signal B arrives first, the output is 0.
The two set of bundled SWCNTs are generated in exactly same environment.
87
Using Bundled SWCNT Interconnects to Design PUF
Challenge
Parallel
Bundled
SWCNTs
Unit
Parallel
Bundled
SWCNTs
Unit
Parallel
Bundled
SWCNTs
Unit
Parallel
Bundled
SWCNTs
Unit
Response
88
CNFET Based PUF
CNT based PUFs aim to achieve better reliability, low energy and power
consumption compared to that of silicon based PUFs.
89
PUFs Properties
Challenge x
๏‚ง Evaluatable
PUF
Response y
๏‚ง y = PUF (x) is easy
๏‚ง Unique
๏‚ง PUF(x) contains some unique information
๏‚ง Reproducible
๏‚ง PUF() has only small error
๏‚ง Unclonable
๏‚ง Hard to make PUF’(x) given PUF(x)
๏‚ง Unpredictable
๏‚ง Hard to find yN=PUF(xN) given other x, y pairs
๏‚ง One-way
๏‚ง Given y and PUF(), cannot find x
๏‚ง Tamper evident
๏‚ง Tampering changes PUF()
90
Conclusion
๏ฑ Carbon nanotube interconnects have become a
promising replacement material for copper
interconnects thanks to their superior conductivity.
๏ฑ We develop the first carbon nanotubes based
fabrication variation aware VLSI physical synthesis
targeting timing optimization.
๏ฑ We develop the first timing driven buffer insertion
technique for carbon nanotube interconnects.
๏ฑ Uncertainty aware method and probabilistic method
are proposed to handle fabrication variations.
๏ฑ CNT technologies can be used for security
applications (PUF designs).
91
92
Download