Next Generation Carbon Nanotube Based Electronic Design Outline 1 2 3 4 5 6 7 Introduction Carbon Nanotubes (CNT) Interconnect Buffering CNT Interconnect for Timing Optimization Preliminary Results for CNT Buffering Fabrication Variation Aware CNT Buffering Fabrication Variation Aware CNT based VLSI Synthesis Conclusion 2 Interconnect Delay Dominates Copper 3 Copper resistivity (uOhms.cm) Bottleneck of Prevailing Copper Technology Copper Resistivity Copper interconnect technology has its fundamental physical limit, interconnect delay due to ever increasing wire resistivity has greatly limited the circuit miniaturization. 6 5 4 3 2 1 0 80 65 45 32 22 14 Technology node (nm) 4 Copper Global Interconnect Delay Driver Load Interconnect delay (ps) Copper Global Interconnect Technology node (nm) Minimum pitch (nm) 68 210 52 156 1500 40 120 1000 32 96 22 66 18 54 14 42 3500 3000 2500 2000 500 0 68 59 52 45 40 36 32 28 25 22 20 18 16 14 Technology node (nm) Interconnect RC delay (ps) for a 1 mm length minimum pitch Cu global wire (ITRS 2007) 5 Carbon Nanomaterial Carbon Nanotube (CNT) is one of the material of carbon as well as Graphenes. Graphenes CNT 6 Nobel Prize The Nobel Prize in Physics 2010 was awarded jointly to Andre Geim and Konstantin Novoselov “ for groundbreaking experiments regarding the two-dimensional material graphene" 7 SWCNTs: Single-Walled Carbon Nanotubes Single-walled carbon nanotube (SWCNT) can be envisioned as a rolled up graphene sheet into a seamless cylinder with fullerene caps. Diameters of SWCNTs are typically 0.5 to 3nm. CNT lengths range from less than 100 nm to several centimeters 8 Bundled SWCNTs 1nm 0.32nm Bundled SWCNTs consist of a bundle of parallel single SWCNTs 9 Mean Free Path of CNT 40nm 1000nm All particles suffer from collisions with other particles such that their path through space is very short for high densities. This typical path length is called the mean free path. The longer mean free path results in smaller resistivity. Resistivity ๐ ∝ 1 ๐ 10 Advantage of CNT Compared to Cu Properties CNT Cu Mean Free Path 1000๐๐ 40๐๐ Max. Current Density 1010 ๐ด/๐๐2 106 ๐ด/๐๐2 Thermal Conductivity 6000๐/๐๐พ 400๐/๐๐พ ๏ง Copper interconnect technology is approaching its fundamental physical limit, and issues such as electromigration and ever increasing wire resistivity which has greatly limited the circuit miniaturization. ๏ง Carbon nanotube (CNT) interconnect is a promising replacement material. CNT has better performance in mean free path, maximum current density and thermal conductivity. 11 CNT Fabrication Process: Chemical Vapor Deposition (CVD) CVD uses carbon precursors from gas phase to form CNTs that takes place at relatively low temperatures (500–1000 °C), providing great scalability and controllability. Thus, CVD is regarded as a highly promising CNT growth technique for the purpose of large-scale synthesis 12 Quartz Wafer-Scale Aligned CNT Growth Quartz wafer with catalyst Aligned CNT growth 13 Transfer CNTs from Quartz Wafers to Silicon Wafers CNT transfer technique using Thermal Release Tape. (a) SEM of CNTs on quartz. (b) 100 nm of Au evaporated on 4” quartz wafer after CNT growth. (c) Thermal release tape is applied to the Au film and the tape/Au bilayer is peeled off. (d) SiO2/Si Wafer with transferred Au after tape release at 120oC. (e) SEM images of SWNTs transferred from quartz to 50 nm SiO2 on Si after gold etching (KI/I2). (f) (f) Si wafer after substrategated CNFET fabrication. • N. Patil, A. Lin, E. Myers, H.-S. P.Wong, and S. Mitra, “Integrated waferscale growth and transfer of directional carbon nanotubes and misalignedcarbon-nanotube-immune logic structures,” in Proc. Symp. VLSI Technol., pp. 205–206, 2008. 14 CNT Fabrication Orientation Control Surface structures of substrate and state of gas flow can partially decide the growth orientation of SWCNTs under a suitable temperature range 15 CNT Fabrication Length Control To selectively grow SWCNT arrays with certain length, one easy way is confining the spatial termination position of growing SWCNTs, which means to obstruct the growth of SWCNTs by instantaneously stopping the catalysts' activity possibility with additional barriers at a certain position. Rogers et al. introduce a layer of amorphous SiO2 onto quartz surface, and SWCNTs terminated at the edge of the SiO2 layer because of the surface relief 16 CNT Fabrication Density Control Keeping catalyst activity; multiple-cycle CVD growth; adding in new catalysts to grow new SWCNTs 17 CNT Variations in Density Aligned arrays of CNTs can take full advantage of the superior transport characteristics of CNTs, and therefore are considered to be most suitable for high-performance circuit applications. The density of variations will impact on the resistance and capacitance parameters of bundled CNTs 18 CNT Fabrication Diameter Control Depend on the diameter of catalysts and Temperature 19 CNT Variation in Diameter 20 Mis-alignment CNT State-of-the-art CNT growth techniques today are able to produce CNTs with >99.9% alignment for low density. In our design, high density CNTs are desired and misalignment becomes an important issue. 21 Outline 1 2 3 4 Introduction Carbon Nanotubes (CNT) Interconnect Buffering CNT Interconnect for Timing Optimization Preliminary Results for CNT Buffering 5 Fabrication Variation Aware CNT Buffering 6 Fabrication Variation Aware CNT based VLSI Synthesis 7 Conclusion 22 Overview of Bundled SWCNTs Circuit Model Driver Load Bundled SWCNTs interconnect ๐ ๐๐ ๐ถ๐๐ ๐ ๐_๐ข๐ ๐ ๐ ๐๐ข๐๐๐๐ 2 ๐ถ๐ธ ๐๐ข๐๐๐๐ โ ๐ 2 ๐ ๐๐๐ข๐๐๐๐ ๐ ๐ ๐ ๐๐ข๐๐๐๐ 2 ๐ถ๐ธ ๐๐ข๐๐๐๐ โ ๐ 2 ๐ ๐_๐๐๐ค๐ ๐ถ๐๐๐๐ 23 Quantum Resistance of Single SWCNT ๏ If the size of the structure is of the same scale as the mean free path of an electron, Ohm’s law may not apply, there exist quantum effects ๏ Quantum resistance (the lowest possible resistance of an isolated SWCNT) โ ๐ ๐ = 2 = 6.45๐Ω 4๐ โ =6.626×10−34 J·s -- Plank’s constant ๐ =1.602×10−19 Coulombs -- the electronic charge 24 Scattering Resistance of Single SWCNT ๏ Scattering unit resistance โ ๐ ๐ = 2 = 6.45๐Ω/๐๐, when ๐ > λ 4๐ ๐ For simplicity, define ๐ ๐ = 0, when ๐ ≤ λ ๏ Total resistance of a single SWCNT ๐ ๐๐ ๐๐๐๐ก๐๐ = ๐ ๐ + ๐ ๐ ๐ โ =6.626×10−34 J·s -- Plank’s constant ๐ =1.602×10−19 Coulombs -- the electronic charge ๐ is the mean free path of electrons for a CNT ๐ is the length of CNT 25 Resistance of Bundled SWCNTs ๏ Total resistance of bundled SWCNTs ๐ ๐๐ข๐๐๐๐ = ๐ ๐๐ ๐๐๐๐ก๐๐ /๐๐๐๐ก ๐ ๐๐ ๐๐๐๐ก๐๐ is total resistance of a single SWCNT ๐๐๐๐ก is the number of SWCNTs contained in the bundle For global interconnect, the scattering resistance dominates, thus, ๐ ๐๐ข๐๐๐๐ = ๐ ๐๐ ๐๐๐๐ก๐๐ /๐๐๐๐ก =๐ ๐ ๐/๐๐๐๐ก 26 Contact Resistance Imperfect contacts between copper and carbon nanotubes CNT interconnect layer Contact resistance Some research groups have accomplished to fabricate the contact resistances ranging from a few hundred ohms to a few kilohms 27 Quantum Capacitance of Single SWCNT and Bundled SWCNTs ๏ Quantum unit capacitance ๐ถ๐ = 2๐ 2 โ๐ฃ๐ where ๐ฃ๐ is the Fermi velocity (๐ฃ๐ ≈ 8 × 105 ๐/๐ ) ๏ Since an SWCNT has four conducting channels, the net quantum capacitance of a single SWCNT is ๐ถ๐๐ถ๐๐ = 4๐ถ๐ ๏ Quantum capacitance of a bundled SWCNTs is ๐ถ๐๐๐ข๐๐๐๐ = ๐๐๐๐ก ๐ถ๐๐ถ๐๐ 28 Electrostatic Capacitance of Single SWCNT and Bundled SWCNTs ๏ Electrostatic unit capacitance 2๐๐ ๐ถ๐ธ = −1 ๐๐๐ โ (๐ฆ/๐) ๏ FastCap is used to calculate the electrostatic capacitance of each single SWCNT 29 Effective Capacitance of Bundled SWCNTs The effective capacitance of an SWCNT bundle interconnect (Cbundle) is given by the series combination of its electrostatic capacitance and quantum capacitance. As shown in left figure, the effective SWCNT bundle capacitance is nearly equal to its electrostatic capacitance and the effect of the quantum capacitance is small. N. Srivastava, H. Li, F. Kreupl and K. Banerjee, “On the Applicability of Single-Walled Carbon Nanotubes as VLSI Interconnects,” IEEE Transactions on Nanotechnology, vol. 8, no. 4, pp. 542–559, 2009. 30 Inductance of Single SWCNT and Bundled SWCNTs ๏Kinetic inductance of single SWCNT โ ๐๐ ๐๐๐๐ก๐๐ ๐ฟ๐พ = 2 8๐ ๐ฃ๐น ๏Magnetic inductance of single SWCNT ๐ ๐ฆ ๐๐ ๐๐๐๐ก๐๐ −1 ๐ฟ๐ = ๐๐๐ โ ( ) 2๐ ๐ ๏Inductance of bundled SWCNTs ๐ฟ๐๐ข๐๐๐๐ = ๐ฟ๐พ ๐๐ ๐๐๐๐ก๐๐ + ๐ฟ๐ ๐๐ ๐๐๐๐ก๐๐ ๐๐๐๐ก 31 Inductive Effect of Bundle SWCNTs is Not Important 1 ๐ ๐๐ ๐ถ๐ < ๐ ๐๐ถ๐ < ๐ฟ๐ถ๐ 2 ๐ ๐๐ ๐ถ๐ where Rdr is the driver impedance and R, C and L are the per unit length interconnect resistance, capacitance and inductance. (1/2)๐ ๐๐ถ๐ ๐ฟ๐ถ๐ The above inequality never holds, inductance can be ignored. N. Srivastava, H. Li, F. Kreupl and K. Banerjee, “On the Applicability of Single-Walled Carbon Nanotubes as VLSI Interconnects,” IEEE Transactions on Nanotechnology, vol. 8, no. 4, pp. 542–559, 2009. 32 Bundled SWCNTs Interconnect Model CNT interconnect layer Driver Load Bundled SWCNTs interconnect ๐ ๐๐๐ข๐๐๐๐ ๐ ๐๐ ๐ ๐,๐๐๐ค๐๐ ๐ก๐๐๐๐ 2 ๐ ๐ ๐๐ข๐๐๐๐ ๐ถ๐๐๐ข๐๐๐๐ ๐ถ๐ธ๐๐ข๐๐๐๐ ๐ ๐ ๐๐ข๐๐๐๐ ๐ ๐ ๐๐ข๐๐๐๐ ๐ถ๐๐๐ข๐๐๐๐ ๐ถ๐๐๐ข๐๐๐๐ ๐ถ๐ธ๐๐ข๐๐๐๐ ๐ถ๐ธ๐๐ข๐๐๐๐ ๐ ๐๐๐ข๐๐๐๐ 2 ๐ ๐,๐ข๐๐ ๐ก๐๐๐๐ ๐ถ๐๐๐๐ 33 Bundled SWCNT Interconnect π Model ๐ ๐๐ ๐ถ๐๐ ๐ ๐,๐๐๐ค๐๐ ๐ก๐๐๐๐ ๐ ๐๐๐ข๐๐๐๐ + ๐ ๐ ๐๐ข๐๐๐๐ ๐ ๐ถ๐๐๐ข๐๐๐๐ โ ๐ถ๐ธ ๐๐ข๐๐๐๐ โ ๐ 2(๐ถ๐๐๐ข๐๐๐๐ + ๐ถ๐ธ๐๐ข๐๐๐๐ ) ๐ถ๐๐๐ข๐๐๐๐ โ ๐ถ๐ธ ๐๐ข๐๐๐๐ โ ๐ 2(๐ถ๐๐๐ข๐๐๐๐ + ๐ถ๐ธ๐๐ข๐๐๐๐ ) ๐ ๐,๐ข๐๐ ๐ก๐๐๐๐ ๐ถ๐๐๐๐ 34 Bundled SWCNT Global Interconnect Simplified π Model ๐ ๐๐ ๐ถ๐๐ ๐ ๐๐๐ข๐๐๐๐ ๐ ๐ ๐,๐๐๐ค๐๐ ๐ก๐๐๐๐ ๐ถ๐ธ ๐๐ข๐๐๐๐ โ ๐ 2 ๐ ๐,๐ข๐๐ ๐ก๐๐๐๐ ๐ถ๐ธ ๐๐ข๐๐๐๐ โ ๐ 2 ๐ถ๐๐๐๐ 35 Outline 1 2 3 4 Introduction Carbon Nanotubes (CNT) Interconnect Buffering CNT Interconnect for Timing Optimization Preliminary Results for CNT Buffering 5 Fabrication Variation Aware CNT Buffering 6 Fabrication Variation Aware CNT based VLSI Synthesis 7 Conclusion 36 Delay for A Circuit Delay = ๏all Ri ๏all Cj downstream from Ri Ri*Cj Elmore delay to n1 R(B)*(C1+C2) Elmore delay to n2 R(B)*(C1+C2)+R(w)*C2 37 Buffer Insertion For Delay Reduction 38 Why Buffers Can Reduce Delay? (Rd,Cd) ๐๐ ๐ ๐ ๐ถ๐ ๐๐ 2 ๐๐/2 ๐ ๐ ๐๐ 2 (Rb,Cb) (Rd,Cd) (CL) ๐ถ๐ฟ ๐๐ ๐ก๐ข๐๐๐ข๐ = ๐ ๐ ๐๐ + ๐ถ๐ฟ + ๐๐( + ๐ถ๐ฟ ) 2 Suppose that ๐ ๐ = ๐ ๐ , ๐ถ๐ = ๐ถ๐ โ๐ก = ๐ก๐๐ข๐ − ๐ก๐ข๐๐๐ข๐ = ๐ ๐ ๐ถ๐ − ๐๐๐ 2 /4 ๐๐ 4 ๐ถ๐ ๐ก๐๐ข๐ = ๐ ๐ ๐๐ 4 ๐ ๐ ๐ถ๐ (CL) ๐๐/2 ๐๐ 4 ๐๐ 4 ๐ถ๐ฟ ๐๐ ๐๐ ๐๐ ๐๐ ๐๐ ๐๐ + ๐ถ๐ + + ๐ถ๐ + ๐ ๐ + ๐ถ๐ฟ + ( + ๐ถ๐ฟ ) 2 2 4 2 2 4 โ๐ก ๐ 39 CNT Buffering and Copper Buffering CNT interconnect layer Copper interconnect layer Copper buffering CNT buffering 40 Existing Works ๏ฑ Some existing works consider the CNT interconnect or buffered CNT interconnect, however, they only use a two pin mode interconnect model ๏ฑ None of the existing works consider the deployment of CNT into VLSI physical design • • • • N. Srivastava, H. Li, F. Kreupl and K. Banerjee, “On the Applicability of SingleWalled Carbon Nanotubes as VLSI Interconnects,” IEEE Transactions on Nanotechnology, vol. 8, no. 4, pp. 542–559, 2009. A. Nieuwoudt and Y. Massoud, “On the optimal design, performance, and reliability of future carbon nanotube-based interconnect solutions,” IEEE Transactions on Electron Devices, vol. 55, no. 8, pp. 2097–2110, 2008. G. Close and H.-S. Wong, “Assembly and electrical characterization of multiwall carbon nanotube interconnects,” IEEE Transactions on Nanotechnology, 2008. A. Naeemi and J. D. Meindl, “Design and performance modeling for singlewall carbon nanotubes as local, semi-global, and global interconnects in gigascale integrated systems,” IEEE Transactions on Electron Devices, vol. 54, no. 1, pp. 26–37, 2008. 41 Problem Formulation Timing Constrained Minimum Cost Buffering for Carbon Nanotube Interconnects: Given a binary routing tree with ๐ candidate buffer locations in carbon nanotube interconnect layer, a buffer library and a set of candidate buffer positions, to compute a buffer assignment solution such that the timing constraint is satisfied, and the total buffer cost is minimized. 42 Solution Characterization ๏ฎ To model effect to downstream, a candidate solution is associated with • v: a node • C: downstream capacitance • Q: required arrival time • W: cumulative buffer cost 43 Candidate Buffering Solutions 44 Dynamic Programming (DP) ๏ง Start from sinks ๏ง Candidate solutions are generated ๏ง Three operations – Add Wire – Insert Buffer – Merge Candidate solutions are propagated toward the source ๏ง Solution Pruning 45 Solution Propagation: Add Buffer (๐ ๐พ ′ , ๐ถ ๐พ ′ , ๐ ๐พ ′ ) (๐ ๐พ , ๐ถ ๐พ , ๐(๐พ)) 46 Solution Propagation: Add Wire (๐ ๐พ๐ฃ , ๐ถ ๐พ๐ฃ , ๐ ๐พ๐ฃ ) ๐ข (๐ ๐พ๐ข , ๐ถ ๐พ๐ข , ๐ ๐พ๐ข ) ๐ ๐ข, ๐ฃ , ๐ถ(๐ข, ๐ฃ) ๐ฃ 47 Solution Propagation: Add Driver (๐ ๐พ ′ , ๐ถ ๐พ ′ , ๐ ๐พ ′ ) (๐ ๐พ , ๐ถ ๐พ , ๐(๐พ)) 48 Branch Merge (๐ ๐พ , ๐ถ ๐พ , ๐(๐พ)) (๐ ๐พ1 , ๐ถ ๐พ1 , ๐ ๐พ1 ) (๐ ๐พ2 , ๐ถ ๐พ2 , ๐ ๐พ2 ) 49 Exponential Runtime 16 solutions 8 solutions 4 solutions 2 solutions n candidate buffer locations lead to 2n solutions 50 Too Many Solutions ๏ง Needs solution pruning for acceleration ๏ง Two candidate solutions – (v, c1, q1,w1) – (v, c2, q2,w2) ๏ง Solution 1 is inferior to Solution 2 if – c1 ๏ณ c2 : larger load – and q1 ๏ฃ q2 : tighter timing – and w1 ๏ณw2: larger cost 51 Pruning (Q1,C1,W1) inferior/dominated if C1 ๏ณ C2,W1 ๏ณ W2 and Q1 ๏ฃ Q2 (Q2,C2,W2) 52 Outline 1 2 3 4 5 6 7 Introduction Carbon Nanotubes (CNT) Interconnect Buffering CNT Interconnect for Timing Optimization Preliminary Results for CNT Buffering Fabrication Variation Aware CNT Buffering Fabrication Variation Aware CNT based VLSI Synthesis Conclusion 53 Experimental Environment The proposed carbon nanotube interconnect based timing driven minimum cost buffer insertion algorithm is implemented in C and tested on a machine with 3.40GHz Intel Pentium CPU and 3GB memory. Simulation 1: Timing constrained minimum cost buffering Simulation 2: Timing minimization without considering cost 54 Experimental Setup – Layer RC Information Cu Bundled SWCNT (1000 SWCNTs) Resistance (โฆ) 14.50 6.45 Capacitance (fF) 0.16 0.16 ๐๐ ๐ด CNT density = 1000/(33 × 88) = 0.34๐๐2 ๐ ๐๐ข = Contact resistance is set to 100โฆ 55 Experimental Setup – Buffer Library Property BUF_X1 BUF_X2 BUF_X4 BUF_X8 BUF_X16 Resistance (โฆ) 2310.0 1201.0 618.9 315.5 159.6 ๏ดIntrinsic delay (ps) 0.21 Capacitance (fF) Linear fitting 2.93 Area (nm2) 15197.6 Property INV_X1 ๏ด ๏ฝ RC1846.0 ๏ซ๏ด 0 Resistance (โฆ) 0.44Reference 0.88 driver A2.91 B 2.87 30395.2 60790.4 INV_X2 INV_X4 Buffer w/ unknown 3.51 capacitance 1.76 2.87 121580.8 INV_X8 Reference driver C A’ 976.5 514.8 B’ 270.2ref 2.87 C 243161.6 INV_X16 139.7 C’ Capacitance (fF) 0.44 0.87 1.74 3.49 6.97 Intrinsic delay (ps) 0.59 0.62 0.61 0.61 0.61 Area (nm2) C 10115.6 20231.2 40462.4 80924.8 161849.6 56 Experimental Setup – Global Nets Our experiments are performed to 500 global nets. Due to the lack of industrial nets in 22nm technology, we scale wire lengths of old technology nets to 22nm technology. 57 Experiment 1 Timing constrained minimum cost buffering Timing minimization without considering cost 58 Experiment 1 Results of 500 Nets On Average (Normalized) Chart Title 1.2 1 Ratio 0.8 0.6 0.4 0.2 0 CNT w/o contact resistance Area # Buffers CNT w/ 100 Ohms contact resistance Delay # Solutions Cu CPU 59 Experiment 1 Results on Five Representative Nets Test cases CNT w/o contact resistance CNT w/ contact resistance (100โฆ) Cu 1 2 3 4 5 Average Area (nm2) 318666.0 394605.0 222543.0 50578.0 40462.4 205370.88 # Buffers 7 9 5 3 2 5.2 Delays (ps) 754 1128 676 1019 722 859.8 Area (nm2) 379359.0 263151.0 222543.0 80924.8 40462.4 197288.04 # Buffers 7 9 5 4 2 5.4 Delays (ps) 762 1130 691 927 736 849.2 Area (nm2) 955997.0 612308.0 475433.0 202312.0 91040.4 467418.08 # Buffers 18 19 12 10 5 15.0 Delays (ps) 766 1180 702 994 870 902.4 60 Area and Delay Trade-off Curves for Cu and CNT 61 Experiment 2 Timing constrained minimum cost buffering Timing minimization without considering cost 62 Experiment 2 Results on Five Representative Nets Test cases CNT w/o contact resistance CNT w/ contact resistance (100โฆ) Cu 1 2 3 4 5 Average Area (nm2) 3307950.0 3738560.0 2477160.0 3039520.0 1945290.0 2913844.00 # Buffers 50 61 44 44 32 46.0 Delays (ps) 376 724 314 249 188 370.2 Area (nm2) 1463910.0 1995870.0 1408250.0 1458970.0 1094230.0 1555162.00 # Buffers 36 36 31 24 18 30.8 Delays (ps) 423 797 347 302 229 419.6 Area (nm2) 2851920.0 3794220.0 2269040.0 2872350.0 2142860.0 3040382.00 # Buffers 65 67 56 48 36 59.4 Delays (ps) 479 877 382 363 276 475.4 63 Observations ๏ฑ In order to achieve the similar delay, the CNT buffering saves more than 50% buffer area over copper buffering ๏ฑ The total number of buffers in CNT buffering is much (about 2X) smaller than that of copper buffering thanks to the fact that wire resistivity of bundled SWCNTs is much lower than that of copper for global interconnect ๏ฑ The contact resistance does not have significant impact on the performance for CNT interconnect timing constrained minimum cost buffering ๏ฑ CNT buffering always outperforms the copper buffering in terms of timing and buffer area ๏ฑ CNT buffering can reduce timing by up to 32% comparing to copper buffering for buffering timing minimization without considering cost 64 Summary of CNT Buffering ๏ Carbon nanotube interconnects have become a promising replacement material for copper interconnects thanks to their superior conductivity. ๏ This work develops the first timing driven buffer insertion technique for carbon nanotube interconnects. ๏ In the experimental results, it demonstrates that with the same timing constraint, CNT buffering can save over 50% buffer area compared to copper buffering. In addition, CNT buffering can effectively reduce the delay by up to 32% without considering cost. ๏ This work is published in ISVLSI2014. Lin Liu, Yuchen Zhou and Shiyan Hu, “Buffering Single Walled Carbon Nanotube Bundle Interconnects for Timing Optimization”, to appear in Proceedings of IEEE Computer Society Annual Symposium on VLSI (ISVLSI), 2014. 65 Outline 1 2 3 4 5 6 7 Introduction Carbon Nanotubes (CNT) Interconnect Buffering CNT Interconnect for Timing Optimization Preliminary Results for CNT Buffering Fabrication Variation Aware CNT Buffering Fabrication Variation Aware CNT based VLSI Synthesis Conclusion 66 Fabrication Imperfectness • It is very difficult for today’s CNT processing to produce perfect CNTs. • New design techniques must be employed that are immune to these inherent CNT imperfections. • These new design techniques must be compatible with VLSI processing, and must have minimal impact on existing VLSI design flows. 67 Existing Works ๏ฑ Some research works studied the variations of CNT and some proposed robust design considering CNFET ๏ฑ Some research works studied the copper variation based buffer insetion • • • • • Jie Zhang, et al, "Carbon Nanotube Robust Digital VLSI," Computer-Aided Design of Integrated Circuits and Systems, IEEE Transactions on , vol.31, no.4, pp.453,471, April 2012 Patil, N.; Lin, A.; Myers, E.R.; Ryu, Koungmin; Badmaev, A.; Chongwu Zhou; Wong, H. -S P; Mitra, S, "Wafer-Scale Growth and Transfer of Aligned Single-Walled Carbon Nanotubes," Nanotechnology, IEEE Transactions on , vol.8, no.4, pp.498,504, July 2009 Jie Zhang; Patil, N.P.; Hazeghi, A.; Wong, H. -S P; Mitra, S, "Characterization and Design of Logic Circuits in the Presence of Carbon Nanotube Density Variations," Computer-Aided Design of Integrated Circuits and Systems, IEEE Transactions on , vol.30, no.8, pp.1103,1113, Aug. 2011 Raychowdhury, A.; De, V.K.; Kurtin, Juanita; Borkar, S.Y.; Roy, K.; Keshavarzi, A., "Variation Tolerance in a Multichannel Carbon-Nanotube Transistor for High-Speed Digital Circuits," Electron Devices, IEEE Transactions on , vol.56, no.3, pp.383,392, March 2009 Jinjun Xiong; Lei He, "Probabilistic Transitive-Closure Ordering and Its Application on Variational Buffer Insertion," Computer-Aided Design of Integrated Circuits and Systems, IEEE Transactions on , vol.26, no.4, pp.739,742, April 2007 68 CNT Fabrication Variations Distribution CNT diameter variations (d) Normal CNT length variations (l) Normal CNT counts (Ncnt) Normal ๐๐ = 1.3๐๐ ๐๐ = 0.2๐๐ ๐๐ ๐๐ = ๐๐ cos 10° ๐๐๐๐๐ก = 1000 ๐๐๐๐๐ก = 22 CNT distance to ground (y) Normal ๐๐ฆ ๐๐ฆ = 0.1๐๐ Jie Zhang; Patil, N.P.; Hazeghi, A.; Wong, H. -S P; Mitra, S, "Characterization and Design of Logic Circuits in the Presence of Carbon Nanotube Density Variations," Computer-Aided Design of Integrated Circuits and Systems, IEEE Transactions on , vol.30, no.8, pp.1103,1113, Aug. 2011 http://www.sciencedirect.com/science/article/pii/S1369702113000205 71 Parameters Variations ๐ ๐๐ ๐ ๐๐๐ข๐๐๐๐ ๐ ๐ ๐,๐๐๐ค๐๐ ๐ก๐๐๐๐ ๐ถ๐ธ ๐๐ข๐๐๐๐ โ ๐ 2 ๐ถ๐๐ ๐ ๐,๐ข๐๐ ๐ก๐๐๐๐ ๐ถ๐ธ ๐๐ข๐๐๐๐ โ ๐ 2 ๐ถ๐๐๐๐ ๐, ๐๐๐๐ก , ๐ฆ and ๐ follow normal distribution In a SWCNT bundle that contains large amount of CNTs in parallel; according to Central Limit Theorem, we have ๐ ๐๐๐ข๐๐๐๐ ~๐ ๐๐ ๐๐ข๐๐๐๐ , ๐๐ ๐๐ข๐๐๐๐ 2 ๐ ๐ถ๐ธ ~๐(๐๐ถ๐ธ , ๐๐ถ๐ธ 2) ๐ 70 Uncertainty Aware Design Idea 1. According to the distribution of each variable, generate large amount of test cases with deterministic values; 2. For each deterministic test case, perform the deterministic algorithm; 3. If the yield rate is large enough, the current solution is return; else generate new test cases and repeat 2. 71 Uncertainty Aware Design on CNT Buffering Algorithm Given the maximum and minimum of each variable Generation the value of the variables by ๐=๐ฝ๐๐๐๐ฅ+(1−๐ฝ)๐๐๐๐, ๐ฝ is initialized to 0 Generate large amount of test cases given normal distribution of each variables Run the deterministic CNT buffering algorithm Timing constraints yield rate is satisfied No Update ๐ฝ = ๐ฝ + โ Yes Return the buffering solution 72 Outline 1 2 3 4 5 6 7 Introduction Carbon Nanotubes (CNT) Interconnect Buffering CNT Interconnect for Timing Optimization Preliminary Results for CNT Buffering Fabrication Variation Aware CNT Buffering Fabrication Variation Aware CNT based VLSI Synthesis Conclusion 73 Traditional Physical Synthesis of Copper Based Design Given: nets, a set of cells and constraints of timing, power and area. Placement: determine the physical location of each cell Routing: given a placement solution, determine the routing path of every cell Buffer insertion: given a routing solution, insert buffer to reduce the delay Layer assignment: determine the layer of interconnects 74 Fabrication Variation Aware CNT based VLSI Physical Synthesis Placement Routing CNT Co-design Layer Assignment No Converged? CNT Buffering Yes Output 75 Cross-Entropy (CE) Method CE is developed as an efficient estimation technique for rare-event probabilities in discrete event simulation systems and is adapted for use in optimization. Two phases of CE: 1.Generate a random data sample according to a specified mechanism. 2.Update the parameters of the random mechanism based on the data to produce a "better" sample in the next iteration. 76 Placement: SimPL Initial Placement Global Placement Post Global Placement Uniformly Distributed Placement Look-ahead Legalization (Upper Bounds) Last Upper-bound Placement Pseudonets linking each cell to its legalized location Final Legalization and Detailed Placement B2B Graph Update Linear System (Lower Bounds) Legal Placement Netlist ๏ Graph (B2B Net Model) Linear System No Converged? (Δ HPWL) Yes No Converged? (Gap+ΔHPWL) Yes 77 Initialize PDF for each cell Cross-Entropy Based Flowchart Pull cell to available placement according to PDF of each cell using Monte-Carlo method Update PDF for cells associated with survivals Keep Samples With Smaller Gaps Sample 1 (Upper Bounds) Sample 2 (Upper Bounds) Sample N (Upper Bounds) B2B Graph Update Linear System (Lower Bounds) B2B Graph Update Linear System (Lower Bounds) B2B Graph Update Linear System (Lower Bounds) Copper based Routing Copper based Routing Copper based Routing CNT Co-Design Layer Assignment CNT Co-Design Layer Assignment CNT Co-Design Layer Assignment CNT Buffering CNT Buffering CNT Buffering Yes No Converged? Finish 78 CNT for Security? ๏ง CNT technologies can be utilized to design Physically Unclonable Function (PUF) for security applications. ๏ง Based on a physical system (e.g. random variation during an IC fabrication process) ๏ง For use in crypto applications ๏ง Easy to construct and evaluate a PUF ๏ง Very hard (“impossible”) to produce two PUFs with similar challenge-response behavior Challenge PUF Response 79 One Example C D Q 0 x No change 1 1 1 1 0 0 1 0 1 0 D 1 0 Q 1 0 C If top path is faster, the D = 0, C = 1, output Q = 0; If bottom path is faster, the D = 1, C = 0, output Q remains at 1. The fabrication variation will generate unpredictable and random output. 80 Basic Properties of PUF ๏ง Minimum requirements: ๏ง For two random PUFs, difference between expected responses to same challenge, should be large ๏ง For single random PUF, difference between two measured responses to same challenge should be small ๏ง For single random PUF, uncertainty about response to challenge is large 81 PUFs Classification ๏ง Delay based intrinsic PUFs ๏ง Utilize the propagation delay between identical circuits in order to derive a response ๏ง e.g. Arbiter PUF, Ring oscillator PUF ๏ง Memory based intrinsic PUFs ๏ง Produce an output response based on the unpredictable startup state of feedback-based CMOS memory structures ๏ง e.g. SRAM PUF 82 Arbiter PUF ๏ง Initial design ๏ง switch block: e.g. two MUX ๏ง arbiter: e.g. a latch or a flip-flop ๏ง n switch blocks ๏ 2n “different” delays Lee, J.W.; Lim, D.; Gassend, B.; Suh, G.E.; van Dijk, M.; Devadas, S., "A technique to build a secret key in integrated circuits for identification and authentication applications," in VLSI Circuits, 2004. Digest of Technical Papers. 2004 Symposium on , pp.176-179, 2004 83 SRAM PUF ๏ง Which state right after power-up? ๏ง depends on physical mismatch between M2 and M4 ๐ ๐ธ 1 0 0 1 2 possible stable states J. Guajardo, S. S. Kumar, G. 1. Schrijen , and P. Tuyls, "FPGA Intrinsic PUFs and Their Use for IP Protection ," in CHES, 2007, pp. 63-80. 84 CNT Fabrication Process CNT fabrication process at the Future Carbon GmbH in Bayeruth, Germany 85 CNT Density Variation ๏ง CNT density is defined as the CNT count per unit width (1um). ๏ง CNT density variation is caused by randomness of CNT manufacturing process. ๏ง Spacing between alligned CNTs varies significantly, leading to huge CNT density variation. (i-1,j) (i,j) (i-1,j+1) (i,j+1) (i+1,j) Bundled CNT interconnect (i+1,j+1) 86 Parallel Bundled SWCNTs Unit A Timing B comparator Parallel Bundled SWCNTs Unit Output 1/0 If signal A arrives first, the output is 1; if signal B arrives first, the output is 0. The two set of bundled SWCNTs are generated in exactly same environment. 87 Using Bundled SWCNT Interconnects to Design PUF Challenge Parallel Bundled SWCNTs Unit Parallel Bundled SWCNTs Unit Parallel Bundled SWCNTs Unit Parallel Bundled SWCNTs Unit Response 88 CNFET Based PUF CNT based PUFs aim to achieve better reliability, low energy and power consumption compared to that of silicon based PUFs. 89 PUFs Properties Challenge x ๏ง Evaluatable PUF Response y ๏ง y = PUF (x) is easy ๏ง Unique ๏ง PUF(x) contains some unique information ๏ง Reproducible ๏ง PUF() has only small error ๏ง Unclonable ๏ง Hard to make PUF’(x) given PUF(x) ๏ง Unpredictable ๏ง Hard to find yN=PUF(xN) given other x, y pairs ๏ง One-way ๏ง Given y and PUF(), cannot find x ๏ง Tamper evident ๏ง Tampering changes PUF() 90 Conclusion ๏ฑ Carbon nanotube interconnects have become a promising replacement material for copper interconnects thanks to their superior conductivity. ๏ฑ We develop the first carbon nanotubes based fabrication variation aware VLSI physical synthesis targeting timing optimization. ๏ฑ We develop the first timing driven buffer insertion technique for carbon nanotube interconnects. ๏ฑ Uncertainty aware method and probabilistic method are proposed to handle fabrication variations. ๏ฑ CNT technologies can be used for security applications (PUF designs). 91 92