2012 2nd International Conference on Information Communication and Management (ICICM 2012) IPCSIT vol. 55 (2012) © (2012) IACSIT Press, Singapore DOI: 10.7763/IPCSIT.2012.V55.26 Efficient On-Chip Crosstalk Avoidance CODECs using FTF-FNS Venkateswara Rao J 1, Sudhakara Rao P 2 1 Associate Professor, Vignan Institute of Technology & Science 2 Dean R & D, Vignan Institute of Technology & Science Abstract. VLSI design has marched into the realm of Deep Submicron (DSM) processes, where the minimum feature size is well below 1 µm. These advanced processes enable designers to implement faster, bigger and more complex designs. Integrated Circuit(IC) design has seen a revolutionary progress in the past two decades with shrinking sizes of VLSI fabrication processes. This has an advantage of fabricating millions of transistors in a single chip IC. On the other hand it also creates many challenges in Deep Sub-Micron (DSM) technologies. One of the greatest challenges in DSM designs is inter-wire crosstalk, which becomes significant due to coupling capacitance between wires. The effect of inter wire crosstalk is that it greatly limits speed and increases power consumption of IC. This paper focuses on design and implementation of an efficient CODEC which uses Forbidden Transition Free (FTF), Fibonacci based Number System (FNS) for bus encoding. Our approach of CODEC design greatly increases the speed (approximately greater than 2.5 times) and decreases the power consumption with the best existing technologies. Keywords: Crosstalk, Deep Sub-Micron, Forbidden Transition Free, Fibonacci Number System, On-Chip bus, Crosstalk avoidance Codes, CODEC. 1. Introduction With shrinking device sizes, increasing chip complexity and faster clock speeds, wire delay is becoming increasingly significant [11]. The propagation delay through long cross-chip buses is proving to be a limiting factor in the speed of some designs, and this trend is only expected to get worse. It has been shown that the delay through a long bus is strongly dependent on the coupling capacitance between the wires. In particular, the crosstalk effect when adjacent wires simultaneously transition in opposite directions is particularly detrimental to the delay. When the cross-coupling capacitance is comparable to or exceeds the loading capacitance on the wires, the delay of such a transition may be twice or more than that of a wire transitioning next to a steady signal. This delay penalty is commonly referred to as the capacitive crosstalk delay. The capacitive crosstalk delay strongly depends on the transition activities of the adjacent signals, hence the crosstalk type. Type-4 and type-3 crosstalk have the worst delay characteristics, followed by type-2 and then type-1. A few techniques involving selective skewing of bus data signals [12], transistor sizing [13], and repeater sizing [14] to reduce capacitive crosstalk induced delay have been proposed. Encoding is one of the more effective ways to reduce capacitive crosstalk delays. Here we present encoding techniques that focus on reducing crosstalk delay. The rest of the paper is organized as follows. Section II explains about Crosstalk Classification, Section III discusses about forbidden transition free crosstalk avoidance codes (FPF-CAC). In Section IV, we discuss about Fibonacci Number System. In Section V, We discuss about circuit implementation and experimental results. We conclude the paper in Section 6. 2. Crosstalk Classification Figure-1 illustrates a simplified on-chip bus model with crosstalk. For simplicity we assumed three concurrent wire lines. In the figure, CL denotes the load capacitance, which includes the receiver gate capacitance and also the parasitic wire-to-substrate parasitic capacitance. CI is the inter-wire coupling 137 capacitance between adjacent signal lines of the bus. In practice, this bus structure is typically modelled as a distributed RC network, which includes the non-zero resistance of the wire as well. Physical Layout B A C Circuit Model Aggressor net A CI Victim net B Driver CL Receiver 2 Receiver 1 Aggressor net C Fig. 1: On-chip bus model with crosstalk Fig. 2: Delay impact of different crosstalk classes types The on-chip bus crosstalk is classified into five types as shown in Table-1. Table. 1: Transition Pattern Crosstalk Classification Crosstalk Class Sample Transition Patterns Ceff 0C CL 000→111 1C CL(1+λ) 011→000 2C CL(1+2λ) 010→000 3C CL(1+3λ) 010→100 4C CL(1+4λ) 010→101 This classification is based on the effective capacitance, in the jth line in a bus as, Ceff, j Ceff, j = CL [1 + λ ((1 − δj, j−1) + (1 − δj, j+1))] (1) = CL + Clw, j + Crw, j It separates Ceff, j into three components: the intrinsic capacitance CL, the crosstalk capacitance to the wire on the left side, Clw, j = λ (1−δj, j−1) CL, and the capacitance to the wire on the right side, Crw, j = λ (1−δj, j+1) CL. It is easy to see that Clw, j, Crw, j ∈ {0, 1CI, 2CI}. 3. Forbidden Transition Free Based Crosstalk Avoidance Codes A forbidden transition is defined as the simultaneous transition (in opposite directions) on two adjacent bits, i.e., 01 → 10 or 10 → 01. We first observe that to guarantee forbidden transition freedom on the boundary djdj+1 between any two code-words in an FTF-CAC, the 01 and 10 patterns cannot coexist in the same set of code-words. This can be easily confirmed by examining the transitions among codes in {00, 01, and 11}, or {00, 10, 11}. If we eliminate 01 or 10 from all the boundaries in the code-words in a set of codewords R, we can guarantee that R is forbidden transition free. Therefore, once again, the problem of eliminating forbidden transitions is transformed into a problem of eliminating specific patterns. • Start with a complete set of 2n vectors and remove code-words that do not satisfy the boundary constraints. • Start with a set consisting of a single class 1 code-word and grow the FTF-CAC code-words by adding compatible code-words to the set. • Start from a small FTF set (say 2-bit FTF codes) and inductively append bits to the code-words in the set until the code-word length reaches n-bits. Clearly, the first (pruning) method is impractical when n is 138 large, since a complete set of code-words have 2n entries and searching through n − 1 boundaries requires O (2(n−1) n) searches. Both the second and the third methods listed above actually “grow” the FTF code-words instead of “pruning”, and therefore require less computation. The method of “growing” code-words by appending bits to code-words in an existing set is given in Algorith-1. Algorithm-1 is the pseudo code for generating the FTF code-words. Table. 2: FTF-CAC Code-words generation Algorithm. Algorithm-1 FTF code-word generation S2 = {00, 01, 11} else for m > 2 do for ∀Vm−1 ∈ Sm−1 do if m is odd then add 0 · Vm−1 to Sm; for ∀Vm−1 ∈ Sm−1 do if dm−1 = 1 then add 1 · Vm−1 to Sm; add 1 · Vm−1 to Sm; if dm−1 = 0 then end if add 0 · Vm−1 to Sm; end for end if end if end for end for Table. 3: FTF-CAC Code-words for 2, 3, 4 & 5 Bit Busses. 2 bits 00 01 11 3 bits 000 001 100 101 111 4 bits 0000 0001 0100 0101 0111 1100 1101 1111 5 bits 00000 00001 00100 00101 00111 10000 10001 10100 10101 10111 11100 11101 11111 4. Fibonacci Number System(FNS) The Fibonacci binary numeral system (FNS) was first mentioned in the context of CAC designs by Mutyam in [4]. The author proposed an inductive code-word generation algorithm for the forbidden Pattern free code. The algorithm is similar to those proposed in [6, 8]. However, [4] failed to address the mapping scheme and CODEC design. We next describe our FNS-based mapping, and the resulting CODEC designs. A numeral system is “a framework where numbers are represented by numerals in a consistent manner”. The most commonly used numeral system in digital design is the binary numeral system, which uses powers of two as the basis. For a number v, its binary representation is defined in Eq. 2. The binary numeral system is complete and unambiguous, which means that each number has one and only one representation in the binary numeral system. Definition-1 v= bk ∈ {0, 1} (2) = dk ∈ {0, 1} (3) The Fibonacci-based numeral system N(fm, {0, 1}) is the numeral system that uses Fibonacci sequence as the basis. The definition of the Fibonacci sequence is given in Eq. 4. A number v is represented as the summation of some Fibonacci numbers, and no Fibonacci number is in the summation more than once, as indicated in Eq. 5. Definition-2 fm = {0 if m=0, 139 {1 if {fm−1 + fm−2 if m=1, (4) m ≥ 2. Similar to the binary numeral system, the Fibonacci-based numeral system is complete, and therefore any number v can be represented in this system. However, the Fibonacci-based numeral system is ambiguous. As an example, there are six 7-digit vectors in the Fibonacci numeral system for the decimal number 19: {0111101, 0111110, 1001101, 1001110, 1010001 and 1010010}. For clarity, we refer to a vector in the binary numeral system as a binary vector or binary code; a vector in the Fibonacci numeral system is referred to as a Fibonacci vector or Fibonacci code. All the Fibonacci vectors that represent the same value are defined as equivalent vectors. Another very important identity of the Fibonacci sequence is fm = ∑ m -2 k =0 fk (5) n n The n-bit binary vector can represent numbers in the range of [0, 2 -1], and therefore a total of 2 values can be represented by n-bit binary vectors. From Eq. 5, we know that the range of an m-bit Fibonacci vector is [0, fm+2−1], where the minimum value 0 corresponds to all the bits dk being 0, and the maximum value corresponds to all dk being 1. Hence a total of fm+2 distinct values can be represented by m-bit Fibonacci vectors. 5. Implementation & Experimental Results Table-4 shows 6 bit code-words for 4 bit data-words which are generated using FTF-FNS codes. The coded busses in the simulation were 6-bits wide. A 4- to-6-bit encoder and a 6-to-4-bit decoder logic were implemented using mapping of data-words to code-words using FTF code-words. Our simulations using Synopsys Design Compiler show that the maximum delay of both the encoder and the decoder was 10.23 ns. Table. 4: 6 Bit FTF-FNS Code-words for 4 Bit Data-words. 4 bit data-words (d3d2d1d0) 6 bit code-words 4 bit data-words (c5c4c3c2c1c0) (d3d2d1d0) 6 bit code-words (c5c4c3c2c1c0) 0000 000000 1000 010101 0001 000001 1001 010111 0010 000100 1010 011100 0011 000101 1011 011101 0100 000111 1100 011111 0101 010000 1101 110000 0110 010001 1110 110001 0111 010100 1111 110100 Fig. 3: Gate Level Schematic for FTF-FNS Encoder Fig. 4: Gate Level Schematic for FTF-FNS Decoder This paper is based on the concepts proposed in [1]. The encoder and decoder presented in fig.3 and fig. 4 have a data arrival time of 4.08 ns and 6.15 ns respectively. The encoder and decoder proposed in [1] have 140 a data arrival time of 17.88 ns and 7.63 ns respectively. Thus the total delay for the CODEC is 10.23 ns compared to encoder proposed in [1] have a total delay of 25.51 ns. So, we can say that our CODEC is around 2.5 times faster than the CODEC proposed in [1]. When the data arrival time is small, it creates a more positive slack, which makes the chip to work at higher frequencies. Table. 5: Timing reports for FTF-FNS Encoder and Decoder. Timing reports for FTF-FNS Encoder Port/Point Timing reports for FTF-FNS Decoder Incr. Path Delay (ns) Delay ( ns) Port/Point Incr. Path Delay (ns) Delay ( ns) Input/external delay 0.00 0.00 r Input/external delay 0.00 0.00 f b[2] (in) 0.00 0.00r c[2] (in) 0.00 0.00 f U34/Z (IV) 0.47 0.47 f U53/Z (IV) 0.32 0.32 f U33/Z (ND2) 1.25 1.72 r U48/Z (NR2) 1.62 1.94 r U29/Z (NR2) 0.34 2.06 f U47/Z (ND2) 0.37 2.31 f U28/Z (AO2) 1.56 3.63 r U46/Z (IV) 0.93 3.25 r U24/Z (AO3) 0.45 4.08 f U44/Z (ND3) 0.70 3.94 f c[2] (out) 0.00 4.08 f U43/Z (AO7) 1.18 5.13 r 4.08 U42/Z (MUX21L) 0.45 5.58 f U38/Z (ND2) 0.57 6.15 r b[0] (out) 0.00 6.15 r Data arrival time Data arrival time 6.15 Table-5 shows the timing report of the data arrival time for the FTF-FNS encoder from input side to output side. Prime-Time reports the worst delay path from input to output. The b[2] is the input port of the encoder and the name in the bracket is the reference name for that port. Incr. path is the incremental path delay. The delay from input port, b[2] to the output of and-or gate (U24/Z) which is the output port of encoder, c(2) is 4.08 ns (See Fig-3). The f in the third column indicates a transition from 1 to 0 and r indicates a 0 to 1 transition.Table-5 also shows the timing report of the data arrival time for the FTF-FNS decoder from input side to output side. The c[2] is the input port of the decoder and the delay from input port, c[2] to the output of nand2(U38/Z) which is the output port of decoder, b(0) is 6.15 ns (See Fig-4). Table. 6: Timing, Power and Area Report. Delay(ns) Power(µw) Area(µm2) Encoder 4.08 4.7919 25 Decoder 6.15 6.4275 40 Any CODEC performance is quantified by measuring the area, power consumption and speed of the CODEC which is required for any VLSI chip. Table-6 shows the reports generated using Design Compiler Tool for both encoder and decoder. It has observed that the encoder and decoder occupied very small area with low power consumption and very small delay. 6. Conclusion FNS-based FTF encoding algorithms are very efficient in terms of area overhead. In this work, we presented a novel method to avoid crosstalk using Forbidden Transition Free, Fibonacci Number System CODEC. This work is much superior in terms of Speed, Power consumption and Area required for implementing this CODEC. This CODEC has got an improved speed of around 2.5 times than the implementation technique specified in [1]. 141 7. References [1] Chunjie Duan, Victor Cordero, Sunil P. Khatri “Efficient On-Chip Crosstalk avoidance CODEC Design” Proceedings of IEEE VLSI Systems, pp 551-560, April-2009. [2] A. Bastani and C.A. Zukowski. “A low-leakage high-speed monotonic static CMOS 64b adder in a dual gate oxide 65-nm CMOS technology”. In Proceedings of Int. Symposium on Quality Electronic Devices, pages312–317, 2006. [3] C. Duan, K. Gulati and S.P. Khatri, “Memory-based Cross-talk Cancelling CODECs for On-chip busses”, ISCAS, 2006. [4] Madhu Mutyam, “Preventing Crosstalk Delay using Fibonacci Representation”, Intl Conf. on VLSI Design, pp 685-688, 2004. [5] S.R. Sridhara, A. Ahmed, and N.R. Shanbhag, “Area and Energy-Efficient Crosstalk Avoidance Codes for On-Chip busses”, Proc. Of ICCD, PP 12-17, 2004. [6] C. Duan and S.P. Khatri, “Exploiting Crosstalk to Speed up On-Chip busses:, DATE 2004. [7] S.R. Sridhara and N.R. Shanbhag. “Coding for system-on-chip networks: a unified framework”. In Proceedings of Design Automation Conference, pages 103–106. IEEE/ACM, 2004. [8] B.Victor and K.Keutzer. “Bus encoding to prevent crosstalk delay”. In Proceedings of International Conference on Computer Aided Design, pages 57–63. IEEE/ACM, 2001. [9] Duan .C, Tirumala .A and Khatri .S.P.,”Analysis and Avoidance of Cross-talk in On-chip Bus”, Hot Interconnects, pp 133-138, 2001. [10] Ma .J and He .L,”Formulae and applications of interconnect estimation considering shield insertion and net ordering”, ICCAD, pp 327-332, 2001. [11] Davis J. A, al. et, “Interconnect limits on giga-scale integration (GSI) in the 21st century” in Proceedings of the IEEE, pp. 305–324, 2001. [12] Hirose .K and Yasuura .H, “A bus delay reduction technique considering crosstalk” in Proceedings of Design, Automation and Test in Europe, pp. 441–445, 2000. [13] Xiao .T and Sadowska .M, “Crosstalk reduction by transistor sizing” in Proceedings of Asia and South Pacific Design Automation Conference (ASPDAC), pp. 137–140, 1999. [14] Li .D, Pua .A, Srivastava .P and Ko .U, “A repeater optimization methodology for deep submicron, highperformance processors” in IEEE International Conference on Computer Design: VLSI in Computers and Processors (ICCD), Austin, TX, pp. 726–731, 1997. The First Author J.Venkateswara Rao received B.Tech Degree from, VRSEC, Nagarjuna University, Guntur, in 2000. M.Tech. Degree from NITW, Warangal, India in 2003, pursuing Ph.D in the Department of Electronics and Communication Engineering, JNT University, Hyderabad, INDIA. Currently he is working as Associate Professor in the Department of Electronics and communication Engineering, VITS, Hyderabad, INDIA, His research interest includes on chip crosstalk noise reduction in VLSI Circuits. He published 4 papers on “on-chip crosstalk noise reduction in VLSI circuits” in international journals and international conferences. The Second Author Dr.P.Sudhakara Rao Completed Ph.D., Information and Communication Engineering from Anna University, India, Masters in Electronics and Communication Engineering from Anna University, India, Bachelors in Electronics and Communication Engineering from Mysore University, India. Worked as deputy Director, “Central Electronics Engineering Research Institute centre, India” for over 25 years, 2 years as Vice-president, “Sieger Spintech Equipments Ltd., India”, established an electronic department for the development of electronic systems for nearly 2 years. Presently working with Vignan Institute of Technology and Science, Nalgonda District, AP, INDIA as DEAN R&D, HOD ECE. He has one patent and 55 technical publications/ conference papers to his credit. Conducted many international and national conferences as chairman. 142 Author Index A Amnart Pohthong Apisak Khankaewla Arun Swaminathan B B. Arun Prasad Bai Weixiong Kyung-Min Park 108 126 67 1 8 C Chenchen Zhang Cheng-Jung Tsai Chun-Cheng Peng 93 39 39 D Dac-Nhuong Le Deyu Feng Dong Weisheng Dr. V.M. Shenbagaraman 81 76 13 1 E Eun-Ji You 62 F Fu Xiaolong 8 G Guangna Lv H Hoon Choi I Iain Phillips J Jayshree Suresh Jiao Shuhong Jiao Shuhong Jizhang Fan Jo-han Chang Jukkrit Kluabwang L Lin Guan Liu Jiayuan Liu Mingzhu 87 62 98 117 13 51 113 34 126 143 98 51 13, 51 M Md. Shahid Uz Zaman Mehdi Asgarkhani 131 122 N Narongrit Pimkumwong Nguyen Le Dang Nhu Gia Nguyen 126 81 81 P P. Arun Kumar Pan Liu Pei-Cheng Cheng Pengpeng Zheng Phongsathon Fongta Priyanka Ahlawat 1 76 39 44 28 19 R Rajender Singh Chillar 19 S S. Sundararajan Shang Zhigang Shaoying Huang Sudhakara Rao P Supat Sriyanaluk Sutep Tongngam 117 8 44 137 108 72 T Tae-Eun Yoon Tatsana Thomthong Tianyu Ding Ting-Yi Chang 62 126 44 39 U Usha Kumari K 62 19 V V.M. Shenbagaraman Venkateswara Rao J Vinh Trong Le 117 137 81 W Wei Zheng Wilawan Rukpakavong Worasit Choochaiwattana 56 98 103 X Xiangming Wen Xiaohua Guo Xiongfei Tao Y Yan Zhang Yanan Fu Yanhui Wang Yanqiu Li Yen-Lin Chen Yi- ting Chen Yuthapong Somchit Z Zhaoming Lu Zhenmin Zhao Zhicai Zhang Zi Wang 56, 87 56 76 144 76 44 93 113 39 34 28 56 56, 87 87 93