ANALYSIS WITH HIGH SPEED MULTI-OUTPUT 256-BIT CARRY LOOK-AHEAD ADDERS OF 32NM USING DOMINO LOGIC A.Bharathi1, M.Manikandan2, K.Rajasri3 Abstract—This paper presents the carry look-ahead adder algorithm with high speed, less area consuming less power is being analyzed and compared with different technologies of L=32nm.In adder circuit delay is the major drawback. To overcome this domino logic is used to minimize the circuit and reduce the number of transistors size for 256-bit CLA adder using HSPICE simulation tool. For higher speed of the domino circuit in CMOS technology Manchester carry chain adder(MCC) is proposed. Index Terms— Addition, Carry-Look ahead Adder (CLA), High Performance, domino logic, CMOS technology, HSPICE TOOL 1.INTRODUCTION ADDITION is the most commonly used in arithmetic operation and also the speed-limiting element to make faster VLSI processors. The objective of Computer Arithmetic is to develop appropriate algorithms that are utilizing available hardware in the most efficient way. Ultimately, speed, power and chip area are the most often used measures, making a strong link between the algorithms and technology of implementation. Dynamic gates are faster than static gates despite the extra fet in the pull down path because of the reduction in self-loading and the limitation of the pull up short-circuit current during the first part of the output transition. Dynamic gates cannot be cascaded. In integrated circuit design, dynamic logic is a design methodology in combinatory logic circuits, particularly those implemented in MOS technology. It is distinguished from the so-called static logic by exploiting temporary storage of information in stray and gate capacitances. It was popular in a recent resurgence in the design of high speed digital electronics, particularly computer CPUs. Dynamic logic circuits are usually faster than static counterparts, and require less surface area, but are more difficult to design. Dynamic logic has a higher toggle rate than static logic but the capacitive loads are smaller so the overall power consumption of dynamic logic may be higher or lower depending on various tradeoffs. In the most common version of this concept, the output is driven high or low during distinct parts of the clock cycle. In contrast, in dynamic logic, there is not always a mechanism driving the output high or low. During the time intervals when the output is not being actively driven, its impedance causes it to maintain a level within some tolerance range of the driven level. In a basic design, to start waiting, the CPU would write to a register to set a binary latch bit which would be ANDed or ORed with the processor clock, stopping the processor. A signal from a peripheral device would reset this latch, resuming CPU operation. The hardware logic must gate the latch control inputs as necessary to ensure that a latch output transition does not cause the clock signal level to instantaneously change and cause a clock pulse, either high or low, that is shorter than normal. Dynamic logic, when properly designed, can be over twice as fast as static logic. It uses only the faster N transistors, which improve transistor sizing optimizations. Static logic is slower because it has twice the capacitive loading, higher thresholds, and uses slow P transistors for logic 2.RELATED WORK Dynamic logic can be harder to work with, but it may be the only choice when increased processing speed is needed. Most electronics running at over 2 GHz these days require the use of dynamic, although some manufacturers such as Intel have completely switched to static logic to reduce power consumption. Note that reducing power use not only extends the running time with limited power sources such as batteries ,but it also reduces the thermal design requirements, reducing the size of needed heat sinks, fans, etc., which in turn reduces system weight and cost. In this paper presents the carry look-ahead adders in domino circuits for faster requirement of the VLSI processors or digital signal processors. The Manchester carry chain adder (MCC) is the most popular dynamic (domino) CLA, is proposed with an implementation in VLSI. The MCC have enabled the development of multi-output domino gates which have given area and speed improvement with respect to single output. The efficiency of the MCC is trying to transfer its structure to static logic. In a report has been made of dynamic CMOS 4-bit CLA adder in multi-output logic which reduces the number of transistors which considered to a conventional schema. However, the simulation results shown any speed improvement but reduce delay is been analyzed and compared with different technologies having an supply voltage of VDD=1.2V and 1V is defined. The performance of the domino logic circuits of an 256-bit CLA adder has the generate and propagate signal of AND or XOR gates. When CLK is low, dynamic node is precharged high and buffer inverter output is low. NMOS in the next logic block will be off. When CLK goes high, dynamic node is conditionally discharged and the buffer output will conditionally go high. Since discharge can only happen once, buffer output can only make one low-to-high transition. When domino gates are cascaded, as each gate evaluates, if its output rises, it will trigger the evaluation of the next stage, and 1 so on… like a line of dominos falling. Like dominos, once the internal node in a gate falls, it stays fallen until it is “picked up” by the precharge phase of the next cycle. Thus many gates may evaluate in one eval cycle. charge sharing between nodes in the pull down network and the dynamic node can unintentionally reduce the voltage of the dynamic node enough to switch output buffer .The addition of the output inverter makes domino gates non-inverting. One can often design around this limitation, but some circuits cannot be implemented solely using Domino logic unless both polarities (true and complement) of the inputs are available. If both polarities of inputs are available then we can generate both polarities of internal signals with two domino gates so subsequent stages will have both polarities of their inputs available. C1=G0+P0.C0 C2=G1+G0.P1+C0.P0.P1 C3=G2+G1.P2+G0.P1.P2+C0.P0.P1.P2 C4=G3+G2.P3+G1.P2.P3+G0.P1.P2.P3+C0.P0.P1.P2 To determine whether a bit pair will generate a carry, the following logic works: Gi=Ai.Bi 1.1 CIRCUIT DESIGN Since domino gate outputs are low during the precharge phase, gates which have only domino output nodes as inputs don’t need the evaluate nfet since all the nfets in the pull down will be off anyway. if evaluate nfet is removed, precharge will ripple through cascaded gates just like evaluates. Maybe only remove for gates where nfet stack is tall (i.e. resistive) enough that pull up will satisfy anyway before ripple reaches gates and turns off pull downs. To avoid short-circuit current, delay arrival of precharge signal to downstream gate. Fig 2 layout diagrams for generate AND signal Fig.1 schematic diagram of generate AND signal Figure 1 shows the implementation of generate signal in domino logic. The circuit will possesses generally two state precharge state and evaluation state. If the clock signal goes to value ‘0’, then the circuit will enter into precharge state and PMOS will get connected to ground and output will maintain the value of 0. If the clock makes the transition from 0 to 1 then the circuit will enter into evaluation state and the output depends on the input value. Since generate signal possess AND operation if both input are maintained at 1, 1 then the output gi will be maintained at 1 else the output value will be maintained at 0 i.e gi=0. For the example provided, from the preliminary basics the logic for the generate (g) and propagate (p) values are given below. Note Fig.3 schematic diagram of propagate or signal Fig.3 shows When CLK is low, evaluate nfet is off and precharge PMOS is on output node is precharged to VDD, other nodes may precharge to VDD - Vth, depending on values of inputs .When CLK goes, high evaluate nfet is on and precharge PMOS is off output node may be discharged if inputs have configured a conducting path to GND, otherwise output node stays charged high. inputs must be stable before CLK goes high because once output has been discharged it won’t go high again until next cycle for same reason, noise/glitches on inputs cannot exceed nfet threshold, a much more stringent requirement than for static CMOS gates. that the numeric value determines the signal from the circuit above, starting from 0 on the far left to 3 on the far right: C1=G0+P0 .C0 C2=G1+P1 .C1 C3=G2+P2 .C2 C4=G3+P3 .C3 Substituting C1 into C2, then C2 into C3, then C3 into C4 yields the To determine whether a bit pair will propagate a carry, either of the following logic statements work: Pi=Ai xor Bi Pi=Ai +Bi expanded equations 2 The layout diagram has been drawn using microwind tool has shown below Technique 90nm 65nm 32nm Static 2.7096 2.3285 2.0125 Dynamic 1.9969 1.9022 1.0624 Table1: comparisons with technology and sum delay Technique 90nm 65nm 32nm Static 2.6421 2.2405 2.0120 Dynamic 1.9033 1.9022 1.0756 Table2: comparisons with technology and carry delay Fig.4 layout diagram for propagate XOR signal In this paper mainly focus on reducing delay when temperature increases from 25oc to 70 oc. when the temperature increases power dissipation will occur, it may be a static power or dynamic power. so, power consumed for the circuit in the system is less. The system performance gets reduced accordingly. 2. PROPOSED WORK The Manchester Carry-Chain Adder is a chain of pass-transistors that are used to implement the carry chain. During precharge, all intermediate nodes (e.g. Cout0) are charged to Vdd. During the evaluation phase, Cout_k is discharged if there is an incoming carry Cin0 and the previous propagate signals (P 0...Pk-1) are high. Only 4 diffusion capacitances are present at each node, but the distributed RC-nature of the chain results in a delay that is quadratic with the number of bits. Transistor sizing was performed to improve performance. The details are elaborated on in the design strategy section. The Manchester carry chain was designed using dynamic logic and implements the following logical function: Coi=Gi+PiCoi−1 Fig 5 schematic diagram of votage vs temperature In this paper three types of technologies with an adders circuit are presented with L=0.12μm technology and with a supply voltage of 1.2V. These high performance domino styles improve the scalability of multiple bit domino logic adders. Using these methods it is possible to implement the adder circuits with a transistor gate length of L=22nm along with a supply voltage of 0.8V. These adder circuits are superior in performance compared to conventional static logic adders. These adder circuits minimize the chip area, minimize the leakage power, and improve the noise tolerance without much speed degradation. Also the delay between the gates is now reduced to the order of pico seconds. These types of domino logic circuits can be used in high performance microprocessors. By considering this below, comparing these technologies with static and dynamic(domino) gates Delay in Nano seconds Fig.6 4bit Manchester carry chain adder Manchester carry chain adder presents a efficient implementation of multi-output domino CMOS logic. The further work has been analyzed with different technologies and results are observed using HSPICE tool. 3. SIMULATION RESULTS 3 The simulations were performed using L=0.12μm technology along with the supply voltage VDD=1.2V. In this paper a 256-bit adder is constructed using three different domino technologies such as 90nm,65nm,22nm .This adder circuit has the area overhead of an extra NMOS transistor which is connected to the keeper from the dynamic(domino) circuit. This adder circuit has much better noise margin, low leakage current and low power consumption compared to the adder circuits designed L=32nm using domino logic styles with traditional feedback keepers. The adder circuit designed using carry look-ahead adder domino logic is used but it consumes some additional power due to the transistors count has been increased. The following simulation result is Fig 9 Power analysis graph Fig.7 simulation diagram of voltage vs time BITS/DELAY SUM DELAY(ns) CARRY DELAY(ns) POWER(uw) 4BIT 247 242 37 8BIT 335 320 39 16BIT 355 340 42 32BIT 425 415 45 64BIT 545 520 51 128BIT 612 595 65 256BIT 705 687 73 Table.3 comparison of sum delay,carry delay,and power 4. CONCLUSION In this paper the performance of 256-bit adder circuit designed using in dynamic (domino) circuit techniques is analyzed in detail and its performance is compared with static adder circuits. The 256-bit adder circuit is simulated using L=32 nm technology along with supply voltage VDD=0.8V. The experimental results shows that these adder circuits gives superior performance compared to adder circuits designed using conventional domino techniques. Further, Manchester carry chain adder in 256-bit is used for increasing high speed and reduced delay in the domino circuit. Fig .8 comparison of temperature vs sum ,carry, power graph The delay graph is plotted with rest to number of bits and the gate length is 32nm technology. In this graph explain the REFERENCES [1] Jagannath Samanta, Mousam Halder and Bishnu Prasad De, "Performance analysis of high speed low power carry- lookahead adder using different logic styles," International Journal of Soft Computing and Engineering (IJSCE), vol. 2, issue 6, pp. 330-336, Jan- 2013 proposed carry look-ahead adder based domino circuit has less delay but it will increase to increase number of bits for many applications. [2] Nagaraj Y, Shrinivas K, Veeresh K, Veeresh A, Madhu Patil and Dr. Chirag Sharma, "FPGA implementation of different adder architectures," International Journal of Emerging Technology and Advanced Engineering, vol. 2, issue 8, pp. 362-364, August 2012 [3] A. Weinberger and J. L. Smith, “A logic for high speed addition,” Nat. Bureau Stand. Circulation, vol. 591, pp. 3–12, 1958. [4] S. Vassiliadis, “Recursive equations for hardwired binary adders,” Int. J.Electron., vol. 67, no. 2, pp. 201–213, Aug. 1989. 4 [5] P. K. Chan and M. D. F. Schlag, “Analysis and design of CMOS Manchester adders with variable carry-skip,” IEEE Trans. Comput., vol. 39,no. 8, pp. 983–992, Aug. 1990. [6] Z.Wang, G. Jullien,W.Miller, J.Wang, and S. Bizzan, “Fast adders using enhanced multiple-output domino logic,” IEEE J. Solid State Circuits,vol. 32, no. 2, pp. 206–214, Feb. 1997. [7] G. A. Ruiz, “New static multi-output carry look-ahead CMOS adders,”Proc. Inst. Elect. Eng.—Circuits, Devices Syst., vol. 144, no. 6, pp. 350–354, Dec. 1997. [8] C. Efstathiou, H. T. Vergos, and D. Nikolos, “Ling adders in CMOS standard cell technologies,” in Proc. 9th ICECS, vol. 2,pp. 485–489,Sep. 2002. [9] S. Perri, P. Corsonello, F. Pezzimenti, and V. Kantabutra, “Fast and energy-efficient Manchester carry-bypass adders,” Proc. Inst. Elect. Eng.—Circuits Devices Syst., vol. 151, no. 6, pp. 497–502, Dec. 2004. [10] M. Osorio, C. Sampaio, A. Reis, and R. Ribas, “Enhanced 32-bit carry look-ahead adder using multiple output enable-disable CMOS differential logic,” in Proc. 17th Symp. Integr. Circuits Syst. Design, pp. 181–185,2004. [11] G. A. Ruiz and M. Granda, “An area-efficient static CMOS carry-select adder based on a compact carry look-ahead unit,” Microelectron. J.,vol. 35, no. 12, pp. 939–944, Dec. 2004. [12] G. Dimitrakopoulos and D. Nikolos, “High-speed parallel-prefix VLSI Ling adders,” IEEE Trans. Comput., vol. 54, no. 2, pp. 225–231,Feb. 2005. [13] A. A. Amin, “Area-efficient high-speed carry chain,” Electron. Lett.,vol. 43, no. 23, pp. 1258–1260, Nov. 2007. [14]M.C.B. Osorio, C.A. Sampaio, A. I. Reis, R.P. Ribas., etc.al. “Enhanced 32-bit Carry Look ahead Adder using Multiple Output Enable-Disable CMOS Differential Logic”. SBCCI, pp. 181-185, 2004. [15] K. Ueda, H. Suzuki, K. Suda, H. Shinohara, K. Mashiko,et.al., “A 64-bit Carry Look Ahead Adder Using Pass.” IEEE Journal of Solid-Stale Circuits, vol. 31, no. 6, pp. 810-818, 1996. [16] Wei, B. W.; Thompson, C. D.; "Area-Time Optimal Adder Design", Computers, IEEE Transactions on, Vol. 39, No. 5, May 1990, pp. 666-675 [17] V. Kantabruta, A recursive carry-lookahead/carry-select hybrid adder, IEEE Transactions on Computers 42 (12) (1993) 1495–1499 5