Low Noise High Performance Wide OR Domino Logic in DSM Technology

advertisement
International Journal of Engineering Trends and Technology (IJETT) – Volume 33 Number 2- March 2016
Low Noise High Performance Wide OR
Domino Logic in DSM Technology
Anshul Namdev #1, Ashish Raghuwanshi #2,
#
PG Student [VLSI], Dept. of ECE, IES college of Engg. Bhopal, RGPV Bhopal, M.P. India
Abstract— Dynamic CMOS circuit exhibits high
performance, faster and more compact than static
CMOS circuit. Also, due to the absence of the PMOS
transistors, the input capacitance is lower. However,
dynamic logic style suffers from charge sharing,
higher power dissipation (due to high switching
activity), clock load and complexity. In this paper we
analyze and compare different topologies of domino
logic at circuit level for low power consumption, high
performance and batter noise immunity. We compare
power, delay, and unit noise gain (UNG) is calculated
by keeping Same W/L ratio of NMOS and PMOS
transistor for fair comparison of results . Simulation
is performed at Cadence virtuoso at 65 nm technology
with supply voltage is of 1 V at 100MHz frequency,
bottleneck operating temperature of 27 degrees
centigrade. Simulation results on 8, 16 and 32 inputs
OR gates with existing and proposed circuit showed
improvements noise immunity and decrease in delay
and power compared with other conventional
techniques.
By scaling down the technology the sensitivity of
the dynamic node to the noise sources has emerged as
a serious design challenge. For improving noise
immunity and reducing leakage the keeper transistor is
added.
However, power dissipation increases and
performance degrades by adding this pMOS keeper
transistor. Upsizing the keeper transistor improves
robustness at a cost of higher power dissipation and
delay. The severity increases many fold in wide
Domino circuits because of higher number of parallel
pull-down branches [2]. Fig. 1 illustrates [12] the
failure mechanism for a 32-in OR gate using FLDL
style at VDD=1 V for temperature 27OC.
Keywords— Wide fan in, CKD, HS, DFD, LCR, UNG.
I. INTRODUCTION
circuit has superior to the static logic circuit [1, 2].
It requires lesser area and reduces output load
capacitance hence enhance the speed [3, 4]. The high
speed domino logic circuit has become one of the
most crucial components of digital circuits. The
scaling of CMOS technology reduces both the
threshold and gate oxide thickness (T ox). The scaling
enhances the leakage current in the domino circuit.
Dynamic circuit typically uses a clock signal to
distinguish between the precharge and evaluation
phases. During the precharge clock (low) phase, the
precharge device is used to pull the output high,
assuming that there does not exist a low impedance
path from the output node to ground [5]. To ensure
that there is no active paths exist through the pull
down network or use evaluation device. An evaluation
device can be used to guarantee that the conducting
paths through the pull-up and pull down networks are
mutually exclusive. During the evaluation clock (high)
phase, the precharge device is turned OFF and pulldown network will either discharge the output node
making high (1) to low (0) transition or stay the same
depending on the input conditions.
ISSN: 2231-5381
Fig. 1 Failure mechanism for 32-in OR gate (FLDL)
Thus trade off exist between delay and power to
improve noise and leakage immunity [3]. Such tradeoff is not acceptable because it may increase the delay
or make the circuit too power hungry. There are
several techniques introduce in the paper to address
this issue.
The rest of the paper is arranged as follows. Section
II, studies five types of circuits that have been
proposed in related literatures, Standard Footless
Domino logic, Conditional Keeper Domino logic,
High Speed domino logic, Leakage Current Replica
diode and Diode Footed Domino logic.
In Section III we have briefly explain the proposed
modified circuit which improves power delay and
UNG. Section IV shows the simulation results and
finally conclusion is offered in Section V.
http://www.ijettjournal.org
Page 90
International Journal of Engineering Trends and Technology (IJETT) – Volume 33 Number 2- March 2016
II. RELATED WORK
In order to maintain the performance of the chip
along with high driving capability at lower supply
voltage, the VTH is reduced. However, the Threshold
Voltage (VTH) scaling results in the substantial
increase of the Subthreshold Leakage Current (I SUB) as
VTH is exponentially proportional to ISUB. In DSM
technology three domination leakage current occurred
in the CMOS device such as ISUB, IGATE, and IBTBT, as
shown in Figure 1.
keeper transistor MP2 turns ON to maintain the
voltage of the dynamic node. During the evaluation
mode, i.e. when the CLK goes HIGH, the dynamic
node is either discharged to ground or remains HIGH
depending on the inputs. Fig.1. Standard Footed
Domino circuit which have Lower Power
consumption by inserting a NMOS transistor Footed
Domino circuit come into existence [2]
VDD
VDD
PRECHARGE TRANSISTOR
CLK
KEEPER TRANSISTOR
MP1
MP2
VDD
G
MP3
DYNAMIC NODE
S
D
Gate Leakage
INn
Substrate
IN2
IN1
OUTPUT
MN1
Subthrehold
Leakage
Reverse Biased
Junction BTBT
Bulk
Fig.2. Standard Footerless Domino Logic Circuit
Fig. 1. Shows Various Leakage current in DSM
technology
2.1 Leakage power dissipation
Total Power dissipation is calculated as:PTotal = PDynamic + PSwatching + PShort-circuit + PStatic
Where PT is the dynamic or switching power
dissipation, occurs due to charging or discharging the
parasitic capacitances in node voltage transition. P ST is
the static or leakage power dissipation, combination of
the subthreshold leakage power due to the not ideal off
state. PShort-circuit is the short circuit power dissipation
occurs during switching operation when both the Pull
Up and Pull Down networks are in ON state. P Static is
the static DC power dissipated
The main power contribution is CMOS technology
is basically Sub-threshols Leakage and gate oxide
leakage current is the dominant in nanometer regime.
B. Conditional Keeper Domino Logic (CKD)
Conditional Keeper employs two keepers, small
keeper and large keeper [5]. In this technique, the
keeper device (PK) in conventional domino is divided
into two smaller ones, PK1 and PK2 as shown in Fig.3.
The keeper sizes are chosen such that PK=PK1+PK2
[6]. Such sizing insures the same level of leakage
tolerance as the conventional gate but yet improving
the speed. The circuit works as follows:
Precharge Phase: In pre-charge phase when clock
is low, the pull-up transistor is on, so the dynamic
node starts being charge up to VDD. Small keeper
transistor PK2 is ON which hold the voltage of
dynamic node to high level.
Evaluation Phase: A high clock pulse is applied. At
the beginning of evaluation phase when clock is high
pre-charge transistors and large keeper PK1 are off.
When all the inputs are at low logic level, i.e. in
standby mode, the dynamic node after the delays of
two inverters remains high, in this condition the output
node of NAND gate goes low, this causes the large
a) Sub-threshols Leakage
keeper PK1 to be turned on. The large keeper is
Sub-threshold leakage current is very significant deployed after a delay for two inverters, to prevent
component of the leakage power and this current erroneous discharge of the dynamic node when all
passes from drain to source through the channel [6-7 inputs remain LOW. The small keeper PK2; however
A. Standard Footless Domino Logic Circuit (SFLD) remain ON to compensate for charge leakage until
The footless scheme [4] is characterized by the PK1 is activated.
fact that discharge of dynamic node is faster. This
property is exploited by the high-performance circuits.
The circuit of the SFLD logic is shown in Fig. 1.
Operation of Footless-Domino is as follows: During
the pre-charge phase, i.e. when then clock (CLK) is
LOW, the dynamic node is charged to VDD and the
ISSN: 2231-5381
http://www.ijettjournal.org
Page 91
International Journal of Engineering Trends and Technology (IJETT) – Volume 33 Number 2- March 2016
VDD
CLK
Delay
PK1
VDD
VDD
PRECHARGE TRANSISTOR
CLK
MP1
PK2
DYNAMIC NODE
OUTPUT
IN2
INn
IN1
Fig.3 Conditional Keeper Domino Logic
C. High Speed Domino Logic (HS)
The circuit of the HS Domino logic is shown in
Fig.4 [7]. In HS domino the keeper transistor is driven
by a combination of the output node and a delayed
clock. The circuit works as follows: At the start of the
evaluation phase, when clock is high, MP 3 turns on
and then the keeper transistor MP2 turns OFF. In this
way, the contention between evaluation network and
keeper transistor is reduced by turning off the keeper
transistor at the beginning of evaluation mode. After
the delay equals the delay of two inverters, transistor
MP3 turns off. At this moment, if the dynamic node
has been discharged to ground, i.e. if any input goes
high, the nMOS transistor MN1 remains OFF. Thus
the voltage at the gate of the keeper goes to VDD-Vth
and not VDD causing higher leakage current though the
keeper transistor [8].
terminals connected together. Fig.5 [9] shows the
Diode Footed Domino configuration. Stacking effect
[10] occur because this transistor M1 is connected in
series with the evaluation network. Thus subthreshold
leakage current reduces as a result of stacking effect.
DFD circuit works as follow:
Preacharge Phase: during this phase a low clock
pulse is applied. Thus during the precharge phase
transistor M3 is ON and it turns off the mirror
transistor M2 to prevent any possible short-circuit
current through M2 during this phase.
Evaluation phase: In this phase high clock pulse is
applied and any of the inputs are allowed to switch to
high level. Due to the leakage current of the
evaluation transistor some voltage drops across the
diode footer M1. Thus a negative voltage exists
between gate and source of the evaluation transistors
that is in OFF mode. This negative voltage
exponentially reduces subthreshold current. Moreover,
the voltage-drop across the diode increases the body
effect of the evaluation transistors, which also helps in
the subthreshold leakage reduction. Thus noise
immunity improves by the higher gate switching
voltage at the expense of speed degradation. Speed is
improved by adding a mirror transistor M 2 as shown in
Fig. 5. By increasing the mirror ratio, the performance
can be increased.
CLK
VDD
VDD
MPRE
MK
DYNAMIC NODE
OUTPUT
Evaluation
Network
VDD
VDD
VDD
N_FOOT
M1
M2
MP3
CLK
MP2
MP1
MN1
CLK
DYNAMIC NODE
OUTPUT
INn
IN2
M3
M4
Fig.5 Diode Footed Domino Logic
IN1
Fig.4 High Speed Domino Logic
D. Diode Footed Domino (DFD)
In diode footed domino we modify the conventional
domino circuit by adding an nMOS transistor M1 in
series with the
foot of the evaluation network. This nMOS
transistor is in diode configuration i.e. gate and drain
ISSN: 2231-5381
E. Leakage Current Replica (LCR)
In Leakage Current Replica Domino logic one extra
pMOS transistor MK1 is stacked above the keeper
transistor as shown in Fig. 6 [11]. Addition to this
pMOS transistor a replica current mirror is added with
MK1. The main function of the current mirror is to
track the leakage current and copies it into the
dynamic gate through the transistor MK1. Construction
of the current mirror is such that it draws current
sf.Ileak where sf is safety factor and Ileak is dynamic gate
leakage current. An extra nMOS transistor M2 is used
in the current mirror. Transistor M2 is in diode
connection and work as a replica of the worst case
leakage current hence its width is set equal to the sum
http://www.ijettjournal.org
Page 92
International Journal of Engineering Trends and Technology (IJETT) – Volume 33 Number 2- March 2016
of the widths of the nMOS transistor in the PDN times
the safety factor. Width of the transistor M1 is equal to
that of the MK1.
In an LCR current mirror, dynamic node must be
pulled close to VDD to avoid compromising the noise
immunity of the dynamic gate, so M1 operates in the
triode region In this mode MK1 exits in saturation
region and continuously rise until MK1’s triode region
current matches the actual leakage current. Rest of the
working is same as that of the conventional domino
logic.
Shared Replica
Current Mirror
VDD
MK1
CLK
MP1
KPR
M1
sf.Ileak
Ileak
MK2
DYNAMIC NODE
M2
OUTPUT
Evaluation
Network
Shared Replica
Current Mirror
VDD
VDD
VDD
VDD
VDD
MK1
CLK
MP1
KPR
M1
sf.Ileak
Ileak
MK2
DYNAMIC NODE
M2
OUTPUT
Evaluation
Network
Fig.6 Leakage Current Replica Domino Logic
III. PROPOSED CIRCUIT
In the previous work, the main focus was directed
towards the feedback loop associated in the domino
logic circuit, which is created by the keeper transistor
and the inverter. This approach was mainly
concentrated for the delay variation in the circuit,
which causes problem for the circuit especially in high
performance application. Therefore the variation in
delay was the centre of point in this circuit analysis. In
this paper we have Modified LCR by inserting NMOS
transistor below dynamic node. At the beginning of
the pre-charge mode the pre-charge device is in active
mode. Therefore the voltage at the dynamic node will
be at 0 V and hence that will pass through an inverter
so the output at the inverter will be VDD. In
consequence the extra added transistor will turn on
and at the beginning of the pre-charge phase there will
be contention of current between the two current
derived from the extra added transistor and the keeper
transistor because the pre-charge. During the
evaluation phase the pre-charge device gets OFF
because at this time the clock switches from logic 0 to
logic 1 as shown in Fig 7.
ISSN: 2231-5381
Fig.7. Modified Diode Current Replica
Despite of having various advantages in domino
logic circuit it suffers a lot by noise and charge
leakage trouble. Therefore in the proposed work the
main area of concentration is the noise immunity of
the circuit without affecting it’s delay performance.
To achieve such performances a new circuit is
proposed. This circuit gives approximately same delay
as the previous circuit with better noise better noise
immunity. The proposed circuit is having an extra
NMOS transistor added to the circuit. The main
function of this extra added transistor is to compensate
the leakage current associated in the circuit.
Considering domino CMOS-OR logic the leakage
current in the circuit is inversely proportional to the
unity noise gain of the circuit. In Fig.8. The proposed
circuit employs nMOS transistors to implement
logical function. The proposed circuit has four
additional transistors, MEval, MDis, MK1, MK2, and a
shared reference circuit compared to standard footless
domino (SFLD). In this topology, the nMOS
transistors MDis turn on conditionally. Gate of the MDis
is connected to the N-FOOT node.
The dynamic power consumption directly depends
on the capacitance, voltage swing, and contention
current on the switching node in the constant
condition for frequency, power supply, and
temperature. Although the proposed circuit has less
dynamic power consumption and low delay at the
expense of some area overhead compared to footless
domino. It should be noted that if the source and body
terminals are not at the same voltage level then body
effect will occurs, thus the leakage current will be
decreased further at the expense of higher deviation
due to process variations.
http://www.ijettjournal.org
Page 93
International Journal of Engineering Trends and Technology (IJETT) – Volume 33 Number 2- March 2016
VDD
Reference circuit for all gate
VDD
S.No.
CLK
MPre
M1
VDD
VDD
M5
M6
1.
SFLD
2.
FDL
M7
3.
CKD
M8
4.
HSD
5.
DFD
6.
LCR
VDD
CLK
Out
IN0
transistors INK
VDD
MK1
MK2
N-FOOT
CLK
M4
MDis
Evaluation
Domino
Circuits
M3
CLK
MEval
Fig.8. Modified Current-Comparison Based
Domino
7.
8.
IV. RESULTS AND DISCUSSION
Simulations are performed by using Spectre
Cadence tool in 65 nm technology at 1 GHz frequency
and VDD of 1 V. width of the keeper is set equal to
0.25µm. The fall/rise times of the waveforms were set
to 1pS. Delay power dissipation and UNG is
calculated for 8, 16 and 32 input OR gate in various
logic style. Delay is calculated by using Cadence
spectra calculator. For delay calculation high voltage
is applied at only one input and remaining all input are
kept at low level in evaluation phase. For calculation
of UNG [12], a pulse noise is applied to all inputs with
amplitude which is a fraction of supply voltage and a
pulse width equal to 30% of duty cycle. Then, the
amplitude of the input noise pulse is increased until
the amplitude of the resulting output noise voltage is
equal to that of the input noise signal. This noise
amplitude is defined as
UNG= Vin,Vnoise = Voutput
Proposed
Modified LCR
Proposed
Modified CCD
8
INPUT
2.31
1
3.76
1
5.76
1
6.34
1
2.52
1
3.12
6
2.01
7
1.89
1
16
INPUT
32
INPUT
5.521
8.346
6.356
9.671
7.447
11.276
9.262
12.34
3.891
5.945
6.103
10.981
5.143
7.892
4.725
7.312
Fig.10 Average Power Consumption
TABLE II.COMPARISION OF UNG (in Volt)
S.
No.
1.
2.
3.
4.
5.
6.
7.
8.
Domino
Circuits
SFLD
FDL
CKD
HSD
DFD
LCR
Proposed
Modified LCR
Proposed
Modified CCD
8
INPUT
16
INPUT
32
INPUT
0.256
0.261
0.286
0.279
0.291
0.281
0.221
0.234
0.231
0.228
0.252
0.241
0.201
0.205
0.210
0.203
0.211
0.204
0.298
0.258
0.221
0.310
0.561
0.224
Fig.9. Transient response of 2 input proposed
domino OR logic
TABLE
I.
COMPARISION
DESSIPATION (in µW)
ISSN: 2231-5381
OF
POWER
http://www.ijettjournal.org
Page 94
International Journal of Engineering Trends and Technology (IJETT) – Volume 33 Number 2- March 2016
threshold voltage improves the noise immunity. Thus
UNG of DFD logic is higher than the other domino
logic. Leakage Current Replica shows best result in
term of speed as compared to other logic. The reason
behind this high speed is that the current mirror tracks
the leakage current and copies it into the dynamic gate.
REFERENCES
[1] L. T. Clarke, G. F. Taylor, “High fan-in circuit design,” IEEE
Journal of Solid-State Circuits, vol. 31, Issue 1, January 1996,
pp.91-96.
Fig.11 UNG Comparison
TABLE III.COMPARISION OF DELAY (in ps)
S.
No.
Domino Logic
8 INPUT
16 INPUT
32 INPUT
1.
SFLD
14.20
19.53
28.59
2.
FLD
15.23
20.65
29.35
3.
CKD
17.10
22.13
31.23
4.
HSD
14.41
18.32
28.43
5.
DFD
17.56
24.21
33.12
6.
LCR
13.93
17.51
25.34
12.35
18.27
27.18
11.98
17.93
26.56
7.
8.
Proposed
Modified LCR
Proposed
Modified CCD
[2] Farshad Moradi, TuanVuCao, ElenaI.Vatajelu, AliPeiravi,
Hamid
Mahmoodi, DagT.Wisland, “Domino logic designs for
high-performance and leakage-tolerant applications,” Elsevier
INTEGRATION, the VLSI journal, Issue 24 April 2012.
[3] Farshad Moradi, AliPeiravi, HamidMahmoodi, “A New
Leakage Tolerant Design for High Fan-in Domino circuits, ” 2004
IEEE.
[4]
B.-Y. Tsui, L.-F. Chin, “A comprehensive study of the
FIBL of nanoscale MOSFETs,” IEEE Transactions on Electron
Devices 51 (10) (2004) 1733–1735.
[5] A. Alvandpour, R.K. Krishnamurthy, K. Soumyanath, S.Y.
Borkar, “A sub-130-nm conditional keeper technique,” IEEE
Journal of Solid-State Circuits 37 (2002) 633-638.
[6] A. Alvandpour, R. Krishnamurthy, K. Soumayanath, ands.
Borkar, “ A
Low-Leakage Dynamic Multi Ported
Register File in 0.13 µm CMOS,” in proceedings of international
Symposium on Low Power Electronics and Design, 2001, pp. 68-71.
[7]
M.W. Allam, M.H. Anis, M.I. Elmasry, “High speed
dynamic logic style for scaled- down CMOS and MTCMOS
technologies,” in: Proceedings of the International Symposium on
Low Power Electronics and Design, 2000, pp. 155–160.
[8] M. H. Anis, M. W. Allam, and M. I. Elmasry, “Energyefficient noisetolerant dynamic styles for scaled-down
CMOS and MTCMOS
technologies,” IEEE Trans. Very
Large Scale (VLSI) Syst., 2002.
[9] Hamid Mahmoodi and Kaushik Roy, “Diode-Footed Domino:
A Leakage-Tolerant High Fan-in Dynamic Circuit Design Style,”
IEEE Transactions on Circuits and Systems—i: Regular papers,
VOL. 51, NO. 3, MARCH 2004.
[10] K. Roy, S. Mukhopadhyay, and H. Mahmoodi-Meimand,
“Leakage current mechanisms and leakage reduction techniques in
deep-submicron CMOS circuits,” Proc. IEEE, vol. 91, pp. 305–327,
Feb. 2003.
Fig.12 Delay Comparison
[11] Yolin Lih, Nestoras Tzartzanis and William W. Walker, “A
Leakage Current Replica Keeper for Dynamic Circuits,” IEEE
JOURNAL OF SOLID-STATE CIRCUITS, VOL. 42, NO. 1,
JANUARY 2007.
V. CONCLUSION
In this paper, several domino logic circuit topologies
were proposed for high-speed and leakage-tolerant
design. From the simulation result we can conclude
that power dissipation of the Diode Footed Domino is
minimum due to the stacking effect and body biasing
of the nMOs transistor that reduces the subthreshold
leakage current exponentially while the speed is
degraded because of the increase in switching
threshold voltage. On the other hand this increased
ISSN: 2231-5381
http://www.ijettjournal.org
Page 95
Download