Nanoelectronic Scaling Tradeoffs: What does Physics have to say? Victor V. Zhirnov

advertisement
Nanoelectronic Scaling Tradeoffs: What does
Physics have to say?
Victor V. Zhirnov
Semiconductor Research Corporation
1
Outline
u
u
u
u
u
u
Technology Roadmap on Semiconductors
Fundamentals of information processing,
Fundamental limits to scaling
Thermal Limits
Message: We suggest that the benefits from
nanoelectronics research may, in the -Short term lie with the invention of new structures,
materials and processes that extend the CMOS
technology platform
v
u
Radical thermal solutions are needed
Long term enable invention of entirely new information
processing technologies
2
International Technology Roadmap on Semiconductors
u
A very detailed industrial perspective on the future
requirements for micro/nano electronic technologies
v
u
u
u
u
Goal is to continue exponential gains in performance/price
for the next fifteen years
Built on worldwide consensus of leading industrial,
government, and academic technologists
Provides guidance for the semiconductor industry and for
academic research worldwide
Content: critical requirements and judgment of status
Projects that by 2016, half-pitch spacing of metal lines will
be 22 nanometers and device gate lengths will be 9
nanometers
3
Moore’s Law: Minimum Feature Size
500
Minimum Feature Size (nm)
350
1994
250
1997
1998
180
1999
130
2000
100
70
50
35
25
‘95
‘00
‘05
‘10
4
Devices will soon be on the
“molecular” or “atomistic” scale
1.00E+27
Atoms per Bit
1.00E+24
IC and Bio-based device
converge
ICdensity
and bio-based
device
densities converge
1.00E+21
1.00E+18
1.00E+15
Molecular
Molecular
devices
Devices
1.00E+12
1.00E+09
1.00E+06
1.00E+03
1.00E+00
-60
-50
-40
-30
-20
-10
0
Years from 2000
10
20
30
40
6
Evolution of Electronics
Controllable
resistor
Analog: TV, radio,
communications...
Switch
DIGITAL
INFORMATION
PROCESSING
Digital: Computation
General Purpose Computer (GPC) accepts arbitrary types of data and sets of
instructions to perform arbitrary tasks of transmission, processing, and storing
the information
Parameters of GPC:
u Number of components (integration density/functional complexity)
u Speed
u Energy consumption
7
What is Information?
Information is…
u …Measure of distinguishability
u …A function of a priory probability of a given state or
outcome among the universe of possible states.
I = K ln N
N min = 2
I ( N min ) = 1
1 = K ln 2
The binary choice: YES/NO, 1/0 etc
1
K=
ln 2
N = 2n
1
I = K ln 2 =
n ln 2 = n = log 2 N
ln 2
n
8
Constituents of the Information Theory
u
Constituents of the Information Theory
•
•
u
Sender and recipient
Symbols (microstates) as elementary units of information
Information carriers
Information is physical!
9
The Abacus, an ancient digital
calculating device
Information is represented in digital form
Each column denotes a decimal digit
Binary representation: two possible positions for each bead
A bead in the abacus is a memory device, not a logic gate
10
Source: IBM
Particle Location is an Indicator of
State
1
1
0
0
1 0
12
Two-well bit
a
Eb
w
Eb
a
w
13
A physical system as a computing
medium
u
u
u
u
We need to create a bit first. Information processing always
requires physical carrier, which are material particles.
First requirement to physical realization of a bit implies
creating distinguishable states within a system of such
material particles.
The second requirement is conditional change of state.
The properties of distinguishability and conditional change
of state are two fundamental properties of a material
subsystem to represent information. These properties can
be obtained by creating energy barriers in a material
system.
14
Kroemer’s Lemma of Proven
Ignorance
u
u
If in discussing a semiconductor problem,
you cannot draw an Energy-Band-Diagram,
this shows that you don’t know what are you
talking about
If you can draw one, but don’t, then your
audience won’t know what are you talking
about
15
Barrier engineering in semiconductors
By doping, it is possible to create a built-in field and energy barriers of
controllable height and length within semiconductor. It allows one to achieve
conditional complex electron transport between different energy states inside
semiconductors that is needed in the physical realization of devices for
information processing.
p
n
n
16
Heterojunction barriers
w
Eb
Double barrier
a
Superlattice
17
Example: Field Effect Transistor
Long Channel
Short Channel
It is possible to derive MOSFET I-V equation form the
two-well one-barrier model
18
Ideal von Neumann’s Computer
Designers and Users want:
u Highest possible integration density (n)
v
v
u
Highest possible speed (f=1/t)
v
u
To keep chips size small and increase yields
To increase functionality
Speed sells!
Lowest possible power consumption (P)
v
v
Decrease demands for energy
The generation of too much heat means costly cooling
systems
19
Binary Information Throughput (BIT)
BIT is the maximum number of binary
transition per unit time
BIT = nbit f
- one measure of
computational capability
nbit – the number of binary
states (e.g. transistors)
f-switching frequency
BIT, Mbit/ns
10000
1000
100
10
1
2000
2005
2010
2015
2020
Year
20
Energetics of Computation
P = Ebit nf
Requirements for an ideal computer:
(integration density)
n=max
(switching frequency)
f=max
(power)
P=min
BIT=max
21
Power density will increase
Power Density (W/cm2)
10000
1000
100
Rocket
Nozzle
Sun
Nuclear
Reactor
8086
10 4004
Hot Plate
P6
8008 8085
Pentium® proc
386
286
486
8080
1
1970
1980
1990
2000
2010
Year
Power density too high to keep junctions at low temp
®
14
22
Lowest Barrier:
Distinguishability Barrier
a
Distinguishability D implies low
probability of spontaneous transitions
between two wells (error probability)
D=max,
=0
D=0,
Classic distinguishability:
=0.5 (50%)
Π classic
Eb
Eb
a
Eb
= exp(−
)
k BT
Minimum distinguishable barrier: _=0.5
1
Eb
= exp(−
)
2
k BT
Eb=kTln2
Shannon - von Neumann - Landauer limit
23
Smallest Size:
The Heisenberg Barrier
∆x∆p ≥ h
∆E∆t ≥ h
acrit
h
=
2mEb
t min
h
=
Eb
Eb=kTln2
24
Classic and Quantum
Distinguishability @ =0.5
Π classic
min
b
E
Eb
= exp(−
)
k BT
= k BT ln 2
a
Eb
Eb
a
WKB: (Tunneling)
2 2m
Π quantum = exp(−
a Eb )
h
2
2
h ln 2
min
Eb =
2
8ma
a
Eb
Eb
a
25
Total Distinguishability @
=0.5
Π error = Π classic + Π quantum − Π classic Π quantum =
hEb + 2akT 2mEb
Eb
2 2m
= exp(− ) + exp(−
a Eb ) − exp(−
)
kT
h
hkT
1.2
1.1
min
b
E
2
h (ln 2)
≈ kT ln 2 +
2
8ma
Eb/kT
Generalized expression for
the minimum energy barrier
to create a bit
2
1
0.9
0.8
0.7
kTln2
0.6
0
5
10
15
20
a, nm
26
Least Energy Computer
1) Minimum distance between two distinguishable states (Heisenberg)
xmin
h
=a=
= 1.5nm(300 K )
2mkT ln 2
2) Minimum state switching time (Heisenberg)
h
t st =
= 1.2 × 10 −13 s (300 K )
2kT ln 2
3) Maximum gate density
n=
1
2
xmin
4) Maximum binary throughput BITmax
gate
= 4.6 ×10
cm 2
13
(kT ln 2) 2
7 Tbit
= 2m
= 10
3
h
ps
27
Total Power Consumption at
Minimal Energy per bit - {kTln(2)}
Eb kT ln 2 2
Pbit =
=
= (kT ln 2) 2
t sc
t sc
h
Pchip
W
= 4.74 ×10
2
cm
6
Pchip = nPbit
T=300 K
The circuit would vaporize when it is turned on!
28
Single Electron Devices Don’t Avoid the Power
Problem
Electrostatic energy
to add an electron to
a particle with size w
e-
Ea =
e
4
0
fEa
P = Ea fn = 2
w
f=10 GHz
w
0.4
0.35
0.3
0.25
0.2
0.15
0.1
0.05
0
100000
Power, W/cm2
Ea, eV
w
Ebit>Ea
10000
1000
100
10
1
0
5
10
w, nm
15
20
0
10
20
30
w, nm
In sub-10 nm single electron devices, the minimum energy to
add/remove electron is much larger than kT, and it increases as
size deceases.
29
Power vs. Error trade-off
Πerr
False bit
occurrence
Computation at Πerr=0.5 is impossible
Additional Static
(leakage) power
In useful computation, Πerr <<0.5,
hence much larger total power is
needed
1.E+08
Ptotal , W
1.E+08
8.E+07
6.E+07
a=1nm;
n=1014 bit/cm2;
f=3x1013 Hz
4.E+07
2.E+07
0.E+00
0
0.1
0.2
0.3
0.4
0.5
0.6
err
30
Dynamic and Static Power
Eb
Pd = I onV = ⋅ n ⋅ ef ⋅
= Eb fn
e
Eb Pd
Ps = ioff V = n ⋅ ef ⋅ Π err ( Eb ) ⋅
= Π err ( Eb )
e
1.00E+09
1.00E+09
1.00E+08
1.00E+08
1.00E+07
1.00E+07
P, W
P, W
a=1nm; n=1014 bit/cm2; f=3x1013 Hz
α=1
α=0.05
1.00E+06
1.00E+05
0
kT
0.05
0.1
0.15
Eb, eV
dynamic power
static power
total power
1.00E+05
dynamic power
static power
total power
1.00E+04
1.00E+06
1.00E+04
0.2
0.25
0
0.05
0.1
0.15
Eb, eV
0.2
0.25
31
Will Spintronics Alleviate the Power
Problem?
u
Expectations:
v
v
u
ultra low power ???
ultra high density ???
E↑
1
∆E = 2
r
B
B
B
E↓
A quote: “Spintronics would use much less power than conventional electronics,
because the energy needed to change a spin is a minute fraction of what is
needed to push charge around”
Example:
T=300 K
B=1.5 T
(practically viable)
u
0
∆E=2 µB B = 2_9.27_10-24 J / T_1.5 T =2.78_10-23 J= 1.74_10-4 eV
Π err
 2 BB 
= exp −
 = 0.99
 kT 
(µB = 9.27_10-24 J / T)
Is the very low energy to change state an advantage (e.g. low dynamic
power) or a disadvantage (e.g. high error probability) for applications of
spin devices in information processing?
32
How much heat a solid system
can tolerate?…
ITRS 2001 projects 93 W/cm2 for MPU in 2016
Several hundred W/cm2 is close to known limits of
heat removal from a 2-dimensional solid material
structure with Tmax =125°C
Experimental demonstrations of on-Si cooling systems
(without active devices):
680 W/cm2
thermoelectric (Zheng et al.)
790 W/cm2
microchannel (Tuckerman and Pease)
33
…and implications to the Roadmap:
Inflexion of ITRS vectors?
300 K
l
Heisenberg barrier
1.00E-08
Memory
1
1.00E-09
l is the “cell size”:
t, s
1.00E-10
I=n-1/2
4
lmin=a
Logic
1.00E-11
lMPU=(11-15)a
P=100 W/cm2
3
1.00E-12
2
1.00E-13
1
10
l, nm
100
Heisenberg barrier
1000
34
Implications for Nanoelectronics
Utilizing ElectronTransport
u
Scaling to molecular dimensions may not yield
performance increases
v
u
We will be forced to trade-off between speed and density
Optimal dimensions (depending on speed/density tradeoffs) for electronic switches should range between 5 and
50 nm, and this may be achievable with silicon technology
v
Within the scope of ITRS projections
35
Fundamentals of Heat Removal
Quotes from anonymous scientists working at the frontiers of
nanoelectronics:
“ Heat removal is not an issue. Simply, engineers must invent
better technologies for heat removal and cooling ”.
“Heat can be dissipated somewhere else”
Three fundamentals of heat removal:
1)The Newton's Law of Cooling: q=h(Th-Ta)
(h-heat transfer coefficient)
2) The Ambient: Ta=300 K !!!Wcool
3) The Carnot’s theorem:
Work to
be done
Ta − Tc −
=
Q
Tc
Heat to be
removed
36
Newton’s law of cooling
Q=h*A**(Th-Ta)
Cooling method
.
Air, natural convection
Air, forced convection
.
Water, natural convection
Water, forced convection
max (Th-Ta)=100K
h, W/cm2·K
0.001
0.01
0.1
1
.
Boiling
10
Max P=1000 W/cm2 ? (A=const)
*h
– the heat transfer coefficient
**A - area
37
The ambient interface
Chip interface
Q1
Tj
Ambient interface
A1
Chip
Q2=Q1+W
Refrigerator
R1
W
R2 A2
1
R=
Ah
- thermal
resistance
Air
air, T=300 K
Q2 > Q1
h2 < h1
A2>A1
Surface extension is
essential for removal of
high heat fluxes to the
environment
Lbox
Device
Ahot<<Acold
Boiling38
Thermoelectric cooling
Chip
R1
T=125 C
Hot
side
R2
Heat rejection to the ambient is a
universal element of all cooling
methods and requires surface
extension
Effect of varying R2 (e.g. changing air’s
humidity, temperature etc.) on the heat
removal rate and junction temperature, for
a thermoelectric cooler, compared with air
cooling by a fan.
39
How Much Volume is Needed to Transfer
Chip Heat to the Ambient?
Air: Ta=300 K
Cold wall:
Q>107 W
Tcold=350 K
Example:
h=0.001 W/cm2K
n=1014 bit/cm2
t=0.01 ps
BIT=104 Tbit/ps
Box size:
a=1 nm
P=107
W;Ahot=1
Ebit=0.08 eV (~4kT)
cm2
Lbox
Πerror=10% /device
P=107 W/cm2
Thot=400 K
Acold>2x108 cm2
Lbox>141 m
Very Big Box!
Example: computer min. size vs power
P, W
Q>P
Acold
P
>
hnc ∆T
Approx. box dimensions, cm
1
3x3x0.5
10
10x8x1
100
30x20x8
1000
100x50x40
10000
182x182x182
40
Carnot’s Refrigerator and
Cryogenic Computation
The efficiency of heat engines dramatically drops at T<<Ta
Wcool
Ta − Tc −
=
Q
Tc
Min. total power needed to run a 100 W chip:
at 77 K - 300 W
at 4.2 K - 7 kW
41
Cryogenic Computation with nanodevices
total
Ebit
Ta − Tdev
Ta
Ta
= Ebit +
Ebit =
Ebit =
Tdev
Tdev
Tdev

h 2 (ln 2) 2 
k BTdev ln 2 +
=
2
8ma 

Ta h 2 (ln 2) 2
= k BTa ln 2 +
> k BTa ln 2
2
Tdev 8ma
1000.00
T=300K
Eb, meV (total)
T=4K
100.00
10.00
0
5
10
15
20
Due to tunneling, the
power consumed by the
device depends on both
operating temperature and
size that manifests itself
with unexpectedly dramatic
increases in total power
consumption at cryogenic
temperatures.
a, nm
42
The barriers dilemma
u
u
u
Energy barriers are key components to provide Information
Flow
Energy barriers are negative factor for Heat Flow
Can we think of radically new ways of heat removal based
on coherent heat flows, e.g. heat lasers or solitons?
Reflection
phonons
Transmission
A driver for physical layout?
43
A question – What to do?
Gate
Source
Drain
Substrate
Silicon MOSFET
?
IBM CNTFET
44
Energy efficiency of CMOS
Does practical CMOS operate far from fundamental limits?
u
2016 ITRS 22-nm Node:
Fundamental limits
h
= 1.5nm(300 K )
2mkT ln 2
v
xmin: Channel length
9 nm
xmin = a =
v
Esw: Switching energy
2 x 10-18 J
Esw > kTln2 = 3 x 10-21 J
v
Eb: S-Ch barrier height
~0.4 eV
Eb>kTln2=0.02 eV
v
Electrons/switching event
~50
1
v
Energy/electron
4 x 10-20 J~12 kT 3 x 10-21J ~ kT
45
Can we decrease the energy of
CMOS?
2 x 10-18 J
3 x 10-21J
Yes (in principle):
Decrease the barrier height:
0.4 eV
0.02 eV
Eb
E
Π err = exp(− b )
kT
12kT
0.001%
8kT
0.03%
4kT
2%
kT ln2
Decrease the number of
electron per switching event:
50
1
50%
46
What to do? (Cont’d)
?
Gate
Source
Quantum Computer
Drain
Substrate
Silicon MOSFET
Cellular Non Linear Network
Conventional von Neumann
Architecture
New Information
Processing Architectures
47
2003 ITRS: Emerging Research
Devices
Device
FET
Density
(dev/cm2)
Switch
Speed
Circuit
Speed
Switching
energy, J
Binary
throughput,
GBit/ns/cm2
3E9
RSFQ
1D
struct
RTD
SET
Molecul
ar
QCA
Spin
transistor
106
3E9
3E9
6E10
1012
3E10
3E9
700
1.2 THz
?
1 THz
1GHz
?
30 MHz 700 GHz
GHz
30
400 GHz 30 GHz 30 GHz 100 MHz 1 MHz
1 MHz
30 GHz
GHz
-18
2x 10-19
10-18
]
-16 E: [> 10
-18
2E-18
2E-18 >2E-18
1.3x10
-17 2 x 10
[>1.4E-17
[>1.5E-17]
M:>4x10
86
0.4
86
86
10
?
0.06
86
48
Classic to Quantum transition
u
Classic memory bits become indistinguishable, which limits
our ability to use them for computation
BUT
u
u
The superposition of indistinguishable states is a key
concept of Quantum Computation
A quantum bit or qbit is a physical system with two
quantum states
49
Power of quantum computing
u
Quantum information storage
v
N quantum bits stores 2N complex numbers
Consider information in 300 entangled qubits
2300 = 1090
n
Compare to the total number of atoms in the Universe:
Natoms=1080
n
If dramatic improvement of the information throughput can be
achieved, the cryogenic operation might be affordable
50
Neuromorphic Computing
u
Implies computational schemes and systems resembling
operation of human brains.
v
u
The potential capabilities of neuromorphic computers could
be close to those of the brain, thus enabling e.g. artificial
intelligence
Properties of brain:
v
v
v
v
v
Mass – 1.5 kg
Volume – 1.5 l
Energy consumption – ~10 W
Information stored – 1e14 bits
1e13 bits/s
51
Conclusions
v
Fundamental considerations suggest that the potential
benefits from replacing CMOS devices with new types of
electron transport devices may be limited
v
Search for radically new methods of heat removal is one of
the most critical research directions
v
The exploration of alternative approaches to von Neumann
type computing, such as brain or Reversible/Quantum
Computation, is becoming a strategic imperative.
n
We need a concerted effort in these areas because of the long
lead times for the introduction of radically new technologies
52
Download