Jitter in High

advertisement
ECE 733 Class Notes
Jitter in High-Speed I/O
Dr. Lei
D
L i Luo
L
March 2011
©2011, Dr. Lei Luo
1
ECE 733 Class Notes
Outline
Jitter and BER



Voltage margin and Time margin
Jitter components
BER(RJ)
Jitter sources in High Speed Links





Phase-Locked Loop (PLL)
Duty Cycle Distortion (DCD)
Power Supply Induced Jitter (PSIJ)
Inter-Symbol-Interference (ISI)
Cross-Talk and reflections
Clocking architectures




Clock-Data-Recovery (CDR)
Source Synchronous Link
Clock embedded with Data
Asynchronous link
A case study for jitter tracking
©2011, Dr. Lei Luo
2
ECE 733 Class Notes
A simple high speed link
Sampler
p
D
Transmitter
Channel
TD-to-Clk
Clock
To avoid bit error, two factors are important


©2011, Dr. Lei Luo
Sampling
p gp
position ((clock p
phase))
Eye opening at sampling position
3
ECE 733 Class Notes
Voltage and Time Margin
Time margin is constrained by
 Sampler input swing requirement, which includes


sampler sensitivity, sampler mismatch/offset, sampler noise.
Eye
y diagram
g
which includes

signal swing, edge rate and signal integrity (ISI, crosstalk,
reflections, especially at the dotted sensitive area)
We will only focus on Time Margin although voltage and time really
affects each other.
other
©2011, Dr. Lei Luo
4
ECE 733 Class Notes
Jitter is Phase Wander
The main goal of study Jitter is for BER improvement.
Jitter is undesired phase wander of a clock signal or data signal, comparing
with an ideal periodic time reference.
j
affects the BER. Actually,
y, it is the jitter
j
of TD-to-Clk
Both data jjitter and clock jitter
D to Clk
really matters.
©2011, Dr. Lei Luo
5
ECE 733 Class Notes
Jitter components
Data Uncorrelated Jitter
Data Correlated Jitter
Total Jitter
(TJ)
Deterministic
Jitter (DJ)
(Bounded)
Data Dependent
Jitter (DDJ)
(Bounded)
IInter-symbol
t
b l
interference (ISI)
(Bounded)
Duty-cycle
D
t
l
distortion (DCD)
(Bounded)
Random Jitter
(RJ)
(Unbounded)
Periodic Jitter
(PJ)
(Bounded)
Sub-rate
S
b t Jitter
Jitt
(SRJ)
(Bounded)
Uncorrelated
U
l t d
Bounded Jitter (UBJ)
(Bounded)
B
Bounded/Unbounded
d d/U b
d d


RJ: grows with TIME, following Gaussian distribution
DJ: doesn’t grow with TIME
Data correlated/uncorrelated
©2011, Dr. Lei Luo
6
ECE 733 Class Notes
RJ (Random Jitter)
Source of RJ


External
E
t
l reference
f
RJ noise
i
Internal RJ from PLL, where random voltage noise integrates to random
phase noise
Concept of Jitter(BER)



Jitter is a function of BER, since both ties directly to TIME.
 BER of 10-12 means one error within a TIME range of consecutive
1012 bits.
 Jitter (the RJ part) grows with TIME
S
Specify
if a Jitter
Jitt without
ith t BER is
i nott appropriate
i t
A TJ histogram and Power Spectrum Density (PDF), plot below
©2011, Dr. Lei Luo
7
ECE 733 Class Notes
RJPP(BER)
PDF
Example of BER with
RJPP/RJRMS=1
σ=RJRMS
BER  
RJ PP

1
RJ RMS
 3
 2

0

2
3

x2
1
x2
 exp(
 exp(
)dx
)dx  
2
2
RJ PP
2 RJ RMS
2 RJ RMS
RJ RMS 2
2
RJ PP
 erfc(
)
2 RJ RMS
©2011, Dr. Lei Luo
BER
RJPP/RJRMS
1.00E-03
6.18
1.00E-04
7.438
1.00E-05
8.53
1.00E-06
9.507
1.00E-07
10.399
1.00E-08
11.224
1.00E-09
11.224
1.00E-10
12.723
1.00E-11
13.412
1.00E-12
14.069
1.00E-13
14.698
1.00E-14
15.301
1.00E-15
15.883
1.00E-16
16.444
8
ECE 733 Class Notes
How a BERT measures Jitter(BER)
An accurate way
y
Procedure:
1. Select a sampling point (blue dots)
J + TJ2
J
2. TJJ = TJ1
3. Measure the BER bit by bit: BER=
incorrect bits/total bits
Might take a long time for Low BER. For
example: takes more than 3 years to
measure BER of 10-18 for a 10Gbps link
Measured BER
A practical way is to measure the
TJ1
TJ2
jitter att hi
jitt
high
h BER and
d do
d
extrapolation (curve fitting) to
estimate jitter for lower BER.
Extrapolated BER
©2011, Dr. Lei Luo
9
ECE 733 Class Notes
Outlines
Jitter and BER



Voltage margin and Time margin
Jitter components
Jitter(BER)
Jitt sources iin Hi
Jitter
High
h Speed
S
d Li
Links
k





Phase-Locked Loop (PLL)
Power Supply Induced Jitter (PSIJ)
Duty Cycle Distortion (DCD)
Inter-Symbol-Interference (ISI)
Cross-Talk and Reflections
Clocking architectures




Clock-Data-Recovery (CDR)
Source Synchronous Link
Clock embedded with Data
Asynchronous link
A case study for jitter tracking
©2011, Dr. Lei Luo
10
ECE 733 Class Notes
Jitter Sources in High Speed Links - PLL
PLL:
•Ref Clk
•PSIJ
•Osc
O RJ
PLL
RefClk
©2011, Dr. Lei Luo
11
ECE 733 Class Notes
Jitter Sources – Level Shifters
Level Shifters:
•PSIJ
•DCD
Level
Shifters
f
PLL
RefClk
©2011, Dr. Lei Luo
12
ECE 733 Class Notes
Jitter Sources – Clock buffers
Clk bufffe rs
Clk Buffers:
•PSIJ
•DCD
Level
f
Shifters
PLL
RefClk
©2011, Dr. Lei Luo
13
ECE 733 Class Notes
Jitter Sources - Channel
Sampler
Even Data
Odd Data
Channel:
Driver
Clk b ufffe rs
Clktx
•ISI
•Reflections
Level
f
Shifters
PLL
RefClk
©2011, Dr. Lei Luo
14
ECE 733 Class Notes
Jitter Sources – Cross-Talk
Cross Talk
Sampler
Driver
Sampler
Even Data
Odd Data
Driver
Jitter from CrossTalk
Clk b ufffe rs
Clktx
Level
f
Shifters
PLL
RefClk
©2011, Dr. Lei Luo
15
ECE 733 Class Notes
Jitter Sources – Receiver Clk
Sampler
Driver
Sampler
Even Data
Odd Data
Driver
Clktx
Clk buff
ffe rs
Clk b ufffe rs
Clkrx
Level
f
Shifters
Level
Shifters
PLL
Phase
Rotator
RefClk
Same Clk jitter
at Receiver side
• Ref Clk Jitter
• PSIJ
• PLL Osc Jitter
• DCD
PLL
RefClk
©2011, Dr. Lei Luo
16
ECE 733 Class Notes
Jitter Sources - CDR
Sampler
Driver
Even Data
Odd Data
CDR:
Sampler
Driver
Clktx
CDR Jitter
Jitt due
du to
t
Phase/Freq lag and
overshoot
Clk b uff
ffe rs
Clk b ufffe rs
Clkrx
Level
f
Shifters
Level
Shifters
PLL
Phase
Rotator
CDR
PLL
RefClk
RefClk
©2011, Dr. Lei Luo
17
ECE 733 Class Notes
PLL jitter
Noise transfer functions
• Low bandwidth desired for noisy input
• Jittery
Jitt
reference
f
clock
l k
• High bandwidth required for supply noise
rejection
Noise trans
sfer functions [dB
B]
PLL noise Transfer Functions
10
from
input clock
from
clock buffer supply
0
-10
-20
from
VCO supply
-30
5
©2011, Dr. Lei Luo
10
6
10
7
10
8
10
9
10
10
10
frequency [Hz]
18
ECE 733 Class Notes
PSIJ - Clock Buffer Jitter
Power supply induced clock jitter falls into DJ

Uncorrelated bounded jitter
Typical Clock Buffers and tradeoffs


Inverters: dynamic power
power, full swing
swing, smaller area
CML: low jitter sensitivity to power supply noise
Reduce PSIJJ

Circuitry improvements - reduce jitter sensitivity to power supply noise
©2011, Dr. Lei Luo
19
ECE 733 Class Notes
What Determines On-chip
p Supply
pp y Noise?
I
d
Impedance
Z(f)
Current Profile I(f)
V ( f )  I( f )Z( f )
It is the interaction between I(f) and Z(f)
that determines the supply noise
©2011, Dr. Lei Luo
20
ECE 733 Class Notes
Power Supply Network (PDN) Modeling
((log)
Vnoise(f) = ZPDN(f) · Inoise(f)
ZPD
DN
Typical Profile of
Supply Impedance
Frequency
x0 … x00 MHz
Vnoise
VRM
PCB
Package
(log)
ZPDN
Inoise(f)
On-chip
Package inductance separates spectrum into two regions:
Low-frequency,
o
eque cy, add
addressed
essed by PCB
C capac
capacitors
to s
High-frequency, addressed by on-chip capacitors
©2011, Dr. Lei Luo
21
ECE 733 Class Notes
Jitter Sensitivity Concept
• Linearity
Li
it assumption:
ti
• Supply noise induced jitter component at one frequency is a
linear function of supply
pp y noise component
p
at the same
frequency
J( f )
S( f ) 
[ ps / mV
Vpp ]
V( f )
• Jitter sensitivity:
• Determined only by circuit implementation
• Independent of noise or data pattern
• A link parameter between data activity (noise) and total
jitter impact
©2011, Dr. Lei Luo
22
ECE 733 Class Notes
Supply Noiise Amplitu
ude (log)
~100MHz
z
PDN Reso
onance
~30MHz
PLL Band
dwidth
Jitter Sensitivity [ps//mV] (log)
Noise Spectrum and Jitter Sensitivity
Frequency (log)
Overlap between both parameters (jitter sensitivity and
noise
i spectrum)
t
)d
determines
t
i


Amount of supply noise induced jitter
Spectrum of jitter (needed for jitter tracking)
©2011, Dr. Lei Luo
23
ECE 733 Class Notes
Reduce power supply noise



Improve PDN impedance, at critical frequency range;
Reduce self-introduced power supply noise, reduce di/dt
Use power supply regulator;
B d t th
Budget
the regulator
l t noise
i source:
DC offset due to
Limited amp gain
Device mismatch
AC noise comes from
Vref noise
Loading impedance change
Vdd noise
©2011, Dr. Lei Luo
24
ECE 733 Class Notes
Duty Cycle Distortion (DCD)
Duty-Cycle-Distortion
DCD is the highest
g
frequency
q
y jjitter component
p

Directly affects the timing margin
for DDR signaling
OUTn
DCD mainly comes from



Device mismatch
V l
Voltage
l
level
l shifters
hif
DCD amplification due to limited BW
INp
OUTp
INn
Vbias
DCD reduction circuitry example 1:


DCD also appears as single-ended common-mode DC offset.
A CML high-pass filter reduces the incoming single-ended commonmode
d DC offset
ff and
d thus
h reduces
d
the
h DCD at its
i output.
©2011, Dr. Lei Luo
25
ECE 733 Class Notes
DCD reduction circuitry example 2
LPF
LPF
FSM
The single-ended common-mode is Low-Pass-Filtered and compared. The
result is used to correct the DCD
The DCD from the CLKin is corrected
The DCD introduced in clock buffers is corrected
With help from FSM, this detection-correction loop can run Continuously
on or P
Periodically
i di ll or calibrate
lib t once att power on.
©2011, Dr. Lei Luo
26
ECE 733 Class Notes
ISI - Frequency-dependent
Frequency dependent Loss
Conductors and dielectrics both lossy
Loss in 00.5m
5m of typical FR-4
FR 4
Skin Effect ~ F 1/2
0
-5
Dielectric Loss ~ F
-10
Attenuation (dB)
A
-15
-20
Overall Loss
-25
30
-30
-35
0.001
0.01
-40
40
0.1
1
10
Freq enc (GH
Frequency
(GHz))
•For low-cost PC boards, dielectric loss dominates
©2011, Dr. Lei Luo
27
ECE 733 Class Notes
ISI - Why is attenuation bad?
Frequency-dependent attenuation in channel produces
inter-symbol interference (ISI):
Vin
Lone 1 in
stream of 0
0’s
s
undetectable
Vout
• Hurt both voltage margin and timing margin
• Equalization required
©2011, Dr. Lei Luo
28
ECE 733 Class Notes
ISI - Equalization
Channel
Equalizing Filter
Equalized Channel
Frequency
Frequency
Bit-rate/2
Frequency
Bit-rate/2
Bit-rate/2
Filter can go at either end of the channel..
T
Tx
Tx
©2011, Dr. Lei Luo
Filt
Filter
R
Rx
Filter
Rx
29
ECE 733 Class Notes
ISI - TX FIR
• De
De-emphasis
emphasis
• Reduce ISI
• Increase cross talk
noise
©2011, Dr. Lei Luo
30
ECE 733 Class Notes
ISI - Tx EQ is effective..
effective
Equalization OFF
Equalization ON
0.18µ I/O, 1999
3.125Gb/s
50cm FR-4 + conn’s
pp
-8 dB atten
Approx
2-Tap equalizer
210 PRBS pattern
160 mV eye height
150 ps P-P jitter
0.13µ I/O, 2003
6.25Gb/s
73cm FR-4 + conn’s
Approx
pp
-15 dB atten
4-Tap equalizer
210 PRBS pattern
100 mV eye
y height
g
63 ps P-P jitter
©2011, Dr. Lei Luo
31
ECE 733 Class Notes
ISI - Receiver Equalizers
• Source degenerated amp popular implementation
• About as effective as a 2-tap Tx EQ
• Problems:
° Amplifies
A lifi hi
high-freq
hf
noise
i as well
ll as signal
i
l
° Power-hungry
Gain
OutOut
Out+
In+
3 dB/octave
Min. R
8-10 dB of
‘boost’
In-
Max. R
R
©2011, Dr. Lei Luo
Freq
32
ECE 733 Class Notes
ISI - RX DFE (Decision-Feedback-Equalization)
(Decision Feedback Equalization)
• Non-linear, doesn’t amplify crosstalk!
• Power hungry
• Tight timing for the 1st loop back
• Error
E
propagation
ti
©2011, Dr. Lei Luo
33
ECE 733 Class Notes
ISI - 3-Tap
pp
prDFE (p
(partial-response
p
DFE))
Sampler removed from critical path
Solves path through parallelism
q
offset
Eliminates summer, requires
Consumes more power
©2011, Dr. Lei Luo
34
ECE 733 Class Notes
ISI - ADC & FFE (Andy,
(Andy ISSCC08)
shift
ADC
FFE &
Crosstalk
Cancel
“TAP”
ADAPTION
16
adc samples
MU
X
>
Output
O
data
For very complex BP channel at reasonable data rate
©2011, Dr. Lei Luo
35
ECE 733 Class Notes
Cross Talk and Reflections
Cross-Talk
Connectors and vias introduce reflections, ‘notches’, crosstalk
0
Bare 20” FR4 Trace
-5
-10
10
-15
S21 (dB)
S
-20
-25
20” Backplane Channel
-30
-35
-40
-45
0.01
0.1
50
> 100 MHz
Flat Channel
no attn
1
10
Frequency (GHz)
0.1 - 1 GHz
Moderate Attn
few dB/octave
> 1 GHz
Strong Attn
> 5 dB/octave
Chip-to-chip channels more like bare boards
boards, and generally less than 20”
20
©2011, Dr. Lei Luo
36
ECE 733 Class Notes
Outlines
Jitter and BER



Voltage
V
lt
margin
i and
d Time
Ti
margin
i
Jitter components
BER(Jitter)
Jitter sources in High Speed Links





Phase-Locked Loop (PLL)
Duty
y Cycle
y Distortion ((DCD))
Power Supply Induced Jitter (PSIJ)
Inter-Symbol-Interference (ISI)
Cross-Talk
Cross
Talk and reflections
Clocking architectures




Clock-Data-Recovery (CDR)
Source Synchronous Link
Clock embedded with Data
Asynchronous link
A case study
d for
f jitter
ji
tracking
ki
©2011, Dr. Lei Luo
37
ECE 733 Class Notes
Revisit - Jitter is Phase Wander
The main goal of study Jitter is for BER improvement.
Jitter is undesired phase wander of a clock signal or data signal, comparing
with an ideal periodic time reference.
j
affects the BER. Actually,
y, it is the jitter
j
of TD-to-Clk
Both data jjitter and clock jitter
D to Clk
really matters.
©2011, Dr. Lei Luo
38
ECE 733 Class Notes
Clock Recoveryy
1.5GbpsX4
NRZ
ERROR
Report
1.5GHz
CLK IN
LFSR PRBS
BERT
Sampler
1.5GHzX4
CLK
Multi-Phase
DLL
1.5GHzX8
CLK
6Gbps
NRZ
RX
1.5GbpsX4
NRZ
Clock
Recovery
 Have
 6Gbps NRZ data, 1.5GHz 8-phase clks, but clk&NRZ are not aligned
 Need
 Align clk at center of NRZ, use this clk to sample NRZ
 Demux 6Gbps NRZ to 1.5Gbps NRZ
©2011, Dr. Lei Luo
39
ECE 733 Class Notes
How to align clk to data?
 First, need to judge if current clk is early or late
If Edge(I)
Edge(I)=Edge(I+1)
Edge(I+1) then No information
If Data(I)
Data(I)=Data(I+1)
Data(I 1) then No information
else if Edge(I)=Data(I-1) then Early
else if Data(I)=Edge(I) then Late
else if Edge(I)=Data(I) then Late
else if Data(I)=Edge(I+1) then Early
6Gbps NRZ
D0
D1
D2
D3
1.5GHz CLK
Rising edge
CLK is aligned
Edge0
g Data0 Edge1
g Data1 Edge2
g Data2 Edge3
g
Data3
CLK comes early
Edge0 Data0 Edge1 Data1 Edge2 Data2 Edge3 Data3
CLK comes late
Edge0 Data0 Edge1 Data1 Edge2 Data2 Edge3 Data3
©2011, Dr. Lei Luo
40
ECE 733 Class Notes
How to align clk to data?





Second, do majority vote on the 8 Early/Late/No_info judgments
Third, do LPF to prevent continuous Early or Late
Forth, generate control signal for Selector and Interpolator
Fifth, selector clk phases from 8-phase DLL
Si th iinterpolate
Sixth,
t
l t and
d regenerate
t RX side
id clock
l k which
hi h iis aligned
li
d tto NRZ
Clk From 8-phase DLL 0 2 4 6
Sel X8
1 3 5 7
SelectorX4
SelectorX4
ClkX4
LPF
InterpolatorX8
FSM
E/L/N
W X8
E/L/N
Vote
Judge
E/L/N X8
©2011, Dr. Lei Luo
ClkX4
ClkX8
SamplerX8
Data X8
From RX
Demuxed
out X4
41
ECE 733 Class Notes
Selector Design
g
18
18
8
8
8
8
A simple 2 to 1 differential selector structure
Actually circuit is a 4 to1 selector
©2011, Dr. Lei Luo
42
ECE 733 Class Notes
Interpolator
p
Design
g
20
1
1
1
1
A simple 2-bit weight differential interpolator structure
IIn actually
t ll d
design,
i
8
8-bit
bit weight
i ht is
i needed
d d tto keep
k
phase
h
controlling step within 10ps
©2011, Dr. Lei Luo
43
ECE 733 Class Notes
Link with CDR (Clock
(Clock-Data-Recovery)
Data Recovery)
The one matters BER is JitterD-to-Clk
Jitt sources are from:
Jitter
f
TX side: RefClktx noise; TXPLL; Transmitter; Channel
RX side: RefClkrx noise; RXPLL; phase rotator;
Jitter reduced:
PLL filters out high frequency noise of RefClktx/RefClkrx
CDR loop tracks low frequency JitterD-to-Clk noise within its BW
Key jitter still matters:
JitterD-to-Clk beyond of CDR BW (usually <10MHz)
©2011, Dr. Lei Luo
44
ECE 733 Class Notes
Source Synchronous Link
Sampler
Transmitter
Dtx
Drx
Channel
Clktx
JitterD-to-Clk
PLL
Crx
RefClktx
Channel
Clkrx
Delay
Low and mid frequency jitter sources go away because Drx and Crx jitter
moves together,
t th assuming
i Dtx
Dt is
i triggered
ti
d precisely
i l b
by Clktx.
Clkt
Remained jitter sources:
Any high-frequency
high frequency jitter (next slide)
Others: Transmitter jitter; Channel difference; PSIJ difference between
two chips
Constraints: dedicated pin/channel for clock, need to match channels
©2011, Dr. Lei Luo
45
ECE 733 Class Notes
Source Synchronous Link
• Dtx edge trigged directly from Clktx edge, their jitter dance together.
• If “Dtx-Drx delay cycles” matches with “Clktx-Clkrx delay cycles”, JitterD-to-Clk
will be perfect.
perfect (note,
(note clk at center of data is ignored here for simple explain)
• The more clk-to-data delay, the more JitterD-to-Clk.
©2011, Dr. Lei Luo
46
ECE 733 Class Notes
Source Synchronous Link (N:1)
Sampler
D3
Transmitter
Data Channel 3
Clktx3
JitterD3-to-Clk3
Clkrx3
N data channel share one
clock
l k channel
h
l (N=3)
(N 3)
Challenges

Sampler
D2
Transmitter
Data Channel 2


Clkt 2
Clktx2
JitterD2-to-Clk2
Channel matching
Phase adjustment
PSIJ difference between
TX and RX side
Clkrx2
Sampler
D1
Transmitter
Data Channel 1
Clktx1
JitterD1-to-Clk1
Clk 1
Clkrx1
CLK
CLK Channel
PLL
RefClktx
©2011, Dr. Lei Luo
47
ECE 733 Class Notes
Embed the clock with the data
Self-Clocked Two-Level Differential signaling methods and apparatus :
US patent 7535964
• Source synchronous Link (1:1)
• Clock edges embedded with each data channel, no
channel matching needed
• Compromise voltage margin to improve timing
margin
©2011, Dr. Lei Luo
48
ECE 733 Class Notes
Outlines
Jitter and BER



Voltage
V
lt
margin
i and
d Time
Ti
margin
i
Jitter components
BER(Jitter)
Jitter sources in High Speed Links





Phase-Locked Loop (PLL)
Duty
y Cycle
y Distortion ((DCD))
Power Supply Induced Jitter (PSIJ)
Inter-Symbol-Interference (ISI)
Cross-Talk
Cross
Talk and reflections
Clocking architectures




Clock-Data-Recovery (CDR)
Source Synchronous Link
Clock embedded with Data
Asynchronous link
A case study
d for
f jitter
ji
tracking
ki
©2011, Dr. Lei Luo
49
A 5Gb/s Link with Clock Edge Matching
and Embedded Common Mode Clock for Low Power Interfaces
A 5Gb/s Link with Clock Edge Matching
and Embedded Common Mode Clock for
Low Power Interfaces
Jared Zerbe
Zerbe, Barry Daly
Daly, Lei Luo,
Luo Bill
Stonecypher, Wayne Dettloff, John Eble,
Teva Stone, Jihong Ren, Brian Leibowitz,
Michael Bucher, Patrick Satarzadeh, Qi Lin1
Rambus Inc., Los Altos, CA
1Presently
with NVIDIA Corporation
2010 VLSI Circuit Symposium
June 16, 2010
Outline
•
•
•
•
Introduction & motivation
Matched source
source-synchronous
synchronous clocking
Common-mode clocking
D i
Design
•
•
•
•
Clock distribution & DCDL
T
Transmitter
itt
Integrating receiver
Noise generation
• Measured results
• Conclusions
C
l i
2010 VLSI Circuits Symposium
Slide 51
Zerbe : C7P5
Power Beyond
y
Dynamic:
y
Turn-on/off
Server
CPU
Utilization
• Rarely
y at 100%
•
• Non-zero y
y-intercept
p power
p
Building a low-power link system goes
beyond low active power link
Servers and mobile devices spend a lot of
time idle
• Proportional power not easy to make 0,0
 Fast duty-cycling of burst transactions a good
approach
pp
Speed
•
2010 VLSI Circuits Symposium
Slide 52
52
Slide
time
Source: Luiz Barroso & Urs Hölzle, IEEE Computer, December 2007
Zerbe : C7P5
Source-Synchronous
y
Clocking
g
D t
Data
D
OMUX
CK
“1”/”0"
CK
OMUX
D
TX
Sampler
AMP
Clock
TX
AMP
• Large Data : Clock ratio for busses
• Receive clock buffer gets longer
• Often multiple UI of path mismatch
• Can lead to anti-tracking in source-jitter & PSIJ
• Adds latency to power on/off
• Requires
R
i
additional
dditi
l edges
d
on state
t t transitions
t
iti
2010 VLSI Circuits Symposium
Slide 53
Zerbe : C7P5
Matched Source Synchronous
y
Clocking-MSSC
g
• Transmit data DCDL to match Rx clock buffer delay
• Switched receiver to windowed integrating sampler
2010 VLSI Circuits Symposium
Slide 54
Zerbe : C7P5
MSSC Edge-Timing
2010 VLSI Circuits Symposium
Slide 55
Zerbe : C7P5
MSSC Edge-Timing
2010 VLSI Circuits Symposium
Slide 56
Zerbe : C7P5
MSSC & PSIJ
Source
Destination
DATA
3
5
DCDL
PLL/CLK
BUFFER
1
CLK
BUFFER
4
FWD CLK
2
• MSSC helps cancel source jitter at (2) but
potentially adds PSIJ in DCDL (23)
• Low-frequency
L
f
PSIJ will
ill track
t
k with
ith destination
d ti ti
clock buffer (45)
• Relative impacts must be studied
2010 VLSI Circuits Symposium
Slide 57
Zerbe : C7P5
Common Mode Clocking
g (CMC)
(
)
• C
Can b
break
k1
1:N
N problem
bl
if we send
d clock
l k
along with data in different mode
• Potentially minimizes/eliminates receive
distribution clock buffer
• Results in multi
multi-level
level waveform
2010 VLSI Circuits Symposium
Slide 58
Zerbe : C7P5
Common-Mode Clocking
g Doublets
D0 & Clk
Dp
Dn
Dp - Dn
= Data
D1 & ClkB
Dp + Dn
= Clock
• Complementary clocks over adjacent pairs
• Makes CM clock extraction simple
• Makes CM noise – in-common
• Any CM termination mismatch
 static phase error (removed during calibration)
• Design includes both CM clock and forwarded
clock to allow for performance comparisons
2010 VLSI Circuits Symposium
Slide 59
Zerbe : C7P5
Outline
•
•
•
•
Introduction & motivation
Matched source
source-synchronous
synchronous clocking
Common-mode clocking
D i
Design
•
•
•
•
Clock distribution & DCDL
T
Transmitter
itt
Integrating receiver
Noise generation
• Measured results
• Conclusions
C
l i
2010 VLSI Circuits Symposium
Slide 60
Zerbe : C7P5
DCDL : Trellis LSB < 1 Gate Delay
y
Input
d1 = 4 LSB, d2 = 5 LSB
0
4, 5
d1, d2
d1, d2
8, 9, 10
d1 d2
d1,
12 13
12,
13, 14
14,15
15
20-24
d1
24-28
2010 VLSI Circuits Symposium
Slide 61
6,7, and 11
 Start at 12
16, 17, 18, 19, 20
d1,, d2
d1
Missing 1-3,
• Use difference in delay to
get resolution < 1 gate dly
• CMOS for short / currentstarved inverters for long
d l
delays
Zerbe : C7P5
Transmitter Incorporating
g CM Clock
• Segmented design
• Adjustable between
clock and data
2010 VLSI Circuits Symposium
Slide 62
Zerbe : C7P5
CMC Transmitter Example
DATA
CLOCK
DATA
CLOCK
P
N
P
N
2010 VLSI Circuits Symposium
Slide 63
Zerbe : C7P5
Integrating/Sampling
g
g
g Receiver Design
g
Amp/Int
switch
A
Common-gate
amplifier
dataout
Linear
Equalizer
Integrate
or
Amplify
p y
Sampler
• Integrator load switchable
between cap/resistor
• All
Allows comparison
i
off
integration/sampling
modes
• CTLE power efficient way
of equalizing for ISI at
high-speeds
g p
2010 VLSI Circuits Symposium
Slide 64
Zerbe : C7P5
PS Noise Generation
Effect on output eye
Noise measured at sense
off
on
•
Adjustable Vdd-Vss Clamp
•
•
‘noiseclk’ driving shorting clamp – observable at sense lines
Set frequency to insight highest PSN from package/pwr
package/pwr-gnd
gnd network
300/305MHz @ 30mV used for testing
2010 VLSI Circuits Symposium
Slide 65
Zerbe : C7P5
Outline
•
•
•
•
Introduction & motivation
Matched source
source-synchronous
synchronous clocking
Common-mode clocking
D i
Design
•
•
•
•
Clock distribution & DCDL
T
Transmitter
itt
Integrating receiver
Noise generation
• Measured results
• Conclusions
C
l i
2010 VLSI Circuits Symposium
Slide 66
Zerbe : C7P5
Two Device MSSC SJ & PSIJ Testing
g
• Configurations
SER
• 6 Bi-directional
links/device
Tx
Freq gen
for PSIJ
Tx VER
16
M
SAMP
Rx
DES
• 3 Bi-directional
clocks/device
BERT w/SJ*
• Devices
fi
d in
i
configured
parallel loopback
for data
generation
2010 VLSI Circuits Symposium
Slide 67
Tx
Clk VER
T
Tx
Tx VER
SERVC
16
Clock With Jitter
PRBS
M
PRBS
*SJ 300MHz Equipment limit
Rx
SAMP
DES
Zerbe : C7P5
DCDL Delay
y vs. Code & Filtering
g
Raw results vs. simulation
Results after code filtering
• Raw DNL/INL not as critical as code filtered INL
• DCDL code only updated during re-calibration
2010 VLSI Circuits Symposium
Slide 68
Zerbe : C7P5
Integrating
g
g / Sampling
g Rx Sensitivity
y
Sampling
Integrating
(variable window)
0.87 UI
0.90 UI
0 93 UI
0.93
BER
PRBS7 Input Eye Opening (mVpp)
• Integrating
I t
ti receiver
i
is
i superior,
i
with
ith proper windowing
i d i
2010 VLSI Circuits Symposium
Slide 69
Zerbe : C7P5
MSSC & SJ Measured Results @ 5Gbps
Integrating
Sampling
1E+00
BER
R
1E-03
1E-06
1E-09
50
100
150
200
250 50
Time (ps)
Sample BER
Integrate BER
No SJ,
no MSSC
(91/110ps)
100
150
200
250 650
Time (ps)
Sample BER
Integrate BER
300MHz 0.3UI SJ,
no MSSC
700
750
800
850
Time (ps)
Sample BER
Integrate BER
300MHz 0.3UI SJ,
with MSSC
(47/45ps)
MSSC shows
~2× improvement
2010 VLSI Circuits Symposium
Slide 70
(94/113ps)
Integrating
g
g shows
20% improvement
Zerbe : C7P5
Sampled Rx : SJ & PSIJ @ 5Gbps
No SJ or PSIJ
1E+0
With SJ
With
SJ
 Big MSSC benefit
(SJ & no SJ ~identical)
(SJ & no SJ identical)
1E‐3
1E‐6
1E‐9
1E+0 0
200
400
600
800
200
400
600
800
200
400
600
800
1E 3
1E‐3
1E‐6
1E‐9
1E+0 0
1E‐3
1E‐6
1E‐9
1E
9
0
2010 VLSI Circuits Symposium
Slide 71
With PSIJ
 DCDL PSIJ sensitivity
is not huge
With both SJ & PSIJ
 With SJ & PSIJ,
With SJ & PSIJ
MSSC is required
to open the eye
to open the eye
Delay (ps)
300mUI SJ, 30mV PSIJ
Zerbe : C7P5
Integrating
g
g Rx : SJ & PSIJ @ 5Gbps
No SJ or PSIJ
1E+0
With SJ
 Strong MSSC
jjitter benefit
1E 3
1E‐3
1E‐6
1E‐9
BER
1E+0 0
200
400
600
800
With PSIJ
 Moderate PSIJ
margin impact
1E 3
1E‐3
1E‐6
1E‐9
1E+0 0
200
400
600
800
200
400
600
800
1E‐3
1E‐6
1E‐9
1E
9
0
2010 VLSI Circuits Symposium
Slide 72
Delay (ps)
With both SJ & PSIJ
MSSC net win w/both
MSSC net win
80ps margin vs. 10ps
IntRx 30% more 30% more
margin than sampled
Zerbe : C7P5
300mUI SJ, 30mV PSIJ
CM Clocking
g : Modal Conversion (3Gbps)
(
)
No CMCLK
With CMCLK
Data Jitter:
only 38mUI
increase
for CM
clocking
No PRBS7
With PRBS7
Clock Jitter*:
only 15.4mUI
15 4mUI
increase in
jitter when
running
PRBS7
2010 VLSI Circuits Symposium
Slide 73
*As measured on looped back clock output
Zerbe : C7P5
CMC Margins @ 4Gbps
1E+00
BER
1E-03
1E-06
1E-09
10
60
110
160
Time (ps)
210
No MSSC,
MSSC
with SJ + PSIJ
54ps
2010 VLSI Circuits Symposium
Slide 74
260
490
540
590
640
Time (ps)
690
740
With MSSC,
MSSC
with SJ + PSIJ
106ps
Zerbe : C7P5
Low Power : Enabling
g Duty-Cycling
y y
g
Improved DCDL vs. PhaseMixer @6.4Gbps
PhMix
•
Neither MSSC or CMC save power themselves… and are not too expensive
•
•
•
•
DCDL vs. Phase-Mixer : similar in many
y ways,
y , can be more efficient at lower
codes
•
•
•
They enable
Th
bl more effective
ff ti architectures
hit t
with
ith power duty-cycling
d t
li in
i low-power
l
li k
links
CM Clocking enables fine-grain (2 bit) enabling / disabling
Current test silicon @ 5mW/Gbps due primarily to many un-optimized test options
With MSSC no longer need phase mixer for pin-pin variation
Opportunities for sharing DCDL delay between pins
CMClk
•
CM-clocking of 2:1 (1 CM doublet vs. forwarded clock) : ~ equal power at 2:1 ratio
2010 VLSI Circuits Symposium
Slide 75
Zerbe : C7P5
Conclusions
• Matched Source-Synchronous Clocking
(MSSC) has promise for high-source-jitter
high source jitter
environments
• Especially with Integrating Receiver
• With PSIJ & SJ only MSSC passes @ 5Gbps
• Common-Mode Clocking shows promise
 Systems
S t
where
h
additional
dditi
l pin
i costt is
i prohibitive
hibiti
• F
Future low-power
l
li k will
links
ill need
d to
consider multiple clocking approaches to
achieve
hi
low
l
power and
d fast
f t duty-cycling
d t
li
2010 VLSI Circuits Symposium
Slide 76
Zerbe : C7P5
ECE 733 Class Notes
References
1.
2
2.
3.
4.
5.
6.
7
7.
8.
9.
10
10.
11.
12.
13.
14.
15.
J. Ren, H. Lee, B. Leibowitz, Q. Lin, R. Ratnayake, K. Kelly, V. Stojanovic, D. Oh, J. Zerbe,
N. Nguyen, “Performance Comparison of Data-based Equalization and Edge-based
Equalization for Transmitter and Receiver”, DesignCon 2007.
Dan Oh,
Oh Sam Chang
Chang, Chris Madden
Madden, Joong-Ho
Joong Ho Kim,
Kim Ralf Schmitt,
Schmitt Ming Li,
Li Chuck Yuan
Yuan,
Fred Ware, Brian Leibowitz, Yohan Frans and Nhat Nguyen, “Design and Characterization
of a 12.8GB/s Low Power Differential Memory System for Mobile Applications”, EPEP
2009.
Dan Oh and Sam Chang, “Clock Jitter Modeling in Statistical Link Simulation”, EPEP 2009
Andy Joy, “Electrical Channel Equalization using Analog to Digital Converters and Digital
Signal Processing”, ISSCC 2008.
Jared Zerbe, “ADC/Binary - Are they really that different?”, ISSCC 2009
John Poulton, “Low power chip-to-chip I/O”, NCSU ECE733 Lecture, 1997.
G
Gary
S
Steven M
Murdock,
d k “Self-Clocked
“S lf Cl k d T
Two-Level
L
l Differential
Diff
i l signaling
i
li methods
h d and
d
apparatus”, US patent 7535964 B2.
Ali Hajimiri and Thomas H. Lee, The Design of Low Noise Oscillators, Springer, 1999.
Behzad Razavi, Design of Analog CMOS Integrated Circuits, MCGrawHill, 2001.
Behzad Razavi,
Razavi Monolithic Phase-Locked
Phase Locked Loops and Clock Recovery Circuits: Theory and
Design, Wiley-IEEE Press, 1996.
Behzad Razavi, Phase-Locking in High-Performance Systems: From Devices to
Architectures, Wiley-IEEE Press, 2003.
Floyd
y M. Gardner,, Phaselock Techniques,
q
, 3rd Edition,, Wiley,
y, 2005.
Introduction to Jitter: http://www.analogzone.com/nett0927.pdf
Jitter center on Agilent: http://www.home.agilent.com/agilent/industry.jspx?nid=536900074.0.00&cc=US&lc=eng
Dual-Dirac Model
htt //
http://www2.tek.com/cmswpt/tidetails.lotr?ct=TI&cs=wpp&ci=12838&lc=EN
2t k
/
t/tid t il l t ? t TI&
& i 12838&l EN
©2011, Dr. Lei Luo
77
Download