PRISM: Enabling Next Generation Wireless Multimedia Architectures

advertisement
Changing Landscape of Multimedia
Applications
Today: Downlink Video
Broadcast
Tomorrow: Uplink Video
Transmission
Contemporary Video Coding Standards
• Motion-Compensated Predictive Coding (MPEG/H.26)
– High compression efficiency
– Rigid complexity partition between encoder (heavy) & decoder (light)
– High fragility to transmission losses
DFD
(Displaced Frame Difference)
+
• Image Coding (Motion JPEG)
– Low complexity
– High robustness to transmission losses
– Low compression efficiency
Rethinking Video Over Wireless
Challenges:
•
•
•
Low bandwidths  high compression efficiency
Limited handheld battery power  low end-device complexity
Lossy wireless medium  robustness to transmission losses
New Architecture: PRISM
(Power-efficient, Robust, hIgh-compression Syndrome-based Multimedia coding)
•
High compression efficiency
•
Flexible partition of complexity
between encoder & decoder
•
Inbuilt robustness to channel loss
•
Backward compatibility with existing
video standards
Puri & Ramchandran,
Allerton ’02
Heavy PRISM
Uplink Decoder
Light PRISM
Uplink Encoder
Heavy PRISM
Downlink Encoder
Trans-coding Proxy
Light PRISM
Downlink Decoder
Background: Distributed Source Coding
Source Coding with side-information (Slepian–Wolf, Wyner-Ziv)
X
Encoder
Decoder
^
X
•
X and Y are correlated sources
•
Y is available only at decoder
Y
• Exploit side-information Y at the decoder while encoding X
• No MSE performance loss over case when Y is available at
both encoder and decoder when innovations is Gaussian
• For the video coding case, X is the block to be coded and
the side-information Y consists of the previously decoded
blocks in the frame memory
Motion-Free Encoding?
Y1
X
...
Predictive
Encoder
Y1
X
...
Y1’
YM
… Motion Vector …
… Quantized …
DFD
PRISM
Encoder
Predictive
Decoder
Y1’
YM
?
...
YM’
...
X
YM’
PRISM
Decoder
X
• The encoder does not have access to Y1’, Y2’, etc
• Neither the encoder nor the decoder knows the correct sideinformation
• Can decoding work?
– Yes!
– A “modified” Wyner-Ziv paradigm is needed
(Ishwar, Prabhakaran, & Ramchandran ICIP ’03.)
PRISM
Y1’
Wyner-Ziv
Decoder
X
Wyner-Ziv
Encoder
bin index
Decoding failure
YT’
Wyner-Ziv
Decoder
YM ’
Wyner-Ziv
Decoder
Robustness Comparisons:
• Predictive Coding: channel errors
lead to prediction mismatch and drift
• PRISM: drift stopped if syndrome
code is “strong enough”:
Targeted noise ≥ Correlation Noise
+ Induced Channel Noise + Quant. Noise
X
Decoding
failure
• Need concept of “motion
compensation at decoder”!
• Need mechanism to detect
decoding failure
• In theory: joint typicality
(statistical consistency)
• In practice: use CRC
Standards-Compliant Auxiliary-Channel
Auxiliary-Channel
Auxiliary-Channel
Encoder
Coset
Index Wireless
Channel
Auxiliary-Channel
Decoder
Final
reconstruction
Xmain
X
^
Wireless
MPEG/H.26x X
Encoder
MPEG/H.26x Channel
bit-stream
MPEG/H.26X
Decoder
• Secondary description of video sent over auxiliary-channel.
• Need to find statistics of correlation noise Z = X – Xmain.
– Can leverage algorithm of Zhang, Regunathan and Rose (Asilomar ’99) to
develop recursive correlation estimation algorithm.
(Wang, Majumdar, Ramchandran, and Garudadri: PCS ’04.)
• Auxiliary channel allows drift correction without intra-refresh.
Results
•
Channel simulator provided by Qualcomm Inc. conforming to a CDMA 2000 1x
standard.
•
Performance comparison among 3 systems:
– H.263+ bitstream with 20% extra rate for FEC (RS codes)
– H.263+ bitstream with 20% extra rate for standard-compliant auxiliary channel
– PRISM
•
Standard-compliant auxiliary channel version outperforms H.263+FEC by 2.5-4
dB between error rates of 2-10%.
•
PRISM outperforms H.263+FEC by 6-8 dB between error rates of 2-10%.
H.263+ with FEC
H.263+ with Auxiliary Channel
Stefan, 352x240, 15fps, 2200 kbps, 8% error rate
PRISM
PRISM for Wireless Video Broadcast
Yb (“bad” side-information)
Rate = R
X
Decoder Bad
Xb
Decoder Good
Xg
Encoder
Rate = ∆R
Yg (“good” side-information)
• Broadcast source coding studied in information theory literature.
(Heegard & Berger, IT’85, Steinberg & Merhav IT’04)
• Lossy channel: need broadcast source-channel coding view.
– Can use PRISM constructions.
(Majumdar & Ramchandran, ICIP ’04)
• No need to deterministically track Yb and Yg at encoder.
• No need for multiple prediction loops  complexity savings.
• Multiple side-informations at each decoder  motion search at each decoder.
– Standards-compliant implementations possibly using the auxiliary channel
setup.
(Wang, Majumdar, & Ramchandran, ICASSP ’05)
Download