INF5300 JPEG2000 Components in an image encoder

advertisement
Lecture, 7.april 2005
Lecture, 7.april 2005
JPEG2000
Components in an image encoder
INF5300
Splitting into tiles and components
Color transform (exploits redundancy between color
components)
Image transform
Quantization
Coding
JPEG2000
Øyvind Ryan
April 2005
Øyvind Ryan
These components dier highly between dierent image standards.
INF5300 JPEG2000
Lecture, 7.april 2005
Øyvind Ryan
JPEG2000
Lecture, 7.april 2005
Signal x = {x [n]}n∈Z ,
x [n] vector of dimension m, the number of channels).
JPEG has 8 channels, JPEG2000 has 2.
JPEG image transform is block transform of type y [n] = A∗ x [n].
A is given by the Discrete Cosine Transform (DCT):
A=
cq cos
JPEG2000
Denition (Analysis subband transform)
JPEG Image transform
INF5300 JPEG2000
2π fq
1
p+
2
q ,p
,
q
fq =
.
2m
Closely related to the Discrete Fourier Transform (DFT):
2π
A = e j m pq
q ,p
.
Blocks treated as independent units. Results in blocking artifacts.
Can perhaps remove blocking eects by generalizing transform?
Øyvind Ryan
INF5300 JPEG2000
A : 2 (Z ) → 2 (Z )
is given by
y [n] =
m×m
i
matrices
Z
:
Z
:
∈Z
is given by
x [n] =
i
A∗ [i ]x [n − i ].
Denition (Synthesis subband transform)
S : 2 (Z ) → 2 (Z )
{A[i ]} ∈
m×m
i
matrices
{S [i ]} ∈
i
S [i ]y [n − i ].
∈Z
Denition (Perfect reconstruction)
Synthesis transform is inverse of analysis transform. Also called
biorthogonal subband transforms.
Biorthogonal subband transforms are applied in JPEG2000.
Øyvind Ryan
INF5300 JPEG2000
Lecture, 7.april 2005
JPEG2000
Lecture, 7.april 2005
Generalized transforms
JPEG2000
Subband transforms and lter banks
Denition (Orthonormal block transform)
One can write
S [0] = A[0]. All other S [i ], A[i ] are 0.
Examples: DCT , DWT .
Denition (Orthonormal subband transform)
S [i ] = A[−i ]∀i .
yq [n] = (x hq )[mn],
Expresses analysis through lter banks. Also,
Example: Haar wavelet.
Denition (Polyphase components of y ∈ (Z ))
x =
2
{yq }0≤q<m dened by yq [n] = y [nm + q ].
Denition (Analysis lters of a subband transform A)
{hq }0≤q<m dened by hq [mi − j ] = (A∗ [i ])q,j .
Denition (Synthesis lters of a subband transform S )
0 ≤ q < m,
m
−1
(y˜q gq ),
q =0
if m divides i
0
otherwise
Expresses synthesis through lter banks.
y˜q [i ] =
yq [ mi ]
{gq }0≤q<m dened by gq [mi + j ] = (S [i ])j ,q .
Øyvind Ryan
Lecture, 7.april 2005
Multi Resolution Analysis
INF5300 JPEG2000
JPEG2000
JPEG2000
Associate scaling function with lter g0 through two-scale equation
(from (MR-3)):
A set of sub-spaces
· · · ⊂ V (1) ⊂ V (0) ⊂ V (−1) ⊂ · · ·
∪m∈Z V (m) = L2 (R ).
∩m∈Z V (m) = {0}.
x (t ) ∈ V (0) ⇐⇒ x (2−m t ) ∈ V (m) .
x (t ) ∈ V (0) ⇐⇒ x (t − n) ∈ V (0) .
Exists orthonormal basis {φn }n∈Z , for V (0) such that
φn (t ) = φ(t − n).
φ(t )
Lecture, 7.april 2005
INF5300 JPEG2000
Associated lters and mother wavelet
Denition (An MRA on L2(R ))
satisfying
(MR-1)
(MR-2)
(MR-3)
(MR-4)
(MR-5)
Øyvind Ryan
is called the scaling function.
Øyvind Ryan
φ(t ) =
√
2
∞
n=−∞
g0 [n]φ(2t − n)
Associate lter g1 orthogonal to g0 through
(1)
g1 [n] = (−1)n+1 g0 [−(n − 1)],
We associate g0 , g1 with two-channel synthesis subband transform
S.
Associate g1 √withmother wavelet ψ through
ψ(t ) = 2 ∞
g1 [n]φ(2t − n)
√ n=−∞
(m )
ψn (t ) = 2−m ψ(2−m t − n)
√
(m )
φn (t ) = 2−m φ(2−m t − n)
(m )
W (m) = span(ψn )
INF5300 JPEG2000
Øyvind Ryan
INF5300 JPEG2000
Lecture, 7.april 2005
JPEG2000
Lecture, 7.april 2005
CWT versus CFT
Subspace expansions
Can show from MR conditions, two-scale equation that:
S has an inverse (analysis) transform A, associated with lters
h0 , h1
A is orthonormal.
orthonormal basis of L2 (R )
= V (m+1) ⊕ W (m+1)
= V (m+2) ⊕ W (m+2) ⊕ W (m+1)
= V (m+3) ⊕ W (m+3) ⊕ W (m+2) ⊕ W (m+1)
=
CFT f (ω) =
CFT
W (m )
V (m )
JPEG2000
···
Lecture, 7.april 2005
−∞
ω ).
function of only frequency (
CWT f (m, n) =
CWT
(t )e −j ωt dt
Expands in basis
∞
f
−∞
{e j ωt }.
(m )
(t )ψn (t )dt
function of both time (n) and frequency (m). Expands in
(m )
{ψn (t )}.
INF5300 JPEG2000
Øyvind Ryan
JPEG2000
Haar wavelet
f
basis
Øyvind Ryan
∞
INF5300 JPEG2000
Lecture, 7.april 2005
JPEG2000
Subspace expansions using lters
φ(t ) =
1
0
⎧
⎨ −1
ψ(t ) =
1
⎩
0
if
if
t
t
if
if
if
∈ (0, 1)
∈ (0, 1)
t
t
t
∈ (0, 12 )
∈ ( 12 , 1)
∈ (0, 1)
V (m) (the resolution space) consists of functions which are
piecewise constant on intervals of length 2m .
This is a 'bad' wavelet basis, since many basis functions are needed
to provide good approximations.
Note that the Haar wavelet is not continous. More regular wavelets
means faster convergence of the wavelet coecients. This results
in good compression. Wavelet in lossy JPEG2000 is more regular
then the one in lossless JPEG2000.
Øyvind Ryan
INF5300 JPEG2000
Can expand a signal in the W (m) subspaces using analysis lters
and the two-scale equation:
x =
n ∈Z
(0)
y0 [n]φ(t − n) =
n ∈Z
n ∈Z
∈V (0)
↓h0 ↓h1
(1)
(1)
y0 [n]φn (t ) +
∈V (1)
↓h0 ↓h1
(2)
(2)
y0 [n]φn (t ) +
∈V (2)
↓h0 ↓h1
Øyvind Ryan
n ∈Z
n ∈Z
(1)
(1)
y1 [n]ψn (t )
∈W (1)
(2)
(2)
y1 [n]ψn (t )
∈W (2)
INF5300 JPEG2000
Lecture, 7.april 2005
JPEG2000
Explanation
Lecture, 7.april 2005
JPEG2000
Separable extension
Dierent lines correspond to dierent resolutions.
Highest resolution (most detail) corresponds to the rst lines.
Resolution 0, 1, 2 only listed
Apply (one-dimensional transform) to rows, then to columns. Do
this for all combinations of low-pass/high-pass.
Example (Image with 4 resolutions)
Gives bands LL1 , HL1 , LH1 , HH1 after rst applications of
transforms. LL1 band decomposed further.
R0 = LL3 . Lowest resolution
R1 = HL3 ∪ LH3 ∪ HH3
R2 = HL2 ∪ LH2 ∪ HH2
R3 = HL1 ∪ LH1 ∪ HH1 .
Highest resolution
JPEG2000 Resolution scalability: Contributions from lower
resolutions (the last lines) rst in code-stream. The order of the
resolutions in the codestream is thus R0 , R1 , R2 , R3 .
Figure shown on next foil.
Øyvind Ryan
Lecture, 7.april 2005
INF5300 JPEG2000
JPEG2000
Visualization of subband decomposition
LL3 HL3
LH3 HH3
HL2
HL1
LH2
HH2
LH1
HH1
Can also decompose HL, LH, HH further, not done in baseline
JPEG2000. Drawback of orthonormal wavelets: Exist no interesting
such with linear phase.
Øyvind Ryan
Øyvind Ryan
INF5300 JPEG2000
Lecture, 7.april 2005
INF5300 JPEG2000
JPEG2000
Linear phase
A linear phase lter h satises one of the following:
Symmetry h[d + n] = h[d − n]∀n
Anti-symmetry h[d + n] = −h[d − n]∀n
d is called the centre of symmetry.
Even length lters Symmetry is about odd multiple of 12
Odd length lters Symmetry is about d ∈ Z .
Odd length lters have nicest behaviour w.r.t. boundaries of nite
sequences. Used in JPEG2000.
The term linear phase somes from the fact that the phase response
is linear in the frequency domain. Importance of linear phase:
Can reduce total number of multiplications
Finite sequences can be symmetrically extended across the
boundaries. The lter will preserve the symmetry and the
boundary
The linear phase property is important in many DSP applications
due to this.
Øyvind Ryan
INF5300 JPEG2000
Lecture, 7.april 2005
JPEG2000
Lecture, 7.april 2005
Constructing biorthogonal subband transforms from wavelets
Replace (MR-5) with linear independence. g0 , g1 , ψ as before.
Can construct scaling function so that S is invertible.
(φ, ψ) is now called a biorthogonal wavelet basis.
A = S in this case.
Can actually give useful MRA's with linear phase lters.
{W (m) }m not orthogonal in this case.
The wavelet basis (φ̃, ψ̃) we obtain by switching the role of the
analysis and synthesis lters is called a dual wavelet basis. For dual
wavelet bases we can show that
(m )
(m̃)
ψ , ψ̃ = δ[n − ñ]δ[m − m̃],
n
ñ
also called biorthogonality. JPEG2000 uses two biorthogonal
wavelets:
Lossless mode Spline 5/3 transform
Lossy mode CDF 9/7 transform
Øyvind Ryan
Lecture, 7.april 2005
INF5300 JPEG2000
JPEG2000
Spline 5/3 transform
√
1
− 14
A[−1] = 2 − 80 −
1
8
3
√
− 14
4
A[0] = 2 1 3
A[1] =
√
2
4
− 81
1
4
0
− 18
Lossless since only dyadic fractions involved. Computer can perform
exact operations using bitshifts and addition. Floating point
arithmetic not needed.
Implemented with 2 lifting steps
CDF 9/7 transform: Only the A[−2], A[−1], A[0], A[1], A[2] are
nonzero. Irrational coecients.
Implemented with 4 lifting steps
Øyvind Ryan
JPEG2000
4
Lecture, 7.april 2005
INF5300 JPEG2000
JPEG2000
Example resolution scalability
Lifting
Way to factor a lter into shorter lters. i.e. lters with less
nonzero coecients. Utilized in the two modes of JPEG2000:
Lossless mode Filters of length 3 and 5 reduced to two lter
applications of length 2
Lossy mode Filters of length 7 and 9 reduced to four lter
applications of length 2
Factoring reduces number of multiplications, number of needed
register variables. In short, a lter is factored into steps of the form
{l }
{l }
y1−p (l ) [n]
= y 1−
yp (l ) [n]
=
{l }
File with no loss, i.e. all wavelet subband spaces included. It's size
is 105kb
( ) [n]
{l −1}
yp (l ) [n] +
p l
i
{l −1}
[n
p (l )
λ [i ]y1−
l
− i ],
where y0 are the even-indexed part of the sequence, y1 ,
odd-indexed part.
{l }
{l }
Øyvind Ryan
INF5300 JPEG2000
Øyvind Ryan
INF5300 JPEG2000
Lecture, 7.april 2005
JPEG2000
Example resolution scalability
( )
,
Lecture, 7.april 2005
INF5300 JPEG2000
JPEG2000
Example resolution scalability
We then remove the W110 -coecients also. It's size is 84kb.
( )
,
Øyvind Ryan
Lecture, 7.april 2005
INF5300 JPEG2000
JPEG2000
Example resolution scalability
We then remove the W011 -coecients also. It's size is 73kb.
( )
,
Øyvind Ryan
JPEG2000
Example resolution scalability
We then remove the W111 -coecients. It's size is 94kb.
Øyvind Ryan
Lecture, 7.april 2005
INF5300 JPEG2000
We then remove the W121 -coecients also. It's size is 57kb.
( )
,
Øyvind Ryan
INF5300 JPEG2000
Lecture, 7.april 2005
JPEG2000
Example resolution scalability
( )
,
Lecture, 7.april 2005
JPEG2000
Example resolution scalability
We then remove the W110 -coecients also. It's size is 37kb.
Øyvind Ryan
Lecture, 7.april 2005
INF5300 JPEG2000
JPEG2000
Finally, we remove the W0(1,1) -coecients also. It's size is 17kb.
Øyvind Ryan
Lecture, 7.april 2005
INF5300 JPEG2000
JPEG2000
Other features in the JPEG2000 standard
Remarks
JPEG2000 oers the possibility of skipping the wavelet
transform altogether. Image on previous foils is smaller when
compressed with no wavelet transform. Therefore, wavelet
transform compresses information only for certain types of
images.
Compressed images on previous foils may also have been
smaller if antialiasing had been applied prior to compression.
Wavelet transform compresses more on images with less sharp
edges (these are removed with antialiasing)
Supports general bitdepths. JPEG restricted to 8 and 12.
Several progression orders supported.
Unied framework for lossy and lossless cosing
No restriction on number of image components.
Bad quality on these images if lossy mode is applied. Due to
the quantization eect on sharp edges. Blurring
Øyvind Ryan
INF5300 JPEG2000
Øyvind Ryan
INF5300 JPEG2000
Lecture, 7.april 2005
1
2
JPEG2000
Lecture, 7.april 2005
JPEG2000
Quantization and Coding
3
JPEG
4
5
Quantization tables in the codestream
Human coding. Zig zag scan order within each
block
6
6
8 7
8
9
8 ?
JPEG: Blocks traversed in order shown.
Progressive JPEG: Traversal may be split in several passes:
Spectral selection Each pass contains a portion of the
spectral components for each block
Successive approximation Each pass contains lower precision
versions of the block values
8 × 8 blocks only tiling with JPEG.
Øyvind Ryan
Lecture, 7.april 2005
INF5300 JPEG2000
JPEG2000
JPEG2000 standard extensions
No quantization with lossless JPEG2000.
Deadzone scalar quantization with lossy
JPEG2000.
MQ coding. Adaptive arithmetic coding of the
bitplanes of the wavelet coecients in each
block. Adapts very well to parallelism. Lots ow
work done on hardware architectures. Simpler
hardware architecture for DWT. Highly
congurable.
Øyvind Ryan
Lecture, 7.april 2005
INF5300 JPEG2000
JPEG2000
JPEG2000 applications
Part 1: contents in basic JPEG2000 system.
Part 2 Extensions to Part 1
Extended le format
Further transformation of subbands
User denable wavelet kernels
Other means of quantization. TCQ (Trellis Coded
Quantization)
Part 3 Motion JPEG2000, video format utilizing JPEG2000.
Similar to MPEG4, but with no inter-frame coding.
Part 4 Conformance testing
Part 5 Reference software
JasPer C reference implementation
JJ2000 Java reference implementation
Part 9 JPIP, JPEG2000 network transport protocol.
Scalable delivery of JPEG2000 image data
Metadata interrogation at the le level
Øyvind Ryan
JPEG2000
INF5300 JPEG2000
National Museum of Science and Industry (NMSI) Three major
gallery spaces in England, making images, image and
item descriptiosn available on web. JPEG2000 taking
over as format for images, eliminating need for image
database with dierent scaled versions of same image.
Image descriptions and linked data included in
JPEG2000 le as image metadata. JPIP may
eliminate need for Image Database further.
Police Scientic Development Branch (PSDB) Application of
Motion JPEG2000 to crime scene investigations.
Image File metadata used to record and verify
recompression details as part of the audit trail for
crime scenes.
Øyvind Ryan
INF5300 JPEG2000
Download