Video codec standards review - The University of Texas at Arlington

advertisement
Current Video Coding Standards:
H.264/AVC, Dirac, AVS China and VC-1
K. R. Rao, IEEE Fellow, and Do Nyeon Kim
Dept. of Electrical Engineering
University of Texas at Arlington
Arlington, Texas
rao@uta.edu; dnkim@uta.edu
Abstract—Video coding standards: H.264/AVC,
DIRAC, AVS China and VC-1 are presented. These
are the latest standards and are adopted by ITUT/ISO-IEC, BBC, China standards organization and
SMPTE respectively.
Besides presenting these
standards, research potential and as well projects
(both at UG and grad levels) are emphasized. These
are available by accessing the database for research
and projects in [18]. Web/ftp sites for accessing
standards documents, software, test sequences,
conformance bit streams, industry activities etc are
provided.
HEVC
Keywords- H.264/AVC; Dirac; AVS China; VC-1
I.
INTRODUCTION
Residual image data is that which is obtained
through taking the pixel by pixel differences
between the original data and the image
reconstructed after lossy compression. For lossless
compression, the residual from compression are
separately compressed using an appropriate lossless
compression approach [50]. Work has been done on
optimizing the codec, either by reducing the
complexity,
encoding
time,
improving the quality, or improving
the robustness of the standard using
algorithms for error concealment and
error correction (Fig. 1).
MPEG-4
AVC/H.264
is
developed
for
multimedia
applications [1, 3, 5-13, 18a]. It
adopted advanced coding techniques
such as multiple-reference frame
prediction,
and
context-based
adaptive binary arithmetic coding
(CABAC).
It
provides
high
compression efficiency. Thus it
enables to compress video to 1.5~2
Mbps for standard definition (SD),
and 6~8 Mbps for HD. It can save
storage space, channel bandwidth,
and frequency spectrum.
Figure 1. Optimizing the codec in terms of complexity and robustness.
We suggest that you use a text box to insert a graphic (which is ideally
a 300 dpi TIFF or EPS file, with all fonts embedded) because, in an MSW
document, this method is somewhat more stable than directly inserting a
picture.
To have non-visible rules on your frame, use the MSWord “Format”
pull-down menu, select Text Box > Colors and Lines to choose No Fill
and No Line.
Figure 2. Profiles in H.264/AVC [1].
II.
H.264/AVC
A. H.264 intra-frame encoding
H.264 (Figs. 2, 3 and 4) uses the methods of adaptive
prediction of intra-coded macroblocks to reduce the high
amount of bits coded by original input signal itself. For
encoding a block or macroblock in intra-coded mode, a
prediction block is formed based on previously reconstructed
blocks. For the luma samples, the prediction block may be
formed for each 4 × 4 subblock, each 8 × 8 block, or for a 16
× 16 macroblock. One mode is selected from a total of 9
prediction modes for each 4 × 4 (similar to Fig. 7) and 8 × 8
luma blocks; 4 modes for a 16 × 16 luma block; and 4 modes
for each chroma block. The residuals generated from the
difference between the current block and the best mode are
further processed by the transform and quantization unit, and
reconstructed by their inverse operations to be the reference
for the next macro block. The coefficients after quantization
are encoded by entropy coding for final bit stream output.
The best prediction mode(s) are chosen utilizing the R-D
optimization which is described as:
J(s, c, MODE |QP)
= D(s, c, MODE |QP) + MODE R(s, c, MODE |QP)
(1)
The distortion D(s, c, MODE |QP) is measured as sum of
squared differences (SSD) between the original block s and
the reconstructed block c, and QP is the
quantization parameter, MODE is the
prediction mode. R(s, c, MODE |QP) is a
number of bits for coding the block. The
modes(s) with the minimum J(s, c, MODE
|QP) are chosen as the prediction mode(s)
of the macroblock.
Figure 3. Coding structure for H.264/AVC encoder for a macroblock [7].
B. Profiles
Jizhun Profile (base profile or main profile) is defined in
AVS Part 2 and is targeted mainly at digital video applications
like commercial broadcasting and storage media. It has
moderate computational complexity. Jiben Profile (basic
profile or baseline profile) is defined in AVS Part 7 for mobile
applications. Shenzan and Jiaqiang profiles are defined in AVS
Part 2 for video surveillance and multimedia entertainment
respectively.
Adpative seven block size ME/MC
prediction for
inter-frame prediction is
shown in Fig. 5.
III.
AVS CHINA [42-48]
A. Standards
AVS Part 1 System comprises a set of
standards that converts single/multi
channel audio and video bit streams into a
single multiplexed stream for transmission
and storage and also defines an encoding
syntax which is necessary for synchronous
de-multiplexing of audio and video bit streams.
AVS System basically comprises of two data streams
namely the program stream and transport stream where each
one has its own applications. AVS Part1 complies with AVS
Part 2 or AVS Part 7 video, AVS Part 3 audio as its elementary
bit stream [41].
While H.264 specifies only video, it is meaningful to
encode and multiplex audio with the video bitstream. Hence
this is a viable research area where the best audio codec can be
multiplexed with the latest video codecs such as AVS China,
H.264/AVC, VC-1 and Dirac (Fig. 6). Ten parts of AVS china
are listed in Table I.
Figure 4. H.264/MPEG-4 AVC decoder block diagram [1].
8
16
16
8
MB
16
0
16
0
0
8
8
8
8
0
8
4
0 1
1
2
3
1
1
Sub MB
0
4
8 0
1
4
4 0
1
2
3
Figure 5. MB and sub MB partitions for adpative ME/MC prediction
(seven block sizes). The coded blocks with motion vectors are ordered in a
raster-scan order.
Nine adaptive directional intra prediction modes
including the DC mode for luminance in AVS-China
Part 7 is show in Fig. 7 [48].
C. Inter-frame prediction (Part 7)
Similar to Fig. 5, seven sizes of the blocks in
inter-frame adaptive ME/MC prediction are 16×16,
16×8, 8×16, 8×8, 8×4, 4×8 and 4×4 depending on the
amount of information present within the macro-block.
Motion is predicted up to ¼ pixel accuracy. If the
half_pixel_mv_flag is 1 then it is up to ½ pixel
accuracy.
Eight-tap filter F1 = (−1, 4, −12, 41, 41, −12, 4,
−1) and four-tap filter F2 = (−1, 5, 5, −1) are used for
horizontal and vertical interpolations respectively for
½ pixel MV search and averaging (liner interpolation)
is used for ¼ pixel accuracy.
Figure 8. Comparison of H.264 and VC-1.
IV.
SMPTE VC-1 (WINDOWS MEDIA VIDEO 9)
VC-1 [19-22] is an informal name of the SMPTE 421M
video codec. This standard initially has been developed by
Microsoft – Window Media Video 9. WMV-9 supports
progressive video and is mainly used for online video services.
VC-1 extends WMV-9 and adds features necessary for
broadcast services such as interlace support. It is a supported
standard for Blu-ray Discs and Windows Media Video. The
high definition DVD format Blue ray has mandated MPEG-2,
H.264 and VC-1 as the video compression formats. VC-1 is
compared with H.264 in Fig. 8.
Figure 6. Multiplexing of audio/video and lip sync.
V.
Figure 7. Nine adaptive directional intra prediction modes
including the DC mode for luminance in AVS-China Part 7
[48].
TABLE I.
TEN PARTS OF AVS CHINA STANDARD FAMILY [41]
AVS
Contents
Part 1
Part 2
Part 3
Part 4
Part 5
Part 6
Part 7
Part 8
Part 9
Part 10
System for broadcasting
SD/HD video
Audio
Conformance test
Reference software
Digital rights management
Mobility video
System over IP
File format
Mobile speech and audio coding
DIRAC
Dirac [23-40] is a family of video codecs spanning mobile
to UHDTV and film video post production. For low bit rate
applications such as the Internet, we can think of Dirac as
functionally similar to H.264 (Fig. 8) and offering similar
compression performance. For high quality compression in
production, Dirac is functionally similar to JPEG 2000 [49].
Dirac is royalty free open technology. Dirac is simple, low
cost. Dirac is a hybrid motion-compensated video coding,
whereas Dirac Pro (standardized as SMPTE VC-2) is only intra
frame coding for professional or production applications.
In the Dirac codec, image motion is tracked and the motion
information is used to make a prediction of a later frame. A
transform is applied to the prediction error between the current
frame and the previous frame aided by motion compensation
and the transform coefficients are quantized and entropy coded
(Figs. 9 and 10). Temporal and spatial redundancies are
removed by motion estimation, motion compensation and
discrete wavelet transform respectively. Dirac uses a more
flexible and efficient form of entropy coding called arithmetic
coding which packs the bits efficiently into the bit stream [39,
48]. The two-dimensional discrete wavelet transform provides
Dirac with the flexibility to operate at a range of resolutions.
This is because wavelets operate on the entire picture at once,
rather than focusing on small areas at a time. In Dirac, the
discrete wavelet transform plays the same role as the DCT in
MPEG-2 in de-correlating data in a roughly frequencysensitive way, whilst having the advantage of preserving fine
details better than block based transforms [37]. An experiment
showed the difference in the encoding time taken by Dirac and
H.264 / MPEG-4 for QCIF, CIF and SD sequences. The
simplicity of the Dirac encoder is evident, as its encoding speed
was much higher compared to the H.264 AVC [37].
evaluation (Figs. 11, 12 and 13). The two methods are very
close and comparable in compression, PSNR and SSIM. Also,
a significant improvement in encoding time is achieved by
Dirac, compared to H.264 for all the test sequences [37].
Figure 11. Compression ratio comparison of Dirac and H.264 for “MissAmerica” QCIF sequence [37].
Figure 9. Dirac encoder architecture [38, 39].
Figure 10. Dirac decoder architecture [37].
VI.
Figure 12. SSIM comparison of Dirac and H.264 for “Miss-America” QCIF
sequence [37].
SIMULATION RESULTS
The comparison between H.264 and AVS-China’s
performance was produced by encoding several test sequences
at different bit rates and shown in Figs. 14 thru 17. Test
sequences with HD (1280×720) and standard-definition (SD)
(720×480) are used for evaluation. The two methods are very
close and comparable in peak-to-peak signal-to-noise ratio
(PSNR).
Objective test methods attempt to quantify the error
between a reference and an encoded bit stream. To ensure the
accuracy of the tests, each codec must be encoded using the
same bit rate. Since the latest version of Dirac does include a
constant bit rate (CBR) mode, the comparison between Dirac
and H.264’s performance was produced by encoding several
test sequences at different bit rates. By utilizing the CBR mode
within H.264, we can ensure that H.264 is being encoded at the
same bit rate as that of Dirac.
Objective tests are divided into three sections, namely (i)
compression, (ii) structural similarity index (SSIM), and (iii)
peak-to-peak signal-to-noise ratio (PSNR). The test sequences
“Miss-America” QCIF (176×144), “Stefan” CIF (352×288)
and “Susie" standard-definition (SD) (720×480) are used for
Figure 13. PSNR (peak-to-peak signal-to-noise ratio) comparison of Dirac
and H.264 for “Miss-America” QCIF sequence [37].
VII. CONCLUSIONS
Video coding standards: H.264/AVC, DIRAC, AVS China
and VC-1 are presented. Performance comparison of these
standards using different test sequences is presented. Their
functionalities are summarized in Tables II and III. In general
H.264 performs better compared to Dirac, AVS China and VC1, but at the cost of additional complexity.
[2]
[3]
Figure 14. Bitrate vs. MSE for Harbour – HDTV sequence (1280 
720p). AVS Jizhun Profile is a main profile.
Figure 15. Bitrate vs. MSE for Harbour – HDTV sequence (1280 
720p).
Figure 16. Bitrate vs. PSNR for Bus – SDTV sequence (720  480i).
H.264 AVC JM software: http://iphome.hhi.de/suehring/tml/
D. Kumar, P. Shastry and A. Basu, “Overview of the H.264 / AVC”, 8th
Texas Instruments Developer Conference India, 30 Nov – 1 Dec 2005,
Bangalore.
[4] H.264 encoder and decoder:
http://www.adalta.it/Pages/407/266881_266881.jpg
[5] “H.264 video compression standard”, White paper, Axis
communications.
[6] R. Schäfer, T. Wiegand and H. Schwarz, “The emerging H.264/AVC
standard”, EBU Technical Review, Jan. 2003.
[7] T.Wiegand, et al “Overview of the H.264/AVC video coding standard”,
IEEE Trans. CSVT, vol.13, pp 560-576, July 2003.
[8] M. Fieldler, “Implementation of basic H.264/AVC decoder”, seminar
paper at Chemnitz University of Technology, June 2004
[9] MPEG-4: ISO/IEC JTC1/SC29 14496-10: Information technology –
Coding of audio-visual objects - Part 10: Advanced Video Coding,
ISO/IEC, 2005.
[10] Advanced Video Coding for Generic Audiovisual Services, ITU-T Rec.
H.264/ISO/IEC 14496-10, Mar. 2005.
[11] S.K.Kwon, A.Tamhankar and K.R.Rao, “Overview of H.264 / MPEG-4
Part 10” J. Visual Communication and Image Representation, Vol 17,
pp.186-216, April 2006.
[12] D. Marpe, T. Wiegand and G. J. Sullivan, “The H.264/MPEG-4 AVC
standard and its applications”, IEEE Communications Magazine, vol. 44,
pp. 134-143, Aug. 2006.
[13] T. Wiegand and G. J. Sullivan, “The H.264 video coding standard”,
IEEE Signal Processing Magazine, vol. 24, pp. 148-153, March 2007.
[14] Z. Wang, et al “Image quality assessment: From error visibility to
structural similarity”, IEEE Trans. on Image Processing, vol. 13, pp.
600-612, Apr. 2004. http://www.ece.uwaterloo.ca/~z70wang/
[15] H. Jia and L. Zhang, “Directional diamond search pattern for fast block
motion estimation”, IEE Electronics Letters, vol. 39, No. 22, pp. 15811583, 30th Oct. 2003.
[16] Video test sequences (YUV 4:2:0):
http://trace.eas.asu.edu/yuv/index.html
[17] Video test sequences ITU601:
http://www.cipr.rpi.edu/resource/sequences/itu601.html
[18] K.R. Rao, Mutimedia Processing, Course Website, UT Arlington:
http://ee.uta.edu/Dip/Courses/EE5359/index.html
[18a] I. Richardson, H.264 Advanced Video Compression Standard, II
Edition, Hoboken, NJ: Wiley, 2010.
[18b] Y.Q. Shi and H. Sun, “ Image and video compression for multimedia
engineering”, Boca Raton: CRC Press, II Edition, (Chapter on H. 264),
2008.
[18c] B. Furht and S.A. Ahson, “Handbook of mobile
broadcasting, DVB-H, DMB, ISDB-T and MEDIAFLO”, Boca
Raton, FL: CRC Press, 2008 (H.264 related chapters).
MPEG
and
<http://en.wikipedia.org/wiki/MPEG>
Figure 17. Bitrate vs. MSE for Bus – SDTV sequence (720  480i).
REFERENCES
H.264/AVC
[1]
A. Puri, X. Chen and A. Luthra, “Video coding using the H.264/MPEG4 AVC compression standard”, Signal Processing: Image
Communication, vol. 19, pp. 793-849, Oct. 2004
H.26x
series
HEVC
[1] G.J. Sullivan and J.-R. Ohm, “ Recent developments in
standardization of high efficiency video coding”, SPIE Optics
+ Photonics, vol. 7798, paper 7798-3, San Diego, CA, Aug.
2010.
[2] IEEE Trans. on CSVT, vol. 20, Special section on high efficiency
video coding (several papers), Dec. 2010.
DIRAC
VC-1
[19] VC-1 Compressed Video Bitstream Format and Decoding Process,
SMPTE 421M-2006, SMPTE Standard, 2006.
[20] S. Srinivasan and S. L. Regunathan, “An overview of VC-1,” Proc.
SPIE, vol. 5950, pp. 720–728, 2005.
[21] Microsoft Windows Media:
http://www.microsoft.com/windows/windowsmedia
[22] H. Kalva and J.-B. Lee, The VC-1 and H.264 video compression
standards for broadband video services, Springer, 2008.
TABLE II.
Algorithmic
Element
Intra
Prediction
Picture coding
type
Motion
compensation
block size
Motion vector
Precision
P frame type
B frame type
In loop filters
SMPTE VC-1
(Windows Media
Video 9)
Frequency
domain
coefficient
Frame
Field
Picture AFF
MB AFF
16×16, 16×8,
8×16, 8×8,
8×4, 4×8,
4×4 (seven
variable sizes)
Full pel
Half pel
Quarter pel
Single
reference
Multiple
reference
Entropy coding
CAVLC,
CABAC
Transform
Main: 4×4
integer DCT,
High:
4×4 & 8×8
integer DCTs
Quantization
scaling
matrices
Other
COMPARISON OF VARIOUS VIDEO COMPRESSION STANDARDS
MPEG-4
AVC
(H.264)
4×4 spatial
16×16 spatial
I-PCM
One reference
each way,
Multiple
reference,
Direct &
spatial direct
weighted
prediction
De-blocking
[23] K. Onthriar, K. K. Loo and Z. Xue, “Performance comparison of
emerging Dirac video codec with H.264/AVC,” IEEE Int’l Conf. on
Digital Telecommunications, ICDT 2006, vol. 6, Page: 22, Issue: 29-31,
Aug. 2006.
[24] T. Davies, “The Dirac Algorithm”:
http://dirac.sourceforge.net/documentation/algorithm/, 2008.
[25] M. Tun and W. A. C. Fernando, “An error-resilient algorithm based on
partitioning of the wavelet transform coefficients for a Dirac video
codec” , Tenth International Conference on Information Visualization,
2006, IV, Vol. 5-7 , pp.: 615 – 620, Issue : July 2006.
[26] Daubechies wavelet: http://en.wikipedia.org/wiki/Daubechies_wavelet
Dirac
Dirac PRO
(SMPTE VC-2)
AVS China
Part 2
4×4 spatial
4×4 spatial
(forward,
backward)
8×8 block based
Intra Prediction
Frame
Field
Picture AFF
MB AFF
16×16, 8×8
Frame
Intra – Frame,
Field (Interlace,
Progressive)
Frame
4×4
N/A
16×16, 16×8,
8×16, 8×8
16×16, 16×8,
8×16, 8×8, 8×4,
4×8
Full pel
Half pel
Quarter pel
Single reference,
Intensity
compensation
1/8 pel
N/A
1/4 pel
1/4 pel
Single
reference,
Multiple
reference
No P frames
One reference
each way
One reference
each way,
Multiple
reference
No B frames
Single and
multiple
reference
(maximum of 2
reference
frames)
One reference
each way,
Multiple
reference.
Direct and
symmetrical
mode
Single and
multiple
reference
(maximum of 2
reference
frames)
No B frames
De-blocking
Overlap
transform
Adaptive VLC
None
None
De-blocking
filter
De-blocking
filter
Arithmetic
coding
Context based
adaptive binary
arithmetic coding,
Exponential
Golomb coding
4×4
wavelet transform
2D variable
length coding.
Context based
adaptive 2D
variable length
coding
8×8 integer
DCT
4×4 integer
DCT
Quantization
scaling matrices
Quantization
scaling matrices
Quantization
scaling matrices
4×4, 8×8
8×4 & 4×8
integer DCTs
4×4 wavelet
transform
Range reduction.
In stream-post
processing
control
Quantization
scaling
matrices
AVS China
Part 7
(AVS-Mobile)
Intra_4×4
(4×4 spatial).
Direct Intra
Prediction
Frame
[27] Daubechies wavelet filter design: http://cnx.org/content/m11159/latest/
[28] Vorbis: http://www.vorbis.com/
[29] T. Borer, “Dirac coding: Tutorial & Implementation”, EBU Networked
Media Exchange seminar, June 2009.
TABLE III.
Standard
H.264/MPE
G-4 Part 10
[31] Dirac video codec - A programmer's guide:
http://dirac.sourceforge.net/documentation/code/programmers_guide/toc
.htm
[32] Dirac Pro: http://www.bbc.co.uk/rd/projects/dirac/diracpro.shtml
STANDARD
Main Compression Technologies
Standardization body
JVT (ISO/IEC & ITU-T)
Main Target Bitrate
8 kb/s up to about 150
Mb/s
– Integer DCT
– Adaptive quantization
– Zigzag reordering
– Alternate Scan ordering
– Predictive motion compensation
– Bi-directional motion compensation
– Variable block size motion compensation
with small block sizes
– Quarter pixel motion compensation
– Motion vector over picture boundaries
– Multiple reference picture motion
compensation
– Adaptive intra directional prediction
– In-loop deblocking filter
AVS Part 2
Standardization body
AVS workgroup
Main Target Bitrate
1 Mb/s up to about 20
Mb/s
AVS Part 7
Standardization body
AVS workgroup
Main Target Bitrate
1 Mb/s up to about 20
Mb/s
– Arithmetic coding
– Variable length coding
– Error resilient coding
– Interlace handling: Picture-level adaptive
frame/field coding (PAFF)
– Macroblock-level adaptive frame/field
coding (MBAFF)
– Intra prediction: 5 modes for luma and 4
modes for chroma
– Motion compensation: 16×16, 16×8,
8×16, 8×8 block size
– Resolution of MV: 1/4-pel, 4-tap
interpolation filter
– Transform: 16 bit-implemented 8×8
integer cosine transform
– Quantization and scaling: scaling only in
encoder
– Entropy coding: 2D-VLC and Arithmetic
Coding
– In-loop deblocking filter
– Motion vector prediction
–Adaptive scan
– Intra prediction: 9 modes for luma and 3
modes for chroma
– Motion compensation: 16×16, 16×8,
8×16, 8×8, 8×4, 4×8 block size
– Resolution of MV: 1/4-pel
– Transform: 16 bit-implemented 4×4
integer cosine transform
– Quantization and scaling: scaling only in
encoder
– Entropy coding: Context based adaptive
2D variable length coding
– In-loop deblocking filter
[30] Dirac software and source code: http://diracvideo.org/download/diracresearch/
Main Target Applications
– Broadcast over cable, terrestrial and
satellite
– Interactive or serial storage on optical
and magnetic devices, DVD, etc
– Conversational services
– Video on demand
– MMS over ISDN, DSL, Ethernet, LAN,
wireless and mobile networks
– HDTV
– Digital camera
– HD broadcasting
– High density storage media
– Video surveillances
– Video on demand
– Record and local playback on mobile
devices
– Multimedia Message Service (MMS)
– Streaming and broadcasting
– Real-time video conversation
[33] T. Davies, “A modified rate-distortion optimization strategy for hybrid
wavelet video coding,” ICASSP 2006 vol. 2, pp. , May 2006.
[34] M. Tun, K. K. Loo and J. Cosmas, “Semi-hierarchical motion estimation
for the Dirac video codec,” 2008 IEEE International Symposium on
Broadband Multimedia Systems and Broadcasting, pp. 1-6, March 31April 2, 2008.
TABLE III.
Standard
Dirac
Main Target Bitrate
Few hundred kbps up to
about 15 Mbps
DiracPRO
Standardization body
BBC R&D
SMPTE
Main Target Bitrate
Lossless HD to < 50 Mb/s
Compression ratio 20:1
SMPTE VC-1
(WMV-9)
STANDARD (CONTINUED)
Main Compression Technologies
Standardization body
BBC R&D
Mozilla Public License
(MPL)
(SMPTE VC-2)
SPIE-IS & T Electronic Imaging, SPIE vol. 6508, pp. 650822, Jan. 29,
2007.
[44] W. Gao et al., “AVS - The Chinese next-generation video coding
Standardization body
SMPTE 421M
Main Target Bitrate
10 kbps – 8 Mbps
Main Target Applications
– 4×4 wavelet transform
– Dead-zone quantization and scaling
– Entropy coding: Arithmetic coding
– Hierarchical motion estimation
– Intra, Inter prediction
– Single and multiple reference P, B
frames
– 1/8 pel motion vector precision
– 4×4 overlapped block based motion
compensation (OBMC)
– Daubechies wavelet filters
– Broadcasting
– Live streaming video
– Pod casting
– Peer to peer transfers
– HDTV with SD (standard definition)
simulcast capability
– Desktop production
– News links
– Archive storage
– PVRs (personal video recorders)
– Multilevel Mezzanine coding
– 4×4 wavelet transform
– Dead-zone quantization and scaling
– Entropy coding: Context based adaptive
binary arithmetic coding (CABAC),
exponential Golomb coding
– Intra-frame only (forward, backward
prediction modes also available)
– Frame, Field coding (Interlaced and
progressive)
– Daubechies wavelet filters
– Integer DCT
– Adaptive block size transform: (8×8),
(8×4), (4×8) and (4×4)
– Motion estimation for (16×16) and (8×8)
blocks
– ½ pixel and ¼ pixel motion vector
resolution
– Dead zone and uniform quantization
– Multiple VLCs
– In-loop deblock filtering, fading
compensation
– Professional (high quality, low latency)
applications (not for end user
distribution)
– Lossless or visually lossless
compression for archives
– Mezzanine compression for re-use of
existing equipment
– Low delay compression for live video
links
[35] H. Eeckhaut et al., “Speeding up Dirac’s entropy coder”, Proc. 5th
WSEAS Intl. Conf. on Multimedia, Internet and Video Technologies,
pp. 120-125, Greece, Aug. 2005.
[36] The Dirac web page and developer support: http://dirac.sourceforge.net
[37] A. Ravi and K.R. Rao, “Performance analysis and comparison of the
Dirac video codec with H.264 / MPEG-4 Part 10 AVC,” IJWMIP.
(accepted)
[38] BBC Research on Dirac:
http://www.bbc.co.uk/rd/projects/dirac/index.shtml
[39] T. Borer and T. Davies, “Dirac video compression using open
technology,” BBC EBU Technical Review, July 2005.
[40] C. Gargour et al., “A short introduction to wavelets and their
applications,” IEEE Circuits and Systems Magazine, vol. 9, pp. 57-68, II
Quarter, 2009.
AVS China
[41] GB/T 20090.1 Information technology - Advanced coding of audio and
video – Part 1: System, Chinese AVS standard.
[42] L. Yu et al., “An Overview of AVS-Video: tools, performance and
complexity”, Visual Communications and Image Processing 2005, Proc.
of SPIE, vol. 5960, pp. 596021, July 31, 2006.
[43] L. Yu et al., “An area-efficient VLSI architecture for AVS intra frame
encoder” Visual Communications and Image Processing 2007, Proc. of
– Media delivery over the Internet
– Broadcast TV
– HD DVD
– Digital projection in theaters, mobile
phones
– DVB-T, DVB-S
standard” NAB, Las Vegas, 2004.
[45] J. Wang et al., “An AVS-to-MPEG2 transcoding system” Proceedings of
2004 International Symposium on Intelligent Multimedia, Video and
Speech Processing , Hong Kong, pp. 302-305, Oct. 20-22, 2004.
[46] X. Wang et al., “Performance comparison of AVS and H.264/AVC
video coding standards” J. Comput. Sci. & Technol., vol. 21, No. 3, pp.
310-314, May 2006.
[47] B. Tang et al., “AVS encoder performance and complexity analysis
based on mobile video communication”, WRI International conference
on Communications and Mobile Computing, CMC ‘09, vol. 3, pp. 102107, 6-8 Jan. 2009.
[48] L. Yu et al., “Overview of AVS video coding standards,” Signal
Processing: Image Communication, vol. 24, pp. 263-276, April 2009.
JPEG 2000
[49] D. T. Lee, “JPEG 2000: Retrospective and new developments,” Proc.
IEEE, vol. 93, pp. 32-41, Jan. 2005
[50] P. Schelkens, A. Skodras and T. Ebrahimi, “The JPEG 2000 suite”,
Hoboken, NJ: Wiley, 2009.
[51] M. S. Zhong and Z. M. Ma, “JPEG 2000 based scalable reconstruction of
image local regions”, IEEE ISIMP 2001, Hong Kong, May 2001.
[52] C. Christopoulous, A. Skodras and T. Ebrahimi, “The JPEG 2000 still
image coding system: An overview,” IEEE Trans. on Consumer Electronics,
vol. 46, pp. 1103-1127, Nov. 2000.
[53] M. D. Adams, “The JPEG-2000 still image compression standard,” JPEG
Tutorial download from http://www.ece.uvic.ca/~mdadams/jasper
(also
software)
Symposium, Digital Image Processing, applications of digital
image processing XXIX, vol. 6321, San Diego, Aug. 2006.
JPEG-2000 open source software
http://www.ece.uvic.ca/~mdadams/jasper/
[54] A. Skodras, C. Christopoulous, and T. Ebrahimi, “JPEG-2000: The
upcoming still image compression standard,” Pattern Recognition Letters, vol.
25, pp. 1337 – 1345, 2001.
[55] T. Fukuhara et al, “Motion-JPEG2000 standardization and target
market,” IEEE ICIP . vol. 2, pp. 57-60, 2000.
[56] M. Rabbani and R. Joshi, “ An overview of the JPEG 2000 still image
compression standard, “ Signal Processing: Image Communication, vol. 17,
pp. 3-48, Jan. 2002.
[57] J. Hunter and M. Wylie, “JPEG2000 Image Compression: A real time
processing challenge,” Advanced Imaging, vol. 18, pp.14-17, and 43, April
2003.
[58] D.S. Taubman and M.W. Marcellin, “JPEG 2000: Image
compression fundamentals, standards and practice”, Kluwer,
2001.
[59] D. Marpe, V. George, and T. Weigand, “Performance
comparison of intra-only H.264/AVC HP and JPEG 2000 for
a set of monochrome ISO/IEC test images”, JVT-M014,
pp.18-22, Oct. 2004.
[60] D. Marpe et al, “Performance evaluation of motion
JPEG2000 in comparison with H.264 / operated in intracoding mode”, Proc. SPIE, vol. 5266, pp. 129-137, Feb. 2004.
[61] P. Topiwala, “Comparative study of JPEG2000 and
H.264/AVC FRExt I-frame coding on high definition video
sequences,” Proc. SPIE Int’l Symposium, Digital Image
Processing, San Diego, Aug. 2005.
[62] P. Topiwala, T. Tran and W. Dai, “Performance
comparison of JPEG2000 and H.264/AVC high profile intraframe coding on HD video sequences,” Proc. SPIE Int’l
JPEG XR (HD photo of Microsoft)
[63] T. Tran, L. Liu and P. Topiwala, “Performance
comparison of leading image codecs: H.264/AVC intra,
JPEG 2000, and Microsoft HD photo,” Proc. SPIE Int’l
Symposium, Digital Image Processing, San Diego, Sept.
2007.
JPEG <http://en.wikipedia.org/wiki/JPEG>
MJPEG<http://en.wikipedia.org/wiki/MJPEG>,
GENERAL
[64] D. A. Novik, J. C. Tilton and M. Manohar, "Compression through
decomposition into browse and residual images" Space and Earth
Science Data Compression Workshop, NASA CP-3191, edited by James
C. Tilton, Washington D.C., 1993.
Digital Video
DV <http://en.wikipedia.org/wiki/DV>
ATSC (advanced televiosn and systems committee) www.atsc.org
Special issue on global digital television technology and services, Proc
IEEE, vol. 94, Jan. 2006.
[65] M.S. Richer et al, “ The ATSC digital television sytem”, Proc IEEE,
vol. 94, pp. 37-43, Jan. 2006.
Download