29n117801

advertisement
INTERNATIONAL ORGANISATION FOR STANDARDISATION
ORGANISATION INTERNATIONALE DE NORMALISATION
ISO/IEC JTC 1/SC 29/WG 11
CODING OF MOVING PICTURES AND AUDIO
N11555
ISO/IEC JTC 1/SC 29/WG 11
Guangzhou, CN – October 2010
Source: Leonardo Chiariglione
Title:
Report of 94th meeting
Status
Report of 94th meeting .......................................................................................................................... 1
Annex A – Attendance list .................................................................................................................. 15
Annex B – Agenda .............................................................................................................................. 25
Annex C – Input contributions ............................................................................................................ 28
Annex D – Output documents ............................................................................................................. 53
Annex E – Requirements report .......................................................................................................... 57
Annex F – Systems report ................................................................................................................... 63
Annex G – Video report .................................................................................................................... 109
Annex H – Audio report.................................................................................................................... 266
Annex I – 3DG report ....................................................................................................................... 300
Report of 94th meeting
1
Opening
The 94th MPEG Meeting will take place from 11th to 15th October 2010 at Guangzhou Baiyun
International Convention Center, Oriental International Convention Hotel.
2
Roll call of participants
Annex 1 provides the list of participants
3
Approval of agenda
Annex 1 provides the adopted agenda
4
Allocation of contributions
Annex 1 provides the list of input contributions
1
5
Communications from Convenor
There was no specific communication.
6
Report of previous meeting
This could not be approved because of its unavailability due to some missing components.
7
Processing of NB Position Papers
These were presented and discussed. When appropriate responses were provided
8
Work plan management
8.1 Media coding
8.1.1 Frame Packing Arrangement Signalling in MPEG-2 Video
The following documents were approved
11612
11613
Request for ISO/IEC 13818-2:2000/Amd.4
Text of ISO/IEC 13818-2:2000/PDAM4 Frame Packing Arrangement Signalling in
MPEG-2 Video
8.1.2 Progressive High Profile
The following document was approved
11616
Text of ISO/IEC 14496-10:201x/PDAM1 Progressive High Profile
8.1.3 MPEG Frame-Compatible Enhancement (MFC)
The following documents were approved
11681
11682
Requirements for MPEG Frame-Compatible Enhancement (MFC)
Test Conditions and Evaluation Process for MFC
8.1.4 AFX 3rd edition
The following document was approved
11665
Description of 3DG Core Experiments
8.1.5 Multiresolution 3D mesh Coding
The following document was approved
11666
Working Draft 3.0 on Multiresolution Mesh Compression
8.1.6 Reconfigurable Graphics Coding
The following documents were approved
2
11667
11668
Working Draft 3.0 on Reconfigurable Graphics Coding
Reconfigurable graphics coding tutorial v2.0
8.1.7 Composite Font in Open Font Format
The following documents were approved
11571
11572
Evaluation Report
WD of ISO/IEC 14496-28 Composite Font
8.1.8 Contract Expression Language
The following documents were approved
11573
11664
WD of ISO/IEC 21000-20 Contract Expression Language
Description of CE on Contract Expression Language
8.1.9 Codec Configuration Representation
The following document was approved
11623
Working Draft of ISO/IEC 23001-4/Amd.1
8.1.10 Video Tool Library
The following documents were approved
11624
11625
11627
11628
Disposition of Comments on ISO/IEC 23002-4/FPDAM1
Text of ISO/IEC 23002-4/FDAM1 Software and Conformance Testing for Video
Tool Library
WD of ISO/IEC 23002-4/Amd.3 Functional Units for SVC Profiles
Description of Core Experiments in RVC
8.1.11 Spatial Audio Object Coding
The following document was approved
11657
Report on Spatial Audio Object Coding Verification Tests
8.1.12 Unified Speech and Audio Coding
The following documents were approved
11659
11660
Study on ISO/IEC 23003-3:201x/CD of Unified Speech and Audio Coding
Status and Workplan for USAC CEs
8.1.13 High-Efficiency Video Coding
The following documents were approved
3
11642
11629
11643
11645
Process of Test Model Development for HEVC
High Efficiency Video Coding (HEVC) Test Model 1 (HM 1) Encoder Description
High Efficiency Video Coding (HEVC) text specification Working Draft 1
Status of HEVC Reference Software Copyright Management in JCT-VC
8.1.14 Media Context and Control – Control Information
The following document was approved
11589
WD of ISO/IEC 23005-2 AMD 1 Additional Control Information and binary
representation
8.1.15 Media Context and Control – Sensory Information
The following document was approved
11590
WD of ISO/IEC FDIS 23005-3 AMD 1 Binary representation of sensory
information
8.1.16 Media Context and Control – Virtual World Object Characteristics
The following document was approved
11591
WD of ISO/IEC FDIS 23005-4 AMD 1 Binary representation of Virtual World
Object Characteristics
8.1.17 Media Context and Control – Data Formats for Interaction Devices
The following document was approved
11592
WD of ISO/IEC FDIS 23005-5 AMD 1 Additional sensors, interaction devices and
binary representation
8.1.18 Media Context and Control – Common Types and Tools
The following document was approved
11593
WD of ISO/IEC FDIS 23005-6 AMD 1 Binary representation of Common types and
tools
8.1.19 3D Video Coding
The following documents were approved
11678
11679
11630
11631
Applications and Requirements on 3D Video Coding
Draft Call for Proposals on 3D Video Coding Technology
Description of Exploration Experiments in 3D Video Coding
Report on Experimental Framework for 3D Video Coding
4
8.1.20 Low-Complexity Video Coding
The following document was approved
11673
Low-Complexity Video Coding – Draft Problem Statement
8.1.21 Future Audio
The following document was approved
11661
Audio for High Efficiency Video Coding
8.1.22 Option-1 Licensable Video Coding
The following document was approved
11676
Call for Evidence on Option-1 Video Coding Technology
8.2 Composition coding
8.2.1 Collaborative Applications by using MPEG Scene Description Standards
The following document was approved
11569
Technologies under Considerations for Collaborative Applications by using MPEG
Scene Description Standards
8.3 Description coding
8.3.1 Compact Descriptors for Visual Search
The following documents were approved
11675
11674
Compact Descriptors for Visual Search: Evaluation Framework (draft)
Compact Descriptors for Visual Search: Draft Call for Proposals
8.3.2 Extraction and Matching of Video Signature Tools
The following document was approved
11621
Study Text of ISO/IEC 15938-8/PDAM6 Extraction and Matching of Video
Signature Tools
8.3.3 AudioVisual Description Profile
The following document was approved
11677
ISO/IEC 15938-9:2005/PDAM 1
5
8.4 Transport and File formats
8.4.1 File formats
The following documents were approved
11570
11586
Request for ISO/IEC 14496-12:2008 AMD 3 DASH support and RTP reception hint
track processing
Text of ISO/IEC 14496-12:2008 PDAM 3 DASH support and RTP reception hint
track processing
8.4.2 MVC operation point descriptor
The following documents were approved
11560
11562
Study of ISO/IEC 13818-1:2007/FPDAM 6 Extension to AVC video descriptor and
signaling of operation points for MVC
WD of ISO/IEC 13818-1:2007/AMD 7 Signalling of stereoscopic video in MPEG-2
Systems
8.4.3 Dynamic Adaptive Streaming over HTTP
The following documents were approved
11577
11578
11579
Request for the subdivision of ISO/IEC 23001-6
Text of ISO/IEC CD 23001-6 Dynamic Adaptive Streaming over HTTP
Description of Evaluation Experiments on Dynamic Adaptive Streaming over
HTTP
8.5 Multimedia architecture
8.5.1 SC3DMC for 3DGCM
The following document was approved
11669
Text of 2nd Edition of ISO/IEC 14496-25 FDIS 3DG compression model
8.5.2 MPEG-V Use cases
The following document was approved
11588
WD of ISO/IEC 23005-1 AMD 1 Additional Use Cases
8.5.3 MPST Architecture and Technologies
The following document was approved
11581
Text of ISO/IEC CD 23001-1 2nd edition Multimedia Service Platform
Technologies – Architecture
6
8.5.4 MXM API
The following document was approved
11582
Text of 2nd edition of ISO/IEC CD 23006-2 MPEG Extensible Middleware API
8.5.5 Elementary Services
The following document was approved
11583
Text of 2nd edition of ISO/IEC CD 23006-4 Elementary Services
8.5.6 Service Aggregation
The following documents were approved
11692
11584
Request of subdivision of ISO/IEC 23006-5 Service Aggregation
Text of ISO/IEC CD 23006-5 Service Aggregation
8.5.7 Widget Extensions
The following documents were approved
11598
11585
Request for ISO/IEC 23007-1 AMD 1 Widget Extensions
Text of ISO/IEC 23007-1 PDAM 1 Widget Extensions
8.6 Application formats
8.6.1 Interactive Music AF dynamic volume change
The following document was approved
11599
Description of CE on dynamic volume change
8.7 Reference implementation
8.7.1 BIFS ExtendedCore2D Profile Reference Software
The following documents were approved
11567
11568
Request for ISO/IEC 14496-5:2001/AMD 30 Reference software for BIFS
ExtendedCore2D Profile
Text of ISO/IEC 14496-5:2001/PDAM 30 Reference software for BIFS
ExtendedCore2D Profile
8.7.2 LASeR adaptation tools Reference Software
The following documents were approved
11565
DoC on ISO/IEC 14496-5:2001/FPDAM 28 Reference software for LASeR
adaptation tools
7
11566
Text of ISO/IEC 14496-5:2001/FDAM 28 Reference software for LASeR adaptation
tools
8.7.3 Video Signature Tools Reference Software
The following documents were approved
11617
11618
DoC on ISO/IEC 15938-6:2001/PDAM4
Text of ISO/IEC 15938-6:2001/FPDAM4 Reference Software for Video Signature
Tools
8.7.4 DASH Reference Software
The following document was approved
11580
Workplan of Reference Software for DASH
8.7.5 Stereoscopic video AF Reference Software
The following document was approved
11575
Text of ISO/IEC 23000-11 FDAM 1 Conformance and Reference Software
8.7.6 MPEG Rich Media UI Reference Software
The following document was approved
11587
Study of ISO/IEC FCD 23007-3 Conformance and Reference Software
8.7.7 Media Context and Control – Reference Software
The following document was approved
11594
Study of ISO/IEC FCD 23005-7 Conformance and Reference Software
8.8 Conformance
8.8.1 BIFS ExtendedCore2D Profile Conformance
The following documents were approved
11563
11564
Request for ISO/IEC 14496-4:2004/AMD 40 Conformance for BIFS
ExtendedCore2D Profile
Text of ISO/IEC 14496-4:2004/PDAM 40 Conformance for BIFS ExtendedCore2D
Profile
8.8.2 Video Signature Tools Conformance
The following documents were approved
8
11619
11620
DoC on ISO/IEC 15938-7:2001/PDAM6
Text of ISO/IEC 15938-7:2001/FPDAM6 Conformance for Video Signature Tools
8.8.3 Stereoscopic video AF Conformance
The following document was approved
11575
Text of ISO/IEC 23000-11 FDAM 1 Conformance and Reference Software
8.8.4 Interactive Music AF Conformance
The following documents were approved
11575
11576
Study of ISO/IEC 23000-12 FPDAM 1 Conformance and Reference Software
Workplan of ISO/IEC 23000-12 Conformance and Reference Software
8.8.5 MPEG Rich Media UI Conformance
The following document was approved
11587
Study of ISO/IEC FCD 23007-3 Conformance and Reference Software
8.8.6 Media Context and Control – Conformance
The following document was approved
11594
Study of ISO/IEC FCD 23005-7 Conformance and Reference Software
8.9 Maintenance
8.9.1 Systems coding standards
The following document was approved
11561
List of errata items of ISO/IEC 13818-1:2007
8.9.2 Video coding standards
The following documents were approved
11614
11615
11622
Text of ISO/IEC 14496-5:2001/Amd.5 DCOR1
Defect report on ISO/IEC 14496-10:201x
Text of ISO/IEC 23001-4/DCOR1
8.9.3 Audio coding standards
The following documents were approved
11654
11655
ISO/IEC 13818-4:2004 DCOR 2, AAC Intensity Stereo
ISO/IEC14496-26:2010/Cor 2 BSAC Corrections
9
11656
11658
9
ISO/IEC 14496-26:2010/DCOR 3, Intensity Stereo
Defect Report of ISO/IEC 23003-2:2010, Spatial Audio Object Coding
Organisation of this meeting
9.1 Tasks for subgroups
The following tasks were assigned
Requirements Std
Systems
Pt Amd
2
4
2
10
28
7
3
Std
2
4
7
21
A
B
M
V
Pt Amd
1 5
7
10 6
5 ?
22 1
28
12 Amd2
2 Amd1
4 Amd4
8 Amd1
6 Cor
11 1
2
12 1
6 1
1
2
3
4
5
1
2
3
4
5
6
Low complexity video codec
Stereo video
Stereo video
Composite Font Format
3DV
MMT – CfP, Requirements, Context & Objectives, Use Cases
Compact Descriptors for Visual Search
MPEG-V extensions
MPEG-V Profiles
Type 1 licensing codec
New standard areas
 Audio for HVC
Carriage of JPEG2000
Signaling of Stereoscopic Video
MVC operation point descriptors
SVC FF RS
Open Font Format
Composite Font Format
Semantic enhancement
PDI
Protection of presentation element
MVCO RS
Professional Archival AF
Stereoscopic video AF RS & C
Additional composition type
Interactive music AF RS & C
DASH
Architecture
MXM API
RS & C
Elementary Services
Aggregated Services
Architecture
Control Information
Sensory information
Virtual world object characteristics
Data representation for interaction devices
Common types and tools
10
7
U
2
3
Conformance and reference software
MPEG-V Profiles
Advanced input device interface
Reference Software and Conformance
Video
2
4
4
4
7
2
10
10
10 Cor 1
3 ?
6
7
8
4 1
2
Stereo video
Stereo video
AVC profiles
Miscellanea
Compact Descriptors for Visual Search
Video Signature Tools RS
Video Signature Tools Conf
Extraction and Use of MPEG-7 Descriptions
Video Tool Library Conformance & RS
Video Tool Library extensions
3DV/FTV
HEVC
26 Cor
BSAC for broadcasting
2
Spatial Audio Object Coding
2 Amd1 Spatial Audio Object Coding C & RS
3
USAC
New audio issues (HEVC)
5 27
Scalability complexity 3DMC RS
16 2
Mesh representation supporting multiple attributes
3
Multiresolution 3DMC
27 2
Scalable complexity 3DMC Conformance
3
Scalable complexity 3DMC in part 25 Conformance
Reconfigurable Graphics Coding
C
H
Audio
D
3DG
4
9.2 Joint meetings
The following joint meetings were held
Groups
S, R
R, V
S, R, V, A
E, V
S, 3
R, V
R, V
V, VCEG
R, 3, S
R, A
3, V
S, 3
R, V
R, V
V, VCEG
What
Font Format
Type 1 licensing
MP7 Profiles
RGC
MPEG-V
Stereo Video Enhancement
Visual search CfP
AVC issues
User description in MPEG-V
Audio for HEVC
RVC & RGC
MPEG-V/U
Frame compatible stereo enhancement
Visual search CfP
AVC issues
11
Day
Mon
Mon
Tue
Tue
Tue
Tue
Tue
Tue
Tue
Wed
Wed
Thu
Thu
Thu
Thu
Time1
16:00
16:30
09:00
09:00
14:00
14:00
16:00
17:00
17:30
14:00
15:00
09:00
11:00
12:00
14:00
Time2
16:30
18:00
10:00
10:00
17:30
15:00
17:00
18:00
18:00
15:00
16:00
10:00
12:00
13:00
15:00
Where
R
R
R
3
3
V
R
J
3
A
3
3
R
R
V
10 WG management
10.1 Terms of reference
The following document was approved
11600
Terms of reference
10.2 Liaisons
10.2.1 Letters
The following documents were approved
11680
11635
11636
11637
11638
11639
11640
11641
11663
11694
11695
11696
Liaison letter to ITU-T SG 16 on Telepresence Systems
Liaison statement to SC 6 on MMT
Liaison statement to 3GPP on DASH
Liaison statement to JTC 1/WG 7 on Sensor Networks
Liaison statement to IEC TC 100 on the definition of “trick mode”
Liaison statement to JTC 1/SC 24 on “zip” format
Liaison statement to IEC TC 100 on IEC CDV 62516-2, BIFS in T-DMB receiver
Liaison statement to IEC TC 100 on Richmedia for e-Book
Statement of benefits from establishing a Category C liaison with DTG
Liaison Statement to 3GPP TSG SA4
Liaison Statement to Wireless Gigabit Alliance
Liaison Statement to ISO/IEC JTC 1/SC 37
10.2.2 Templates
The following documents were approved
11683
11632
11633
11634
Liaison statement template on MPEG frame-compatible activities
Liaison statement template on DASH
Liaison statement template on MSPT
Liaison statement template on signalling of stereoscopic video in MPEG-2 System
10.2.3 Statement of benefits
The following document was approved
11697
Statement of benefits from establishing a Category C liaison with Wireless Gigabit
Alliance
10.2.4 List of liaisons
The following document was approved
11611
List of Organisations with which MPEG entertains liaisons
12
10.3 Ad hoc groups
The following document was approved
11554
Ad hoc groups established at Guangzhou
10.4 Asset management
The following documents were approved
11605
11606
11607
11608
11609
Schema assets
Software assets
Conformance assets
Content assets
URI assets
10.5 Facilities fees
The following document was approved
11672
MPEG and facilities fees
10.6 IPR management
The following document was approved
11610
Standards under development for which a call for patent statements is issued
10.7 Work plan and time line
The following documents were approved
11601
11602
11603
MPEG Standards
Table of unpublished FDISs
Work plan and time line
11 Administrative matters
11.1 Schedule of future MPEG meetings
The following schedule of future MPEG meetings was approved
#
94
95
96
97
98
99
100
City
Guangzhou
Daegu
Geneva
Torino
Geneva
San José, CA
Geneva
Country
CN
KR
CH
IT
CH
US
CH
13
yy
10
11
11
11
11
12
12
mm
10
01
03
07
11-12
02
04-05
dd-dd
11-15
24-28
21-25
18-22
28-02
06-10
30-04
101 Stockholm
102 Suzhou?
103 Geneva
SE
CN
CH
12 07
12 10
13 01
16-20
15-19
??-??
11.2 Promotional activities
The following documents were approved
11691
11595
11596
11557
MPEG-V awareness day Call for Participation
Storyboard for MPEG-U promotion video
Short descriptions
Guangzhou press release
12 Resolutions of this meeting
These were approved
13 A.O.B.
There was no other business
14 Closing
The meeting closed at 2010/10/15T20:20 with thanks to the hosting organisation
14
Annex A – Attendance list
Name
Michael Grafl
Christian Timmerer
Mohamad Raad
Kenneth Vermeirsch
Jan De Cock
Marc Jacobs
Joeri Barbarien
Louis Zhang
Jian Wang
Gaelle Martin-Cocher
Dake He
Panos Nasiopoulos
Mahsa Pourazad
Roch Lefebvre
Feng Pan
Philippe Gournay
Bruno Bessette
Nikolce Stefanoski
Marco Mattavelli
Touradj Ebrahimi
Jean-Pierre EVAIN
Zongbiao Niu
Carmen Cheng
Qiao Shujuan
Wenpeng Ding
Hui Liu
Sijia Chen
Lai-Man Po
Xiaoran Cao
Yun He
Hongzhang Yang
Jinahua Zheng
Yu Liu
Xing WEN
Oscar Au
Hongbo Zhu
JING WANG
Shanfu Lee
Wei Xiao
Yang Mingyuan
Qiu Shen
Sixin Lin
Changcai Lai
Yongbing Lin
Peiyu YUE
Affiliation
Klagenfurt University
lagenfurt University
RaadTech Consulting
IBBT - Ghent University
IBBT - Ghent University
IBBT - VUB
IBBT - VUB
AMD
Polycom Inc.
Reasearch In Motion Limited
Research In Motion Ltd.
University of British Columbia
University of British Columbia
University of Sherbrooke
ViXS Systems Inc.
VoiceAge Corporation
VoiceAge Corporation
Disney Research, Zurich
EPFL
EPFL
European Broadcasting Union
ASTRI
Beihang University
Beijing University of Technology
Cisco
Cisco
City University of Hong Kong
Dept. of Electronic Engineering, Tsinghua University
Dept. of Electronic Engineering, Tsinghua University
Freescale Semiconductor China
HiSilicon Technologies Co. Ltd,
HK-ASTRI
HONG KONG UNIV. OF SCIENCE & TECHNOLOGY
HONG KONG UNIV. OF SCIENCE & TECHNOLOGY
Hongbo Zhu
HUAWEI
Huawei
Huawei Technologies
HUAWEI TECHNOLOGIES CO., LTD
Huawei Technologies Co., Ltd
Huawei Technologies Co., Ltd
Huawei Technologies Co., Ltd
Huawei Technologies Co., Ltd
Huawei Technologies Co., Ltd
15
country
AT
AT
AU
BE
BE
BE
BR
CA
CA
CA
CA
CA
CA
CA
CA
CA
CA
CH
CH
CH
CH
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
Pulin Wang
Jin Song
Haitao Yang
weizhong Chen
Yunxuan Zhao
Xiaozhen Zheng
Zhang Shaobo
Lidong Xu
Weimin(Wilma) Su
Shawmin Lei
Yu-Wen Huang
Jicheng An
Ching-Yeh Chen
Chih-Wei Hsu
Yu-Lin Chang
Xun Guo
Jizheng Xu
Jianxin Yan
Feng Shao
Tiejun Huang
Siwei MA
Li Zhang
Lingyu Duang
Wen Gao
An Ping
Song Li
ZHAO Haiwu
Chenchen Gu
Xunan Mao
Qingyi Zhao
Xuemei Yan
Feng Cen
Min Lu
Yuan Yuan
Qifei Wang
Yongbing Zhang
Xiangyang Ji
Junyan Huo
Yanzhuo Ma
Wei Li
Fuzheng Yang
Lu YU
Zhang Wen
Dong Wang
Yingjie Hong
Ming Li
Zhuping Li
Fang Wang
HUAWEI TECHNOLOGIES CO., LTD.
Huawei Technologies Co., Ltd.
Huawei Technologies Co., Ltd.
Huawei technologies CO.,LTD
HUAWEI TECHNOLOGIES CO.,LTD
Huawei Technologies Ltd., Co
Huawei Techonologies Co. Ltd
Intel China Research Center
Intellectual Property & Standards China, Philips
MediaTek
MediaTek
MediaTek
MediaTek
MediaTek
MediaTek
MediaTek (Beijing) Inc.
Microsoft Research Asia
National Engineering Laboratory for audio Coding
Technology
Ningbo University
Peking University
Peking University
Peking University
Peking University
Peking Universtiy
School of Communication and Information Engineering
Shanghai jiaotong university
Shanghai University
Shenzhen Tencent Computer Systems Conpany Limited
Tencent Technology (Beijing) Company Limited
Tencent Technology (Shenzhen) Co.,Ltd
Tencent Technology (Shenzhen) Co.,Ltd
Tongji University
Tsinghua University
Tsinghua University
Tsinghua University
Tsinghua University
Tsinghua University
Xidian University
Xidian University
Xidian University
Xidian University
Zhejiang Unviersity
ZTE
ZTE Coporation
ZTE Corporation
ZTE Corporation
ZTE Corporation
ZTE Corporation
16
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
CN
Yanzi Yang
Ferenc Kraemer
Gerhard Tech
Benjamin Bross
Martin Winken
Karsten Suehring
Karsten Mueller
Detlev Marpe
Karsten Grueneberg
Haricharan Lakshman
Max Neuendorf
Markus Multrus
Julien Robilliard
Bernhard Grill
Ingo Hofmann
Terentiev Leon
Oliver Hellmuth
David VIRETTE
Joern Ostermann
Zhijie Zhao
Sven Klomp
Thomas Wedi
Matthias Narroschke
Virginie Drugeon
Thomas Stockhammer
Jens-Rainer Ohm
Gero Baese
Peter Amon
Aritz Sanchez de la Fuente
Thomas Wiegand
Oliver Wuebbolt
Alexander Glantz
Andreas Krutz
Shujun Li
Victor Rodriguez Doncel
Pablo Carballeira
Francisco Moran Burgos
Jaime Delgado
M. Oguz Bici
Jani Lainema
Dmytro Rusanovskyy
Kemal Ugur
Imed Bouazizi
Helmut Burklin
Guillaume LAROCHE
Patrice ONNO
Gordon Clare
Mickael Raulet
Laurent Guillo
ZTE Corporation
Dolby Germany GmbH
Fraunhofer Heinrich Hertz Institute
Fraunhofer HHI
Fraunhofer HHI
Fraunhofer HHI
Fraunhofer HHI
Fraunhofer HHI
Fraunhofer HHI
Fraunhofer HHI
Fraunhofer IIS
Fraunhofer IIS
Fraunhofer IIS
Fraunhofer IIS
Fraunhofer IIS
Fraunhofer Institut IIS
Fraunhofer Institut IIS
HUAWEI TECHNOLOGIES CO.,LTD.
Institut fiuer Informationsverarbeitung
Leibniz Universitaet Hannover
Leibniz University Hannover
Panasonic
Panasonic R&D Center Gemany
Panasonic R&D Center Germany
Qualcomm Incorporated
RWTH Aachen University
Siemens
Siemens AG
Technical University of Berlin
Technical University of Berlin
Technicolor
Technische University Berlin
Technische University Berlin
University of Konstanz
Universitat Politecnica de Catalunya
Universidad Politecnica de Madrid
Universidad Politécnica de Madrid
Universitat Politecnica de Catalunya
Nokia
Nokia
Nokia
Nokia
Nokia Research Center
AFNOR
CANON RESEARCH CENTRE FRANCE SAS
CANON RESEARCH CENTRE FRANCE SAS
France Telecom Orange Labs
IETR/INSA Rennes
INRIA Rennes
17
CN
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
DE
E
ES
ES
ES
FI
FI
FI
FI
FI
FR
FR
FR
FR
FR
FR
Ronan BOITARD
Marius Preda
Mihai Mitrea
Pierrick Philippe
Joel Jung
Sylvain Kervadec
Patrick Lopez
Edouard Francois
Mary-Luc Champel
Jean-Claude Dufourd
Jean LE FEUVRE
Avraham Shimor
Sergio Matone
Leonardo Chiariglione
Alessandra Mosca
Giovanni Cordara
Masato Shima
Kimihiko Kazui
Jumpei Koyama
Tomokazu Murakami
Yukiko Ogura
Shuichi Aoki
Atsuro Ichigaya
Shinichi Sakaida
Shigeru Fukushima
Ueda Motoharu
Tomonobu Yongbing
Kazuo Sugimoto
Tokumichi Murakami
Kohtaro Asai
Shun-ichi Sekiguchi
Masayuki Tanimoto
Meindert Wildeboer
Taka Senoh
Hirofumi Aoki
Chono Keiichi
Kenta SENZAKI
Kota Iwamoto
Masanori Sano
Yukihiro Bandoh
Takehiro Moriya
Yoshihide Tonomura
Shinya Shimizu
Shohei Matsuo
Seishi Takamura
Tsutsumi Kimitaka
Thiow Keng Tan
INRIA Rennes
Institut TELECOM
Institut Télécom; Télécom Sudparis, ARTEMIS
Department
Orange Labs
Orange Labs
Orange Labs
Technicolor
technicolor
Technicolor
Telecom ParisTECH
Telecom ParisTech
SanDisk Corporation
CEDEO.net
CEDEO.net
CSP s.c.a r.l.
Telecom Italia Lab
Canon Inc
FUJITSU LABORATORIES LTD.
FUJITSU LABORATORIES LTD.
Hitachi, Ltd
IPSJ/ITSCJ
Japan Broadcasting Corporation
Japan Broadcasting Corporation (NHK)
Japan Broadcasting Corporation (NHK)
JVC KENWOOD Holdings, Inc
JVC KENWOOD Holdings, Inc
KDDI R&D Laboratories Inc.
Mitsubishi Electric Corporation
Mitsubishi Electric Corporation
Mitsubishi Electric Corporation
Mitsubishi Electric Corporation
Nagoya University
Nagoya University
National Institute of Information and Communications
Technology
NEC Corporation
NEC Corporation
NEC Corporation
NEC Corporation
NHK
NTT
NTT
NTT
NTT Corporation
NTT Corporation
NTT Cyber Space Laboratories
NTT DOCOMO
NTT DOCOMO, Inc
18
FR
FR
FR
FR
FR
FR
FR
FR
FR
FR
FR
IL
IT
IT
IT
IT
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
Akira Fujibayashi
Kei Kikuiri
Zhixiong Wu
Takuyo Kogure
TAKAHIRO NISHI
Hisao Sasai
Youji Shibahara
Takeshi Norimatsu
Sugio Toshiyasu
Seiji Mochizuki
Tadashi Uchiumi
Yasuaki Tokumo
Tomoyuki Yamamoto
Jun Yonemitsu
Shinobu Hattori
Kazushi Sato
Teruhiko SUZUKI
Yasuaki Yamagishi
Kenji Kondo
Toru Chinen
Masayuki Nishiguchi
Ikuo Tsukagoshi
Toshiaki Fujii
Itaru Kaneko
AKIYUKI TANIZAWA
Tomoo Yamakage
Takeshi Chujoh
Shiodera Taichiro
Jin Xin
Kim Daeyoon
Sungmoon Chun
Seyoon Jeong
JIN SOO CHOI
Bang Gun
Miran Choi
Jin Young Lee
Inseon Jang
Kyeongok KANG
WEON GEUN OH
Jung Won Kang
Taejin Lee
Seungkwon Beack
Sang-il Na
Kugjin Yun
YOUNGKWON LIM
Sung-Chang Lim
Wonsuk Lee
Won-Sik CHEONG
NTT DOCOMO, Inc
NTT DOCOMO, INC.
Oki Electric Industry Co., Ltd.
Panasonic
Panasonic
Panasonic Corporation
Panasonic Corporation
Panasonic Corporation
Panasonic Corporation
Renesas Electronics Corporation
Sharp corporation
Sharp corporation
Sharp corporation
Sony
Sony Corp
Sony Corp
Sony Corp.
Sony Corp.
Sony corporation
Sony Corporation
Sony Corporation
Sony Corporation, HE Business Group, HE Development
Division
Tokyo Institute of Technology
Tokyo Polytechnic University
TOSHIBA Corporation
Toshiba Corporation
Toshiba Corporation
Toshiba Corporation
Waseda Univ.
ECT Inc.
Electronics and Telecommunications Research Institute
Electronics and Telecommunications Research Institute
Electronics and Telecommunications Research Institute
ETRI
ETRI
ETRI
ETRI
ETRI
ETRI
ETRI
ETRI
ETRI
ETRI
ETRI
ETRI
ETRI
ETRI
19
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
JP
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
Hokyom Kim
Seung Wook Lee
Eun Seo Lee
Kwang Roh Park
Seong Yong Lim
YONGJU CHO
Cong Thang Truong
Bumsuk Choi
SangHyun Joo
Hui Yong KIM
Chang Yul Kim
KIHUM HAN
Seonghoon Kim
Inkwon Kim
YO-SUNG HO
Jeha Ryu
Haechul Choi
Kiho Choi
Minsoo Park
Euee S. Jang
Hyunok Oh
Tae-young Jung
Kibaek Kim
Jechang Jeong
Junwoo Lee
Dong Seok Jeong
WEN-HSIAO PENG
Munchurl Kim
Bumshik Lee
IN JOON CHO
Min Uk Kim
CHEN YINGYING
Yungho Choi
KYOUNGRO YOON
Kwangki Kim
Euy-Doc Jang
Un Ki Park
Jae Keun Kwak
Kyung Won Kim
Xiaochao Qu
Soomin Lim
Yong-Goo KIM
Eunbin Oh
Woong Lim
Hochong Park
NeungJoo Hwang
Donggyu Sim
SEOUNG-JUN OH
Chulkeun Kim
ETRI
ETRI
ETRI
ETRI
ETRI
ETRI
ETRI
ETRI
ETRI
ETRI
Feelingk
Feelingk
Galaxia Communications Co., Ltd
Galaxia Communications Co., Ltd.
Gwangju Institute of Science and Technology
Gwangju Institute of Science and Technology (GIST)
Hanbat National University
Hanyang University
Hanyang University
Hanyang University
Hanyang University
Hanyang University
Hanyang university
Hanyang University
Inha Univ.
Inha University
ITRI International/NCTU
KAIST
KAIST
KBS Technical Research Institute
Konkuk University
KONKUK UNIVERSITY
Konkuk University
KONKUK UNIVERSITY
Korea Advanced Institute of Science And Technology
Korea Aerospace University
Korea Aerospace University
Korea Electronics Association
Korea Electronics Technology Institute
Korea University MSL
Korea University MSL
Korean German Institute of Technology (KGIT)
Korean Standards Association
Kwangwoon Univ.
Kwangwoon University
Kwangwoon University
Kwangwoon University
Kwangwoon University
Kyuanhee UNIV
20
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
Yong-Chul Park
KyuHeon Kim
JangWon Lee
JeonHo Kang
GilBok Lee
Heo Youngsu
Kim Kyungyong
Park Gwanghoon
Lee Yoonjin
Yongjoon Jeon
SUNGYONG YOON
Seungwook Park
Jaehyun Lim
Jaewon Sung
Sangoh Jeong
Byeong Moon Jeon
Ji Wook Jung
Sehoon Yea
Hyeon Jae Lee
Suh Doug Young
Lee Yonghun
Kyoung-Ho Choi
Sang-Kyun Kim
Jae-Gon Kim
Lai-Tee Cheok
Bo Gyeong Kang
Kihyun Choo
Ha Hojin
il koo Kim
Rhyu Sungryeul
Jaejoon Lee
Seungju Han
Elena Alshina
Woo-Jin Han
Song Jaeyeon
Kyong-Sok Seo
Hee Jean Kim
Jae Joon Han
Jianle Chen
Seok Lee
Minsu Ahn
Hwang Seo-Young
JeongHoon Park
JUNG SUK SUH
Kyungmo Park
Chanwon SEO
Jong-Ki Han
Hae Kwang Kim
Daeil Yoon
Kyung Hee University
Kyung Hee University
Kyung Hee University
Kyung Hee University
Kyung Hee University
Kyunghee University
Kyunghee University
Kyunghee University
Kyunghee University
LG Electronics
LG Electronics
LG Electronics
LG Electronics
LG Electronics
LG Electronics
LG Electronics
LG Electronics
LG Electronics
LG Electronics
Kyunghee university
Kyunghee university
Mokpo National University
Myongji University
Korea Aerospace University
Samsung
SAMSUNG
Samsung Advanced Institute of Technology
Samsung elec.
Samsung Electonics, Co., Ltd
Samsung Electonics, Co., Ltd
Samsung Electronics
Samsung Electronics
Samsung Electronics
Samsung Electronics
Samsung electronics
Samsung Electronics
Samsung Electronics
Samsung Electronics Advanced Institute of technology
Samsung Electronics Co.
Samsung Electronics Co. Ltd.
Samsung Electronics Co., Ltd
Samsung Electronics Co., LTD.
Samsung Electronics Co., Ltd.
Samsung Electronics Co.,Ltd.
Samsung Electronics Co.,Ltd.
Sejong University
Sejong University
Sejong University
Sejong University
21
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
Joo Hee Moon
Ju Ock Lee
Hyoungmee Park
Yung Lyul Lee
Hong Sung-Wook
Kim Jeong-Pil
CHO JAEHEE
KIM HYUNDONG
Jeongyeon Lim
Min Cheol Hong
Yang Jungyoup
Kwanghyun Won
Byeungwoo Jeon
Yong Han Kim
Song Jeongook
SANGYOUN Lee
Seongwan Kim
Lee Jaeseong
Soongi Hong
Jongho Kim
Henney Oh
Delphine Roussy
Fons Bruls
Werner Oomen
Jean H.A Gelissen
Arild Fuldseth
Marek Domanski
Krzysztof Wegner
Kristofer Kjoerling
Heiko Purnhagen
Kenneth Andersson
Rickard Sjoberg
Thomas Rusert
Per Frojdh
Yongwei Zhu
Zhengguo Li
Chuo Hao Yeo
Yih Han Tan
Haiyan Shu
Haishan Zhong
Viktor Wahadaniah
DAN ZHAO
ChongSoon Lim
Gerard Fernando
Ying Weng
Nikola Sprljan
Stavros Paschalakis
Ping Wu
Miroslaw Bober
Sejong University
Sejong University
Sejong University
Sejong University
Sejong University
Sejong University
Sejong University/DMS
Sejong University/DMS
SK Telecom
Soongsil University
Sungkyunkwa University
Sungkyunkwa University
Sungkyunkwa University
University of Seoul
Yonsei Univ.
Yonsei University
Yonsei University
Yonsei University
Yonsei University
Yonsei University
Yonsei University
Koninklijke Philips Electronics N.V.
Philips
Philips Applied Technologies
Philips Research Lifestyle Program
Cisco Systems Norway
Poznan University of Technology
Poznan University of Technology
Dolby R&D Sweden AB
Dolby R&D Sweden AB
Ericsson
Ericsson
Ericsson AB
Ericsson Research
Institute for Infocomm Research
Institute for Infocomm Research
Institute for Infocomm Research
Institute for Infocomm Research
Institute for Infocomm Research
Panasonic Singapore Laboratories
Panasonic Singapore Laboratories
Panasonic Singapore Laboratories Pte Ltd
Panasonic Singapore Laboratories Pte Ltd
British Broadcasting Corporation
Mitsubishi Electric R&D Centre Europe
Mitsubishi Electric R&D Centre Europe BV
Mitsubishi Electric R&D Centre Europe BV
Mitsubishi Electric R&D Centre Europe BV
22
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
KR
NL
NL
NL
NL
NO
PL
PL
SE
SE
SE
SE
SE
SE
SG
SG
SG
SG
SG
SG
SG
SG
SG
UK
UK
UK
UK
UK
UK
Ken McCann
Chad Fogg
Arianne Hinds
Walt Husak
Nenad Rijavec
David Singer
David Pope
Schuyler Quackenbush
Alexander Giladi
Flynn David
Marta Mrak
Thomas Davies
Yang Zhijie
BZ (Bazhong) Shen
Lei Zhang
Yong Yu
Wade Wan
Panos Kudumakis
Yian Xu
Huipin Zhang
Junlin Li
Frank Bossen
Alexandros Tourapis
Michael Biber
Yan Ye
John Ralston
Bill Zou
Pankaj Topiwala
Nam Ling
Ye-Kui Wang
Lingzhi Liu
Xin Wang
Yi-Jen Chiu
RAJENDRA
BOPARDIKAR
Yi-Shin Tung
HAOPING YU
Iraj Sodagar
Gary Sullivan
Andrew Tescher
Kilroy Hughes
Huifang Sun
Dong Tian
Robert Cohen
Anthony VETRO
Vladimir Levantovsky
Mandayam Narasimhan
Krit Panusopone
ZetaCast representing Samsung
ANSI
ANSI
ANSI
Apple
Aptina, LLC
Audio Research Labs
Avail-TVN
British Broadcasting Corporation
British Broadcasting Corporation
British Broadcasting Corporation
Broadcom
Broadcom Corp
Broadcom Corp
Broadcom Corp
BROADCOM CORPORATION
Centre for Digital Music, Queen Mary University of
London
Cisco Systems
Cisco Systems, Inc
Cisco Systems, Inc.
DOCOMO USA Labs
Dolby Laboratories
Dolby Laboratories
Dolby Laboratories Inc
Droplet Technology, Inc
DTS
FastVDO
Huawei (China) - Santa Clara University
Huawei Technologies
Huawei Technologies USA
Huawei Technologies, USA
Intel Corp
Intel Corp
ITRI USA/MStar Semiconductor
Media Technology Research Department, Corporate
Research
Microsoft
Microsoft Corp.
Microsoft Corporation
Microsoft Corporation
Mitsubishi Electic Research Labs
Mitsubishi Electic Research Labs
Mitsubishi Electric
Mitsubishi Electric
Monotype Imaging Inc
Motorola
Motorola
23
UK
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
Ajay Luthra
Faisal Ishtiaq
Shih-Ta Hsiang
Mark Watson
Justin Ridge
Lulin Chen
Jesus Sampedro
Marta Karczewicz
Ying Chen
Muhammed Coban
Xianglin Wang
Pengjun Huang
Peisong Chen
Yuriy REZNIK
In Suk Chong
Wei-Jung Chien
Daniel O'Loughlin
Felix Fernandes
Nhut Nguyen
Sanjeev Verma
WANG Lai
Ankur Saxena
Guichun Li
Andrew Segall
Lazar Bivolarski
Ali Tabatabai
Cheung Auyeung
Vivienne Sze
Minhua Zhou
Madhukar Budagavi
Jill Boyce
Michael Horowitz
Dzung Hoang
Motorola
Motorola Inc.
Motorola, Inc.
Netflix Inc.
Nokia
Omneon Video Networks
POLYCOM
Qualcomm
Qualcomm
Qualcomm
Qualcomm Inc
Qualcomm Inc.
Qualcomm Incorporated
Qualcomm Incorporated
Qualcomm Incorporated
Qualcomm Incorporated
Research in Motion Ltd
Samsung
Samsung
Samsung Electronics
Samsung Telecom America
Samsung Telecommunications America
Santa Clara University
Sharp Labs of America
Skype
Sony Electronics
Sony Elenctronics Inc
Texas Instruments
Texas Instruments Inc
Texas Instruments Inc.
Vidyo
Vidyo, Inc.
Zenverge, Inc.
24
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
US
Annex B – Agenda
#
#
#
Item
1
2
3
4
5
6
7
8
1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
2
1
3
1
2
3
4
4
5
1
6
1
7
1
2
3
Opening
Roll call of participants
Approval of agenda
Allocation of contributions
Communications from Convenor
Report of previous meeting
Processing of NB Position Papers
Work plan management
Media coding
Scalable resolution enhancement of frame-compatible stereoscopic 3D video
AFX 3rd edition
Multiresolution 3D mesh Coding
Scalable-complexity 3D mesh compression
Efficient representation of 3D meshes with multiple attributes
Many-to-one range mappings in Open Font Format
Composite Font in Open Font Format
Video Tool Library
Spatial Audio Object Coding
Unified Speech and Audio Coding
High-Efficiency Video Coding
Media Context and Control – Control Information
Media Context and Control – Sensory Information
Media Context and Control – Virtual World Object Characteristics
Media Context and Control – Data Formats for Interaction Devices
Media Context and Control – Common Types and Tools
3D Video Coding
Contract Expression Language
Low-Complexity Video Coding
Composition coding
Advanced User Interaction
Description coding
Video Signature Descriptors
Compact Descriptors for Visual Search
Extraction and Matching of Video Signature Tools
MPQF semantic enhancement
Systems support
IPMP
Protection of Presentation Element
Digital Item
Presentation of Digital Item
Transport and File formats
File formats
SAOC transport
MVC operation point descriptor
25
4
5
6
7
8
9
10
11
12
9
1
2
10
Carriage of JPEG 2000 in MPEG-2 Systems
Sub-track selection & switching
Dynamic Adaptive Streaming over HTTP
MPEG Media Transport
Multimedia architecture
1 SC3DMC for 3DGCM
2 Architecture and Technologies
3 MXM API
4 Elementary Services
5 Service Aggregation
Application formats
1 Stereoscopic Video AF composition type
Reference implementation
1 SC3DMC Reference Software
2 Scalable Complexity 3D Mesh Coding in 3DG Compression Model Reference
Software
3 Video Signature Tools Reference Software
4 Stereoscopic video AF Reference Software
5 Interactive Music AF Reference Software
6 Video Tool Library Reference Software
7 MXM and AIT Reference Software
8 MPEG Rich Media UI Reference Software
9 Media Context and Control – Reference Software
Conformance
1 MPEG-4 Audio Conformance
2 3DG Conformance
3 Scalable Complexity 3DMC Conformance
4 3D Graphics Compression Model Conformance
5 Video Signature Tools Conformance
6 Stereoscopic video AF Conformance
7 Interactive Music AF Conformance
8 Video Tool Library Conformance
9 MXM and AIT Conformance
10 MPEG Rich Media UI Conformance
11 Media Context and Control – Conformance
Maintenance
1 Systems coding standards
2 Video coding standards
3 Audio coding standards
4 3DG coding standards
5 Systems description coding standards
6 Visual description coding standards
7 Audio description coding standards
8 MPEG-21 standards
9 MPEG-A standards
Organisation of this meeting
Tasks for subgroups
Joint meetings
WG management
26
1
2
3
4
5
6
7
8
1
2
3
4
9
10
11
1
2
12
13
14
Terms of reference
Officers
Editors
Liaisons
Responses to National Bodies
Work item assignment
Ad hoc groups
Asset management
Reference software
Conformance
Test material
URI
IPR management
Work plan and time line
Administrative matters
Schedule of future MPEG meetings
Promotional activities
Resolutions of this meeting
A.O.B.
Closing
27
Annex C – Input contributions
No.
Source
Title
17964 Webmaster
Guangzhou document register
17965 Yi-Shin Tung, Jens-Rainer Ohm
Ad Hoc on Maintenance of MPEG-4 Visual
related Documents, Reference Software and
Conformance
17966 Marco Mattavelli, Kazuo Sugimoto
Ad Hoc on Reconfigurable Video Coding
17967 Miroslaw Bober, Ryoma Oami
Ad Hoc on MPEG-7 Visual
17968 Karsten Müller, Anthony Vetro
Ad Hoc on 3D Video Coding
17969 Francisco Morán Burgos, Seung Wook Lee
Ad Hoc on 3DG documents, software
maintenance and core experiments
17970 Khaled Mamou, Minsu Ahn, Xiangyang Ji
Ad Hoc on Multi-Resolution 3D mesh Coding
17971 S. Quackenbush
Ad Hoc on Audio Standards Maintenance
17972 S. Quackenbush
Ad Hoc on SAOC, USAC, and Audio and
Systems Interactions
17973 Jaeyeon Song, Cyril Concolato
Ad Hoc on Scene Representation
17974 David Singer, Per Fröjdh
Ad Hoc on MPEG File Formats
17975 Kyuheon Kim, Kugjin Yun
Ad Hoc on Application Format
17976 Vladimir Levantovsky
Ad Hoc on Font Format Representation
17977 Xin Wang, Young Kwon Lim
Ad Hoc on Advanced IPTV Terminal
17978 Marius Preda
Ad Hoc on MPEG-V
17979 Xin Wang, Jaime Delgado
Ad Hoc on Contract Expression Language
17980 Iraj Sodagar, David Singer, Young Kwon Lim
Ad Hoc on Dynamic Adaptive Streaming over
HTTP
17981 Walt Husak, Ajay Luthra, Gary Sullivan
Ad Hoc on scalable resolution enhancement of
frame-compatible stereoscopic 3D video
17982 Thomas Davies, John Ralston
Ad Hoc on Lightweight Video Coding
17983 Yuriy Reznik, Giovanni Cordara, Miroslaw Bober
Ad Hoc on Compact Descriptors for Visual
Search
17984 Christian Timmerer
Ad Hoc on MPEG Media Transport (MMT)
17985 ITU-T SG 16 via SC 29 Secretariat
Liaison Statement from ITU-T SG 16 on
Approval of a New Question on Telepresence
systems [SC 29 N 11508]
17986 ITU-T SG 16 via SC 29 Secretariat
Liaison Statement from ITU-T SG 16 to SC 29
on Video coding collaboration [SC 29 N 11525]
17987 SC 29 Secretariat
Summary of Voting on ISO/IEC 1449626:2010/DCOR 2 [SC 29 N 11526]
17988 SC 37 via SC 29 Secretariat
ISO/IEC JTC 1/SC 37's CD and FCDs: FCD
19794-6.2, FCD 19794-9, FCD 19794-5.3, FCD
29109-6.2, FCD 19794-8.3 and CD 19794-11.4
[SC 29 N 11531]
17989 IEC TC 100 via SC 29 Secretariat
IEC CDV 62516-2 [SC 29 N 11538]
17990 WGA via SC 29 Secretariat
Liaison Statement from Wireless Gigabit
Alliance [SC 29 N 11539]
17991 3GPP via SC 29 Secretariat
Liaison Statement from 3GPP [SC 29 N 11540]
17992 SC 29 Secretariat
Summary of Voting on ISO/IEC 15938-
28
6:2003/PDAM 4 [SC 29 N 11543]
17993 DTG via SC 29 Secretariat
Liaison Statement from Digital TV Group [SC
29 N 11551]
17994 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 1449626:2010/FDAM 2
17995 SC 29 Secretariat
Summary of Voting on ISO/IEC 1449625:2009/FPDAM 1
17996 SC 29 Secretariat
Summary of Voting on ISO/IEC 159387:2003/PDAM 6
17997 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC FDIS 23003-2
17998 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 1449611:2005/FDAM 7
17999 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC FDIS 23007-1
18000 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 144963:2009/FDAM 2
18001 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 144961:2010/FDAM 1
18002 ITTF via SC 29 Secretariat
Table of Replies on ISO/IEC 159383:2002/FDAM 4
18003 SC 29 Secretariat
Summary of Voting on ISO/IEC 144965:2001/FPDAM 26
18004 SC 29 Secretariat
Summary of Voting on ISO/IEC 1449627:2009/FPDAM 3
18005 SC 29 Secretariat
Summary of Voting on ISO/IEC 144965:2001/FPDAM 28
18006 3GPP via SC 29 Secretariat
Liaison Statement from 3GPP
18007 Waqar Zia
DASH Evaluation Experiment 5: Delivery
Format of MPEG-2 TS
18008
Report of DASH Evaluation Experiment 9,
Delivery Format to File Format Conversion
(DFC)
Ye-Kui Wang, Sungryeul Rhyu, Miska M.
Hannuksela
18009 Jin Young Lee, Harry Pyle
DASH Evaluation Experiment #1: Compositions
of Media Presentation (CMP) Proposal
Comparison
18010 Jin Young Lee
DASH Evaluation Experiment #2: MPD
Modification, Corrections, and Extension
18011 Nhut Nguyen, Harry Pyle
DASH Evaluation Experiment #8, Clock Drift
Control
18012 Gerard Fernando, Kyungmo Park
DASH Evaluation Experiment #4: Delivery
Format Addressing
18013 Waqar Zia, Ying Chen, Yago Sanchez
DASH Evaluation Experiment #3:
Enhancement to Representation Description
(ERD)
18014 Ying Chen, Ingo Hofmann
DASH Evaluation Experiment #7: Seamless
Switching (SES)
18015 Thomas Stockhammer
Updated Information on Reference Software for
DASH
18016 Ye-Kui Wang, Thomas Stockhammer
EE#6: Trick Mode and Random Access
Signaling (TRA)
18017 Mark Watson
Some comments on DASH EE#7 – Seamless
Switching
29
18018 Kilroy Hughes, Quintin Burns
Proposed Amendment: Support for Common
Encryption
18019 Kilroy Hughes, Quintin Burns
Proposed Amendment: Independently
Decodable Fragments
18020 JTC 1/WG 7 via SC 29 Secretariat
Liaison Statement from ISO/IEC JTC 1/WG 7
[SC 29 N 11560]
18021 IEC TC 100 via SC 29 Secretariat
Liaison Statement from IEC TC 100 [SC 29 N
11561]
18022 JTC 1/SC 34 via SC 29 Secretariat
Liaison Statement from JTC 1/SC 34 [SC 29 N
11563]
18023
Bum Suk Choi, Eunseo Lee, Jong Hyun Jang,
Kyoungro Yoon, Jonghyung Lee, Ying Ying Chen
Update of Reference SW for MPEG-V Part 2
18024
Eunseo Lee, Bum Seok Choi, Kwang Roh Park,
Kyoungro Yoon, Jonghyung Lee, Ying Ying Chen
Update of Reference SW for MPEG-V Part 5
Proposal of GPS Sensor Capability Type,
Altitude Sensor Capability Type, and Global
Position Capability Type
18025 Kyoungro Yoon, Doohyun Kim, Min-Uk Kim
18026
3DV/FTV EE results of depth estimation and
Gun Bang, Neungjoo Hwang, Donggyu Sim, Woong
coding experiment on ‘Lovebird1’
Lim, Gi-Mun Um, Won-Sik Cheong, Namho Hur
sequence
18027
Kyoungro Yoon, Soohan Kim, Woo Chool Park, Hae Proposal of Header information for binary
Moon Seo
transmission of MPEG-V information
18028 K. Kazui, J. Koyama, A. Nakagawa (Fujitsu)
Evaluation result of JCTVC-B031
18029 A. G. Tescher for USNB
USNB Contribution: WG11 resolution 2.1.4
consideration
18030 A. G. Tescher for USNB
USNB Contribution: WG11 resolution 2.2.2
consideration
18031 A. G. Tescher for USNB
USNB Contribution: WG 11 resolution 3.3.2
consideration
18032 A. G. Tescher for USNB
USNB Contribution: Suggested practices for
the HEVC project in the JCT-VC
18033 A. G. Tescher for USNB
USNB Contribution: HEVC standardization
timeline
18034 A. G. Tescher for USNB
USNB Contribution: Option-1 video codec
consideration
18035 T. Murakami, T. Yokoyama, K. Nakamura (Hitachi)
TE1: Cross-check result of DMVD proposal
JCTVC-C124 (Mitsubishi)
18036 Fuzheng Yang, Wei Li, Shuai Wan, Yilin Chang
CTB splitting on frame boundary for arbitrary
resolution video
18037 F. Yang, W. Li, S. Wan, Y. Chang (Xi'dian Univ.)
Proposal of frame parameters FLCTB and
FSCTB for video content fitting
18038 Jungsun Kim
TE7: Verification results of Peking Univ.
proposal (mode-dependent residual reordering
for intra prediction residual)
18039 J. Kim, B. Jeon (LG Electronics)
TE6: Verification results of Huawei and
Hisilicon proposals (short distance intra
prediction using the correlation between lines
and pixels)
18040 Purvin Pandit
TE12: Report on 64x64 versus 32x32
maximum coding unit size
18041 G. J. Sullivan, J.-R. Ohm (co-chairs)
JCT-VC AHG report: JCT-VC project
management
30
18042
K. McCann (chair), M. Karczewicz, J. Ridge, S.
Sekiguchi, T. Wedi, T. Wiegand (vice-chairs)
JCT-VC AHG report: Test Model under
Consideration (TMuC) editing
18043
F. Bossen, P. Chen, D. Flynn, W.-J. Han, K.
Sühring, H. Schwarz, K. Ugur
JCT-VC AHG report: Software development
and TMuC software technical evaluation
18044 R. Cohen, R. Joshi
18045
JCT-VC AHG report: Alternative transforms
T. Yamakage (chair), Y. J. Chiu, M. Karczewicz, M.
Narroschke (co-chairs)
JCT-VC AHG report: In-loop and postprocessing filtering
18046 K. Panusopone, M. Budagavi, W.-J. Han, D. He
JCT-VC AHG report: Large block structures
18047 K. Chono (chair), T. Chujoh, C. S. Lim (vice-chairs)
JCT-VC AHG report: Memory compression
18048 M. Budagavi, A. Segall
JCT-VC AHG report: Parallel entropy coding
18049 J. Xu, W. Ding
JCT-VC AHG report: Screen content coding
18050 D. Alfonso, J. Ridge, X. Wen
JCT-VC AHG report: Complexity assessment
18051
E. Francois (Technicolor), L. Guillo (INRIA), A.
Ichigaya (NHK), H. Yu (Huawei)
18052
E. Francois, D. Thoreau, P. Bordes (Technicolor), L. TE3 – Subtest 2: Report on Simplified
Guillo (INRIA),
Geometry Block Partitioning
18053
Yoshiaki Shishikui, Yasutaka Matsuo, Atsuro
Ichigaya, Kazuhisa Iguchi, Shinichi Sakaida (NHK)
TE12: Report on AMP evaluation
Characteristics of Super Hi-Vision test
sequences
18054 S. Park, J. Park, B. Jeon
TE3: Motion compensation with adaptive
warped reference
18055 E. Francois, P. Bordes (Technicolor)
TE3 – Subtest 2: Cross-check of results from
Huawei
18056 R. Cohen, A. Vetro, H. Sun (Mitsubishi)
TE7: Cross-verification of the 1D Directional
Unified Transform in TMuC
18057 R. Cohen, A. Vetro, H. Sun (Mitsubishi)
Alternative performance measurement of
MDDT and ROT TMuC
18058 C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R)
TE7: Results for mode-dependent fast
separable KLT for block-based intra coding
18059 C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R)
TE7: Cross-check results of MDDT
simplification proposal from Huawei
18060 C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R)
Choice of transforms in MDDT for unified intra
prediction
18061 C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R)
Chroma intra prediction using reconstructed
luma
18062
A. Fujibayashi, S. Kanumuri, F. Bossen, T. K. Tan
(NTT DoCoMo)
TE12: Performance of partition based
illumination compensation (PBIC)
18063 T. K. Tan, Frank Bossen (NTT DoCoMo)
TE5: Results for simplification of unified intra
prediction
18064 Y. Suzuki, F. Bossen (NTT DoCoMo)
TE11: Simulation results of merge/skip (3.2b
and 3.2d)
18065 F. Bossen, T. K. Tan (NTT DoCoMo)
TE12: Results for experiments on max CU size,
RDOQ and AIS
18066 T. K. Tan, Junya Takiue (NTT DoCoMo)
TE6.a: Cross verification for line based intra
prediction
18067
T. K. Tan (NTT DoCoMo), M. Budagavi (TI), J.
Lainema (Nokia)
TE5: Summary report on simplification of
unified intra prediction
18068 B. Bross (Fraunhofer HHI)
TE12.4: Transform skip flag proposed by
Fraunhofer HHI (off vs. on)
18069 B. Bross (Fraunhofer HHI)
TE12.6: PIPE and LCEC tested against
CABAC by Fraunhofer HHI
31
18070 B. Bross
TE12.4: Transform coding HHI tested against
Samsung proposal by Fraunhofer HHI
18071 B. Bross (Fraunhofer HHI)
TE12.2: Interpolation filter SIFO and DIF tested
against MOMS by Fraunhofer HHI
18072 B. Bross (Fraunhofer HHI)
TE12.6: PIPE tested against TENVLC by
Fraunhofer HHI
18073 B. Bross (Fraunhofer HHI)
TE12.1: Block merging (MRG) test (off vs. on)
by Fraunhofer HHI
18074 B. Bross (Fraunhofer HHI)
TE12.3: Adaptive intra smoothing (AIS) test
(slow vs. fast) by Fraunhofer HHI
18075 B. Bross (Fraunhofer HHI)
TE12.3: Adaptive intra smoothing (AIS) test (off
vs. on) by Fraunhofer HHI
18076 S. Quackenbush
94th MPEG Audio Subgroup Report
18077 S. Quackenbush
Draft USAC CE Status and Workplan
18078
K. Iguchi, A. Ichigaya, Y. Shishikui(NHK), S.
Sekiguchi, A. Minezawa (Mitsubishi Electric)
Performance report of TMuC for Super HiVision
18079 M. Zhou, V. Sze (TI)
TE 12: Evaluation of transform unit (TU) size
18080 M. Zhou (TI)
TE12: Evaluation of IBDI and TPE (transform
precision extension)
18081 V. Sze, M. Budagavi (TI)
TE12: Evaluation of entropy coders: PIPE vs.
CABAC
18082 V. Sze, M. Budagavi
TE12: Evaluation of transform coefficient
coding (HHI_TRANSFORM_CODING) with tool
breakdown
18083 M. Zhou, M. Budagavi (TI)
TE2: TI reference frame compression proposal
18084 M. Budagavi (TI)
TE5: TI evaluation of unified intra prediction
simplifications
18085 M. Budagavi (TI)
TE8: TI Parallel context processing (PCP)
proposal
18086 V. Sze, M. Budagavi (TI)
TE8: Evaluation of RIM-PCP
18087 V. Sze, M. Budagavi (TI)
TE8: Evaluation of RIM-V2V entropy coding
18088 M. Zhou (TI)
Testing results on the simplified TMuC
configurations
18089 M. Zhou (TI)
Cross verification of Mitsubishi 0.7.4
configuration
J. Kim, M. Kim (KAIST), H.-Y. Kim (ETRI), K. Sato
(SONY), X. Shen, L. Yu (Zhejiang Univ.), K. Choi, E.
18090 S. Jang (Hanyang Univ.), B. Bross (HHI), W.-J. Han TE9: Report on large block structure
(Samsung), J.-K. Jo, S.-N. Park, D. G. Sim, S.-J. Oh
(Kwangwoon U
18091 B. Lee, M. Kim (KAIST), J. Kim, H.-Y. Kim (ETRI)
Improved side information signaling for QVBT
in TMuC
18092 S. Li, L. Yu (Zhejiang Univ.)
TE3: Test results of second order prediction
18093 X. Zhu, L. Yu (Zhejiang Univ.)
TE7: Cross-check for I2R proposal on modedependent fast separable KLT for block-based
intra coding
18094 L. Wang, L. Yu (Zhejiang Univ.)
TE10 subset 2: Performance on Wiener-based
in-loop filters
18095 J. Chen, L. Yu (Zhejiang Univ.)
TE2: A memory bandwidth measurement
algorithm for reference frame compression
18096 C. S. Lim, V. Wahadaniah, S. Naing, H. W. Sun
TE2: Reference frame compression using
32
18097
(Panasonic)
image coder
C. S. Lim, V. Wahadaniah, S. Naing, H. W. Sun
(Panasonic)
TE2: Cross-check of memory compression
results from NEC (JCTVC-B057/JCTVC-Cxxx)
18098 T. Chujoh, T. Yamakage (Toshiba)
TE2: Adaptive scaling for bit depth
compression on IBDI
18099 T. Chujoh (TE coordinator)
TE2: Summary of TE2 on IBDI and memory
compression
18100 T. Chujoh, T. Yamakage (Toshiba)
A framework for standardization of memory
compression
18101 T. Chujoh, K. Kanou, T. Yamakage (Toshiba)
TE3 subtest 4: High accuracy interpolation filter
(HAIF)
18102
T. Shiodera, A. Tanizawa, T. Chujoh, T. Yamakage
(Toshiba)
18103
A. Tanizawa, J. Yamaguchi, T. Shiodera, T. Chujoh, TE7: 1-dimensional directional unified
T. Yamakage (Toshiba)
transform
TE6 subset a: Bidirectional intra prediction
18104 A. Tanizawa, J. Yamaguchi (Toshiba)
TE7: Cross-verification of mode dependent
residual reordering (Peking Univ.)
18105 T. Yamakage, T. Chujoh, T. Watanabe (Toshiba)
TE10 subtest 2: Reduction of number of
encoding passes for quadtree adaptive loop
filter (QALF)
18106 K. Chono, T. Yamakage (TE coordinators)
TE10: Summary of TE10 on in-loop filtering
18107 T. Yamakage, T. Chujoh, T. Watanabe (Toshiba)
Coding efficiency report of modification by
TMuC draft005
18108 T. Yamakage, T. Chujoh, T. Watanabe (Toshiba)
Comparison of loop & post filtering for in-loop
and post-processing filtering AHG
18109 T. Chujoh, T. Yamakage (Toshiba)
Experimental results of ALF on low complexity
18110 T. Chujoh, K. Kanou, T. Yamakage (Toshiba)
TE12.2: Experimental results of interpolation
filter
18111 T. Chujoh, T. Yamakage (Toshiba)
TE12.7: Experimental results of IBDI
18112 X. Zhao, L. Zhang, S. Ma, W. Gao (Peking Univ.)
TE7: Results for mode-dependent residual
reordering for intra prediction residual
18113 X. Zhao, L. Zhang, S. Ma, W. Gao (Peking Univ.)
TE7: Cross-verification results of I2R proposal
18114
K. Chono, K. Senzaki, H. Aoki, J. Tajime, Y. Senda
(NEC)
TE10: Conditional joint deblocking-debanding
filter
18115
K. Chono, K. Senzaki, H. Aoki, J. Tajime, Y. Senda
(NEC)
TE12: Report on combined intra prediction
evaluation
18116
H. Aoki, K. Chono, K. Senzaki, J. Tajime, Y. Senda
(NEC)
TE2: 1-D DPCM-based memory compression
18117
H. Aoki, K. Chono, K. Senzaki, J. Tajime, Y. Senda
(NEC)
An extension of DPCM-based memory
compression to 2-D structure with ADPCM
18118
H. Aoki, K. Chono, K. Senzaki, J. Tajime, Y. Senda
(NEC)
Performance improvement of DPCM-based
memory compression by adaptive quantization
18119 F. C. A. Fernandes
Low complexity rotational transform
18120 M. Yang, S. Lin, D. Wang, J. Zhou, H. Yu, (Huawei)
TE1:Huawei report on DMVD improvements
in TMuC
18121 S. Lin, D. Wang, M. Yang, H. Yu, (Huawei)
Cross-checking of DMVD results from
Mitsubishi and Intel
18122 X. Zheng (HiSilicon), H. Yu, (Huawei)
TE3.2: Huawei & HiSilicon report on flexible
motion partitioning coding
18123 L. Liu (HiSilicon), J. Zhou, H. Yu (Huawei)
TE5: Huawei & HiSilicon report on verification
test results
33
18124 C. Lai, Y. Lin (HiSilicon)
TE6.a: HiSilicon report on short distance intra
prediction
18125 H. Yang, J. Zhou, H. Yu(Huawei)
TE7: Symmetry-based simplification of MDDT
18126 H. Yang, J. Zhou, H. Yu (Huawei),
TE7: Report of cross-checking results for
Toshiba proposal
18127 X. Zheng (HiSilicon), J. Zhou, H. Yu (Huawei),
TE12: Huawei & HiSilicon report on AMP
evaluation
18128 J. Zhou, D. Wang, H. Yu (Huawei),
TE12: Huawei report on max transform unit
size evaluation
18129 H. Yang, J. Zhou, H. Yu (Huawei),
Symmetry-based scan order sharing scheme
for MDDT
18130 H. J. Kim
Fast CABAC
18131 A. Saxena, F. C. A. Fernandes (Samsung)
Jointly optimal intra prediction and adaptive
primary transform
18132 B. Bross (Fraunhofer HHI)
TE12.1: Results for PU-based merging by
Fraunhofer HHI
18133 B. Bross (Fraunhofer HHI)
TE12.4: Transform unit quadtree tested against
2-level by Fraunhofer HHI
18134
G. Li (Santa Clara Univ.), L. Liu, N. Ling (Santa
Clara Univ.), J. Zheng, P. Zhang (Hisilicon)
Integration of plane mode in unified intra
prediction
18135
W. Dai, M. Krishnan, P. Jesudhas, P. Topiwala
(FastVDO)
Fast integer transforms for the HEVC test
model
18136 I.S. Chong, W. Chien, M. Karczewicz (Qualcomm)
Adaptive loop filter with low encoder complexity
18137 J. Lou, K. Panusopone, L. Wang (Motorola)
Zigzag scan for CABAC/PIPE
18138
Y.-J. Chiu (Intel), Y.-W. Huang (MediaTek), M. Wien
TE1: Summary report of TE1 activity
(RWTH Aachen Univ.), H. Yu (Huawei)
18139 Y.-J. Jeon, B.-M. Jeon (LG Electronics)
TE11: Cross verification of Sony proposal on
motion vector coding
18140 Y. Yu (Broadcom)
Implementation analysis of transform block size
18141
K. Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi TE12.1 cross-check report on block merging
Electric)
(MRG) on/off
18142
K. Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi
TE12.2 report on MV prediction AMVP/IMVP
Electric)
18143
K. Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi
TE12.3 cross-check report on AIS on/off
Electric)
18144 S. Sekiguchi, K. Sugimoto (Mitsubishi Electric)
Suggested approach toward HEVC test model
creation
18145 K. Sugimoto, S. Sekiguchi (Mitsubishi Electric)
A suggested starting point for HEVC test model
18146
K. Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi
On selecting coding tools for HEVC test model
Electric)
18147 Y. Itani, S. Sekiguchi (Mitsubishi Electric)
TE1: Implicit direct vector derivation
18148 S. Sekiguchi, Y. Itani (Mitsubishi Electric)
TE1: Spatio-temporal direct mode
18149
D. Y. Kim, J. P. Kim, Y. L. Lee (Sejong University), J. Chroma interpolation filtering using high
Lim, J. Song (SKT)
precision filter
Alberto Messina, Robbie De Sutter, Werner Bailer,
18150 Masanori Sano, Jean-Pierre Evain, Patrick NdjikiNya,
Some changes to WD on MPEG-7 AudioVisual
Description Profile (AVDP)
18151 Y.-J. Chiu, L. Xu, Wenhao Zhang, Hong Jiang (Intel)
TE1: Report of self derivation of motion
estimation improvement in TMuC
18152 Y.-J. Chiu, L. Xu, Wenhao Zhang, Hong Jiang (Intel) TE1: Cross-checking of DMVD result from
34
Huawei
TE12: Cross verification for adaptive loop filter
(ALF) (on vs. off)
18153 Y.-J. Chiu, L. Xu, W. Zhang, H. Jiang (Intel)
18154
J. Yang, K. Won, B. Jeon (SKKU), J. Lim, J. Song
(SKT)
TE10 subtest 1: Results of intra deblocking
filter testing by SKKU/SKT
18155
J. Yang, K. Won, B. Jeon (SKKU), J. Lim, J. Song
(SKT)
TE10 subtest 1: Cross-verification result of
Microsoft deblocking filter proposal by
SKKU/SKT
18156
J. Yang, K. Won, B. Jeon (SKKU), J. Lim, J. Song
(SKT)
TE11: Cross-verification result of Orange Labs
motion vector competition proposal (3.1a) by
SKKU
18157
J. Yang, K. Won, B. Jeon (SKKU), J. Lim, J. Song
(SKT)
Motion vector competition method with reduced
candidate sets
18158
D. He, G. Korodi, P. Imthurn, J. Jamias, D.
O’Loughlin, G. Martin-Cocher (RIM)
Comments on V2V coding for TM/TMuC
18159 D. Hoang (Zenverge)
Flexible scaling of quantization parameter
18160 X. Zhang (HKUST)
TE12.4: Mode dependent direnctional
transform (MDDT) test (off vs. on) by HKUST
18161 S. Fukushima (JVC)
Decoder-side block boundary decision (DBBD)
with OBMC
18162 M.Ueda (JVC)
TE1.a: Implementation report of Refinement
Motion Compensation using DMVD on TMuC
18163 C. Lai, Y. Lin (HiSilicon), J. Zhou (Huawei)
TE6.a: Huawei & HiSilicon report on cross
verification of MSRA's LIC
18164 B. Bross (Fraunhofer HHI)
TE9-2: Report on performance tests for
different sets of PU modes by Fraunhofer HHI
18165 Yongwei Zhu, Rongshan Yu, Susanto Rahardja
An experimental study of different dynamic
volume change encoding methods for IMAF
18166 Yongwei Zhu, Rongshan Yu, Susanto Rahardja
A study on the computational complexity of EQ
implementation in IMAF
18167 Y.-L. Chang, Y.-W. Huang, S. Lei (MediaTek)
TE8: Crosscheck on TI’s proposal of
parallel context processing by MediaTek
18168
J. An, K. Zhang, Y. Gao, X. Guo, C.-M. Fu, Y.-W.
Huang, S. Lei (MediaTek)
TE10 subtest 1: Improved deblocking filter
18169
C.-Y. Tsai, C.-M. Fu, C.-Y. Chen, Y.-W. Huang, S.
Lei (MediaTek)
TE10 subtest 2: Coding unit synchronous
picture quadtree-based adaptive loop filter
(QALF)
18170 C.-M. Fu, Y.-W. Huang, S. Lei (MediaTek)
TE10 subtest 2: Crosscheck on
TOSHIBA’s proposal of adaptive loop filter
by MediaTek
18171 C.-Y. Tsai, Y.-W. Huang, S. Lei (MediaTek)
TE10 subtest 2: Crosscheck on SHARP’s
proposal of adaptive loop filter by MediaTek
Y.-L. Chang, C.-M. Fu (MediaTek), A. Segall, Y. Su
18172 (Sharp), C.-Y. Chen, Y.-W. Huang, S. Lei
(MediaTek)
TE10 subtest 3: Controlled clipping
18173
C.-Ming Fu, C.-Y. Chen, Y.-W. Huang, S. Lei
(MediaTek)
TE10 subtest 3: Quadtree-based adaptive
offset
18174 Y.-L. Chang, Y.-W. Huang, S. Lei (MediaTek)
TE12: Crosscheck on deblocking filter in
TMuC0.7 by MediaTek
18175 A. Ichigaya, K. Iguchi (NHK)
TE12.2: Report on asymmetric motion
prediction unit (AMP) on/off
18176 Y. Choe, S. Hong (Yonsei Univ.), Y-G. Kim (KGIT)
TE2: Cross-check of memory compression
35
results from TOSHIBA (JCTVC-C075)
18177 P. Amon, A. Hutter, E. Wige, A. Kaup (Siemens)
Reference frame denoising
18178 Y. Xu, J. Li, W. H. Chen, D. Tian (Cisco Systems)
Context-adaptive hybrid variable length coding
18179 P. Chen, M. Karczewicz (Qualcomm)
TE12: Report on deblocking filter
18180 R. Sjöberg, P. Wennersten (Ericsson)
Fine granularity slices
18181 K. Andersson, R. Sjöberg (Ericsson)
TMuC 0.7 results with sequence dependent
QPs
18182 A. Norkin, R. Sjöberg, K. Andersson (Ericsson)
TE10: Cross verification of Mediatek’s
deblocking filter by Ericsson
18183
J. Samuelsson, R. Sjöberg, K. Andersson
(Ericsson)
TE6.a: Cross verification of line-based intra
prediction
18184 K. Andersson (Ericsson)
TE7: Cross verification of MDDT simplification
JCTVC-B024
18185 K. Andersson (Ericsson)
TE12: Cross verification of RDOQ
18186 P. Chen, M. Karczewicz (Qualcomm)
TE12: Crosscheck on LCEC phase 2
18187
K. Chono, K. Senzaki, H. Aoki, J. Tajime, Y. Senda
(NEC)
TE10: Cross-verification result of SKKU/SKT
deblocking filter
18188 K. Kondo, T. Suzuki (Sony)
TE12.2: Results on MC interpolation filters
18189 K. Sato, T. Suzuki (Sony)
TE12.2: Results on MV prediction
18190 K. Kondo, T. Suzuki (Sony)
TE3 subset 4: Results on bi/single filter
switching in FIF
18191 K. Kondo, T. Suzuki (Sony)
TE3 subset 4: Cross verification on high
accuracy interpolation filter
18192 K. Sato (Sony)
TE11: Study on motion vector coding
(experiment 3.3.a & 3.3.c)
18193 K. Sato (Sony)
Proposal on large block structure and
quantization
18194 K. Kondo, K. Sato, T. Suzuki (Sony)
Comments on common test conditions
18195 Japan National Body
JNB comments on Resolution 2.1.4: Methods
for signaling frame packing arrangement in
MPEG-2 video
18196 Japan National Body
JNB comments on Resolution 2.2.2 (3D
Signaling in MPEG-2 video)
18197 Japan National Body
JNB comments on N11465 about frame
packing arrangement SEI message
18198 E. Maani, L. Dong, W. Liu (Sony)
TE6.b: Experiment results of DCIM
18199 X. Zheng (HiSilicon), H. Yu (Huawei)
TE3.2: Report on cross-verification of simplified
geometry block partitioning from Technicolor
18200 A. Tabatabai, C. Auyeung (Sony)
TE12.3: Experimental results of edge based
prediction
18201 A. Tabatabai (Sony)
TE6: Summary report of intra prediction
improvements tool experiments
18202 T. Ikai, T. Yamamoto (SHARP)
TE10 subtest 2: Parallel adaptive loop filter
18203 T. Yamamoto, A. Segall (SHARP)
TE12.3: Report on combined intra prediction
18204 T. Yamamoto, Y. Yasugi (SHARP)
TE6.b: Cross verification of differential coding
of intra modes (DCIM)
18205 T. Yamamoto, Y. Yasugi (Sharp)
Analysis and improvement of differential coding
of intra modes
18206 P. Tao, D. Li, W. Wu, J. Wen (Tsinghua Univ.)
Horizontal spatial prediction for intra coding
36
18207 A. Segall (Sharp)
TE2: Cross-check of memory compression
results from Panasonic (JCTVC-C074)
18208 A. Segall (Sharp)
TE6: Cross-check of bi-directional intrapredition results from Toshiba
18209 A. Segall (Sharp)
TE10.3: Cross check of MediaTek's proposal
on quadtree-based adaptive offset
18210 K. Misra, A. Segall (Sharp)
TE12.4: Cross-check of rotational transform
(ROT)
18211 A. Segall (Sharp)
TE12.2: Cross-check of adaptive motion vector
resolution (AMVRES) technology in the TMuC
software
18212 T. Yoshino, S. Naito, S. Sakazawa (KDDI)
Enhanced switching of interpolation filter for
HEVC
18213
Background for Proposing Scalable Application
Kyoungro Yoon, Min-Uk Kim, Tae-Beom Lim, Kyung
Markup Language for AIT’s Present
Won Kim, Yun Ju Lee, Jae Won Moon
Content Elementary Service
18214
Kyung Won Kim, Tae-Beom Lim, Yun Ju Lee, Jae
Won Moon, Kyoungro Yoon, Min-Uk Kim
18215 A. Fuldseth (Cisco)
18216
Proposal of Scalable Application Markup
Language (SAML) for AIT’s Present
Content Elementary Service
TE12.6: PIPE tested against LCEC by Cisco
A. Fuldseth, G. Bjøntegaard (Cisco), A. Hallapuro,
K. Ugur, J. Lainema (Nokia)
Recent improvements of the low complexity
entropy coder (LCEC) in TMuC
18217 J. Lainema, K. Ugur (Nokia)
TE5: Results for simplified intra prediction tests
by Nokia
18218 K. Iwata (Renesas)
TE6.a: Cross-check of bidirectional intra
prediction (JCTVC-B042)
18219 J. Lainema, K. Ugur, O. Bici (Nokia)
TE12.3: Results for planar prediction tests by
Nokia
18220 S. Mochizuki, K. Iwata (Renesas)
Evaluation of intra prediction based on
repetitive pixel replenishment
Takanori Senoh (senoh@nict.go.jp), Kenji
Yamamoto (k.yamamoto@nict.go.jp), Ryutaro Oi
18221 (oi@nict.go.jp), Yasuyuki Ichihashi (yichihashi@nict.go.jp), Taiichiro Kurita (tkurita@nict.go.jp),
Depth Estimation Experiment with Poznan Hall
18222
Minsoo Park, Seungwook Lee, Bon Ki Koo, Hyunok
Oh, Taehee Lim, Euee S. Jang
Bug report in discrepancy between the textual
specification in VTL and RVC-CAL
implementation in RSM
18223
J. Kim, S.-C. Lim, H. Y. Kim, H. Lee, J. S. Choi
(ETRI)
TE7: Cross-check result of Peking Univ.’s
proposal (JCTVC-C089)
Minsoo Park, Seungwook Lee, Bon Ki Koo, Hyunok
18224 Oh, Sinwook Lee, Sowon Kim, Taehee Lim,
Hyungyu Kim, Euee S. Jang
Contribution to RGC tutorial
18225
Seungwook Lee, Bon Ki Koo, Sinwook Lee, Minsoo
Park, Sowon Kim, Taehee Lim, Euee S. Jang
Contribution of 3DMC FUs for RGC framework
18226
S. Jeong, J. Lee, H. Y. Kim, S.-C. Lim (ETRI), K.
Kim, H. Lee, G. Park (KHU)
TE11: Cross-check result of merge/skip (3.2c)
18227
Jae Won Moon, Kyung Won Kim, Tae-Beom Lim,
Yun Ju Lee, Kyoungro Yoon, Min-Uk Kim
Proposal of Stack Operators for AIT’s
Present Content Elementary Service
18228 Y. Choe, J. Kim (Yonsei Univ.), Y-G. Kim (KGIT)
37
TE7: Cross-verification results of Huawei
proposal on simplified MDDT for intra
prediction residual
18229
Tae-Beom Lim, Kyung Won Kim, Yun Ju Lee, Jae
Won Moon, Kyoungro Yoon, Min-Uk Kim
Initial Proposal of DCFunctions for AIT’s
Present Content Elementary Service
18230
K. Zhang, X. Guo, J. An, M. Guo, Y.-W. Huang, S.
Lei (MediaTek)
Overlapped block intra prediction
P. Wu, S. Paschalakis, N. Sprljan (Mitsubishi
18231
Electric)
TE10 subtest 2: Cross-check results of JCTVCC082 [Toshiba] Reduction of number of
encoding passes for quadtree-based adaptive
loop filter (QALF)
P. Wu, S. Paschalakis, N. Sprljan (Mitsubishi
Electric)
An enhanced block-based adaptive loop filter in
TMuC0.7.0 platform
18232
TE12.6: Results for LCEC_PHASE2 tests by
Nokia
18233 K. Ugur, A. Hallapuro, J. Lainema (Nokia)
18234
Taejin Lee, Seungkwon Beack, Minje Kim,
Kyeongok Kang
Progress report on the TCX windowing CE for
USAC
18235
K. Ugur, J. Lainema (Nokia), R. Panchal, M.
Karczewicz (Qualcomm)
TE12.2: Results for SIFO-6Tap with DIF tests
by Nokia and Qualcomm
18236 J. Chen, T. Lee, W.-J. Han (Samsung)
TE9: Simulation results for various max.
number of transform quadtree depth
18237 I.-K. Kim, T. Lee, W.-J. Han (Samsung)
TE11: Report on experiment 3.2.c: Check skip
and merge together
18238 T. Lee, J. Chen, W.-J. Han (Samsung)
TE12.1: Experimental results of transform unit
quadtree/2-level test
18239
E. Alshina, J. Chen, N. Shlyakhov, A. Alshin, W.-J.
Han (Samsung)
TE12.2: Experimental results of SIFO (DIF) /
DCT-IF
18240
E. Alshina, A. Alshin, W.-J. Han (Samsung), R.
Joshi, M. Coban (Qualcomm)
TE12.4: Experimental results of MDDT and
ROT by Samsung and Qualcomm
18241 J. Chen, V. Seregin, W.-J. Han (Samsung)
TE12.4: Experimental results of transform
coefficient coding
18242 A. Alshin, E. Alshina (Samsung)
Bi-directional optical flow
18243 V. Seregin, J. Chen (Samsung)
Low-complexity adaptive coefficient scanning
18244 J. Chen, V. Seregin (Samsung)
Chroma intra prediction by scaled luma
samples using integer operations
18245 Y. Piao, J.-H. Min, J. Chen (Samsung)
Encoder improvement of unified intra prediction
18246 I.-K. Kim, T. Lee (Samsung)
Simulation results of motion partition, block
merging and motion competition schemes
18247 Y.-M. Hong, M.-S. Cheon, I.-K. Kim (Samsung)
Low-complexity 16x16 and 32x32 transforms
and partial frequency transform
18248 S. Lee, M.-S. Cheon, I.-K. Kim (Samsung)
Efficient coefficient coding method for large
transform in VLC mode
18249 Y. Sohn, K.-H. Lee, B.-K. Lee, I.-K. Kim (Samsung)
Inner block oriented ALF processing for
memory & bandwidth reduction
18250 W.-J. Han (Samsung)
Cross-verification of JCTVC-C086:
Experimental results of ALF on low complexity
18251 M. Mrak, T.Davies, D. Flynn, A. Gabriellini (BBC)
Additional results for combined intra prediction
18252 acordova@tue.nl, jean.gelissen@philips.com
Robot Integration
18253 jean.gelissen@philips.com
Additional Use Cases for MPEG-V
18254 M. Narroschke (Panasonic)
TE12.5: Results for adaptive loop filter using
prediction and residual
18255 K. Ugur, J. Lainema (Nokia)
Adaptive MV resolution with directional filters
18256 V. Drugeon (Panasonic)
TE12.3: Results for edge based prediction
38
18257 V. Drugeon (Panasonic)
TE06.b: Cross verification of differential coding
of intra modes (DCIM)
18258 J. Kim, Y.Jeon, B. Jeon (LG Electronics)
Encoding complexity reduction for intra
prediction by disabling NxN partition
18259 P. Amon, A. Hutter, E. Wige, A. Kaup (Siemens)
In-loop reference frame denoising
18260 A. Krutz, T. Sikora (Tech. Univ. Berlin)
TE3: Summary report for inter prediction in
HEVC
18261 A. Glantz, A. Krutz, T. Sikora (Tech. Univ. Berlin)
TE3: Adaptive global motion temporal
prediction for HEVC
18262
M. Esche, A. Krutz, A. Glantz, T. Sikora (Tech. Univ.
In-loop filtering using temporal pixel trajectories
Berlin)
18263 G. Martin-Cocher (RIM), M. Budagavi (TI)
TE8 report
18264 D.-K. Kwon, H. Kim (TI)
Frame coding in vertical raster scan order
18265 K. McCann (Samsung/Zetacast)
TE12: Summary of evaluation of TMuC tools in
TE12
18266 M. Sadafale, M. Budagavi (TI)
Low-complexity configurable transform
architecture for HEVC
18267 V. Sze, M. Budagavi (TI)
Parallelization of HHI_TRANSFORM_CODING
18268 M. Budagavi (TI)
A simple cache model for measuring motion
compensation bandwidth
18269 I. S. Chong, M. Karczewicz (Qualcomm)
TE10 subset 2: Cross check result of MediaTek
ALF
18270 I. S. Chong, M. Karczewicz (Qualcomm)
TE12: Cross check result of Panasonic's 3input-ALF
18271 Y. Suzuki, T.K. Tan, F. Bossen (NTT DOCOMO)
TE12.1: Results for PU based merging
18272
N. Sprljan, S. Paschalakis, P. Wu (Mitsubishi
Electric)
TE12.7: Verification of JCTVC-C041 (partition
based illumination compensation - PBIC)
18273
N. Sprljan, S. Paschalakis, P. Wu (Mitsubishi
Electric)
TE3 subtest 3: Local intensity compensation
(LIC) for inter prediction
18274 Y. Zheng, M. Coban, M. Karczewicz (Qualcomm)
Simplified intra smoothing
18275 Dong Tian, Anthony Vetro
Comments on Draft Call for Proposals on 3D
Video Coding Technology
18276 I. S. Chong, M. Karczewicz (Qualcomm)
TE12: Evaluation of adptive in-loop filter
18277
K. Vermeirsch, J. De Cock, R. Van de Walle (Ghent
Univ. - IBBT)
Report of complexity analysis of geometric
partitioning
18278 J. Sole, R. Joshi, M. Karczewicz (Qualcomm)
Reduced complexity 32×32 transform by
coefficient zero-out
18279 J. Zan, J. Meng, M. T. Islam, D. He (RIM)
TE12.8: Results on RDOQ in high efficiency
settings
18280 R. Panchal, M. Karczewicz (Qualcomm)
TE12: Report on interpolation filter: SIFO/DIF
18281 D. Karwowski (Poznan Univ. Tech.)
TE3: Cross-check results of local intensity
compensation tool from Mitsubishi Electric
18282 R. Panchal, M. Karczewicz (Qualcomm)
TE12: Crosscheck on interpolation filter:
SIFO(DIF)/MOMS
18283
Document registration withdrawn
18284 I. S. Chong, M. Karczewicz (Qualcomm)
Cross check result of JCTVC-C084
18285 D. Flynn (BBC)
TE12.3: Report on planar intra prediction
18286 J. Zan, J. Meng, M. T. Islam, D. He (RIM)
TE 8: Cross verification of TI-PCP proposal for
significance map
39
18287 P. Onno (Canon)
TE12.8: Cross-verification of partition-based
illumination compensation (PBIC)
18288 T. Chujoh, K. Kanou, T. Yamakage (Toshiba)
TE3 subset 4: Cross-verification on bi/single
filter switching in FIF
18289 T. Chujoh, K. Kanou, T. Yamakage (Toshiba)
TE12.2: Cross verification on DCT-IF for
chroma
18290 J. Zan, G. Korodi, J. Meng, M. T. Islam, D. He (RIM)
TE 8: Reports on V2V coding and context
modeling by RIM
18291 M. Coban, R. Joshi, M. Karczewicz (Qualcomm)
Low complexity adaptive coefficient scanning
18292
P. Chen, R. Panchal, W.-J. Chien, M. Karczewicz
(Qualcomm)
Overlapped block motion compensation in
TMuC
18293 R. Joshi, M. Karczewicz (Qualcomm)
TE 12: Report on evaluation of internal bit
depth increase and transform precision
extension
18294 R. Panchal, M. Karczewicz, P. Chen (Qualcomm)
Encoder speedup for bidirectional averaging
with rounding control
18295 R. Panchal, M. Karczewicz (Qualcomm)
Test result of SIFO for sequencs with
illumination change
18296 M. Budagavi, A. Gupte (TI)
DCT+Hadamard low complexity large
transform for Inter coding
18297 K. Misra, J. Zhao, A. Segall (SHARP)
New results for entropy slices for highly parallel
coding
18298 Y. Su, A. Segall (SHARP)
On motion vector competition
18299 W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm)
TE 12: Evaluation of transform skip flag
(RQT_ROOT)
18300
Y. Yu, K. Panusopone, L. Wang, S.-T. Hsiang, F.
Ishtiaq (Motorola)
TE12: Results for experiments on Max CU size
18301 K. Panusopone, X. Fang, L. Wang (Motorola)
Flexible picture partitioning
18302 X. Wang (Qualcomm)
TE4: Summary of TE4 on variable length
coding
18303 X. Wang, M. Karczewicz, W.-J. Chien (Qualcomm)
TE4: Report on VLC for coded block flag
18304 M. Karczewicz, W.-J. Chien, X. Wang (Qualcomm)
Improvements on VLC
18305 W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm)
TE 12: Evaluation of block merging (MRG)
18306 W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm)
TE 12: Evaluation of adaptive motion vector
resolution (AMVRES)
18307 W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm)
TE1: Cross-check result of DMVD proposal
from JCTVC-C138 (JVC/Kenwood)
18308
W.-J. Chien, P. Chen, X. Wang, M. Karczewicz
(Qualcomm)
Test results of transform skip flag and phase 2
VLC integration
18309 Ye-Kui Wang, Zhenyu Wu
DASH related clarifications to ISO base media
file format
18310 R. Joshi, M. Coban, M. Karczewicz (Qualcomm)
TE12: Report on evaluation of MDDT and ROT
Ye-Kui Wang, Per Fröjdh, Thomas Stockhammer,
18311
David Furbeck, Imed Bouazizi
Base DASH on updated 3GPP adaptive HTTP
streaming specification
18312
A. Tabatabai, C. Auyeung, E. Maani, T. Suzuki
(Sony)
A study on the impact of intra smoothing
18313 X. Peng, J. Xu, F. Wu (Microsoft)
TE6.a: Report of line-based coding
18314 X. Peng, J. Xu (Microsoft)
TE6: Cross-verification of HiSilicon’s short
distance intra prediction by Microsoft
18315 X. Peng, J. Xu (Microsoft)
TE6: Cross-verification of MediaTek's
40
overlapped block intra prediction by Microsoft
18316 Z. Xiong, X. Sun, J. Xu (Microsoft)
TE10 subset 1: Report of content-adaptive deblocking
18317 X. Xiong, J. Xu (Microsoft)
TE10 subset 1: Cross- verification of
NEC’s joint deblocking-debanding filter by
Microsoft
18318 X. Peng, J. Xu, F. Wu (Microsoft)
Improve intra frame coding by PU/TU
reordering
18319
C. Lan, J. Xu, F. Wu (Microsoft), G. Shi (Xidian
Univ.)
Screen content coding results using TMuC
18320
B. Li, J. Xu, F. Wu, G. J. Sullivan (Microsoft), H. Li
(Univ. Sci. Tech China)
Redundancy reduction in Cbf and merge
coding
18321
B. Li, J. Xu, G. J. Sullivan, F. Wu (Microsoft), H. Li
(Univ. Sci. Tech China)
Redundancy reduction in B-frame coding at
temporal level zero
18322
D. He, G. Korodi, E.-h. Yang, G. Martin-Cocher
(RIM)
Opportunistic parallel V2V decoding
18323 Y.-L. Chang, Y.-W. Huang, S. Lei (MediaTek)
TE8: Crosscheck result of the transcoder for
JCTVC-B034 source selection for V2V entropy
coding in HEVC
18324 M. Coban, Y. Zheng, M. Karczewicz (Qualcomm)
TE12: Evaluation of fast adaptive intra
smoothing
18325
M. Tok, A. Glantz, A. Krutz, T. Sikora (Tech. Univ.
Berlin)
TE3 subtest 1: Cross-check of results from LG
18326 A. Paul (Hanyang Univ. / NCKU)
Region of block based dynamic video
processing for HEVC
18327 A. Segall (Sharp), J. Xu (Microsoft)
TE9-2.1 Report on forced RQT split according
to PUs (Sharp, Microsoft)
18328
W.-J. Chien, P. Chen, X. Wang, M. Karczewicz
(Qualcomm)
Modified uni-directional inter prediction in
generalized P and B pictures
18329 E. Francois (Technicolor), L. Guillo (INRIA)
TE12.2: Technicolor & INRIA report on AMP
evaluation
18330 B. Bross (Fraunhofer HHI)
Cross-check of encoder speedup for
bidirectional averaging with rounding control
18331 T. Yamakage (Toshiba)
TE4: Verification results of JCTVC-C262
18332 J. Jung, G. Clare, S. Pateux (Orange Labs)
TE11: Report on experiment 3.1.a: Disabling
MV competition
18333 J. Jung, G. Clare, S. Pateux (Orange Labs)
TE11: Report on experiment 3.2.b: MVCompetition on Inter, Skip and Direct modes
18334 J. Jung, G. Clare, S. Pateux (Orange Labs)
TE11: Report on experiment 3.3.b:
‘temporally oriented’ set of predictors for
MV-Competition
18335 J. Jung, G. Clare, S. Pateux (Orange Labs)
TE11: Verification of experiment 3.2.d by NTT
DOCOMO
18336 J. Jung (Orange Labs)
TE11: Summary report for TE11 on motion
vector coding
18337 A. Wells (Ambarella), C. Fogg (Harmonic)
Level limits for Progressive High Profile
18338 G. J. Sullivan (Microsoft)
On the AVC frame packing arrangement SEI
message
18339
G. J. Sullivan (Microsoft), H. Schwarz (Fraunhofer
HHI)
18340 H. Zhu (Zhu)
Miscellaneous AVC (ITU-T H.264 | ISO/IEC
14496-10) errata issues
Re-compensation based on partial coefficients
41
18341 H. Zhu (Zhu)
Decoding improvement on the PA-Coder
18342 Deliang Fu, Lu Yu
3DV EE4 Report on Book_arrival Sequence
18343 Deliang Fu, Lu Yu
3DV EE4 Report on Poznan_Street Sequence
18344 W.-J. Han (Samsung)
Cross-verification of JCTVC-C213: additional
results for combined intra prediction
18345
Mickael Raulet, Matthieu Wipliez, Jérôme Gorin,
Nicolas Siret
Addendum: Specification of typing rules for
RVC-CAL
18346 Mickaël Raulet, Matthieu Wipliez
Automatic classification of the FUs
18347 Ye-Kui Wang
DSAH - support of rewind trick mode
18348 Ye-Kui Wang
On random access point signaling in file
formats
18349 Jaejoon Lee, Seok Lee, HoCheon Wey, Du Sik Park 3DV EE4 Results on Balloons sequence
18350
Jaejoon Lee, Seungsin Lee, Seok Lee, HoCheon
Wey, Du Sik Park
Comments on Test Scenario for Optional Case
of Call for Proposal on 3D Video Coding
Technology
18351 Korean NB via SC 29 Secretariat
Korean NB comment on 3DTV signaling in
MPEG-2 System
18352 jean.gelissen@philips.com, roland@cs.uu.nl,
Path Finding
18353
Yasuaki Tokumo, Maki Takahashi, Shuichi
Watanabe, Norio Ito,
Consideration of Clock Drift Control for DASH
18354
Masayuki Tanimoto, Toshiaki Fujii, Mehrdad
Panahpour Tehrani, Menno Wildeboer,
3DV/FTV EE1 and EE4 report on Champagne
Tower sequence
18355
Masayuki Tanimoto, Toshiaki Fujii, Mehrdad
Panahpour Tehrani, Menno Wildeboer,
3DV/FTV EE4 report on Kendo sequence
18356
Masayuki Tanimoto, Toshiaki Fujii, Mehrdad
Panahpour Tehrani, Menno Wildeboer
3DV/FTV EE2 report on VSRS extrapolation
Yasuaki Tokumo, Maki Takahashi, Shuichi
18357 Watanabe, Yoshiaki Ogisawa, Takashi Kaneko,
Norio Ito,
Modification of Enhancement to Representation
Description on DASH for signaling of 3D
content
Menno Wildeboer (Nagoya Univ.), Patrick Lopez
18358 (Technicolor), Dong Tian (Mitsubishi), Yin Zhao
(Zhejiang Univ.), Cheon Lee (GIST)
Update of the Draft Report on Experimental
Framework for 3D Video Coding
18359 DVB via SC 29 Secretariat
Liaison Statement from DVB
18360
Takeshi Norimatsu, Tomokazu Ishikawa, Haishan
Zhong, Dan Zhao, Kok Seng Chong
Panasonic cross check report on complexity
reduction for time warping in USAC
18361
Takeshi Norimatsu, Tomokazu Ishikawa, Haishan
Zhong, Dan Zhao, Kok Seng Chong
Panasonic cross check report on PVC for SBR
envelope coding in USAC
18362 Inseon Jang, Jeongil Seo, Laurent Primaux
Updated text and reference SW of ISO/IEC
23000-12/FPDAM1 IM AF
Pablo Carballeira, Julián Cabrera, Fernando
18363 Jaureguizar, Gianluca Cernigliaro, Juan Casal,
Narciso GarcÃa
Results on 3DVC EE4 for Newspaper
18364
Tadashi Uchiumi, Makoto Ohtsu, Junsei Sato,
Yoshiya Yamamoto, Atsutoshi Shimeno
Comments on Requirements and Draft Call for
Proposal on 3D Video Coding
18365
SeungJu Han, Jae Joon Han, Won-Chul Bang,
James D.K. Kim, Sang-Kyun Kim
Data format for sensed information on
Electrograph sensor
Proposal for new box to support random
access for DASH
18366 Gerard Fernando, Yang Yanzi, Wang Fang
18367
Yoshiya Yamamoto, Tadashi Uchiumi, Makoto
Ohtsu, Junsei Sato, Atsutoshi Shimeno
18368
A method of disparity vector prediction using
depth map
3DV EE4 Resulr
42
18369 Patrick Lopez
3DV EE4 Results on Beergarden
18370 Gerard Fernando, Yang Yanzi, Wang Fang
Carriage of MPEG-2 TS support information in
Index file for DASH
18371 Kihyun Choo, Miyoung Kim, Eunmi Oh
Crosscheck report on adaptive T/F domain
post-processing for USAC
18372 Kihyun Choo, Miyoung Kim, Eunmi Oh
Crosscheck report on harmonic transposer CEs
18373 Heiko Purnhagen, Kristofer Kjörling
Dolby listening test results for CE on T/F postprocessing in USAC
18374 Heiko Purnhagen, Kristofer Kjörling
Dolby listening test results for CE on PVC for
SBR in USAC
18375 Heiko Purnhagen, Kristofer Kjörling
Dolby listening test results for CE on
complexity reduction for time warping in USAC
18376
Jean-Claude Dufourd, Jean Lefeuvre, Cyril
Concolato, Kyungmo Park, Jaeyeon Song
Proposed amendment for MPEG-U widgets
18377 jean.gelissen@philips.com, esko.dijk@philips.com
Tactile Information
18378 Kristofer Kjörling, Leif Sehlström
Finalization of CE on improved SBR
Barbara Resch, Leif Sehlström, Heiko Purnhagen,
18379 Lars Villemoes, Kristofer Kjörling, Bruno Bessette,
Philippe Gournay
Finalization of CE on an improved bass- post
filter operation for the ACELP of USAC
18380 Cyril Concolato, Jean Le Feuvre, Romain Bouqueau, Comments on DASH WD
18381 Cyril Concolato, Jean Le Feuvre, Romain Bouqueau
Report on an Open Source implementation of
MPEG-DASH
18382 Jean Le Feuvre
Conformance Sequences for BIFS
ExtendedCore2D
18383 Jean Le Feuvre
Reference Software for BIFS ExtendedCore2D
18384
Lars Villemoes, Per Ekstrand, Sascha Disch,
Frederik Nagel
Finalization of CE on improved harmonic
transposer in USAC
18385
Sungryeul Rhyu, Seo-Young Hwang, Kyungmo park, Annotation and Attributes for Media
Jaeyeon Song, Bogyeong Kang, Nhut Nguyen
Compositions
Haishan Zhong, Kok Seng Chong, Takeshi
Norimatsu, Tomokazu Ishikawa, Lars Villemoes, Per Finalization of CE on QMF based harmonic
18386
Ekstrand, Kristofer Kjörling, Stephan Wilde,
transposer
Sascha Disch, Frederik Nagel, Max Neuendorf,
18387
Yonghun Lee, Doug Young Suh, Jaeyeon Song,
kyungmo Park,
Compelling use-case for M17770(DFA-EE)
18388 Sungryeul Rhyu, Seo-Young Hwang, Nhut Nguyen,
Timed metadata Support in DASH
Kristofer Kjörling, Haishan Zhong, Kok Seng
Chong, Takeshi Norimatsu, Tomokazu Ishikawa,
18389
Lars Villemoes, Per Ekstrand, Stephan Wilde,
Sascha Disch, Frederik Nagel, Max Neuendorf,
Overview of performance of transposer
proposals, and suggested decoding modes
18390
Chulkeun Kim, Doug Young Suh, Jaeyeon Song,
Kyungmo Park
Kyuheon Kim, Jangwon Lee, Jung-Han Kim, Hee
18391 Jean Kim, Gahyun Ryu, Gilyoon Kim, Do Young
Joung, Namguk Kim
Benefits of TRA representation
File Format for 3D Support of DASH
18392
Hojin Ha, Sungryeul Rhyu, Seo-Young Hwang,
Jaeyeon Song
Random Access Point (RAP) Indication
18393
Kiho Cho, Nam Soo Kim, Sungyong Yoon, Sangoh
Jeong
Core Experiment on Enhanced Mode
Transitions in USAC
18394 Seo-Young Hwang, Sungryeul Rhyu, Hojin Ha
Content Insertion on DASH
18395 fons bruls
3DV EE4 Results on Mobile
43
18396 Toru Chinen, Masayuki Nishiguchi
Sony listening test report on pulse indexing in
USAC
18397 fons bruls
Comments on Requirements and Draft Call for
Proposal on 3D Video Coding
18398 Toru Chinen, Masayuki Nishiguchi
Sony listening test report on improved SBR in
USAC
18399
Toru Chinen, Yuki Yamamoto, Mitsuyuki Hatanaka,
Masayuki Nishiguchi
Report on PVC CE for SBR in USAC
18400 Thomas Stockhammer
Editor's Proposed Updates on DASH Working
Draft
18401 Thomas Stockhammer
Adding Minimum RAP Frequency to MPD
18402
Reference Software for DASH
18403 Thomas Stockhammer
Reference Software for DASH
18404
Junaid Jameel Ahmad, Shujun Li, Marco Mattavelli,
Matthieu Wipliez, Mickaël Raulet
Crypto Tools Library (CTL): Applying RVC-CAL
for Multimedia Security Applications
18405
Helmut Bürklin, Stéphane Gouache, Truong
Cong Thang, Jin Young Lee
Extensions to Media Presentation Description
to support HTTP Streaming from multiple
servers
18406 Ferenc Kraemer, Heiko Purnhagen
Proposed addition to Intensity Stereo in
ISO/IEC 14496-26:2009, Audio conformance
18407 Kim Pelsor, Jamie Marshall,
Use Case for Collaborative BiFS Scenes :
Distance Learning
18408 Dr Michael McGannon, Jamie Marshall,
Use Case for Collaborative BiFS Scenes :
Proactive Medicine
18409
Fritz-Joachim Westphal, Mario Kind, Jamie Marshall, Use Case for Collaborative BiFS Scenes :
Mihai Mitrea,
Consultations without Walls
18410
Jae Joon Han, SeungJu Han, Won-Chul Bang,
James D.K. Kim, Sang-kyun Kim, YongSoo Joo
18411 T. Davies, D. Flynn (BBC)
Sensed Information for gas and dust sensors
TE12.4 Cross-check of unified MDDT/ROT
18412
Mihai Mitrea, Jamie Marshall, Françoise Preteux,
Pieter Simoens, Bart Dhoedt, ,
Use Case for Collaborative BiFS Scenes : A
day in a Professor's life
18413
Achim Kuntz, Sascha Disch, Erik Schuijers, Werner
Oomen
CE proposal on improved applause coding in
USAC
Jin Young Lee, Kwang-deok Seo, Hyung Jung Kim,
18414 Truong Cong Thang, Jung Won Kang, Seong-Jun
Bae, Soon-heung Jung, Sang Taick Park
Scalable Random Access for SVC based
DASH
Jin Young Lee, Euy-doc Jang, Jae-Gon Kim, Truong
Virtual Segmentation of TS Packetized Video
18415 Cong Thang, Jung Won Kang, Seong-Jun Bae,
using Key-frame Information
Soon-heung Jung, Sang Taick Park
Jin Young Lee, Euy-doc Jang, Jae-Gon Kim, Truong
TS Header Extension for Efficient Adaptation in
18416 Cong Thang, Jung Won Kang, Seong-Jun Bae,
the HTTP Streaming
Soon-heung Jung, Sang Taick Park
Jamie Marshall, Mihai Mitrea, Bojan Joveski,
Ludovico Gardenghi, Françoise Preteux,
Use Cases for Collaborative BiFS Scenes :
Overview
Kwang-deok Seo, Sang-woo Shim, Jin Young Lee,
18418 Truong Cong Thang, Jung Won Kang, Seong-Jun
Bae, Soon-heung Jung, Sang Taick Park
File Format Adaptation Based on HTTP Hint
Track for HTTP Streaming
Zhao Dan, Zhong Haishan, Chong Kok Seng,
18419 Takeshi Norimatsu, Tomokazu Ishikawa, Neo Sua
Hong
Status report of time warping CE in USAC
18420 Dmytro Rusanovskyy, Miska M. Hannuksela
3DV EE4 Results on Pantomime sequence
18417
44
3D Video Coding Results of Exploration
Experiments (Book Arrival)
18421 Gerhard Tech, Karsten Müller
18422
Bojan Joveski, Ludovico Gardenghi, Jamie Marshall, Image Transmission for Mobile Thin Clients : A
Mihai Mitrea, Françoise Preteux
Comparative Study
UK National Body comments on HEVC
Timescales
18423 UKNB of WG11
18424 David Fuentes Sánchez, Francisco Morán Burgos Initial ideas for texturing support in MR3DMC
18425 fons bruls
Philips (in coop with 3D4YOU) response to
request for HQ stereo content for MFC AHG
Andreas Hölzer, Christian Ertel, Markus
18426 Lohwasser, Michael Härtl, Ferenc Kraemer, Frans
de Bont
Additional information on MPEG Surround
conformance testing
18427 fons bruls
MFC initial results & findings
18428 Jeongook Song, Eunwoo Song, Henney Oh
Yonsei Crosscheck listening test report on
additional bandwidth extension CE for USAC
18429 Jeongook Song, Henney Oh, Hong-Goo Kang
Status report on USAC Reference Software
JAME
18430 Jeongook Song, Henney Oh, Hong-Goo Kang
Updated CE on enhanced long term predictor
(eLTP) for USAC
18431 Stephan Wilde, Max Neuendorf
FhG Listening Test Report – improved SBR
18432
Stephan Wilde, Markus Multrus, Max Neuendorf,
Kristofer Kjörling, Heiko Purnhagen
Status update of CE proposal on increased
structural flexibility in SBR
Julien Robilliard, Matthias Neusinger, Johannes
18433 Hilpert, Erik Schuijers, Bert den Brinker, Werner
Oomen
Proposed decorrelator improvements in USAC
18434 Gregory Pallone, Pierrick Philippe
Report on USAC performance
18435 Philippe Gournay, Roch Lefebvre
VoiceAge listening test results for the
enhanced bass-postfilter CE
Jonas Engdegård, Heiko Purnhagen, Oliver
Hellmuth, Jürgen Herre, Cornelia Falch, Leon
18436
Terentiv, Maria Luis Valero, Johannes Hilpert,
Andreas Hölzer, Werner Oomen
Report on corrections for MPEG SAOC
Jonas Engdegård, Heiko Purnhagen, Oliver
Hellmuth, Jürgen Herre, Cornelia Falch, Leon
18437
Terentiv, Maria Luis Valero, Johannes Hilpert,
Andreas Hölzer, Werner Oomen
Revised draft of SAOC verification test report
18438
Jean-Claude Dufourd, Jean Lefeuvre, Cyril
Concolato, Kyungmo Park, Jaeyeon Song
Study of FCD of 23007-3
Truong Cong Thang, Jin Young Lee, Jung Won
18439 Kang, Seong-Jun Bae, Soon-heung Jung, Sang
Taick Park
Contribution to DASH EE#2
Truong Cong Thang, Jin Young Lee, Jung Won
18440 Kang, Seong-Jun Bae, Soon-heung Jung, Sang
Taick Park
Use Cases of Multiple Byte Ranges for DASH
Truong Cong Thang, Jin Young Lee, Jung Won
18441 Kang, Seong-Jun Bae, Soon-heung Jung, Sang
Taick Park,
Comments on the Use of RAP in Segments
18442 Thorsten Lohmar, Per Fröjdh
Test sequences in DASH format
18443 Per Fröjdh, Torbjörn Einarsson
DASH File Format and MPD Considerations
18444 Christopher Müller, Christian Timmerer,
Proposal for using MPEG-21 Digital Item
Declaration and W3C XInclude/XLink for Media
Presentation Description
45
Truong Cong Thang, Jin Young Lee, Jung Won
18445 Kang, Seong-Jun Bae, Soon-heung Jung, Sang
Taick Park,
Proposal on Signaling for DASH
18446 Thomas Rusert, Per Fröjdh
Compression Efficiency Evaluation of FrameCompatible Stereoscopic Video Coding with
Resolution Enhancement
18447 Thomas Rusert, Per Fröjdh
Comments on Coding Conditions in Call for
Proposals on 3D Video Coding Technology
18448
Sanbao Xu, Martin Pettersson, Thomas Rusert,
Mats Folkesson
Evaluation of Subjective Test Methodologies
for 3D Video
18449 Markus Waltl, Christian Timmerer
Proposed changes for validation rules of
MPEG-V Part 7
18450 Guillaume Fuchs, Markus Multrus, Max Neuendorf
Proposed unified global gain syntax element in
USAC
18451 Guillaume Fuchs, Max Neuendorf
FhG listening test report on CE on improving
the USAC bass post-filter
18452 Stefan Bayer
Completion of the Core Experiment on
Reducing the Complexity of the USAC Time
Warping
Jaime Delgado, VÃctor RodrÃguez-Doncel,
18453 Francesco Gallo, Elisa Todarello, Annarita Di Carlo,
Walter Allasia, Eva RodrÃguez
Results of the Core Experiment for Contracts
Mapping
18454
Vignesh Subbaraman, Markus Multrus, Kihyun
Choo,
Informative Encoder Description for USAC
Improved Noiseless Coding CE
18455 Frederik Nagel
FhG Listening Test Report – PVC for SBR
envelope coding
18456 Max Neuendorf
Corrections to Reference Software and CD of
USAC
18457 Zhijie Zhao, Marco Munderloh, Joern Ostermann
MPEG-2 TS RAP indexing for DASH
Jorn Janneck, Marco Mattavelli, Shujun Li, Johan
Eker, Carl Von Platen, Mickael Raulet, Ghislain
18458
Roquier, Matthieu Wipliez, Ihab Amer, Christophe
Lucarz, Junaid Jameel Ahmad
Writing dataflow networks components (FUs)
with different models of computations using
RVC-CAL: a tutorial
18459 Jeff Huang
Crosscheck listening test report for USAC on
FFT and QMF harmonic transposers
Jorn Janneck, Marco Mattavelli, Shujun Li, Mickael
Raulet, Matthieu Wipliez, Johan Eker, Carl Von
18460
Platen, Ghislain Roquier, Ihab Amer, Christophe
Lucarz, Pascal Faure, Junaid Jameel Ahmad
A proposal of RVC-CAL extensions for
improved support of I/O processing
18461 Jeff Huang
Crosscheck listening test report for USAC on
time frequency domain post-processing
18462 Miska M. Hannuksela
Contribution to DASH EE#5: MPEG2-TS
Reception Hint Track
18463 Miska M. Hannuksela, Imed Bouazizi
Contribution to DASH EE#7:
bitstreamStructureId
18464 Miska M. Hannuksela, Imed Bouazizi
Contribution to DASH EE#9: MPD Indication for
File Conversion of Scalable Streams
18465 Miska M. Hannuksela
Contribution to DASH EE#9: File Construction
Instruction Format
18466 Kimitaka Tsutsumi, Kei Kikuiri, Nobuhiko Naka
NTT DOCOMO Cross-check Report on
Improved Harmonic Transposer in USAC
18467 David Virette
Report on cross-check listening test for the CE
46
on improved Bass-post filter for USAC
18468 David Virette
Report on cross-check listening test for the
CEs on QMF based harmonic transposer and
improved harmonic transposer in USAC
18469 David Virette, Dejun Zhang, Fuwei Ma
Finalization of Enhanced Pulse Indexing CE for
ACELP in USAC
18470 David Virette, Lijing Xu, Wei Xiao
A new signal classifier for USAC reference
encoder
18471 David Virette, Wei Xiao
Finalization of CE on adaptive T/F domain
post-processing for USAC
18472 David Virette, Wei Xiao
Progress report on additional bandwidth
extension CE for USAC at low bit rates
18473 Kei Kikuiri, Atsushi Yamaguchi, Nobuhiko Naka
Educational Information on Encoder
Implementation of inter-TES tool in USAC
18474 David Virette
CE proposal on improved stereo coding at low
bit rates
18475 Ying Chen, Marta Karczewicz
Comments on Scalable Resolution
Enhancement of Frame-Compatible
Stereoscopic 3D Video
18476 Michael Grafl, Christian Timmerer
Input on w11413 (WD of ISO/IEC 23006-4 2nd
edition Elementary Service Protocols)
18477 Michael Grafl, Christian Timmerer
Input on w11414 (WD of ISO/IEC 23006-5
Service Aggregation)
18478 Walt Husak, Ajay Luthra, Gary Sullivan
Use cases and requirements for scalable
resolution enhancement of frame-compatible
stereoscopic 3D video
18479
Markus Multrus, Philippe Gournay, Nikolaus
Rettelbach, Bruno Bessette, Bernhard Grill
Proposed Core-Experiment on Enhancing the
USAC Codec at Mid Bitrates
18480
Max Neuendorf, Markus Multrus, Stefan Doehla,
Werner Oomen, Heiko Purnhagen
Comments on further USAC investigation
18481 Bruno Bessette, Philippe Gournay, Roch Lefebvre
Proposed CE for extending the LPD mode in
USAC
18482 David Furbeck
DASH MPD Delta Files
18483 Xin Wang, Shaobo Zhang
Content Protection and DRM in DASH
18484 Zhang Shaobo, Xin Wang
Signaling for Seamless Switching for DASH
18485 Yuriy Reznik
thoughts on next steps for CDVS project
18486
Koohyar Minoo, Vivian Kung, David Baylon, Krit
Panusopone, Ajay Luthra, Jae Hoon Kim
On scalable resolution enhancement of framecompatible stereoscopic 3D video
18487
Seong Yong Lim, Jihun Cha, Injae Lee, Young-kwon Definition and application domains of new
Lim, Joong Yun Lee
sensors
18488
Seong Yong Lim, Jihun Cha, Injae Lee, Young-kwon Sensor capability and preference of new
Lim, Joong Yun Lee
sensors
18489
Seong Yong Lim, Jihun Cha, Injae Lee, Young-kwon
Binary representation of new sensors
Lim, Joong Yun Lee
18490
Seong Yong Lim, Jihun Cha, Injae Lee, Young-kwon
Proposed mechanism to support AUI interface
Lim
18491 V. Sze, M. Budagavi (TI)
High-efficiency entropy coding simplifications
18492 Jaewon Sung, Byeong-Moon Jeon
Comments on depth data of 3DV test sequence
18493 David Singer
On decode times in movie fragments
18494 Mark Watson, Thomas Stockhammer
DASH: Considerations on Overlap during
switching
47
18495 Mark Watson
DASH: URL selection
18496 Mark Watson, Thomas Stockhammer
DASH EE#7 – Segment Index
18497 watsonm@netflix.com
DASH: Considerations on Segment Duration
18498 Mark Watson
Input for DASH EE#1 (CMP): Pixel Aspect
Ratio
18499 Shinobu Hattori, Teruhiko Suzuki,
Comments on Requirements and Call for
Proposal on 3D Video Coding Technology
18500 Teruhiko Suzuki, Ikuo Tsukagoshi, Takahiro Nishi
Comments on scalable enhancement of frame
compatible stereo 3D
Zhong Haishan, Chong Kok Seng, Zhao Dan,
18501 Takeshi Norimatsu, Tomokazu Ishikawa, Neo Sua
Hong
Panasonic crosscheck report on improved
harmonic transposer
18502 Rob Glidden, Cliff Reader, Gerard Fernando
Option-1 (royalty-free) video codec industry
considerations
18503 Next Generation Broadcasting Forum (Korea)
Proposed Text of ISO/IEC 23000-11/FDAM1
Stereoscopic Video AF Conformance and
Reference Software
18504 Doug Young Suh, Kyungmo Park, Sungryeul Rhyu
DASH: Field data for quantitative evaluation
18505 Wonsuk Lee, Seungyun Lee
Java implementation for generic metadata APIs
on MXM
18506 Olgierd Stankiewicz, Krzystof Wegner
Frame range extension of Poznan Street and
Poznan Carpark sequences (3DV/EE1)
18507 Sanghyun Joo (joos@etri.re.kr)
Additional type for Virtual Object
18508 Sanghyun Joo (joos@etri.re.kr)
Descriptions for social network services
18509 Sanghyun Joo (joos@etri.re.kr)
Minor changes in MPEG-V Part 4
18510
Jae Joon Han, SeungJu Han, Won-Chul Bang,
James D.K. Kim, Sang-Kyun Kim, YongSoo Joo
sensor capability for gas and dust sensors
18511 Cheon Lee, Yo-Sung Ho
3DV EE1 Results on Newspaper
18512 Cheon Lee, Yo-Sung Ho
EE1 Results on Café
18513 Min-Koo Kang, Cheon Lee, Yo-Sung Ho
3DV EE4 Results on Newspaper
18514 Cheon Lee, Yo-Sung Ho
Common-hole Filling for Boundary Noise
Removal in VSRS
18515 Min-Koo Kang, Cheon Lee, Yo-Sung Ho
3DV EE4 Results on Cafe
18516 P. Chen, L. Guo, M. Karczewicz (Qualcomm)
Geometry motion partition
18517 T. Nguyen (Fraunhofer HHI)
Improved intra smoothing for UDI and new AIS
fast mode
18518
Qiuwen Zhang, Ping An, Yan Zhang, Ran Ma, Qian
Zhang, Zhaoyang Zhang
A Depth Compression for MVD View Synthesis
18519
Qiuwen Zhang, Ping An, Qian Zhang, Yan Zhang,
Zhaoyang Zhang
Enhancement of Depth Estimation
18520 Feng Shao, Gangyi Jiang, Mei Yu, Xu Wang
Stereoscopic video coding with asymmetric
luminance and chrominance qualities
18521 Kyong-Sok Seo, Byeong-Doo Choi, Dae-Sung Cho
Comments on Scalable resolution
enhancement of frame-compatible stereoscopic
3D video
18522 E. Francois, P. Bordes (Technicolor)
Geometry adaptive block partitioning (GEO)
cross-check
18523 Gaëlle Martin-Cocher
Requirements for LCVC
18524 Zhijie Zhao, Marco Munderloh, Joern Ostermann
Contribution to EE#2 MCE: Extension and
Modification
48
18525
Depth map enhancement for view synthesis
18526 sergio matone
Comments on WorkingDraft 23006-4
18527 sergio matone
Comments on WorkingDraft of 23006-5
18528 sergio matone
Contribution for 23006-1
18529 Mattia Donna Bianco
Proposal of an hierarchical representation of a
LASeR scene
18530 sergio matone
Comments on Negotiate ES
18531 Vijay Chandrasekhar, Bernd Girod
The Stanford Mobile Visual Search Data set
Tomasz Zernicki, Maciej Bartkowiak, Marek
Domanski
Telcordia and PUT listening test results for CE
on improved tonal component coding in eSBR
(USAC)
18535
Qian Zhang, Ping An, Qiuwen Zhang, Ran Ma,
Zhaoyang Zhang
Depth map enhancement for view synthesis
18536
Jae Joon Han, SeungJu Han, Won-Chul Bang,
James D.K. Kim, Sang-Kyun Kim, Yong Soo Joo
Update of Reference SW for MPEG-V Part 4
18537
H.-Joong Kim, X. Qu (Korea Univ.), W.-J. Han
(Samsung)
Showing the possibility of fast CABAC
18538
Miran Choi, MyungGil Jang, Chang-ki Lee, HyunKi
Kim,
AIT Elementary Service - Search License
18539
Miran Choi, Hyojeong Oh, Jeong Hur, Soojong Lim,
Yeochan Yun
AIT Elementary Service - Search Contract
18532
18533
18534
18540 Giovanni Cordara
Summary of discussions on CDVS databases
and software
18541 Tao Chen, Yoshiichiro Kashiwagi, Takahiro Nishi
Subjective Picture Quality Evaluation of MVC
Stereo High Profile for Full-Resolution
Stereoscopic HD Video Applications
18542
Rongrong Ji, Lingyu Duan, Tiejun Huang, Hongxun
Yao, Wen Gao
A Case Study for Compact Descriptors for
Visual Search - Location Discriminative Mobile
Landmark Search
18543
Qifei Wang, Kun Xu, Qian Ma, Xiangyang Ji,
Qionghai Dai
3DV EE1 Results on “Poznan_Carpark―
Test Sequence
TE7: Results for Simplification of MDDT
Transform
18544 W. Ding, Y. Shi, B. Yin (Beijing Univ. Tech)
18545 Cyril Concolato, Romain Bouqueau, Jean Le Feuvre Seamless switching for TS in DASH
18546 H. Schwarz, D. Marpe, T. Wiegand (Fraunhofer HHI)
18547
Mariam Saleh, Mickaël Raulet, Ihab Amer, Marco
Mattavelli
Investigations for representing rectangular
blocks using the merging concept
RVC-CAL non BTYPE Version
18548 J. Zan, J. Meng, M. T. Islam, D. He (RIM)
Cross Verification of Low Complexity Adaptive
Coefficient Scanning
18549 Mark Watson
DASH – Quality of Experience and
performance metrics
18550 TTA (Korea) via SC 29 Secretariat
Liaison Statement from TTA (Korea)
18551
Mark Watson, Thomas Stockhammer, Kilroy
Hughes,
DASH: MPD Representation Groups and MPD
Fragmentatio
Experimental Results for Inter-View Direct
Mode with Decompressed Disparity Maps for
Multiview Video Coding
18552 Jacek Konieczny, Marek Domański
49
18553 danny.cohen@oracle.com Danny Cohen
Evaluation of the OMS Video Encoding
18554 Gary J. Sullivan, Jens-Rainer Ohm,
Meeting report of the 2nd meeting of the Joint
Collaborative Team on Video Coding (JCTVC), Geneva, CH, 21-28 July, 2010
18555
Seung Wook Lee, ChangHwa Lyou, SangKwon
Jeong
Notification of the change of Forum Name of
Category C liaison partner
18556 K. Sato (Sony)
Cross verification of MV Coding Proposal by
JCTVC-C293
18557 T. Suzuki (Sony)
Cross verification of DCT-IF for chroma
18558 David Singer, Alex Giladi
Usage of MPEG-2 Transport Streams
18559 Alex Giladi
Segment Index Syntax for MPEG-2 TS
18560 alex.giladi@gmail.com
Comments on MPEG-2 Transport Streams in
DASH
18561 Jaime Delgado, Victor Rodriguez-Doncel
Proposal of initial CEL Working Draft
18562 hj08.lee@lge.com, w.zia@lglab.eu
LGE proposal for PIP attributes for CMP
18563
Sang-il Na, Weon-Geun Oh, Kyung-Ho Choi, DongSeok Jeong
Database contribution for CDVS
18564
Weon-Geun Oh, Sang-il Na, Jun-Woo Lee, DongSeok Jeong
Service scenario of CDVS
18565 Zhijie Zhao, Yanzi Yang
Support of virtual/non- segmentation in DASH
18566 H J Lee, w.zia@lglab.eu
DASH DTS M2TS Compression gzip
comparison
18567 H J Lee, w.zia@lglab.eu
A few comments on LGE proposal about
delivery of MPEG-2 TS
18568 R. Cohen (Merl), C. Yeo (I2R), R. Joshi (Qualcomm) TE7: Summary report for MDDT Simplification
18569
B. Bross, H. Kirchhoffer, H. Schwarz, T. Wiegand
(Fraunhofer HHI)
Fast intra encoding for fixed maximum depth of
transform quadtree
Jianwen Chen, Yaocheng Rong, Feng Xu, Yan Xu,
18570 Yun He, Xingguo Zhu, Lu Yu, Zhihang Wang, Li
Zhang, Cliff Reader, Siwei Ma, Wen Gao
Response to Call for Evidence on Option-1
Video Coding Technology
18571 W.-J. Han (Samsung)
Cross-verification of JCTVC-C311: Fast intra
encoding for fixed maximum depth of transform
quadtree
18572
Y. Zheng, R. Joshi, M. Coban, M. Karczewicz
(Qualcomm)
TE8: Crosscheck Result of the Transcoder of
JCTVC-B034 for V2V Entropy Coding in HEVC
18573 J. Kim, J. Park, B. Jeon (LG Electronics)
Verification results of Samsung's proposal
JCTVC-C206
18574 Yiping Chen
Response to the AIT 2nd CfP: Authenticate
User and Identify User
18575
Dr. Ken Lunde (Adobe Systems Inc.), Tony Tseung
(Apple Inc.), Julio Gonzalez (Apple Inc.)
Response to the Call for Proposals on
Composite Font Standard
18576 Walt Husak
3D Test Shoot
18577 Alexandros Tourapis
Comments on m17925
18578 Khaled Mamou, Minsu Ahn, Xiangyang Ji
Ad Hoc Group on Multi-Resolution 3D Mesh
Coding
18579 Tiejun Huang for CNNB
Comments on HEVC Test Model
18580 Tiejun Huang for CNNB
Comments on Option-1 Video coding for
Internet Application
18581 D. Hoang (Zenverge)
On the computation of PSNR and BD-Rate
18582 Shehrzad Qureshi, Steven Saunders, John Ralston
Dynamic computational complexity metric for
50
video codec evaluation
18583 T. Yamakage (Toshiba)
Verification Results of Simplified Intra
Smoothing (JCTVC-C234)
18584 K. Ugur (BoG coordinator)
Breakout Report on Interpolation Filters
18585 Jean H.A. Gelissen, Herman Tuininga
Additional input for Sensor / Actuator
Integration use case
18586 R. Sjöberg (Ericsson)
TMuC text on max CU size seems broken
18587
18588
Sang-il Na, Weon-Geun Oh, Ju-Kyong Jin, DongSeok Jeong
The process of Test Model development for the
HEVC initiative
18589 G. Sullivan, J.-R. Ohm (chairs)
18590
Fast Matching for the MPEG-7 Video Signature
Tools
T. Wiegand, H. Schwarz, B. Bross, A. Fuldseth, X.
Wang, W.-J. Han (BoG coordinators)
BoG report: residual quadtree structure
18591 Sangyoun Lee, Seongwan Kim, Jaeho Lee
Cross verification for ETRI proposal
18592 Helmut Bürklin, Mark Watson
Update on Media Presentation Description to
support HTTP Streaming from multiple servers
18593 JTC 1/SC 6 via SC 29 Secretariat
Liaison Statement from JTC 1/SC 6
18594
K. Sühring, D. Flynn, F. Bossen (on behalf of JCTHEVC software guidelines
VC)
18595 Miska Hannuksela, Imed Bouazizi
Clock Drift Compensation
18596 Per Fröjdh, Thomas Stockhammer
Support for trick modes in DASH
18597 K. McCann (TE coordinator)
TE12: Summary of results of evaluation of
TMuC tools in TE12
18598
Karsten Grüneberg, Ying Chen, Yasuaki Tokumo,
3D Video Support in DASH
Thomas Schierl
18599 Mark Watson
Evaluation Experiment on HTTP Streaming and
Quality of Experience metrics
18600 J. Jung, B. Bross (CE coordinators)
Description of Core Experiment 9: Motion
Vector Coding
18601 R. Cohen, C. Yeo, R. Joshi (CE coordinators)
Description of Core Experiment 7: Alternative
Transforms
18602 E. Francois, P. Chen, X. Zheng (CE coordinators)
Description of Core Experiment 2: Flexible
Motion Partitioning
18603
E. Alshina, Y.-L. Lee, P. Chen, H. Lakshman (CE
coordinators)
Description of Core Experiment 4: Interpolation
for MC (Chroma)
18604
Y.-J. Chiu, H. Yu, Y.-W. Huang, S. Sekiguchi (CE
coordinators)
Description of Core Experiment 1: DecoderSide Motion Vector Derivation
18605 W.-J. Chien (CE coordinator)
Description of Core Experiment 12: Adaptive
Motion Vector Resolution
18606 K. Sugimoto (CE coordinator)
Description of Core Experiment 10: Number of
Intra Prediction Directions
18607 F. Bossen (on behalf of JCT-VC)
Common test conditions and software
reference configurations
18608 A Tabatabai (CE coordinator)
Description of Core Experiment 6: Intra
Prediction Improvement
18609 X. Wang (CE coordinator)
Description of Core Experiment 5: Low
Complexity Entropy Coding Improvements
18610
V. Sze, K. Panusopone, J. Chen, T. Nguyen, M.
Coban (CE coordinators)
51
Description of Core Experiment 11: Coefficient
Scanning and Coding
18611 M.Coban, B. Bross, J. Chen (CE coordinators)
Description of Core Experiment 13: Intra
Smoothing
T. Yamakage, T. Chujoh, Y. W. Huang, K. Chono,
Description of Core Experiment 8: In-loop
18612 I.S. Chong, T. Yamamoto, J. Lim, B. Jeon, J. Xu, M.
filtering
Narroschke (CE coordinators)
Description of Core Experiment 3: Interpolation
Filtering for MC (Luma)
18613 T. Chujoh (CE coordinator)
18614
Marco Mattavelli, Mickael Raulet, Shujun Li, Jorn
Janneck
Proposed requirements for RVC and RGC
specification languages and solution
18615
Iain-James MARSHALL, Mihai MITREA, Bojan
JOVESKI
Technologies under Consideration for
Collaborative Applications by using MPEG
scene description standards
18616
T. K. Tan, G. J. Sullivan, J.-R. Ohm (on behalf of
JCT-VC)
Summary of HEVC working draft 1 and HEVC
test model (HM)
18617 G. J. Sullivan, J.-R. Ohm (chairs)
Meeting report of the third meeting of the Joint
Collaborative Team on Video Coding (JCTVC), Guangzhou, CN, 7–28 October, 2010
18618 K. McCann, B. Bross, S. Sekiguchi (editors)
High Efficiency Video Coding (HEVC) Test
Model 1 (HM 1) Encoder Description
18619
T. Wiegand, W.-J. Han, J.-R. Ohm, G. J. Sullivan
(editors)
Thomas Stockhammer, Per Fröjdh, Iraj Sodagar,
18620 Sungryeul Rhyu
52
High Efficiency Video Coding (HEVC) text
specification Working Draft 1
Draft CD as submitted to Secretary of ISO/IEC
JTC 1/SC 29: Text of ISO/IEC 23001-6:
Dynamic adaptive streaming over HTTP
(DASH)
Annex D – Output documents
No.
Title
11552 List of Documents from the 94th Meeting in Guangzhou, China
11553 Resolutions of the 94th Meeting in Guangzhou, China
11554 List of AHGs Established at the 94th Meeting in Guangzhou, China
11555 Report of the 94th Meeting in Guangzhou, China
11556 Guidelines for Electronic Distribution of MPEG M and N Documents
11557 Press Release of the 94th Meeting in Guangzhou, China
11558 Meeting Notice of the 95th Meeting in Daegu, Korea
11559 Guide for WG 11 Meeting Hosts
11560
Study of ISO/IEC 13818-1:2007/FPDAM 6 Extension to AVC video descriptor and signaling of
operation points for MVC
11561 List of errata items of ISO/IEC 13818-1:2007
11562
WD of ISO/IEC 13818-1:2007/AMD 7 Signaling of frame compatible and service compatible
stereoscopic video
11563 Request for ISO/IEC 14496-4:200x/AMD XX Conformance for BIFS ExtendedCore2D Profile
11564 Text of ISO/IEC 14496-4:200x/PDAM XX Conformance for BIFS ExtendedCore2D Profile
11565 DoC on ISO/IEC 14496-5:2001/FPDAM 28 Reference software for LASeR adaptation tools
11566 Text of ISO/IEC 14496-5:2001/FDAM 28 Reference software for LASeR adaptation tools
11567 Request for ISO/IEC 14496-5:200x/AMD XX Reference software for BIFS ExtendedCore2D Profile
11568 Text of ISO/IEC 14496-5:200x/PDAM XX Reference software for BIFS ExtendedCore2D Profile
11569 Technologies under Consideration for use of BIFS on Collaborative Application
11570 Request for ISO/IEC 14496-12:2008 AMD 3 DASH Support and RTP reception track processing
11571 Evaluation Report
11572 WD of ISO/IEC 14496-28 Composite Font
11573 WD of ISO/IEC 21000-20 Contract Expression Language
11574 Text of ISO/IEC 23000-11/FDAM 1 Stereoscopic Video AF Ref. Soft and Conf.
11575 Study of ISO/IEC 23000-12 FPDAM 1 Conformance and Reference Software
11576 Workplan of ISO/IEC 23000-12 Conformance and Reference Software
11577 Request for the subdivision of ISO/IEC 23001-6
11578 Text of ISO/IEC CD 23001-6 Dynamic Adaptive Streaming over HTTP
11579 Description of Evaluation Experiments on Dynamic Adaptive Streaming over HTTP
11580 Workplan of Reference Software for DASH
11581 Text of ISO/IEC CD 23001-1 2nd edition Multimedia Service Platform Technologies - Architecture
11582 Text of ISO/IEC CD 23006-2 2nd edition MPEG Extensible Middleware API
11583 Text of ISO/IEC CD 23006-4 2nd edition Elementary Services
11584 Text of ISO/IEC CD 23006-5 Service Aggregation
11585 Text of ISO/IEC 23007-1 PDAM 1 Widget Extensions
11586 Text of ISO/IEC 14496-12:2008 PDAM 3 DASH Support and RTP reception track processing
11587 Study of ISO/IEC FCD 23007-3 Conformance and Reference Software
11588 WD of ISO/IEC 23005-1 AMD 1 Additional Use Cases
11589 WD of ISO/IEC 23005-2 AMD 1 Additional Control Information and binary representation
53
11590 WD of ISO/IEC FDIS 23005-3 AMD 1 Binary representation of sensory information
11591 WD of ISO/IEC FDIS 23005-4 AMD 1 Binary representation of Virtual World Object Characteristics
11592
WD of ISO/IEC FDIS 23005-5 AMD 1 Additional sensors, interaction devices and binary
representation
11593 WD of ISO/IEC FDIS 23005-6 AMD 1 Binary representation of common types and tools
11594 Study of ISO/IEC FCD 23005-7 Conformance and Reference Software
11595 Storyboard for MPEG-U promotion video
11596 Short descriptions
11597 Guide to the MPEG Subversion Repository
11598 Request for ISO/IEC 23007-1 AMD 1 Widget Extensions
11599 Description of CE on dynamic volume change
11600 Terms of Reference
11601 MPEG Standards
11602 Unpublished standards at FDIS level
11603 MPEG work plan and time line
11604 MPEG Standard Editors
11605 Schema assets
11606 Software assets
11607 Conformance assets
11608 Content assets
11609 URI assets
11610 Standards under development for which a call for patent statements is issued
11611 List of Organisations with which MPEG entertains liaisons
11612 Request for ISO/IEC 13818-2:2000/Amd.4
11613 Text of ISO/IEC 13818-2:2000/PDAM4 Frame Packing Arrangement Signalling in MPEG-2 Video
11614 Text of 14496-5:2001/Amd.5 DCOR1
11615 Defect report on ISO/IEC 14496-10:201x
11616 Text of ISO/IEC 14496-10:201x/PDAM1 Progressive High Profile
11617 DoC on ISO/IEC 15938-6:2001/PDAM4
11618 Text of ISO/IEC 15938-6:2001/FPDAM4 Reference Software for Video Signature Tools
11619 DoC on ISO/IEC 15938-7:2001/PDAM6
11620 Text of ISO/IEC 15938-7:2001/FPDAM6 Conformance for Video Signature Tools
11621 Study Text of ISO/IEC 15938-8/PDAM6 Extraction and Matching of Video Signature Tools?
11622 Text of ISO/IEC 23001-4/DCOR1
11623 Working Draft of ISO/IEC 23001-4/Amd.1
11624 Disposition of Comments on ISO/IEC 23002-4/FPDAM1
11625 Text of ISO/IEC 23002-4/FDAM1 Software and Conformance Testing for Video Tool Library
11626 AHG on MPEG-7 Visual
11627 WD of ISO/IEC 23002-4/Amd.3 Functional Units for SVC Profiles
11628 Description of Core Experiments in RVC
11629 Test Model 1 of High Efficiency Video Coding
11630 Description of Exploration Experiments in 3D Video Coding
11631 Report on Experimental Framework for 3D Video Coding
11632 Liaison letter template on DASH
54
11633
Liaison letter template regarding Multimedia Service Platform Technologies (Formerly Advanced
IPTV Terminal)
11634 Liaison statement template on signaling of stereoscopic video in MPEG-2 Systems
11635 Liaison statement to SC 6 on MMT
11636 Liaison statement to 3GPP TSG-SA4 on DASH
11637 Liaison statement to JTC 1/WG 7 on Sensor Networks
11638 Liaison statement to IEC TC 100 on the definition of “trick mode―
11639 Liaison statement to JTC 1/SC 24 on “zip― format
11640 Liaison statement to IEC TC 100 on IEC CDV 62516-2, using BIFS in T-DMB receiver
11641 Liaison statement to IDPF ePub WG on Richmedia technologies
11642 Process of Test Model Development for HEVC
11643 Working Draft 1 of High Efficiency Video Coding
11644 AHG on Reconfigurable Video Coding
11645 Status of HEVC reference software copyright management in JCT-VC
11646 AHG on Scene Representation
11647 AHG on MPEG File Formats
11648 AHG on Application Format
11649 AHG on Font Format Representation
11650 AHG on Advanced IPTV Terminal
11651 AHG on MPEG-V
11652 AHG on Contract Expression Language
11653 AHG on Dynamic Adaptive Streaming over HTTP
11654 ISO/IEC 13818-4:2004 DCOR 2, AAC Intensity Stereo
11655 ISO/IEC14496-26:2010/Cor 2 BSAC Corrections
11656 ISO/IEC 14496-26:2010/DCOR 3, Intensity Stereo
11657 Report on Spatial Audio Object Coding Verification Tests
11658 Defect Report of ISO/IEC 23003-2:2010, Spatial Audio Object Coding
11659 Study on ISO/IEC 23003-3:201x/CD of Unified Speech and Audio Coding
11660 Status and Workplan for USAC CEs
11661 Audio for High Efficiency Video Coding
11662 AHG on SAOC, USAC, and Audio and Systems Interactions
11663 Statement of benefits from establishing a Category C liaison with DTG
11664 Description of CE on Contract Expression Language
11665 Description of 3DG Core Experiments
11666 Working Draft 3.0 on Multiresolution Mesh Compression
11667 Working Draft 3.0 on Reconfigurable Graphics Coding
11668 Reconfigurable graphics coding tutorial v2.0
11669 Text of 2nd Edition of ISO/IEC 14496-25 FDIS 3DG compression model
11670 AHG on 3DG documents, software maintenance and core experiments
11671 AHG on Multi-Resolution 3D mesh Coding
11672 Standardisation projects and facilities fees
11673 Low-Complexity Video Coding – Draft Problem Statement
11674 Compact Descriptors for Visual Search: Draft Call for Proposals
11675 Compact Descriptors for Visual Search: Evaluation Framework (draft)
55
11676 Call for Evidence on Option-1 Video Coding Technology
11677 ISO/IEC 15938-9:2005/PDAM 1
11678 Applications and Requirements on 3D Video Coding
11679 Draft Call for Proposals on 3D Video Coding Technology
11680 Liaison letter to ITU-T SG 16 on Telepresence Systems
11681 Requirements for MPEG Frame-Compatible Enhancement (MFC)
11682 Test Conditions and Evaluation Process for MFC
11683 Liaison statement template on MPEG frame-compatible activities
11684 AHG on scalable resolution enhancement of frame-compatible stereoscopic 3D video
11685 AHG on Evaluation of Responses to the Call for Evidence
11686 AHG on Lightweight Video Coding
11687 AHG on 3D Video Coding
11688 AHG on Compact Descriptors for Visual Search
11689 AHG on MPEG Media Transport (MMT)
11690 AHG on MPEG-7 Profiles
11691 MPEG-V Awareness Day: Call for Participation
11692 Request fof subdivision of ISO ISO/IEC 23006-5 Service Aggregation
11693 VOID
11694 Liaison Statement to 3GPP
11695 Liaison Statement to Wireless Gigabit Alliance
11696 Liaison Statement to ISO/IEC JTC 1/SC 37
11697 Statement of benefits from establishing a Category C liaison with Wireless Gigabit Alliance
56
Annex E – Requirements report
Source: Jörn Ostermann (Leibniz Universität Hannover)
1
Requirements documents approved at this meeting
No.
11630
11631
11661
11673
11674
11675
11676
11677
11678
11679
11680
11681
11682
11683
11684
11685
11686
11687
Title
Description of Exploration Experiments in 3D Video Coding
Report on Experimental Framework for 3D Video Coding
Audio for High Efficiency Video Coding
Low-Complexity Video Coding – Draft Problem Statement
Compact Descriptors for Visual Search: Draft Call for Proposals
Compact Descriptors for Visual Search: Evaluation Framework (draft)
Call for Evidence on Option-1 Video Coding Technology
ISO/IEC 15938-9:2005/PDAM 1
Applications and Requirements on 3D Video Coding
Draft Call for Proposals on 3D Video Coding Technology
Liaison letter to ITU-T SG 16 on Telepresence Systems
Requirements for MPEG Frame-Compatible Enhancement (MFC)
Test Conditions and Evaluation Process for MFC
Liaison statement template on MPEG frame-compatible activities
AHG on scalable resolution enhancement of frame-compatible stereoscopic
3D video
AHG on Evaluation of Responses to the Call for Evidence
AHG on Lightweight Video Coding
AHG on 3D Video Coding
11688
AHG on Compact Descriptors for Visual Search
11689
11690
AHG on MPEG Media Transport (MMT)
AHG on MPEG-7 Profile
2
Systems
2.1
MPEG Font Formats
rd
At the 93 meeting, MPEG issued the Call for Proposals N11538 Composite Font Standard: Call
for Proposals and the associated document N11537 Requirements and Use Cases for the Composite
Font Standard. The Requirements group thanks Adobe and Apple for their responses to the call.
MPEG received sufficient technology in order to start the standardization process. The work is
carried out in the Systems Subgroup.
57
3
Video
3.1
MPEG-7 AVDP
Several broadcasters requested a new profile for MPEG-7 covering several aspects of audio, video
and systems. The main functionality of this audio visual description profile (AVDP) is describing
the results of automatic media analysis with audio/video low-level features for audio-visual content.
They are to be used for the archives of the broadcast industry. Changes to the WD were approved
resulting in the current PDAM N11677 ISO/IEC 15938-9:2005/PDAM 1. The current work plan
foresees the amendment to be finalized in 11/2011.
In order to assure that the profile covers the relevant applications foreseen in the broadcast industry,
EBU will organize an evaluation of the tools (Figure 1) listed as part of the new AVDP.
Work is progressed within the adhoc group N11690 AHG on MPEG-7 Profile.
Figure 1 - Tools of AVDP.
3.2
Miscellaneous
nd
At the 92 MPEG meeting, a proposal for a new profile for surveillance videos (m17461) was
entertained. At this meeting, no support for the proposal was presented.
4
Explorations
4.1
Compact Descriptors for Visual Search (CDVS)
58
Images are foreseen as input for visual search. The temporal extension of images also known as
video will enable life interaction. ETRI and Stanford University contributed databases for use in
this potential standardization activity. While this increases the amount of test data, more
contributions of databases is required. Additional input on destructors, high quality reference and
low-quality test data is required. It appears that MPEG needs a database with text images. More
video would also be beneficial. Furthermore, the annotations of the databases need to be unified. A
CfP will only be meaningful if the before mentioned issues get resolved. The adhoc group N11688
AHG on Compact Descriptors for Visual Search will work on these issues.
MPEG met with JPEG on CDVS. The activities JPSEC and JPSearch may benefit from the CDVS
activity. Future meetings are envisioned when MPEG and JPEG have collocated meetings.
In order to prepare the CfP and evaluation, MPEG issued the documents N11675 Compact
Descriptors for Visual Search: Evaluation Framework (draft) and N11674 Compact Descriptors for
Visual Search: Draft Call for Proposals.
Given the open issues, the earliest time for issuing a call will be the 95th meeting with an evaluation
planned prior to the 97th meeting.
4.2
MPEG Frame-Compatible Stereo Enhancement for AVC (MFC)
The current popularity of stereoscopic video makes distribution using a frame-compatible
stereoscopic 3D video service over AVC desirable. However, these solutions provide only half of
the full spatial resolution for each channel. Transmission of an enhancement layer to achieve full
resolution 3D video might be a solution to this issue. While there was a significant push and effort
during the meeting to issue a CfP at the next meeting, even more significant issues are still to be
solved: At this point, there are no test sequences available. As anchor, SVC bit streams need to be
created. Furthermore, there is currently no evidence that there exists a solution superior to SVC.
From a formal point of view, MPEG progressed this matter by issuing N11681 Requirements for
MPEG Frame-Compatible Enhancement (MFC) and N11682 Test Conditions and Evaluation
Process for MFC. MPEG hopes that members provide test sequences at the 95th meeting such that
evidence may be evaluated at the 96th meeting. The adhoc group N11684 AHG on scalable
resolution enhancement of frame-compatible stereoscopic 3D video works on this activity.
Several members of the group assume that a similar solution based on MPEG-2 will not be required
by the market.
A liaison statement N11683 Liaison statement template on MPEG frame-compatible activities will
be sent to several interested organisations.
4.3
Audio
As in several previous meetings, MPEG had a brainstorming session on future audio activities. At
this meeting, the potential work areas were driven by potential HEVC application scenarios. The
main aspects are large screen, small viewing distances and hence near field audio perception. They
include home cinema requiring to render a multi-channel audio program onto a different number of
speakers or to define wavefield synthesis. Other applications include real-time communications,
camcorders, storage (random access issues), 3D video and telepresence as well as personal 3D
video. Requirements might focus on resolution of sound source localization and enhanced spatial
audio for stereo displays. Results of the current discussion are summarized in N11661 Audio for
High Efficiency Video Coding. Members are encouraged to further elaborate their ideas.
59
The audio group is encouraged to enlist more universities in their activities.
4.4
Leightweight Video Coding
th
At the 90 meeting, MPEG reviewed requirements on Low Complexity Video Coding as described
in N17005 What can we do about Interlace?. Following input from the US National Body at the
92nd meeting, the Requirements Group issued a resolution encouraging its members to provide
further input on this topic. In response to this resolution, one document was submitted describing
the low complexity video coding algorithm VC2 which is already standardized. Despite this rather
limited response and due to the potential relevance of the topic especially for mobile and UHDTV
devices, MPEG updated the output document N11673 Low-Complexity Video Coding – Draft
Problem Statement. Spatial random access is a functionality currently not provided by standardized
solutions. The adhoc group N11686 AHG on Lightweight Video Coding will progress this matter.
In order to decide whether there is room for another standard, MPEG encourages its members to
provide high-quality, high-bit depth and high-rate 4:4:4 video signals.
JPEG might issue a call on this leightweight video coding.
4.5
MMT
The area of work for MMT (Figure 2) might include adaptive progressive transport
(download/stream), cross layer optimization, hybrid delivery and conversational services. It is
foreseen that MMT will also provide adaptive streaming support for mp4 files. Naturally, MMT
will provide some kind of interworking with DASH.
Since the CfP was issued at the 93rd meeting and responses are due prior to the 95th meeting, no
activity took place at this meeting. However, N11689 AHG on MPEG Media Transport (MMT) was
established in order to evaluate the responses.
The Systems subgroup and its members working on MMT are encouraged to define test beds and
software environments such that the development of the MMT standard can be guided by the
outcome of core experiments.
Figure 2 - Networks of interest in the context of MPEG Media Transport.
60
4.6
3D Video Coding
Dolby Labs and Sun-Yat Sen University provided new test material. The selection of test sequences
for the envisioned CfP is now finalized. The next steps related to test material include definition of
rates for anchors based on MVC. The test procedure including instructions of subjects, evaluation
of subjects and selection of displays needs to be defined. The current plan foresees the use of
consumer 3D displays.
A CfP will be issued at the 95th or 96th meeting. Evaluation is planned to be carried out prior to the
98th meeting. The call will be open to all technologies including AVC, MVC, HEVC and others.
MPEG will decide on the use of the proposed techniques, i.e. technology proposed based on AVC
might be used in the standard in order to adapt HEVC to 3DV. The following documents were
updated: N11678 Applications and Requirements on 3D Video Coding, N11679 Draft Call for
Proposals on 3D Video Coding Technology, N11630 Description of Exploration Experiments in 3D
Video Coding. The adhoc group working on 3DV is N11687 AHG on 3D Video Coding.
MPEG made publicly available N11631 Report on Experimental Framework for 3D Video Coding.
The purpose of the document is to provide a high-level overview of the experimental framework
that is developed by MPEG to support the envisioned 3D video applications (Figure 3). This
document also aims to indicate assumptions on the input data as well as supplementary data derived
from the input data.
Left
Output to
Auto-Stereo
N-view
Display
3D Video
Codec
Center
Right
Depth
Estimation
Multiview
Rendering
virtual
views
Figure 3 – Auto-stereoscopic output with 3-view configuration as an example for 3DV.
4.7
Option-1 Licensable Codecs
For several meetings, MPEG had discussions on Option-1 licensable codecs. Especially small
companies consider the per-stream licensing fees as cumbersome.
At its 91st meeting, MPEG issued document N11221 Possible future actions on standardization
with Type 1 licensing where the legal issues are summarized and discussed. Type 1 licensing refers
to option 1 of the joint patent declaration form, where an intellectual property holder can indicate
that he will not charge for his IP. Laymen refer to this type of licensing as royalty-free.
In order to finalize the discussion MPEG issued a resolution at the 92nd meeting related to royaltyfree video codecs. Until this 93rd meeting, the CH, CN, IT, KR, and UK NB indicated support. No
input document on this support was received. The FR and US NB indicated that they have no
resources available. Given that the minimum resources for a new work item is available, MPEG
decided to further explore this type of standard.
In response to N11533 Call for Evidence on Option-1 Video Coding Technology, MPEG received 2
late inputs from Oracle (US NB), Peking University, Tsinghua University, and Zhejiang University
(CN NB). One proposal included the full set of coded video sequences. During this meeting, these
sequences were evaluated in an informal subjective test. Compared to MPEG-1, a noticeable
improvement in subjective quality and an objective improvement of 1db to 4 db was observed.
61
Comparisons against other standards like VP8 were not available. Given that the proponents
envision further improvements of their codecs, a second call for evidence N11676 Call for Evidence
on Option-1 Video Coding Technology was made. The due date is the 95th meeting. The adhoc
group N11685 AHG on Evaluation of Responses to the Call for Evidence prepares the evaluation.
No progress was made with respect to the working guidelines for developing an Option-1 licensable
codec.
Between the 93rd and 94th MPEG meeting, an indefinite moratorium on airtime fees for Internet
video that is free for end users was announced by MPEG LA. Therefore, MPEG asks its members
whether there is still a need to develop a type-1 licensable codec and what its performance
requirements are.
4.8
User Identification
In many applications relevant to MPEG standardization, an identification of individual users or a
description of users may be desirable. Currently, a user description in the context of MPEG-V is
under investigation. A more focused and all-embracing effort related to user identification might be
required. This identification of users may be useful for content adaptation and for exploitation of
user preferences. Obviously, such an user identification bears some risks related to privacy. The
Requirements subgroup kindly encourages organizations to provide input on user
identification/description.
62
Annex F – Systems report
Source: Young-Kwon Lim, Chair
1
No.
X
11560
11561
11562
X
11563
11564
X
11565
Output of the Meeting
Title
TBP Available
Editor
No
10/10/15
Sam
Yes
Yes
10/10/15
10/10/29
Sam
Sam
No
10/10/15
Jean L.
No
10/10/15
Jean L.
No
10/10/15
Text of ISO/IEC 14496-5:2001/FDAM 28 Reference software for LASeR
No
10/10/15
Request for ISO/IEC 14496-5:2001/AMD 30 Reference software for BIFS
No
10/10/15
SeoYoung
SeoYoung
Jean
Text of ISO/IEC 14496-5:2001/PDAM 30 Reference software for BIFS
No
10/10/22
Jean
No
10/10/15
Jamie
No
10/10/15
Dave
No
10/10/29
Dave
No
No
10/10/15
10/10/15
Vladimir
Vladimir
No
10/10/15
10/10/15
Xin
No
No
10/10/29
Kugjin
No
10/10/15
Inseon
No
10/10/15
Inseon
13818-1 Systems
Study of ISO/IEC 13818-1:2007/FPDAM 6 Extension to AVC video
descriptor and signaling of operation points for MVC
List of errata items of ISO/IEC 13818-1:2007
WD of ISO/IEC 13818-1:2007/AMD 7 Signaling of frame compatible and
service compatible stereoscopic video
14496-4 Conformance
Request for ISO/IEC 14496-4:2004/AMD 39 Conformance for BIFS
ExtendedCore2D Profile
Text of ISO/IEC 14496-4:2004/PDAM 39 Conformance for BIFS
ExtendedCore2D Profile
14496-5 Reference Softwares
DoC on ISO/IEC 14496-5:2001/FPDAM 28 Reference software for LASeR
adaptation tools
11566 adaptation tools
11567 ExtendedCore2D Profile
11568 ExtendedCore2D Profile
X
11569
X
11570
11586
X
14496-11 Scene Description and Application Engine
Technologies under Consideration for use of BIFS on Collaborative
Application
14496-12 ISO Base Media File Format
Request for ISO/IEC 14496-12:2008 AMD 3 DASH support and RTP
reception hint track processing
Text of ISO/IEC 14496-12:2008 PDAM 3 DASH support and RTP
reception hint track processing
14496-28 Composite Font
11571 Evaluation Report
11572 WD of ISO/IEC 14496-28 Composite Font
X
21000-20 Contract Expression Language
WD of ISO/IEC 21000-20 Contract Expression Language
11573
11664 Description of CE on Contract Expression Language
X
11574
X
11575
11576
23000-11 Stereoscopic Video Application Format
Text of ISO/IEC 23000-11/FDAM 1 Stereoscopic Video AF Ref. Soft and
Conf.
23000-12 Interactive Music AF
Study of ISO/IEC 23000-12 FPDAM 1 Conformance and Reference
Software
Workplan of ISO/IEC 23000-12 Conformance and Reference Software
63
11599 Description of CE on dynamic volume change
No
10/10/15
Yongwei
23001-6 Dynamic Adaptive Streaming over HTTP
Request for the subdivision of ISO/IEC 23001-6
No
10/10/15
Text of ISO/IEC CD 23001-6 Dynamic Adaptive Streaming over HTTP
No
10/10/20
Description of Evaluation Experiments on Dynamic Adaptive Streaming
No
10/10/15
Thomas
St.
Thomas
St.
Jin
Workplan of Reference Software for DASH
No
10/10/15
Thomas
St.
10/10/19
Xin
No
10/10/19
Sergio
No
10/10/19
Michael
No
10/10/15
No
10/10/21
Michael
No
No
10/10/15
10/10/15
JC
JC
No
10/10/15
Jean L.
Y
10/10/30
Joo
No
10/10/22
Jean G.
No
10/11/12
Seungju
11590 WD of ISO/IEC FDIS 23005-3 AMD 1 Binary representation of sensory No
10/11/12
Kyoungro
10/11/12
Marius
No
10/11/12
Kyoungro
No
10/11/12
Kyoungro
No
10/10/22
Christian
No
10/10/15
Kyungmo
X
11577
11578
11579 over HTTP
11580
X
23006-1 Architecture and Technologies
11581 Text of ISO/IEC CD 23001-1 2nd edition Multimedia Service Platform Yes
Technologies - Architecture
X
23006-2 Application Programing Interface
11582 Text of 2nd edition of ISO/IEC CD 23006-2 MXM APIs
X
23006-4 Elementary Services
11583 Text of 2nd edition ISO/IEC CD 23006-4 Elementary Services
X
23006-5 Service Aggregation
Request of subdivision of ISO/IEC 23006-5 Service
11692
Aggregation
11584 Text of ISO/IEC CD 23006-5 Service Aggregation
X
23007-1 Widgets
11598 Request for ISO/IEC 23007-1 AMD 1 Widget Extensions
11585 Text of ISO/IEC 23007-1 PDAM 1 Widget Extensions
X
23007-3 Conformance and Reference Software
11587 Study of ISO/IEC FCD 23007-3 Conformance and Reference Software
X
23005 Media context and control
11691 MPEG-V awareness day Call for Participation
X
23005-1 Media context and control – Architecture
11588 WD of ISO/IEC 23005-1 AMD 1 Additional Use Cases
X
23005-2 Media context and control – Control Information
11589 WD of ISO/IEC 23005-2 AMD 1 Additional Control Information and
binary representation
X
23005-3 Media context and control – Sensory Information
information
X
23005-4 Media context and control – Virtual World Object
Characteristics
11591 WD of ISO/IEC FDIS 23005-4 AMD 1 Binary representation of virtual No
X
world object characteristics
23005-5 Media context and control – Data Formats for Interaction Devices
11592 WD of ISO/IEC FDIS 23005-5 AMD 1 Additional sensors,
interaction devices and binary representation
X
23005-6 Media context and control – Common Data Format
11593 WD of ISO/IEC FDIS 23005-6 AMD 1 Binary representation
of common data format
X
23005-7 Media context and control – Conformance and
Reference Software
11594 Study of ISO/IEC FCD 23005-7 Conformance and Reference
Software
X
Promotion
11595 Storyboard for MPEG-U promotion video
64
11596 Short descriptions
X
Assets and Standing Documents
11605 MPEG Schema Assets Updates
11609 MPEG URIs and MIME Types
X
11632
11633
11634
11635
11636
11637
11638
11639
11640
11641
11663
10/10/15
10/10/15
Christian
Christian
No
10/10/15
Christian
No
10/10/15
Iraj
No
10/10/15
Xin
No
10/10/15
Sam
No
No
10/10/15
10/10/15
Young
Iraj
No
No
10/10/15
10/10/15
Young
Young
No
No
10/10/15
10/10/15
Young
JC
No
10/10/15
No
10/10/15
Jean L.
Young
SVN support
11597 Guide to the MPEG Subversion Repository
X
No
No
Liaison
Liaison statement template on Dynamic Adapative Streaming
over HTTP
Liaison statement template on Multimedia Service Platform
Technology
Liaison statement template on signalling of stereoscopic video
in MPEG-2 System
Liaison statement to SC 6 on MPEG Media Transport
Liaison statement to 3GPP on Dynamic Adapative Streaming
over HTTP
Liaison statement to JTC 1/WG 7 on Sensor Networks
Liaison statement to IEC TC 100 on the definition of “trick
mode”
Liaison statement to JTC 1/SC 24 on “zip” format
Liaison statement to IEC TC 100 on IEC CDV 62516-2, BIFS
in T-DMB receiver
Liaison statement to IEC TC 100 on Richmedia for e-Book
Statement of benefits from establishing a Category C liaison
with DTG
65
2
General Input Documents
2.1
AHG reports
Session Number
Title
Plenary m17973 Ad Hoc on Scene
Representation
Plenary m17974 Ad Hoc on MPEG File
Formats
Plenary m17975 Ad Hoc on Application
Format
Plenary m17976 Ad Hoc on Font Format
Representation
Plenary m17977 Ad Hoc on Advanced
IPTV Terminal
Plenary m17978 Ad Hoc on MPEG-V
Plenary m17979 Ad Hoc on Contract
Expression Language
Plenary m17980 Ad Hoc on Dynamic
Adaptive Streaming over
HTTP
Plenary m17984 Ad Hoc on MPEG Media
Transport (MMT)
2.2
Dispositions
Accepted
Xin Wang , Young
Kwon Lim
Marius Preda
Xin Wang , Jaime
Delgado
Iraj Sodagar , David
Singer , Young Kwon
Lim
Christian Timmerer
Accepted
Accepted
Accepted
Accepted
Accepted
Accepted
Accepted
Accepted
General technical contributions
Session Number
Title
Plenary m18001 Table of Replies on
ISO/IEC 144961:2010/FDAM 1
2.3
Demo
2.4
FAQ
Source
ITTF via SC 29
Secretariat
.
2.5
Source
Jaeyeon Song , Cyril
Concolato
David Singer , Per
Frojdh
Kyuheon Kim , Kugjin
Yun
Vladimir Levantovsky
AOB
None.
66
Dispositions
Noted
3
MPEG-2 Systems (13818-1)
3.1
Topics
3.1.1
ISO/IEC 13818-1:2007 AMD 5 Carriage of JPEG2000 over MPEG-2
3.1.2 ISO/IEC 13818-1:2007 AMD 6 Extension to AVC video descriptor and
operation points for MVC
3.2
Contributions
Session
MPEG2
MPEG2
MPEG2
MPEG2
MPEG2
3.3
Number
Title
m18029 USNB Contribution:
WG11 resolution 2.1.4
consideration
m18195 JNB comments on
Resolution 2.1.4: Methods
for signaling frame
packing arrangement in
MPEG-2 video
m18351 Korean NB comment on
3DTV signaling in MPEG2 System
m18359 Liaison Statement from
DVB
m18550 Liaison Statement from
TTA (Korea)
Source
A. G. Tescher for
USNB
Dispositions
Noted
(N1562)
Japan National Body
Noted
(N1562)
Korean NB via SC 29
Secretariat
Noted
(N1562)
DVB via SC 29
Secretariat
TTA (Korea) via SC
29 Secretariat
Noted
(N1562)
Noted
(N1562)
Summary of discussions
•
3.4
ignalling of
We received 5 input contributions related to system level signaling of 3D content coded
MPEG-2 video:
– USNB and KNB support option to use new stream_type and a new descriptor.
– JNB did not support a new stream_type due to compatibility issues with deployed
systems.
– KNB and TTA requested adding support for service compatible mode in addition to
frame compatible mode.
- DVB indicated that they do have any plans currently to use MPEG-2 video for 3D
services.
Action Points
 ITU-T Rec. H.222.0 (05/2006)/Amd.5|ISO/IEC 13818-1:2007/FPDAM 5 ballot until 201012-03
 ITU-T Rec. H.222.0 (05/2006)/Amd.5|ISO/IEC 13818-1:2007/FPDAM 6 ballot until 201012-03
67
4
MPEG-4 Reference Software (14496-5)
4.1
Topics
4.1.1
ISO/IEC 14496-5:2001 AMD 28 Reference software for LASeR adaptation tools
4.1.2
ISO/IEC 14496-5:2001 AMD 29 PMSI for LASeR
4.2
Contributions
Session
Scene
4.3
Number
Title
m18005 Summary of Voting on
ISO/IEC 144965:2001/FPDAM 28
Source
SC 29 Secretariat
Dispositions
Accepted
(N11565,
N11566)
Summary of discussions
m18005 One disapproval with comment from France. Disposition is provided appropriately.
4.4
Action Points
 ISO/IEC 14496-5:2001/PDAM 29 ballot until 2010-11-03
5
5.1
MPEG-4 BIFS (14496-11)
Topics
5.1.1 Exploration
- Remote and Collaborative work
- Haptic interaction
5.2
Contributions
Session
Scene
Scene
Scene
Scene
Number
Title
m17998 Table of Replies on
ISO/IEC 1449611:2005/FDAM 7
m18382 Conformance Sequences
for BIFS ExtendedCore2D
m18383 Reference Software for
BIFS ExtendedCore2D
m18407 Use Case for
Collaborative BiFS
Scenes : Distance
Learning
68
Source
ITTF via SC 29
Secretariat
Dispositions
Noted
Jean Le Feuvre
Accepted
(N11564)
Accepted
(N11566)
Accepted
(N11569)
Jean Le Feuvre
Kim Pelsor , Jamie
Marshall ,
Scene
Scene
m18408 Use Case for
Collaborative BiFS
Scenes : Proactive
Medicine
m18409 Use Case for
Collaborative BiFS
Scenes : Consultations
without Walls
Scene
m18412 Use Case for
Collaborative BiFS
Scenes : A day in a
Professor's life
Scene
m18417 Use Cases for
Collaborative BiFS
Scenes : Overview
Scene
m18422 Image Transmission for
Mobile Thin Clients : A
Comparative Study
5.3
Dr Michael
McGannon , Jamie
Marshall ,
Accepted
(N11569)
Fritz-Joachim
Westphal , Mario
Kind , Jamie
Marshall , Mihai
Mitrea ,
Mihai Mitrea , Jamie
Marshall , Françoise
Preteux , Pieter
Simoens , Bart
Dhoedt , ,
Jamie Marshall ,
Mihai Mitrea , Bojan
Joveski , Ludovico
Gardenghi ,
Françoise Preteux ,
Bojan Joveski ,
Ludovico Gardenghi ,
Jamie Marshall ,
Mihai Mitrea ,
Françoise Preteux
Accepted
(N11569)
Accepted
(N11569)
Accepted
(N11569)
Noted
Summary of discussions
M18382
A set of conformance sequences produced by GPAC have been uploaded in this contribution (mp4
and xmt). Another set of files in the used by bifsenc/mux in IM1 is also provided.
There is no repository for sequences, so where to put them ?
This should go to the WD on BIFS ExtendedCore2D profile conformance
M18383
IM1-2D has been updated to reflect the latest versions of BIFS. The project files were updated to
Visual Studio 8 (and should be upgradable easily now). MPEG-J support is not activated because it
could not be rebuilt. Some 3DGC code needs some help.
Parsing, decoding and encoding of the new tools is functional for nodes and commands. There is
utility software to check the behaviour of the nodes (GPAC). So there are two interoperable open
implementations of the standard.
This should go into a WD of BIFS ExtendedCore2D Reference Software.
This raises the question of free readonly access to the MPEG SVN.
The five contributions are presented together through m18417.
m18407: question about the virtual lab, sharing a lab software, clarification that if there is software,
it is on the server, and all communication with the software is managed by the collaborative
environment.
m18408: what is the specificity of this use case ? possibly the security/privacy issues, and how to
establish and communicate trust.
m18409: specificity is the mobility, the sharing of high quality material and the variety of usage
environment, but not so collaborative.
69
m18412: session migration, collaborate on external software, remote upgrades
There is a missing tool on allowing collaboration on external software, a sort of extended VNC.
Should it be connected to BIFS? No! We should specify what are the needs: a scene description
format, possibly XML-base, with events, maybe an update mechanism (but could be in the
collaborative layer).
M18422
This describes using mosaic of images to share an application within a BIFS environment. The
criticism expressed in the contribution was clarified to be criticism of the GPAC implementation,
not of the standard. Guidelines were provided to the author about the proper use of CacheTexture,
once the implementation in GPAC has been improved.
5.4
Action Points
6
MPEG-4 ISO Base File Format (14496-12)
6.1
Topics
6.1.1
ISO/IEC 14496-12:2008/PDAM 2 Support for sub-track selection & switching
 Subtrack selection and switching
 Adaptive progressive download
6.2
Contributions
6.3
Summary of discussions
6.4
Action Points
 ISO/IEC 14496-12:2008/DCOR 4 ballot until 2010-11-10
 ISO/IEC 14496-12:2008/FPDAM 2 ballot until 2011-02-03
7
LASeR & SAF (14496-20)
7.1
Topics
7.1.1
Exploration
 Advanced User Interaction Interface
7.2
Contributions
70
Session
Scene
7.3
Number
Title
m18529 Proposal of an
hierarchical
representation of a
LASeR scene
Source
Mattia Donna Bianco
Dispositions
Rejected
Summary of discussions
m18529:
the contribution was presented, together with further discussions on the reflector. Some of
it comes from a misunderstanding, and further discussions on the reflector may be needed.
However, the contribution cannot be accepted.
7.4
Action Points
8
Open Font Format (14496-22)
8.1
Topics
8.1.1
Composite Fonts
8.2
Contributions
.
Session
8.3
Number
Title
Source
Dispositions
Summary of discussions
M18575
8.4
Action Points
9
Contract Expression Language (21000-20)
9.1
Topics
9.1.1
ISO/IEC 21000-20 Contract Expression Language
9.2
Contributions
Session
Number
Title
Source
71
Dispositions
MSP
m18453 Results of the Core
Experiment for Contracts
Mapping
MSP
m18561 Proposal of initial CEL
Working Draft
9.3
Jaime Delgado , VÃctor RodrÃguezDoncel , Francesco
Gallo , Elisa
Todarello , Annarita
Di Carlo , Walter
Allasia , Eva RodrÃguez
Jaime Delgado ,
Victor RodriguezDoncel
Accepted
N11664
Accepted
N11573
Summary of discussions
M18453
A lot of work done for mapping 4 contracts
M18561
Good to be a starting point. More discussions needed to finalize a core structure of CEL.
9.4
Action Points
10
Stereoscopic Video AF (23000-11)
10.1
Topics
10.1.1 ISO/IEC 23000-11 AMD1 Conformance and Reference Software
10.1.2 ISO/IEC 23000-11 AMD2 Additional composition type
10.2
MAF
10.3
Contributions
m18503 Proposed Text of ISO/IEC
23000-11/FDAM1
Stereoscopic Video AF
Conformance and
Reference Software
Next Generation
Broadcasting Forum
(Korea)
Accepted
N11574
Summary of discussions
M18503 has been reviewed with minor changes in conformance points, conforming files, and
Annex B, and then is finally approved. The reference SW is to support all the three conformance
points, and the four conforming files are provided as well. In order to finalize the FDAM doc, it is
required to have two weeks editing period.
72
10.4
Action Points
 ISO/IEC 23000-11:2009/PDAM 2 ballot until 2010-11-03
11
Interactive Music AF (23000-12)
11.1
Topics
11.1.1 ISO/IEC 23000-12 AMD1 Conformance and Reference Software
11.2
Contributions
Session
MAF
MAF
MAF
11.3
Number
Title
m18165 An experimental study of
different dynamic volume
change encoding
methods for IMAF
m18166 A study on the
computational complexity
of EQ implementation in
IMAF
m18362 Updated text and
reference SW of ISO/IEC
23000-12/FPDAM1 IM AF
Source
Yongwei Zhu ,
Rongshan Yu ,
Susanto Rahardja
Dispositions
Accepted
N11599
Yongwei Zhu ,
Rongshan Yu ,
Susanto Rahardja
Noted
Inseon Jang , Jeongil
Seo , Laurent
Primaux
Accepted
N11575
Summary of discussions
M18165: proposed the duration_update, which seems to have a less parsing complexity and
improve bit savings. However, it is not so sure whether it is affect the quality of the audio sequence.
It is recommended to conduct the experiment to find out the saving of complexity and bits without
any degradation of the quality in audio sequence.
m18166: EQ has been requested as a function in the requirement document, however, it is not
supported in the current IMAF. Thus, it is requested to bring a syntax and semantics for satisfying
the requirement.
M18362: Approved it.
11.4
Action Points
 ISO/IEC 23000-12:2010/PDAM 1 ballot until 2010-12-05
73
12
Dynamic Adaptive Streaming over HTTP (23001-6)
12.1
Topics
•
•
•
12.2
Manifest
Delivery format for ISO BM FF
Delivery format for MPEG-2 TS
Contributions
Session Number
DASH
m18007
DASH
m18008
DASH
m18009
DASH
m18010
DASH
m18011
DASH
m18012
DASH
m18013
DASH
m18014
DASH
m18015
DASH
m18016
DASH
m18017
DASH
m18018
Title
DASH Evaluation Experiment
5: Delivery Format of MPEG2 TS
Report of DASH Evaluation
Experiment 9, Delivery
Format to File Format
Conversion (DFC)
DASH Evaluation Experiment
#1: Compositions of Media
Presentation (CMP) Proposal
Comparison
DASH Evaluation Experiment
#2: MPD Modification,
Corrections, and Extension
DASH Evaluation Experiment
#8, Clock Drift Control
DASH Evaluation Experiment
#4: Delivery Format
Addressing
DASH Evaluation Experiment
#3: Enhancement to
Representation Description
(ERD)
DASH Evaluation Experiment
#7: Seamless Switching (SES)
Updated Information on
Reference Software for
DASH
EE#6: Trick Mode and
Random Access Signaling
(TRA)
Some comments on DASH
EE#7 – Seamless Switching
Proposed Amendment:
Support for Common
74
Source
Waqar Zia
Dispositions
Refer
m17980
Ye-Kui Wang , Sungryeul
Rhyu , Miska M.
Hannuksela
Refer
m17980
Jin Young Lee , Harry Pyle Refer
m17980
Jin Young Lee
Refer
m17980
Nhut Nguyen , Harry Pyle
Gerard Fernando ,
Kyungmo Park
Refer
m17980
Refer
m17980
Waqar Zia , Ying Chen ,
Yago Sanchez
Refer
m17980
Ying Chen , Ingo
Hofmann
Thomas Stockhammer
Refer
m17980
Refer
m17980
Ye-Kui Wang , Thomas
Stockhammer
Refer
m17980
Mark Watson
Refer
m17980
Accepted
N11578
Kilroy Hughes , Quintin
Burns
DASH
m18019
DASH
m18309
DASH
m18311
DASH
m18347
DASH
m18348
DASH
m18353
DASH
m18357
DASH
m18366
DASH
m18370
DASH
m18380
DASH
m18381
DASH
m18385
DASH
m18387
DASH
m18388
DASH
m18390
Encryption
Proposed Amendment:
Independently Decodable
Fragments
DASH related clarifications
to ISO base media file
format
Base DASH on updated 3GPP
adaptive HTTP streaming
specification
DSAH - support of rewind
trick mode
On random access point
signaling in file formats
Consideration of Clock Drift
Control for DASH
Kilroy Hughes , Quintin
Burns
Accepted
N11578
Ye-Kui Wang , Zhenyu
Wu
Accepted
N11578
Ye-Kui Wang , Per
Fröjdh , Thomas
Stockhammer , David
Furbeck , Imed Bouazizi
Ye-Kui Wang
Accepted
N11578
Ye-Kui Wang
Yasuaki Tokumo , Maki
Takahashi , Shuichi
Watanabe , Norio Ito ,
Yasuaki Tokumo , Maki
Takahashi , Shuichi
Watanabe , Yoshiaki
Ogisawa , Takashi
Kaneko , Norio Ito ,
Gerard Fernando , Yang
Yanzi , Wang Fang
Modification of
Enhancement to
Representation Description
on DASH for signaling of 3D
content
Proposal for new box to
support random access for
DASH
Carriage of MPEG-2 TS
Gerard Fernando , Yang
support information in Index Yanzi , Wang Fang
file for DASH
Comments on DASH WD
Cyril Concolato , Jean Le
Feuvre , Romain
Bouqueau ,
Report on an Open Source
Cyril Concolato , Jean Le
implementation of MPEGFeuvre , Romain
DASH
Bouqueau
Annotation and Attributes
Sungryeul Rhyu , Seofor Media Compositions
Young Hwang , Kyungmo
park , Jaeyeon Song ,
Bogyeong Kang , Nhut
Nguyen
Compelling use-case for
Yonghun Lee , Doug
M17770(DFA-EE)
Young Suh , Jaeyeon
Song , kyungmo Park ,
Timed metadata Support in
Sungryeul Rhyu , SeoDASH
Young Hwang , Nhut
Nguyen ,
Benefits of TRA
Chulkeun Kim , Doug
75
Accepted
N11579
Accepted
N11578
Noted
Noted
Accepted
N11579
Accepted
N11579
Accepted
N11578
Noted
Accepted
N11579
Accepted
N11578
Accepted
N11578
Accepted
representation
DASH
m18391
File Format for 3D Support
of DASH
DASH
m18392
Random Access Point (RAP)
Indication
DASH
m18394
Content Insertion on DASH
DASH
m18400
DASH
m18401
DASH
m18403
DASH
m18405
Editor's Proposed Updates
on DASH Working Draft
Adding Minimum RAP
Frequency to MPD
Reference Software for
DASH
Extensions to Media
Presentation Description to
support HTTP Streaming
from multiple servers
Scalable Random Access for
SVC based DASH
DASH
DASH
DASH
DASH
Young Suh , Jaeyeon
Song , Kyungmo Park
Kyuheon Kim , Jangwon
Lee , Jung-Han Kim , Hee
Jean Kim , Gahyun Ryu ,
Gilyoon Kim , Do Young
Joung , Namguk Kim
Hojin Ha , Sungryeul
Rhyu , Seo-Young
Hwang , Jaeyeon Song
Seo-Young Hwang ,
Sungryeul Rhyu , Hojin
Ha
Thomas Stockhammer
Thomas Stockhammer
Thomas Stockhammer
Helmut Bürklin ,
Stéphane Gouache ,
Truong Cong Thang , Jin
Young Lee
m18414
Jin Young Lee , Kwangdeok Seo , Hyung Jung
Kim , Truong Cong
Thang , Jung Won Kang ,
Seong-Jun Bae , Soonheung Jung , Sang Taick
Park
m18415 Virtual Segmentation of TS
Jin Young Lee , Euy-doc
Packetized Video using Key- Jang , Jae-Gon Kim ,
frame Information
Truong Cong Thang ,
Jung Won Kang , SeongJun Bae , Soon-heung
Jung , Sang Taick Park
m18418 File Format Adaptation
Kwang-deok Seo ,
Based on HTTP Hint
Sang-woo Shim , Jin
Track for HTTP Streaming Young Lee , Truong
Cong Thang , Jung
Won Kang , Seong-Jun
Bae , Soon-heung
Jung , Sang Taick Park
m18439 Contribution to DASH EE#2
Truong Cong Thang , Jin
Young Lee , Jung Won
Kang , Seong-Jun Bae ,
Soon-heung Jung , Sang
Taick Park
76
N11578
Accepted
N11578
Accepted
N11579
Accepted
N11578
Accepted
N11578
Accepted
N11579
Accepted
N11580
Accepted
N11578
Accepted
N11579
Accepted
N11579
Accepted
N11579
Accepted
N11579
DASH
m18440
Use Cases of Multiple Byte
Ranges for DASH
DASH
m18441
Comments on the Use of
RAP in Segments
DASH
m18442
DASH
m18443
DASH
m18444
DASH
m18445
Test sequences in DASH
format
DASH File Format and MPD
Considerations
Proposal for using MPEG-21
Digital Item Declaration and
W3C XInclude/XLink for
Media Presentation
Description
Proposal on Signaling for
DASH
DASH
m18457
MPEG-2 TS RAP indexing for
DASH
DASH
m18462
DASH
m18463
DASH
m18464
DASH
m18465
DASH
DASH
m18482
m18483
DASH
m18484
DASH
m18493
DASH
m18494
Contribution to DASH EE#5:
MPEG2-TS Reception Hint
Track
Contribution to DASH EE#7:
bitstreamStructureId
Contribution to DASH EE#9:
MPD Indication for File
Conversion of Scalable
Streams
Contribution to DASH EE#9:
File Construction Instruction
Format
DASH MPD Delta Files
Content Protection and
DRM in DASH
Signaling for Seamless
Switching for DASH
On decode times in movie
fragments
DASH: Considerations on
77
Truong Cong Thang , Jin
Young Lee , Jung Won
Kang , Seong-Jun Bae ,
Soon-heung Jung , Sang
Taick Park
Truong Cong Thang , Jin
Young Lee , Jung Won
Kang , Seong-Jun Bae ,
Soon-heung Jung , Sang
Taick Park ,
Thorsten Lohmar , Per
Fröjdh
Per Fröjdh , Torbjörn
Einarsson
Christopher Müller ,
Christian Timmerer ,
Accepted
N11579
Truong Cong Thang , Jin
Young Lee , Jung Won
Kang , Seong-Jun Bae ,
Soon-heung Jung , Sang
Taick Park ,
Zhijie Zhao , Marco
Munderloh , Joern
Ostermann
Miska M. Hannuksela
Noted
Miska M. Hannuksela ,
Imed Bouazizi
Miska M. Hannuksela ,
Imed Bouazizi
Accepted
N11579
Accepted
N11579
Miska M. Hannuksela
Accepted
N11579
Accepted
N11579
Accepted
N11579
Accepted
N11578
Accepted
N11579
Accepted
N11579
Accepted
N11579
David Furbeck
Xin Wang , Shaobo Zhang Accepted
N11579
Zhang Shaobo , Xin Wang Accepted
N11579
David Singer
Accepted
N11579
Mark Watson , Thomas
Accepted
Overlap during switching
DASH: URL selection
Stockhammer
Mark Watson
DASH EE#7 – Segment
Index
DASH: Considerations on
Segment Duration
Input for DASH EE#1 (CMP):
Pixel Aspect Ratio
DASH: Field data for
quantitative evaluation
Mark Watson , Thomas
Stockhammer
watsonm@netflix.com
DASH
m18495
DASH
m18496
DASH
m18497
DASH
m18498
DASH
m18504
DASH
m18524
Contribution to EE#2 MCE:
Extension and Modification
DASH
m18545
Seamless switching for TS in
DASH
DASH
m18549
DASH
m18551
DASH
m18558
DASH
m18559
DASH
m18560
DASH
m18562
DASH
m18565
DASH
m18566
DASH
m18392
DASH – Quality of
Experience and performance
metrics
DASH: MPD Representation
Groups and MPD
Fragmentatio
Usage of MPEG-2 Transport
Streams
Segment Index Syntax for
MPEG-2 TS
Comments on MPEG-2
Transport Streams in DASH
LGE proposal for PIP
attributes for CMP
Support of virtual/nonsegmentation in DASH
DASH DTS M2TS
Compression gzip
comparison
Random Access Point (RAP)
Indication
DASH
m18416
TS Header Extension for
Efficient Adaptation in the
HTTP Streaming
78
Mark Watson
Doug Young Suh ,
Kyungmo Park ,
Sungryeul Rhyu
Zhijie Zhao , Marco
Munderloh , Joern
Ostermann
Cyril Concolato , Romain
Bouqueau , Jean Le
Feuvre
Mark Watson
Mark Watson , Thomas
Stockhammer , Kilroy
Hughes ,
David Singer , Alex Giladi
Alex Giladi
alex.giladi@gmail.com
hj08.lee@lge.com ,
w.zia@lglab.eu
Zhijie Zhao , Yanzi Yang
H J Lee , w.zia@lglab.eu
Hojin Ha , Sungryeul
Rhyu , Seo-Young
Hwang , Jaeyeon Song
Jin Young Lee , Euy-doc
Jang , Jae-Gon Kim ,
Truong Cong Thang ,
Jung Won Kang , SeongJun Bae , Soon-heung
Jung , Sang Taick Park
N11578
Accepted
N11579
Accepted
N11578
Noted.
Accepted
N11578
Accepted
N11579
Noted
Accepted
N11579
Accepted
N11579
Accepted
N11578
Accepted
N11579
Accepted
N11579
Accepted
N11579
Accepted
N11579
Accepted
N11579
Accepted
N11579
Accepted
N11579
Accepted
N11579
12.3
Summary of discussions
m18464: the flag will be considered as part of Representation ‘annotation” in more generalized case
and will be discussed there.
m18465: will bring back an enhanced version of proposal that also includes less on MPD signaling.
Show the evidence of how often the client can create a valid MP4 file, using the existing players
(whether file with missing track in mdat can be played by legacy player?)
-“If satisfied, proposal 1 (one initialization segment for all reps) get added as client/service
guidelines (informative)” no input.
ExpriationTimeUTC: Samsung will have contribution at the Jan meeting.
m18353: seems not directly related to EE8
m18562 moved to EE1 (CMP).
m18370: supporting information + index file+ MPD additions for index file
m18462: dual approach using sidx and MP4 or separate file
m18457: two solutions: 1) index file similar to m18370 2) include RAP index in PID stream, prefer
PID approach. Need to transcode new TS.
:m18558 & m18560: fixes and improvements on the baseline.
m18392: extensions 2 M2TS, goes to M2TS extension work, outside of DASH
m18559:similar to m18392 but use private data
m18545: the audio glitch during switching of video streams
m18566: - map PTS to MPD timeline, gzip vs LG proposal: not completed.
m18439: ETRI, proposes “periodref”, Xlink and XInclude are not appropriate.
m18551: section 3. Propose a new element “Include” in MPD inside any period or representation.
Both 18439 and 18551 do not recommend XInclude or Xlink.
Some people think Xlink may able to do same. Will bring evidence tomorrow.
Will decide on the solution after CMP review.
M18388: timemetadata, no consensus yet.
m18366: msra box for accessing RAP. No major advantage to WD is shown yet.
m18387: new proposal to extend sidx box, will be reviewed with TRA proposals.
m18440: no strong support
m18565: no new decision
m18385: mediacomposition group
79
m18444: DID, MPD selection
m18498: aspect ratio. New attributes for signaling the aspect ratio.
m18551: RepresenationGroup
The concept of RepresentationGroup in m18551 was originally introduced by ETRI in section 5.4
of m18009 as componentRepresentations
Following attributes in RepresentatonGroup element (m18551) was originally introduced by ETRI
in section 5.4 of m18009;
lang
mimeType
minBandwidth
maxBandwidth
minWidth
maxWidth
minHeight
maxHeight
minFramerate
maxFrame
m18562: view priority and PIP location are provided as representation attributes in MPD. But since
MPD is not designed to convey presentation/layout information, therefore the proposed attributes
are not added to MPD.
m18385 will provide a use case and detail example of its proposal compared with WD by Tues.
DRM part of CMP will be reviewed during the week.
18390: adding RAPperiod to MPD. Accepted the maxmiumRAPperiod into MPD.
Keep the constraint of one minimum RAP.
18401: minimum RAP frequency.
m18347: rewind trick mode: alternatePlayoutRate have negative value + additional text to clarify
the meaning of negative/positive. MP4FF needs also to signal for reverse sample order in reverse
order.
m18441: remove the constraint of having minimum one RAP. Propose RandAcess element in MPD
to signal segments with RAPs.
m18443: temporal ordering of tick mode. Not to use one trun. Sample grouping using “tele” box in
MP4. Use segment type bx to brand the segment as temporal scalability. A new attribute in trick
mode element of MPD for temporal ordering, also number of levels.
Sample grouping functionality might exist in AVC FF. Accept m18443 with the refinement on
AVC extension box, resolve the disagreement on index box after sidx. Collaboration with 18387.
Update on m18016:
m18387: extend the sidx box.
Decision:
m18441: bring back the proposal with more details.
A basic question to answer generally: if the values of attributes are out of range, what would be the
behavior of the client? (contributions are welcomed to answer this).
80
m18463: need two decoded 2 I frames sometimes to switch seamlessly. To avoid overlapping,
proposing bitstreamStrcutureId attribute for representation meaning that they can be switched and
the old reference frame used for new ones.
m18493: sidx is not easy to create. MPD gives the accurate time for segments. Add the tfdt box
after Track Fragment Header box. Tfad express the clean start.
m18494: switching while overlapping representation. encodedSampleNooverlapFlag at
represenationgroup
m18496: sidx box has two purposes: timing recovery and RAP indexing.
TrackFragmentDecodeTime can do it better.
Sidx Define decoding time, should have defined the composition time.
Accept it to WD. Ask 3gpp to align the 3GPP release 9.
Come back with fragmentAlignmentFlag to propose.
m18497: for information. Will be presented later.
M18405
Multiple URl with simple priority.
M18495
. If there is any issues with CDN, it will brought up. We will look at the exact syntax and accept it if
there is no issue.
M18497
Add to WD: add “duration” to period.
Make period start time optional
Only for VOD case, if it doesn’t break anything else in MPD.
M18445: - supp. Can be delivered as content, no need to be part of MPD.
Average bandwidth: not average. If use the same definition of Rep’s bandwidth. No
consensus on this. Bring modified proposal.
M18524: - no strong support for flag.
Group: WD will be clarified that this is the case.
M18357: Add stereoVideo to represenation and generic FramePackingType
M18484: deferred to later.
M18549
Propose QoE metrics for informative part. Exact parameters to be further defined and refined.
Approve an EE on this: Mark to provide the description.
M18483
On, UsageRights, UsageRightUri
MPDIntegrity: Need further studies. Possibly start an EE.
M18385
On downloadableDRMInfo
81
M18015 was presented in Paris and reviewed here again. DASH Media and DASH Client
imple,ments the major feature of the software. Built on Mac, Linux.
m18402: duplicate registration.
m18403: the software plan
m18442: Test sequences and a site that provide VOD and live content for DASH.
m18444: Demoed a player which parses MPD and plays on representation.
M18381: implementation based on GPAC open source project. Parsing MPD. It doesn’t support
switching
- m18309:
•
clalify spec that track fragment boxes covering the same time period: corrigenda issue
•
restriction on the composition time of following samples
•
Movie fragments covering the same time period should be time-aligned. -> this should be
addressed in DASH spec.
•
Allow gaps in movie fragment box: corrigenda issue.
No additional FF amendment required. Either corrigenda issues or better DASH.
M18348: signaling rap in FF. 1) Define RAP to include all 3 types of RAPs. 2) Sync sample table
box should have only IDR RAPs. 3) Add 2 fields to rap sample group entry. 4) Add one filed to
syntax of roll sample group entry. Discuss with dave. May be already addressed. 5) Clarify raps in
mfra ox can be any raps.
Give a definition of sync sample. Part of corrigenda. Editors need to go look at Part 12 and 15 and
see what is missing about RAPs and propose for next corrigenda.
m18391: 3 new boxes for 3-D signaling. Svmi, scdi and mfsi
see part 15 extension and if it can be addressed there.
m18018 and m18019:
-Trik box: start work with RAP sample group box. Integrated sample group to be defined to address
this. Will be addressed as part of editing of the PDAM.
-sample encryption box -> study if it can be implemented with time metadata track,
- track encryption box add to PDAM,
- pssh: draft a metada item, instread of using uuid, use a 4cc register id. Goes to PDAM.
m18418: HTTPHintSampleEntry, not part of delivery.
M180415: signaling the RAP indexes in MPD.
There are 3 different methods: inside M2TS signaling, sidx based signaling and signaling in MPD.
12.4
Action Points
 Detailed schedule for DASH sessions.
Day
Time
Topic
16:00 –
Review of EE results
Mon.
??:??
09:00 –
New contributions
Tue.
??:??
12:00 –
Reference Software
Wed.
82
13:00
14:00 –
16:00
09:00 –
13:00
Wed.
Thu.
MPEG-2 TS, File Format
Issues
Remaining issues
13
Multimedia Service Platform Architecture (23006-1)
13.1
Topics
13.1.1 ISO/IEC 23006-1 MXM Architecture and technologies
13.2
Contributions
Session
Number
13.3
Summary of discussions
13.4
Action Points
14
MSP APIs (23006-2)
14.1
Topics
Title
Source
Disposition
14.1.1 ISO/IEC 23006-2 MXM APIs
14.2
Contributions
Session
MSP
Number
Title
m18528 Contribution for 23006-2
Source
sergio matone
MSP
m18505 Java implementation for
generic metadata APIs on
MXM
Wonsuk Lee ,
Seungyun Lee
14.3
Summary of discussions
M18528
This is an Editor’s input to improve the text.
83
Dispositions
Accepted
N11582
Accepted
N11582
M18505
This provides software for generic metadata API for MXM. It should be integrated into the existing
package. The authors will provide an input document to provide the API specification for generic
metadata.
14.4
Action Points
15
MSP Elementary Service Protocol (23006-4)
15.1
Topics
15.1.1 ISO/IEC 23006-4 MXM Protocols
15.2
Contributions
Session
MSP
MSP
MSP
MSP
MSP
MSP
MSP
Number
Title
m18526 Comments on
WorkingDraft 23006-4
m18476 Input on w11413 (WD of
ISO/IEC 23006-4 2nd
edition Elementary
Service Protocols)
m18213 Background for Proposing
Scalable Application
Markup Language for
AIT’s Present
Content Elementary
Service
m18214 Proposal of Scalable
Application Markup
Language (SAML) for
AIT’s Present
Content Elementary
Service
m18227 Proposal of Stack
Operators for AIT’s
Present Content
Elementary Service
m18229 Initial Proposal of
DCFunctions for
AIT’s Present
Content Elementary
Service
m18538 AIT Elementary Service Search License
84
Source
sergio matone
Michael Grafl ,
Christian Timmerer
Dispositions
Accepted
N11583
Accepted
N11583
Kyoungro Yoon , MinUk Kim , Tae-Beom
Lim , Kyung Won
Kim , Yun Ju Lee ,
Jae Won Moon
Noted
Kyung Won Kim ,
Tae-Beom Lim , Yun
Ju Lee , Jae Won
Moon , Kyoungro
Yoon , Min-Uk Kim
Noted
Jae Won Moon ,
Kyung Won Kim ,
Tae-Beom Lim , Yun
Ju Lee , Kyoungro
Yoon , Min-Uk Kim
Tae-Beom Lim ,
Kyung Won Kim , Yun
Ju Lee , Jae Won
Moon , Kyoungro
Yoon , Min-Uk Kim
Miran Choi , MyungGil
Jang , Chang-ki Lee ,
Noted
Noted
Accepted
N11583
MSP
15.3
m18539 AIT Elementary Service Search Contract
HyunKi Kim ,
Miran Choi , Hyojeong Accepted
N11583
Oh , Jeong Hur ,
Soojong Lim ,
Yeochan Yun
Summary of discussions
M18526 & m18476
This is an Editor input to improve the text
The contributions m18213, m18214, m18227, m18229, and a demo of the use of Scalable
Application Markup Language (SAML) were presented. It was agreed that the contributors will try
to re-implement the demo using
MPEG-21 DIA. They will evaluate, whether an extension needs to be specified in order to include
and use MPEG-21 DIA information for the choice/selection mechanism in MPEG-21 DID. The
result will be presented at the 95th MPEG meeting.
M18538
This is an input on Search License ES. Accepted to be part of Part 4 ES.
M18539
This is an input on Search Contract ES. Accepted to be part of Part 4 ES.
15.4
Action Points
16
MSP Service Aggregation (23006-5)
16.1
Topics
16.1.1 ISO/IEC 23006-5 MSP Service Aggregation
16.2
Contributions
Session
MSP
MSP
16.3
Number
Title
m18527 Comments on
WorkingDraft of 23006-5
m18477 Input on w11414 (WD of
ISO/IEC 23006-5 Service
Aggregation)
Summary of discussions
85
Source
sergio matone
Michael Grafl ,
Christian Timmerer
Dispositions
Accepted
N11584
Accepted
N11584
M18527 & m18477
Editors input to improve the text
16.4
Action Points
17
Richmedia UI Widgets (23007-1)
17.1
Topics
17.1.1 Package, Delivery and Presentation of Widgets
17.1.2 Communication
17.2
Contributions
Session
Scene
Scene
17.3
Number
Title
m17999 Table of Replies on
ISO/IEC FDIS 23007-1
m18376 Proposed amendment for
MPEG-U widgets
Source
ITTF via SC 29
Secretariat
Jean-Claude
Dufourd , Jean
Lefeuvre , Cyril
Concolato , Kyungmo
Park , Jaeyeon Song
Dispositions
Noted
Accepted
(N11585)
Summary of discussions
m18376:
The contribution is presented. A comment from MPEG-V is that numberarray may also be
needed. An answer is that there is then a need to provide mapping of the array types to a
type acceptable to all protocols (such as mapping to/from string type). Question on
whether the stringarray implementation caused any problem, and the answer was no,
beyond the mapping, hence the proposed restriction in the contribution. Question on the
implementation of security restrictions, which led to a discussion to be continued.
17.4
Action Points
18
Advanced User Interaction Interface (23007-2)
18.1
Topics
18.1.1 Advanced User Interaction Interface
86
18.2
Contributions
Session
Scene
18.3
Number
Title
m18490 Proposed mechanism to
support AUI interface
Source
Seong Yong Lim ,
Jihun Cha , Injae
Lee , Young-kwon
Lim
Dispositions
Noted
Summary of discussions
m18490 :
Presentation of the contribution. The presented interface seems more like a higher level of MPEG-V
than an MPEG-U interface.
Request to do a state of the art on mouse gestures, same as was done on the iphone SDK.
Question about what language to use to describe the gestures, and then what language to use to pass
the information about a recognized gesture into a scene in a particular format: the answer for both is
that the proposal is flexible.
Once the gesture is recognized, it is important to adapt the information to the target scene format,
even if the actual effect in the scene is out of scope.
There seems to be a need for a database of recognized gestures, an entity which matches the output
of the semantic generator to one recognized gesture (independent from the scene format), and then a
module to translate the recognized gesture into some scene description specific stuff.
Suggestion that MPEG-U part 1 can be used for some of the communications between MPEG-V
and the scene.
Request for a walkthrough of the system, a concrete example of a flow
18.4
Action Points
19
MPEG-U Conformance and Ref. SW (23007-3)
19.1
Topics
19.1.1 Conformance and Reference Software
19.2
Contributions
Session
Scene
19.3
Number
Title
m18438 Study of FCD of 23007-3
Summary of discussions
m18438:
87
Source
Jean-Claude
Dufourd , Jean
Lefeuvre , Cyril
Concolato , Kyungmo
Park , Jaeyeon Song
Dispositions
Accepted
(N11587)
the contribution was presented. The conformance sequences are not completed, mostly
negative sequences are still missing. These will be discussed and provided or removed
until the next meeting.
19.4
Action Points
 ISO/IEC FCD 23007-3 ballot until 2010-12-05
20
MPEG-V Architecture (23005-1)
20.1
Topics
20.1.1 ISO/IEC 23005-1 MPEG-V Architecture
20.1.2 Exploration on Haptics
20.2
Contributions
Session
MPEGV
MPEGV
MPEGV
MPEGV
MPEGV
Number
Title
m18027 Proposal of Header
information for
binary transmission
of MPEG-V
information
m18252 Robot Integration
m18253 Additional Use
Cases for MPEG-V
m18508 Descriptions for
social network
services
m18352 Path Finding
20.3
Summary of discussions
20.4
Action Points
Source
Kyoungro Yoon , Soohan
Kim , Woo Chool Park ,
Hae Moon Seo
Dispositions
Refer 3DGC
report
acordova@tue.nl ,
jean.gelissen@philips.com
jean.gelissen@philips.com
Refer 3DGC
report
Refer 3DGC
report
Refer 3DGC
report
Sanghyun Joo
(joos@etri.re.kr)
jean.gelissen@philips.com , Refer 3DGC
report
roland@cs.uu.nl ,
 ISO/IEC FCD 23005-1 ballot until 2010-03-02
88
21
Control Information (23005-2)
21.1
Topics
21.1.1 ISO/IEC 23005-2 Control Information
21.2
Contributions
21.3
Summary of discussions
21.4
Action Points
 ISO/IEC FCD 23005-2 ballot until 2010-04-14
22
Sensory Information (23005-3)
22.1
Topics
22.1.1 ISO/IEC 23005-3:201x/AMD 1 Binary representation
22.2
Contributions
Session
MPEGV
MPEGV
MPEGV
MPEGV
MPEGV
MPEG-
Number
Title
m18025 Proposal of GPS
Sensor Capability
Type, Altitude
Sensor Capability
Type, and Global
Position Capability
Type
m18365 Data format for
sensed information
on Electrograph
sensor
m18377 Tactile Information
m18487 Definition and
application domains
of new sensors
m18488 Sensor capability
and preference of
new sensors
m18489 Binary
Source
Kyoungro Yoon , Doohyun
Kim , Min-Uk Kim
Dispositions
Refer 3DGC
report
SeungJu Han , Jae Joon
Han , Won-Chul Bang ,
James D.K. Kim , SangKyun Kim
jean.gelissen@philips.com ,
esko.dijk@philips.com
Seong Yong Lim , Jihun
Cha , Injae Lee , Youngkwon Lim , Joong Yun Lee
Seong Yong Lim , Jihun
Cha , Injae Lee , Youngkwon Lim , Joong Yun Lee
Seong Yong Lim , Jihun
Refer 3DGC
report
89
Refer 3DGC
report
Refer 3DGC
report
Refer 3DGC
report
Refer 3DGC
V
MPEGV
MPEGV
representation of
new sensors
m18410 Sensed Information
for gas and dust
sensors
m18510 sensor capability for
gas and dust
sensors
22.3
Summary of discussions
22.4
Action Points
Cha , Injae Lee , Youngkwon Lim , Joong Yun Lee
Jae Joon Han , SeungJu
Han , Won-Chul Bang ,
James D.K. Kim , Sangkyun Kim , YongSoo Joo
Jae Joon Han , SeungJu
Han , Won-Chul Bang ,
James D.K. Kim , SangKyun Kim , YongSoo Joo
report
Refer 3DGC
report
Refer 3DGC
report
 ISO/IEC FCD 23005-3 ballot until 2010-04-14
23
Virtual World Object Characteristics (23005-4)
23.1
Topics
23.1.1 ISO/IEC 23005-4:201x/AMD 1 Binary representation
23.2
Contributions
Session
MPEGV
MPEGV
MPEGV
Number
Title
m18507 Additional type for Virtual
Object
m18509 Minor changes in MPEGV Part 4
m18536 Update of Reference SW
for MPEG-V Part 4
23.3
Summary of discussions
23.4
Action Points
Source
Sanghyun Joo
(joos@etri.re.kr)
Sanghyun Joo
(joos@etri.re.kr)
Jae Joon Han ,
SeungJu Han , WonChul Bang , James
D.K. Kim , Sang-Kyun
Kim , Yong Soo Joo
 ISO/IEC FCD 23005-4 ballot until 2010-04-14
90
Dispositions
Refer 3DGC
report
Refer 3DGC
report
Refer 3DGC
report
24
Data formats for interaction devices (23005-5)
24.1
Topics
24.1.1 ISO/IEC 23005-5:201x/AMD 1 Additional sensors, interaction devices and binary
representation
24.2
Contributions
Session
MPEGV
Number
Title
m18024 Update of Reference SW
for MPEG-V Part 5
24.3
Summary of discussions
24.4
Action Points
Source
Eunseo Lee , Bum
Seok Choi, Kwang
Roh Park, Kyoungro
Yoon , Jonghyung
Lee , Ying Ying Chen
Dispositions
Refer 3DGC
report
 ISO/IEC FCD 23005-5 ballot until 2010-04-14
25
Conformance and reference software (23005-7)
25.1
Topics
25.1.1 ISO/IEC 23005-5 Reference Software
25.2
Contributions
Session
MPEGV
Number
Title
m18023 Update of Reference SW
for MPEG-V Part 2
MPEGV
m18449 Proposed changes for
validation rules of MPEGV Part 7
25.3
Summary of discussions
91
Source
Bum Suk Choi ,
Eunseo Lee , Jong
Hyun Jang , Kyoungro
Yoon , Jonghyung
Lee , Ying Ying Chen
Markus Waltl ,
Christian Timmerer
Dispositions
Refer 3DGC
report
Refer 3DGC
report
25.4
Action Points
 ISO/IEC FCD 23005-7 ballot is note started yet.
25.5
Summary of discussions
25.6
Action Points
26
Exploration – MMT
26.1
Topics
•
•
•
•
•
Transport- and file format friendly stream format
Cross layer optimization between video and transport layer
Error resilience for MPEG streams
Conversion between transport mechanisms
Content adaptation to different networks
26.2
Contributions
26.3
Summary of discussions
26.4
Action Points

CfP
27
Liaison
27.1
List of input liaison letters
Session Number
Title
Plenary m17985 Liaison Statement from
ITU-T SG 16 on Approval
92
Source
ITU-T SG 16 via SC
29 Secretariat
Dispositions
Jean G.
Plenary m17989
Plenary m17993
Plenary m18006
Plenary m18020
Plenary m18021
Plenary m18022
of a New Question on
Telepresence systems
[SC 29 N 11508]
IEC CDV 62516-2 [SC 29
N 11538]
Liaison Statement from
Digital TV Group [SC 29
N 11551]
Liaison Statement from
3GPP
Liaison Statement from
ISO/IEC JTC 1/WG 7 [SC
29 N 11560]
Liaison Statement from
IEC TC 100 [SC 29 N
11561]
Liaison Statement from
JTC 1/SC 34 [SC 29 N
11563]
Plenary m18359
Plenary m18550
IEC TC 100 via SC 29 JC
Secretariat
DTG via SC 29
Iraj
Secretariat
3GPP via SC 29
Secretariat
JTC 1/WG 7 via SC
29 Secretariat
Iraj
Young
IEC TC 100 via SC 29 Young
Secretariat
JTC 1/SC 34 via SC
29 Secretariat
Young
Sam
Sam
93
28
References
28.1
Timeline of standards under development
P
r
P
t
Edit
.
Projec
t
2
1
AMD5 Transport of JPEG2000
2
1
4
5
4
5
4
5
4
5
4
5
4
1
2
1
2
1
5
1
5
2
0
2
2
2
2
2
8
1
2
1
2
2
200
7
200
7
200
1
200
1
200
1
200
x
200
x
200
x
200
8
201
0
201
0
200
9
200
8
200
8
200
8
200
8
200
8
200
2
200
6
200
4
4
4
4
4
4
4
7
7
2
1
2
1
2
4
8
CfP
Description
WD
CD
DCO
R
PDA
M
09/10
DIS
FPDA
M
DAM
FDIS
FDA
M
COR
10/07
11/01
09/10
10/07
11/01
AMDx
x
AMDx
x
AMDx
x
AMD2
8
AMD2
9
AMD2
AVC File Format Ref. Soft
09/0
7
09/0
7
TBS
SVC File Format Ref. Soft
TBS
MVC File Format Ref. Soft
TBS
Reference software for LASeR
adaptation tools
Reference software for PMSI for
LASeR
Sub-track selection & switching
10/07
11/01
11/07
10/01
10/07
11/01
COR1
Minor corrections
10/07
11/01
COR1
Minor corrections
10/07
11/01
AMD6 MVC operation point descriptor
10/10
AMD1 Sub-track definition
10/07
11/01
COR1
10/07
AMD1 Support for many-to-one range
mappings
COR1
1st
Composite Font Format
09/04
Presentation element
Protection of Presentation
Information
MVCO Reference Software
94
09/10
10/01
10/0
7
AMD1 MPQF reference software
.
AMD2 Semantic enhancement
AMD1
.
AMD2
.
AMD2
11/07
10/1
0
09/0
7
08/1
0
08/1
0
10/07
10/07
11/01
11/07
12/01
09/07
09/10
10/07
09/10
10/07
11/01
09/04
10/07
11/01
09/04
10/07
11/01
10/07
11/01
11/07
1
2
1
A
2
0
5
A
6
A
9
A
B
1
1
1
1
1
2
6
M
1
M
2
M
3
M
4
M
5
U
2
U
3
V
1
V
1
V
2
V
2
V
3
V
3
V
4
V
4
V
5
A
A
8
200
x
200
x
200
9
200
8
200
x
200
x
201
0
200
x
200
x
200
x
200
x
200
x
200
x
200
x
200
x
200
x
200
x
200
x
200
x
200
x
200
x
200
x
200
x
200
x
1st ed.
Contract Expression Language
10/0
7
08/0
1
2nd Ed. MS AF
COR1
PA AF
AMD1 Minor correction
/COR1
AMD1 SVAF Ref. Soft. And Conf.
09/0
2
09/0
7
AMD2 Additional composition type
AMD1 IM AF Ref. SW and
Conformance
st
1 ed. Dynamic Adaptive Streaming
over HTTP
nd
2 ed. Architecture
10/0
4
10/0
7
10/10
11/01
11/07
09/02
10/01
10/07
10/01
10/07
10/07
11/01
09/07
09/10
10/10
10/07
11/01
11/07
10/01
10/07
11/01
10/10
11/01
11/07
10/10
11/01
11/07
2nd ed.
MXM APIs
10/10
11/01
11/07
2nd ed.
Reference SW & Conformance
10/10
11/01
11/07
2nd ed.
Elementary Service Protocol
10/10
11/01
11/07
2nd ed.
Service Aggregation
10/10
11/01
11/07
1st ed.
10/07
11/01
11/07
1st ed.
Advanced User Interaction
Interface
Conf. & Ref. SW
10/01
10/07
11/01
200x
Architecture
09/07
09/10
10/07
11/01
11/07
12/01
09/07
09/10
10/07
11/01
11/07
12/01
09/07
09/10
10/07
11/01
11/07
12/01
09/07
09/10
10/07
11/01
11/07
12/01
09/07
09/10
10/07
AMD1 Streaming Support
200x
Control Information
AMD1 Additional Control Information
200x
Sensory Information
AMD1 Binary representation
200x
Virtual world object
characteristics
AMD1 Binary representation
200x
Data formats for interaction
devices
95
10/0
7
10/0
7
08/1
0
09/0
4
09/0
2
10/0
7
09/0
2
10/0
7
09/0
2
10/0
7
09/0
2
10/0
7
09/0
2
V
5
V
6
V
7
200
x
200
x
200
x
28.2
Pr
AMD1 Additional sensors and
interaction devices
200x Common types and tools
200x
Conformance testing and
reference software
10/0
7
09/0
2
09/0
2
11/01
11/07
12/01
09/07
09/10
10/07
10/01
10/07
11/01
Standing Documents
1
1
1
Pt
1
1
1
No.
N7675
N7676
N7677
Meeting
05/07 Nice
05/07 Nice
05/07 Nice
N7678
N7679
N7680
05/07 Nice
05/07 Nice
05/07 Nice
11
1
1
1
1
6
11
12
14
15
13
13
17
18
20
Documents
MPEG-1 White Paper – Multiplex Format
MPEG-1 White Paper – Terminal Architecture
MPEG-1 White Paper – Multiplexing and
Synchronization
MPEG-2 White Paper – Multiplex Format
MPEG-2 White Paper – Terminal Architecture
MPEG-2 White Paper – Multiplexing and
Synchronization
MPEG-2 White Paper – MPEG-2 IPMP
MPEG-4 White Paper – MPEG-4 Systems
MPEG-4 White Paper – Terminal Architecture
MPEG-4 White Paper – M4MuX
MPEG-4 White Paper – OCI
MPEG-4 White Paper – DMIF
MPEG-4 White Paper – BIFS
MPEG-4 White Paper – ISO File Format
MPEG-4 White Paper – MP4 File Format
MPEG-4 White Paper – AVC FF
White Paper on MPEG-4 IPMP
MPEG IPMP Extensions Overview
White Paper on Streaming Text
White Paper on Font Compression and Streaming
Presentation Material on LASER
2
2
2
1
1
1
2
4
4
4
4
4
4
4
4
4
4
4
4
4
4
N7503
N7504
N7610
N7921
N8148
N8149
N7608
N8150
N7923
N7924
N7505
N6338
N7515
N7508
N6969
4
4
7
7
21
A
A
20
22
1
1
9
X
X
White Paper on LASeR
White Paper on Open Font Format
MPEG-7 White Paper - MPEG-7 Systems
MPEG-7 White Paper – Terminal Architecture
MPEG-21 White Paper – MPEG-21 File Format
MPEG Application Format Overview
MAF Overview Document
N7507
N7519
N7509
N8151
N7925
N9421
N9840
A
X
MAF Overview Presentation
N9841
B
E
X
X
N7922
N6335
E
X
MPEG-B White Paper – BinXML
MPEG Multimedia Middleware Context and
Objectives
1rst M3W White paper
05/07 Poznan
05/07 Poznan
05/10 Nice
06/01 Bangkok
06/04 Montreux
06/04 Montreux
05/10 Nice
06/04 Montreux
06/01 Bangkok
06/01 Bangkok
05/07 Poznan
04/03 München
05/07 Poznan
05/07 Poznan
05/01 HongKong
05/07 Poznan
05/07 Poznan
05/07 Poznan
06/04 Montreux
06/01 Bangkok
07/10 Shenzhen
08/04
Archamps
08/04
Archamps
06/01 Bangkok
04/03 München
N7510
05/07 Poznan
96
E
E
E
X
X
X
E
E
E
X
X
X
E
E
E
X
X
X
2nd M3W White Paper : Architecture
Tutorial on M3W
M3W White Paper : Multimedia Middleware
Architecture
M3W White Paper : Multimedia API
M3W White Paper : Component Model
M3W White Paper : Resource and Quality
Management
M3W White Paper : Component Download
M3W White Paper : Fault Management
M3W White Paper : System Integrity
Management
97
N8152
N8153
N8687
06/04 Montreux
06/04 Monreux
06/10 Hanzhou
N8688
N8689
N8690
06/10 Hanzhou
06/10 Hanzhou
06/10 Hanzhou
N8691
N8692
N8693
06/10 Hanzhou
06/10 Hanzhou
06/10 Hanzhou
28.3
Mailing Lists Reminder
Topic
General
Systems List
File Format
Scene
Representati
on
AF
AIT
MPEG-V
MXM
MPEG
Media
Transport
HTTP
Delivery
Information
Reflector : gen-sys@lists.uni-klu.ac.at
Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/gen-sys
Archive: http://lists.uni-klu.ac.at/mailman/private/gen-sys/
Reflector : mp4-sys@lists.uni-klu.ac.at
Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/mp4-sys
Archive: http://lists.uni-klu.ac.at/mailman/private/mp4-sys/
Reflector : mpeg-laser@lists.uni-klu.ac.at
Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/mpeglaser
Archive: http://lists.uni-klu.ac.at/pipermail/mpeg-laser/
Reflector : maf-sys@lists.uni-klu.ac.at
Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/maf-sys
Archive: http://lists.uni-klu.ac.at/mailman/private/maf-sys/
Reflector: jiptv@lists.uni-klu.ac.at
Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/jiptv
Archive: http://lists.uni-klu.ac.at/mailman/private/jiptv/
Reflector: metaverse@lists.uni-klu.ac.at
Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/metaverse
Archive: http://lists.uni-klu.ac.at/mailman/private/metaverse/
Reflector: mxm@lists.uni-klu.ac.at
Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/mxm
Archive: http://lists.uni-klu.ac.at/mailman/listinfo/mxm
Reflector: mmt@tnt.uni-hannover.de
Subscribe: https://mailhost.tnt.unihannover.de/mailman/listinfo/mmt
Reflector: mmt@tnt.uni-hannover.de
Subscribe: https://mailhost.tnt.unihannover.de/mailman/listinfo/mmt
98
Kindly
Hosted
by
Klagenfur
t
University
Klagenfur
t
University
Klagenfur
t
University
Klagenfur
t
University
Klagenfur
t
University
Klagenfur
t
University
Klagenfur
t
University
University
of
Hannover
University
of
Hannover
28.4
Latest References and Publication Status
Reference on the ISO Web Site : http://www.itscj.ipsj.or.jp/sc29/open/29view/29n9270c.htm
Pr
Pt
Standard
No.
2
2
1
1
ISO/IEC 13818-1:2000 (MPEG-2 Systems 2nd Edition)
ISO/IEC 13818-1:2000/COR1 (FlexMux Descr.)
N3844
01/01 Pisa
2
1
ISO/IEC 13818-1:2000/COR2 (FlexMuxTiming_ descriptor)
N4404
01/12 Pattaya
2
1
ISO/IEC 13818-1:2000/Amd.1 (Metadata on 2) & COR1 on Amd.1
N5867
2
1
ISO/IEC 13818-1:2000/Amd.2 (Support for IPMP on 2)
N5604
03/07
Trondheim
03/03 Pattaya
2
1
ISO/IEC 13818-1:2000/Amd.3 (AVC Carriage on MPEG-2)
N5771
2
1
ISO/IEC 13818-1:2000/Amd.4 (Metadata Application CP)
N6847
2
1
ISO/IEC 13818-1:2000/Amd.5 (New Audio P&L Sig.)
N6585
2
1
ISO/IEC 13818-1:2000/COR3 (Correction for Field Picture)
N6845
04/07
Redmond
04/10 Palma
2
1
ISO/IEC 13818-1:2000/COR4 (M4MUX Code Point)
N7469
05/07 Poznan
2
1
ISO/IEC 13818-1:2000/COR5 (Corrections related to 3rd Ed.)
N7895
06/01
Bangkok
2
2
1
1
ISO/IEC 13818-1:2007 (MPEG-2 Systems 3rd Edition)
ISO/IEC 13818-1:2007/Amd.1 (Transport of Streaming text)
N8369
2
1
ISO/IEC 13818-1:2007/Amd.2 (Carriage of Auxialiry Video Data)
N8798
06/07
Klagenfurt
07/01
Marrakech
99
Date
Status
Doc. With
00/12
Published
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Published
Published
N/A
N/A
ISO
Award
Proposed
N/A
N/A
N/A
N/A
Proposed
N/A
N/A
N/A
Proposed
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
TBP
TBP
Published
N/A
TBP
03/07
Trondheim
04/10 Palma
Purpose
2
1
ISO/IEC 13818-1:2007/Cor.1.2 (Reference to AVC Specification)
N9365
2
1
ISO/IEC 13818-1:2007/Cor.3
2
1
ISO/IEC 13818-1:2007/Amd.3 (SVC in MPEG-2 Systems)
2
1
ISO/IEC 13818-1:2007/Amd.3/Cor.1
2
1
ISO/IEC 13818-1:2007/Amd.4 (Transport of Multiview Video)
2
4
11
1
ISO/IEC 13818-1:2003 (IPMP on 2)
ISO/IEC 14496-1 (MPEG-4 Systems 1st Ed.)
N1093
7
N1005
8
N1093
8
N1074
5
N5607
N2501
4
1
ISO/IEC 14496-1/Amd.1 (MP4, MPEG-J)
N3054
99/12 Hawaii
4
1
ISO/IEC 14496-1/Cor.1
N3278
4
1
ISO/IEC 14496-1:2001 (MPEG-4 Systems 2nd Ed.)
N3850
00/03
Noordwijk.
01/01 Pisa
4
1
ISO/IEC 14496-1:2001/Amd.1 (Flextime)
4
1
ISO/IEC 14496-1:2001/Cor.1
N4264
01/07 Sydney
4
1
ISO/IEC 14496-1:2001/Cor.2
N5275
02/10 Shangai
4
1
ISO/IEC 14496-1:2001/Cor.3
N6587
4
1
ISO/IEC 14496-1:2001/Amd.2 (Textual Format)
N4698
4
1
ISO/IEC 14496-1:2001/Amd.3 (IPMP Extensions)
N5282
4
1
ISO/IEC 14496-1:2001/Amd.4 (SL Extension)
N5471
04/07
Redmond
02/03 Jeju
Island
02/10
Shanghai
02/12 Awaji
100
07/10
Shenzhen
09/10 Xian
Published
N/A
TBP
COR
ITTF
08/07
Hannover
09/10 Xian
Published
N/A
COR
ITTF
to be published
09/07 London
FDAM
ITTF
to be published
03/03 Pattaya
98/10 Atl. City
Published
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
N/A
N/A
Proposed
Done
N/A
Done
N/A
N/A
N/A
N/A
N/A
Done
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
to be published
4
1
ISO/IEC 14496-1:2001/Amd.7 (AVC on 4)
N5976
4
1
ISO/IEC 14496-1:2001/Amd.8 (ObjectType Code Points)
N6202
03/10
Brisbanne
03/12 Hawaii
4
1
ISO/IEC 14496-1:200x/Amd.1 (Text Profile Descriptors)
N7229
05/04 Busan
4
1
ISO/IEC 14496-1:200x/Cor4 (Node Coding Table)
N7473
05/07 Poznan
4
1
N5277
4
1
ISO/IEC 14496-1:200x/Amd.1 (Text Profile Descriptors)
N7229
02/10
Shanghai
05/04 Busan
4
1
ISO/IEC 14496-1:200x/Cor.1 (Clarif. On audio codec behavior)
N8117
4
1
ISO/IEC 14496-1:200x/Amd.2 (3D Profile Descriptor Extensions)
N8372
4
1
ISO/IEC 14496-1:200x/Cor.2 (OD Dependencies)
N8646
4
1
ISO/IEC 14496-1:200x/Amd.3 (JPEG 2000 support in Systems)
N8860
4
1
ISO/IEC 14496-1 (MPEG-4 Systems 4th Ed.)
4
1
ISO/IEC 14496-1:2010/Amd.1 (Usage of LASeR in MPEG-4 systems
and Registration Authority for MPEG-4 descriptors)
4
4
ISO/IEC 14496-1:200x/Amd.17 (ATG Conformance)
N1094
3
N1124
8
N8861
4
4
ISO/IEC 14496-1:200x/Amd.22 (AudioBIFS v3 conformance)
N9295
4
4
ISO/IEC 14496-1:200x/Amd.23 (Synthesized Texture conformance)
N9369
4
4
ISO/IEC 14496-1:200x/Amd.24 (File Format Conformance)
N9370
ISO/IEC 14496-1 (MPEG-4 Systems 3rd Ed.)
101
06/04
Montreux
06/07
Klagenfurt
06/10
Hangzhou
07/01
Marrakech
09/10 Xian
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Published
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
Proposed
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
10/04 Dresden
Published
N/A
N/A
07/01
Marrakech
07/07
Lausanne
07/10
Shenzhen
07/10
Published
N/A
N/A
Published
N/A
N/A
Published
N/A
N/A
Published
N/A
N/A
4
Shenzhen
07/10
Shenzhen
08/04
Archamps
08/04
Archamps
09/07 London
4
ISO/IEC 14496-1:200x/Amd.25 (LASeR V1 Conformance)
N9372
4
4
ISO/IEC 14496-1:200x/Amd.26 (Open Font Format Conf.)
N9815
4
4
ISO/IEC 14496-1:200x/Amd.27 (LASeR Amd.1 Conformance)
N9816
4
4
ISO/IEC 14496-1:200x/Amd.37 (Additional File Format
Conformance)
4
4
4
4
4
4
5
5
5
6
8
11
N1075
0
N9020
N9672
N9674
4
11
ISO/IEC 14496-11/Amd.1 (AFX)
N5480
02/03 Jeju
05/01
HongKong
02/12 Awaji
4
11
ISO/IEC 14496-11/Amd.2 (Advanced Text and Graphics)
N6205
03/12 Hawaii
4
11
ISO/IEC 14496-11/Cor.1
N6203
03/12 Hawaii
4
11
ISO/IEC 14496-11/Cor.3 Valuator/AFX related correction N6594
4
11
ISO/IEC 14496-11/Amd.3 Audio BIFS Extensions
N6591
4
11
ISO/IEC 14496-11/Amd.4 XMT and MPEG-J Extensions
N6959
4
11
ISO/IEC 14496-11/Cor.3 (Audio BIFS Integrated in 3 rd Edition)
N7230
4
11
ISO/IEC 14496-11/Cor.5 (Misc Corrigendum)
N8383
ISO/IEC 14496-1:200x/Amd.12 (File Format)
ISO/IEC 14496-1:200x/Amd.16 (SMR Ref. Soft)
ISO/IEC 14496-1:200x/Amd.17 (LASeR Ref. Soft)
ISO/IEC 14496-6:2000
ISO/IEC 14496-8 (MPEG-4 on IP Framework)
ISO/IEC 14496-11 (MPEG-4 Scene Description 1st
Edition)
N4712
N6960
102
07/04 San Jose
08/01 Antalya
08/01 Antalya
04/07
Redmond
04/07
Redmond
05/01
HongKong
05/04 Busan
06/07
Klagenfurt
Published
N/A
N/A
Published
N/A
N/A
Published
N/A
N/A
Published
N/A
PDAM
PDAM
PDAM
Published
Published
Published
ITTF
ITTF
ITTF
N/A
N/A
N/A
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Published
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
Proposed
N/A
N/A
N/A
N/A
N/A
N/A
to be published
to be published
to be published
N/A
N/A
N/A
N/A
Proposed
Proposed
4
11
4
4
11
11
ISO/IEC 14496-11/Amd.6 Scene Partitioning
4
11
ISO/IEC 14496-11/Amd.7 ExtendedCore2D Profile
4
12
ISO/IEC 14496-12 (ISO Base Media File Format)
N9021
N1024
7
N1125
1
N5295
4
12
ISO/IEC 14496-12/Amd.1 ISO FF Extension
N6596
4
12
ISO/IEC 14496-12/Cor.1 (Correction on File Type Box)
N7232
4
12
ISO/IEC 14496-12/Cor.2 (Miscellanea)
N7901
4
12
N8659
4
12
ISO/IEC 14496-12/Amd.1 (Description of timed
metadata)
ISO/IEC 14496-12/Cor.3 (Miscellanea)
N9024
06/01
Bangkok
06/10
Hangzhou
07/04 San Jose
4
12
ISO/IEC 14496-12/Amd.2 (Flute Hint Track)
N9023
07/04 San Jose
4
12
N
4
12
ISO/IEC 14496-12 (ISO Base Media File Format 3rd
edition)
ISO/IEC 14496-12/Cor.1
4
12
ISO/IEC 14496-12/Cor.2
4
12
4
12
ISO/IEC 14496-12/Amd.1 General improvements
including hint tracks, metadata support, and sample
groups
ISO/IEC 14496-12/Cor.3
ISO/IEC 14496-11/Amd.5 Symbolic Music
Representation
ISO/IEC 14496-11/Cor.6 (AudioFx Correction)
N8657
06/10
Hangzhou
07/04 San Jose
08/10 Busan
Published
N/A
TBP
Published
Published
N/A
N/A
N/A
TBP
10/04 Dresden
FDAM
SC29
Published
N/A
Proposed
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Published
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
02/10
Shanghai
04/07
Redmond
05/04 Busan
N1025
0
N1044
1
N1058
0
08/10 Busan
COR
N/A
N/A
09/02
Lausanne
09/04 Maui
COR
N/A
N/A
FDAM
N/A
N/A
N1075
3
09/07 London
COR
N/A
N/A
103
4
13
ISO/IEC 14496-13 (IPMP-X)
N5284
4
14
ISO/IEC 14496-14:2003 (MP4 File Format)
N5298
4
14
ISO/IEC 14496-14:2003/Cor.1 (Audio P&L Indication)
N7903
4
14
4
15
ISO/IEC 14496-14:2003/Amd.1 Handling of MPEG-4
audio enhancement layers
ISO/IEC 14496-15 (AVC File Format)
N1113
8
N5780
4
15
ISO/IEC 14496-15/Amd.1 (Support for FREXT)
N7585
4
4
15
15
ISO/IEC 14496-15/Cor.1
ISO/IEC 14496-15/Cor.2 (NAL Unit Restriction)
N7575
N8387
4
4
15
15
ISO/IEC 14496-15/Amd.2 (SVC File Format Extension)
ISO/IEC 14496-15 (AVC File Format 2nd edition)
4
4
4
17
18
18
4
4
19
20
ISO/IEC 14496-17 (Streaming Text)
ISO/IEC 14496-18 (Font Compression and Streaming)
ISO/IEC 14496-18/Cor.1 (Misc. corrigenda and
clarification)
ISO/IEC 14496-19 (Synthesized Texture Stream)
ISO/IEC 14496-20 (LASeR & SAF)
N9682
N1113
9
N7479
N6215
N8664
4
20
4
N6217
N7588
N8666
20
ISO/IEC 14496-20/Cor.1 (Misc. corrigenda and
clarification)
ISO/IEC 14496-20/Amd.1 (LASeR Extension)
4
20
ISO/IEC 14496-20/Cor.2 (Profile Removal)
N9381
4
20
ISO/IEC 14496-20/Amd.2 (SVGT1.2 Support)
N9384
N9029
104
02/10
Shanghai
02/10
Shanghai
06/01
Bangkok
10/01 Kyoto
03/07
Trondheim
05/10 Nice
05/10 Nice
06/07
Klagenfurt
08/01 Antalya
10/01 Kyoto
05/07 Poznan
03/12 Hawaii
06/10
Hangzhou
03/12 Hawaii
05/10 Nice
06/10
Hangzhou
07/04 San Jose
07/10
Shenzhen
07/10
to be published
Proposed
IS
ITTF
Published
N/A
Proposed
Published
N/A
N/A
Published
N/A
Published
N/A
FDAM
ITTF
COR
COR
ITTF
ITTF
N/A
N/A
FDAM
Published
ITTF
N/A
N/A
Proposed
FDAM
Published
COR
ITTF
N/A
ITTF
TBP
Proposed
N/A
Published
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
d
Consolidate
N/A
N/A
Proposed
TBP
N/A
N/A
N/A
N/A
N/A
N/A
N/A
N/A
Proposed
Final Text
Editing
N/A
Shenzhen
2nd
4
4
4
20
20
20
ISO/IEC 14496-20 (LASeR & SAF
edition)
ISO/IEC 14496-20/Amd.1 SVGT1.2 support
ISO/IEC 14496-20/Amd.2 Adaptation
4
20
ISO/IEC 14496-20/Amd.3 PMSI
4
20
ISO/IEC 14496-20/Cor. 1
4
22
ISO/IEC 14496-22 (Open Font Format)
4
4
22
22
ISO/IEC 14496-22 (Open Font Format 2nd edition)
ISO/IEC 14496-20/Amd.3 PMSI
7
7
7
7
7
7
7
1
1
1
1
1
2
7
ISO/IEC 15938-1 (MPEG-7 Systems)
7
N
N
N1075
9
N1095
4
N1137
6
N8395
ISO/IEC 15938-7/Amd.2 (Fast Access Ext. Conformance)
N
N1095
4
N4285
N6326
N6328
N7490
N7532
N4288
N8672
12
ISO/IEC 15938-12 MPEG Query Format
N9830
7
12
ISO/IEC 15938-12/Cor.1
7
12
ISO/IEC 15938-12/Cor.2
21
21
5
9
ISO/IEC 21000-5 (Open Release Content Profile)
ISO/IEC 21000-9 (MPEG-21 File Format)
N1045
2
N1095
9
N9687
N6975
21
9
ISO/IEC 21000-9/Amd.1 (MPEG-21 Mime Type)
N9837
ISO/IEC 15938-1/Amd.1 (MPEG-7 Systems Extensions)
ISO/IEC 15938-1/Cor.1 (MPEG-7 Systems Corrigendum)
ISO/IEC 15938-1/Cor.2 (MPEG-7 Systems Corrigendum)
ISO/IEC 15938-1/Amd.2 (BiM extension)
ISO/IEC 15938-2 (MPEG-7 DDL)
105
09/07 London
d
Published
Published
Published
N/A
N/A
N/A
TBP
N/A
N/A
09/10 Xian
Published
N/A
N/A
COR
ITTF
N/A
Published
N/A
TBP
Published
Published
N/A
N/A
TBP
N/A
01/07 Sydney
04/03 Munich
04/03 Munich
05/07 Poznan
05/10 Nice
01/07 Sydney
06/10
Hangzhou
08/04
Archamps
09/02
Lausanne
09/10 Xian
Published
Published
Published
Published
Published
Published
Published
N/A
N/A
N/A
N/A
N/A
N/A
N/A
Done
N/A
N/A
N/A
N/A
Done
N/A
Published
N/A
N/A
Published
N/A
N/A
Published
N/A
N/A
08/01 Antalya
05/01
HongKong
08/04
Published
Published
N/A
N/A
TBP
Done
Published
N/A
Done
10/07 Geneva
06/07
Klagenfurt
09/10 Xian
21
15
ISO/IEC 21000-15 (Security in Event Reporting)
N9839
21
21
16
19
ISO/IEC 21000-16 (MPEG-21 Binary Format)
ISO/IEC 21000-19 (Media Value Chain Ontology)
A
4
ISO/IEC 23000-4 (Musical Slide Show MAF)
N7247
N1114
6
N9037
A
4
ISO/IEC 23000-4 (Musical Slide Show MAF 2nd Ed.)
N9843
A
4
A
4
A
A
6
6
A
A
7
7
A
8
ISO/IEC 23000-4 Amd.1 Conformance & Reference
Software
ISO/IEC 23000-4 Amd.2 Conformance & Reference
Software for Protected MSS MAF
ISO/IEC 23000-6 (Professional Archival MAF)
ISO/IEC 23000-6 Amd.1 Conformance and Reference
Software
ISO/IEC 23000-7 (Open Access MAF)
ISO/IEC 23000-7 Amd.1 Conformance and Reference
Software
ISO/IEC 23000-8 (Portabe Video AF)
A
9
ISO/IEC 23000-9 (Digital Multi. Broadcasting MAF)
N9397
A
9
N9854
A
9
A
10
ISO/IEC 23000-9/Cor.1 (Digital Multi. Broadcasting
MAF)
ISO/IEC 23000-9/Amd.1 (Conformance & Reference
SW)
ISO/IEC 23000-10 (Video Surveillance AF)
A
A
10
11
ISO/IEC 23000-11/Amd.1 Conformance & Reference SW N
ISO/IEC 23000-11 (Stereoscopic video MAF)
N9397
Archamps
08/04
Archamps
05/04 Busan
10/01 Kyoto
Published
N/A
TBP
Published
Publsihed
N/A
N/A
TBP
TBP
Consolidate
d
FDIS
N/A
TBP
ITTF
TBP
N
N/A
TBP
N
N/A
TBP
07/04 San Jose
08/04
Archamps
TBP
TBP
N
N
N9698
N
08/01 Antalya
FDIS
ITTF
TBP
TBP
N9853
08/04
Archamps
07/10
Shenzhen
08/04
Archamps
FDIS
ITTF
TBP
FDIS
ITTF
TBP
N
N9397
106
07/10
Shenzhen
07/10
Published
TBP
FDIS
SC29
TBP
FDIS
ITTF
TBP
Published
FDIS
ITTF
TBP
TBP
Shenzhen
A
A
B
B
11
12
1
1
B
1
B
1
B
1
B
B
E
ISO/IEC 23000-11/Cor.1
ISO/IEC 23000-11 (Interactive Music AF)
ISO/IEC 23001-1 (XML Binary Format)
ISO/IEC 23001-1/Cor.1 (Misc. Editorial and technical
clar.)
ISO/IEC 23001-1/Cor.2 (Misc. Editorial and technical
clar.)
ISO/IEC 23001-1/Amd.1 (Reference Soft. & Conf.)
N
N
N7597
N8680
N9296
2
3
1
ISO/IEC 23001-1/Amd.1 (Exten. On encoding of wild
cards)
ISO/IEC 23001-2 (Fragment Request Unit)
ISO/IEC 23001-3 (IPMP XML Messages)
ISO/IEC 23008-1 Architecture
E
2
ISO/IEC 23008-2 Multimedia API
N8893
E
3
ISO/IEC 23008-3 Component Model
N8894
E
4
ISO/IEC 23008-4 Ressource & Quality Management
N8895
E
E
E
E
M
5
6
7
8
1
ISO/IEC 23008-5 Component Download
ISO/IEC 23008-6 Fault Management
ISO/IEC 23008-7 System Integrity Management
ISO/IEC 23008-7 Reference Software
ISO/IEC 23006-1 Architecture and Technologies
M
2
ISO/IEC 23006-2 MXM API
M
3
ISO/IEC 23006-3 Reference Software
N9053
N9054
N9055
N
N1116
3
N1116
5
N1116
8
N9049
N8886
N9051
N9416
N8892
107
Published
Published
FDIS
COR
ITTF
ITTF
TBP
TBP
TBP
N/A
COR
ITTF
N/A
FDAM
ITTF
N/A
PDAM
ITTF
FDIS
FDIS
FDAM
ITTF
ITTF
ITTF
TBP
TBP
N/A
FDAM
ITTF
N/A
FDAM
ITTF
N/A
FDAM
ITTF
N/A
FDAM
FDAM
FDAM
ITTF
ITTF
ITTF
10/01 Kyoto
FDIS
Editor
N/A
N/A
N/A
N/A
N/A
10/01 Kyoto
FDIS
Editor
N/A
10/01 Kyoto
FDIS
Editor
N/A
05/10 Nice
06/10
Hangzhou
07/04 San Jose
07/01
Marrakech
07/07
Lausanne
07/04 San Jose
07/04 San Jose
07/01
Marrakech
07/01
Marrakech
07/01
Marrakech
07/01
Marrakech
07/04 San Jose
07/04 San Jose
07/04 San Jose
to be published
N/A
M
4
ISO/IEC 23006-4 MXM Protocols
U
1
ISO/IEC 23007-1 Widgets
N1117
0
N1125
6
108
10/01 Kyoto
FDIS
SC29
N/A
10/04 Dresden
FDIS
SC29
N/A
Annex G– Video report
Source: Jens Ohm and Gary Sullivan, Chairs
1 MPEG-2 video
During the July 2010 meeting, it had been requested to define a similar approach for MPEG-2
Video as was defined for AVC using an SEI message supporting "frame compatible" stereo video
with various frame packing arrangements. It was suggested to enable this capability as a user data
extension and to put a 4-byte ID after the user_data_start_code to avoid conflicts with existing
systems. To initiate further study on this, document N11462 was issued, and NBs were requested to
study and comment on the viability of this solution or to propose better solutions. Two comments
by NBs were received. From these, the concept of the solution of N11462 appears to be a viable
way, keeping compatibility with existing devices and simplifying the launch of new emerging
services in that area. To proceed to the next step, PDAM4 was issued. The plan is to proceed to
FPDAM 11/01, FDAM 11/07. The method of signaling allows left/right or top/bottom placement of
the two stereo views, and switching between mono and stereo within one stream.
It is planned to provide a new edition of MPEG-2 part 2 after finalization of this amendment.
Document(s) reviewed:
m18196
m18030
JNB comments on Resolution 2.2.2 (3D Signaling in MPEG-2
video)
USNB Contribution: WG11 resolution 2.2.2 consideration
Document(s) approved:
No.
Title
11612 Request for ISO/IEC 13818-2:2000/Amd.4
11613 Text of ISO/IEC 13818-2:2000/PDAM4 Frame Packing
Arrangement Signalling in MPEG-2 Video
Japan National Body
A. G. Tescher for USNB
TBP Available
N
10/10/15
N
10/10/29
2 MPEG-4 part 2 Software
During several periods of AHG work, various corrections had been provided to the MPEG-4 part 2
reference software, which is located in the MPEG SVN server. As more recently no new bugs had
been detected, it can be assumed that the software has reached a mature status. Therefore, it was
decided to modify the latest published version by an appropriate corrigendum.
Document(s) reviewed:
m17965
Ad Hoc on Maintenance of MPEG-4 Visual related Documents,
Reference Software and Conformance
Document(s) approved:
No.
Title
11614 Text of ISO/IEC 14496-5:2001/Amd.5 DCOR1
109
Yi-Shin Tung, Jens-Rainer Ohm
TBP Available
N
10/10/15
3 Development of AVC
The video subgroup held two discussion sessions together with VCEG (ITU-T Q6/SG16) to review
input documents related to AVC.
The main topics of discussion were
a) Potential corrigendum issues (follow-up of the defect report N11465 issued during the July
2010 meeting).
b) Establishment of a “Progressive High Profile”.
Regarding the corrigendum work, the most critical issue was the clarification of the positioning
indicators for the video data grid related to the AVC frame packing arrangement SEI message. A
solution was found that appears to be compatible with existing implementations, which was to
designate the (0,0) coordinate position as a special case to be treated as indicating the same
positioning as would be indicated by the positioning that had been shown in the example figures.
A need for clarification of the intended buffering capacity requirements for MVC was also
identified, discussed and agreed.
Regarding 2D content in 3D application deployments – it was remarked that this should be
considered for an amendment/extension activity rather than a corrigendum activity.
A new version of the defect report was issued, which was made publicly available. More editorial
work is needed on various items before it can be issued as DCOR.
Following the plan established at the Geneva meeting, a PDAM was issued for defining a new
Progressive High Profile. NB comments were requested on issues of level 4.2 restrictions and a
potential Progressive Main Profile. No technical changes were made relative to the previous WD.
Instead of defining a new conformance point via a new value of profile_idc, the specification uses
the constraint_set4_flag in a manner similar to its use for multiview/interlaced Stereo High profile
cases.
One contribution (m18337) requested for the Progressive High Profile to impose further restrictions
on level 4.2 and higher levels. Specifically, it was proposed that a maximum number of slices per
row of macroblocks be established, with a maximum value of two slices per row, and that no
predictions blocks smaller than 8x8 should be allowed. These issues require further study on their
possible impact and desirability; therefore NBs were requested by a resolution to comment. The
specification of a new, similarly-constructed “Progressive Main Profile” was also suggested in the
same contribution.
Document(s) reviewed:
m18197
m18031
m18337
JNB comments on N11465 about frame packing arrangement SEI message

Request: (0,0) should be compatible with figures
USNB Contribution: WG 11 resolution 3.3.2 consideration

Request: (0,0) should be "undefined" (in conflict with JNB) – a
compromise was drafted in which (0,0) would indicate "default"
positions as illustrated.

Various issues on MVC
Level Limits for Progressive High Profile

Restrict number of slices (2) in rows of macroblocks for level 4.2 and
higher

Restrict level 4.2 and higher to only 8x8 prediction blocks

Specify Progressive main profile as well
110
Japan National Body
A. G. Tescher for USNB
Aaron Wells
m18338
On the AVC frame packing arrangement SEI message

Sample aspect ratio

2D content in 3D signaling (comment: this should be an amendment
rather than corrigendum)

Grid position indicator

Semantics sentence placement
G. J. Sullivan (Microsoft)
m18339
Miscellaneous AVC (ITU-T H.264 | ISO/IEC 14496-10) errata issues
G. J. Sullivan (Microsoft),
H. Schwarz (Fraunhofer
HHI)
Document(s) approved:
No.
Title
11615 Defect report on ISO/IEC 14496-10:201x
11616 Text of ISO/IEC 14496-10:201x/PDAM1 Progressive High Profile
TBP Available
Y
10/10/31
N
10/10/15
4 MPEG-7 Visual
The main activities in MPEG-7 Visual technical work were related to the proceeding of software
and conformance amendments for video signature tools into FPDAM status, and issuing a Study of
PDAM 6 of 15938-8 (extraction and matching) related to video signature tools. As expressed by a
resolution of the Geneva meeting, software related to indexing for fast matching that had been
included in XM 39 was still to be provided. After this was done, the related technology was
included in the description of the Study document.
An AHG was appointed (N11626, as recorded in N11554) to further the work until the 95th
meeting. The mandates relate to maintenance of the various MPEG-7 Visual related parts.
Document(s) approved:
No.
Title
11617 DoC on ISO/IEC 15938-6:2001/PDAM4
11618 Text of ISO/IEC 15938-6:2001/FPDAM4 Reference Software for
Video Signature Tools
11619 DoC on ISO/IEC 15938-7:2001/PDAM6
11620 Text of ISO/IEC 15938-7:2001/FPDAM6 Conformance for
Video Signature Tools
11621 Study Text of ISO/IEC 15938-8/PDAM6 Extraction and
Matching of Video Signature Tools
TBP Available
N
10/10/15
N
10/11/07
N
N
10/10/15
10/11/07
N
10/11/07
5 Reconfigurable Video Coding (RVC)
The main activities in RVC were as follows:
Regarding MPEG-B part 4, a DCOR on ISO/IEC 23001-4 (N11622) was issued which contains
various editorial corrections and clarifications of mathematical operations. In addition, a first
Working Draft of ISO/IEC 23001-4/Amd.1 (N11623) was approved, which contains a more precise
specification of the computation model, RVC-CAL I/O extended support, definition of typing rules
for large number arithmetic and (non-normative) definition of FU dataflow classes, classification
rules and methods.
111
Regarding MPEG-C part 4, FDAM1 (software and conformance) was released. Besides Amd.2
Tools for AVC High Profile (progressive only) that is currently under PDAM ballot, WD of
ISO/IEC 23002-4/Amd.3 was issued which defines FUs for SVC Profiles (N11627).
Report of RVC Breakout Activity
5.1 RVC-related activity scheduling at the meeting
Room allocation for Video (SHAOGUAN) and RVC Breakout (3416) were arranged as shown:
Date
Topic
Room Time
Video Plenary (planning of this week)
Video
1230
Monday
RVC agenda time allocation approval
RVC
1530 – 1630
Joint meeting REQ on Type1 Licensing
Reqs
1630 - 1830
Reconfigurable Graphic Framework joint
3DG
9 – 1130
Tuesday
meeting
Status of Conformance Testing FPDAM1
RVC
1130 - 12
Review of input contributions
RVC
14-16
Review of input contributions
RVC
16-18
MPEG Plenary
Yellow
9-11
Wednesday
Review and approval of output documents RVC
17-18
Video Plenary
Video
17-19
Thursday
Friday
MPEG Plenary
5.2 Review of Input contributions
Scheduling and notes recorded for particular input contributions were as shown below:
Time
Document
No.
T 12 – 13
Title
Notes & Recommendations
Mickael Raulet ,
Matthieu Wipliez,
Jarome Gorin,
Nicolas Siret
Typing rules should be included in the WD of AMD1 of
ISO/IEC 23001-4
A new core experiment should be added studying the big
integers supports and their typing rules.
Junaid Jameel Ahmad
Crypto Tools Library (CTL): Applying
Shujun Li , Marco
RVC-CAL for Multimedia Security
Mattavelli , Matthieu
Applications
Wipliez , Mickael
Raulet
Contribution providing a first step towards the possibility to
embed encryption in codecs and generate encrypted bitstreams
that maintains conformance and syntax. Future contributions
will be provided to demonstrate it. The need of big integers in
RVC-CAL has been demonstrated.
A proposal of RVC-CAL extensions for
Jorn Janneck, Marco
Addendum: Specification of typing rules
for RVC-CAL
m18345
T 15 – 18
m18404
m18460
Authors
112
improved support of I/O processing
m18458
m18346
18547
Mattavelli, Shujun Li,
Mickael Raulet,
Matthieu Wipliez,
Johan Eker, Carl Von
Platen, Ghislain
Roquier, Ihab Amer,
Christophe Lucarz,
Pascal Faure, Junaid
Jameel Ahmad
Proposed extensions of RVC-CAL I/O support should be
included in the WD of AMD1 of ISO/IEC 23001-4.
Supporting tools should be updated so as to support the I/O
defined.
Writing dataflow networks components Jorn Janneck, Marco
(FUs) with different models of
Mattavelli, Shujun Li,
computations using RVC-CAL: a
Mickael Raulet,
tutorial
Matthieu Wipliez,
Johan Eker, Carl Von
Platen, Ghislain
Roquier, Ihab Amer,
Christophe Lucarz,
Pascal Faure, Junaid
Jameel Ahmad
The contribution presents theory and example of FU dataflow
classes using RVC-CAL. It proposes a core experiment that
aims at re-writing FUs that can be classified as belonging to a
more specialized class.
Mickael Raulet ,
Matthieu Wipliez
The contribution reports the classification of FUs in the RVC
toolbox. The classification is made automatically by a tool
embedded in Orcc environment. It is recommended to use it in
the core experiment proposed in m18458
Mariam Saleh,
Mickael Raulet , Ihab
RVC-CAL non BTYPE Version
Amer, Marco
Mattavelli
The contribution reports the results of conformance testing of
the non BType version of the decoder
Automatic classification of the FUs
An AHG was appointed (N11644, as recorded in N11554) to further the work until the 95th
meeting.
Document(s) approved:
No.
Title
11622 Text of ISO/IEC 23001-4/DCOR1
11623 Working Draft of ISO/IEC 23001-4/Amd.1
11624 Disposition of Comments on ISO/IEC 23002-4/FPDAM1
113
TBP
N
N
N
Available
10/10/15
10/10/15
10/10/15
11625
11627
11628
Text of ISO/IEC 23002-4/FDAM1 Software and Conformance
Testing for Video Tool Library
WD of ISO/IEC 23002-4/Amd.3 Functional Units for SVC
Profiles
Description of Core Experiments in RVC
N
10/10/22
N
10/10/15
N
10/10/15
6 Explorations – 3D Video
In preparation of a Call for Proposals, the 3D Video activity and associated breakout sessions
operated under the management of the Requirements subgroup.
Technical Document(s) reviewed (see AHG report M17968)
m18026
3DV/FTV EE result of coding experiment on Lovebird1
sequence
m18221
Depth Estimation Experiment with Poznan Hall
m18342
m18343
3DV EE4 Report on Book_arrival Sequence
3DV EE4 Report on Poznan_Street Sequence
m18349
3DV EE4 Results on Balloons sequence
m18354
3DV/FTV EE1 and EE4 report on Champagne Tower sequence
m18355
3DV/FTV EE4 report on Kendo sequence
m18356
3DV/FTV EE2 report on VSRS extrapolation
m18358
Update of the Draft Report on Experimental Framework for 3D
Video Coding
m18363
Results on 3DVC EE4 for Newspaper
m18367
A method of disparity vector prediction using depth map
m18369
m18395
3DV EE4 Results on Beergarden
3DV EE4 Results on Mobile
m18420
3DV EE4 Results on Pantomime sequence
m18511
m18512
3D Video Coding Results of Exploration Experiments (Book
Arrival)
Frame range extension of Poznan Street and Poznan Carpark
sequences (3DV/EE1)
3DV EE1 Results on Newspaper
EE1 Results on Cafe
m18513
3DV EE4 Results on Newspaper
m18514
m18515
Common-hole Filling for Boundary Noise Removal in VSRS
3DV EE4 Results on Cafe
m18421
m18506
114
Donggyu Sim, Woong Lim, GiMun Um, Won-Sik Cheong,
Namho Hur
Takanori Senoh, Kenji Yamamoto,
Ryutaro Oi, Yasuyuki Ichihashi,
Taiichiro Kurita,
Deliang Fu, Lu Yu
Deliang Fu, Lu Yu
Jaejoon Lee, Seok Lee, HoCheon
Wey, Du Sik Park
Masayuki Tanimoto, Toshiaki
Fujii, Mehrdad Panahpour Tehrani,
Menno Wildeboer
Masayuki Tanimoto, Toshiaki
Fujii, Mehrdad Panahpour Tehrani,
Menno Wildeboer
Masayuki Tanimoto, Toshiaki
Fujii, Mehrdad Panahpour Tehrani,
Menno Wildeboer
Menno Wildeboer (Nagoya Univ.),
Patrick Lopez (Technicolor), Dong
Tian (Mitsubishi), Yin Zhao
(Zhejiang Univ.), Cheon Lee
(GIST)
Pablo Carballeira , Julián
Cabrera, Fernando Jaureguizar,
Gianluca Cernigliaro, Juan Casal,
Narciso Garcia
Yoshiya Yamamoto, Tadashi
Uchiumi, Makoto Ohtsu, Junsei
Sato, Atsutoshi Shimeno
Patrick Lopez
fons bruls
Dmytro Rusanovskyy, Miska M.
Hannuksela
Gerhard Tech, Karsten Müller
Olgierd Stankiewicz , Krzystof
Wegner
Cheon Lee, Yo-Sung Ho
Cheon Lee, Yo-Sung Ho
Min-Koo Kang, Cheon Lee, YoSung Ho
Cheon Lee, Yo-Sung Ho
Min-Koo Kang, Cheon Lee, Yo-
m18518
A Depth Compression for MVD View Synthesis
m18519
Enhancement of Depth Estimation
m18520
Stereoscopic video coding with asymmetric luminance and
chrominance qualities
m18535
Depth map enhancement for view synthesis
m18543
3DV EE1 Results on Poznan_Carpark• Test Sequence
m18552
Experimental Results for Inter-View Direct Mode with
Decompressed Disparity Maps for Multiview Video Coding
Sung Ho
Qiuwen Zhang, Ping An, Yan
Zhang, Ran Ma, Qian Zhang,
Zhaoyang Zhang
Qiuwen Zhang, Ping An, Qian
Zhang, Yan Zhang, Zhaoyang
Zhang
Feng Shao, Gangyi Jiang, Mei Yu,
Xu Wang
Qian Zhang, Ping An, Qiuwen
Zhang, Ran Ma, Zhaoyang Zhang
Qifei Wang , Kun Xu , Qian Ma ,
Xiangyang Ji , Qionghai Dai
Jacek Konieczny , Marek
Domanski
Document(s) approved:
No.
Title
11630 Description of Exploration Experiments in 3D Video Coding
11631 Report on Experimental Framework in 3D Video Coding
TBP Available
N
10/10/29
Y
10/10/15
7 Explorations – Scalable resolution enhancement of frame-compatible stereo
video
In continuation of the work begun at the previous meeting for evaluating the potential need for
scalable resolution enhancement of frame-compatible stereo video, a number of activities took place
on this subject in Guangzhou at this meeting. This work should primarily be considered to fall
under the Requirements subgroup at this stage, and although a summary of the work on this topic is
also provided here, the Requirements report should be reviewed for further information on this
activity.
An ad hoc group had met on the day preceding the WG 11 meeting. The AHG had recommended
for WG 11 to
 Acquire more common set of test sequences. The "Beer garden" sequence is available but
has asymmetric source quality due to precompression. Dolby is expected to provide
sequences to MPEG by the end of this year.
 Further clarify and fine tune the test conditions (e.g. bit rates) that can be used with the
common test sequences to evaluate the performance of existing as well as possible new
coding tools that can provide reasonably higher coding efficiency than the existing tools in
MPEG-4 Part 10,
 Create anchors with the target of issuing CfP in the next WG11 meeting if the established
test conditions show that the performance of existing tools can be improved significantly.
It was agreed that AVC base layer compatibility is required – at least it must support that. It is not
yet known whether frame-compatible MPEG-2 will be deployed in large numbers.
It was remarked regarding m18478 that the wording "The new codec shall not introduce significant
degradation" may not be fully appropriate, and it was suggested to reword this to something more
describing having a "visually acceptable quality base layer quality".
A break-out activity was coordinated by W. Husak & A. Tourapis during the meeting to:
 discuss test conditions
 prepare use case & requirements output doc
115


prepare toward fulfilling the recommendations of the ahg
plan for establishing new ahg
In M18541, Panasonic reported on Subjective Picture Quality Evaluation of MVC Stereo High
Profile for Full-Resolution Stereoscopic HD Video Applications. Panasonic tested the use of the
Stereo High profile with a 12 Mbps first view and a varying bit rate for the 2nd view, relative to the
base layer with a 1920x1080 shutter-glass frame-sequential 103" display. Confidence intervals
overlapped for 25-50% supplemental bit rate, and 50% supplemental bit rate scored slightly higher
than 100% added bit rate simulcast. For animation, a lower second-view relative bit rate seemed
acceptable. For equal total bit rate, it seemed better to allocate 1/3 of those bits (50% relative to the
main view) than to allocate substantially fewer (15% relative to the main view). Interlaced video
appeared to need a higher bit rate than progressive scan video. These subjective results seemed
better than the objective results reported in JVT-AE022, in terms of indicating the visual coding
efficiency of the MVC Stereo High profile design.
The results of the work and plans for future activity were agreed to be reflected in two output
documents, one describing requirements for MPEG Frame-Compatible Enhancement (MFC), and
one describing plans for the test conditions and evaluation process for MFC. And editing period was
authorized for preparation of the latter document.
An Ad Hoc group was established to further the work until the Daegu meeting.
Relevant input documents reviewed:
m17981
m18425
m18427
m18446
m18475
m18478
AHG report on scalable resolution enhancement of frame-compatible
stereo video
Philips (in coop with 3D4YOU) response to request for HQ stereo content
for MFC AHG
MFC initial results & findings
Compression Efficiency Evaluation of Frame-Compatible Stereoscopic
Video Coding with Resolution Enhancement
Comments on Scalable Resolution Enhancement of Frame-Compatible
Stereoscopic 3D Video
Use cases and requirements for scalable resolution enhancement of framecompatible stereoscopic 3D video
m18486
On scalable resolution enhancement of frame-compatible stereoscopic 3D
video
m18500
Comments on scalable enhancement of frame compatible stereo 3D
m18521
Comments on Scalable resolution enhancement of frame-compatible
stereoscopic 3D video
m18541
Subjective Picture Quality Evaluation of MVC Stereo High Profile for
Full-Resolution Stereoscopic HD Video Applications
Fons Bruls
Fons Bruls
Thomas Rusert , Per
Fröjdh
Ying Chen , Marta
Karczewicz
Walt Husak, Ajay
Luthra , Gary Sullivan
Koohyar Minoo , Vivian
Kung, David Baylon, Krit
Panusopone, Ajay Luthra
Teruhiko Suzuki , Ikuo
Tsukagoshi
Kyong-Sok Seo, ByeongDoo Choi, Dae-Sung Cho
Tao Chen, Yoshiichiro
Kashiwagi, Takahiro
Nishi
Document(s) approved:
No.
Title
TBP Available
11681 Requirements for MPEG Frame-Compatible Enhancement (MFC) N
10/10/15
11682 Test Conditions and Evaluation Process for MFC
N
10/10/29
116
Annex H– JCT report
Source: Jens Ohm and Gary Sullivan, Chairs
Summary
The Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T WP3/16 and ISO/IEC JTC
1/SC 29/WG 11 held its third meeting during 7-15 October, 2010 at the Guangzhou Baiyun
International Convention Center – Oriental International Convention Hotel, in Guangzhou, China.
The JCT-VC meeting was held under the chairmanship of Dr. Gary Sullivan (Microsoft/USA) and
Dr. Jens-Rainer Ohm (RWTH Aachen/Germany).
The JCT-VC meeting sessions began at approximately 9:00 a.m. on Thursday 7 October. Meeting
sessions were held on all days (including weekend days) until the meeting was closed at
approximately 1:30 p.m. on Friday 15 October. Approximately 244 people attended the JCT-VC
meeting, and approximately 300 input documents were discussed. The meeting took place in a colocated fashion with a meeting of ISO/IEC JTC 1/SC 29/WG 11 (MPEG) – one of the two parent
bodies of the JCT-VC. The subject matter of the JCT-VC meeting activities consisted of work on
the new next-generation video coding standardization project now referred to as High Efficiency
Video Coding (HEVC).
The primary goals of the meeting were to review the work that was performed in the interim period
since the second JCT-VC meeting in implementing the Test Model under Consideration (TMuC),
review results from Tool Experiments (TE), review technical input documents, establish a first
Working Draft and HEVC Test Model (HM), and plan a set of Core Experiments (CE) for further
investigation of proposed technology.
The JCT-VC produced five important output documents from the meeting: A document describing
the process of test model development for HEVC, the HEVC Test Model 1 (HM1), the HEVC
specification Working Draft 1 (WD1), a document describing software guidelines and a document
providing an overview about the HM1 and WD1 adoptions. Furthermore, 13 documents describing
the planning of CEs were drafted.
For the organization and planning of its future work, the JCT-VC established thirteen "Ad Hoc
Groups" (AHGs) to progress the work on particular subject areas. The next JCT-VC meetings will
be held during 20-28 January 2011 in Daegu, Korea under the auspices of JTC 1/SC 29/WG 11, and
during 15-23 March 2011 in Geneva, Switzerland under the auspices of ITU-T Q6/16.
A new document distribution site http://phenix.it-sudparis.eu/jct/ became operational shortly before
the Guangzhou meeting and was exclusively used for distribution of all documents. The new site
allows a more automated process of document registration and download. Further improvements are
expected in simplified bulk download and particularly for selective download of updated documents.
The reflector to be used for discussions by the JCT-VC and all of its AHGs is the JCT-VC reflector:
jct-vc@lists.rwth-aachen.de. For subscription to this list, see
http://mailman.rwth-aachen.de/mailman/listinfo/jct-vc.
1
1.1
Administrative topics
Organization
The ITU-T/ISO/IEC Joint Collaborative Team on Video Coding (JCT-VC) is a group of video
coding experts from the ITU-T Study Group 16 Visual Coding Experts Group (VCEG) and the
ISO/IEC JTC 1/ SC 29/ WG 11 Moving Picture Experts Group (MPEG). The parent bodies of the
JCT-VC are ITU-T WP3/16 and ISO/IEC JTC 1/SC 29/WG 11.
117
The Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T WP3/16 and ISO/IEC JTC
1/SC 29/WG 11 held its third meeting during 7-15 October, 2010 at the Guangzhou Baiyun
International Convention Center – Oriental International Convention Hotel, in Guangzhou, China.
The JCT-VC meeting was held under the chairmanship of Dr. Gary Sullivan (Microsoft/USA) and
Dr. Jens-Rainer Ohm (RWTH Aachen/Germany).
1.2
Meeting logistics
The JCT-VC meeting sessions began at approximately 9:00 a.m. on Thursday 7 October. Meeting
sessions were held on all days (including weekend days) until the meeting was closed at
approximately 1:30 p.m. on Friday 15 October. Approximately 244 people attended the JCT-VC
meeting, and approximately 300 input documents were discussed. The meeting took place in a colocated fashion with a meeting of ISO/IEC JTC 1/SC 29/WG 11 (MPEG) – one of the two parent
bodies of the JCT-VC. The subject matter of the JCT-VC meeting activities consisted of work on
the new next-generation video coding standardization project now referred to as High Efficiency
Video Coding (HEVC).
Information regarding logistics arrangements for the meeting had been provided at
http://avslab.org/news/mpeg.
1.3
Primary goals
The primary goals of the meeting were to review the work that was performed in the interim period
since the second JCT-VC meeting in implementing the Test Model under Consideration (TMuC),
review results from Tool Experiments (TE), review technical input documents, and establish a first
Working Draft and HEVC Test Model (HM).
1.4 Documents
The documents of the JCT-VC meeting are listed in Annex A of this report. The documents can be
found at http://phenix.it-sudparis.eu/jct/.
The formal deadline for registering and uploading non-administrative contributions was October 1,
2010. Some confusion over document handling occurred as the result of the transition to a new
electronic document archive site just prior to the meeting.
Documents with numbers JCTVC-C295 through JCTVC-C320 were all registered after October 2
and uploaded after October 3 (based on the timestamps of the document archive site, which appear
to correspond to the local time zone of Paris, France), of which only JCTVC-C296, JCTVC-C297,
JCTVC-C299, JCTVC-C301, JCTVC-C302 and JCTVC-C303 were available by the first meeting
day. Four of these (JCTVC-C310, JCTVC-C317, JCTVC-C319, and JCTVC-C320) were summary
reports or side-activity reports generated during the meeting and thus should not necessarily be
considered late. The majority of the others were cross-verification reports.
Among the other documents, JCTVC-C030, JCTVC-C068, JCTVC-C114, JCTVC-C165, JCTVCC174, JCTVC-C175, JCTVC-C178, JCTVC-C179, JCTVC-C180, JCTVC-C181, JCTVC-C182,
JCTVC-C225, JCTVC-C231, JCTVC-C244, JCTVC-C247, JCTVC-C248, JCTVC-C287, JCTVCC288 and JCTVC-C292 had been uploaded after October 3 (based on the timestamps of the
document archive site, which appear to correspond to the local time zone of Paris, France), but had
been uploaded in some form by the first day of the meeting. Documents JCTVC-C272 and JCTVCC284 were missing prior to the meeting and were provided by the third meeting day during the
meeting. Documents JCTVC-C243, JCTVC-C308, and JCTVC-C309, all of which were crossverification experiment reports, were only uploaded substantially after the meeting had ended.
Registration timestamps, initial upload timestamps, and final upload timestamps are listed in Annex
A of this report. These timestamps reflect activity on the new document site, and some documents
had been uploaded (and in some cases, made available for distribution) on the prior document site
before being placed on the new document site. The timestamps of the document archive site appear
to correspond to the local time zone of Paris, France.
118
Documents JCTVC-C289, JCTVC-290 and JCTVC-291 were available, but were initially missing
an IPR statement.
As a general policy, missing documents were not to be presented, and late documents (and
substantial revisions) could only be presented when sufficient time for studying was given after the
upload. This policy was not strictly enforced at this meeting in a few cases – partly because of the
transition to the use of a new document archive site and partly because there were network access
difficulties during the meeting – although some late documents were not presented due to lack of
time for their presentation. Otherwise, there were no objections raised by the group regarding
presentation of late contributions. The situation about missing input documents must be
significantly improved in upcoming meetings.
An issue discussed at the meeting was what to do in the case of a submission of a "placeholder"
contribution document that is basically empty of content, with perhaps only a brief abstract and
some indication of an intent to provide a more complete submission as a revision. It was agreed that
such submissions do not seem to be constructive, and should be rejected if there is a way to do this
in the document handling interface; as such submissions seem to be an attempt to avoid a
submission being identified as late while not providing adequate content for study by the
participants.
Contribution document registrations JCTVC-C069, JCTVC-C107, JCTVC-C133, JCTVC-C151,
JCTVC-C177, JCTVC-C221, JCTVC-C222, JCTVC-C242, and JCTVC-C294 were withdrawn (no
documents had been provided, or an incomplete document had been provided, or the registration to
JCT-VC had been made in error when intending to submit a document to the WG11 parent body
instead).
The report documents of the previous meeting, particularly the meeting report JCTVC-B200, the
draft TMuC encoder description JCTVC-B204 and decoder specification JCTVC-B205 were
approved. Draft versions 6 and 7 of JCTVC-B205 were made available by Saturday.
1.5 Attendance
The list of participants in the JCT-VC meeting can be found in Annex B of this report.
The meeting was open to those qualified to participate either in ITU-T WP3/16 or ISO/IEC
JCT1/SC29/WG11 (including experts who had been personally invited by the Chairs as permitted
by ITU-T or ISO/IEC policies).
Participants had been reminded of the need to be properly qualified to attend. Those seeking further
information regarding qualifications to attend future meetings may contact the Chairs.
1.6 Agenda
The agenda for the meeting was as follows:

IPR policy reminder and declarations

Contribution document allocation

Reports of Ad Hoc group activities

Reports of Tool Experiment activities

Review of results of previous meeting

Consideration of contributions and communications on HEVC project guidance

Consideration of HEVC technology proposal contributions

Consideration of information contributions

Coordination activities
119

Future planning: Determination of next steps, discussion of working methods,
communication practices, establishment of coordinated experiments, establishment of
AHGs, meeting planning, refinement of expected standardization timeline, other planning
issues

Other business as appropriate for consideration
1.7 IPR policy reminder
Participants were reminded of the IPR policy established by the parent organizations of the JCT-VC
and were referred to the parent body web sites for further information. The IPR policy was
summarized for the participants.
The ITU-T/ITU-R/ISO/IEC common patent policy shall apply. Participants were particularly
reminded that contributions proposing normative technical content shall contain a non-binding
informal notice of whether the submitter may have patent rights that would be necessary for
implementation of the resulting standard. The notice shall indicate the category of anticipated
licensing terms according to the ITU-T/ITU-R/ISO/IEC patent statement and licensing declaration
form. Contributions of software source code for incorporation into the Reference Software for the
standard shall be provided with a suitable copyright disclaimer header text in a form acceptable to
the parent bodies to enable publication of the source code and to enable users of the software to
copy the software and use it for research and standardization purposes and as a basis for the
development of products.(while the submitter separately retains any associated patent rights for
licensing to be conducted outside of ITU-T/ITU-R/ISO/IEC).
This obligation is supplemental to, and does not replace, any existing obligations of parties to
submit formal IPR declarations to ITU-T/ITU-R/ISO/IEC.
Participants were also reminded of the need to formally report patent rights to the top-level parent
bodies (using the common reporting form found on the database listed below) and to make verbal
and/or document IPR reports within the JCT-VC as necessary in the event that they are aware of
unreported patents that are essential to implementation of a standard or of a draft standard under
development.
Some relevant links for organizational and IPR policy information are provided below:

http://www.itu.int/ITU-T/ipr/index.html (common patent policy for ITU-T, ITU-R, ISO,
IEC and guidelines and forms for formal reporting to the parent bodies)

http://ftp3.itu.int/av-arch/jctvc-site (JCT-VC contribution templates)

http://www.itu.int/ITU-T/studygroups/com16/jct-vc/index.html (JCT-VC founding charter)

http://www.itu.int/ITU-T/dbase/patent/index.html (ITU-T IPR database)
 http://www.itscj.ipsj.or.jp/sc29/29w7proc.htm (SC29 Procedures)
The chairs invited participants to make any necessary verbal reports of previously-unreported IPR
in draft standards under preparation, and opened the floor for such reports: No such verbal reports
were made.
It was mentioned that further discussions about an appropriate software disclaimer header text are
currently conducted between JCT-VC and parent body management, as discussed in JCTVC-C001.
To proceed on this, JCT-VC issued the following resolution to the WG11 parent body: "The JCTVC has requested review of the reference software copyright handling status reported from the JCTVC chairs as found in N11645 [a document produced for WG11 corresponding to the content of
JCTVC-C001]. NBs are requested to comment on the suitability of the proposed approaches."
1.8 Communication practices
The documents for the meeting can be found at http://phenix.it-sudparis.eu/jct/. This is a new site
that the group transitioned to using just prior to the meeting. Previously, JCT-VC documents had
120
been made available at http://ftp3.itu.int/av-arch/jctvc-site. The earlier uploaded documents for the
meeting were put on the prior site until the transition to using the new site was announced.
JCT-VC email lists are managed through the site http://mailman.rwthaachen.de/mailman/options/jct-vc, and to send email to the reflector, the email address is jctvc@lists.rwth-aachen.de. Only members of the reflector can send email to the list.
It was emphasized that reflector subscriptions and email sent to the reflector must use their real
names when subscribing and sending messages and must respond to inquiries regarding their type
of interest in the work.
For the case of TE/CE documents and AHGs, email addresses of participants and contributors may
be obscured or absent (and will be on request), although these will be available (in human readable
format – possibly with some "obscurification") for primary TE/CE coordinators and AHG chairs.
1.9 Terminology
Some terminology used in this report is explained below:

AHG: Ad hoc group

AIS: Adaptive intra smoothing.

ALF: Adaptive loop filter

AVC: Advanced video coding – the video coding standard formally published as ITU-T
Recommendation H.264 and ISO/IEC 14496-10.

BD: Bjøntegaard-delta – a method for measuring percentage bit rate savings at equal PSNR
or decibels of PSNR benefit at equal bit rate (e.g., as described in document VCEG-M33 of
April 2001).

BR: Bit rate.

CABAC: Context-adaptive binary arithmetic coding.

CBF: Coded block flags.

CE: Core experiment – a coordinated experiment conducted after the 3rd meeting.

DT: Decoding time.

ET: Encoding time.

HE: High efficiency – a set of coding capabilities designed for enhanced compression
performance (contrast with LC). Often loosely associated with RA.

HEVC: High Efficiency Video Coding – the video coding standardization initiative under
way in the JCT-VC.

HM: HEVC Test Model – a video coding design containing selected coding tools that
constitutes our draft standard design – now also used especially in reference to the (nonnormative) encoder algorithms (see WD and TM).

IBDI: Internal bit-depth increase – a technique by which lower bit depth (8 bits per sample)
source video is encoded using higher bit depth signal processing, ordinarily including higher
bit depth reference picture storage (ordinarily 12 bits per sample).

JM: Joint model – the primary software codebase developed for the AVC standard.

LC: Low complexity – a set of coding capabilities designed for reduced implementation
complexity (contrast with HE). Often loosely associated with LD.

LCEC: Low-complexity entropy coding.
121

LD: Low delay – a set of coding conditions designed to enable interactive real-time
communication, with less emphasis on ease of random access (contrast with RA). Often
loosely associated with LC.

LUT: Look-up table.

MC: Motion compensation.

MDDT: Mode-dependent directional transform.

MRG: block merging mode for CUs.

MV: Motion vector.

PCP: Parallelization of context processing.

PIPE: Probability interval partitioning entropy coding (roughly synonymous with V2V for
most discussion purposes, although the term PIPE tends to be more closely associated with
proposals from Fraunhofer HHI while the term V2V tends to be more closely associated
with proposals from RIM).

QP: Quantization parameter.

RA: Random access – a set of coding conditions designed to enable relatively-frequent
random access points in the coded video data, with less emphasis on minimization of delay
(contrast with LD). Often loosely associated with HE.

R-D: Rate-distortion.

RDO: Rate-distortion optimization.

RDOQ: Rate-distortion optimized quantization.

ROT: Rotation operation for low-frequency transform coefficients.

RQT: Residual quadtree.

TE: Tool Experiment – a coordinated experiment conducted after the 1st or 2nd JCT-VC
meeting.

TM: Test Model – a video coding design containing selected coding tools; as contrasted
with the TMuC, see HM.

TMuC: Test Model under Consideration – a video coding design containing selected
proposed coding tools that are under study by the JCT-VC for potential inclusion in the
HEVC standard.

TPE: Transform precision extension.

UDI: Unified directional intra.

Unit types:
o CU: coding unit.
o LCTU: largest coding tree unit.
o PU: prediction unit.
o TU: transform unit.

V2V: variable-length to variable-length prefix coding (roughly synonymous with PIPE for
most discussion purposes, although the term PIPE tends to be more closely associated with
122
proposals from Fraunhofer HHI while the term V2V tends to be more closely associated
with proposals from RIM).

WD: Working draft – the draft HEVC standard corresponding to the HM.
1.10 Liaison activity
The JCT-VC did not send or receive formal liaison communications at this meeting.
1.11
Contribution topic overview
The approximate subject categories and quantity of contributions per category for the meeting was
summarized and is provided as follows.

AHG reports and TE summary reports (10)

TE1: Decoder-side motion derivation (10)

TE2: IBDI and memory compression (14)

TE3: Inter prediction (14)

TE4: Variable length coding (3)

TE5: Simplified TMuC intra prediction (5)

TE6: Intra prediction (18)

TE7: Alternative transforms / MDDT Simplification (19)

TE8: Parallel entropy coding (10)

TE9: Large block structures (4)

TE10: In-loop filtering (20)

TE11: Motion vector coding (12)

TE12: Evaluation of TMuC tools (79)

Project planning and test model establishment (14)

TMuC settings and common test conditions (4)

Application-specific topics (4)

Loop filtering (5)

Block structures and partitioning (12)

Motion compensation and interpolation filters (10)

Motion vector coding (2)

B picture reference list redundancy (2)

Quantization control (1)

Entropy coding (13)

Intra prediction (24)

Transforms and residual coding (10)

IBDI and memory compression (3)
123

1.12
Complexity analysis (2)
Video sequence viewing during the meeting
The following informal viewing sessions were held on Saturday in Room 3410 (4th floor, building
3) during the meeting:

From 10 am to 2 pm, viewing of TMuC-encoded video with Super High Vision resolution
(cropped areas).
 From 2 pm to 5 pm, viewing of in-loop filtering TE video.
In a resolution conveyed to the WG11 parent body, the JCT-VC thanked Prof. Liang Fan of Sun
Yat-sen University, Mr. Taichiro Shiodera of Toshiba, and Mr. Kenta Senzaki of NEC for
assistance with video viewing experiment during the 3rd meeting of the JCT-VC.
2 AHG reports
The activities of ad hoc groups that had been established at the prior meeting are discussed in this
section.
2.1.1.1.1.1.1.1.1
JCTVC-C001 AHG report: JCT-VC project management [G. J. Sullivan, J.-R.
Ohm (co-chairs)] (missing prior, available first day)
This document reported on the work of the JCT-VC ad hoc group on Project Management.
The work of the JCT-VC overall had proceeded well in the interim period. A large amount of
discussion was carried out on the group email reflector. All report documents from the preceding
meeting had been made available. The various ad hoc groups and tool experiments had made
progress and various reports from those activities had been submitted.
One key topic discussed at the management level in the interim period was the need to establish an
appropriate copyright status for the test model and reference software being developed by the JCTVC, as noted in the JCT-VC Terms of Reference. The intent is for the software to be developed as
part of the work to develop the HEVC standard and also for it to be published as reference software
by ITU-T and ISO/IEC.
In regard to the software copyright issue, it was suggested to conclude as follows:

That the software copyright management should be established in a manner that protects the
patent rights of contributors (which are subject to the ITU-T/ITU-R/ISO/IEC Common
Patent Policy) and minimizes any concerns regarding avoiding any other liabilities for
contributors and users.

In particular, it has been expressed that the scope of copyright rights to use of the software
should not be substantially constrained.

The MXM form of the BSD license as quoted above from WG11 N10791 may be a good
candidate language to consider, and nearly all of those who have contributed to the software
thus far have agreed to allow its use. The only reservation that has been expressed by any
contributors regarding the use of the MXM form of the BSD license is a further desire to
ensure full clarity that no patent rights are granted by the availability of the software.

If the MXM form of the BSD license cannot be used for some reason, an alternative
candidate language has been provided herein for consideration.

As it does not currently seem feasible to immediately establish a particular language
declaring the copyright status of the software, it must be understood by all contributors that
the JCT-VC and its parent bodies plan to establish such a statement and that the act of
contributing to the software involves agreeing to allow the group to do so (in a manner
consistent with addressing the concerns stated above to the extent feasible).
124
These recommendations were reviewed and the intent was agreed. Consideration by the parent
bodies was suggested, and the topic was addressed in a resolution produced as an output of the
meeting.
2.1.1.1.1.1.1.1.2
JCTVC-C002 AHG report: Test Model under Consideration (TMuC) editing
[K. McCann (chair), M. Karczewicz, J. Ridge, S. Sekiguchi, T. Wedi, T. Wiegand
(vice-chairs)] (missing prior, uploaded on first day)
Following the Geneva meeting, this AHG began the editorial work on the TMuC decoder
description using JCTVC-B205 draft000 as the initial draft. This was identical to JCTVC-A205
draft007, the final draft from the Dresden meeting, but with all changes accepted. Seven additional
drafts of this output document were produced in the interim period. The last two of these drafts
were produced just as the meeting approached, and had not yet been made available for download
until after the 3rd meeting began.
There was no initial draft from the Geneva meeting for JCTVC-B204, the corresponding encoding
document, which would also provide some tutorial information to explain how TMuC works. Shunichi Sekiguchi led the production of this document.
The recommendations of the TMuC Editing AHG were to:

Approve the edited JCTVC-B204 (TMuC encoder) and JCTVC-B205 (TMuC decoder)
documents as JCT-VC outputs

Continue to edit the TMuC encoder and decoder documents to ensure that all agreed
elements of the TMuC are fully described

Compare the TMuC documents with the TMuC software and resolve any discrepancies that
may exist

Improve the editorial consistency by ensuring that all “normative-style” text is in the
decoder document, which may be referred to by the encoder document

Encourage more people to volunteer to contribute text to the TMuC encoder document

Consider generating a new pair of documents to describe the future Test Model, assuming
that the TMuC continues to exist and evolve in parallel with the Test Model
The further discussion of this report included the following remarks:

We need to work to resolve any discrepancies between document and software

There should be no normative phrasing style in the encoder description document (and not
an excessive amount of informal informative remarks in the decoder specification either)

The establishment of a TM may be an opportunity to increase the level of editorial
discipline.
2.1.1.1.1.1.1.1.3
JCTVC-C003 AHG report: Software development and TMuC software
technical evaluation [F. Bossen (chair), P. Chen, D. Flynn, W.-J. Han, K. Sühring,
H. Schwarz, K. Ugur (vice-chairs)] (missing prior, uploaded during meeting)
Initially, a non-final draft of this AHG report was discussed. The final report was submitted later
during the meeting.
The activities of the AHG included integration of missing tools into a common code base,
resolution of compatibility issues between tools, bug fixes, etc. A near-complete implementation of
TMuC had become available.
A brief summary of activities related to each mandate is given below.

Development of the software was coordinated with the parties needing to integrate changes.
Several tracks were typically pursued in parallel to speed up development. The distribution
125
of the software was made through the SVN servers set up at HHI and BBC, as announced on
the jct-vc reflector. No documentation was produced at this time.

Version 0.6 of the software was delivered according to schedule. Due to the presence of a
number of known bugs in version 0.6, a version 0.7 was delivered a few days later with a
number of fixes and was recommended as the base version to be used for experiments.
Reference encodings were generated using version 0.7 by several parties and distributed
through an ftp server set up at the BBC.

As of version 0.8.1, almost all tools included in TMuC had been integrated into a common
code base. Missing tools were:


Motion vector scaling (Qualcomm). Qualcomm indicated that they no longer wished
to proceed with this integration.

Geometric partitions (Qualcomm). This tool had been integrated into the 0.8qualcomm branch but has not yet been merged into a release version of the software.

3-input adaptive loop filter (Panasonic). This tool had been integrated into the 0.8panasonic branch but had not yet been merged into a release version of the software.
In addition to these missing tools, the following code contributions had not been merged
into a release version of the software due to lack of time:

Bug fixes related to the PU-based merging tool which are present in the 0.8-hhibugfix branch

Unification of PIPE and load balancing which is present in the 0.8-rim-bugfix branch

A number of additional macros were added to the software to improve its configurability.
Requests for new control switches were handled through the public issue tracker set up by
the BBC.
Multiple versions of the TMuC software were produced and announced on the jct-vc email reflector.
A detailed history of changes made to the software can be viewed at
http://hevc.kw.bbc.co.uk/trac/timeline
Released versions of the software are available on the SVN server set up at the following URL:
https://hevc.hhi.fraunhofer.de/svn/svn_TMuCSoftware/tags/{version number} where {version
number} corresponds to one of the versions described below (e.g., 0.7). Intermediate code
submissions can be found on a variety of branches available at
https://hevc.hhi.fraunhofer.de/svn/svn_TMuCSoftware/branches/{branch name} where {branch
name} corresponds to a branch (e.g., 0.8-qualcomm).
The AhG recommended to the JCT-VC to:

Use the TMuC software for the development of TM software

Clean-up the TMuC software to remove parts that are not relevant anymore
 Address remaining critical issues filed on the issue tracker, if any
The discussion of this report included the following remarks:

Improving configurability is desirable

There had been some parallel software development; after which merging was then
necessary – David Flynn particularly helped with that.

Documentation of software configuration was not adequately advanced.
126

It is especially important to keep track of intended normative behavior during the software
development process, and ensure that normative behavior cannot be changed casually.

0.6 version was produced on the planned date, but it did not contain all planned elements
(two missing at that time – one not pursued and the other rescheduled a month later).

Various bugs had been identified within a couple of days and version 0.7 was issued and
recommended for use in testing, and was used to generate the common conditions anchors
made available on BBC hosted ftp site.

Some extremely minor platform dependencies were noticed on Win32 systems – possibly
compiler optimization (e.g., up to 0.05 dB PSNR differences).

Some tools were still missing in 0.8.1 (motion vector scaling not pursued, geometric
partitions just submitted and not yet merged, 3-input loop filter due to branch divergence
issues – now seems to be available).

Substantial new kinds of configurability were added.

It is important to identify and preserve the distinction between what should be considered as
a new proposal and what is just an improvement of configurability, and what should be
considered as a bug fix.

It is possible to see the software integration timeline in the software tracking system.

For the "load balancing" aspect in V2V entropy coding, software had been submitted but not
yet merged.

PU-based merging fixes remained to be finalized.
2.1.1.1.1.1.1.1.4
JCTVC-C004 AHG report: Alternative transforms [R. Cohen, R. Joshi (cochairs)]
Discussions for this AHG were carried out over the main JCT-VC reflector (jct-vc@lists.rwthaachen.de), usually with the term “AHG – Transforms” in the subject line. The discussions were
mainly related to how to test alternative transform-related tools in TE12. An example of this was
testing of MDDT and ROT. After some discussions, it was agreed to test MDDT and ROT under 3
different conditions, namely, MDDT=0 and ROT=0, MDDT=1 and ROT=0, MDDT=0, ROT=1.
There was also some discussion about EDGE_BASED_PREDICTION and MDDT not being
combined in an optimal fashion. A related TE (TE7) explored alternatives to the MDDT transform
currently in the TMuC. A synopsis of TE7-related contributions was also included in this AHG
report. Several cross-verifiers found that that the encoder produced slightly different results
depending upon the platform, in part due to known issues with TMuC 0.7.
The contributions related to alternative transforms that had been registered for this meeting were
noted.
Tool Experiments TE7 and TE12 related to the work of this AHG, and various contributions had
been submitted in relation to these TEs. The AHG report included a summary of TE7 related
contributions.
2.1.1.1.1.1.1.1.5
JCTVC-C005 AHG report: In-loop and post-processing filtering [T.
Yamakage (chair), Y. J. Chiu, M. Karczewicz, M. Narroschke (co-chairs)]
This document summarized the In-loop and post-processing filtering Ad hoc activities between the
2nd JCT-VC meeting in Geneva, Switzerland (21 to 28 July, 2010) and the current 3rd JCT-VC
meeting in Guangzhou, China (7 to 15 October, 2010).
There were a few email exchanges for this period. A related TE (TE10) covered most of mandates.
127
Other than the work on TE10, there were two contributions JCTVC-C085 and JCTVC-C113 that
compared loop and post filtering. Although the examined post filtering methods in JCTVC-C085
may not be the best, the contribution made a comparison (in a fair manner in terms of coding
conditions) for three methods. JCTVC-C113 presents experimental results using a different
platform (i.e. JMKTA 2.7).
There is also a report of signaling the position of an ALF flag in JCTVC-C084 and some
verification results were supposed reported as JCTVC-C243. However, JCTVC-C243 was not
uploaded until substantially after the meeting had ended.
The related contributions were reviewed in the AHG report.
Related to Mandate 1, 2, 3 and 4:
TE10 has conducted experiments for three technical categories (deblocking/debanding filters,
Wiener-based in-loop filters and Image clipping and offset). A summary of experimental results is
reported in JCTVC-C083.
Subtest 2 (Wiener-based in-loop filters) covered the complexity assessment at encoder side as well
as coding efficiency assessment. There are complexity related contributions outside of TE10
activities, JCTVC-C113, JCTVC-C086 and its verification JCTVC-C212. In addition, a
contribution (JCTVC-C173) proposes a coupled control of deblocking filter and Wiener-based filter,
and a contribution JCTVC-C214 for TE12 to study various inputs to Wiener-based filters. There are
contributions JCTVC-C195, JCTVC-C219 and JCTVC-C222 to enhance Wiener-based in-loop
filtering.
Informal subjective viewing for TE10 was conducted during this meeting.
Related to Mandate 5:
There are two contributions JCTVC-C085 and JCTVC-C113 that compare loop and post filtering.
Although the examined post filtering methods in JCTVC-C085 may not be the best, the contribution
made a comparison (in a fair manner in terms of coding conditions) for three methods. JCTVCC113 presents experimental results using a different platform (i.e. JMKTA 2.7).
There was a report of signaling position of ALF_flag JCTVC-C084 and verification results in
JCTVC-C243. However, JCTVC-C243 was uploaded substantially after the meeting had ended.
Subjective viewing was conducted at the current meeting for some proposals (in room 3410 on 4th
floor of building 3). T. Yamakage assisted in coordinating the scheduling of subjective viewing. It
was generally agreed that we should avoid spending substantial time on viewing of cases where no
substantial difference in quality will be perceived.
2.1.1.1.1.1.1.1.6
JCTVC-C006 AHG report: Large block structures [K. Panusopone (chair), M.
Budagavi, W.-J. Han, D. He (vice-chairs)] (missing prior, available first day)
This report summarized the large block structures ad hoc activities between the 2nd JCT-VC
meeting in Geneva, Switzerland (21 to 28 July, 2010) and the current 3rd JCT-VC meeting in
Guangzhou, China (7 to 15 October, 2010).
There were some discussions relating to this ad hoc group on the main JCT-VC reflector during the
interim period. The discussions included reporting of encoder crashes with some CU/PU/TU size
combinations, and how to handle these issues. For example, the horizontal padding and vertical
padding (or --PAD option) should be used to ensure that both frame width and height are a multiple
of LCTU. There were also a lot of activities relating to large block structure occurred in TE 9 and
TE 12; namely, Unit Defintions tests in TE 12 and performance tests for different CTB and TU
sizes in TE 9.
There are some input documents proposing solutions either to handle boundary issues, to allow
flexible block partitioning, or to improve signaling efficiency. Additionally, TE 9 activities related
to the mandates of this AHG. There also had been some initial discussions with the AhG on in-loop
and post-processing filtering for a potential collaboration on evaluating performance of the current
ALF filtering control information at the CU level.
128
2.1.1.1.1.1.1.1.7
JCTVC-C007 AHG report: Memory compression [K. Chono (chair), T.
Chujoh, C. S. Lim (vice-chairs)]
The ad hoc group (AHG) on memory compression was formed during the July 2010 JCT VC
meeting in Geneva. The bulk of this AHG activity has been devoted to discuss and develop an MC
read memory access bandwidth measure module for TMuC software in order to access the results of
TE2: IBDI and Memory Compression. Approximately 70 e-mail messages were exchanged between
the members from 4 parties/companies.
The following items are listed as the benefits of memory compression for video encoders and
decoders:

Reduction in MC read memory access bandwidth;

Reduction in DPB write memory access bandwidth; and
 Graceful power degradation by enabling the low resolution decoding of a bit-stream.
The second item can be reasonably assessed by comparing the compression ratios of memory
compression schemes since the DPB write memory access is not random access. Similarly, the third
item can be reasonably assessed by counting the number of low resolution samples. Therefore,
AHG activity has been devoted to discussing the first item. Indeed, it was mentioned by JCT-VC
chairs during the last JCV-VC meeting that the compression ratios of memory compression
schemes are not a comparable measure since overheads that come from the overlaps of the size of
memory compression unit access in MC read random access are unclear.
Through the discussion on the first item as described in detail below, the AHG developed a module
that computes run-time memory access bandwidth measure for TMuC decoder.
In order to have a comparable measure, some TE2 members suggested developing a module that
computes run-time memory access bandwidth for given the combination of MC block size, the taplength of MC interpolation filter, Motion Vector (MV), sample-bits per pixel, etc. AHG members
agreed to the suggestion and discussed a YUV storage structure in reference picture buffer. By
considering the TMuC design in which the same MC interpolation filter is applied to UV
components and a different MC interpolation filter is applied to Y component than UV's, the
member decided to employ a YUV format comprising of a plane Y component and horizontallyinterleaved UV components.
AHG members developed the module and updated it to account for the overlaps of a memory
compression unit size. Given a combination of MC block size, the tap-length of MC interpolation
filter, MV, sample-bits per pixel, memory compression unit size, the updated module computes the
memory access samples.
Then AHG members discussed a model of compressed reference storage and decided to use a 1-D
storage model. In the 1-D storage model, a memory compressor compresses each of compression
units and stores its compressed versions, whose size is byte-aligned, into 1-D storage in the raster
scan order. A memory decompressor accesses a set of compressed reference data units stored in the
1-D storage and outputs the decompressed compression units (MC reference samples) to a MC
module. Since the 1-D storage access is restricted by memory alignment and memory burst sizes,
the actual memory access size (referred to as run-time memory access bandwidth) is determined on
the basis of the associated memory read overheads as well as the number of compressed reference
data units.
Although run-time memory access bandwidth is varied by memory allocation methods, AHG
members agreed that the raster scan is used as a default setting and that TE2 proponents can report
run-time memory access bandwidth measures obtained by their specific memory allocation methods
in addition to those obtained by the default setting. The reason is that the optimal memory
allocation method depends on a memory compression unit size as well as memory alignment and
memory burst sizes.
After having the 1-D storage model, NEC offered their resources to implement the module and to
integrate it into the top of TMuC0.7 decoder. The module can be turn on and off by setting a macro
129
"#define MC_MEMORY_ACCESS_CALC 1" in TypeDef.h. Panasonic and TI inspected it prior to
the module distribution to TE2 participants. (Note that current implementation supports the runtime
memory access only for Anchor encoding settings, i.e., SIFO with 12tap and AMV res., and DIF
without AMV res. Furthermore there are two restrictions: for a SIFO with 12tap and AMV res. case,
the number of tap-lengths is not switched as the TMuC document specifies; for a DIF case, the
shapes of memory access on diagonal fractional pixel positions are assumed to be rectangular.
However, the above restrictions presumably do not affect the assessment of MC read memory
access bandwidth reduction obtained by memory compression schemes since Anchor and Proposal
MC read memory access bandwidths are computed by the same way.)
A template excel sheet containing the memory access bandwidth measurement of the anchor
streams generated based on JCTVC-B300.doc was created and distributed to TE2 participants along
with the software. Using the module created by NEC, the total MC read memory bandwidth of the
anchor streams for high efficiency and low complexity settings are computed based on
combinations of DDR address and burst alignment bits, (8,8), (32, 64), (64, 128), (128,128), and
(256, 256). Using the same memory bandwidth computation tool provided by NEC, the TE2
participants were requested to provide the total MC read memory bandwidth measurement of the
same anchor streams based on each proponent’s proposed memory compression unit size and
compression ratio.
It was also discussed that having a simple cache model to measure memory bandwidth after the use
of cache will be desirable since many video coding solutions in the marketplace use cache memory.
Video coding solutions that do not use cache are also widely used in the marketplace, so the cache
model needs to have the option of being switched on and off. TI offered their resources to
implement a simple cache model on top of NEC’s memory bandwidth measurement module in
TMuC0.7 decoder. After a quick informal survey of existing L1 and L2 cache architectures on
recent microprocessors and DSPs, a single level cache with following parameters was implemented
in TMuC-0.7 decoder with capacity (cache size): 128 KB, line size: 64 bytes, organization: 4-way
set-associative, replacement strategy: FIFO (a least Recently Used (LRU) strategy is industry
standard, but FIFO is easier to implement), DDR alignment: 64 bits. The aim of the cache model is
to have a simple cache simulation that captures the most relevant parameters for comparing
bandwidth consumed, with and without memory compression, and that is fair to all block structures
used in memory compression. The goal was not to develop a very sophisticated cache model. The
software cache model was released TE2 members after it was inspected by NEC and others.
The memory bandwidth usage is printed in the same format used by NEC's memory bandwidth
measurement module. The code changes are isolated to be in #if
MC_MEMORY_ACCESS_CACHE. MC_MEMORY_ACCESS_CALC needs to be enabled before
cache can be used.
It was noted that lossless aspects of memory compression (e.g entropy coding) may not need to be
standardized.
The recommendations of the AHG were to:

Adopt the developed run-time memory access modules to the main branch of the software
with a macro to be turned on or off by compilation

Study MC memory access bandwidths of the combination of different minimum inter PU
partition sizes and several TMuC MC interpolation schemes

Study essential functionalities of memory compression schemes that need to be standardized
in order to keep the interoperability of encoders and decoders
In the discussion of the report, it was remarked that there may potentially be subjective impairments
associated with memory compression that should be considered.
It was suggested that a reduction of memory access bandwidth on the order of 50% may be a
realistic goal.
130
2.1.1.1.1.1.1.1.8
JCTVC-C008 AHG report: Parallel entropy coding [M. Budagavi, A. Segall
(co-chairs)] (missing prior, available first day)
Parallel entropy coding tools were proposed in several contributions at the April 2010 JCTVC
meeting in response to HEVC CfP. The parallelism proposed in those contributions can be broadly
classified into three categories:

Bin-level parallelism, which parallelizes the binary arithmetic coder (BAC), as in:
o N-bin BAC in JCTVC-A101
o PIPE/V2V in JCTVC-A116 and JCTVC-A120

Syntax element-level parallelism, which parallelizes the BAC, the context modeler, and the
binarizer, as found in syntax element partitioning JCTVC-A101.

Slice-level parallelism, as in:
o Entropy slices, JCTVC-A105
o Interleaved entropy slices, JCTVC-A101
o Sub-streams coding, JCTVC-A114
In the July JCTVC meeting, context processing was identified as an additional bottleneck. The
following context parallelization tools were proposed in the July meeting and a Tool Experiment
TE8 was established to further study this is more detail.

Parallel context processing (PCP)
o Coefficient Sign PCP (JCTVC-B088 Section 3.2)
o Coeff Level BinIdx 0 PCP (JCTVC-B088 Section 3.3)
o Significance map PCP (JCTVC-B088 Section 3.4)
o Coding order for significance map on bin decoding throughput (JCTVC B036
Section 2)
Results on entropy slices were also presented in the July JCTVC meeting, as in entropy slices for
parallel entropy coding (JCTVC-B111).
There was no email activity related to this AhG on the JCTVC reflector. However, several AhG
members interested in this area were actively participating in proposal/cross-verification of parallel
entropy coding within TE8 and TE12.
Thirteen related contributions were noted.
The AhG recommended to review all relevant input contributions.
A question that was discussed was how to measure the amount of parallelism that is achieved.
There are some metrics that have been proposed in contributions, and we should study these to
determine their acceptability.
2.1.1.1.1.1.1.1.9
JCTVC-C009 AHG report: Screen content coding [J. Xu, W. Ding (cochairs)] (missing prior, uploaded first meeting day)
The document summarized the activities of the screen content coding AHG between the 2nd and the
3rd JCT-VC meetings. The mandates of the AHG were to identify and describe use case scenarios
including non-camera-view content; investigate and collect appropriate test material including
computer screen captures, mixed video and graphics, animated graphics, game content, CAD video,
news content, text overlays, etc.; and identify potential needs for action in HEVC standardization.
There was very little activity for this AHG during the interim period, and only one related input
document was submitted for the 2rd meeting (JCTVC-C276).
The topic was discussed and further input was encouraged. MPEG and VCEG also discussed the
topic at their co-located meetings. To further progress on this, JCT-VC issued the following
131
resolution to WG11: "The JCT-VC has received input contributions that indicate a potential need to
consider the coding of "screen content" (text and graphics mixed into a video source, computer
desktop or mobile device display content, scrolling text over video, etc.), which may have different
characteristics than the camera-captured content as used in the HEVC CfP. NBs are requested to
comment about the relevance of such content to the development of the HEVC standard."
2.1.1.1.1.1.1.1.10
JCTVC-C010 AHG report: Complexity assessment [D. Alfonso (chair), J.
Ridge, X. Wen (vice-chairs)]
This report summarized the activities of the Ad Hoc Group on Complexity Assessment between the
2nd JCT-VC meeting held in Geneva in July 2010 and the current meeting in Guangzhou.
There was moderate activity on the e-mail reflector related to Complexity Assessment since the July
meeting. A total of 18 e-mails on the subject were exchanged.

It had been pointed out that the concept of "complexity" still lacks a fully satisfactory
definition.

It had been suggested that computational complexity could be assessed by measuring the
CPU clock cycles using the "rdtsc" instruction, but there was no consensus on the reliability
of this kind of measure.

It had been suggested to use profile tools, e.g. Vtune or oprofile, although no results had
been proposed so far.

Some simulation results showed a discrepancy between computational complexity measures
obtained with the TMuC 0.7 decoder on Linux machines and those from a Windows XP
machine running Cygwin. The reason for that had not been determined yet.
Several Tool Experiments ran between the last meeting and the current meeting that were aimed at
providing both coding efficiency and complexity measurement results. The AHG recommended the
JCT-VC experts to review the complexity results considering possible issues with precision,
accuracy and reproducibility on different systems.
The AHG encouraged further simulation experiments and discussions among JCT-VC experts
toward reaching a consensus on suitable complexity definitions and methods to measure it.
3 TE1: Decoder-side motion vector derivation
3.1.1.1.1.1.1.1.1
JCTVC-C115 TE1: Summary report of TE1 activity [Y.-J. Chiu (Intel), Y.-W.
Huang (MediaTek), M. Wien (RWTH Aachen Univ.), H. Yu (Huawei)]
This contribution summarized the results and various contributions relating to TE1. The
contributions and results in this area are further discussed below in this section.
3.1.1.1.1.1.1.1.2
JCTVC-C097 TE1: Huawei report on DMVD improvements in TMuC [M.
Yang, S. Lin, D. Wang, J. Zhou, H. Yu (Huawei)]
The goal of this Tool Experiment (TE1) was to measure the performance of Decoder-side Motion
Vector Derivation (DMVD) techniques in TMuC software. There are two categories in the TE1, one
is TE.1a (DMVD Inter Prediction) and the other is TE.1b (DMVD Direct Mode). This contribution
continued the work on the Template Matching based DMVD (TM-DMVD) and Spatial and
Temporal Direct Mode (STDM) techniques. STDM and TM-DMVD work together in the TE.1a,
and STDM works alone in the TE.1b. Both techniques had been implemented into the TMuC 0.7
software and tested following the common test conditions in JCTVC-B300 and JCTVC-B301. This
document described these techniques in detail, which included algorithm description, software
implementation, coding performance, and complexity evaluation and analysis. According to the test
results, the average bit rate savings for the random access and low delay configurations were 3.90%
and 2.43%, respectively in TE1.a, and 2.9% and 0.5%, respectively in TE1.b.
132
It was noted that some of the tests for Class A and Class B sequences in the TE1.a random access
configuration were not finished yet, and the authors indicated that final results would be updated
when these became available. As of the time of writing this report, the latest uploaded version of the
contribution still said that the results were not yet finished.
TE1.a

Results for Random Access were not finished yet when reviewed

Current decoder runtime increase to 230% for RA, 300% for LD.

"low complexity" method presented by last meeting not implemented in TMuC yet –
estimate to bring complexity increase to 160%.
TE1.b

Current decoder runtime increase to 118% for RA, 129% for LD.

Encoder runtime increase negligible

AMVP was disabled in all TE1
3.1.1.1.1.1.1.1.3
JCTVC-C128 TE1: Cross-checking of DMVD result from Huawei [Y.-J. Chiu,
L. Xu, W. Zhang, H. Jiang (Intel)]
This contribution was a cross check of the results of JCTVC-C097 on TE1.a, reporting matching
results as far as the results that were available.
3.1.1.1.1.1.1.1.4
JCTVC-C125 TE1: Spatio-temporal direct mode [S. Sekiguchi, Y. Itani
(Mitsubishi Electric)]
This contribution was a cross check of the results of JCTVC-C097 on TE1.b, reporting matching
results.
3.1.1.1.1.1.1.1.5
JCTVC-C138 TE1.a: Implementation report of refinement motion
compensation using DMVD on TMuC [M. Ueda (JVC)]
In TE1 activity, the tool experiment targets at exploring the performance and value of Decoder-side
Motion Vector Derivation (DMVD) and at analyzing the interaction of DMVD with other tools
under consideration for HEVC.
This proposal focused on the tool "Refinement Motion Compensation (RMC) using DMVD" which
belongs to TE1.a: DMVD Inter Prediction. The feature in RMC is that bi-predictive MC blocks can
be obtained by transmitting only one motion vector using inter-reference DMVD.
In this contribution, the implementation of the basic RMC tool on TMuC0.7 software and result of
evaluating the performance were reported. The simulation results reportedly showed that the
proposed technique provides 1.8% BD-rate gain for the random access case and 0.0% for the low
delay case as tested against the TE1 common anchor under TE1 test conditions.

Decoder runtime increased to 138%, encoder runtime to 146% for RA case

The method seems not to be working for bi-predictive extrapolation currently

Results were relative to TMuC anchors without AMVP
3.1.1.1.1.1.1.1.6
JCTVC-C266 TE1: Cross-check result of DMVD proposal from JCTVC-C138
(JVC/Kenwood) [W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm)]
The results of JCTVC-C138 were reportedly confirmed in terms of coding performance and speed.
3.1.1.1.1.1.1.1.7
JCTVC-C124 TE1: Implicit direct vector derivation [Y. Itani, S. Sekiguchi
(Mitsubishi Electric)]
133
This contribution reported results of performance verification on implicit direct vector derivation
technique that was approved as a part of TE1 at the last (Geneva) meeting. The evaluated technique
provides a new design of B-skip and B-direct modes with locally adaptive direct MV derivation
relying on a decoder-side decision. A performance evaluation using TMuC software was conducted
according to the test conditions defined in TE1, and the results reportedly showed 1.8% coding gain
on average for RA case (and up to 3.5% at maximum gain). The increase of decoding time was
about 10% relative to the TE1 anchor, and the encoding time was reportedly almost same as that of
the anchor. This contribution also provided notes on normative text in the form of modification of
the TMuC specification.

For LD case, 0.4% BR reduction on average

Decoder time increases 13% for RA case and 5% for LC case; no significant increase in
encoder run time

Comparison against anchors without motion vector competition and without CU merging
(skip/direct on)
3.1.1.1.1.1.1.1.8
JCTVC-C024 TE1: Cross-check result of DMVD proposal JCTVC-C124
(Mitsubishi) [T. Yokoyama, K. Nakamura (Hitachi)]
This contribution confirmed the results of JCTVC-C124 w.r.t. performance and encoding/decoding
time.
3.1.1.1.1.1.1.1.9
JCTVC-C127 TE1: Report of self derivation of motion estimation
improvement in TMuC [Y.-J. Chiu, L. Xu, W. Zhang, H. Jiang (Intel)]
This contribution presented the techniques of decoder-side motion vector derivation (DMVD) side
to increase the coding efficiency of B pictures. With the motion vector (MV) information self
derived at the video decoder side, the transmission of these MVs from the video encoder side to the
video decoder side is skipped. The proposed self derivation of motion estimation (SDME)
techniques were reported to be friendly to parallel implementation by utilizing on the temporal
correlation among the available samples in the previously-decoded reference pictures, instead of
operating on the previously reconstructed samples of the neighboring area of the current picture,
which poses an inherent decoding ordering causal relationship among decoding blocks of current
picture. Experiments reportedly demonstrated an overall 4.8% BD bit rate improvement for the test
of random access (high efficiency) category under the TE1 test conditions with 73% increase in
encoding time and 150% increase in decoding time increase on TMuC 0.7 Reference Software
platform. A fast version of the direct-candidate simplified SDME reportedly demonstrated an
overall 3.1% BD bit rate improvement with a 26% reduction in encoding time and 18% increase in
decoding time when compared to the TE1 anchor. Another version of the rounded-candidate
simplified SDME had reportedly demonstrated an overall 3.7% BD bit rate improvement with 10%
increase in encoding time and 13% increase in decoding time when compared to the TE1 anchor.
Also, the proposed SDME reportedly demonstrated 2.7% BD bit rate improvement over the anchor
of TE12 under the random access test category (high efficiency) on a shorter version of the common
test video sequences defined for TE12. A description of the normative syntax change in the form of
TMuC specification was included.
In the discussion, it was remarked that the reduction of encoding time is not specific to DMVD, but
could be achieved similarly by appropriate encoder optimization (avoid unnecessary checking of
intra modes).
3.1.1.1.1.1.1.1.10
JCTVC-C098 TE1: Cross-checking of DMVD results from Mitsubishi and
Intel [S. Lin, D. Wang, M. Yang, H. Yu (Huawei)]
The results of JCTVC-C124 w.r.t. performance and encoding/decoding time were reportedly
confirmed.
134
The results of JCTVC-C127 w.r.t. performance and encoding/decoding time (x-check for the fast
modes not done) were reportedly confirmed.
3.1.1.1.1.1.1.1.11
TE1 summary discussion
Aspects remarked upon in the discussion of TE1 included the following:

Current TE1 settings were with AMVP (motion competition) off and CU merging off.

Gains were reported of 4.8% BR reduction average.

New methods were proposed with significantly less decoder complexity than before.

Results should be compared against the default TMuC settings (which probably brings the
gains down by 2-2.5%).

Some of the methods seem to reach a region of more moderate increase in decoding
complexity.
It was suggested to continue investigation and experiment work on this topic.
4 TE2: IBDI and memory compression
4.1
Summary
4.1.1.1.1.1.1.1.1
JCTVC-C076 TE2: Summary of TE2 on IBDI and memory compression [T.
Chujoh (TE coordinator)]
This contribution is a summary of activies and results relating to tool experiment 2: IBDI and
memory compression as described in JCTVC-B302r2. Nine companies and one university had been
registered in TE2 and four proposed tools had been evaluated on the common condition with crossverification. There are two test conditions. One is the case of keeping the bit depth of reference
memory to the input bit depth on IBDI and the other is the case of reduction of reference memory
access bandwidth and size.
A substantial amount of information was in this summary report.
The proposed techniques were summarized as shown in the table below:
Tool Proponent Cross-checker
Method
1
NEC
Panasonic, JVC
DPCM of 8 samples
2
Panasonic TOSHIBA
Hadamard transform and bit-plane coding
3
TI
TOSHIBA
Wavelet transform and DC prediction by 8x8 block
4
TOSHIBA NEC, Yonsei Unv.
Adaptive scaling by 4x4 block and de-scaling by
sample
The summary report included tables illustrating the impact on measures relating to coding
efficiency, complexity, and memory access for HE and LC configurations
It was remarked that for the various memory compression schemes in general, the behavior on static
regions may be a particular concern.
It was also noted that more complex techniques need to be significantly better than 8 bit rounding of
IBDI results in order to to be useful.
4.2
1-D DPCM-based memory compression
4.2.1.1.1.1.1.1.1
JCTVC-C093 TE2: 1-D DPCM-based memory compression [H. Aoki, K.
Chono, K. Senzaki, J. Tajime, Y. Senda (NEC)]
In this contribution, a performance report on the 1-D DPCM-based memory compression method
described in JCTVC-B057 was presented. The method was designed to reduce reference frame
memory size and memory access bandwidth for motion compensation, with relatively low
135
complexity. The method employs 1-D DPCM. Experimental results reportedly showed that average
coding losses are 10.7% for test cases without IBDI and 2.0% for test cases with 12-bit IBDI.
Subjective quality degradation caused by such coding loss was reported to be invisible for most test
cases. Average increase of decoding time is as small as 3.0% and memory bandwidth reduction
ratios are 40.6% for test cases without IBDI and 49.0% for test cases with 12-bit IBDI.
Experimental results on bandwidth reportedly showed that bandwidth reduction depends on
memory architecture and implementation on mapping of compressed image data onto frame
memory.
It was remarked that the 2% loss with IBDI seems to be most of the gain that IBDI provides.
It was remarked that such a proposal should be compared against just rounding to 8 bits when
storing.
JCTVC-C094 and JCTVC-C095 are related proposal documents for a new proposal.
It was asked whether the decoder is required to do clipping after reconstruction of each sample. The
proponent indicated that this is not necessary. Instead, the encoder must perform quantization in a
way that avoids reconstruction overflow.
4.2.1.1.1.1.1.1.2
JCTVC-C074 TE2: Cross-check of memory compression results from NEC
(JCTVC-B057/JCTVC-C093) [C. S. Lim, V. Wahadaniah, S. Naing, H. W. Sun
(Panasonic)] (had been uploaded on the ITU ftp site and was only later put on the
new site)
The purpose of this document was to present a cross-check of memory compression results from
NEC (JCTVC-B057/JCTVC-C093) for Tool Experiment 2 (IBDI and memory compression). The
cross-check was reportedly completed successfully and the results from NEC were reportedly
reproduced without any problem.
4.3
Reference frame compression using image coder
4.3.1.1.1.1.1.1.1
JCTVC-C073 TE2: Reference frame compression using image coder [C. S.
Lim, V. Wahadaniah, S. Naing, H. W. Sun (Panasonic)]
This contribution presented results of Tool Experiment 2 (IBDI and memory compression) from the
reference frame compression scheme proposed by Panasonic (JCTVC-B103). The presented
reference frame compression scheme uses an image coder comprising a transform, scanning, and
bit-plane coding. The presented scheme was implemented on TMuC version 0.7.3 revision 188 and
experiments were conducted using the common test conditions defined in the Tool Experiment 2
document (JCTVC-B302).
The memory compression rate used in the experiments for High Efficiency setting was 12-bit to
5.33-bit (44.4%). In comparison with the TMuC software without reference frame compression,
experiment results reportedly show an average coding efficiency drop of 2.9% for the High
Efficiency, Random Access setting and an average coding efficiency drop of 4.7% for the High
Efficiency, Low Delay setting.
The memory compression rate used in the experiments for Low Complexity setting was 8-bit to
5.33-bit (66.7%). In comparison with the TMuC software without reference frame compression,
experiment results reportedly show an average coding efficiency drop of 2.9% for the Low
Complexity, Random Access setting and an average coding efficiency drop of 4.0% for the Low
Complexity, Low Delay setting. It was noted from the results in all test settings that the larger drop
in performance occurs in the lower resolution images.
It was remarked that the complexity of the proposed reference frame compression method seems
relatively high, and the coding efficiency impact is not negligible.
4.3.1.1.1.1.1.1.2
JCTVC-C178 TE2: Cross-check of memory compression results from
Panasonic (JCTVC-C074) [A. Segall (Sharp)] (missing prior, available first day)
136
This contribution provided a cross-check of memory compression as proposed by NEC and results
provided by Panasonic. This cross-check was performed within the context of TE2. The
experimental results reportedly closely matched the rate-distortion results provided by Panasonic.
The reported maximum Y BD Rate differed from the provided results by at most 0.1%. The crosschecker indicated that he had read through the software and found it to match the proposal
algorithm.
4.4
TI reference frame compression proposal
4.4.1.1.1.1.1.1.1
JCTVC-C060 TE2: TI reference frame compression proposal [M. Zhou, M.
Budagavi (TI)] (late IPR declaration prior)
This document reported testing results of the reference frame compression (RFC) algorithm
proposed by Texas Instruments at the Geneva JCT-VC meeting. The algorithm was integrated into
the TMuC-0.7 reference software and tested with the JCT-VC common testing conditions. For the
High Efficiency (HE) configurations in which IBDI was enabled, the proposed algorithm reportedly
provided 12 bits to 8 bits compression at the cost of an average BD-rate increase of 0.3% for
random access configurations and 1.7% for low-delay configurations, respectively. The average
motion compensation memory bandwidth was reportedly reduced by amounts ranging from 14.7%
to 45.7%. For the low complexity (LC) configurations in which the IBDI is off, the proposed
algorithm provided 8 bits to 4 bits compression, with a reported average BD-rate increase of 2.5%
for random access LC and 3.4% for low-delay LC, respectively. The memory bandwidth saving is
50% if the growing window is employed.
A cross check was reportedly done by Toshiba, but no document was submitted for that, other than
a summary in JCTVC-C076.
Further study was encouraged.
4.5
Adaptive scaling for bit depth compression on IBDI
4.5.1.1.1.1.1.1.1
JCTVC-C075 TE2: Adaptive scaling for bit depth compression on IBDI [T.
Chujoh, T. Yamakage (Toshiba)]
In this contribution, results of an adaptive scaling for bit depth compression on IBDI were reported.
This was one of the proposals in Tool Experiment 2 on IBDI and memory compression. The
purpose of this tool experiment was to improve coding efficiency by increasing internal process of
video codec while minimizing reference frame memory access bandwidth and reducing reference
frame memory access bandwidth and reference frame memory size. This contribution reported an
average 0.8% loss for bit depth compression on IBDI.
By design, it was reported that the method should provide a benefit relative to fixed rounding.
There was reportedly a substantial impact of fixed rounding to 8 bits for the Class E sequences.
A substantial memory access bandwidth reduction was reported.
It was remarked that performance may depend on the applied QP / fidelity.
Another proposal JCTVC-C077 also related to this, as described elsewhere in this report.
4.5.1.1.1.1.1.1.2
JCTVC-C150 TE2: Cross-check of memory compression results from
TOSHIBA (JCTVC-C075) [Y. Choe, S. Hong (Yonsei Univ.), Y-G. Kim (KGIT)]
This document reported cross-checking results of adaptive scaling of bit depth compression for
IBDI (Internal Bit Depth Increase) coding, which was proposed by Toshiba as described in JCTVCC075. The proposal was evaluated based on the TMuC 0.7 with BUGFIX50TMP software and
configuration files which were based on JCTVC-B302r1. The software package was obtained from
Toshiba.
The verification was reported to have been carried out by encoding the test sequences and decoding
the generated bitstreams, collecting PSNR/bit rate numbers and encoding/decoding times and
verifying that the numbers generated matched those in the proponent's spreadsheets.
137
The results were reported to be consistent with the data provided by Toshiba. The contribution did
not indicate whether the algorithm or software were carefully studied.
4.6
Summary conclusions on TE2
It was agreed that further study of the techniques involved in TE2 was desirable. However, it was
agreed not to include these proposed techniques in the test model.
5 TE3: Inter prediction
5.1.1.1.1.1.1.1.1
JCTVC-C220 TE3: Summary report for inter prediction in HEVC [A. Krutz, T.
Sikora (Tech. Univ. Berlin)]
This document summarized the interim activity relating to TE3 – Inter prediction in HEVC
(JCTVC-B303). The TE was separated into four subtests that were Subtest 1 – Warped motion
compensated and second order prediction, Subtest 2 – Flexible motion partitioning, Subtest 3 –
Multi hypothesis inter prediction, and Subtest 4 – Improved inter prediction with enhanced MC
filter. The results that had been generated prior to the writing of this document were summarized.
Due to the short period of time, integration in the new software and running the experiments was
reportedly a big task. Therefore, not all participants who intended to actively contribute to this TE at
the last meeting had shown results for their proposed tools.
5.1.1.1.1.1.1.1.2
JCTVC-C033 TE3: Motion compensation with adaptive warped reference [S.
Park, J. Park, B. Jeon (LG Electronics)]
This document presented experimental results of TE3 subtest 1, which aimed to improve temporal
prediction performance by using warping motion. This tool uses an additional reference picture
which is warped version of the original reference picture based on homography relation between the
current picture and the reference picture. Using a test set different than the typical common
conditions, the experimental results reportedly showed 3.1% overall bit rate savings under ‘low
delay & high efficiency’ condition and 3.5% overall bit rate savings under the ‘random access &
high efficiency’ condition as compared with TMuC 0.7.3.
The results were achieved by adding additional sequences into the common test set. Using only the
test set would have resulted in very small measured gain or even loss. The highest gain for the TS
was Cactus, 3.1% BR savings. The maximum gain reported gain was for "station 2" LD at 41.1%.
The following remarks and observations were made in the discussion of this contribution:

Sequences that are added usually do not have much texture and are easy to code.

With the current results there may be no reason to take action. The sequences where gain is
shown are mostly easy to code (blur, low amount of texture) and were not selected as group
test sequences before because of this reason.

Further study is recommended, but more evidence should be shown with appropriate test
material.
5.1.1.1.1.1.1.1.3
JCTVC-C282 TE3 subtest 1: Cross-check of results from LG [M. Tok, A.
Glantz, A. Krutz, T. Sikora (Tech. Univ. Berlin)]
The purpose of this document was to cross-check JCTVC-C033 on motion compensation with
adaptive warped reference picture as submitted by LG Electronics. The verification task was
reported to have been done successfully, with results that matched those proposed in JCTVC-C033.
A verification of the source code had not been done. LG Electronics had provided Windows 64
binary executable files and encoded bitstreams for the cross check activity.
This sort of "black box" testing does not appear to be a satisfactory cross-check for JCT-VC
purposes.
138
5.1.1.1.1.1.1.1.4
JCTVC-C031 TE3 subtest 2: Report on simplified geometry block
partitioning [E. Francois, D. Thoreau, P. Bordes (Technicolor), L. Guillo (INRIA)]
This document relates to TE3 subset2 "flexible block partitioning". It reported results of Simplified
Geometry adaptive block partitioning (SGEO) as proposed initially in document JCTVC-B085. The
reported BD-rate results were as follows. Compared to the TE03 reference, SGEO average Y BDrate gainswere, respectively, 3.1% for Random Access High Efficiency, and 2.9% for Low Delay
High Efficiency. Compared to GEO, SGEO average BD-rate loss is less than 1% for both Random
Access High Efficiency and Low Delay High Efficiency. SGEO average encoding time reduction
was reported to be around 20-25% compared to GEO. The decoding time was reported to not be
significantly impacted.
The following remarks and observations were made regarding this contribution:

Current implementation does not support CU merge

Encoding time was reported to be around 300% compared to the case of TMuC reference
with AMP off with SGEO.

The 4% gain appears attractive, but the current complexity is too high.
5.1.1.1.1.1.1.1.5
JCTVC-C170 TE3.2: Report on cross-verification of simplified geometry
block partitioning from Technicolor [X. Zheng (HiSilicon), H. Yu (Huawei)]
This contribution confirmed the results of JCTVC-C031 to the extent of the cross-check testing that
was reported (all of class D were said to have been completed and some other sequences). Only
very minute differences were reported, likely due to platform compiler issues. It was reported that
the data of Technicolor’s proposal were matched with the cross-verification data. For
random_access case, when turning off MRG for the anchor, the ratio of encoding time between
Technicolor’s proposal and anchor decrease from 3.2 times to 2.8 times, while the BD-rate saving
has slight decrease. For low delay case, when turning off MRG for anchor, the ratio of encoding
time between Technicolor’s proposal and the anchor decreased from 2.8 times to 2.6 times, while
BD-rate saving had slight increase.
5.1.1.1.1.1.1.1.6
JCTVC-C099 TE3.2: Huawei & HiSilicon report on flexible motion
partitioning coding [X. Zheng (HiSilicon), H. Yu (Huawei)]
This contribution reported Huawei & HiSilicon's implementation status of flexible motion
partitioning on the TMuC platform. Based on the test results achieved so far, the proposed
partitioning method can reportedly bring about 1.6% bit rate saving over the TE3.2 anchor for
random access high efficiency configuration, and 2% bit rate saving for low delay high efficiency
configuration compared against anchors with AMP off. The coding performance can reportedly be
further improved with the addition of more flexible partitioning cases.
The following remarks and observations were noted in discussion:

Encoding time compared to TMuC default with AMP off increased to about 200%.

The current complexity is too high. This needs to be further studied.

Subjective effects should be studied.
5.1.1.1.1.1.1.1.7
JCTVC-C034 TE3 subtest 2: Cross-check of results from Huawei [E.
Francois, P. Bordes (Technicolor)]
The purpose of this document was to report cross-checking of the Flexible Block Partitioning tool
proposed by Huawei/Hisilicon for TE03 subset 2. The source code had been analyzed and the
conformance of the code with the tool description had been verified. The source code had been
compiled successfully using Windows32 with VC 2005. The verification task had reportedly been
done successfully and the results matched those presented in JCTVC-C099.
139
The cross-check of JCTVC-C099 was performed for classes C, D and E. Results were confirmed
with minor mismatches (due to different platforms).
Regarding encoding and decoding time comparisons, the test had also been performed on a limited
set of cases. The time ratios between FMPHV encoding and default TMuC encoding measured were
approximately of 140-150% for both Random Access and Low Delay cases. The time ratios
between FMPHV encoding and “TmuC without AMP” encoding measured were approximately of
230-240% for both Random Access and Low Delay cases. Decoding time was almost not impacted.
5.1.1.1.1.1.1.1.8
JCTVC-C236 Report of complexity analysis of geometric partitioning [K.
Vermeirsch, J. De Cock, R. Van de Walle (Ghent Univ. - IBBT)]
This contribution was a report on a complexity analysis which was carried out at the Multimedia
Lab of Ghent University. In this analysis a geometric partition mode is defined in function of a
partition boundary line having a certain orientation (angle) and offset from the block center. First,
the complexity and compression gains of geometric partitioning were determined as a function of
the number of allowed orientations. Secondly, the analysis investigated the histograms of the
location of the partition boundary within blocks w.r.t. the block center. Based on these observations,
a strategy for complexity reduction can reportedly be put forward. By only testing a subset of
partition modes, the complexity can reportedly be reduced by a factor of 40 while retaining over
50% of the gains of the more exhaustive approach.
The contribution was considered useful to give hints in reducing encoder complexity in geometric
partitioning.
5.1.1.1.1.1.1.1.9
JCTVC-C233 TE3 subtest 3: Local intensity compensation (LIC) for inter
prediction [N. Sprljan, S. Paschalakis, P. Wu (Mitsubishi Electric)]
This contribution presented the status of implementation of the Local Intensity Compensation (LIC)
tool, as a successor to the previously reported Macroblock Weighted Prediction (MBWP), in TMuC
0.7.1. Intensity compensation is done in a form of a locally adapted weighted prediction where
parameters are selected for partitions within a prediction unit. The implementation had only been
completed partially, and the work on its additional features and fine tuning was ongoing. Although
the first results with low complexity configurations show loss of up to 0.7%, it was also observed
that LIC can be beneficial for some sequences. This document also described future work which
addresses improvements to this method.
The contribution was noted – further study of this technique is needed to reach a conclusion about it.
5.1.1.1.1.1.1.1.10
JCTVC-C240 TE3: Cross-check results of local intensity compensation tool
from Mitsubishi Electric [D. Karwowski (Poznan Univ. Tech.)]
The contribution presented experimental results on coding efficiency and computational complexity
for the Local Intensity Compensation (LIC) tool proposed by Mitsubishi Electric. Experiments were
done in the context of Subtest 3: Multi-hypothesis inter prediction defined in Tool Experiment 3:
Inter Prediction in HEVC. The aim of the experiments was to cross check the results obtained by
authors of the method (Mitsubishi Electric).
When reviewed, no presenter was available for this document – it was reviewed and seemed to
confirm results of JCTVC-C233.
5.1.1.1.1.1.1.1.11
JCTVC-C078 TE3 subtest 4: High accuracy interpolation filter (HAIF) [T.
Chujoh, K. Kanou, T. Yamakage (Toshiba)]
In this contribution, experimental results of high accuracy interpolation filters were reported. This
was one of the proposals in subtest 4 of tool experiment 3 on inter prediction. The purpose of this
tool experiment was to improve the performance of inter prediction with enhanced motion
compensation filters. The experimental results reportedly showed that the interpolation filter which
derives each fractional sample position directly and has more steep frequency characteristics with a
long tap length can achieve good performance, and the interpolation filter and the in-loop filter have
140
strong relationship as a filter processing technology. At the nest stage, computational reduction for
the interpolation filter should be discussed in this subtest.
The following remarks were made:

Uses an asymmetric 6-tap filter for ¼ and ¾ positions, and an 8-tap filter for ½ positions and
boundaries.

This also shows again the dependency of interpolation filter type and ALF gains.

Complexity investigation not fully conclusive, e.g. 6/8 filter in some cases has longer
runtime than 8-tap filter.

Further study is needed.
5.1.1.1.1.1.1.1.12
JCTVC-C165 TE3 subset 4: Cross verification on high accuracy interpolation
filter [K. Kondo, T. Suzuki (Sony)] (missing prior, uploaded by first meeting day)
This contribution reported cross check results on an enhanced MC filter, called the High Accuracy
Interpolation Filter (HAIF), in Tool Experiment3 (TE3) subtest4. In the test, the coding
performance and complexity were reportedly measured under common test conditions, which were
defined in JCTVC-B300. The proposed tools had been evaluated using the common conditions.
Detailed results were summarized in an attached Excel sheet.
The contributor received source code of JCTVC-C078 implementation; and confirmed the results
with minor deviations (different platforms).
For low complexity case, HAIF 8 tap reportedly improved BD rate about 4 % in luma. When it is
used together with ALF, the gain is increased to 8 %. It was confirmed that HAIF can improve
coding efficiency using together with ALF. Therefore there is a reported benefit to use two
techniques together.
Encoding time increased about 11 % by HAIF only. Decoding time was increased about 7 %. When
HAIF and ALF are used together, encoding time is increased about 15% and decoding time
increased about 16 %.
5.1.1.1.1.1.1.1.13
JCTVC-C164 TE3 subset 4: Results on bi/single filter switching in FIF [K.
Kondo, T. Suzuki (Sony)]
This contribution reported results on enhanced MC filter, Bi/Single filter switching in fixed
interpolation filter, in Tool Experiment3 (TE3) subtest4. In the test, the coding performance and
complexity were measured under common test conditions, which were defined in JCTVC-B300.
Proposed tools had been evaluated on the common conditions. Detailed results were summarized in
the attached Excel sheet.
The following observations were reported:

Decoder run time is reported not to change

Encoder run time changes to 60% in case of HE

Encoder runtime reduction – certainly due to single-pass operation.

For LC, 6-tap filter shown to give approximately 1% BR reduction compared to DCT-IF 6tap.
Further study was encouraged.
5.1.1.1.1.1.1.1.14
JCTVC-C247 TE3 subset 4: Cross-verification on bi/single filter switching in
FIF [T. Chujoh, K. Kanou, T. Yamakage (Toshiba)] (missing prior, available first
day)
141
Small differences were reported in cross-checking JCTVC-C164. However, the results
werenearlythe same. Different OS platform usage seemed to explain the differences. The
contribution did not describe wether the algorithm and software were carefully studied.
5.1.1.1.1.1.1.1.15
Conclusion on TE3:
The following remarks and suggestions were recorded for TE3.
TE 3.1 (warping) – further study (no experiment establishment needed)
TE 3.2 (geometric partitioning) – make a separate experiment (2 methods plus AMP)
TE 3.3 (illumination) – further study.
TE 3.4 (interpolation filters) – an experiment on interpolation filters will be needed anyway
6 TE4: Variable length coding
6.1.1.1.1.1.1.1.1
JCTVC-C261 TE4: Summary of TE4 on variable length coding [X. Wang
(Qualcomm)]
This document summarized the activities in the Tool Experiment TE4 on Variable Length Coding
(VLC). A group of five companies and universities had registered for participation in TE4. One TE
report, JCTVC-C262, had been submitted on VLC coding for coded block flags.
6.1.1.1.1.1.1.1.2
JCTVC-C262 TE4: Report on VLC for coded block flag [X. Wang, M.
Karczewicz, W.-J. Chien (Qualcomm)]
Document JCTVC-C262 reported TE results based on a coding scheme presented in a previous
proposal JCTVC-B098. In JCTVC-B098, coding schemes for coded block flag both at CU level and
at block level were proposed. Due to the fact that the proposed coded block flag coding at CU level
is very close to the one introduced in "phase 2" VLC integration, JCTVC-C262 further reported
results of coded block flag coding only at block level.
According to the proposal, if a coded block flag for a certain video component at the coding unit
level is not zero and the transform block size is smaller than the coding unit size, the coded block
flags from the four quarter-sized blocks are grouped and coded together. Due to coded block flag at
the coding unit level, the coded block flags for the four quarter-sized blocks cannot all be zero. As a
result there are 15 different combinations in total for these flags. A new VLC table was designed
and tested. It was reported that on average around 0.2% coding gain can be obtained using the three
low complexity coding configurations.
It was noted that the proposal has an interaction with the use of a residual quadtree. The proposal
does not apply if residual quadtree encoding with more than two levels is used.
The proposal only applies to the LCEC entropy coding scheme.
The coding gain is rather minor, but the proposal is a small change.
It was suggested that if the proposal is adopted, it should be possible to switch it off in the software,
to ease future testing.
Decision: It was agreed to adopt the proposal into the TMuC / TM (provided it does not conflict
with other adoptions).
See also the discussion of JCTVC-C277 and JCTVC-C319.
6.1.1.1.1.1.1.1.3
JCTVC-C288 TE4: Verification results of JCTVC-C262 [T. Yamakage
(Toshiba)] (missing prior, available first day)
This contribution shows verification results of JCTVC-C262 (Report on VLC for coded block flag)
for TE4. The contributor started with source code and compiled it themselves to produce
executables, but did not study the software algorithmically. Results were reported to be slightly
different but not significantly, relative to JCTVC-C262. No mismatch between the encoder and
decoder was observed. Encoding and decoding time comparison with the TMuC 0.7.4 anchor
(including macro change) showed approximately no impact.
142
7 TE5: Simplified TMuC intra prediction
7.1.1.1.1.1.1.1.1
JCTVC-C046 TE5: Summary report on simplification of unified intra
prediction [T. K. Tan (NTT DoCoMo), M. Budagavi (TI), J. Lainema (Nokia)]
This is the summary report for the TE5 on Simplification of Unified Intra Prediction. The
description of the experiment can be found in JCTVC-B305r1. This tool experiment was
successfully conducted and verified the simplification proposed in JCTVC-B093 and JCTVC-B118
within the context of the unified intra prediction method proposed in JCTVC-B100. Simulation
results reportedly show that both methods do not result in a significant loss in coding efficiency
compared to the TMuC 0.7.
In the angular intra prediction method, two arrays of reference samples are used, corresponding to a
row of samples lying above the current block to be predicted, and a column of samples lying to the
left of the same block. Given a dominant prediction direction (horizontal or vertical), one of the
reference arrays is defined to be the "main" array and the other array the "side" array. In the
prediction process, only samples from the main array are used for prediction when the angle value
is positive. When the angle value is negative a per-sample test is performed to determine whether
samples from the main or the side array should be used for prediction. Additionally when the side
array is used, the index into the array is obtained by a mapping operation that requires either a
division operation or a table look-up into a sizable table.
JCTVC-B093 simplifies the prediction process when the angle value is negative. The main array is
extended by projecting samples from the side array onto it according to the prediction direction.
This projection entails copying a subset of the samples from the side array into the main array. Note
that in the copy process, the inverse angle may be replaced with a lookup table. During the
prediction process, only the extended main array is used and the same simple linear interpolation
formula is used to predict all samples in the block.
JCTVC-B118 replaces divisions in angular intra prediction with a lookup table and additions.
All six coding conditions cases were tested.
Decision: After discussion of the relevant contributions, it was agreed to adopt these two
simplifications into the TMuC and TM.
7.1.1.1.1.1.1.1.2
JCTVC-C042 TE5: Results for simplification of unified intra prediction [T. K.
Tan, Frank Bossen (NTT DoCoMo)]
This contribution reported simulation results for Tool Experiment 5 as described in JCTVC-B305r1.
Two tools were compared. The first tool, based on JCTVC-B093, simplifies the angular intra
prediction by eliminating the sample-based checking and division steps. The second tool, based on
JCTVC-B118, replaces the division in angular intra prediction with a lookup table and additions.
The results reportedly confirmed that both tools do not change the objective performance of the
angular prediction process in any significant way. However, the simplified angular prediction
(JCTVC-B093) provides the additional advantage of removing bottlenecks in the reference TMuC
algorithm to enable a lower-complexity method for intra prediction for which parallel processing
can be achieved for all prediction angles on typical SIMD architectures.
7.1.1.1.1.1.1.1.3
JCTVC-C061 TE5: TI evaluation of unified intra prediction simplifications [M.
Budagavi (TI)]
In HEVC TMuC-0.7 Intra prediction, pixels in current block are predicted from their neighboring
left (side reference) and top pixels (main reference) using angular intra prediction. Prediction of
pixels from side reference involves division of angle which is computationally complex to
implement. Two proposals were made at the last JCT-VC meeting to simplify intra prediction from
side reference – JCTVC-B118 and JCTVC-B093. This contribution presents results of evaluation of
these two proposals for simplification of intra prediction. The simplifications are achieved using the
following two tools: division elimination (JCTVC-B118, JCTVC-B093) and fractional pixel
143
position calculation optimization (JCTVC-B093). Based on the results presented in this contribution,
the contributor recommended that these two tools be adopted into TMuC and Test Model (TM).
7.1.1.1.1.1.1.1.4
JCTVC-C100 TE5: Huawei & HiSilicon report on verification test results [L.
Liu (HiSilicon), J. Zhou, H. Yu (Huawei)]
From the results it was confirmed that the proposed methods do not result in any significant
difference in coding efficiency compared to the TMuC 0.7.
Both simplified angular prediction (JCTVC-B093) and JCTVC-B118 successfully eliminate the
division using a lookup table.
Simplified angular prediction also has the additional benefit of being more suited for typical SIMD
architectures due to the single reference array and a single equation for the creation of all predicted
samples in the block.
It was recommended that the division elimination and sample creation method of simplified angular
prediction be added to the TMuC and Test Model (TM).
7.1.1.1.1.1.1.1.5
JCTVC-C186 TE5: Results for simplified intra prediction tests by Nokia [J.
Lainema, K. Ugur (Nokia)]
This document reported Nokia results for the TE5 tests and verifies the correctness of the results
submitted by other TE participants.
This contribution was based on a separate independent implementation of the proposal.
8 TE6: Intra prediction
8.1.1.1.1.1.1.1.1
JCTVC-C172 TE6: Summary report of intra prediction improvement tool
experiments [A. Tabatabai (Sony)]
This contribution was a summary of Tool Experiment 6, Intra Prediction Improvements. From a
total of 6 proposed toolsets, 4 had been evaluated by at least one organization. For cross checking,
the recommended test conditions of intra-only were used for both high compression efficiency and
low complexity as defined in the document JCTVC-B306_r3.
The tools tested in this tool experiment were as shown in the table below:
144
Tool
Technology
TE6.a.1 Weighted
Bidirectional
Prediction
(WBP) and
Adaptive Subblock Coding
Order (ASCO)
TE6.a.2 Short distance
intra prediction
Proponent
Toshiba
JCTVC-B042
JCTVC-C079
Cross-checker(s)
Sharp JCTVC-C179
Renesas JCTVC-C187
Huawei & Hisilicon
JCTVC-B040
JCTVC-C101
TE6.a.3 Overlapped
block intra
prediction
TE6.a.4 Line-based intra
prediction
MediaTek
JCTVC-A109
JCTVC-C193
Microsoft
JCTVC-A118
JCTVC-C270
Sony
JCTVC-B109
JCTVC-C169
LGE JCTVC-C028
DOCOMO JCTVC-C045
Ericsson JCTVC-C157
Microsoft JCTVC-C271
Microsoft JCTVC-C272
TE6.b.1 Differential
coding of intra
modes (DCIM)
8.2
Huawei JCTVC-C139
DOCOMO JCTVC-C045
Sharp JCTVC-C175
Panasonic JCTVC-C217
TE6.a.1 Weighted Bidirectional Prediction (WBP) and Adaptive Sub-block
Coding Order (ASCO)
8.2.1.1.1.1.1.1.1
JCTVC-C079 TE6 subset a: Bidirectional intra prediction [T. Shiodera, A.
Tanizawa, T. Chujoh, T. Yamakage (Toshiba)]
This contribution presents the detailed experimental results of Bidirectional Intra Prediction (BIP)
for Tool Experiment 6 on intra prediction improvement. BIP was included in the CfP submission of
JCTVC-A117 and was proposed in the contribution of JCTVC-B042 at JCT-VC Geneva meeting.
In this document, experimental results using TMuC software under the common test conditions
defined by JCTVC-B300 were reported. For the I slice only coding structure, the average BD-rate
gain was 2.1% on low complexity conditions and 2.3% on high efficiency conditions without
increase of the encoding time and the decoding time.
The reason that the encoding time was not increased was noted to be fast mode decision and turning
off some tools that were enabled in the reference configuration – adaptive intra smoothing (AIS)
and edge-based prediction. This feature would ordinarily be expected to increase encoding
complexity.
It was remarked that JCTVC-C207 encoding optimization improves efficiency by 0.8% with only a
fast mode decision technique without adding new tools.
Further study was encouraged.
8.2.1.1.1.1.1.1.2
JCTVC-C179 TE6: Cross-check of bi-directional intra-predition results from
Toshiba [A. Segall (Sharp)] (missing prior, available first day)
This contribution cross-checks the bi-directional intra-prediction results provided by Toshiba. This
cross-check was performed within the context of TE6 for "Intra high-performance" and "Intra lowcomplexity" configuration settings. The experimental results reportedly closely matched the ratedistortion results provided by Toshiba. The maximum Y BD Rate differs from the provided results
by at most 0.1%.
The Toshiba software source code was used, and the software was reportedly reviewed in detail.
145
8.2.1.1.1.1.1.1.3
JCTVC-C187 TE6.a: Cross-check of bidirectional intra prediction (JCTVCB042) [K. Iwata (Renesas)]
This document described results of cross-checking the software containing the tool of bidirectional
intra prediction (BIP) proposed by Toshiba Corporation. The verification task was reported to have
been done successfully and the results matched with proponent’s data. The Toshiba software source
code was used (without studying the software algorithm).
8.3
TE6.a.2 Short distance intra prediction
8.3.1.1.1.1.1.1.1
JCTVC-C101 TE6.a: HiSilicon report on short distance intra prediction [C.
Lai, Y. Lin (HiSilicon)]
This document reported the Tool Experiment TE6.a activities related to the evaluation of short
distance intra prediction (SDP). Based on the test results achieved so far, line based SDP was
reported to provide about 0.9% of bit rate reduction over TMuC anchors for the Intra LC
configuration with comparable encoding and decoding time.
The proposed tool includes two kinds of 1x16, 16x1, 2x8, 8x2 and "resample based" prediction.
The "resample based" prediction was not tested, and the HE configuration context was not tested
due to software development difficulties.
It was remarked that the LC configuration context may not be the best in which to evaluate such a
technique, since it seems to have substantial complexity impact.
Further investigation was encouraged.
8.3.1.1.1.1.1.1.2
JCTVC-C028 TE6: Verification results of Huawei and Hisilicon proposals
(short distance intra prediction using the correlation between lines and pixels) [J.
Kim , B. Jeon (LG Electronics)]
This document reported verification results of JCTVC-C101 on short distance intra prediction. The
proponents provided their source coded which was compared under the condition of intra low
complexity condition, and the results reportedly matched those of JCTVC-C101.
8.3.1.1.1.1.1.1.3
JCTVC-C045 TE6.a: Cross verification for line based intra prediction [T. K.
Tan, Junya Takiue (NTT DoCoMo)]
This contribution reported simulation results and cross verification of subset Tool Experiment 6.a as
described in JCTVC-B306. This contribution documents the findings based on the study and
simulation conducted using the software provided by Huawei and Microsoft.
This contribution was a cross-check verification for both JCTVC-C101 and JCTVC-C270.
Software was received from the proponents, and was somewhat studied to identify the algorithmic
differences. In the case of JCTVC-C101 (JCTVC-B040) verification, the software changes were
more difficult to identify and study, due to not being controlled by a macro switch.
The results reportedly closely matched those reported for JCTVC-C101 and JCTVC-C270.
The contributor remarked that deeper analysis of the changes made in these proposals in regard to
the prediction process and to the difference coding process was desirable.
The contributor also remarked that it would be beneficial to study harmonization of JCTVC-C101
and JCTVC-C270 as well as refinement of each scheme itself.
8.3.1.1.1.1.1.1.4
JCTVC-C157 TE6.a: Cross verification of line-based intra prediction [J.
Samuelsson, R. Sjöberg, K. Andersson (Ericsson)]
This contribution contained cross-checking results of Line-based Intra Prediction (LIP) presented
by Huawei in JCTVC-B040 and later implemented in TMuC v7.0. Simulations run for the Intra
Only Low Delay test case reportedly showed an average BD-rate reduction of 0.9%, matching what
was reported in JTCVC-C101.
The software was studied to determine that it matched the described algorithm.
146
8.3.1.1.1.1.1.1.5
JCTVC-C271 TE6: Cross-verification of HiSilicon's short distance intra
prediction by Microsoft [X. Peng, J. Xu (Microsoft)]
This document summarized the Tool Experiment TE6 activities related to the cross verification of
short distance intra prediction by HiSilicon.
The source code provided by HiSilicon was reported to match with their document description,
except that the resample-based intra prediction was disabled. Only intra prediction by exploring the
correlations between lines was enabled. In prediction, four modes were allowed on 16x16 and 8x8
blocks, which are 1x16, 16x1, 2x8 and 8x2. Each line had its own prediction mode among nine,
which needs to be coded in the bit stream. After prediction, a DCT is applied on the line before it is
quantized.
The results reportedly essentially matched those reported in JTCVC-C101.
8.4
TE6.a.3 Overlapped block intra prediction
8.4.1.1.1.1.1.1.1
JCTVC-C193 Overlapped block intra prediction [K. Zhang, X. Guo, J. An, M.
Guo, Y.-W. Huang, S. Lei (MediaTek)]
This contribution presented the results of overlapped block intra prediction (OBIP) in Tool
Experiment TE6.a. In the proposed OBIP method, a position-dependent weighted sum of several
possible predictors is treated as the final prediction. The weighting values are obtained by offline
training and are fixed during the encoding/decoding procedure. The OBIP method had been
implemented into TMuC software. Experimental results reportedly showed that OBIP can attain an
average bit-rate reduction of 1% and 1.7% for all-intra high efficiency and low complexity,
respectively.
The technique uses a position-dependent weighted combination of three predictors. The weighting
combination is selected from one of nine weighting matrices. Only 4x4 and 8x8 blocks use this (as
implemented).
Edge-based prediction and planar prediction were turned off (which reportedly has essentially
negligible PSNR impact) for reasons of tool incompatibility.
The non-overlapped modes were retained in the scheme as well as the additional OBIP modes.
Further study was proposed, and encouraged.
8.4.1.1.1.1.1.1.2
JCTVC-C272 TE6: Cross-verification of MediaTek's overlapped block intra
prediction by Microsoft [X. Peng, J. Xu (Microsoft)] (missing prior – later provided)
This contribution reported cross-verification test results for OBIP as proposed in JCTVC-C193. The
results reportedly matched those reported in JCTVC-C193.
The software was studied as well as compiled and run.
8.5
TE6.a.4 Line-based intra prediction
8.5.1.1.1.1.1.1.1
JCTVC-C270 TE6: Report of line-based coding [X. Peng, J. Xu, F. Wu
(Microsoft)]
This document reported experimental results of the line-based coding scheme, which was first
presented in JCTVC-A118. The line-based coding takes a line as the reconstruction unit and utilizes
content-adaptive prediction filters for accurate predictions. When integrated into the TMuC0.7
software, it reportedly showed 2.1% and 2.4% bit rate saving on average, under all intra high
efficiency and low complexity conditions, respectively. Up to 5.1% bit rate saving was reportedly
achieved on sequences with rich edges.
Line lengths of 4, 8, and 16 were tested. Syntax is not used to indicate the prediction parameters –
rather, they are derived based on the reconstructed area.
The reported encoding and decoding times were approximately doubled or tripled.
It was remarked that the complexity impact of the proposed scheme is substantial.
Further study was encouraged.
147
Also see remarks above regarding JCTVC-C101.
8.5.1.1.1.1.1.1.2
JCTVC-C045 TE6.a: Cross verification for line based intra prediction [T. K.
Tan, Junya Takiue (NTT DoCoMo)]
This contribution was also described above. It contained cross verification reporting for both
JCTVC-C101 (JCTVC-B040) and JCTVC-C270.
8.5.1.1.1.1.1.1.3
JCTVC-C139 TE6.a: Huawei & HiSilicon report on cross verification of
MSRA's LIC [C. Lai, Y. Lin (HiSilicon), J. Zhou (Huawei)]
This contribution provided a summary of the cross-checks performed by Huawei Technologies and
Hisilicon Technologies on MSRA’s LIC (Line Intra Coding) proposal. The simulation results
reportedly exactly matched the results in terms of bit rate and PSNR as provided by Microsoft.
8.6
TE6.b.1 Differential coding of intra modes (DCIM)
8.6.1.1.1.1.1.1.1
JCTVC-C169 TE6.b: Experiment results of DCIM [E. Maani, L. Dong, W. Liu
(Sony)]
This document presented experimental results of Differential Coding of Intra Modes (DCIM) under
the test conditions defined for Tool Experiment 6.
DCIM uses neighborhood edge estimation to predict Intra prediction direction and differentially
encodes the modes with respect to the predicted direction. This enables a higher precision in the
Intra prediction directions without substantially increasing the mode signaling overhead. An
additional bit is transmitted per block (i.e., PU) to signal to the decoder weather DCIM is used.
MDDT was disabled due to tool implementation compatibility issues, although ROT was used.
Approximately 1.9% overall gain was reported.
For the LC configuration, the work was affected by a bug (bug fix 79) in the underlying TMuC
software. The cross verifications with the bug fix had not yet been completed at the time of
presentation.
Encoding time is approximately doubled (possibly somewhat slowed by increased use of ROT due
to disabling of MDDT); decoding time is reportedly decreased by about 40% (due to disabling of
MDDT).
The contributor recommended further study, which was encouraged.
8.6.1.1.1.1.1.1.2
JCTVC-C175 TE6.b: Cross verification of differential coding of intra modes
(DCIM) [T. Yamamoto, Y. Yasugi (SHARP)] (missing prior, available first day)
This contribution provided cross-check results of the method called Differential Coding of Intra
Modes (DCIM) proposed by Sony in the context of TE6 (TE6.b: Edge based intra prediction). The
experiments defined in TE6 were carried out using the software provided by Sony. The
experimental results reportedly showed that the proposed method provides 1.9% gain in High
Efficiency / Intra Only operation. It was reportedly confirmed that these results matched the results
provided by the proponent.
Testing for the LC configuration context had encountered difficulties as described above (bug fix
79).
It was suggested that perhaps, in general, we should not use the "LC" configurations to test
techniques that require a substantial complexity increase.
8.6.1.1.1.1.1.1.3
JCTVC-C217 TE06.b: Cross verification of differential coding of intra modes
(DCIM) [V. Drugeon (Panasonic)]
This contribution was a cross-check for differential coding of intra modes (DCIM). Partial results
were provided. The results reportedly closely matched those reported by Sony. The software was
studied and determined to match the algorithm description as proposed in July.
148
8.6.1.1.1.1.1.1.4
JCTVC-C176 Analysis and improvement of differential coding of intra modes
[T. Yamamoto, Y. Yasugi (Sharp)]
This contribution related to the DCIM technology tested in TE6.
Edge based intra prediction is a method to utilize the gradient of the neighboring samples to decide
the direction to predict the target partition. Since one of its variants has been incorporated in the
TMuC and the other variant (DCIM: Differential Coding of Intra Modes) was being evaluated in a
TE, the Edge based intra prediction was one of the actively researched areas. In this contribution,
some analysis of DCIM was presented for providing a better understanding of the technology. Also,
some modification and (limited) experimental results were reported – with reported benefits of
approximately 0.2% in HE and 0.7% in LC configurations.
It was asked whether the parsing of the bitstream is dependent on the values of the reconstructed
samples. The contributor said that it does not.
9 TE7: MDDT Simplification
9.1.1.1.1.1.1.1.1
JCTVC-C027 TE7: Summary report for MDDT Simplification [R. Cohen
(Mitsubishi Electric Research Laboratories), C. Yeo (I2R), R. Joshi (Qualcomm)
The purpose of Tool Experiment 7 (TE7) was to explore the performance and value of simplifying
Mode-Dependent Directional Transforms (MDDT) within the HEVC Test Model under
Consideration (TMuC). The five proposals submitted for TE7 along with their corresponding crossverification documents are summarized in the table below.
Proponent Proposal TE7 Cross-Checks
Other
Remarks
Doc.
Doc.
related
if no
contributions match
I2R
B024
C037 C070
C090
C158
C039
(Zhejiang (Peking
(Ericsson)
Univ.)
Univ.)
Huawei
B039
C102 C038
C192
C106
(I2R)
(Yonsei
Univ.,
KGIT)
Toshiba
B042
C080 C035
C103
(MERL) (Huawei)
Beijing
B073
C305
Not
Univ.
available
Tech.
prior to
meeting,
so crosscheckers
were
reassigned.
Peking
B102
C089 C027
C081
C190
Some
Univ.
(LGE)
(Toshiba) (ETRI)
crosscheckers
reported
crashes,
but
otherwise
results
match.
149
The proposed schemes generally use a combination of the following techniques to simplify MDDT:

Exploit symmetry of Intra prediction directions

Reduce the number of different transforms used

Use of lower-complexity transform architectures

Reordering of residuals and reduction of scan orders
 Training of new transforms for use with the above techniques
Abstracts for each of the proposals are as tabulated below:
150
Propone Doc.
nt
I2R
JCTV
CB024
Huawei
JCTV
CB039
Toshiba
JCTV
CB042
Beijing
Univ.
Tech.
JCTV
CB073
Peking
Univ.
JCTV
CB102
Tool Abstract
Mode-Dependent Directional Transform (MDDT) was previously
introduced to improve transform coding of intra-predicted blocks, and is
now a component of the Test Model under Consideration (TMuC). This
proposal presents a simplification of the MDDT scheme that requires
only two transform matrices: a DCT and a derived KLT. The derived 4x4
KLT also has a structure that can be exploited to reduce the operation
count of the transform operation. Derivation of the KLT is based on an
assumed image correlation model. Experimental results reportedly show
that the proposed technique matches the performance of MDDT even
though the approach requires no training, and has significantly lower
computation and storage costs.
Directional textures exist in the residuals after directional intra
predictions. Based on the symmetry of directions, the residual blocks of
all directional prediction modes are classified into several groups, with
the group number being much less than the number of directional
prediction modes. Then, separable KLT is obtained and applied to each
group. Simulation results reportedly show that the proposed SMDDT
(Simplified MDDT) can use fewer transform matrices to achieve the
same or even slightly better coding performance than MDDT in KTA
software.
1 Dimensional Directional Unified Transform (1DDUT) is a spatial
transform scheme for intra coding. 1DDUT has additional two 1D
directional transform matrices (type A and B) compared with AVC. The
transform matrices are pre-determined. Three combinations of these two
1D directional transform matrices are also pre-determined each intra
prediction mode. After intra prediction, the residual signal is transformed
using pre-determined set of 1D directional transform matrices each
prediction mode. And transform coefficients are converted into 1D
signals from 2D signals according to the adaptive coding scan, which is a
same as the scheme of MDDT.
Mode Dependent Directional Transform (MDDT) can improve the
coding efficiency of AVC but it also brings high computation
complexity. In this contribution, a new design for implementing fast
MDDT transform through integer lifting steps is presented. First, MDDT
is approximated by a proper transform matrix that can be implemented
with butterfly-style operation. Then the butterfly-style transform is
factored into a series of integer lifting steps to eliminate the need of
multiplications. Experimental results reportedly show that the proposed
fast MDDT can significantly reduce the computation complexity while
introducing negligible loss in the coding efficiency. Due to the merit of
integer lifting steps, the proposed fast MDDT is reversible and can
reportedly be implemented on hardware easily.
To further simplify the mode-dependent directional transform (MDDT),
a mode-dependent residual reordering (MDRR) method is proposed. In
the proposed MDRR, between the prediction and transform stages, a
certain kind of reordering is implemented on the residual samples for
each mode in spatial. The reordering is manipulated in a way that the
distribution statistics of reordered residual samples present less modedependent characteristic. After the reordering, the nine Intra modes in
151
I4MB and I8MB modes will be assigned into three groups, and only one
transform matrix is assigned for each group. With the proposed MDRR,
the number of transform matrices in MDDT is significantly reduced,
while negligible coding performance difference is observed between the
MDDT and the proposed scheme.
9.1.1.1.1.1.1.1.2
JCTVC-C310 TE7: Summary report for MDDT Simplification [Robert Cohen,
Chuohao Yeo , Rajan Joshi] (late reg.)
This summary report document provided a brief summary and table of documents related to TE7 on
MDDT simplification.
9.2
I2R proposed simplification of MDDT
9.2.1.1.1.1.1.1.1
JCTVC-C037 TE7: Results for mode-dependent fast separable KLT for
block-based intra coding [C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R)]
To simplify the operation of Mode Dependent Directional Transform (MDDT), a Mode-Dependent
Fast Separable KLT for Block-based Intra Coding was previously introduced that requires only two
transform matrices: a "DCT" and a "derived KLT". The derived 4x4 KLT also reportedly has a
structure that can be exploited to reduce the operation count of the transform operation. This
contribution provides test results as stipulated in TE7. Experimental results reportedly show that the
proposed technique matches the performance of MDDT even though the approach requires no
training, and has lower operation count and storage costs.
It was remarked that the "derived KLT" is essentially the same as a form of DST.
The proposal uses two transforms per block size instead of 18. The scheme was applied to two
block sizes: 4x4 and 8x8.
A coding loss of 0.1% was reported overall for HE intra-only conditions relative to the anchor
(which used MDDT).
It was remarked that an adaptive scan order is part of the scheme.
It was asked how much the coding loss would be if using just one transform. We don't know the
answer to that. Roughly 1-2% was suggested as a reasonable guess.
9.2.1.1.1.1.1.1.2
JCTVC-C070 TE7: Cross-check for I2R proposal on mode-dependent fast
separable KLT for block-based intra coding [X. Zhu, L. Yu (Zhejiang Univ.)]
This contribution presented cross check results of I2R proposal on mode-dependent fast separable
KLT for block-based intra coding. Due to lack of time, not all data was checked – only some
randomly selected points had been checked. The sequences checked reportedly had the same PSNR
and bit rate with the corresponding data given by proponent. The source code was studied and
determined to match the proposed algorithm.
9.2.1.1.1.1.1.1.3
JCTVC-C090 TE7: Cross-verification results of mode-dependent fast
separable KLT for block-based intra coding from I2R [X. Zhao, L. Zhang, S. Ma, W.
Gao (Peking Univ.)]
This contribution reported Peking Univ. cross-verification results of the I2R proposal on modedependent fast separable KLT for block-based intra coding in response to the Tool Experiment TE7.
Some minor differences were observed between Peking Univ. and I2R results, which could be
induced by the different platforms used (Peking Univ.: WindowXP, I2R: Linux). However, the
average rate-distortion performance results of I2R and Peking Univ. were reported to be equal.
9.2.1.1.1.1.1.1.4
JCTVC-C158 TE7: Cross verification of MDDT simplification JCTVC-B024 [K.
Andersson (Ericsson)]
This contribution was a cross-verification of the TE7 MDDT Simplification that was proposed in
JCTVC-B024. The PSNRs, bit rates and BD-rate measurements provided by IR2 were matched.
152
The result reportedly shows that the BD bit rate loss is 0.1% and 0.0% for high efficiency
configurations all intra and random access respectively compared to the anchor TMuC 0.7 reference
configuration. The encoding and decoding time were approximately unaffected.
9.2.1.1.1.1.1.1.5
JCTVC-C039 Choice of transforms in MDDT for unified intra prediction [C.
Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R)]
A simplification of Mode-Dependent Directional Transform (MDDT) was previously proposed in
JCTVC-B024 that requires only two transform matrices: a "DCT" and a "derived KLT". However,
the choice of transforms in JCTVC-B024 was only specified for the 9 intra prediction modes
present in AVC. In this proposal, a choice of transforms is presented for the 34 intra prediction
modes possible in the unified intra prediction in TMuC. As in the earlier proposal, only two
transform matrices are necessary, and no training is required.
This document provided information on how the proposed scheme was harmonized with Unified
Intra Prediction (adding more prediction direction types to the transform selection table) for use in
TE7. The testing performed in TE7 used this scheme.
9.3
Huawei proposed simplification of MDDT
9.3.1.1.1.1.1.1.1
JCTVC-C102 TE7: Symmetry-based simplification of MDDT [H. Yang, J.
Zhou, H. Yu (Huawei)]
In the last meeting, symmetry-based simplification of MDDT (SMDDT) was proposed to reduce the
number of transform matrices in the current MDDT implementation. And the SMDDT technique
was included in TE 7 for further investigation. According to the TE7 specification, SMDDT has
been implemented in the TMuC software and evaluated under the common test conditions since the
last meeting. This document reported the performance of the proposed SMDDT technique and
discussed the complexity.
Five transforms, with flipping and transposing, were used. Fewer scan orders were used in this
proposal than in the I2R proposals. It was claimed that this proposal requires less data storage.
Overall, a 0.1% bit rate increase was reported.
9.3.1.1.1.1.1.1.2
JCTVC-C038 TE7: Cross-check results of MDDT simplification proposal
from Huawei [C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R)]
This contribution provided a summary of the cross-checks performed by the Institute for Infocomm
Research on the Huawei MDDT simplification proposal. The simulation results reportedly closely
match the results provided by Huawei. The software source code was studied algorithmically as
well as executed.
9.3.1.1.1.1.1.1.3
JCTVC-C192 TE7: Cross-verification results of Huawei proposal on
simplified MDDT for intra prediction residual [Y. Choe, J. Kim (Yonsei Univ.), Y-G.
Kim (KGIT)]
This document reported cross-verification results of simplified MDDT (SMDDT) for intra
prediction residual proposed by Huawei, according to the description of TE7. The verification task
had been done successfully and the results reportedly match almost exactly those provided by
Huawei. The algorithm was studied as well as executing the software.
9.3.1.1.1.1.1.1.4
JCTVC-C106 Symmetry-based scan order sharing scheme for MDDT [H.
Yang, J. Zhou, H. Yu (Huawei)]
This proposal was used in the TE for the scheme proposed in JCTVC-C102. However, it can
reportedly also apply in the context of other schemes and was therefore proposed separately in this
contribution.
In this proposal, a symmetry-based scan order (SSO) sharing scheme was proposed to further
simplify the mode-dependent directional transform (MDDT). Based on the symmetry among all
153
intra prediction directions, the number of scan maps can reportedly be reduced by the proposed SSO
while the coding performance is kept unchanged. The proposal suggested that the proposed SSO
scheme be adopted as an improvement of the current MDDT implementation in TMuC.
When implemented in the current MDDT implementation in TMuC, it was reported that this scan
order method results in a memory savings with no performance loss.
Disabling scan order adaptivity reportedly resulted in very little compression performance loss
(0.0% to 0.1%). It was remarked that if the gain from scan order adaptivity is that small, perhaps we
shouldn't use it (in the HHI coefficient coding scheme as well).
The contributor suggested that a harmonization of simplified MDDT schemes may also be feasible.
Further study was encouraged.
9.4
Toshiba proposed simplification of MDDT
9.4.1.1.1.1.1.1.1
JCTVC-C080 TE7: One-dimensional directional unified transform [A.
Tanizawa, J. Yamaguchi, T. Shiodera, T. Chujoh, T. Yamakage (Toshiba)]
This contribution presented experimental results of 1 Dimensional Directional Unified Transform
(1DDUT) for Tool Experiment 7 (MDDT simplification in HEVC). 1DDUT is a method of
reducing the number of transform matrices of MDDT by using a set of pre-defined 1 D transform
matrices depending on the intra prediction direction. 1DDUT was included in the CfP submission of
JCTVC-A117 and the improved method was proposed in the contribution of JCTVC-B042 at JCTVC Geneva meeting.
In this document, the experimental results in TMuC software version 0.7.1 on the high efficiency
conditions based on the common test conditions for both I slice only coding structure and the
random access coding structure defined by TE7 were reported. For I slice only coding structure, the
BD-Bit rate gain compared with the anchor (MDDT on) was reported to be 0.1% on average, and
for the random access coding structure, the BD-Bit rate gain was reported as 0.0% on average.
9.4.1.1.1.1.1.1.2
JCTVC-C035 TE7: Cross-verification of the 1D directional unified transform
in TMuC [R. Cohen, A. Vetro, H. Sun (Mitsubishi)]
This document reported the outcome of Mitsubishi Electric Research Laboratory’s (MERL) crossverification of Toshiba’s 1D Directional Unified Transform (1DDUT) in TMuC. This activity is
part of Tool Experiment 7 on MDDT simplification. A brief analysis of the architecture and
software was followed by a summary of experimental results. Differences (negligible, averaging
0.0%) in bit rates and PSNR values can be accounted for by the difference in simulation platforms.
The conclusion is that the MERL cross-verification results match those produced by Toshiba.
The software was studied and indicated to be straightforward and to reflect what is described in
JCTVC-C080, although a bit different (simpler) than what was proposed in Geneva.
9.4.1.1.1.1.1.1.3
JCTVC-C103 TE7: Report of cross-checking results for Toshiba proposal [H.
Yang, J. Zhou, H. Yu (Huawei)]
This document reported cross checking results for Toshiba’s proposal in TE 7 activities. The
decoded sequences matched the reconstructed sequences. The RD data provided by Toshiba were
reportedly the same as the results obtained from the crosschecking (with only very minor
differences likely due to compiler and platform issues).
9.5
Beijing Univ. Tech. proposed simplification of MDDT
9.5.1.1.1.1.1.1.1
JCTVC-C305 TE7: Results for Simplification of MDDT Transform [W. Ding,
Y. Shi, B. Yin (Beijing Univ. Tech.)] (late)
MDDT has been implemented as a direct matrix multiplication, which results in higher computation
cost than that of an ordinary integer transform. This contribution proposed two symmetric transform
matrices to simplify the MDDT matrices. The proposed symmetric transform matrices can be
154
implemented using a lifting structure. Experimental results reportedly demonstrate that this proposal
reduced the run-time complexity of MDDT with approximately 0.5% coding performance loss.
The operation count was asserted to be lower in this contribution than in the others, by enforcing
symmetric transform matrices, which may be why the loss is a bit larger.
The contribution was submitted late, and no cross check was available.
9.6
Peking Univ. proposed simplification of MDDT
9.6.1.1.1.1.1.1.1
JCTVC-C089 TE7: Results for mode-dependent residual reordering for intra
prediction residual [X. Zhao, L. Zhang, S. Ma, W. Gao (Peking Univ.)]
In this contribution, the Peking Univ. results in response to the Tool Experiment TE7 on MDDT
simplification were presented, with the tool description and simulation results. To simplify the
mode-dependent directional transform (MDDT), a mode-dependent residual reordering (MDRR)
method was proposed. In the proposed MDRR, between the prediction and transform stages, a
mode-dependent reordering is implemented on the residual samples for each mode in the spatial
domain. After the reordering, a single transform matrix is applied. In all-intra HE configuration, a
0.3% average loss was reported.
9.6.1.1.1.1.1.1.2
JCTVC-C027 TE7: Verification results of Peking Univ. proposal (modedependent residual reordering for intra prediction residual) [J. Kim, B. Jeon (LG
Electronics)]
This document reported verification results of JCTVC-C089 on mode-dependent residual reordering
for intra prediction residual, submitted by Peking University. The proponents provide their source
code for cross-checking.
For HE intra, the simulation succeeded on all platforms of Window7 32bit and WindowXP 64bit
and reportedly produced the same results as what the proponent provided. For HE random access,
the simulation succeeded and reportedly produced the same results as what the proponent provided
when the same platform was used as what the proponent used.
An accompanying spreadsheet shows different results for V-PSNR for
BasketballDrive_1920x1080_QP22 but it was reported to seem to be caused by minor rounding
issues. The results of HE random access coding on Windows 7 32bit and on WindowXP 64bit have
differences for some sequences such as BQsquare_416x240. An encoder crash occurred for some
sequences such as Racehorse_832x480_QP37 on Windows XP 64bit.
The software was not studied – just compiled and run.
9.6.1.1.1.1.1.1.3
JCTVC-C081 TE7: Cross-verification of mode dependent residual reordering
(Peking Univ.) [A. Tanizawa, J. Yamaguchi (Toshiba)]
This contribution contained cross-checking of software containing the tool "Mode-dependent
residual reordering (MDRR)" of JCTVC-B102 proposed by Peking University. The verification
task was reported to have been completed successfully and the results reportedly matched those
reported in JCTVC-C089. The encoded images matched those of the decoded ones, and the PSNR
and bit rate values were reportedly identical to the results provided by Peking University except for
one case. In the case of RaceHorses (QP=37), an encoder crash happened. However, the contributor
suggested that this might be caused by a bug in the memory access in TMuC version 0.7.
The software was not studied – just compiled and run.
9.6.1.1.1.1.1.1.4
JCTVC-C190 TE7: Cross-check result of Peking Univ.'s proposal (JCTVCC089) [J. Kim, S.-C. Lim, H. Y. Kim, H. Lee, J. S. Choi (ETRI)]
This document presented verification results of JCTVC-C089 proposed by Peking University. Some
encoder crashes were experienced. When the encoder did not crash, matching results were obtained.
The cross-checker verified that no unrelated areas of the software that had been modified, but did
not closely study exactly what the proposal part of the software was doing algorithmically.
155
10 TE8: Parallel entropy coding
10.1.1.1.1.1.1.1.1
JCTVC-C223 TE8 report [G. Martin-Cocher (RIM), M. Budagavi (TI)]
This contribution was a summary of tool experiment 8, Parallel Entropy Coding. Five proposed
tools had been evaluated using the conditions defined in document JCTVC-B308r1. One of the test
conditions defined in JVCTV-B308 was an optional hardware test. The hardware cross-verifications
had not been performed.
In this TE, five tools had been evaluated according to the conditions defined in JCVTC-B308r1 and
their performances had reportedly been verified.
The TE participants, as a group, recommended that the following two tools be adopted into TM and
TMuC:

Coefficient Sign PCP (JCTVC-B088 Section 3.2)
 Coeff Level BinIdx 0 PCP (JCTVC-B088 Section 3.3)
Decision: This was Agreed.
They also recommended that a core/tool experiment be started on the following two tools for
improving context processing of significance map, since interaction with
HHI_TRANSFORM_CODING needs to be studied more carefully:

Significance map PCP (JCTVC-B088 Section 3.4)

Coding order for significance map on bin decoding throughput( JCTVC B036 Section 2)
10.2
Coefficient Sign PCP, Coeff Level BinIdx 0 PCP, and significance map PCP
10.2.1.1.1.1.1.1.1
JCTVC-C062 TE8: TI parallel context processing (PCP) proposal [M.
Budagavi (TI)]
Context-Adaptive Binary Arithmetic Coding (CABAC) is one of two entropy engines used by the
AVC video coding standard. The processing in the CABAC engine is highly serial in nature.
Consequently, in order to decode high bit rate video bit-streams in real-time, the CABAC engine
needs to be run at extremely high frequencies which consumes a significant amount of power and in
the worst case may not be feasible. Techniques to parallelize CABAC can be broadly classified into
three categories: bin-level parallelism, syntax element-level parallelism, and slice-level parallelism.
Bin-level parallelism techniques such as NBAC/PIPE/V2V parallelize binary arithmetic coder
(BAC) of CABAC. However, due to serial bottlenecks in context processing, there is limited overall
throughput improvement in the entropy coder. To address this issue, several techniques that
parallelize context processing were advocated. The following three techniques for parallelization of
context processing (PCP) were presented at the last JCTVC meeting in JCTVC-B088:

Coefficient Sign PCP (JCTVC-B088 Section 3.2)

Coeff Level BinIdx 0 PCP (JCTVC-B088 Section 3.3)
 Significance map PCP (JCTVC-B088 Section 3.4)
The first two of these design elements were reported to have no impact on coding efficiency.
Further study was suggested for the third topic.
10.2.1.1.1.1.1.1.2
JCTVC-C141 TE8: Crosscheck on TI's proposal of parallel context
processing by MediaTek [Y.-L. Chang, Y.-W. Huang, S. Lei (MediaTek)]
This document reported cross-check results for JCTVC-B088 on parallelization of context level
processing submitted by Texas Instruments (TI). The first two tools listed above were checked. The
software source code for the cross checking was obtained from TI, and was studied and it was
concluded that it algorithmically matched the proposed technology design. The verification task
156
was reportedly completed successfully and the results reportedly matched those provided by TI
exactly.
10.2.1.1.1.1.1.1.3
JCTVC-C245 TE8: Cross verification of TI-PCP proposal for significance
map [J. Zan, J. Meng, M. T. Islam, D. He (RIM)]
This contribution reported the results of cross-checking activities performed by RIM on TI parallel
context model proposal on significance map (described in JCTVC-B088 Section 3.4). It was
reported that the software was studied and determined to match the TI proposal, and the results of
the experiments matched those reported by TI.
10.3
Coding order for significance map on bin decoding throughput (JCTVC B036
Section 2) and V2V coding tree (JCTVC-B034)
10.3.1.1.1.1.1.1.1
JCTVC-C249 TE8: Reports on V2V coding and context modeling by RIM [J.
Zan, G. Korodi, J. Meng, M. T. Islam, D. He (RIM)]
This contribution discussed the coding order for the significance map on bin decoding throughput
(JCTVC B036 Section 2) and the V2V coding tree (JCTVC-B034)
This contribution reported experiment results on V2V codes and modified context processing as
described in JCTVC-B308 and proposed by the contributor.
According to the plans established in JCTVC-B308, the following tools were tested:

V2V codes proposed by RIM
 Improved context processing in HEVC
Specifically, the following three tests were run.

Bernoulli tests of RIM V2V codes against PIPE V2V codes

Modified context processing in TMuC0.7 against the high efficiency anchors as defined in
JCTVC-B300.
 RIM V2V codes in TMuC0.7 against high efficiency anchors as defined in JCTVC-B300.
It was remarked that Huffman coding can be sped up by various techniques – when the VLC table is
short, the codewords can be concatenated to create a larger table that requires fewer data fetches to
process.
Generally speaking, the RIM proposed code tables are larger than the HHI proposed code tables and
they capture more source bits per codeword. It was noted that this causes extra latency/buffering
and affects the overhead for flushing out the partially filled codewords when flushing is necessary.
In terms of overall coding efficiency for coding video data, the HHI tables seem very slightly better
for predictive coding cases (due to needing fewer bits for flushing at the end of the codestream) and
the RIM table seem very slightly better for intra (due to better fitting to entropy due to using larger
tables).
Significant overlap with JCTVC-C134 was noted.
In regard to the proposed modified context processing, the motivation is to increase how often the
same context is used for the decoding of consecutive bins, so that multiple bins could be processed
at once through the finite state machine. For the modified proposal, a 0.0% to 0.1% degradation of
coding efficiency was reported with a 1.4x to 2.1x effect on the "throughput" for the part of the
bitstream that is devoted to these symbols. (See the contribution and JCTVC-C063 for the definition
of throughput.)
10.3.1.1.1.1.1.1.2
JCTVC-C134 Comments on V2V coding for TM/TMuC [D. He, G. Korodi, P.
Imthurn, J. Jamias, D. O'Loughlin, G. Martin-Cocher (RIM)]
This document discussed V2V coding. It was asserted that the design of a V2V tree directly impacts,
not only its compression performance, but its throughput, in both software and hardware
157
implementations. The contributor recommended further evaluation of V2V codes by measuring
their compression performance, encoding and decoding throughput, sizes, and power consumption
in hardware.
The contribution discussed the "Codeset 1" and "Codeset 2" code tables currently in the TMuC.
In Bernoulli tests using the 12 probabilities used in the design of the V2V codes, CABAC and the
two codesets were compared. It was remarked that this penalizes CABAC in the comparison, since
CABAC was not customized for those specific probabilities.
It was noted that the Bernoulli tests do not include the work needed to select which V2V code to
apply in the V2V cases, which is another type of apparent bias against CABAC in the test.
The "codeset 1" tables are generally somewhat smaller, and (perhaps as a result) have slightly less
coding efficiency than the "codeset 2" tables (on the specific tested probabilities) – although both
have very little excess entropy (0.15% versus 0.12%).
It was remarked that some assumptions may be built into this analysis about how the VLC would be
implemented that may not reflect the various ways that optimized encoder and decoders would
perform.
It was suggested that some measure such as average bits per look-up operation may be beneficial to
collect in experiments.
It was noted that some of the bit rates discussed in the contribution were very high.
It was suggested that some aspects of power consumption may not be captured in the provided
power analysis.
It was remarked that area versus power, for example, has a different priority balance in different
applications.
The contributor asserted that using a code that has a small number of distinct codeword lengths is
desirable.
A participant remarked that the "codeset 2" tables have reduced coding efficiency (approx 0.5%)
under some coding conditions in video test classes that result in the use of small slices, due to
needing increased overhead bits for flushing the codewords at the end of the slice.
10.3.1.1.1.1.1.1.3
JCTVC-C063 TE8: Evaluation of RIM parallel context processing (PCP)
proposal [V. Sze, M. Budagavi (TI)]
The coding efficiency loss of RIM PCP was reported to have been evaluated to be between 0.0 to
0.1% using TMuC-0.7 with QC_MDDT disabled. The implemented RIM PCP includes reordering
of significant_coeff_flag and last_significant_coeff_flag by context. The update skipping mentioned
in the original proposal was not included in the code provided for evaluation; the context updates
are still being done serially. The software reports RIM PCP throughput impact to be between 1.4 to
1.9 for significant_coeff_flag amd 1.5 to 2.1 for last_significant_coeff_flag. Test results were
reported to have been verified to mostly match with results provided by RIM.
The contributor indicated that using the HHI transform coefficient coding proposal may make it
more difficult to take advantage of the throughput increase opportunity.
The contributor indicated that they had studied the software algorithm and that it functioned
according to the proposal. It was remarked that some of the computed throughput increase might be
difficult to achieve in practice.
Further study of the context modeling aspect was encouraged.
10.3.1.1.1.1.1.1.4
JCTVC-C064 TE8: Evaluation of RIM-V2V entropy coding [V. Sze, M.
Budagavi (TI)]
This document reported on the Bernoulli test used to compare the coding efficiency, encoding and
decoding throughput of various proposed entropy coding approaches – namely RIM-V2V, HHIPIPE and BAC. It also provided additional complexity comparison of the RIM-V2V tables versus
the HHI-PIPE tables. To measure the coding efficiency, Bernoulli sequences of one billion samples
were generated for 12 distinct probabilities. Each of the entropy coding approaches was used to
158
encode the same bin sequence, and the total encoded bits were measured. While the RIM and HHI
proposed tables were designed with the specific 12 probabilities in mind, the BAC used existing
states, nearest to the 12 probabilities (i.e. BAC was not customized for the probabilities).
Furthermore, the tests were not performed in the TMuC context with actual video data. Thus, it was
reported to be difficult to draw a conclusion on coding efficiency. The encoding and decoding
throughput were estimated based on simulation time. However, since software code for each
approach was written in a very different style/structure, it was reported that no conclusion can be
drawn on throughput based on the proposed use of simulation time. Further investigation was
recommended. Test results given by the Bernoulli test software had been verified to match with
results provided by RIM.
The TMuC software has 24 tables from RIM and 12 tables from HHI, and the TMuC document has
the 12 tables from HHI, and the 12 tables now proposed by RIM were first documented in a TE plan
document.
Similar measurements from the Bernoulli testing were reported in this contribution as what was
reported by RIM.
The tested software was reported to have had differences in coding style and degree of optimization.
The contributor indicated that it was difficult to determine whether this affected the analysis results.
The contribution indicated that the extra buffering needed by PIPE (relative to CABAC) was an
issue.
Area estimates for PIPE were 15x higher than for CABAC.
Further study of the various issues was suggested.
10.3.1.1.1.1.1.1.5
JCTVC-C280 TE8: Crosscheck result of the transcoder for JCTVC-B034
source selection for V2V entropy coding in HEVC [Y.-L. Chang, Y.-W. Huang, S.
Lei (MediaTek)]
This document reported cross-check activity for the transcoder part of JCTVC-B034 for V2V
entropy coding as proposed by Research In Motion (RIM). The verification task was reported to
have been done successfully and the results reportedly matched those provided by RIM. There were
some small variations (up to -0.12%) reported in random access conditions.
The source code provided by RIM was studied to verify the proposed algorithmic functionality.
10.3.1.1.1.1.1.1.6
JCTVC-C313 TE8: Crosscheck Result of the Transcoder of JCTVC-B034 for
V2V Entropy Coding in HEVC [Y. Zheng , R. Joshi , M. Coban , M. Karczewicz
(Qualcomm)] (late reg.)
The purpose of this late information document was to crosscheck the transcoder implementation of
JCTVC-B034 on source selection for V2V entropy coding previoiusly submitted by Research In
Motion (RIM). The verification task had reportedly been completed successfully and the results
were reported to closely match those provided by RIM.
11 TE9: Large block structures
11.1.1.1.1.1.1.1.1
JCTVC-C067 TE9: Report on large block structure testing [J. Kim, M. Kim
(KAIST), J. Kim, H.-Y. Kim (ETRI), K. Sato (SONY), X. Shen, L. Yu (Zhejiang
Univ.), K. Choi, E. S. Jang (Hanyang Univ.), B. Bross (HHI), W.-J. Han (Samsung),
J.-K. Jo, S.-N. Park, D. G. Sim, S.-J. Oh (Kwangwoon Univ.)]
Summary, remarks, and observations:

Confirms that restrictions CU 64 and TU 32 are reasonable (for current test set)

Extensive statistic analysis on usage of various block sizes vs. QP/rates
11.1.1.1.1.1.1.1.2
JCTVC-C140 TE9-2: Report on performance tests for different sets of PU
modes by Fraunhofer HHI [B. Bross (Fraunhofer HHI)]
159
Summary, remarks, and observations:

Relates to JCTVC-C200 – TUs spanning PU boundaries has a little advantage

Using merge syntax for rectangular shapes does not produce loss
11.1.1.1.1.1.1.1.3
JCTVC-C198 TE9: Simulation results for various max. number of transform
quadtree depth [J. Chen, T. Lee, W.-J. Han (Samsung)]
(The version of JCTVC-C198 prior to Saturday had no results in it.)
Summary, remarks, and observations:
Advantage of RQT in intra comes by allowing more prediction modes
New suggestion is to augment quadtreetulog2maxsize & -minsize by parameter tumaxdepth – none
of these parameters is a syntax element to be conveyed to the decoder – would be fixed in the
standard (or in a profile)

Advantage of the new restriction is saving of quadtree signaling bits

Complexity in intra encoding is decreased to 75% compared to RQT but still increased to
approx. 180% compared to the less flexible 2-level method (RQT off in TMuC) vs. BR gain
1.2%

Decoder complexity is only marginally affected

Encoders could decide to restrict the number of levels (with penalty on performance) and
run faster

New version of the document that would contain information about using various numbers
of levels was not available yet when the document was first discussed

Reported in revisit:
o Results relative to RQT on (which is about 1.2% BR decrease and 250% encoder
time)
o Depth 3 HE +0.1% , 75% encoder time; depth 2 +0.3, 60% encoder time
o Depth 3 LC +0.3%, 82% encoder time, depth 2 0.3, 58% encoder time

Refer to JCTVC-C311/JCTVC-C312 which implements a fast encoder
11.1.1.1.1.1.1.1.4
JCTVC-C284 TE9-2.1 Report on forced RQT split according to PUs [A.
Segall (Sharp), J. Xu (Microsoft)] (missing prior, provided by third meeting day)
This contribution confirmed the results of JCTVC-C198 (without an exact match due to different
platform usage).
12 TE10: In-loop filtering
12.1.1.1.1.1.1.1.1
JCTVC-C083 TE10: Summary of TE10 on in-loop filtering [K. Chono, T.
Yamakage (TE coordinators)]
In this tool experiment, in-loop filtering had been tested for the following functionalities:

Deblocking/debanding filters (four proposals)

Wiener-based in-loop filters (four proposals)
 Image clipping and offset (two proposals)
As for the details of each proposal, please refer the documents described below.
160
Seven companies and one university participated as proponents and three companies and one
university participated as cross checkers.
Three post filters were investigated. The best of these (in terms of RD perf.) was selected. The
difference of the anchor against post filter from JCTVC-C113 was reported as 0.9/0.3 on average.
12.1.1.1.1.1.1.1.2
JCTVC-C130 TE10 subtest 1: Results of intra deblocking filter testing by
SKKU/SKT [J. Yang, K. Won, B. Jeon (SKKU), J. Lim, J. Song (SKT)]
This document reported TE 10 test results of intra deblocking filter (DF) which was proposed in
JCTVC-B075. The proposed method employs exactly the same DF scheme as that in the TMuC,
except filter strength control for intra blocks. It is implemented on TMuC 0.7 S/W, and its
experimental results under the TE10 subtest 1 procedure reportedly show BDBR gain of 0.6% (intra
only), 0.3% (random access), and 0.1% (low delay). Encoding time is reportedly increased less than
1% and decoding time is reportedly the same as the TMuC0.7. The proposed method reportedly
provides improved visual quality.
It was remarked that this proposed modification compromises preservation of edges vs. removal of
artifacts, and it is difficult to say whether one or the other is better.
12.1.1.1.1.1.1.1.3
JCTVC-C161 TE10: Cross-verification result of SKKU/SKT deblocking filter
[K. Chono, K. Senzaki, H. Aoki, J. Tajime, Y. Senda (NEC)]
Summary, remarks, and observations:

The cross-check found no problem with the software, results were reported to be accurate.

Visual inspection did not unveil relevant improvements or degradations.

Method requires to store the directional mode (slight additional complexity).
12.1.1.1.1.1.1.1.4
JCTVC-C142 TE10 subtest 1: Improved deblocking filter [J. An, K. Zhang, Y.
Gao, X. Guo, C.-M. Fu, Y.-W. Huang, S. Lei (MediaTek)]
This contribution describes MediaTek’s proposal of a modified deblocking filter based on the AVC
deblocking filter (DF). The modified deblocking was asserted to improve intra block boundaries by
using intra mode dependent deblocking filter (MDDF) and modified boundary strength and
thresholds (MBST). The modified deblocking filter reportedly outperforms the TMuC0.7 anchor in
the objective measure RD sense in all six test configurations. Bit rate reductions were reported as
1.6%, 1.2%, 1.3%, 1.4%, 0.6%, and 0.5% for intra, random access, low delay, intra LC, random
access LC, and low delay LC, respectively. Though the visual differences between the IDF and
TMuC0.7 anchor are small, some improvements could reportedly be observed, especially in the
image areas with obvious texture directions. The encoding time reportedly decreases by 1.8%, and
the decoding time increases by 6%, compared to TMuC0.7.
It was remarked that the report about computation time may not be completely reliable; 3% would
mean doubling the complexity of deblocking (according to JCTVC-C147). See also cross-check
below.
Further work was encouraged, but it was remarked that complexity should not increase significantly.
12.1.1.1.1.1.1.1.5
JCTVC-C156 TE10: Cross verification of Mediatek's deblocking filter by
Ericsson [A. Norkin, R. Sjöberg, K. Andersson (Ericsson)]
This contribution reported cross-check activity summarized as follows:

The PSNR and BD rate match

The source code does not exactly match the description of JCTVC-B077

The decoder runtime is approx. 7.5x that of current TMuC deblocking
161

Directional filtering seemed to help on the RaceHorses sequence, but it is not as clear
whether it is better than the TMuC deblocking on other sequences
12.1.1.1.1.1.1.1.6
JCTVC-C273 TE10 subset 1: Report of content-adaptive de-blocking [Z.
Xiong, X. Sun, J. Xu (Microsoft)]
This document presented a content-adaptive deblocking scheme to improve the visual quality of
block-based compressed video. For large smooth regions with small variation, an extra smoothing
deblocking mode was introduced to suppress the blocking artifacts. Experimental results reportedly
demonstrate that the proposed method can improve the visual quality while maintaining the
objective fidelity of heavily compressed video.
The following discussion remarks were recorded:

Does not affect complexity compared to current deblocking

From subjective viewing, flickering was observed (in the all-intra case) which may be
caused by the stronger filtering; no significant difference overall.
12.1.1.1.1.1.1.1.7
JCTVC-C131 TE10 subtest 1: Cross-verification result of Microsoft
deblocking filter proposal by SKKU/SKT [J. Yang, K. Won, B. Jeon (SKKU), J. Lim,
J. Song (SKT)]
This contribution confirmed PSNR and bit rate results of JCTVC-C273, reporting essentially the
same encoding and decoding time.
12.1.1.1.1.1.1.1.8
JCTVC-C091 TE10: Conditional joint deblocking-debanding filter [K. Chono,
K. Senzaki, H. Aoki, J. Tajime, Y. Senda (NEC)]
This contribution presented a performance report on a conditional joint deblocking-debanding filter
described in JCTVC-B056. The conditional joint deblocking-debanding filter injects small pseudo
noise into images around an intra-block boundary which is supposed to be a part of areas of low
detail with subtle changes in pixel intensity. The injected small pseudo noise masks banding noise
around intra-block boundaries while keeping the rate-distortion performance with the help of the
subsequent Wiener filter. Furthermore, when IBDI is used, it also introduces a random quantization
effect on the IBDI output image. The conditional joint deblocking-debanding filter was asserted to
significantly improve visual quality of decoded video, especially when IBDI is used. Simulation
results reportedly verify that the conditional joint deblocking-debanding filter reduces blocking and
banding noise with a negligible impact on the video coding efficiency of the TMuC. It was
proposed that the conditional joint deblocking-debanding filter be adopted in the TMuC software
and that its performance be further evaluated with toolsets specified in HEVC Test Model.
The following discussion remarks were recorded:

For low-complexity, no Wiener filter is operated; therefore PSNR decreases, affecting BD
rate increase by 1.2% in all-intra (less in inter)

Does the deterministic pseudo noise introduce visible temporal artifacts? Could it produce
artifacts like e.g. mosquito noise in certain test sequences?

Is it not possible to do the dithering as postfiltering?

Should it be better after the Wiener filter?

Various solutions should be studied.

This would have quite significant implication on the amount of standard text (e.g. would be
necessary to describe tables of pseudo noise, exactly describe the rules of insertion, etc.)
Further experimentation was recommended in particular on a) the best position in the loop b)
possible implications on producing artifacts, and c) alternative as post processing.
162
12.1.1.1.1.1.1.1.9
JCTVC-C274 TE10 subset 1: Cross- verification of NEC's joint deblockingdebanding filter by Microsoft [X. Xiong, J. Xu (Microsoft)]
This cross-check report confirmed the RD values of JCTVC-C091, reporting similar
encoding/decoding times. No subjective tests were reported in the contribution.
12.1.1.1.1.1.1.1.10 JCTVC-C143 TE10 subtest 2: Coding unit synchronous picture quadtreebased adaptive loop filter (QALF) [C.-Y. Tsai, C.-M. Fu, C.-Y. Chen, Y.-W. Huang,
S. Lei (MediaTek)]
This contribution described MediaTek’s work on adaptive loop filter (ALF), namely coding unit
synchronous picture quadtree-based adaptive loop filter (CS-PQALF). The proposed method uses
multi-level quadtree partitions to allow local adaptivity for Wiener loop filtering. The partitions
boundaries are aligned with the boundaries of the largest coding units (LCUs). Each partition is a
basic filter unit (FU) and can be enhanced by different Wiener filters to reduce mean square error
between the reconstructed picture and the original picture. In this proposal, the number of encoding
passes is reduced from 16, for the ALF in TMuC, to 2, for the CS-PQALF, while reportedly
maintaining the good coding efficiency performance of the anchor. Simulation results were
compared with the anchor adopted in the subtest 2 of tool experiment 10 (TE 10). The proposed CSPQALF increases 0.1% and 0.3% BD-rates for high efficiency random access (HE-RA)
configuration and high efficiency low delay (HE-LD) configuration, respectively. The encoder
execution times are increased by 1% for both HE-RA and HE-LD. The decoder execution times are
increased by 3% and 1% for HE-RA and HE-LD, respectively. Note that the software execution
time cannot really reflect the reduction of hardware external memory access overhead. Additional
experiments on combining four techniques including controlled clipping (JCTVC-C146), improved
deblocking filter (JCTVC-C142), quadtree-based adaptive offset (JCTVC-147), and CS-PQALF
sreportedly how 2.2% and 2.5% BD-rate reductions for HE-RA and HE-LD, respectively.
The main advantage claimed is reduction of number of coding passes; run time does not really
reflect that (encoding/decoding time very slightly increased), memory bandwidth is the more
critical issue than computation power.
Adaptive filter sizes include square 3x3 ... 9x9, rhombus 5x5 ... 9x9.
In terms of performance (subjective & objective), a negligible difference was reported compared to
TMuC anchors.
Similar reductions appear possible with other methods (see e.g. JCTVC-C082 and JCTVC-C113)
More coefficients must be stored & encoded.
It was asked where would be the best place to encode the filter on/off flag: slice or CU level.
Adaptation of filters was controlled by a quadtree transmitted in slice header (at most 16 partitions
per frame), boundary aligned with CU quadtree.
12.1.1.1.1.1.1.1.11 JCTVC-C229 TE10 subset 2: Cross check result of MediaTek ALF [I. S.
Chong, M. Karczewicz (Qualcomm)]
This contribution reportedly confirmed the results of JCTVC-C143.
12.1.1.1.1.1.1.1.12 JCTVC-C173 TE10 subtest 2: Parallel adaptive loop filter [T. Ikai, T.
Yamamoto (SHARP)]
In this contribution, the Parallel adaptive loop filter technique from JCTVC-B064, which was
implemented into TMuC0.7, was evaluated according to the TE10 common condition. The
proposed technique uses both pre-DF (De-blocking filter) signal and post-DF signal as inputs,
where both inputs’ weights are optimized with Wiener-filter technique. So the whole system can be
seen as one in-loop filter where De-blocking filter and Wiener-based filter are combined. This
combination provides coding efficiency improvement as well as the functionality to process the two
filters in parallel. The experimental results reportedly showed that the proposed technique provides
0.7% / 0.4% / 0.4% bit rate reduction (IntraOnly / RandamAccess / LowDelay) compared to the
anchor, where Sum-modified Laplacian based ALF is used. The complexity of the proposed method
163
was also evaluated and the results reportedly showed that the decoding time is 93% to 96% of the
anchor and the encoding time is almost the same as the anchor (100%).
The following remarks and observations were recorded:

Method: Wiener filter applied to deblock filter input, weighted superposition of de-blocking
and loop filter output.

Filter on/off signaled at CU level, filter parameters (incl. weighting params) signaled at slice
header.

Uses 5x5, 7x7 and 9x9 rhombic filters. Loop filter itself basically identical to TMuC. Also
uses 16 passes.

Slight bit rate decrease compared to anchors.

Encoding time similar to anchor, decoding time slightly less. Avoids pixel-wise operation.

Information: It was tried to apply filtering to de-blocking output as well, but that did not
result in significant gain.

Subjectively, there seemed to be a tendency to look sharper, but sometimes this could
reproduce more coding artifacts.
12.1.1.1.1.1.1.1.13 JCTVC-C145 TE10 subtest 2: Crosscheck on SHARP's proposal of adaptive
loop filter by MediaTek [C.-Y. Tsai, Y.-W. Huang, S. Lei (MediaTek)]
This cross-check report confirmed the reported reduction of JCTVC-C173 in BD rates and enc/dec
time measurements.
12.1.1.1.1.1.1.1.14 JCTVC-C082 TE10 subtest 2: Reduction of number of encoding passes for
quadtree adaptive loop filter (QALF) [T. Yamakage, T. Chujoh, T. Watanabe
(Toshiba)]
This contribution described an encoding technique to reduce the number of encoder passes for
Wiener-based filter design, and detailed experimental results of Quadtree-based Adaptive Loop
Filter (QALF) were reported. This is one of the proposals for Subtest 2 (Wiener-based in-loop
filters) in Tool Experiment 10 (Loop filtering). The number of additional encoder passes compared
to no adaptive loop filter encoding is two by this technique, which is drastically reduced from
QC_ALF adopted in TMuC v0.7, while the loss of the coding efficiency from the TMuC 0.7.1
anchor is 0.5%. A supplemental information to show the coding efficiency loss (0.1%) is also
provided that compares multi-pass QALF and the proposed 2-pass QALF. Note that the additional
passes are not encoding a picture, but filtering a picture that requires less complexity compared to
encoding a picture. This technique is also applicable to the methods that adopt block-based filtering
control.
The following remarks and observations were recorded:

Argument for loop filter vs. post filter: Assurance of the (minimum) quality of the output is
desired by industry.

A slight decrease in decoding time may be caused by the less frequent usage of the loop
filter.

There was discussion about relation between loop filtering and interpolation filtering.

Obviously the best choice of interpolation filter depends on the loop filter used. However
the design of the filter coefficients is not normative.

The contribution shows that it is possible to reduce the number of encoder passes in ALF
coefficient derivation without a large penalty in coding efficiency
164
12.1.1.1.1.1.1.1.15 JCTVC-C144 TE10 subtest 2: Crosscheck on TOSHIBA's proposal of
adaptive loop filter by MediaTek [C.-M. Fu, Y.-W. Huang, S. Lei (MediaTek)]
This cross-check report confirmed BD rate values of JCTVC-C082, but encoding/decoding runtime
measured may be unreliable.
12.1.1.1.1.1.1.1.16 JCTVC-C194 TE10 subtest 2: Cross-check results of JCTVC-C082
[Toshiba] Reduction of number of encoding passes for quadtree-based adaptive
loop filter (QALF) [P. Wu, S. Paschalakis, N. Sprljan (Mitsubishi Electric)]
This cross-check report confirmed BD rate values within a small margin (using a different platform),
with encoding/decoding runtime slightly different.
12.1.1.1.1.1.1.1.17 JCTVC-C071 TE10 subset 2: Complexity analysis on Wiener-based in-loop
filters [L. Wang, L. Yu (Zhejiang Univ.)]
In this contribution, the complexity of Wiener-based adaptive in-loop filter (ALF) algorithms (A121,
B064, B077, A117) in the subset2 of TE10 was analyzed along with their coding efficiencies. The
contribution analyzeed three aspects: i) filter features, ii) encoding time, and iii) decoding time.
The following remarks and observations were recorded:

Only a few frames of the sequences were used to determine encoding and decoding time, so
the numbers may not be reliable. New versions of software which are faster exist in most
cases.

It is not clear which versions of software were used (not distributed by the CE coordinator).
12.1.1.1.1.1.1.1.18
Conclusions on TE10 subtest 1&2:
The following overall remarks and observations were recorded:

Alternative de-blocking proposals do not seem especially interesting at the moment (either
no real difference, or too complex)

Comfort noise insertion for de-banding could be interesting, but needs further study.

Neither subjective evaluation nor objective measures unveiled substantial differences
between the various in-loop filters.

Reduction of number of passes in optimization in ALF appears possible.

There is a clear difference between no filtering and filtering (roughly 5% BR gain, with a
subjectively noticeable benefit)

Post filtering suffers from temporal discontinuity.

As the loop filter has an influence on other elements of the design, it is important to have
one in the TM, but it seems to be less important which one this is. Further experimentation
on improvements should go on anyway.

Regarding the 3-input approach investigated in TE 12 which is another option, subjective
viewing was performed. Otherwise, the level of encoder runtime increase of 25% currently
seems not to be justified vs. 1% additional BR reduction.

After some (informal) subjective viewing, a report was given Monday evening: For one case
(Park Scene QP37) a majority of test persons indicated that they had observed slight
improvement (though there was 2% bit rate reduction by the 3-input method compared to
anchor ALF)
165

In general, those visual tests may have the problem that they compare sequences encoded at
(slightly) different bit rates. This puts a slight disadvantage on methods that have a tendency
decrease the bit rate.
12.1.1.1.1.1.1.1.19 JCTVC-C146 TE10 subtest 3: Controlled clipping [Y.-L. Chang, C.-M. Fu
(MediaTek), A. Segall, Y. Su (Sharp), C.-Y. Chen, Y.-W. Huang, S. Lei
(MediaTek)]
This proposal reported MediaTek and Sharp joint work on controlled clipping. Controlled clipping
describes a process that clips predicted or reconstructed pixel values to minimum and maximum
values that are signaled in the bitstream. The clipping process is applied at four stages, namely postprediction, post-reconstruction, post-deblocking, and post-adaptive loop filter (post-ALF). Results
reportedly show that the proposed controlled clipping achieves average 0.6% and 0.4% BD-rate
reductions for high efficiency random access (HE-RA) and high efficiency low delay (HE-LD)
configurations, respectively. The encoding time measures were reportedly increased by 2% and 0%
for HE-RA and HE-LD, respectively, and the decoder times were reportedly increased by 6% and
2%, respectively.
The following remarks and observations were recorded:

Clipping parameters are sent in picture level and/or slice level.

Comparison against post clipping was intended, but no gain was found compared to that. In
previous implementation (not joined proposal of two companies), gain was found.

Would there be a guarantee that clipping at various places in the loop is better than postclipping? Probably not.

Further study was encouraged.
12.1.1.1.1.1.1.1.20 JCTVC-C147 TE10 subtest 3: Quadtree-based adaptive offset [C.-Ming Fu,
C.-Y. Chen, Y.-W. Huang, S. Lei (MediaTek)]
This contribution described a MediaTek proposal of quadtree-based adaptive offset (QAO). QAO
consists of two parts, quadtree partitioning and offset compensation. The former splits a picture into
multi-level quadtree partitions, and each partition is compensated by one offset compensation
method. The latter reduces errors between reconstructed pixels and original samples of a current
picture by using four kinds of offset compensation methods, including uniform band offset, nonuniform band offset, cross pattern edge offset, and diagonal cross pattern edge offset. Simulation
results reportedly show that in comparison with the TE10 anchor, which enables adaptive loop filter
(ALF) and many high efficiency tools of TMuC0.7 as required in JCTVC-B300, this proposal can
achieve 1.2% and 2.0% bit rate reductions for high efficiency random access (HE-RA)
configurations and high efficiency low delay (HE-LD) configurations, respectively. The encoding
time was reportedly increased by 2% and 1% for HE-RA and HE-LD, respectively, and the decoder
time was reportedly increased by 9% and 9%, respectively.
The following remarks and observations were recorded:

Each quadtree partition can switch the mode: Wiener, one of the four offset methods
(similar to original Samsung proposal A125)Subjective viewing did not unveil differences

Complexity increase would be too high to be justified compared to the small gain

Further study was encouraged.
12.1.1.1.1.1.1.1.21 JCTVC-C180 TE10.3: Cross check of MediaTek's proposal on quadtreebased adaptive offset [A. Segall (Sharp)] (missing prior, available first day)
This contribution reported cross-check activity for the MediaTek proposal on quadtree-based
adaptive offset. This cross-check was performed within the context of TE10.
166
MediaTek sent software, and the experiments were run, and closely matching PSNR results were
reported. No visual inspection was done.
13 TE11: Motion vector coding
13.1.1.1.1.1.1.1.1
JCTVC-C293 TE11: Summary report for TE11 on motion vector coding [J.
Jung (Orange Labs)]
This document summarized the activities in Tool Experiment TE11 on motion vector coding. A
total of 9 companies or universities had registered to the TE: Sony, Sungkyunkwan University,
Samsung, NTT DOCOMO, LGE, HHI, Sharps Labs of America, ETRI / Kyung Hee University and
Orange Labs. A few emails were exchanged mainly to coordinate the cross-check activity and
exchange the modified versions of the TMuC software. Also it can be noticed that for some topics,
either the contribution was cancelled by the proponent (lack of time running the experiment, or
insufficient results), or the cross-check was not fully completed (all configurations, or sequences).
Several experiments on motion vector coding had been performed for TE11 with various
configurations of the software.
For the Inter mode, it was reported that the MV Competition scheme provides significant gains.
For the Skip, Direct, Merge modes, both Skip + Direct (with MV Competition) and Merge were
reported to bring significant gains. The following options were reported to seem possible for the
TM:

Keep Merge only at CU and PU level

Keep Skip + Direct only at CU level
 Keep Merge at PU level, Skip and Direct at CU level
When MV Competition is concerned (for either Inter, Skip and Direct modes) it was reported that:

Having a temporal predictor brings gains

Having the ability to signal the set of predictors (including the ordering of the predictors)
brings gains
At this stage, experiments combining IMVP and MV Competition were reported to have failed to
achieve noticeable improvements.
13.1.1.1.1.1.1.1.2
JCTVC-C289 TE11: Report on experiment 3.1.a: Disabling MV competition
[J. Jung, G. Clare, S. Pateux (Orange Labs)] (missing IPR statement prior)
The following remarks and observations were recorded:

Removing the MV competition on Inter, and Skip/Direct modes reportedly provides an
average coding loss of 2.6% to 4.1%. (by tendency higher for RA and LC).

Decoder runtime reportedly increases by MV competition (not measured on the full set, only
QVGA class D) 0.1-2.4%; with the highest increase for the lowest bit rate.
Conclusion: Keep MV competition for inter and skip, considering its compression gain.
13.1.1.1.1.1.1.1.3
JCTVC-C132 TE11: Cross-verification result of Orange Labs motion vector
competition proposal (3.1a) by SKKU [J. Yang, K. Won, B. Jeon (SKKU)]
This cross-check report confirmed the results of JCTVC-C289.
13.1.1.1.1.1.1.1.4
JCTVC-C290 TE11: Report on experiment 3.2.b: MV-Competition on Inter,
Skip and Direct modes [J. Jung, G. Clare, S. Pateux (Orange Labs)] (missing IPR
statement prior)
167
The TMuC reference applies MV-Competition for Inter modes, and Merge. Skip and Direct modes
are disabled. This experiment evaluated a configuration with MV-Competition for Inter, Skip and
Direct modes, with Merge disabled.
The document reported that the scheme with MV-Competition enabled on Inter, Skip and Direct
mode provides on average better results than the current TMuC reference.
Average BR gain was reported as 1.5%, gain on LC was higher – around 2.5-3%.
13.1.1.1.1.1.1.1.5
JCTVC-C043 TE11: Simulation results of merge/skip (3.2b and 3.2d) [Y.
Suzuki, F. Bossen (NTT DoCoMo)]
The following remarks and observations were recorded:
-
Confirmed the results of JCTVC-C290 for the HE case (LC not done).
-
Reported increased encoder run time by 10%, with a marginal decoder time impact.
-
Experiment 3.2d : Switching off skip, direct, CU merge increases BR by roughly 5%, no impact
on encoding/decoding time (only done for HE case).
13.1.1.1.1.1.1.1.6
JCTVC-C292 TE11: Verification of experiment 3.2.d by NTT DOCOMO [J.
Jung, G. Clare, S. Pateux (Orange Labs)] (missing prior, available first day)
The following remarks and observations were recorded:
-
Confirmed the results of JCTVC-C043 3.2d HE cases.
-
Additional results on LC were: RA: Skip 8% / Merge 6%, LD: Skip 10% / Merge 6%
-
For LC, no results on encoder/decoder run times were provided.
13.1.1.1.1.1.1.1.7
JCTVC-C199 TE11: Report on experiment 3.2.c: Check skip and merge
together [I.-K. Kim , T. Lee , W.-J. Han (Samsung)]
In this document, the coding efficiency of a block merging process (MRG) and Skip/Direct mode of
TMuC software was presented. The bit rate impact of Skip/Direct mode was reported as -0.7%
(random access), -2.6% (random access low complexity), 0.1% (low delay) and -2.9% (low delay
low complexity) compared to MRG which is turned on by default in the TMuC. This contribution
also presented a combination of MRG and Skip/Direct where Skip/Direct was enabled on top of
MRG. The reported bit rate impact was -1.2% (random access condition), -2.4% (random access
low complexity condition), -0.5% (low delay condition) and -3.3% (low delay low complexity
condition).
The following remarks and observations were recorded:
-
Compared to configuration skip+direct on CU merge off, 0.5% BR gain on average if both
enabled. No significant increase in encoder run time.
-
Skip+Direct vs. CU based merging seems not always to be identical in effect (as was assumed
before).
13.1.1.1.1.1.1.1.8
JCTVC-C191 TE11: Cross-check result of merge/skip (3.2c) S. Jeong, J.
Lee, H. Y. Kim, S.-C. Lim (ETRI), K. Kim, H. Lee, G. Park (KHU)
This cross-check report confirmed the results of JCTVC-C199. The decoder run time was only
marginally increased, encoder run time increased around 10+%.
It was remarked that the high gains reported in the LC case may be caused by deficiencies of LCEC
vs. PIPE, and some of this may be resolved in "LCEC phase II".
13.1.1.1.1.1.1.1.9
JCTVC-C166 TE11: Study on motion vector coding (experiment 3.3.a &
3.3.c) [K. Sato (Sony)]
This contribution reported results of two experiments as follows:
168

Experiment -1: Comparison of with & without Temporal Predictor
 Experiment -2: Code Number Swapping of Motion Predictors
The following remarks and observations were recorded:

Experiments were performed only for 50 frames.

Average numbers were missing; for experiment 2 the gain seems to be around 0 overall.
13.1.1.1.1.1.1.1.10 JCTVC-C116 TE11: Cross verification of Sony proposal on motion vector
coding [Y.-J. Jeon, B.-M. Jeon (LG Electronics)]
This cross verification contribution confirmed the results of JCTVC-C166 on experiment 2 (with a
reported gain of approximately 0); no cross-check was provided on experiment 1 in the contribution.
13.1.1.1.1.1.1.1.11 JCTVC-C291 TE11: Report on experiment 3.3.b: "temporally oriented" set of
predictors for MV-Competition [J. Jung, G. Clare, S. Pateux (Orange Labs)]
(missing IPR statement prior)
This contribution reported that by changing the order of predictors (temporal first), additional gain
is possible – in particular for static background scenes. The contribution included a suggestion to
signal the set of predictors in MV competition (potentially different for skip as well) at the slice
level. No results on that were available yet.
It was reported that this would come with practically no cost at the decoder, but the encoder would
need to do some extra optimization work.
Further work was encouraged
13.1.1.1.1.1.1.1.12 JCTVC-C308 Cross verification of MV Coding Proposal by JCTVC-C293 [K.
Sato (Sony)] (late reg.) (verbally presented in TE11, uploaded after the meeting
ended)
This cross-check contribution was registered very late. When brought up for discussion, it had not
yet been uploaded and the contributor verbally reported that the work was not yet complete, but that
he results achieved so far had confirmed JCTVC-C291. The contribution was eventually uploaded
significantly after the meeting had ended.
14 TE12
14.1
TE12: Overall TE12 summary
14.1.1.1.1.1.1.1.1
JCTVC-C225 TE12: Summary of evaluation of TMuC tools in TE12 [K.
McCann (Samsung/Zetacast)] (missing prior, uploaded first day)
This contribution was a summary of tool experiment 12 (TE12) on exploring the performance of
individual tools in the Test Model under Consideration (TMuC). Detailed results were reported by
the TE12 participants in individual input documents.
The work of TE12 was very substantial, and it was a very important effort.
The incremental contribution of each individual tool in the TMuC is highly dependent on the
settings of the other tools in use; in effect, it is a multi-dimensional non-linear system. The starting
points used in TE12 were the default configurations of the TMuC specified in JCTVC-B300, since
these represent the current best estimate of realistic use cases.
Each tool in the TMuC was tested individually as follows.

Some tools are on/off in nature (e.g. ROT). This type of tool was tested by being turned on
(if off in the default configuration) or off (if on in the default configuration) unless it has a
direct alternative (see below).
169

Some tools have direct alternatives either within the TMuC itself or else added to the
software for comparison purposes (e.g. PIPE and the LCEC entropy coding within the
TMuC, with CABAC for comparison). This type of tool was tested by being replaced by
each direct alternative in turn (e.g. PIPE was tested against both LCEC and CABAC).

Some tools can have minimum or maximum limits defined (e.g. Large Transform block
size). This type of tool was tested by having either the minimum or maximum limit made
stricter as appropriate (e.g. maximum transform size being reduced from 64x64 to 32x32).
Tools for which there is no alternative in the TMuC (e.g. inter transforms) were not tested.
In most cases, only one tool at a time was exercised by giving it an alternative setting from that of
the default configuration. However, some limited testing of combinations of tools was also
performed, in order to improve understanding of the interaction between tools (e.g. between ROT
and MDDT).
All of the tools listed in the Tool Experiment 12 test plan were evaluated, with the exception of one
encoder-only test on CABAC-based RDO on/off. In most cases the tests used TMuC version 0.7
and subsequent versions of the TMuC released by the Software AHG. However, three of the tests
used company-specific branches: PU-Based Merging (tested with TMuC 0.8-hhi-bugfix), Enhanced
CIP (tested with 0.7-BBC branch) and ALF default/3-input ALF (tested with 0.7-panasonic branch).
Detailed results were reported by the TE12 participants in individual input documents.
It was anticipated that the results of this work would be used to guide improvements in default
operating points. In addition, the TE work provides insight on how the various tools contribute to
the overall performance of the TMuC and hence helped inform the process of forming the first Test
Model. However, it should be noted that the various tools in TMuC interact in a complex non-linear
manner, so that careful interpretation of the results is required.
A total of 70 documents directly relating to TE12 were registered. The categories were as follows:
1. Unit definitions
2. Motion representation
3. Intra-frame prediction
4. Transforms
5. Filtering
6. Entropy coding
7. Other decoder tools
8. Encoder-only tools
Most subtests used TMuC 0.7, while some used 0.7.x (which would include some bug fixes that do
not affect the default configuration). All of the 0.7.x versions should be able to be well compared to
the distributed anchors. Some variants are noted below:

PU merging testing used "0.8 HHI bug fix" software;

Note that a special patch was used in Enhanced CIP on/off testing as discussed in JCTVCC213 with cross-check JCTVC-C297, which are not classified here as TE12.

"0.7 Panasonic branch" was used for 3-input ALF tests (e.g., JCTVC-C214).

LCEC and PBIC tests used 0.7.4.

CABAC based RDO on/off encoder-only evaluation was not tested.
14.1.1.1.1.1.1.1.2
JCTVC-C320 TE12: Summary of results of evaluation of TMuC tools in TE12
[K. McCann (TE coordinator)] (late registration)
170
Due to the large effort and large number of contributions involved in TE12, the original summary
report for the TE (JCTVC-C225) did not contain a summary of the results. Such a report of results
was prepared during the meeting and was reported in this contribution. Although the effort involved
in the preparation of this contribution was greatly appreciated, there was insufficient time to
properly review the document at the meeting due to the late arrival of the document.
14.2
TE12 TMuC coding unit definitions
14.2.1.1.1.1.1.1.1
JCTVC-C029 TE12: Report on 64x64 versus 32x32 maximum coding unit
size for low complexity configuration [P. Pandit (Harmonic)]
This contribution presented results for the Max CU size experiment under TE12 for the low
complexity configuration. Simulations reportedly showed that when using CU32 size, for Intra only
case the average BD-Bit rate gain is about 0.2%, for Random access case the average BD-Bit rate
loss is about 7% and for low delay case the average BD-Bit rate loss is about 8.8%. Some
comments on the visual quality were also made. Cross verification of the results was reportedly
done with Motorola.

The tested cases included LC configuration for intra, low delay & random access

Visual comparison was done for QP 37
14.2.1.1.1.1.1.1.2
JCTVC-C044 TE12: Results for experiments on max CU size, RDOQ and
AIS [F. Bossen, T. K. Tan (NTT DoCoMo)]
Other aspects of this contribution are discussed in another section below.
This contribution reported the simulation results for Tool Experiment 12 as described in JCTVCB312r5. Three sets of experiments were simulated. The experiments are the comparison of the
maximum coding unit (Max CU) size of 32 and 64, the comparison of rate distortion optimized
quantization (RDOQ) on and off, and the disabling of the adaptive intra smoothing (AIS) with the
default set to always off.

The tested cases included HE configuration for intra, low-delay & random access

Decoder run time was 2.5x longer without 64x64 (the reason being related to adaptive
scanning in MDDT)

With 32x32 and smaller only, there was marginal loss in intra-only, around 3% average for
the inter cases

Most of the gain was in classes B and E (as was the case in JCTVC-C029)
14.2.1.1.1.1.1.1.3
JCTVC-C259 TE12: Results for experiments on Max CU size [Y. Yu, K.
Panusopone, L. Wang, S.-T. Hsiang, F. Ishtiaq (Motorola)]
Test Experiment 12 was set up at the second JCT-VC meeting. TE12 aimed to explore the
performance of individual tools in the Test Model under Consideration (TMuC). As part of TE12,
the CU size was tested to decide the best maximum CU size between 32 vs 64. The simulation
results were cross checked with NTT Docomo and Harmonic. From the simulation results, it was
found that CU size 64 is consistently better than CU size 32. Therefore, the contributor
recommended that a maximum CU size 64 should be used in the TMuC.
The results mostly confirmed the findings of JCTVC-C029 and JCTVC-C044 (a different TMuC
version was used, with small deviations in PSNR)
It was suggested to possibly restrict the intra case to 32x32 (note: there were currently only 5
prediction directions used in intra 64x64).
171
14.3
TE12 TMuC prediction units
14.3.1.1.1.1.1.1.1
JCTVC-C052 TE12.1: Block merging (MRG) test (off vs. on) by Fraunhofer
HHI [B. Bross (Fraunhofer HHI)]
This document reported the results of testing the block merging mode for CUs (MRG) within the
scope of tool experiment 12 evaluating TMuC Tools. For the test, MRG was disabled and tested
against the MRG-enabled default. In a high efficiency scenario, average bit rate savings up to 0.7%
were reported when disabling MRG. In a low complexity scenario, disabling MRG yields average
bit rate savings up to 2.9%. MRG disabled encoder runtimes range from 106% to 127% and decoder
runtimes from 96% to 116% of the default MRG enabled runtimes have been measured.
Disabling CU based merging (reinvoking skip & direct) gives roughly 1% BR reduction overall.
14.3.1.1.1.1.1.1.2
JCTVC-C109 TE12.1: Results for PU-based merging by Fraunhofer HHI [B.
Bross (Fraunhofer HHI)]
This document reported the results of testing the block merging mode for PUs (MRG_PU) within
the scope of tool experiment 12 evaluating TMuC Tools. For the test, MRG_PU was disabled and
tested against the block merging mode for CUs (MRG) default. In a high efficiency scenario,
average bit rate savings up to 1.9% were reported when disabling MRG. In a low complexity
scenario, disabling MRG yielded average bit rate savings up to 2.4%.
The collection of experiment results was not yet complete. A "Bug fix" done in TMuC 0.8 on the
PU based merging that was there before but was switched off by default (wrong RD decision at the
encoder – ticket #95 and coding of PU based merging ticket #97 avoiding some unnecessary
signaling). The results reportedly showed that PU based merging gives additional gain for the cases
tested so far. Implicitly, this replaces CU based merging.
It was commented that proposals with similar goals are in TE11.
14.3.1.1.1.1.1.1.3
JCTVC-C118 TE12.1 Cross-check report on block merging (MRG) on/off [K.
Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi Electric)]
This contribution confirmed the results of JCTVC-C052. Brief checking of parameter settings and
software was performed.
14.3.1.1.1.1.1.1.4
JCTVC-C231 TE12.1: Results for PU based merging [Y. Suzuki, T.K. Tan, F.
Bossen (NTT DoCoMo)] (missing prior, available first day)
This contribution provided a cross check with more results than JCTVC-C109, with results reported
as follows. For HE LD, 1.5% BR gain, 2% for HE RA. Also results were reported on case-by-case
basis which show that using only skip/direct (as in JCTVC-C052) BQSquare, BQTerrace and
Vidyo3 provide losses.
14.3.1.1.1.1.1.1.5
JCTVC-C264 TE12: Evaluation of block merging (MRG) [W.-J. Chien, P.
Chen, M. Karczewicz (Qualcomm)]
This contribution confirmed the results of JCTVC-C052. Brief checking of parameter settings and
software was done.
It was remark that from the results, CU based merging does not seem to make sense. However,
investigation of software related to CU based merging may not have been performed to the detailed
extent that could definitely exclude the presence of bugs that would improve the performance if
fixed.
14.4
TE12 TMuC transform units
14.4.1.1.1.1.1.1.1
JCTVC-C056 TE12: Evaluation of transform unit (TU) size [M. Zhou, V. Sze
(TI)]
172
This document reported the evaluation results of the Transform Unit (TU) size for TE12, and
provided a brief complexity analysis of transforms. Experimental results reportedly show that the
average coding loss caused by disabling the 64x64 transform is around 0.1% to 0.6%; disabling
both 32x32 and 64x64 transforms lead to an average coding loss of 1.1% to 4%. The results also
reportedly show that disabling the 4x4 transform leads to huge coding loss. When both the 4x4 and
64x64 transforms are disabled, the average coding loss is increased to 3.9% to 8.3%, and disabling
4x4, 32x32 and 64x64 transforms (i.e. only 8x8 and 16x16 transforms are enabled) can lead to
quality degradation of 5.2% to 10.6% on average. Due to the high complexity and limited coding
efficiency gain of the 64x64 transform, it was recommended that the 64x64 transform be removed
from reference configurations. The contribution also recommended that the quality issues of
disabling the 4x4 transform be investigated in the future.
The following remarks and observations were recorded:

There is relatively small loss when 64x64 transform is disabled (for LD LC case which is
worst of all 0.6% on average over all classes, 1.3% average for class E LD LC).

32x32 disabling causes larger loss (1 to 3.4% depending on case).

Disabling 4x4 causes larger loss compared to AVC.

Almost 4x gate count when 64x64 needs to be implemented compared to 32x32.

This could be dealt with as a profiling issue; the current test material seems not to benefit
too much from using 64x64 (32x32 could be default in the TM, and further study of 64x64
could be performed in the future).
14.4.1.1.1.1.1.1.2
JCTVC-C105 TE12: Huawei report on max transform unit size evaluation [J.
Zhou, D. Wang, H. Yu (Huawei)]
This document summarized the Tool Experiment TE12 activities related to the evaluation of Max
TU Size. This evaluation was performed by Huawei and TI.

Confirms the results by TI, with a small difference due to the use of Windows vs. Linux.

The contributor received the parameter files from TI and ran on newly compiled software.
14.4.1.1.1.1.1.1.3
JCTVC-C200 TE12.1: Experimental results of transform unit quadtree/2level test [T. Lee, J. Chen, W.-J. Han (Samsung)]
This contribution presented the experimental results of TE12 Residual quadtree transform (RQT),
which was to explore the performance of RQT in the Test Model under Consideration (TMuC).
Experimental results of RQT off compared to TMuC showed 1.2%, 0.5% and 0.2% bit rate savings
under intra only, random access and low delay conditions, respectively, in high efficiency settings.
It shows 0.6%, -0.2% and 0.8% bit rate savings under the same conditions, respectively, in low
complexity settings. This contribution also analyzed the performance and complexity of the current
TMuC transform unit design by testing several variants of it: no transform unit across partition
boundary and no transform unit splitting inside a partition.
The following remarks and observations were recorded:

Turning RQT off gives loss in luma PSNR but also has less encoder complexity in the intra
case (where the BR increase is 1.2%), in inter cases there is a smaller BR increase but no
complexity advantage by turning RQT off.

Restricting TU to maximum of PU size (disallowing TU across PU partitions gives loss of
around 1%.

Restricting TU not becoming smaller than TU size gives loss as well.
173

Conclusion: RQT on would only make sense provided that that the suggestion below would
compensate for the complexity.

Intra prediction mode flag could also be used

Gain of TU across PU partition boundaries may be due to more efficient CBP

It could be that gain also comes mainly from asymmetric partitioning

A new suggestion (related to TE9_C198) was to limit the number of TU quadtree levels by
using signaling.
 JCTVC-C276 is also related as well.
These issues are further discussed elsewhere in this report.
14.4.1.1.1.1.1.1.4
JCTVC-C110 TE12.4: Transform unit quadtree tested against 2-level by
Fraunhofer HHI [B. Bross (Fraunhofer HHI)]
This document reported the results of testing the transform unit quadtree (RQT) against the 2-level
representation within the scope of tool experiment 12 evaluating TMuC Tools. The 2-level TU
representation was tested against the RQT default. BD-rates from -0.2% up to 0.7% were reported.
The encoder runtimes of the 2-level representation vary from 31% of the RQT encoder runtimes for
the intra only coding constraint set to 102% of the RQT encoder runtimes for the random access
coding constraint sets.
This confirms the results of JCTVC-C200.
14.5
TE12 Interpolation filters
14.5.1.1.1.1.1.1.1
JCTVC-C050 TE12.2: Interpolation filter SIFO and DIF tested against
MOMS by Fraunhofer HHI [B. Bross (Fraunhofer HHI)]
This document reported the results of testing the 12-tap SIFO and 6-tap DIF interpolation filters
against 6-tap MOMS filtering within the scope of tool experiment 12 evaluating TMuC Tools. In a
high efficiency configuration scenario, MOMS was tested against the SIFO default. Bit rate savings
are highly sequence dependent. Thus, average BD-rates of -4.6% are shown as well as BD-rates of
12% when comparing MOMS to the SIFO reference. The average encoder runtime of MOMS is
62%-68% of the SIFO encoder runtime. On the other hand, the average decoder runtime of MOMS
is 111% to 122% of the SIFO decoder runtime. In a low complexity scenario, MOMS was tested
against the DIF default. MOMS performs better than DIF in terms of coding efficiency with average
bit rate savings around 5% while encoder runtimes up to 139% and decoder runtimes up to 133% of
the DIF runtimes have been measured.
The following remarks and observations were recorded:

Results show slight increase of BR (0.3%) for HE case, but reduction of encoding time by
roughly 35%, increase of decoding time by roughly 15%

For LC (6-tap), rougly 6% BR decrease, increase encoding time by roughly 25%, decoding
time by >30%

Note: It is reported that the results may be wrong due to a bug in the fixed-point
implementation which is effective in class E (static background)

Sequence dependent, significant gain comes from BQSquare, BQTerrace, BlowingBubbles,
PartyScene

Uses 12 bit precision in case of IBDI off, 16 bit otherwise.

Due to usage of IIR filter, also full-pel values are filtered.
174

Potentially this implies that 2 buffers are needed (as in the current TMuC implementation).
Bringing it to one buffer would certainly affect the performance.
14.5.1.1.1.1.1.1.2
JCTVC-C087 TE12.2: Experimental results of interpolation filter [T. Chujoh,
K. Kanou, T. Yamakage (Toshiba)]
In this contribution, detailed results of interpolation filters were reported. This is one of reports in
Tool Experiment 12: Evaluation of TMuC Tools. The interpolation filter of the high efficiency
anchor is SIFO (Switched Interpolation Filter with Offset) and that of the low complexity anchor is
DIF (Directional Interpolation Filter). In this experiment, some DCT-IFs (Discrete Cosine
Transform – Interpolation Filters) are compared to anchor filters. The experimental results show
that the coding gain of DCT-IF 12-tap compared to SIFO in the case of high efficiency anchor is
average of 0.1% and the coding gain of DCT-IF 6-tap compared to DIF in the case of low
complexity anchor is average of 3.6%. The encoding of DCT-IF 12-tap compared to SIFO in the
case of high efficiency anchor is about 2/3 and the decoding time is reportedly decreased by 5%.
The encoding and decoding times of DCT-IF 6-tap compared to DIF in the case of low complexity
anchor were reportedly stable.

For HE cases, little gain of DCT-IF 12 (0.1% on average), reduction of encoding time by
roughly 30%.

For LC cases, 1.7% gain on average for DCT-IF 6-tap, no significant increase in encoding
time, for 8-tap filter 3.5% gain, increase of encoding time by 10%.

It was remarked that the performance seems to be highly variable over different sequences,
e.g. roughly 14% for BQSquare in LD LC.
14.5.1.1.1.1.1.1.3
JCTVC-C241 TE12: Crosscheck on interpolation filter: SIFO(DIF)/MOMS [R.
Panchal, M. Karczewicz (Qualcomm)]
The following remarks and observations were recorded for this contribution:

Confirms results of JCTVC-C050, emphasize different treatment of chroma (higher
precision filters) by MOMS (SIFO compared against 0.7)

Confirm sreport of JCTVC-C087, difference of TMuC 0.7 and 0.8 due to bug fix; 0.8
roughly reduces encoding time by 25%, BR by 0.15% for SIFO

Emphasizes that SIFO has additional advantage of illumination comp (not effective in
current test set).
14.5.1.1.1.1.1.1.4
JCTVC-C239 TE12: Report on interpolation filter: SIFO/DIF [R. Panchal, M.
Karczewicz (Qualcomm)]
The following remarks and observations were recorded for this contribution:

Confirms results of Toshiba (C087)

Gain of SIFO12 vs. DIF6 in RA/LC 5.5%BR, 200%ET, 115% DT, LD/LC 5.9%,
176%/110%

Loss of DIF6 vs. SIFO in RA/HE 3.3%BR/59%ET/96%DT, LD/HE
3.9%BR/65%ET/97%DT
14.5.1.1.1.1.1.1.5
JCTVC-C162 TE12.2: Results on MC interpolation filters [K. Kondo, T.
Suzuki (Sony)]
This document reported the results of MC interpolation filters, especially SIFO/DIF as compared
with DCT-IF for interpolation filtering in TE12. In the test, the coding performance and complexity
were measured under common test conditions, which were defined in JCTVC-B300. The coding
175
performance and complexity of MOMS were reported in addition to DCT-IF. These results were
shown in the second part of the contribution. Detailed results were summarized in an attached Excel
sheet.
The contribution confirmed the results of JCTVC-C087, with a slight difference due to different
platforms.
Additional results:

for LC with 6-tap DCT-IF BR reduction 2.8% RA, 0.7% LD, no difference in ET & DT

for HE with 8-tap DCT-IF RA: BR increase 0.2% / 61% ET, LD: BR 1.1% / 65% ET

for LC with 4-tap DCT-IF 0.8% BR increase RA, 7.4% BR increase LD still ET roughly
100%
The contribution also confirmed the results of JCTVC-C050, with the following aspects noted:

Additional results on 4-tap MOMS for LC cases show BR red. 5.4%/6.1% BR red., 7%/6%
ET inc. 10%/14% DT inc. for RA/LD cases compared to DIF anchors (note: MOMS with
bug)

Confirms significant gain of MOMS in chroma due to use of longer filter (bilinear in
reference cases)

Analysis of MOMS complexity: Due to usage of an IIR filter, the entire frame needs to be
processed first. Memory complexity is significantly increased – higher memory access than
conv. 8-tap.
14.5.1.1.1.1.1.1.6
JCTVC-C197 TE12.2: Results for SIFO-6Tap with DIF tests by Nokia and
Qualcomm [K. Ugur, J. Lainema (Nokia), R. Panchal, M. Karczewicz (Qualcomm)]
This contribution presented the Tool Experiment 12 results on Switched Interpolation Filter with
Offset (SIFO) that switches separable and directional interpolation filters with 6 taps.

BR increase roughly 2% average for HE, BR decrease roughly 3% for LC

ET roughly 50% increase for LC, 10% decrease for HE

DT number not known, probably slight decrease in HE, slight increase in LC cases

Emphasized that switching between directional and 2D filters (at slice level) gives benefit
(mostly for LC case)

Also reports advantage vs. DCT-IF 6-tap, particularly in LD LC gain is achieved over all
classes.
14.5.1.1.1.1.1.1.7
JCTVC-C201 TE12.2: Experimental results of SIFO (DIF) / DCT-IF [E.
Alshina, J. Chen, N. Shlyakhov, A. Alshin, W.-J. Han (Samsung)]
This document reported the results of the chroma DCT-IF extension within the scope of tool
experiment 12 evaluating TMuC Tools. For the test, DCT-IF 6 tap was used to replace the existing
default chroma interpolation (bilinear filter for high efficiency and DIF for low complexity). The
interpolation filter for luma remained as it was for both high efficiency and low delay
configurations, which is SIFO for high efficiency and DIF for low complexity, respectively.
Experiments results reportedly showed that, in the high efficiency test configuration, the average
BD-rate reduction of Y, Cb, Cr components are 0.2%, 5.5% and 6.6%, respectively, for random
access, and 0.5%, 2.5% and 2.9% for low delay. In the low complexity test configuration, the
average BD-rate reduction of Y, Cb, Cr components are 0.4, 12.7 and 14.1%, respectively, for
random access and 1.4, 12.4 and 15.3% for low delay. Encoding and decoding time was similar to
that of anchor software, and deviations exist from measurement inaccuracies.
176

Encoding/decoding time increased slightly (2%Enc/7%Dec for LC case) – concern was
expressed for this LC case.

BD BR saving in chroma may not be the best criterion for judgment, as the effect of better
chroma interpolation is more about improving the PSNR of the chroma components, and the
percentage of total bit rate for chroma is low.

The current default using bilinear interpolation seems to have a drift or rounding problem in
chroma interpolation, and the gains reported may partially relate to the fact that this is
resolved by higher-order interpolation.

The suggestion was made in the contribution to replace the 2-tap filter by DCT-IF 6-tap
filter.
14.5.1.1.1.1.1.1.8
JCTVC-C248 TE12.2: Cross verification on DCT-IF for chroma [T. Chujoh, K.
Kanou, T. Yamakage (Toshiba)] (missing prior, available first day)
This contribution confirmed the results of JCTVC-C201.
JCTVC-C309 Cross verification of DCT-IF for chroma [T. Suzuki (Sony)] (late reg., uploaded
after meeting had ended)
This late document was reported verbally during the meeting: It referred to JCTVC-C201, and was
not finished yet when reviewed. However, the results reportedly confirmed the results appropriately
for classes C and D. The actual document contribution was uploaded significantly after the meeting
had ended.
14.5.1.1.1.1.1.1.9
Overall summary of interpolation filter investigations:

It would be preferable to have the same chroma filter when doing an experiment where
different luma interpolation filters are compared.

The issue of chroma interpolation should be further investigated and must be better
understood.

MOMS: Good performance, but has open issues. Bug reported, claimed to potentially
improve performance for class E. Bidirectional IIR filter requires doubling the memory, and
resolving that may reduce the performance.

SIFO has highest objective gain typically, but higher encoder complexity.

For DCT-IF, DIF, SIFO in the various configurations (4..12tap) the group decided to
provide a table about coding performance and enc./decoder complexity. Kemal Ugur was
asked to coordinate a breakout to collect and organized the data to take as a basis for
decision.
14.6
TE12 TMuC motion representation
14.6.1.1.1.1.1.1.1
JCTVC-C030 TE12: Report on AMP evaluation [E. Francois (Technicolor), L.
Guillo (INRIA), A. Ichigaya (NHK), H. Yu (Huawei)] (missing prior, available first
day)
The following remarks and observations were recorded for this contribution:

Turning AMP off results in 1-2% BR increase for the 4 inter configurations, no large
variations over the classes (slightly more gain in class E makes the LD cases a bit better).

Slight deviations of results were observed on Windows vs. Linux platforms.

Encoding time was reduced by roughly 33% by turning off AMP.
177

The relationship with other methods investigated was in TE3. Block partitioning appears
useful, but not necessarily by AMP.
14.6.1.1.1.1.1.1.2
JCTVC-C104 TE12: Huawei & HiSilicon report on AMP evaluation [X. Zheng
(HiSilicon), J. Zhou, H. Yu (Huawei)]
This report confirmed the results of JCTVC-C030.
14.6.1.1.1.1.1.1.3
JCTVC-C149 TE12.2: Report on asymmetric motion prediction unit (AMP)
on/off [A. Ichigaya, K. Iguchi (NHK)]
This report confirmed the results of JCTVC-C030.
14.6.1.1.1.1.1.1.4
JCTVC-C286 TE12.2: Technicolor & INRIA report on AMP evaluation [E.
Francois (Technicolor), L. Guillo (INRIA)]
The report confirmed results of JCTVC-C030.
14.6.1.1.1.1.1.1.5
JCTVC-C119 TE12.2 report on MV prediction AMVP/IMVP [K. Sugimoto, A.
Minezawa, S. Sekiguchi (Mitsubishi Electric)]
The following remarks and observations were recorded for this contribution:

Switching IMVP on instead of AMVP produces BR increase (<0.5% on average, 1% for
class A), and encoding time also increases slightly.

IMVP seems not to give value.

Further investigation of AMVP was performed in TE11.
14.6.1.1.1.1.1.1.6
JCTVC-C163 TE12.2: Results on MV prediction [K. Sato, T. Suzuki (Sony)]
This contribution confirmed the results of JCTVC-C119, with identical results at the bitstream level
(both run under Linux).
14.6.1.1.1.1.1.1.7
JCTVC-C265 TE12: Evaluation of adaptive motion vector resolution
(AMVRES) [W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm)]
The following remarks and observations were recorded for this contribution:

Result were provided only for HE cases

TE12 result: the BR increase by turning AMVRES off is roughly 0.5%, the effect on
encoding time was roughly 10%.

The new proposal of "unified AMVRES" that adapts to prediction direction results in –
1.4% difference on low delay, 0.6% for RA.

Additional results on GPB=1+ AMVRES=0 / GPB=0 + AMVRES=1 shows 4.7% / 3.5%
decrease in BR and encoding time increase 215% / 112% compared to LD HE anchor.
14.6.1.1.1.1.1.1.8
JCTVC-C182 TE12.2: Cross-check of adaptive motion vector resolution
(AMVRES) technology in the TMuC software [A. Segall (Sharp)] (missing prior,
available first day)
This contribution confirmed the results of JCTVC-C265.
14.7
TE12 TMuC partition-based illumination compensation
14.7.1.1.1.1.1.1.1
JCTVC-C041 TE12: Performance of partition based illumination
compensation (PBIC) [A. Fujibayashi, S. Kanumuri, F. Bossen, T. K. Tan (NTT
DoCoMo)]
178
This contribution was a TE12 contribution presenting the performance of Partition Based
Illumination Compensation (PBIC) tool integrated into the TMuC software. In addition to the test
conditions of TE12, the contribution compared PBIC to two other tools, namely SIFO and
GRefMode, present in the TMuC software that can handle illumination changes. This comparison
was also done when a linear fade was applied to the test sequences.
The results reportedly showed that PBIC has no impact on coding efficiency for the regular test
sequences and a large gain (11-20% depending on the config) for the test sequences with linear fade.
PBIC reportedly has a minimal impact (2-5% increase) on encoder/decoder run times. The authors
said that this comparison indicated that PBIC is the top performing tool in terms of coding
efficiency and has the least increase in encoder run time.
The following remarks and observations were recorded for this contribution:

For the common conditions test sequences, PBIC gives no gain or even produces a small
loss (1%) in low complexity modes (could be due to the entropy coding).

For the same test sequences (first 2 seconds) with artificial fade (1 s fade out/in each), PBIC
was better in terms BR reduction (12% on average) than SIFO (5%, TMuC) and GRefMode
(10%, from A124), and less complex.

For more natural sequences with illumination variations (Nuts, Shuttlestart, Flower), a 2.6%
benefit was reported on average.
 It was remarked that cross fades may be more relevant than fade-in/-out
The PBIC proposal seems to be somewhat better than other potential methods of illumination
compensation, but shows no gains on the current test set, and it is probably not a high priority to
include dedicated tools for illumination variations.
14.7.1.1.1.1.1.1.2
JCTVC-C232 TE12.7: Verification of JCTVC-C041 (partition based
illumination compensation - PBIC) [N. Sprljan, S. Paschalakis, P. Wu (Mitsubishi
Electric)]
This contribution confirmed the results of JCTVC-C041 on the common conditions set, and with
some of the artificial fades, although there were some small deviations due to rounding errors. It
was reported that the author had investigated the source code.
14.7.1.1.1.1.1.1.3
JCTVC-C246 TE12.8: Cross-verification of partition-based illumination
compensation (PBIC) [P. Onno (Canon)]
This contribution confirmed the results of JCTVC-C041 on the common conditions set.
14.8
TE12 TMuC filtering
14.8.1.1.1.1.1.1.1
JCTVC-C235 TE12: Evaluation of adaptive in-loop filter [I. S. Chong, M.
Karczewicz (Qualcomm)]
The effect of the adaptive in-loop filter on BD measures was reported as follows:


High complexity

Intra : 2.2% (max 4.2% for class A)

Random access :3.7% (max 5.8% for class B)

Low delay: 5.2% (max 8.5% for class E)
Low complexity

Intra :-2.2%

Random access:-4.8%
179
 Low delay: -4.3%
Decoding time increased by around 15% (10-20 depending on cases).
The report on encoding time (no significant difference) seems to contradict with the X-check,
probably caused the the cluster environment which may not deliver reasonable numbers.
14.8.1.1.1.1.1.1.2
JCTVC-C129 TE12: Cross verification for adaptive loop filter (ALF) (on vs.
off) [Y.-J. Chiu, L. Xu, W. Zhang, H. Jiang (Intel)]
This contribution confirmed the results of JCTVC-C235 in terms of BR and DT; Encoding time
seems also not to be consistent (increase by 30% switching ALF off?) due to usage of a computing
cluster.
14.8.1.1.1.1.1.1.3
JCTVC-C153 TE12: Report on deblocking filter [P. Chen, M. Karczewicz
(Qualcomm)]
The following remarks and observations were recorded for this contribution:

Marginal changes in BD rate were observed.

The deblocking filter does not have significant impact on PSNR
14.8.1.1.1.1.1.1.4
JCTVC-C148 TE12: Crosscheck on deblocking filter in TMuC0.7 by
MediaTek [Y.-L. Chang, Y.-W. Huang, S. Lei (MediaTek)]
This contribution confirmed the results of JCTVC-C153 in BD rate.
The encoding/decoding time was roughly 3% lower when deblocking was switched off.
14.8.1.1.1.1.1.1.5
JCTVC-C214 TE12.5: Results for adaptive loop filter using prediction and
residual [M. Narroschke (Panasonic)]
This contribution was a part of the TE12: It presented the results of the Adaptive loop filter using
the prediction and residual as filter inputs. An average Y-BD-rate gain was reported of 1.4% for
Intra High Efficiency, 1.3% for Random Access High Efficiency, and 0.8% for Low Delay High
Efficiency compared to the anchors. It was proposed to include it in the TM.
The test results were not complete yet due to recent bug fix work (the results were more complete in
the cross check).
A gain of roughly 1% average (higher on intra) compared to ALF was reported.
Encoder complexity was roughly 25-30% higher than with ALF.
Decoder complexity was also increased by roughly 10%.
14.8.1.1.1.1.1.1.6
JCTVC-C230 TE12: Cross check result of Panasonic's 3-input-ALF [I. S.
Chong, M. Karczewicz (Qualcomm)]
This contribution confirmed the results of JCTVC-C214.
In comparison against ALF, the BD BR improvement was 1.2% for Intra Low Complexity, 1.1%
for Random Access Low Complexity, and 0.9% for Low Delay Low Complexity
14.8.1.1.1.1.1.1.7
JCTVC-C086 Experimental results of ALF on low complexity [T. Chujoh, T.
Yamakage (Toshiba)]
The BR reduction in LC configurations was:

RA 4.83%/2.77%/7.56% for ALF alone / DCT-IF 6-tap alone / combination
 LD 4.27%/0.67%/7.11% for ALF alone / DCT-IF 6-tap alone / combination
The suggestion of the authors was to adopt ALF in the test model.
It was remarked that the relationship of ALF and interpolation filters still needs to be better
understood
180
JCTVC-C212 Cross-verification of JCTVC-C086: Experimental results of ALF on low
complexity [W.-J. Han (Samsung)]
This contribution confirmed the BR reduction results reported in other contributions for this feature.
14.8.1.1.1.1.1.1.8
Overall conclusions on filtering:

ALF results were consistent, with average rate gain slightly less than 5%, decoder runtime
increased around 15%.

Encoder runtime reports were not fully consistent due to usage of clusters.

The relation between interpolation and loop filtering needs to be better understood.

Comparison against post filter (TE 10) should be considered.
14.9
TE12 TMuC intra-frame prediction
14.9.1 Combined intra prediction
14.9.1.1.1.1.1.1.1
JCTVC-C092 TE12: Report on combined intra prediction evaluation [K.
Chono, K. Senzaki, H. Aoki, J. Tajime, Y. Senda (NEC)]
This contribution presented a report on Combined Intra Prediction (CIP) evaluation. Simulation
results reportedly showed that:

BD-Rate increases caused by disabling CIP are 0.3%, 0.2%, and 0.1% for the High Coding
Efficiency (HCE) All Intra, Random Access, Low Delay coding conditions;

BD-Rate increases caused by disabling CIP are 0.2%, 0.1%, and 0% for the Low
Complexity (LC) All Intra, Random Access, Low Delay coding conditions;

Encoding time decrease observed by disabling CIP is about 20% for the HCE and LC AllIntra coding conditions; significant encoding time decrease was not observed by disabling
CIP in other coding conditions; and
 Significant decoding time decrease was not observed in all coding conditions.
It was proposed that CIP is not used in default coding configurations in the TMuC software due to
its reported small coding gain and that CIP should be re-evaluated on the top of an HEVC test
model based coding tool set established as determined by the TE12 results.
It was further noted that when CIP is used, increased encoder complexity (e.g., by 20% for all-intra
runtime measure) is needed to determine whether to use CIP or not.
It was suggested that storing some information could be used in the encoder rather than
recomputing it, somewhat reducing the encoding runtime increase.
It was remarked that the recursive nature of the processing in the decoder may make it difficult to
implement this mode in a highly-optimized fast implementation.
14.9.1.1.1.1.1.1.2
JCTVC-C174 TE12.3: Report on combined intra prediction [T. Yamamoto, A.
Segall (SHARP)] (missing prior, available first day)
The resultsin this contribution were reported to be consistent with that reported in JCTVC-C092.
14.9.1.1.1.1.1.1.3
JCTVC-C213 Additional results for combined intra prediction [M. Mrak, T.
Davies, D. Flynn, A. Gabriellini (BBC)]
This document is not actually directly within the planned scope of TE12, but is closely related and
is thus discussed in this section.
This document provides further simulation results for the Combined Intra Prediction (CIP) tool. As
implemented in TMuC 0.7, the configuration of CIP was reported to be suboptimal. This document
181
proposed improvements to the CIP implementation, and the results reportedly showed that the gain
of CIP can be two to three times greater than in the original implementation.
The asserted suboptimality was asserted to be a result of several factors, which include:

RD search with CIP for one PU splitting type only, as determined by ANG search;

Suboptimal selection of constants used for CIP prediction and signaling;

CIP's open-loop component computed without taking into account AIS;
 RD search without CIP when ROT is used.
Some results were shown for a modified CIP, with roughly 0.5% reported improvement beyond that
of the prior CIP design proposal on intra conditions. Work on further improvements was reported to
be ongoing. It was remarked that some of the new runtime measurement results seem difficult to
justify.
After discussion, it was concluded that, for now, CIP should not be in the TM. Further study was
encouraged.
14.9.1.1.1.1.1.1.4
JCTVC-C297 Cross-verification of JCTVC-C213: additional results for
combined intra prediction [W.-J. Han (Samsung)] (late registration, missing prior,
available first day)
This contribution was a cross-verification report of JCTVC-C213: additional results for combined
intra prediction. Provided source code was investigated. Both encoder and decoder running were
done.
TMuC 0.7 software had been used and the provided patch file (several lines of changes) was
applied to the software. The provided patch included 1) adjustment of weights between open-loop
and close-loop predictors 2) removal of encoder-side penalty due to the fast estimation process of
the combined intra prediction. Both changes seem not relevant to the coding tool itself.
All results were reported to be perfectly matched to those of JCTVC-C213.
14.9.2 Adaptive intra smoothing
14.9.2.1.1.1.1.1.1
JCTVC-C054 TE12.3: Adaptive intra smoothing (AIS) test (off vs. on) by
Fraunhofer HHI [B. Bross (Fraunhofer HHI)]
This document reported results of testing adaptive intra smoothing (AIS) within the scope of tool
experiment 12 evaluating TMuC Tools. For the test, "AIS fast" was disabled and tested against the
AIS enabled default.
The test was thus testing "adaptive smoothing" versus "always smoothing" which was reported to
perform better than using no filter, but the selection of which non-adaptive case works better
depends on the configuration of other related coding tools, in particular including UDI.
For an intra coding only constraint set, average bit rate increases between 2.4% and 4% were
reported for the fast implementation. Encoder runtimes for this constraint set are 61% of the default
AIS enabled implementation.
For the inter-predictive coding constraint sets, the average bit rate increases between 0.4% and
1.5% are smaller. Encoder and decoder runtimes are 92% and 97% of the default AIS enabled
runtimes.
It was remarked that the setting of the default operation is part of the proper analysis of the
technique. A good analysis should include smoothing always off, smoothing always on, and
adaptive smoothing selection.
14.9.2.1.1.1.1.1.2
JCTVC-C120 TE12.3: Cross-check report on AIS on/off [K. Sugimoto, A.
Minezawa, S. Sekiguchi (Mitsubishi Electric)]
The purpose of this document is to report the result of a cross-check on Adaptive Intra Smoothing
ON/OFF conducted by HHI according to the conditions specified in JCTVC-B312. The verification
182
task was reported to have been done successfully and the results reported exactly matched those
provided by HHI.
However, some difference in measured encoding time on the same platform was reported for nonAIS operation. The contributor reported that the non-adaptive case was always faster than the
adaptive case. This was suggested to be some problem in the random-access case anchor time
measures used by the contributor of JCTVC-C054.
The benefit relative to the always off case measured in all intra coding was measured as 0.5-0.6%
for intra-only configurations.
14.9.2.1.1.1.1.1.3
JCTVC-C044 TE12: Results for experiments on max CU size, RDOQ and
AIS [F. Bossen, T. K. Tan (NTT DoCoMo)]
This section discusses the AIS part of this contribution – other parts of the same document were
addressed in other categories.
Similar behavior was reported as in the other contributions. Disabling AIS with disabling smoothing
always loses about 0.5% relative to using AIS, but makes the encoder run substantially slower (e.g.,
40%).
Fast decision encoding reduces the encoding time of AIS, but has an R-D penalty that is actually
more severe than just always disabling smoothing.
Decision: As suggested by the contributor, it was agreed to disable AIS and set DEFAULT_IS to 0
in the TM and reference configurations.
This may depend on various factors that could be affected by further study.
This does not affect consideration of proposed modified techniques as suggested in some other
related input contributions such as JCTVC-C302, JCTVC-C234 and JCTVC-C269.
14.9.2.1.1.1.1.1.4
JCTVC-C053 TE12.3: Adaptive intra smoothing (AIS) test (slow vs. fast) by
Fraunhofer HHI [B. Bross (Fraunhofer HHI)]
This document reported results of testing slow and fast encoder estimations for AIS within the
scope of tool experiment 12 evaluating TMuC Tools. For the test, "AIS fast" was enabled and tested
against the "AIS slow" default. For an intra coding only constraint set, average bit rate increases
between 0.9% and 1.2% were reported for the fast implementation. Encoder runtimes for this
constraint set reportedly ranged from 76% to 72% of the default slow implementation. For the interpredictive coding constraint sets, the average bit rate increases between 0.1% and 0.5% were
smaller. Encoder and decoder runtimes are reportedly similar for these cases and deviations cannot
be clearly distinguished from measurement inaccuracies.
These results seemed consistent with that reported in JCTVC-C044, specifically consistent with the
assertion that "fast AIS" operation performs worse (as tested) than always disabling the smoothing.
14.9.2.1.1.1.1.1.5
JCTVC-C281 TE12: Evaluation of fast adaptive intra smoothing [M. Coban,
Y. Zheng, M. Karczewicz (Qualcomm)]
This contribution presented test results for the fast adaptive intra smoothing listed in Tool
Experiment 12: Evaluation of TMuC Tools.
The related presentation material was requested to be uploaded.
Similar results were reported as described for other contributions on this topic (JCTVC-C044 and
JCTVC-C053).
14.9.3 Edge-based prediction
14.9.3.1.1.1.1.1.1
JCTVC-C171 TE12.3: Experimental results of edge based prediction [A.
Tabatabai, C. Auyeung (Sony)]
Contribution JCTVC-C171 presented results of edge based prediction under TE12.3 for the high
and the low complexity intra configuration. Simulations reportedly showed that when edge based
prediction is disabled, for both high and low complexity intra configuration, the average BD bit rate
183
is reduced by about 0.1%. Cross verification of the results was done with Panasonic. The measured
average BD bit rate was the same as measured by Panasonic. However, the results in this
contribution have some small differences from the results of Panasonic.
The related presentation material was requested to be uploaded.
The experiments were conducted with TMuC version 0.7.1 with "bug fix 51". Further details
regarding macro settings were described in the document.
Visual comparison was not performed.
The minor benefit measured for this coding tool led to the suggestion in the contribution that edge
prediction should not be included in the first test model of HEVC. This was agreed.
There are some new related variations discussed in the TE6 context that remained under
consideration at the time of discussion of this contribution.
14.9.3.1.1.1.1.1.2
JCTVC-C216 TE12.3: Results for edge based prediction [V. Drugeon
(Panasonic)]
Similar results were reported as above for JCTVC-C171. The current implementation of edge based
prediction in the current TMuC software using the test configuration proposed by TE12 did not
show any gain.
Interaction with other coding tools was suggested as the reason that the tested edge-based prediction
scheme did not show gain. To test this hypothesis, testing was performed with CIP, ROT, AIS,
MDDT, adaptive scan, and unified intra prediction all disabled in the software. In that context the
edge-based prediction scheme was reported to provide an average gain of 1.8% (with maximum
gain of 4.8%) for intra test conditions.
The question was asked how much coding performance was reduced by the disabling of the other
intra coding tools – and the contributor indicated that they did not know.
The related presentation material was requested to be uploaded.
A relationship with technology in TE6 was also noted by the contributor.
Further study of similar techniques was suggested by the contributor, toward identifying how such
technology may ultimately prove beneficial, depending on the testing context in regard to other
coding tools. Such further study was encouraged.
It was agreed that edge based prediction should not be in TM.
14.9.4 Planar prediction
14.9.4.1.1.1.1.1.1
JCTVC-C188 TE12.3: Results for planar prediction tests by Nokia [J.
Lainema, K. Ugur, O. Bici (Nokia)]
This document reported Nokia results for the TE12.3 experiment on disabling planar intra
prediction in the TMuC 0.7 environment. Since planar prediction is reported to be a tool to improve
subjective quality, rather than objective quality, visual examples demonstrating the claimed benefit
of planar prediction were also included.
The contribution also described an asserted inefficiency in indication of the planar intra prediction
mode in the low complexity entropy coding configurations, and proposed a fix for that.
In the high efficiency intra coding configuration case, basically no difference in objective
performance was measured.
In the low complexity intra coding configuration case, an objective loss of 0.9% in bit rate was
reported.
Random access and low delay testing resulted in the expected results – diminished impact in
objective terms.
Joint coding of the planar mode flag and the intra split flag was tried for the low complexity
configuration case. With this alteration, the objective loss of 0.9% was reduced to a loss of 0.3%.
Artifact examples were shown in the document with QP=32 and 37, showing a substantial decrease
in visual artifacts for smooth gradually-changing regions (which are essentially what is modeled by
this mode), such as sky areas.
184
The contributor was asked about the complexity of the technique – who responded that there may
actually be a net complexity reduction from the technique, since no residual difference coding is
performed when this prediction mode is used (in the way it is currently found in the TMuC).
It was suggested that adapting the quantizer step size in smooth regions may be able to mitigate the
illustrated artifacts. The contributor indicated that such techniques may increase encoding
complexity and that the handling of block edges in this technique helps avoid blocking artifacts that
would be produced in other techniques.
Some interaction with other proposed techniques such as MDDT was also suggested to potentially
exist.
The question was asked how often the mode gets selected – this suggested to typically be 10-15% in
all intra coding.
A TENTM co-submitter expressed the opinion that the technique substantially helps subjectively.
14.9.4.1.1.1.1.1.2
JCTVC-C244 TE12.3: Report on planar intra prediction [D. Flynn (BBC)]
(missing prior)
This contribution reviewed before it was available for download, due to network issues.
This report presented the configuration and results from testing the intra planar prediction mode.
When this mode is enabled, there is, for most configurations, no significant change in BD-Rate,
with low-complexity configurations, there is a loss of between 0.9% (Intra) and 0.2% (Random
Access) BD-Rate. With all configurations, there is a reduction in encoder runtime associated with
enabling the intra planar prediction mode.
These results are identical to those produced by Nokia in JCTVC-C188 within tolerance of
±1kbit/sec, ±0.005dB PSNR.
The root cause of the loss in compression performance had not been analyzed. It was reportedly
suspected that it is brought about by the combination of the following:

The planar mode, as implemented, effects an implicit skip=true flag, and it was suspected
that the reductions in runtime that the planar mode brings is through the use of the skip flag.

It was suspected that the error for planar blocks is not distributed uniformly across the block,
rather, the error is greatest at the lower and right edges. Such an error can potentially affect
subsequent prediction units since no weighting is applied to the distortion calculation.
It was suggested that further study and potential improved harmonization with related coding
elements may show some promise for the technique. Further refinement of other elements may also
show benefits (e.g., regarding residual skip handling).
Subjectively, the technique was suggested to have a nice matching of adjacent regions that are
predicted the same way. But the benefits seem region-dependent (e.g., better for sky than for faces)
and dependent on non-normative encoding techniques (e.g., well optimized AVC intra coding looks
substantially better than naïve decision encoding).
As currently used, it does actually reduce encoding and decoding runtimes (by roughly 5-8%) –
although it was remarked that we should not read too much into small differences in speed in the
software at this stage.
A decision on what is to be done in other related areas may affect what we would decide to do with
this technology, at least at this stage of study and integration. Somewhat mixed feelings were
expressed, but the strength of evidence on this feature at this time did not seem to justify immediate
inclusion in the first TM. Further study was encouraged.
14.10 TE12 TMuC transforms and coefficient coding
14.10.1Transform skip flag
14.10.1.1.1.1.1.1.1 JCTVC-C047 TE12.4: Transform skip flag test (off vs. on) by Fraunhofer HHI
[B. Bross (Fraunhofer HHI)]
185
This document reported the results of testing the transform skip flag for the transform unit quadtree
(RQT_ROOT) within the scope of tool experiment 12 evaluating TMuC Tools. For the test,
RQT_ROOT was enabled and tested against the RQT_ROOT disabled default. In a high efficiency
scenario, similar performance was reported when enabling RQT_ROOT – with bit rate savings were
reported to be around 0.1%. In a low complexity scenario (using LCEC entropy coder), enabling
RQT_ROOT reportedly performs better than the default in terms of coding efficiency – with
average bit rate savings around 3.6%. Encoder runtimes ranging from 93% to 106% and decoder
runtimes from 91% to 103% of the default runtimes had been measured.
It was remarked that this experiment used version 0.7.2, which was modified for "LCEC Phase 2"
in 0.7.4. The question was asked about the potential interaction of this proposal with that one.
The transform skip flag handling was asserted to be essentially the same in this proposal as in the
Dresden HHI proposal.
The benefit was found primarily when using LCEC entropy coding – will we be keeping that?
It seemed that if we keep LCEC entropy coding, then we should also use this scheme (pending any
conflicting later decisions at the meeting).
14.10.1.1.1.1.1.1.2 JCTVC-C258 TE12: Evaluation of transform skip flag (RQT_ROOT) [W.-J.
Chien, P. Chen, M. Karczewicz (Qualcomm)]
This contribution was a cross-verification report with similar results as reported in JCTVC-C047.
The algorithm and code was studied in detail – which resulted in a related contribution discussed
below.
All test cases were performed.
14.10.1.1.1.1.1.1.3 JCTVC-C267 Test results of transform skip flag and phase 2 VLC integration
[W.-J. Chien, P. Chen, X. Wang, M. Karczewicz (Qualcomm)]
This contribution was not directly part of the planned TE12, but is sufficiently closely related to
merit being considered together with it.
This contribution presented test results for Transform Skip flag (HHI_RQT_ROOT) under "Phase 2
LCEC VLC integration".
According to the simulation results, further investigations and testing should be conducted to
indentify the interaction of the transform skip flag and the phase 2 VLC.
If we don't use the residual quadtree scheme, the proposed Phase 2 scheme was reported to provide
a similar capability.
If we do use the residual quadtree scheme and the LCEC entropy coding, the proposed skip flag
provides a benefit.
And the two schemes have roughly similar actual signaling.
LCEC phase 2 was something planned for integration in Geneva, but was integrated a bit late, and
was asserted to have been intended as original Dresden output (based on JCTVC-A119). Similarly,
the RQT_ROOT scheme with the skip flag was asserted to have been intended as original Dresden
output as well (based on JCTVC-A116).
If we use the residual quadtree approach, it was suggested to use the transform skip flag with the
RQT_ROOT scheme. This was agreed.
If we have LCEC entropy coding and don't use the residual quadtree approach, it was suggested to
use the "Phase 2" scheme (not yet reviewed). – further discussion of this avenue did not appear
necessary.
14.10.1.1.1.1.1.1.4 JCTVC-C185 Recent improvements of the low complexity entropy coder
(LCEC) in TMuC [A. Fuldseth (Cisco), A. Hallapuro, K. Ugur, J. Lainema (Nokia)]
This contribution was not directly part of the planned TE12, but is sufficiently closely related to
merit being considered together with it.
This contribution reports on recent improvements of the low complexity entropy coder (LCEC) in
the Test Model under Consideration (TMuC) context.
186
The low complexity entropy coder (LCEC), based on the JCTVC-A119 HEVC CfP proposal has
been adopted into TMuC as described in JCTVC-B205. This has reportedly proved to be a
challenging task, especially since the coding structures of the two algorithms are quite different. In
particular, the side information parameters of JCTVC-A119 could not be directly mapped to the
side information parameters of TMuC and the strategy of JCTVC-A119 had to be modified as
appropriate.
LCEC has been integrated into the TMuC software in four phases:

Phase 0:

Phase 1:
TMuC v0.7, used as anchor in TE12 low complexity configurations,
(including the PIPE vs. LCEC experiment).

Phase 2:
TMuC v0.1, preliminary version provided by Samsung/BBC.
TMuC v0.7.4, disabled by default, but evaluated as part of TE12.
 Phase 3:
Recent improvements not integrated in the official TMuC software.
In this document, BD-Rate results for LCEC phase 2 and phase 3 relative to phase 1 were provided.
Also, results with and without residual quad tree (RQT) and the transform skip flag
(HHI_RQT_ROOT) were reported. The reason is that significant side information parameters such
as coded block flag (cbf) are treated differently dependent on the values of these RQT parameters.
Finally, a comparison between PIPE and LCEC was given.
The Class E sequences reportedly benefit substantially from the proposed scheme.
The gap between LCEC (as proposed) and PIPE was reported to be approximately 10%.
No cross verification of "phase 3" had been done at this point – see TE12 entropy coding testing.
14.10.2MDDT and ROT
14.10.2.1.1.1.1.1.1 JCTVC-C136 TE12.4: Mode dependent directional transform (MDDT) test
(off vs. on) by HKUST [X. Zhang (HKUST)]
This contribution reported coding performance tests of MDDT according to Tool Experiment 12
JCTVC-B312. Some difficulties were reported in completing the testing process. Incomplete test
results were presented for this reason.
0.7.0 software was used.
When MDDT is disabled in the software, ROT is enabled for all block sizes.
When MDDT is enabled in the software, ROT is enabled on some block sizes but not others.
(ROT was integrated first – when MDDT was later integrated, since it applied only to smaller block
sizes, the MDDT enabling switch was designed to revert to the prior ROT behavior when MDDT is
off.)
The results seem somewhat difficult to interpret because of this behavior. This is not really a simple
test of "off" versus "on" for some tool, since the ROT behavior changes when the MDDT behavior
changes.
In the HE test cases, "MDDT on with ROT on for large block sizes" versus "MDDT off with ROT
on for all block sizes" showed no significant performance difference.
It was remarked that for LC configuration, some bug fix to the entropy coding behavior was made
after version 0.7.0 of the software (bug fix 79) – this may call into question the LC results in this
contribution.
14.10.2.1.1.1.1.1.2 JCTVC-C181 TE12.4: Cross-check of rotational transform (ROT) [K. Misra,
A. Segall (Sharp)] (missing prior, available first day)
This contribution cross-checked the performance of the rotational transform in the context of TE12.
The experimental results reportedly showed that, with MDDT enabled, using ROT for the large
block sizes introduces an average loss (Y BD-rate for each condition) ranging from 0.0% to 0.6%.
This closely matched the rate-distortion results provided by Qualcomm.
187
The software was studied, and was consistent with the description. The 4x4 ROT variant was not
tested.
Encoding and decoding time measures were too unstable to provide confident measures.
14.10.2.1.1.1.1.1.3 JCTVC-C202 TE12.4: Experimental results of MDDT and ROT by Samsung
and Qualcomm [E. Alshina, A. Alshin, W.-J. Han (Samsung), R. Joshi, M. Coban
(Qualcomm)]
The presentation for this contribution was requested to be uploaded
This document described results of three tool evaluation tests on alternative transform field. These
experiments were carried out and cross-verified by Samsung and Qualcomm in a framework of
TE12 activity.
A modified combination of MDDT and ROT was proposed in the contribution.
The current combination of MDDT and ROT in the TMuC when enabled together, reportedly
provides about 2% benefit for HE intra only (and 1% benefit for RA), with about 45% increase in
encoding time. The contributor indicated that this degree of coding time increase is not really
necessary – and contribution JCTVC-C250 discusses that topic.
The new combination proposed in the contribution reportedly provides about 3% benefit for HE
intra only (and 1% benefit for RA).
It was remarked that TE7 also has some other transform modification.
It was noted that there is also a lower-complexity ROT proposal in JCTVC-C096.
A contributor asked if it was possible to estimate the gain from the transform aspects of this and the
adaptive scanning that is part of it. There are really two features being tested together here, in terms
of scan and transform changes. The opinion was offered that most of the gain is from the scan rather
than the transform aspects. Contributions JCTVC-C106, JCTVC-C114, and JCTVC-C263 also
discuss adaptive scanning.
Considering the complexity of these features, their inclusion in the TM at this time did not appear
justified for the benefit shown.
14.10.2.1.1.1.1.1.4 JCTVC-C268 TE12: Report on evaluation of MDDT and ROT [R. Joshi, M.
Coban, M. Karczewicz (Qualcomm)]
Roughly consistent results were presented in this contribution as in the others JCTVC-C136,
JCTVC-C181, JCTVC-C202.
14.10.2.1.1.1.1.1.5 JCTVC-C298 TE12.4 Cross-check of unified MDDT/ROT [T. Davies , D.
Flynn (BBC)] (late registration, missing prior)
This contribution document contained cross-check results of JCTVC-C202, specifically on
providing a unified framework for MDDT and ROT mode selection. The cross-check had only
partially been completed, although the results collected so far matched those proposed in JCTVCC202.
14.10.2.1.1.1.1.1.6 JCTVC-C036 Alternative performance measurement of MDDT and ROT in
TMuC [R. Cohen, A. Vetro, H. Sun (Mitsubishi)]
This contribution was not part of the TE, but is included here since it contains closely related
information.
This document evaluated the performance of ROT and MDDT both individually and combined in
TMuC, using a reference in which both ROT and MDDT were disabled. The results reportedly
show that for all-Intra high efficiency conditions in TMuC 0.7, these tools produce average BDRate improvements from 1.6% to 2.1%, where ROT slightly outperforms both MDDT and their
combination. For random access high efficiency, the gains were reportedly between 0.7% and 0.9%,
with a similar relationship among individual and combined tests. Encoder run times approximately
doubled for ROT, whereas decoder run times doubled or tripled with MDDT, depending upon the
configuration. Possible future improvements were addressed as well.
188
The need for subjective evaluation was noted.
14.10.3Transform coefficient coding
14.10.3.1.1.1.1.1.1 JCTVC-C049 TE12.4: Transform coding HHI tested against Samsung
proposal by Fraunhofer HHI [B. Bross (Fraunhofer HHI)]
This document reported results of testing the transform coefficient coding proposed by Fraunhofer
HHI against the transform coefficient coding proposed by Samsung within the scope of tool
experiment 12 evaluating TMuC Tools. For the test, the Samsung coding was tested against the
HHI coding default. It was reported that the Samsung coding results in 1.3% to 2.0% luma bit rate
increase compared the HHI coding. The Samsung coding runtime of the encoder is between 92%
and 110% of the HHI coding runtime and the Samsung coding runtime of the decoder is between
98% and 120% of the HHI coding runtime.
The HHI scheme has adaptive scan order, context modeling for the significance map, and context
modeling for the absolute coefficient levels.
The Samsung scheme uses fixed scan order and different context modeling.
Contributions JCTVC-C106, JCTVC-C114, and JCTVC-C263 also discussed adaptive scanning.
14.10.3.1.1.1.1.1.2 JCTVC-C059 TE12: Evaluation of transform coefficient coding
(HHI_TRANSFORM_CODING) with tool breakdown [V. Sze, M. Budagavi, M.
Zhou (TI)]
The coding efficiency gain of HHI_TRANSFORM_CODING was evaluated with TMuC-0.7.3 to
be between 1.3 to 2.0%. A breakdown of this gain across the three main tools was also studied and
reported as follows: adaptive scanning order of significance map (0.2 to 0.4%), proposed context
modeling of significance map (0.9 to 1.4%) and a proposed context modeling of coefficient level
(0.1 to 0.2%).
The coding efficiency gain of HHI_TRANSFORM_CODING was also evaluated with TMuC-0.7
by the contributor and was reported to have been verified to be a match with results provided by
Samsung.
However, the three tools increase the complexity of context modeling – making it more difficult to
parallelize for throughput.
Decision: The conclusion was to put the HHI scheme in TM, and further study the complexity
issues.
14.10.3.1.1.1.1.1.3 JCTVC-C203 TE12.4: Experimental results of transform coefficient coding [J.
Chen, V. Seregin, W.-J. Han (Samsung)]
This contribution reported similar results as in JCTVC-C049.
14.11 TE12 TMuC entropy coding
14.11.1PIPE vs. CABAC vs. LCEC
14.11.1.1.1.1.1.1.1 JCTVC-C048 TE12.6: PIPE and LCEC tested against CABAC by Fraunhofer
HHI [B. Bross (Fraunhofer HHI)]
This document reported results of testing the PIPE and LCEC entropy coders against CABAC
within the scope of tool experiment 12 evaluating TMuC Tools. In a high efficiency scenario,
CABAC was tested against the PIPE default. Bit rate savings up to 0.3% are shown when using
CABAC. In a low complexity scenario, CABAC was tested against the LCEC default. CABAC
performed better than LCEC in terms of coding efficiency, with average bit rate savings up to
15.5% while encoder runtimes up to 170% and decoder runtimes up to 113% of the LCEC runtimes
had been measured.
189

Intra CABAC vs. PIPE 0.2%, PIPE vs. LCEC 12.5%.

Random access CABAC vs. PIPE 0.3%, PIPE vs. LCEC 15.2%.
 Low delay CABAC vs. PIPE 0.3%, PIPE vs. LCEC 15.5%.
It was remarked that the RDO encoder decision-making uses CABAC. It seems like this could make
a difference.
14.11.1.1.1.1.1.1.2 JCTVC-C051 TE12.6: PIPE tested against LCEC by Fraunhofer HHI [B.
Bross (Fraunhofer HHI)]
This contribution presented test results that were essentially consistent with those in JCTVC-C048.
14.11.1.1.1.1.1.1.3 JCTVC-C058 TE12: Evaluation of entropy coders: PIPE tested against
CABAC [V. Sze, M. Budagavi (TI)]
This document reported performance evaluation results of PIPE (Probability Interval Partitioning)
in the Test Model under Consideration (TMuC) as part of TE12.
The coding efficiency loss of PIPE over CABAC was reported to be between 0.2 to 0.3% using
TMuC-0.7.
These results were reported to have been verified to be a match with results provided by HHI.
Several outstanding issues pertaining to the throughput and complexity regarding the PIPE were
discussed in the contribution. The contribution suggested that the difficulty of parallelizing context
modeling indicated that more investigation is required to determine how PIPE impacts the overall
throughput of the entropy coder. The reported estimated memory size requirement for PIPE is 5x
larger than CABAC. The cost of the additional buffers required by PIPE were suggested to also
merit additional analysis. More investigation was also requested on how the parallel bitstreams will
be handled (i.e. interleaved or separate); if they are interleaved, the complexity and latency
implications of the various approaches was requested to be analyzed. The contributor advocated that
further study is required on PIPE.
Regarding interleaved versus separate modes of PIPE operation, at the moment the TMuC encoder
makes a decision after buffering the bits for the picture, and the decoder follows what the encoder
selected.
It was remarked that reduced power requirements may be more important than reduced area
requirements.
It was remarked that TE8 studied some related implementation aspects.
14.11.1.1.1.1.1.1.4 JCTVC-C184 TE12.6: PIPE tested against LCEC by Cisco [A. Fuldseth
(Cisco)]
This document presented TE12 results on PIPE vs. LCEC comparisons. Depending on the
configuration (low delay/random access/intra and low complexity/high efficiency), average BDRate figures between 12.3% and 18.0% were reported.
It was noted that the quantity of this compression difference may be affected by other proposed
modifications.
It was remarked that some degree of encoding optimization (e.g., whether the encoder resets the
statistical measures at the beginning of each picture or in some R-D decision-making) may also
affect the coding efficiency difference.
14.11.2LCEC "phase 2" testing
14.11.2.1.1.1.1.1.1 JCTVC-C196 TE12.6: Results for LCEC_PHASE2 tests by Nokia [K. Ugur, A.
Hallapuro, J. Lainema (Nokia)]
This contribution presented Tool Experiment 12 results on LCEC_PHASE2. LCEC_PHASE2
represents the second integration phase of the low complexity entropy coding (LCEC).
A bit rate improvement was reported as follows: Intra 0.3%, random access 0.4%, low delay 1.6%.
190
The contributor recommended to adopt Phase 2 into the TMuC.
14.11.2.1.1.1.1.1.2 JCTVC-C160 TE12: Crosscheck on LCEC phase 2 [P. Chen, M. Karczewicz
(Qualcomm)]
This contribution presents the test results for LCEC tool (phase 2) listed in Tool Experiment 12:
Evaluation of TMuC Tools. Similar gains were reported as described in JCTVC-C196.
14.12 TE12 TMuC IBDI and transform precision extension
14.12.1.1.1.1.1.1.1 JCTVC-C057 TE12: Evaluation of IBDI and TPE (transform precision
extension) [M. Zhou (TI)]
This document reported evaluation results of the IBDI and TPE (Transform Precision Extension)
for TE12. Experimental results on TMuC-0.7 reportedly show that the total gain with both the IBDI
and TEP turned on is 0.9% to 5.0% on average; the average IBDI gain when TPE is off is 0.8% to
4.7%, and it is reduced to -0.1% to 3.3% when TPE is turned on; the average TPE gain when IBDI
is off is 0.8% to 1.4%, but it totally vanishes when IBDI is on; The results also reportedly show that
the additional left shift bits in the TPE (1 bit shift in T32x32, 2 bits shift in T16x16, and 3 bits shift
in T8x8, respectively) do not provide additional gain compared to the 4 bits left shift across the
transforms.
IBDI (Internal Bit-Depth Increase)

Input data is left shifted by 4 bits before passed to video codec

Internal data precision of each video coding processing stage increased by 4 bits

Reference frame has higher precison (8 bits -> 12 bits) which leads to higher memory
bandwidth and increased off-chip storage requirements
 IBDI is on in the reference HE configuration, off in LC configurations
TPE (Transform Precision Extension)

Expands the intermediate data precision after the first-stage of transform.

The final transform output data precison reamains unchanged.

Number of TPE extension bits LC configurations (IBDI off): 7 for T8x8, 6 for T16x16, 5 for
T32x32 and 4 for T64x46

Number of TPE extension bits HE configurations (IBDI on): for T8x8, 2 for T16x16, 1 for
T32x32 and - for T64x64
It was suggested to study the use of 8 bit reference pictures with 12 bit prediction signals and 12 bit
residual signals. It should be easy to modify the software to support it if it is not already
configurable in this fashion.
Results are shown in the table below (with rounding control enabled), indicating the range of
benefit averaged across all test sequences within a particular configuration, out of 6 configurations
total – HE & LC for all-intra, low delay, and random access. The italicized boldface entries are
suggested as the particularly important ones:
191
Anchor combination
Combination under Test
Average Gain (BD-rate)
IBDI off, TPE off
IBDI on, TPE (5,6,7) on
0.8% to 5.0%
IBDI off, TPE off
IBDI on, TPE off
0.8% to 4.7%
IBDI off, TPE (5,6,7) on
IBDI on, TPE (5,6,7) on
-0.1% to 3.3%
IBDI off, TPE off
IBDI off, TPE (5,6,7) on
0.8% to 1.4%
IBDI on, TPE off
IBDI on, TPE (5,6,7) on
No gain
IBDI off, TPE (4,4,4) on
IBDI off, TPE (5,6,7) on
No gain
Recommendations from the proponent:


TPE be turned on in HE and LC configurations by replacing
o const int g_iShift8x8
= 7;
o const int g_iShift16x16
= 6;
o const int g_iShift32x32
= 5;
o const int g_iShift64x64
= 4;
In TlibCommon\Source\CommonDef.h with
o const int g_iShift8x8
= 4;
o const int g_iShift16x16
= 4;
o const int g_iShift32x32
= 4;
o const int g_iShift64x64 = 4;
In the transform, it was remarked that increased bit depth between the horizontal and vertical
transform stages is somewhat burdensome for implementation.
It was also suggested to consider IBDI with 2 bit expansion rather than 4 bit expansion.
The rounding control feature for bidirectional averaging (adopted into TMuC in Geneva) was noted
to affect the degree of benefit for these techniques. It was remarked that when the rounding control
feature was integrated into the software, an additional encoder distortion measurement improvement
for biprediction was also included – which resulted in making the encoder substantially slower
while improving its R-D behavior). JCTVC-C253 was suggested to be related to this topic.
It was concluded that the TM should include 12 bit support (i.e., effectively IBDI) and 4-bit TPE.
For LC configuration; use TPE, for HE configuration, use IBDI (without additional TPE).
14.12.1.1.1.1.1.1.2 JCTVC-C088 TE12.7: Experimental results of IBDI [T. Chujoh, T. Yamakage
(Toshiba)]
In this contribution, results of IBDI (Internal Bit-Depth Increase) and TPE (Transform Precision
Extension) testing were reported. This is one of reports in Tool Experiment 12: Evaluation of
TMuC Tools. The experimental results reportedly show that the coding gain of TPE in the case of
intra coding averages about 1% and the coding gain of IBDI in the case of inter is a total of 3%
benefit. The encoding and decoding times were reportedly stable.
14.12.1.1.1.1.1.1.3 JCTVC-C252 TE12: Report on evaluation of internal bit depth increase and
transform precision extension [R. Joshi, M. Karczewicz (Qualcomm)]
Similar results were presented in this contribution as in JCTVC-C057 and JCTVC-C088.
192
14.13 TE12 TMuC encoder-only aspects
14.13.1.1.1.1.1.1.1 JCTVC-C044 TE12: Results for experiments on max CU size, RDOQ and
AIS [F. Bossen, T. K. Tan (NTT DoCoMo)]
This section discusses the RDOQ part of this contribution – other parts of the same document were
addressed in other categories.
Rate-distortion optimized quantization (RDOQ) is a well-known technique for improving the ratedistortion performance associated with the encoding of transform coefficients. In the following
experiments this optimization is disabled to quantify its contribution to coding efficiency and to
encoding time. Only low-complexity configurations were tested. The reduction of encoding time
observed when disabling RDOQ ranges from 8 to 17%. While this difference is significant, the drop
in BD-rate performance can be signigicant, up to 3.2% on average for the random-access
configuration.
It should be noted that the experiments were run on version 0.7 of the software and thus do not
include updates to the low-complexity entropy coder (LCEC_PHASE2 macro).
Considering the reported results, the contributor suggested to keep RDOQ enabled by default for
future work.
14.13.1.1.1.1.1.1.2 JCTVC-C238 TE12.8: Results on RDOQ in high efficiency settings [J. Zan, J.
Meng, M. T. Islam, D. He (RIM)]
This contribution reported results of cross-checking activities performed by RIM on RDOQ
(described in JCTVC-B312) for all three high-efficiency settings.
Test results had been compared to those generated by Ericsson. For "intra" and "low delay" modes,
RIM test results reportedly matched Ericsson’s; for the "random access" mode, there is minor (less
than 0.1%) differences on four sequences: 1) the PeopleOnStreet sequence on the V component; 2)
the Kimono sequence on the U component; 3) the ParkScene sequence on the U and V component;
and 4)the RaceHorsesC sequence on the V component. Overall, the difference in results between
the RIM and Ericsson tests was reported to be very minor and all the data on the summary tab of the
results spreadsheet matched each other.
The test results were reported to have shown that RDOQ is an effective technique to enhance the
coding performance (Rate and PSNR) of the video coding algorithm. Specifically, for intra only
setting, the gain by using RDOQ on Y-component was reported to be on average 4.7% (in BD-rate);
for random access setting, the gain on Y-component was reported to be on average 4.4%; and for
low-delay setting, the gain on Y-component was reported to be on average 2.5%. It was observed
that RDOQ offers the largest coding performance gain on Class B sequences.
Considering the reported results, the contributor suggested to keep RDOQ enabled by default for
future work.
14.13.1.1.1.1.1.1.3
JCTVC-C159 TE12: Cross verification of RDOQ [K. Andersson (Ericsson)]
Both low complexity and high efficiency test cases were tested by Ericsson.
Similar results were reported by Ericsson as in the other RDOQ contributions.
Considering the reported results, the contributor suggested to keep RDOQ enabled by default for
future work.
15 Project planning and Test Model establishment
15.1
Project timeline
15.1.1.1.1.1.1.1.1
JCTVC-C022 WG11 USNB contribution: Suggested practices for the HEVC
project in the JCT-VC [A. G. Tescher (for USNB to WG11)]
193
The USNB to WG11 requested to establish the following practices for the HEVC project in the
JCT-VC:

The group-adopted "Test Model" (which is expected to be produced in version 1 form at the
October 2010 meeting) should contain only the minimum set of well-tested tools that
together form a coherent design that has been confirmed to show good capability.

This "minimum set" may include such variations as would be appropriate for forming
multiple profiles of the design (but should not otherwise contain significant duplication of
functionality).

In the event that insufficient analysis is available to establish a consensus on the appropriate
minimum set of well-tested tools by the October 2010 meeting, it was suggested that
obtaining this consensus should be a higher priority than the goal of adopting a version 1 of
the Test Model in October 2010.

An encoder text description, a decoder text specification, and software matching the encoder
and decoder algorithms should be provided for the tools in the Test Model within one
meeting cycle of the adoption of a tool into the Test Model. (It may be appropriate for the
encoder text description to be in a separate document from the decoder specification.)

Appropriate software coding guidelines should be established and followed to ensure high
quality software development of the Test Model.
These aspects were tentatively agreed on Sunday a.m.
15.1.1.1.1.1.1.1.2
JCTVC-C023 WG11 USNB contribution: HEVC standardization timeline [A.
G. Tescher (for USNB to WG11)]
Considering the timeline plan for the HEVC standardization approval process and the substantial
need for maturation of the design and text drafting efforts for this new standard, the USNB to
WG11 proposed that the timeline plan should be adjusted to:

CD ballot to be issued from the April/May 2012 meeting

DIS (formerly FCD) ballot to be issued from the October 2012 or January 2013 meeting

FDIS to be issued from the July 2013 meeting.
15.1.1.1.1.1.1.1.3
JCTVC-C299 UK National Body comments on HEVC Timescales [UKNB to
WG11] (late registration, missing prior, available first day)
The UKNB welcomed the progress that had been made so far in the development of the new High
Efficiency Video Coding (HEVC) standard. They said that the creation of the "Test Model under
Consideration" has been a useful first step in making the transition from a competitive to a
collaborative method of working, and that the momentum that has built up around this approach
should be maintained.
They indicated that there is a need to deliver this important new standard to the market in a timely
manner, and requested for MPEG to maintain the timescales that were defined in the Call for
Proposals, as follows:

Test model selection by 2010/10
 Final standard approval by 2012/07
The UKNB further requested MPEG to provide details of the planned intermediate stages of this
process, bearing in mind the revised procedural rules to be followed by ISO/IEC JTC 1 and the need
to align key steps with the ITU approval process.
Further discussion was then postponed to Wednesday (see below)
194
15.1.1.1.1.1.1.1.4
m18579 Comments on HEVC Test Model [CNNB to WG11, late registration,
missing prior, available later]
The Chinese NB to WG11 submitted this late document, which was not registered as a JCT-VC
document. Since it was brief, it is quoted below for the record:
"According to the tentative timeline described in Joint Call for Proposals on Video Compression
Technology (W11113), the Test Model of HEVC is expected to be created at the October 2010
meeting.
CNNB believes that a stable version of Test Model with good quality is essential to the further
development of the HEVC standard. Each tool in the Test Model shall be well-tested and be
confirmed with good performance. The group should give quality and stability as highest
priority in the development of the Test Model.
Considering the frequent updates and inadequate testing to the TMuC software during the
interim period, CNNB suggests the group adopts only such well-tested tools in the TMuC to
create the first version of TM and expects the group could get consensus at this meeting."
These remarks were taken into consideration in the subsequent discussion, which is reported below.
15.1.1.1.1.1.1.1.5
Schedule plan discussion (held Wednesday)
The current WG 11 plan / UK suggestion was summarized as follows:
 (WD 11/01) CD July 11 FCD/DIS Jan 12 FDIS July 12.
The WG 11 US NB suggestion was summarized as follows:
 (WD 11/01) CD May 12 FCD/DIS Oct 12 or Jan 13 FDIS July 13.
It was remarked that we could establish a WD now (as the TM is a complete solution), rather than
waiting until 2011/01. Later at the meeting, we did agree to issue a WD as an output of this meeting.
Participants remarked that a CD should be

Stable in text

Fully verified by software
 Meeting the goals in terms of coding efficiency (roughly)
Opinions expressed were recorded as follows:

It would take 4-5 meeting cycles to come from current TM to CD.

H.262 / MPEG-2 took 5 cycles; MPEG-4pt2 took more cycles than that.

AVC took 16 meeting cycles from initial H.26L reference to final standard (12 to CD).

"We should keep the meeting schedule to put sufficient pressure on ourselves", vs. "We
could release the pressure and always go faster if things come together more quickly than
anticipated."

The market is changing rapidly (and new applications are coming), and the timeline should
be kept but could be changed later if necessary.

Milestones were met so far (CfP/TM), why should we shift the upcoming milestones
compared to what was said in the CfP?

There is a danger that proprietary codecs may appear and take over if we wait for too long

Stable software is needed to serve as the basis of the work, and stability of text depends on
that (need for verification of the text by the software). And being able to run the software is
one thing, having an opportunity to actually understand it and properly specify it is another.
195

As code is not well documented yet, and a considerable amount of new tools are still needed
to achieve the goal of coding efficiency, it will be difficult to reach adequate maturity in the
3 meeting cycle periods between now and a July 2011 CD milestone.

Dual track idea – make a low complexity variation first? This might counteract the desire to
have most tools in common for LC and HE. We must also make sure that the solution keeps
the complexity reasonable – for both HE and LC. Even for an LC variant, there is an HE
intent.

The current specification (as from TMuC) is still far from being mature.

We still need better understanding about the interaction of the different parts of the TM.

The compromise then suggested was: CD Jan 12, DIS July 12, FDIS Jan 13.

The coding efficiency goal is important and should not be missed. The market will take it up
when HEVC achieves 50% compared to AVC.

The compromise suggested would be agreeable if Jan 13 is really staying the end point (i.e.
unless a substantial instability or failure of reaching the goals is detected in the standard
later).

This compromise was agreed and recorded in a resolution reported to the WG11 parent
body: "The JCT-VC thanks the NBs of UK and US on their valuable comments related to
standardization timelines of HEVC. The situation has been considered, particularly
including the feasibility of proceeding from the current test model and WD to a mature CD
text that fulfils the requirements of the project within the available number of meeting
cycles. It is believed that with a timeline of CD 12/02, DIS 12/07 and FDIS 13/01, the
probability of producing a high-quality standard would be maximized. With such a timeline
being established, only exceptional circumstances should require further delay. Considering
comments about the process for test model development that has been followed by the JCTVC, which were raised by these NB comments and by the NB of China, a document N11642
[a document produced for WG11 corresponding to the content of JCTVC-C401] has been
issued which provides an explanation of this process."
15.2
Test model design: normative aspects
15.2.1.1.1.1.1.1.1
JCTVC-C121 Suggested approach toward HEVC test model creation [S.
Sekiguchi, K. Sugimoto (Mitsubishi Electric), more co-authors added later in a
revision]
This joint contribution proposed a suggested approach toward definition of HEVC Test Model,
which was suggested to be referred to as the "HM" in the contribution, and addressed some
software related issues.
As the result of test model discussion in the last Geneva meeting, the group agreed to define an
HEVC Test Model with only the minimum set of well-tested tools that together form a coherent
design that is confirmed to show good capability. Given this agreement, this joint contribution
proposed a suggested approach toward definition of HM based on contributions available at the
Guangzhou meeting. Significant efforts on TE12 have been discussed at the Guangzhou meeting to
identify how much coding efficiency and complexity can be seen per each TMuC tool. And the
results of other established TEs should have been discussed for evaluation of other tools those are
currently not included into the TMuC. However, these tool basis performance reports will be
assessments of each individual tool relative to B300 condition, and it is still difficult to identify
what is the best minimum combination of available tools. Meanwhile, there are several
contributions each of those reports performance evaluation of candidate tool set (JCTVC-C066,
JCTVC-C122, JCTVC-C123). It can be found that each tool set evaluated in these separate
196
contributions has some sort of overlap on a common core set of coding tools, which means that
those core tools have intensively been validated as tool combination via cross-verification by
multiple parties.
This joint contribution recommended for the group to review these tool set basis performance
evaluations, to identify a base structure of the HM first, and to define the HM by adding other
individual tools to the base structure as those would be identified as efficient.
This contribution also addressed some software related issues.
The contribution suggested the following:

To review all TE reports considering software related issues

To review all tool set basis performance evaluation reports available at this meeting

To define a base structure of HM first that follows "one tool per one functionality"

To define the HM by adding only the currently available tools, which will be evaluated by
TEs and provides sufficient coding gain at agreeable complexity, to the base structure of
HM
15.2.1.1.1.1.1.1.2
JCTVC-C122 A suggested starting point for HEVC test model [K. Sugimoto,
S. Sekiguchi (Mitsubishi Electric), more co-authors added later in a revision]
This contribution proposed a set of TMuC coding tools for both high-efficiency and low-complexity
conditions, as a starting point of HEVC Test Model (HM) with experimental data jointly verified by
multiple companies. The proposed toolset reportedly followed a "one tool per one functionality"
principle and was asserted to achieve reasonable coding efficiency relative to the TMuC anchor
used for TE12 assessment, while reducing encoding and decoding time significantly. The
contributors recommended to construct a base structure of the HM based on the suggested toolset
and consider further additions of other tools via careful assessment of all TE results for creation of
HM.
It was noted that the anchor used in this contribution had LCEC phase 2 enabled, but was otherwise
encoded according to JCTVC-B300 with TMuC 0.7.4.
The HE variation proposed in the contribution is shown as follows, with topics identified as open
during the discussion being shown in italic font:
Tool name
Coding unit
Transform
unit
Large
transform
Angular Intra
Prediction
TMuC default
setting
(JCTVCB300)
Min size: 8x8
Max size:
64x64
Quadtree
Proposed
setting
Parameters for proposed
setting
Same as B300 Same as B300
Max size: 64
Samsung
based
Max size: 32
I64: 5 modes
I32: 34 modes
I16: 34 modes
I8: 34 modes
I4: 17 modes
I2: 3 modes
(non-norm?)
I64: 5 modes
I32: 17 modes
I16: 17 modes
I8: 17 modes
I4: 17 modes
I2: 3 modes
(non-norm?)
197
QuadTreeTUFlag : 0
QuadTreeTULog2MaxSize
:5
MaxTrSize : 32
#if UNIFIED_DIRECTIONAL_INTRA
const UChar
g_aucIntraModeNumAng[7]
=
{3, // 2x2
17, // 4x4
17, // 8x8
17, // 16x16
17, // 32x32
5, // 64x64
5 // 128x128};
Rotational
transform
(ROT)
Mode depend.
directional
transform
(MDDT)
Adaptive Intra
Smoothing
(AIS) on/off
Combined
Intra
Prediction
(CIP)
Planar
prediction
Edge-based
prediction
Deblocking
filter
Adaptive loop
filter (ALF)
Entropy coder
Enabled
Disabled
const UChar
g_aucIntraModeBitsAng[7]
=
{2, // 2x2
5, // 4x4
5, // 8x8
5, // 16x16
5, // 32x32
3, // 64x64
3 // 128x128};
ROT : 0
Enabled
Disabled
#define QC_MDDT 0
Enabled
Disabled
AIS : 0
Enabled
Disabled
CIP : 0
Enabled
Disabled
#define PLANAR_INTRA 0
Enabled
Disabled
EdgePredictionEnable : 0
Enabled
Same as B300 Same as B300
Diamond
Same as B300 Same as B300
PIPE
Transform
coefficient
coding
Internal BitDepth
Increase
(IBDI)
Rate
Distortion
Optimized
Quantization
(RDOQ)
CABACbased RDO
Asymmetric
Motion
HHI
Same as B300 Same as B300
or CABAC
Same as B300 Same as B300
4 bits
Same as B300 Same as B300
On
(non-norm)
Same as B300 Same as B300
(non-norm)
On
(non-norm)
Enabled
Same as B300 Same as B300
(non-norm)
Disabled
AMP: 0
198
Prediction
Unit (AMP)
Block
Merging
(MRG)
Motion vector
prediction
(AMVP/IMVP)
Interpolation
filter
Adaptive
motion vector
resolution
(AMVRES)
Transform
Skip Flag
PU-Based
Merging
Partitionbased
intensity
compensation
Transform
precision
extension
Enabled
Disabled
MRG: 0
AMVP method
Same as B300 Same as B300
SIFO
DCT-IF 12 tap
Enabled
Disabled
Disabled
Same as B300 Same as B300
Disabled
Same as B300 Same as B300
Disabled
Same as B300 Same as B300
Enabled
Disabled
InterpFilterType: 0
DIFTap: 12
AMVRES : 0
undefine
TRANS_PRECISION_EXT
The LC variation proposed in the contribution is shown as follows (showing only the rows that
differ with the HE variation shown above), with topics identified as open during the discussion
being shown in italic font:
Tool name
Adaptive loop
filter (ALF)
Entropy coder
Internal BitDepth
Increase
(IBDI)
CABACbased RDO
Interpolation
filter
Adaptive
motion vector
resolution
(AMVRES)
LCEC Phase 2
Transform
precision
TMuC default
setting
(JCTVCB300)
Disabled
Proposed
setting
LCEC
Off
Same as B300 Same as B300
Same as B300 Same as B300
Off
(non-norm)
DIF
Disabled
Same as B300 Same as B300
(non-norm)
DCT-IF 6 tap
InterpFilterType: 0
DIFTap: 6
Same as B300 Same as B300
Disabled
Enabled
Enabled
Parameters for proposed
setting
Same as B300 Same as B300
#define LCEC_PHASE2
(1)
Same as B300 Same as B300
199
extension
Shift bits for
transform
precision
extension
8x8: 7 bit
16x16: 6 bit
32x32: 5 bit
64x64: 4 bit
8x8: 4 bit
16x16: 4 bit
32x32: 4 bit
64x64: 4 bit
const int g_iShift8x8 = 4;
const int g_iShift16x16 = 4;
const int g_iShift32x32 = 4;
const int g_iShift64x64 = 4;
Some particular other subjects of discussion not shown in the table above were recorded as follows:

CU-based merging / MRG – harmonization with skip/direct

PU-based merging – not included

Bidirectional rounding control
15.2.1.1.1.1.1.1.3
JCTVC-C066 Cross verification of Mitsubishi 0.7.4 configuration [M. Zhou
(TI)]
This document reported cross-check results of the simplified TMuC-0.7.4 configurations proposed
in JCTVC-C122. Independent simulations were reported to have verified that the results that
Mitsubishi provided for JCTVC-C122 were accurate.
15.2.1.1.1.1.1.1.4
JCTVC-C065 Testing results on the simplified TMuC configurations [M. Zhou
(TI)]
This document reported evaluation results of simplified TMuC0.7.4 configurations, the tools
included in the simplified configurations are identical to the ones proposed in JCTVC-C122, with
the exception that the CABAC instead of PIPE is used in the HE configurations and 12-tap instead
of 6-tap DCT-IF is used for the LC configurations. Also tested was the performance of the DCT-IF
with different filter lengths (6, 8, 10, and 12) under the simplified configurations. The contributor
recommended to use CABAC in the HE reference configurations due to an expressed uncertainty of
the PIPE development for throughput improvement purposes, and considering 8-tap DCT-IF for
both the HE and LC reference configurations. In terms of compression performance, using CABAC
rather than PIPE was asserted to provide some slight (0.1-0.3%) improvement, and additional
improvement (3% in random access and 4% in low delay) was reported for LC configurations by
using longer DCT-IF (8 or 12 tap rather than 6 tap).
15.2.1.1.1.1.1.1.5
JCTVC-C269 A study on the impact of intra smoothing [A. Tabatabai, C.
Auyeung, T. Suzuki (Sony)]
This contribution discussed the effect of disabling intra smoothing for the suggested HEVC Test
Model in JCTVC-C122. The "always off" setting was reported to be better than always smoothing
by about 1.4% for Intra HE, and 1.9% for Intra LC, with respect to the results in JCTVC-C122 with
PIPE. Cross verification of the results was reportedly done with NEC.
This was tested in the context of the configuration suggested in JCTVC-C122.
It was noted that a late input contribution JCTVC-C302 was a new proposal with some relationship
to this.
15.2.1.1.1.1.1.1.6
JCTVC-C123 On selecting coding tools for HEVC test model [K. Sugimoto,
A. Minezawa, S. Sekiguchi (Mitsubishi Electric)]
As the result of test model discussion in the last (Geneva) meeting, the group agreed to define an
HEVC Test Model with a minimum set of coding tools consisting of those that can jointly provide
sufficient coding gain. This contribution discussed the selection of tools for the HM to create a
reliable starting point for the the future standard.
Some remarks and observations made during the discussion of the contribution were as follows:

LCEC Phase 1 was suggested to have some problems. (This may have been related to the
issue fixed in TMuC 0.7.4.)
200

Interpolation filter length was suggested to be studied.

Some coding tools seemed to have different effects on different video resolutions.

It was suggested to pay attention to complexity issues.
15.2.1.1.1.1.1.1.7

Further discussion on TM definition
RQT :
o HE: 3-level QT and fast intra encode per JCTVC-C311
Encoder settings:
HHI_RQT_INTRA_SPEEDUP_MOD = 0 with
HHI_RQT_INTRA_SPEEDUP = 1
(slower search should remain configurable in the software)
o LC: 2-level QT, aligned with LCEC Phase 2 (with QCOM CBP VLC), with same
fast intra.
o Same maximum QT depth for luma and chroma.
o Clear description of this was to be provided as a BoG report document JCTVC-C319
by the 1st half of morning of Thursday, with its preparation coordinated by Thomas
Wiegand. This was done and the result was reviewed and adopted as recorded below.

Entropy coder (CA)PIPE or CABAC
o Entropy coding is a core part and should not be exchanged arbitrarily; however, both
of these technologies are using same context modeling and are almost identical in
performance, and the choice does not affect other tools (PIPE 0.1-0.3% BR
increase).
o Complexity advantage of PIPE is not clear.
o It was agreed that the TM would use CABAC rather than PIPE, further investigation
(AHG, CE) was planned on PIPE.

Max. 17 vs. 34 intra directions – it was agreed to keep it for the TM of this meeting as it is
in current configuration of JCTVC-B300 (maximum set of 34 directions), and to set up an
experiment for further investigation.

AIS – settled: this will be turned off in the TM / WD

Planar Prediction:
o There is a slight loss in BD BR, but some experts think that this could improve
subjective quality.
o The current TMuC does not have encoder decisions based on subjective quality.
o Conclusion: Not in TM, further study should be performed.

Interpolation filters
Suggestions made:
a) For LC SIFO 6-tap, for HE SIFO 8/12-tap
b) For LC DCT-IF 6-tap, for HE DCT-IF 12-tap
c) DCT-IF 8-tap for both cases
d) For LC DIF, for HE DCT-IF 8-tap
201
e) For LC SIFO 6-tap, for HE DCT-IF 12-tap
For HE: DCT-IF 12-tap was reported to give the best quality – and its use was agreed.
For LC: DIF
Chroma interpolation will be further investigated in a CE
At the close of the meeting, the decision on chroma interpolation remained somewhat
unclear, and an email discussion was conducted to clarify the situation. Within
approximately two weeks of the meeting, this discussion closed with an agreement to use
bilinear 1/8th sample interpolation for the TM / WD chroma interpolation.
More detailed remarks and notes are provided below to show some background for the
above decisions:
o The number of filter tap operations of 6-tap DIF vs. 6-tap separable is roughly 60% both
for average and worst case.
o The encoder complexity of SIFO appears higher (reported as about 30-70% computation
time increase at encoder). In current optimization, all filter types are checked during
encoding to determine the best filter for the next frame.
o SIFO also requires a larger set of filters.
o As a first conclusion, SIFO was removed from the list of candidates.
o According to encoder/decoder computation times, DIF and DCT/IF 6tap appear quite
close.
o In the number of tap operations, DIF needs less (see above).
o DIF also needs lower bit depth.
o DCT-IF provides 1.8% BR reduction.
o If IF takes 30% of decoder time, and DIF would reduce from 100 to 60, this would come
to 0.6x30 approx. 15% of entire decoder time. Realistic may be 8%.
o Agreement: use DIF – CE/AHG to further investigate.

Bidirectional rounding control should be included.
15.2.1.1.1.1.1.1.8
JCTVC-C319 BoG report: residual quadtree structure [T. Wiegand, H.
Schwarz, B. Bross (Fraunhofer HHI), A. Fuldseth (Cisco), X. Wang (Qualcomm),
W.-J. Han (Samsung)]
This document contained the report of the break-out group on residual quadtree structure.
Some concern was expressed that the document did not contain sufficient informative description,
and backgroun information was explained verbally during the discussion and was then added in a
revision.
Decision: Adopted.
15.3
Test model design: non-normative aspects and test methodology
15.3.1.1.1.1.1.1.1
JCTVC-C253 Encoder speedup for bidirectional averaging with rounding
control [R. Panchal, M. Karczewicz, P. Chen (Qualcomm)]
In the 2nd JCT-VC meeting, bidirectional averaging with rounding control (BiRound) was proposed
and adopted in the TMuC. In this contribution further encoder optimizations were proposed leading
to a reported 50% complexity reduction of the BiRound with no changes in the encoder bitstream.
In TMuC 0.8, a new function with 'BiRound' parameter is always used for bidirectional motion
estimation. To achieve further speedup, the new function is used only when 'BiRound' parameter is
1. Otherwise, the original function is used.
202
Results were provided with LD HE configuration. The old version as implemented in TMuC 0.7
showed an increase in encoding time by 8% and 1.1% BR reduction, a speedup which produces bitexact result is reported with 4% increase, and a sub-optimum speedup gives 0.9% reduction with no
change.
Conclusion:

Decision: Adopted.

Should be on when IBDI is off and vice versa.

The fast bit-exact version shall be used in the encoder.
15.3.1.1.1.1.1.1.2
JCTVC-C287 Cross-check of encoder speedup for bidirectional averaging
with rounding control [B. Bross (Fraunhofer HHI)] (missing prior, available first day)
This contribution confirmed the bit rate results reported in JCTVC-C253, with the reported encoder
speedup being slightly better.
15.3.1.1.1.1.1.1.3
JCTVC-C315 On the computation of PSNR and BD-Rate [D. Hoang
(Zenverge)] (late reg.)
This late contribution was not reviewed in group discussion due to lack of time.
The contribution asserted that the computations of PSNR and BD-Rate by the current TMuC
software and spreadsheet macros were flawed, and recommendations were made on how to improve
these computations.
16 TMuC settings and common test conditions
16.1.1.1.1.1.1.1.1
JCTVC-C084 Coding efficiency report of modification by TMuC draft005 [T.
Yamakage, T. Chujoh, T. Watanabe (Toshiba)]
This contribution showed experimental results of the modification by TMuC draft005 that signals
the ALF filtering control map in the slice header instead of signaling it in the CU header.
Experimental results reportedly showed that the coding efficiency loss is 0.04% on average while
the interleaving process of the ALF filtering control map at encoder side is removed.
This was one of the features of "TMuC low priority" disabled by default (only implemented after
TMuC 0.7)
Several experts pointed out that this proposal simplifies the encoder (no buffering of encoded data).
Decision: There was no objection by the group, and it was agreed to Adopt this proposal.
One expert points out that there might be other ways to achieve this goal.
16.1.1.1.1.1.1.1.2
JCTVC-C243 Cross check result of JCTVC-C084 [I. S. Chong, M.
Karczewicz (Qualcomm)] (missing prior, uploaded substantially after the meeting)
This contribution was verbally presented prior to upload. It reportedly confirmed the results
reported in JCTVC-C084.
16.1.1.1.1.1.1.1.3
JCTVC-C155 TMuC 0.7 results with sequence dependent QPs [K.
Andersson, R. Sjöberg (Ericsson)]
This contribution investigated performance when targeting CfP bit rates using sequence specific
QPs rather than those specified in the common conditions (QP=22, 27, 32 and 37) for low delay and
random access configurations. The sequence specific QPs was asserted to give very similar coding
efficiency as for the common conditions and to give bit rates generally closer to application bit rates
and thus to be better suited for subjective assessment. It was noted that the worst case encoding time
was reduced by about 20% when using the proposed alternative values.
The following remarks were recorded in discussion of this contribution:
203

Typically, the example QP settings suggested have a much lower range of operational points
– would this really be desirable?

Some cases go into the low-quality range particularly.

No action was taken on this, as the current settings seem appropriate for investigation of the
merit of single coding tools.
16.1.1.1.1.1.1.1.4
JCTVC-C168 Comments on common test conditions [K. Kondo, K. Sato, T.
Suzuki (Sony)]
This contribution proposed to change the test conditions as follows.

To reduce the number of frames
 To increase the number of test sequences
The common conditions test configurations were summarized in JCTVC-B300. Currently test
sequences used by CfP are used and all frames must be encoded. Coding performance results with a
smaller number of frames, e.g. 120, were reported to be not so different from the results of using all
frames. Instead of reducing the number of total coded frames, it was proposed to test a larger
variety of test sequences, e.g. VCEG test sequences, 4K sequence, 8K sequences, etc. To test a
variety of test sequences was asserted to be more helpful to achieve an understanding of the the
coding performance rather than encoding a larger number of frames in a smaller number sequences.
Comments recorded during the discussion were as follows:

Subjective tests should also be performed, such that long sequences are needed.

Extending the test set to provide more variety appears useful, but needs careful study.

Having long sequences is also an advantage, as it enforces to not use overly complex
methods –with parallel processing, the length of the sequences is the main bottleneck
anyway.

For extreme resolutions, a shorter length may be considered.

Work on fast motion estimation should be started.

It was suggested to conduct further study in an AHG (chaired by T. Suzuki)
17 Application-specific topics
17.1.1.1.1.1.1.1.1
JCTVC-C021 Evaluation result of JCTVC-B031 [K. Kazui, J. Koyama, A.
Nakagawa (Fujitsu)]
A scheme which reportedly minimizes the loss of coding efficiency in very low delay coding was
proposed at the 2nd JCT-VC meeting. This contribution shows evaluation result of the proposed
scheme using TMuC software version 0.7.3.
Coding gains from the conventional multiple slices scheme in BD-Bit rate were reported as 9.0%
and 15.9% when the sizes of LCTB are 64x64 and 32x32, respectively.
In addition, a new buffer model for a coding delay of less than one frame period was proposed in
this contribution.
Remarks recorded during the discussion included the following:

A very high number of slices was used per frame – is this realistic? Certainly other solutions
would be possible to achieve the same goal.

Constant bit rate was assumed for each slice – is that realistic?

It was suggested to contribute software to the work of AHG on slices.
204

Further study was encouraged.
17.1.1.1.1.1.1.1.2
JCTVC-C032 Characteristics of Super Hi-Vision test sequences [Y. Shishikui,
Y. Matsuo, A. Ichigaya, K. Iguchi, S. Sakaida (NHK)]
This contribution presented the characteristics of Super Hi-Vision (SHV) test sequences, which
reportedly differ from those of the other test sequences. The analysis of test sequences indicated that
SHV test sequences contain an extremely large amount of motion that affects spectrum
characteristics of the signal to a large extent. SHV test signals also contain a larger noise component
due to their high bandwidth, although this does not affect subjective picture quality as much as was
expected by the objective amount.
The Sequences were interpolated by Bayer filter to achieve a full 8Kx4K resolution, and spectral
characteristics are different from the 2K and 4K sequences (high flat-spectrum noise component
appears to be present below quarter of sampling frequency).
17.1.1.1.1.1.1.1.3
JCTVC-C055 Performance report of TMuC for Super Hi-Vision [K. Iguchi, A.
Ichigaya, Y. Shishikui(NHK), S. Sekiguchi, A. Minezawa (Mitsubishi Electric)]
This contribution reported the performance of TMuC 0.7.0 for Super Hi-Vision (SHV) sequences.
While the TMuC achieved substantial coding gain relative to the AVC CfP anchor, it took from 3.1
to 6.1 times encoding time than JM encoding with CfP anchor condition.
The following remarks and observations were recorded:

There was a suggestion that cropped sequences could be used that would give roughly same
results with much lower simulation time.

BD bit rate improvement is approx. 60% for Steam Locomotive, 40% for Nebuta

PSNR curves show larger gap (higher gain) towards higher bit rates: Does the TMuC loop
filter improve prediction under noisy conditions?

Bit rates ranged from 500 Mbps to 1.5 Gbps, PSNR 30-37 dB

10 bit to 8 bit conversion was done by rounding

True 8K sequences (not upsampled) would be desirable to judge without influence of the
upsampling filter.
17.1.1.1.1.1.1.1.4
JCTVC-C276 Screen content coding results using TMuC [C. Lan, J. Xu, F.
Wu (Microsoft), G. Shi (Xidian Univ.)]
New coding tools had been presented for coding screen contents using an AVC framework in
JCTVC-B084. This document presented new coding results using the TMuC, for intra coding and
for random-access cases.
The following remarks and observations were recorded during the discussion:

Recorded at 10 fps

BD rate saving around 20% for random access inter, 28% for all intra.

More information about the characteristics of these sequences was requested – e.g., How
often do frame changes occur, is there subpixel motion?

What would be typical test material? This may be the most difficult issue, as this has high
influence on potential selection of tools.

As this is a new application area, the importance should be reviewed by the parent bodies.

Video (particularly news) includes lot of this type of content nowadays.
205
18 Non-TE Technical Contributions
18.1
Loop filtering
JCTVC-C085 Comparison of loop & post filtering for in-loop and post-processing filtering
AHG [T. Yamakage, T. Chujoh, T. Watanabe (Toshiba)]
This contribution showed experimental results of comparison between in-loop filtering and post
filtering. This was a response to one of the mandates of the In-loop and post-processing filtering
AHG. Coding efficiency of several filtering schemes was compared for in-loop or post filtering and
discussed.
Remarks recorded included the following:

The current QC_ALF of the TMuC performs 4% better than a similar stand-alone post-filter

The method from AVC (SEI) AAPF performs 2.5% worse than QC_ALF (an amount which
slightly decreases when operated in the loop)

Results also reportedly show that a filter must be explicitly designed as a loop filter or post
filter
JCTVC-C113 Adaptive loop filter with low encoder complexity [I.S. Chong, W. Chien, M.
Karczewicz (Qualcomm)]
In this contribution, low complexity versions of the Adaptive Loop Filter (ALF) with at most two
encoding passes were proposed. Simulation results for one-pass and two-pass ALF are presented to
demonstrate that one-pass and two-pass methods significantly lower the complexity of the original
ALF with small performance degradation (less than 0.1% for two-pass and less than 0.7% for onepass).

The one-pass method derives the map by using a filter from the previous frame and then
optimizing coefficients for the current frame based on regions where the filter is switched
on.

A two-pass method with one more iteration was described

A post filter from the VCEG KTA software was also compared (losing 0.9% BR relative to
the anchor)

The method is very similar to the one proposed in JCTVC-C082
JCTVC-C195 An enhanced block-based adaptive loop filter in TMuC0.7.0 platform [P. Wu, S.
Paschalakis, N. Sprljan (Mitsubishi Electric)]
In this contribution an Enhanced Block-based Adaptive Loop Filter (EBALF) algorithm has been
implemented and integrated into TMuC0.7.0 platform as a complete functional block for in-loop
filtering processing. Without altering the current default adaptive loop filter technology, which is
contributed by Qualcomm as QC_ALF in the TMuC software, this EBALF functional block can be
placed before or after QC_ALF, resulting in an in-loop filtering composed of two stages. The
simulation results reportedly confirmed that with QC_ALF "on" performance as the reference point,
the joint filtering can potentially further improve the coding efficiency by up to 1.5% BD rate
reduction in some cases. For Class A tests, the average BD rate reduction were 0.7% and 0.6% in
both test cases. This could reportedly imply that performance of QC_ALF can be improved further,
particularly for high resolution sequences. The simulation results cover encoder_randomaccess and
encoder_lowdelay configuration settings with full length and all classes of the sequences as defined
in the Tool Experiment 10 subtest 2 category. The Excel spreadsheet results were provided. This
contribution was reportedly not intended to propose an alternative technology to replace QC_ALF
206
in TMuC but rather to be aiming to assist the current general effort on evaluating the existing
TMuC video coding tools, such as that in Tool Experiment 12 (TE12) and 10 (TE10). The results
reported independently in this document regarding the adaptive loop filtering technology can be
treated as additional information to all other related test results in TE12 and TE10.
It was remarked that a gain (0.6-0.7% BR reduction) was achieved only for class A and B partially,
and that on average there was practically no gain. Decoder runtime increases by 15-20%.
No action was taken in response.
JCTVC-C211 Inner block oriented ALF processing for memory & bandwidth reduction [Y.
Sohn, K.-H. Lee, B.-K. Lee, I.-K. Kim (Samsung)]
This proposal described an adaptive loop filter that was asserted to have low memory, bandwidth
and computational complexity requirements. ALF in TMuC requires spatial neighboring
reconstructed data for CU based ALF processing. This document proposes applying filtering only to
a restricted inner LCU area where there is no need to use data outside of the LCU for filtering. This
approach was reported to result in marginal performance degradation with memory, bandwidth and
computational complexity reduction. The experimental results reportedly showed 1.0%
performance degradation in low delay configuration and 0.8% performance degradation in random
access configuration, respectively. In expense of this performance degradation, it was asserted to
reduce about 24 Kbytes external DDR memory and 0.77 Kbytes internal SRAM, and to reduce
memory bandwidth about 5.98 MHz in the case of 4Kx2K@30p video sequence and 64 bit BUS. It
was asserted to also reduce computational complexity for ALF processing itself.
Concern was expressed that for two adjacent filtered blocks, boundary effects may occur.
This would need to be a normative decoder issue (making it switchable would not help to reduce
maximum decoder complexity). Most experts believed that not every application case of loop
filtering would benefit from that.
A suggestion by the chair was that if limitation of memory accesses is desired, it might be better to
limit the number of filtered blocks in a profile.
JCTVC-C219 In-loop reference frame denoising [P. Amon, A. Hutter, E. Wige, A. Kaup
(Siemens)]
This proposal presented an algorithm for in-loop denoising of the reference frame. The algorithm
modifies the temporal predictor while the decoded picture is unchanged. Knowledge of the noise
power within the reference frame is used in order to affect the inter frame prediction. For noise
filtering of the reference frame, a denoising algorithm is implemented inside the AVC reference
software JM 15.1. It was reported that the bit rate can be decreased for (high resolution) noisy
image sequences especially for higher qualities at medium to high data-rates.
The following remarks were recorded:

The suggestion was to take into account additive noise in the design of the filter coefficients.

The implementation was in the AVC context.

The algorithm currently does not work on intra.

It is not necessary to transmit filter parameters; as these are estimated at the decoder.

The reported improvement was mostly at high bit rates, and may be hardly visible.

The concept may eventually help for applications that require lossless or near-lossless
coding (e.g. medical, studio).

The algorithm does not remove quantization noise. It is more intended to separate content
that is consistent over time from noise that is uncorrelated between frames and therefore
207
disturbs the prediction. At low rates, it was suggested that this would be removed anyway by
the quantization.

18.2
Further study was suggested.
Block structures and partitioning
JCTVC-C025 CTB splitting on frame boundary for arbitrary resolution video [F. Yang, W.
Li, S. Wan, Y. Chang (Xi'dian Univ.)]
The conception of CTB has been proposed. But the modes of the CTB on the frame boundary may
be limited when the video resolution is not an integral multiple of the LCTB. A method of CTB
splitting on the frame boundary for arbitrary resolution video was proposed in this document, where
the non-square CTB and a new representation of the CTB mode reportedly significantly contribute
to the coding efficiency. The proposed method was asserted to be compatible with the current
TmuC, with no extra syntax needed. Remarks recorded during discussion were:

Largest gain comes from small image sizes, for classes other than D typically around 0.1
percent, but would require considerable change of CB design.

Adds irregularity to the CTB, which makes it necessary to augment the parsing of the tree
by additional conditions that depend on frame size.

No action taken.
JCTVC-C026 Proposal of frame parameters FLCTB and FSCTB for video content fitting [F.
Yang, W. Li, S. Wan , Y. Chang (Xi'dian Univ.)]
The size of CTBs in a frame in the TmuC is limited by the parameters of LCTB and SCTB, which
are set on the sequence level. However, the optimal CTB splitting mode for a frame depends on the
frame content, the frame type, and the quantization parameter, which indicates that different LCTB
and SCTB should be used for different frames. Therefore, two frame parameters of FLCTB and
FSCTB were proposed to adapt CTB splitting to the frame content. Remarks recorded during
discussion were:

No results available yet

There is some doubt that this provides gain.

No action taken.
JCTVC-C154 Fine granularity slices [R. Sjöberg, P. Wennersten (Ericsson)]
In order to increase the granularity of possible slice boundary positions this contribution proposed
to change the definition of slices from being a sequence of largest coding tree blocks (LCTB:s) to a
sequence of the smallest coding tree blocks (SCTBs) in order to increase the control of the slice
sizes in terms of bytes.
This contribution also contained a proposal to replace UVLC for signaling of slice start positions
with a fixed length code (FLC) with a one-bit shortcut for indicating the first block in a picture
since this address is present in all pictures. Remarks were recorded in discussion as follows:

Applications of slice-wise processing are parallelization for UHD (where the current design
is not a problem) and packetization in videoconferencing.

In terms of coding efficiency, the main disadvantage of slice-wise processing is the breaking
of prediction chain and entropy coder adaptation.

The main argument for change of CTB signaling would be a more clean design.
208

Conclusions:
o Regarding the slice processing, defining slices as sequence of largest CU give a
disadvantage in regard to flexibility.
o An estimate of solving the problem by restricting to 32x32 slices (taken from TE12)
points to 5% BR increase.
o A suggested first step would be to start slice implementation (suggested for an AHG
chaired by Rickard Sjöberg) with the current CU structure, and only after that further
more concrete evidence can be collected
Another late "Proposal 3" was presented in the presentation slides (not in the contribution
document). Slice implementation would appear to be more simple if the design would return to
JCTVC-A205 draft 2 (min_coding_unit_size and hierarchy_depth); In fact it was suggested that the
current syntax seems to be broken as it does not support rectangular CU (only square). It was
suggested that this other proposal should be registered as a new document (first discussed with the
TMuC editors).
18.2.1.1.1.1.1.1.1
JCTVC-C318 TMuC text on max CU size seems broken [R. Sjöberg
(Ericsson)] (late reg.)
The issue reported in JCTVC-C318 was partially presented in the slide deck of JCTVC-C154,
where it was called "proposal 4". The slide deck did not give very much information about the
issue; and it was not in the word document and seemed to be somewhat independent, so it was
suggested to register this as a separate input document. This document arrived quite late, and there
was insufficient time for this late contribution to be reviewed.
It may be beneficial for the contribution to be studied by those with detailed knowledge about the
tree implementation. The subject matter of the document at the close of the meeting may (or may
not) be addressed by the breakout activity report in JCTVC-C319.
JCTVC-C167 Proposal on large block structure and quantization [K. Sato (Sony)]
For common test conditions in video codec standardization, the value of quantization parameter is
usually fixed within a picture, or a sophisticated scheme like RDOQ is sometimes employed for
coding efficiency improvement. However in the real applications, a QP-changing scheme with
macroblock-by-macroblock variation of QP is usually employed for visual quality improvement
(and rate buffer control).
Adaptive quantization as defined in MPEG-2 Test Model was suggested to be a very old technique
but still one of the effective techniques for subjective quality optimization. It determines QP
according to the activity calculated from the values of original samples within a macroblock. The
basic concept is that human eyes are sensitive to degradation in flat areas, but insensitive to busy
areas, so smaller QP is allocated for the former areas and bigger QPs are allocated for the latter
areas. This was suggested to work effectively, especially when the size of the image is large (like
HD). One of the key applications of HEVC will be beyond-HD image coding, so techniques like
adaptive quantization seem likely to also be useful for HEVC.
The HEVC TMuC employs the concept of a coding unit (CU). The largest coding unit (LCU)
corresponds to macroblock in the existing video coding standard like MPEG-2 or AVC. The main
difference is that the size of the latter is fixed to 16x16 pixels while the one of the former can be up
to 128x128. In the existing video coding standards, QP can be set at the macroblock-level.
However, as the size of LCU becomes large, it may contain both flat areas and busy areas, and if
QP can only be set at LCU-level, such adaptive quantization may become more difficult.
Therefore it was proposed to define the CU syntax so that QP can be changed not only with LCU
level but with all CU levels. This modification can enable QP changing for subjective quality and
was asserted to be useful for real applications. Remarks recorded during the discussion included the
following:
209

More evidence would be necessary (e.g. demonstrating that the switching of QP at 16x16
blocks gives an advantage over switching at 64x64)

What would be the bit rate increase if no switching is used? At least it would be necessary to
have a syntax element e.g., at the picture or slice level to disable the encoding of
cu_qp_delta.

Further study was suggested.
JCTVC-C224 Frame coding in vertical raster scan order [D.-K. Kwon, H. Kim (TI)]
In HEVC, the CUs (Coding Units) in a frame are encoded in a horizontal-first raster scan order. It
was reported that the frame coding in this raster scan order is not always effective for intra
prediction and requires large on-chip memory to cover large vertical motion for motion estimation
(ME) and motion compensation (MC), which is very challenging especially for UHD (ultra high
definition) video such as 4k x 2k and 8k x 4k. In this contribution, it was proposed to support frame
coding in a vertical-first raster scan order along with the horizontal-first raster scan order. Then the
best coding scan order could be selected adaptively for each frame based on coding efficiency,
horizontal/vertical motion, on-chip memory saving, etc. Remarks recorded during the discussion
included the following:

Changing order (by rotating 90%) improves performance for some sequences by up to 3% ,
but nothing on average (i.e. loss for others).

This introduces a 1 frame delay

How would an encoder decide this?

No action taken.
JCTVC-C260 Flexible picture partitioning [K. Panusopone, X. Fang, L. Wang (Motorola)]
This document proposed use of a 2D coordinate system to partition a picture into largest coding tree
units (LCTU) in a more flexible manner. The 2D coordinate system plane is divided into square
blocks by vertical and horizontal lines. The block size is the same as the selected LTCU size. The
top-left corner of a picture may be aligned with the coordinate system origin, or shifted away from
the coordinate origin by an offset. The picture partition follows the square blocks in the coordinate
systems. Remarks recorded during the discussion were as follows:

Alignment of CU grid by default in top-left; Corner mode allows one of 4 corners; explicit
mode allows positioning of CU block grid with any offset

Results were reported with 30 frame length sequences, coded as IPPP (not common
conditions), with adaptations for each frame; and a gain was reported of 0.63% on average,
the largest gain being in class E.

These results were achieved by running 4 encodings with the 4 corner options and selecting
the best. It was suggested that the 400% encoder complexity increase would certainly not be
justified by the small gain.

No action taken.
JCTVC-C275 Improve intra frame coding by PU/TU reordering [X. Peng, J. Xu, F. Wu
(Microsoft)]
This document presented an adaptive PU/TU reordering scheme for prediction in TMuC intra-frame
coding. Within a PU, the coding order of TUs was rearranged to achieve better prediction for the
210
inner TUs. Within a CU, a different coding order of PU can be selected also to reportedly obtain a
better prediction. Remarks included:
-
The ordering is dependent on prediction direction.
-
An increase in encoder runtime roughly to 170% was reported.
-
The gain in bit rate for HE Intra was roughly 0.4%
-
It is not clear how much of the gain comes from TU or PU re-ordering.
-
No action taken.
JCTVC-C283 Region of block based dynamic video processing for HEVC [A. Paul (Hanyang
Univ. / NCKU)]
A region based adaptive video processing scheme based on SAD of macro blocks was proposed in
this proposal. The average SAD of all the blocks in this region is calculated for computing a
threshold value and thereafter this threshold value is used to adaptively select the search range. The
proposed algorithm reportedly achieves 60% time reduction with negligible PSNR loss.
There was difficulty arranging presentation of this contribution and no action was taken on it.
JCTVC-C306 Investigations for representing rectangular blocks using the merging concept
[H. Schwarz, D. Marpe , T. Wiegand] (late reg.)
This document investigated the impact on coding efficiency when the symmetric rectangular block
modes (2NxN and Nx2N) were removed from the syntax and replaced by corresponding merge
flags. For the same encoder complexity (given an algorithmic comparison and verified by the
measurement of encoding times), an average increase in BD rate of 0.13% was reported for a
removal of the symmetric rectangular blocks. A variation with lower encoding complexity (the
average encoding time is 78% of that for the reference) reportedly yielded an increase in BD rate of
0.8%. If the encoder complexity is increased in comparison to the reference (the measured average
encoding time is 144% of that for the reference), an average decrease in BD rate of 0.06% was
reported. The following remarks were recorded during the discussion:

This is one example about redundancy in the block structure (i.e. the same structure can be
expressed in different ways).

The current implementation of doing a full search on rectangular shapes may not be the best
approach.

It was suggested to establish an AHG to study the block structures (chairs WJ Han, TK Tan,
T Wiegand)

This should include intra & inter.
JCTVC-C311 Fast intra encoding for fixed maximum depth of transform quadtree [B. Bross,
H. Kirchhoffer , H. Schwarz , T. Wiegand (Fraunhofer HHI)] (late reg.)
This document described an encoder-side complexity reduction for intra encoding. The presented
fast encoding method produces bitstreams that are conforming to the specification while reportedly
achieving a similar encoder complexity as for the transform quadtree disabled case.
Two fast modes had been implemented:
In speedup mode 1, the different intra prediction modes are tested only on the maximum allowed
transform block size. The best performing mode is tested with the full transform tree partitioning
and the best performing partitioning is chosen.
Speedup mode 2 does basically the same except testing the full transform tree partitioning with the
two best performing intra modes instead of the best one.
211
Results were provided only for 50 frames of each sequence.
Mode 1: Enc. Time 101%, BR reduction 0.3%.
Mode 2: Enc. Time 120%, BR reduction 0.8%.
It was indicated that a new version with more additional results would be made available which
shows also results with tree depth=2 which reports 0.1% and 0.3% BD BR gains for modes 1 and 2.
In addition, results were reported on disabling MDDT, ROT, edge pred and smoothing, with a
degradation of 0.5% for mode 1 (96%/111% encoder for HE and LC), mode 2: 0.8%/0.2% for
HE/LC with 116%/132% encoder time.
JCTVC-C312 Cross-verification of JCTVC-C311: Fast intra encoding for fixed maximum
depth of transform quadtree [W.-J. Han (Samsung)] (late reg.)
This late contribution reportedly confirmed the results of the original quadtree approach with 3
levels and the scheme described in JCTVC-C311, with computation times that differed slightly.
18.2.1.1.1.1.1.1.2
Discussion on JCTVC-C311/JCTVC-C312
The following remarks were recorded during the discussion of JCTVC-C311 and JCTVC-C312:

Concern was expressed that for LC conf this may not give advantage.

It was suggested to enable RQT only for HE mode, or to restrict it to a different depth for
LC (possibility would be to restrict in LC depth=1 for intra, depth=2 for inter; depth=3 for
both intra and inter in HE case).

Note: depth=1 is said to be the same as RQT off for intra – this should be further verified.

Note2: Using a different depth for inter and intra is currently not supported

It was mentioned that the configurability that RQT provides has implications on other tools’
performance. Therefore it would be important to have it in the TM.

It could be studied in the context of the AHG on block structures + CE

Definitely this needs to be further study in the AHG on block structures

It was said that RQT on/off has implications on the choice of block sizes and modes for intra
prediction (signaling is different and therefore other modes may be selected).

From the software perspective, it would be undesirable to use the previous on/off version as
these are completely independent paths

A breakout group was formed to discuss the subject further, resulting in the BoG report
JCTVC-C319 as discussed elsewhere in this report.
18.3
Motion compensation and interpolation filters
JCTVC-C126 Chroma interpolation filtering using high precision filter [D. Y. Kim, J. P. Kim,
Y. L. Lee (Sejong University), J. Lim, J. Song (SKT)]
In the TMuC, chroma interpolation is performed with the bilinear interpolation to obtain the 1/8pixel vectors in the same way as in AVC. In this contribution, a chroma interpolation method using
the 6-tap AVC luma MC interpolation filter with high precision was proposed. Experimental results
reportedly show that the average BD-rate improvements on chroma U and V components are 12.2%
and 13.4%, respectively, in the random access, low complexity case, and the average BD-rate
improvements on U and V components were reported as 4.1% and 5.0%, respectively, in the
random access, high efficiency case. The average BD-rate improvements on U and V components
was reported as 12.0% and 14.9%, respectively, in the low delay, low complexity case, and the
average BD-rate improvements on U and V components was reported as 2.0% and 2.4%,
212
respectively, in the low delay, high efficiency case. The average BD-rate improvements on Luma
were reported as 1.3% and 0.4%, respectively, in low complexity and high efficiency of the low
delay case, and the average BD-rate improvements on Luma were reported as 0.2% and 0.4%,
respectively, in low complexity and high efficiency of the random access case. Remarks recorded
during the review included:

The results are similar to what has been reported before in TE12 and other TEs.

Establishing a CE on luma and chroma subpel interpolation was recommended

It is still unclear what the real benefit in terms of overall BR gain is.

Complexity is certainly increased compared to bilinear interpolation.

Subjective quality may also be an issue in this context.
JCTVC-C137 Decoder-side block boundary decision (DBBD) with OBMC [S. Fukushima
(JVC)]
This proposal focused on the tool "Decoder-side Block Boundary Decision (DBBD)" which was
proposed in JCTVC-A108 as one of tools for HEVC CfP response. DBBD aims to perform motion
compensation with various block patterns without additional coding bits by deciding the MC
boundary on the decoder-side. In this proposal, DBBD with OBMC was proposed for improving the
quality of MC block in addition to the previous proposal. The simulation results reportedly show
that the proposed technique provides 3.1% BD bit rate gain for random access, and 2.2% for low
delay as against JM16.2 anchor under HEVC common test conditions. Remarks included the
following:

Only used in 16x8 and 8x16 modes

Encoding time increased to roughly 400-500% (Currently ME is performed for every
possible boundary position). Decoder time increased roughly by 10%

AMP does something similar (with signaling), reducing the bit rate by roughly 2% but only
increasing encoding time by 33%, with negligible decoder complexity impact.

No action taken.
JCTVC-C183 Enhanced switching of interpolation filter for HEVC [T. Yoshino, S. Naito, S.
Sakazawa (KDDI)]
Adaptive Interpolation Filter (AIF) technology (e.g., as in prior KTA work) has been proposed to
improve the interpolation filter performance for fractional-pel motion compensation. However, this
scheme requires high computational cost for calculating the filter coefficients on the encoder side.
This contribution proposed an Enhanced Switch Interpolation Filter (ESIF) approach that predefines several interpolation filter coefficients in order to reduce the computational cost for
calculating the coefficients. This scheme allows updating the number of filter coefficient sets and
the filter coefficients slice by slice. The experimental results reportedly showed that the BD-bit rate
against AVC reached 1.9% under CS2 condition. Remarks recorded during the discussion included
the following:

The Powerpoint presentation did not match with the contribution (a set of 8 filters was
proposed instead of 4).

The current scheme has 500% coder complexity.

The "Low complexity scheme" with 130% coder complexity for filter decision gains 0.30.4% on average.
213

No action taken.
JCTVC-C204 Bi-directional optical flow [A. Alshin, E. Alshina (Samsung)] (missing prior,
available first day)
A modified B-slice prediction was proposed. By combining the optical flow concept and high
accuracy gradients evaluation this proposed technique allows sample-wise refinement of motion.
This approach does not require any signaling about motion vector refinement to the decoder.

Sample-wise refinement is performed by solving optical flow equations between two
reference images. For gradient calculation, DCT-based interpolation is used.

Uses an anchor with both MRG and skip/direct and DCT-IF12 used for comparison

The reported gain was roughly 2%, with decoder runtime roughly 150%

It was suggested to include this in an experiment on decoder-side estimation
JCTVC-C215 Adaptive MV resolution with directional filters [K. Ugur, J. Lainema (Nokia)]
This contribution presented an adaptive motion vector resolution concept using 1D directional
filters. The goal of this contribution was to demonstrate the possible coding efficiency gains in
interpolation with low complexity. It also presented a design of adaptive motion vector resolution in
TMuC, with directional filters. Remarks recorded included the following:

Saves computations at decoder through additional 8-pel positions (not a dense grid of all
possible positions) which allows more 1D interpolations.

Encoding time increased to 124% (due to additional search), decoding time decreased to
80% (although the time measurement results may not be reliable).

Bit rate reduction was reported at 1.3% on average; where the gain comes due to additional
8th pel positions; and the gain was mostly in class D.

Bilinear interpolation is used for chroma, whereas the LC version of TMuC may have a
rounding problem; and it was suggested that this could explain part of the gain.

It was suggested to study this further in the context of AMVRES and MC interpolation.
JCTVC-C251 Overlapped block motion compensation in TMuC [P. Chen, R. Panchal, W.-J.
Chien, M. Karczewicz (Qualcomm)]
In this contribution, some results of overlapped block motion compensation (OBMC) in geometry
partition coding were reported. The use of OBMC to asymmetric motion partitions was also
proposed, with simulation results in support being presented. Remarks included the following:

Is combined with geometric partitioning

Geometric partitioning reportedly gives 3.9% BR reduction for RA, 3.6% for LD; and 0.8%
of this is due to OBMC.

If combined with AMP, the gain of OBMC is 0.4% and 0.3% for RA & LD, respectively.

The increase of complexity at the decoder was asserted to be marginal.

The presentation did not match with the available document, and it was indicated that a new
version would be uploaded.
214
JCTVC-C301 Geometry motion partition [P. Chen, L. Guo, M. Karczewicz (Qualcomm)] (late
registration, missing prior, available first day)
Geometry motion partition has been integrated into TMuC. The software implementation is based
on the design proposed in JCTVC-A121. In the software, geometry motion partition is applied to
CU size of 16x16 and 32x32. Geometry mode decision has two stages. First, fast geometry mode
decision pre-selects 16 best modes based on SAD calculation. Then the best mode is selected based
on full RD. Geometry modes are tested only if the best non-geometry mode is a non-skip mode.
Remarks recorded included the following:

For results, see above under JCTVC-C251

Encoder runtime is 200%; geom. Partition test only done after skip decision.

It was suggested to include this in a core experiment on asymmetric & geometric
partitioning (previously in TE3).

Note: checking of the sample membership to the left or right partition may also be of
concern for complexity; if masks are used for that purpose, it should be considered that the
number is low.

The presentation did not match with the available document, and the contributor was
requested to upload a new version.
JCTVC-C303 Geometry adaptive block partitioning (GEO) cross-check [E. Francois, P.
Bordes (Technicolor)] (late registration, missing prior, available first day)
This cross-check contribution reported exactly the same results in BR reduction as JCTVC-C301 (3
class B sequences were missing). No checking of runtime was reported.
JCTVC-C254 Test result of SIFO for sequences with illumination change [R. Panchal, M.
Karczewicz (Qualcomm)]
SIFO was asserted to be not just an interpolation filter but it also to have the capability to
compensate for illumination changes. SIFO results were provided for some sequences which had
illumination changes to highlight that for such sequences, the offset component of SIFO provides
useful gains. The test was done using TMuC 0.8 in the low delay configuration. Remarks recorded
included the following:

The presentation did not match with the available document, and uploading of a new version
was requested.

It was asked why should interpolation and illumination compensation be combined? It was
suggested that these are two different issues (the latter also applying to full-pel positions).
The answer by the proponents was that it would be possible to do it separately.
JCTVC-C295 Re-compensation based on partial coefficients [H. Zhu (Zhu)] (late registration,
missing prior)
The proposal reportedly provided a new idea for video coding. The algorithm divides the quantized
DCT coefficients to two parts, using the first part to get a first reconstruction, and using the first
reconstruction R0 and the compensation signals Cmp0 and Cmp1 to compute a new weight for the
motion compensation. No experiment results were provided, and no action was taken in response.
215
18.4
Motion vector coding
JCTVC-C208 Simulation results of motion partition, block merging and motion competition
schemes [I.-K. Kim, T. Lee (Samsung)]
In the current TMuC design, there are two approaches for frame partitioning: prediction unit (PU)
splitting to make smaller motion partitions and block merging (MRG) to make larger motion
partitions. In this proposal, anchor results are obtained by running TMuC-0.81 using reference
configuration with disabling all motion partitions, motion competition and MRG. By enabling
MRG on top of the anchor, improved BD bit rate is reported to be about 2.0% (random access),
0.2% (random access LC), 2.4% (low delay) and 0.0% (low delay LC). But higher BD bit rate
savings can reportedly be obtained by enabling motion partition: 4.2% (random access), 5.8%
(random access LC), 4.1% (low delay) and 5.1% (low delay LC). By enabling motion competition,
improved BD bit rate savings are reported to be about 4.3% (random access), 5.6% (random access
LC), 3.2% (low delay) and 5.2% (low delay LC). The best performance can reportedly be obtained
by combining motion partitions and motion vector competition which shows 7.2% (random access),
9.2% (random access LC), 6.2% (low delay) and 8.3% (low delay LC) BD bit rate savings.
The results of TE12 were reported to have been confirmed.
JCTVC-C257 On motion vector competition [Y. Su, A. Segall (SHARP)]
A method was proposed to improve the performance of motion vector competition in applications
that may experience errors during transmission. The focus of the contribution was the dependency
between parsing and co-located motion vectors that is incorporated into the motion vector
competition design. This dependency results in coding gain; however, it also results in an inability
to parse a bit-stream if a previous frame is lost. To overcome this problem, it was proposed to send
the motion vector candidate as a flag, which signals if the selected predictor is the temporal
collocated block, and subsequently send an index into the set of spatial predictors when the
temporal collocated block is not selected. Remarks recorded in the consideration of the contribution
were as follows:

This was presented near the end of the meeting, after difficulty coordinating a prior
presentation time

The goal is reduction of the buffer size; the advantage of robustness under loss was not
proven yet.

In the worst case the buffer is around 33% of a picture buffer in size.

This may require computation of a median of 16 values, which adds complexity.

A loss around 0.1% in bit rate was reported.

It was suggested to include further investigation of this in a CE
18.5
B picture reference list redundancy
JCTVC-C278 Redundancy reduction in B-frame coding at temporal level zero [B. Li, J. Xu, G.
J. Sullivan, F. Wu (Microsoft), H. Li (Univ. of Sci. &Tech. China)]
In the current TMuC, the usage called GPB is widely used to replace a P slice with a B slice using
two reference frame lists that contain the same pictures. As list 1 is identical to list 0, there is
redundancy comparing using only list 0 with using only list 1 as references. This contribution
suggested not to use list 1 prediction in this kind of B slice, restricting the available predictors to
using bi-prediction and list 0 prediction. Compared to TMuC0.7 anchors, such a change reportedly
leads to 0.1% bit rate savings for random-access cases and 0.9% bit-saving for low-delay cases, on
average.
216
Encoding time was also decreased compared to the anchor (around 10%). See further notes
regarding the JCTVC-C285.
JCTVC-C285 Modified uni-directional inter prediction in generalized P and B pictures [W.-J.
Chien, P. Chen, X. Wang, M. Karczewicz (Qualcomm)]
This contribution proposed a modification of uni-directional inter-prediction in generalized P and B
pictures (GPB). The modification restricts the reference list selection for uni-directional interprediction. Simulation results reportedly showed that 0.88% BD rate reduction can be achieved. The
same method was proposed in JCTVC-C278.
It was remarked that the encoding time decrease is no argument, as this is implementation specific
At first it was suggested for an AHG to study this further and check the two methods, work out
syntax and provide stable cross-checked software and results, and report about implications on the
MMCO and RPLR
Decision: After further study and discussion, it was agreed that this seems to be quite simple to
implement without complications (as confirmed by SW coordinator), and the proposal was adopted.
18.6
Quantization control
18.6.1.1.1.1.1.1.1
JCTVC-C135 Flexible scaling of quantization parameter [D. Hoang
(Zenverge)]
The current Test Model under Consideration (TMuC) employs a quantization parameter (QP)
scaling that is borrowed from the AVC standard. In AVC, the quantization step size increases by
approximately 12.25% with each increment of QP, so that the quantization step size doubles when
QP is incremented by 6. For the purpose of rate control, the contributor asserted that this 12.25%
increment may be too coarse for certain applications, such as low-delay coding. This contribution
proposed a specification that allows the granularity of the quantization parameter (QP) to be varied
at the slice and picture levels. Backward-compatibility with the AVC approach is maintained at one
of the granularity settings. In addition to varying the granularity, this contribution also proposed a
variable QP offset that can be specified at the slice and picture levels.
It was remarked that there have been prior efforts to address the coarse quantization step size
increment of AVC. In JCTVC-A114, the quantization step size was proposed to double every 16
increments of QP.
In AVC, the QP can vary from 0 to 51, inclusive (for 8 bit per sample coding). A full transition of
QP from 0 to 51 reportedly translates to a 362-fold increase in the quantization step size. The
contributor said that, in practice, only a small range of QP values are used when coding a scene at a
given bit rate. It was also asserted that at scene changes, QP can change due to a change in scene
complexity and then typically varies only slightly within a scene.
It was remarked that the complexity of supporting the proposed inverse quantization process in the
decoder may not be trivial. It was suggested that dynamic range increase may be needed to obtain
the suitable precision for frequency-specific norm adjustment, and that necessary LUT sizes and
perhaps some other memory increase may be needed for the decoder to support this.
The relationship between luma and chroma quantization was also discussed.
It was noted that the stability of the transform design may not be at a stage where such a proposal
would be timely.
Further study was encouraged.
18.7
Entropy coding
18.7.1 Coefficient scanning and context selection
18.7.1.1.1.1.1.1.1
JCTVC-C114 Zigzag scan for CABAC/PIPE [J. Lou, K. Panusopone, L.
Wang (Motorola)] (missing prior, uploaded on first meeting day)
217
In TMuC0.7, the macro "HHI_TRANSFORM_CODING" enables a set of transform coefficient
coding tools. It is switched on by default when the entropy coding option is CABAC/PIPE. The set
of transform coefficient coding tools is based on JCTVC-A116. Among these tools, an adaptive
scan is applied for significance map coding. The contributor reported that experimental results
indicate that this adaptive scan scheme achieves only a negligible performance gain, but introduces
additional memory and computational complexity as comparing to the traditional zigzag scan. The
contributor proposed to use a conventional zigzag scan rather than the adaptive scan for significance
map coding when the macro "HHI_TRANSFORM_CODING" is switched on, or CABAC/PIPE is
selected.
This topic was also discussed in the TE12 context – e.g., see section on JCTVC-C059.
The compression impact was estimated by the contributor was estimated here and in JCTVC-C059
as being in the range of 0.0 to 0.3%, depending on coding condition and test sequence class.
Further investigation of the topic was encouraged.
18.7.1.1.1.1.1.1.2
JCTVC-C227 Parallelization of HHI_TRANSFORM_CODING [V. Sze, M.
Budagavi (TI)]
See also JCTVC-C059 and JCTVC-C114.
While various parallel bin encoding approaches have been proposed, context modeling remains a
parallelism challenge in the entropy coding engine. HHI_TRANSFORM_CODING uses a highly
adaptive context modeling approach for the coefficient significance map, where context selection
depends on coefficients in neighboring positions. While it provides coding gains between 0.8% to
1.1%, it reportedly introduces significant dependencies which are difficult to parallelize when using
zig-zag scan or adaptive scan order. In this document, the contributor proposed a "wavefront"
diagonal scan order to enable parallel processing in conjunction with the use of the highly adaptive
context selection for significance map. A simplification of the context selection was also proposed
in the contribution. These modifications were reportedly implemented in TMuC-0.7.3 and the
coding efficiency impact was reportedly evaluated. The modifications reportedly come at a cost of
0.2% to 0.4% coding loss. Despite this loss, the parallelized HHI_TRANSFORM_CODING with
simplified context selection (reduce dependency by one or two scans) and wavefront scan (down
left direction) was asserted to still provide gains between 0.9 to 1.4% when compared with not
using HHI_TRANSFORM_CODING.
The contributor proposed conducting a CE/TE for further study of the topic.
18.7.1.1.1.1.1.1.3
JCTVC-C205 Low-complexity adaptive coefficient scanning [V. Seregin, J.
Chen (Samsung)]
This contribution proposed a new scanning method for transform coefficients. It was proposed for
the scanning mode for every Transform Unit (TU) to be defined explicitly based on RD estimation
and a corresponding mode index signaled to the decoder. The proposed technique was tested
relative to TMuC 0.7 and a comparison was made to the existing methods in TMuC 0.7. The
performance was evaluated on 1 second duration sequences that were otherwise following based the
common test conditions specified in JCTVC-B300 – the sequences were shortened to reduce the
testing time. The proposed method reportedly provided 0.2-0.9% improvement for luma on average
under HE conditions. In LC configurations the proposed method reportedly had similar
performance as TMuC.
This would increase encoding complexity, to determine the selected scan order to be applied.
It was remarked that, instead of using syntax to indicate the scan order, the scan order could be
inferred based on the intra prediction direction. Some proposals relating to this were also discussed
at the meeting.
The contributor proposed conduction a CE/TE for further study of the topic.
18.7.1.1.1.1.1.1.4
JCTVC-C250 Low complexity adaptive coefficient scanning [M. Coban, R.
Joshi, M. Karczewicz (Qualcomm)]
218
An adaptive coefficient scanning process is used to capture coefficient statistics based on the intra
prediction mode. This contribution proposed to introduce a partial sort at the coefficient level for
determining the scan pattern. The proposed scheme reportedly reduces the complexity of the
existing adaptive coefficient scan algorithm, with a negligible average BD bit rate penalty.
If MDDT is disabled, the adaptive scan described in this contribution is not used. However, it could
hypothetically be used without MDDT.
The contributor indicated that without MDDT, approximately 0.6% improvement under HE all-intra
conditions, and 3% improvement under LC all-intra conditions was observed in some preliminary
testing with short video sequences. The gain discussed for the LC case included another effect
discussed in another contribution JCTVC-C263.
The proposed modification reportedly cuts the decoder run-time approximately in half.
Some of the results presented were not in the original contribution.
Some of the results had reportedly been cross-checked by RIM as reported in JCTVC-C307.
18.7.1.1.1.1.1.1.5
JCTVC-C307 Cross Verification of Low Complexity Adaptive Coefficient
Scanning [J. Zan , J. Meng , M. Towhidul Islam , D. He] (late reg.)
This contribution reported cross verification results on "low complexity adaptive coefficient
scanning" as described in JCTVC-C250. Due to limited time, RIM had only tested 5 sequences
from each class; and for each test sequence, two QP values (22 and 37) for the "intra" and "intraLC" modes.
The test results were compared with those from Qualcomm: 1) for the tests that macro
"FAST_ADAPTIVE_SCAN" were enabled, both results reportedly matched on BD bit rate, and
PSNRs on all test points; 2) for the tests that macro"FAST_ADAPTIVE_SCAN" were disabled,
mismatches were reported between the results and those from Qualcomm except for the Class D
sequences, although the difference was very minor (<0.1%).
On decoding time, for the "intra LC" mode, an average of 43% decoding time was recorded on all
available test points, and the Qualcomm data was an average of 37% on all QPs and all sequences;
for the "intra" mode, an average of 60% decoding time was recorded on all available test points, and
the Qualcomm data was an average of 51% on all QPs and all sequences.
It was noted that the tests could only be done on some selected sequences, with the lowest and
highest QP values, due to time constraints. From the data collected on these test points, it was
reportedly shown that the decoding time was greatly reduced.
This contribution was submitted late during the meeting, with very limited opportunity for review.
It was not presented or discussed in detail.
18.7.2 LCEC modification proposals
18.7.2.1.1.1.1.1.1
JCTVC-C263 Improvements on VLC [M. Karczewicz, W.-J. Chien, X. Wang
(Qualcomm)]
This contribution proposed changes relating to variable length coding (VLC) of transform
coefficients, intra prediction modes, and inter modes. In this contribution, a modification was
proposed to the current transform coefficient coding for intra blocks.
Coding schemes were also proposed for coding the prediction mode of both intra and inter blocks
respectively. With a different VLC table selection method and different symbol combination,
coding performance of the prediction modes was also proposed to be modified.
For LC all-intra, an average gain of 4.3% was reported, with MDDT disabled and ROT enabled.
With ROT turned off, the contributor suggested that the gain would likely be similar.
For LC random access, an average gain of 3.2% was reported.
For LC low delay, an average gain of 2.4% was reported.
The results had not been cross verified.
Some complexity impact was reported due to the use of adaptive scanning.
219
The establishment of a core experiment on the subject was proposed (possibly split into different
subject areas).
The LCEC coding in the TMuC software includes the coding of a terminating bit that is
unnecessary and not described in the corresponding text. We agreed that this is considered a bug,
and should be fixed. This somewhat affects the reported results, as noted by the contributor.
It was remarked that his has overlap with JCTVC-C185, and should be evaluated in the same
CE/TE.
18.7.2.1.1.1.1.1.2
JCTVC-C210 Efficient coefficient coding method for large transform in VLC
mode [S. Lee, M.-S. Cheon, I.-K. Kim (Samsung)]
This contribution presented a coefficient coding method for large transforms in LCEC mode.
Tandberg, Ericsson, and Nokia’s proposed coefficient coding method, which is reportedly currently
implemented in the TMuC, was reportedly extended for the efficient coding of the coefficients from
large transforms including 16x16, 32x32, and 64x64. Experimental results reportedly showed that
the proposed method provides gain for random access and intra only configurations without a
significant complexity increase.
For LC all-intra, an average 1.6% improvement was reported.
The results had not been cross verified.
18.7.2.1.1.1.1.1.3
JCTVC-C152 Context-adaptive hybrid variable length coding [Y. Xu, J. Li, W.
H. Chen, D. Tian (Cisco Systems)]
This contribution presented a context-adaptive hybrid variable length coding method for entropy
coding. This method is based on the reported observation that the non-zero transform coefficients
are more clustered in the low frequency region while being more scattered in the high frequency
region. The code tables are adaptively chosen based on the context information from coded
neighboring blocks or from the coded portion of the current block.
The proposed algorithm was implemented and tested relative to JM 10.1 High Profile with fixed
8x8 transform size, and the simulation results were compared with CAVLC.
Further work would be needed to test the proposal in the TM design context to determine its
applicability to the new design. The contributor indicated that such results may be available by the
time of the next meeting.
18.7.3 V2V coding
18.7.3.1.1.1.1.1.1
JCTVC-C279 Opportunistic parallel V2V decoding D. He, G. Korodi, E.-h.
Yang, G. Martin-Cocher (RIM)
A buffer-based technique that distributes codewords produced by V2V codes into interleaved fixedlength phrases was proposed. On the decoder side, since subsequent phrases can be accessed
independently without waiting for the decoding of the current phrase to complete, an "opportunistic
parallel decoding" can reportedly be achieved.
The proposal described the concept of this modified scheme. Some padding bits are needed at the
end of the bitstream when applying this scheme. However, the scheme avoids the need for the
pointers that are otherwise needed for identifying the location of the PIPE streams, and also avoids
the bit-wise interleaving of the PIPE data.
A participant asked what length might be recommended for the "phrases", and the contributor
suggested such examples as 16, 32, etc. (multiples of 8 bits).
It was suggested that some measurements of figures of merit for the technique should be presented.
Further study was encouraged.
220
18.7.4 Modified CABAC coding
18.7.4.1.1.1.1.1.1
JCTVC-C296 Decoding improvement on the PA-Coder [H. Zhu (Zhu)] (late
registration, missing prior, available first day)
The contribution proposed a fast channel for MPS decoding of the PA-coder previously proposed in
JCTVC-A027. Once a new bit is decoded, a threshold value is computed using a table lookup in the
previous design. The threshold value is compared with the current value to decide the MPS or LPS.
The PA arithmetic coder was described as having reduced computational resource requirements
relative to CABAC and PIPE, with slightly less compression capability.
The PA coder had been integrated into the JM, and the contributor indicated that integration of the
coder with the TMuC software was reportedly expected to be completed soon. No action was taken
in response to the contribution.
18.7.4.1.1.1.1.1.2
JCTVC-C300 High-efficiency entropy coding simplifications [V. Sze , M.
Budagavi (TI)] (late registration, missing prior)
PIPE was described as a bin-level tool aimed to improve throughput by using 12 bin encoders in
parallel. Binary symbols (bins) are assigned to each of the bin encoders depending on their
probabilities; with the bins of different probabilities being processed in parallel. The contributor
asserted that there is a large complexity (i.e. area cost) associated with the PIPE implementation due
to the use of V2V (variable length codes to variable length codes) for the bin encoder, with an
estimated area over 5x larger than CABAC. In this document, a multi-CABAC approach was
proposed using a simplified binary arithmetic coding engine with quantized rLPS tables for the bin
encoder to reduce complexity. The resulting area savings was estimated to be over 7x. The
parallelism across bin encoders is still maintained, thus providing improved throughput at the bin
level. The coding efficiency of TMuC-0.7.3 with the modified multi-CABAC and quantized rLPS
tables was reported to have a 0.0-0.1% coding gain over PIPE.
The contributor indicated that the buffering associated with the probability interval partitioning
remains a challenge in the proposed design.
A participant commented that CABAC power consumption could be an issue in such a design.
The fact that the probability estimate is static for each BAC decoder would substantially simplify
the BAC engine operation.
The compatibility of the scheme in regard to the "load balancing" scheme found in the TMuC
would need to be studied.
It was asked how to properly measure area, power, throughput, the difficulty of implementation in
software, and, when comparing to LCEC, also coding efficiency.
Further study was encouraged.
18.7.4.1.1.1.1.1.3
JCTVC-C304 Showing the possibility of fast CABAC [H.-J. Kim , X. Qu , W.J. Han] (late registration, missing prior, provided later during the meeting)
This late document was initially uploaded as an empty file. It was remarked that it had a confusing
IPR statement, and it was indicated that a new version would be provided. Due to lateness, there
was limited opportunity for its review. It described a proposed entropy coding method that was
asserted to have lower complexity than CABAC. Results were reported on some test sequences. A
2-6% bit rate increase was reported along with a reported complexity reduction by 30-35%. No
action was taken in response.
18.7.5 Entropy slices
18.7.5.1.1.1.1.1.1
JCTVC-C256 New results for entropy slices for highly parallel coding [K.
Misra, J. Zhao, A. Segall (SHARP)]
The concept of an Entropy Slice was previously proposed for the HEVC design. Entropy slices
enable separate definitions for neighborhood in the entropy decoding and reconstruction loops.
221
Motivations for entropy slices were described. As a first benefit, it was asserted that the system
enables parallel decoding with negligible overhead. This includes both the context adaptation and
bin coding stages, and it is compatible with all of the entropy coding engines currently in the TMuC.
As a second benefit, it was asserted that the degree of parallelization is flexible – an encoder may
generate a bitstream that supports a wide range of decoder parallelization factors without
knowledge of the target platform. As a third benefit, it was asserted that the entropy slice concept
enables more meaningful profile and level definitions in terms of the impact on entropy coding.
Specifically, profiles/levels can include limits that are relevant to the operation of the entropy
coding component. This was asserted to be useful for all applications, but with a significant benefit
to emerging, higher rate and higher resolution scenarios. The document provided some experiment
results for entropy slices. Software is also provided in the contribution.
The proposal discussed the ability to initialize the state of the entropy coder either using syntax (e.g.,
cabac_init_idc) or using the end state from some other entropy slice. It was remarked that storing
the end state from a preceding slice has a memory impact.
"Wavefront" processing operation had been suggested – and a straightforward method of providing
this capability was described.
The basic concept of desiring enhanced high-level parallelism of the entropy coding stage to be in
the HEVC design is agreed (as recorded in the Geneva meeting report).
It was remarked that the need to store the parsing results prior to completing some of the decoding
process for the picture will have an impact on memory and memory bandwidth, and that this could
be a serious concern.
The overhead bits for indicating the location of the wavefront parallelism entry points were not
accounted for in the coding efficiency experiment results.
It was remarked that, for coded frames that are relatively small, this overhead could be a substantial
percentage of the data.
We can test coding efficiency impact, including overhead bits, and including comparison to regular
slices.
Further study was encouraged (AHG or CE).
18.8
Intra prediction
18.8.1.1.1.1.1.1.1
JCTVC-C040 Chroma intra prediction using reconstructed luma [C. Yeo, Y.
H. Tan, Z. Li, S. Rahardja (I2R)]
Intra coding in the current TMuC achieves high compression efficiency, in part due to the intra
prediction process that exploits spatial directional correlation. However, intra prediction of chroma
components in YUV 4:2:0 video uses a limited set of possible predictions available for coding of
luma components. Also, the coding of chroma components proceeds somewhat independently of
luma components, and ignores any possible correlation between them. This contribution proposed a
way of using reconstructed luma pixels to help with intra prediction of chroma pixels. Specifically,
it uses the regions with similar luma texture characteristics to identify a location for prediction of
chroma components. By making use of the reconstructed co-located luma block to perform template
matching in the luma plane, co-located chroma blocks of the matched luma blocks are identified for
us as predictors. Simulations results reportedly indicate that, on average, the proposed approach is
able to obtain 2.3% chroma bit-rate reduction and 0.5% luma bit-rate reduction relative to the
anchor for all intra coding.
An approximate 2x encoding time and 3x decoding time impact was reported for intra.
It was remarked that something like this that links together luma and chroma operations would need
to provide a very good gain to be a desirable feature.
The complexity seemed rather high, but further study was encouraged.
18.8.1.1.1.1.1.1.2
JCTVC-C206 Chroma intra prediction by scaled luma samples using integer
operations [J. Chen, V. Seregin (Samsung)]
222
This contribution proposed an integerization of the algorithm presented in JCTVC-B021 in which
chroma samples are predicted from colocated reconstructed luma samples by a linear model
relationship. Only addition and multiplication operations are involved to calculate the model
parameter instead of using divisions as in the proposed approach. Experiments reportedly show that
the average BD-rate reduction of Y, Cb, Cr components are 1.3%, 6.8% and 5.6% for intra coding
conditions and 0.7%, 6.7% and 4.6% for random access conditions. The proponent asserted that the
complexity of the proposed method is similar or even smaller than that of the method used in AVC.
The values of the parameters of the linear prediction model are derived at the decoder side from
neighbouring reconstructed samples.
Reportedly the late document JCTVC-C314 contained a cross-verification.
Further study was encouraged.
18.8.1.1.1.1.1.1.3
JCTVC-C314 Verification results of Samsung's proposal JCTVC-C206 [J.
Kim , J. Park , B. Jeon] (late reg.)
The contents of this late document were verbally reported during the presentation of JCTVC-C206.
The document was not reviewed in detail. The PSNR and bit-rate results reported in this document
were reported to be the same as the results reported in the document JCTVC-C206.
18.8.1.1.1.1.1.1.4
JCTVC-C111 Integration of plane mode in unified intra prediction [G. Li
(Santa Clara Univ.), L. Liu, N. Ling (Santa Clara Univ.), J. Zheng, P. Zhang
(Hisilicon)]
The current default intra prediction method in the TMuC, Unified Intra Prediction, reportedly has
up to 33 directional modes and 1 non-directional mode, which is the intra DC mode. This
contribution proposed adding a plane mode and using an "multiple predictor set" intra mode
prediction. Experimental results reportedly showed that coding efficiency can be improved with the
proposed method.
Partial results for intra HE and LC configurations were provided. An average gain of about 0.6%
was reported.
There seems to be several elements discussed together in this contribution.
The encoder search complexity is increased.
It was noted that the encoder search is limited by using fast search in the current TMuC, and better
compression can be obtained just by making a more effective search.
Further study was encouraged.
18.8.1.1.1.1.1.1.5
JCTVC-C189 Evaluation of intra prediction based on repetitive pixel
replenishment [S. Mochizuki, K. Iwata (Renesas)]
This contribution proposed a method of intra prediction based on repetitive pixel replenishment
(Intra RPR).
The proposal uses a displacement vector with area extrapolation for intra coding.
Preliminary partial simulation results reportedly showed that this proposed technique has an average
of 0.8%, 0.2%, 1.3% BD-rate improvement, respectively at Y, U and V, against TMuC 0.1, with
about double encoding time. Only the encoding the first 20 pictures of each sequence was tested.
The reported maximum savings per sequence was 4.8% (0.22 dB) on the Basketball drill relative to
TMuC 0.1. The contributor indicated that they are working on integration into a newer TMuC
version.
The design, as presented, was somewhat preliminary – some modifications were planned (making
the prediction size adaptive, and harmonization with other intra coding tools).
Further study was encouraged.
18.8.1.1.1.1.1.1.6
JCTVC-C207 Encoder improvement of unified intra prediction [Y. Piao, J.-H.
Min, J. Chen (Samsung)]
223
This contribution proposed an encoding scheme for unified directional intra prediction (UIP) which
was introduced in JCTVC-B100.
This was only a non-normative proposal – only the encoder would be changed – not the decoder.
Bit rate reductions of 0.5% for HE all-intra and 0.3% for LC all-intra were reported relative to the
TMuC method, with lower search complexity.
The results were not cross-verified.
Two variants were proposed.
Decision: The S1 (9-9-4-4-5) scheme was adopted as the HM encoder method, with additional
further study planned, and the S2 scheme will additionally be available in the HM software as a
selectable configuration.
18.8.1.1.1.1.1.1.7
JCTVC-C218 Encoding complexity reduction for intra prediction by disabling
NxN partition [J. Kim, Y. Jeon, B. Jeon (LG Electronics)]
This contribution proposed a simplification of the intra prediction process to reduce encoding
complexity. In the current TMuC specification, two kinds of splitting, 2Nx2N and NxN, are allowed
for intra prediction and it is signaled by intra_split_flag. In the proposed method, NxN is not
allowed for each intra prediction unit (PU) except when the PU has the maximum depth. The
proposed modification reportedly achieves approximately 4.5% encoding time reduction with 0.2%
loss in BD bit rate under the intra HE setting of JCTVC-B300. When the rotational transform
(ROT) and combined intra transform (CIP) are turned off, the proposed method reportedly achieves
about 36% encoding time reduction with 0.4% loss in BD bit rate relative to the anchor.
It was noted that this proposal includes syntax modification.
It was remarked that the software includes a macro switch for a similar treatment in the inter case.
Further study was suggested – perhaps as part of block structure testing rather than intra prediction.
18.8.1.1.1.1.1.1.8
JCTVC-C234 Simplified intra smoothing [Y. Zheng, M. Coban, M.
Karczewicz (Qualcomm)]
In this contribution, a simplified intra smoothing scheme was proposed to improve the current AIS
scheme. In the proposed scheme, the prediction samples are filtered according to the prediction unit
size and the intra prediction mode. The proposed scheme reportedly achieves 0.4% BD bit rate
reduction on average for HE and LC intra configurations, relative to not using any intra smoothing.
The encoding complexity was reportedly reduced by about 50% relative to the prior AIS scheme.
No syntax is used to indicate whether to filter or not.
Visual quality was not studied.
Random access and low delay configurations were not tested.
The late document JCTVC-C316 reportedly contained a cross-verification report.
Further study was encouraged.
18.8.1.1.1.1.1.1.9
JCTVC-C316 Verification Results of Simplified Intra Smoothing (JCTVCC234) [T. Yamakage (Toshiba)]
The contents of this late document were verbally reported during the presentation of JCTVC-C234.
This contribution reported verification results of JCTVC-C234 (Simplified Intra Smoothing) by
Qualcomm. Not all results had not been checked, but the reported results were indicated to be
similar to Qualcomm’s results.
18.8.1.1.1.1.1.1.10 JCTVC-C302 Improved intra smoothing for UDI and new AIS fast mode [T.
Nguyen, B. Bross, D. Marpe, T. Wiegand (HHI)] (late registration, missing prior,
available first day)
In this contribution, an adaption of an intra smoothing scheme for UDI was proposed to improve the
overall coding performance. Furthermore, this contribution presented modified fast mode for AIS.
224
For when AIS is not in use, the contribution proposed a LUT without syntax to determine when
filtering is to be applied. For when AIS is active, the contribution proposed a syntax indicator using
the table as part of the entropy coding, and a fast encoding search method to guide the selection.
Relative to not using filtering, the LUT-only scheme in the proposal was reported to provide about
0.3% for HE and LC all-intra.
Relative to not using filtering, the scheme using syntax in the proposal was reported to provide
about 0.8% for HE and 0.9% for LC in all-intra coding.
It was suggested that visual quality effects would be desirable to assess.
The contributor did not suggest to put this into the TM, but rather to do further study.
18.8.1.1.1.1.1.1.11 JCTVC-C269 A study on the impact of intra smoothing [A. Tabatabai, C.
Auyeung, T. Suzuki (Sony)]
Please see the discussion of this contribution in section 15.2.
18.9
Transforms and residual coding
18.9.1 Residual segmentation
18.9.1.1.1.1.1.1.1
JCTVC-C068 Improved side information signaling for QVBT in TMuC [B. Lee,
M. Kim (KAIST), J. Kim, H.-Y. Kim (ETRI)] (missing prior, available first day)
In this contribution, new syntax was proposed for the quadtree based transform to reduce side
information for the transform in TMuC. The available maximum and minimum transform size in
the current TMuC is 64x64 and 4x4 respectively, which were implemented with the quadtreestructure manner in the current version of TMuC. The structure is reportedly well adapted to the
characteristics of the input signal while the signaling for transform types may cause large amounts
of side information as the depth of the quadtree increases. Moreover, the coded_block_flags (cbf)
for luma and chroma components are another overhead for the quadtree transform structure. The
contribution proposed a modification of the side information including split_transform_unit_flag
and cbf. The encoding and decoding for split_transform_unit_flag was proposed to be skipped when
the cbf for luma (Y) and chroma (UV) are all zero. A new flag with a single bit was proposed to
signal the patterns of the quantized coefficient for luma and chroma components.
The work reported was preliminary and no overall gain was shown.
JCTVC-C277 Redundancy reduction in Cbf and merge coding [B. Li, J. Xu, F. Wu, G. J.
Sullivan (Microsoft), H. Li (Univ. of Sci. &Tech. China)]
In the current TMuC, tree based coding is widely used to provide a hierarchical representation of
information. In some cases, the attribute of one block can be derived from the attributes of the
parent block and other "brother" blocks so that the encoder does not have to code the attribute. This
document presented methods to reduce redundancy in coding Cbf and merge. Remarks recorded
during the review of this contribution included the following:

Decision: The first element in the proposal contribution was regarding saving CBF bits
depending on split decisions gives 0.1-0.2% reduction. It was agreed to adopt this – it adds
one more condition in parsing and seems straightforward.

The second element of the proposal contribution was regarding PU based merging and CU
based merging. Further study of this aspect was suggested, and it was suggested that we
would need a more precise description relative to the upcoming TMuC text and that we may
have other solutions e.g. context modeling.
225
18.9.2 Transform complexity reduction
18.9.2.1.1.1.1.1.1
JCTVC-C096 Low complexity rotational transform [F. C. A. Fernandes
(Samsung)]
The Rotational Transform (ROT) is a secondary transform that improves coding efficiency and is
implemented in the JCT-VC Test Model under Consideration (TMuC). This contribution proposed
a lifting-factorization complexity reduction for the 8x8 ROT. The ROT is split into compound
Given’s rotation matrices which are then each factored into a triplet of integer lifting matrices. The
lifting-matrix coefficients can reportedly be implemented in hardware with fewer elemental adders
than required for a TMuC ROT hardware implementation. With BD-rate of -0.003% this technique
reportedly achieves 46% and 21% reduction in the elemental-adder count of forward and inverse
ROT implementations, respectively.
The contributor recommended that the JCT-VC evaluate this transform in a tool-evaluation
experiment or a core experiment.
18.9.2.1.1.1.1.1.2
JCTVC-C112 Fast integer transforms for the HEVC test model [W. Dai, M.
Krishnan, P. Jesudhas, P. Topiwala (FastVDO)]
Beyond the 4x4 and 8x8 transform sizes already found in AVC, larger transform sizes of 16x16,
32x32 and 64x64 have been included in the Test Model under Consideration (TMuC) for HEVC.
However, since large transform sizes have non-trivial computational complexity, one may ask what
is the cost-benefit analysis of their inclusion in the forthcoming Test Model. The larger transforms
in TMuC are all reportedly based on Chen's fast DCT algorithm because of its regular butterfly
structure and its extensibility to any desired transform sizes of order N=2m with m>=1. For
complexity reasons, integer rather than floating-point transform computation is performed. In this
document, it was reported that the complexity can be reduced with no loss of performance. Fast
transforms of each candidate size were proposed that reportedly not only provide virtually identical
coding performance, but offer useful gains in computational complexity. In this way, the
complexity of the transforms can reportedly be kept to a minimum, even when using large
transform sizes.
It was remarked that lifting techniques tend to convert parallel operations into serial ones. However,
it was noted that pipelining can mitigate that.
A participant asked how to best evaluate complexity – this is more than just counting operations.
18.9.2.1.1.1.1.1.3
JCTVC-C255 DCT+Hadamard low complexity large transform for Inter
coding [M. Budagavi, A. Gupte (TI)]
This contribution proposed a class of transforms for large block sizes which is a combination of
DCT+Hadamard transforms for reducing computational complexity of Inter transforms. The best
performing 32x32 transform in this group reportedly provides most of the coding gains of 32x32
DCT for high-efficiency configurations but with 50% reduction in number of multiplications
assuming direct matrix multiplication DCT. If Chen’s DCT implementation is assumed, then the
number of multiplications is reportedly reduced by 24%.
18.9.2.1.1.1.1.1.4
JCTVC-C209 Low-complexity 16x16 and 32x32 transforms and partial
frequency transform [Y.-M. Hong, M.-S. Cheon, I.-K. Kim (Samsung)]
This contribution proposed a new 32-point fast DCT scheme based on Loeffler’s design principles.
Fast integer realization of the 16-point and 32-point transform were provided based on the proposed
transform. In addition, a partial frequency transform scheme was proposed to reduce the complexity
of the transforms of the current TMuC further. The proposed approaches reportedly reduce the
number of operations significantly with negligible performance loss compared to the current TMuC.
18.9.2.1.1.1.1.1.5
JCTVC-C237 Reduced complexity 32x32 transform by coefficient zero-out [J.
Sole, R. Joshi, M. Karczewicz (Qualcomm)]
226
Large block size transforms (up to 64×64) are being considered for HEVC for improving coding
efficiency. However, such transforms may be difficult and/or costly to implement in hardware. This
proposal presented an approach to simplifying the 32×32 transform by zeroing out the high
frequency coefficients. The simplification reduces the number of transform coefficients by 75℅ and
reportedly attains performance close to the full 32×32 transform. The loss in terms of BD-rate is
0.14℅ for random access, high efficiency configuration and 0.12℅ for low-delay, high efficiency
configuration.
Similar to JCTVC-C209, this contribution suggested computation of only the lower frequency
coefficients of a large block size transform.
18.9.2.1.1.1.1.1.6
JCTVC-C117 Implementation analysis of transform block size [Y. Yu
(Broadcom)]
This contribution analyzed the complexity of large transform block sizes, especially for a generic
hardware implementation. Based on the analysis in this contribution, the hardware cost for a 64x64
transform block was reported to be roughly eight times that of an 8x8 transform block, which is the
maximum transform block size of the AVC coding standard. The hardware cost for a 32x32
transform block was reported to be roughly four times that of an 8x8 transform block. The analysis
also provides a compression gain comparison between different transform block sizes. The
experimental results reportedly show that 80% to 90% of the total compression gain (2-4%) from
large transform block sizes can be captured by 16x16 and 32x32 transform block sizes. Given the
cost and compression gain, and that HEVC is also attempting to reduce overall coding complexity,
this contribution suggests limiting the maximum transform block size to either 32x32 or 16x16.
It was remarked that some of the impact depends on throughput requirements – e.g., whether to
process stages of a transform serially or in parallel.
The primary focus of the contribution was on hardware implementation – software may behave
differently.
It was noted that quantization is a closely related topic together with transforms.
18.9.2.1.1.1.1.1.7
JCTVC-C226 Low-complexity configurable transform architecture for HEVC
[M. Sadafale, M. Budagavi (TI)]
This contribution proposed a matrix multiplication architecture for DCT/IDCT implementation that
is configurable and can be re-used across various transform block sizes for HEVC. Matrix
multiplication implementation reportedly has the advantage that it is friendly to parallel processing
with minimal dependency and control logic. In hardware, matrix multiplication reportedly results in
low-area architecture, while in software it reportedly leads to efficient implementation on SIMD
processors. Another asserted advantage of matrix multiplication architecture is that it is a unifying
architecture in the sense that is flexible enough to support other transforms being considered in
HEVC such as directional and 1D transforms. Also matrix multiplication reportedly has better
fixed-point behavior than Chen’s DCT/IDCT which reportedly allows for elimination of the
existing quantization matrices in TMuC. The memory requirement for storing dequantization
matrices in the TMuC decoder reportedly goes down from 7.5 KB to 12 bytes. There is a similar
reported memory savings in the TMuC encoder. A fixed-point version of matrix multiplication
DCT/IDCT along with reduced size quantization/dequantization matrices optimization was
implemented in TMuC-0.7.3. Simulation results reportedly indicate that there is no significant loss
in coding efficiency (average 0.0 to -0.1%) when compared to the Chen DCT/IDCT factorization in
TMuC-0.7.3.
The proponent acknowledged that integration of this scheme into the TMuC would cause a
substantial increase in encoder and decoder software simulation runtimes.
227
18.9.3 Alternative types of transforms
18.9.3.1.1.1.1.1.1
JCTVC-C108 Jointly optimal intra prediction and adaptive primary transform
[A. Saxena, F. C. A. Fernandes (Samsung)]
This document proposed applying either a conventional discrete cosine transform (DCT) or a
derived discrete sine transform (DST) for intra prediction as a primary transform in TMuC 0.7. For
each of the upto 34 modes in unified intra-prediction in TMuC 0.7, the derived transform (separable
along the horizontal and vertical directions) was asserted to be theoretically optimal with
performance close to KLT. The proposed primary transform is based on the intra-prediction modes
with no additional signaling information, works in a single-pass and reportedly does not incur any
additional computational complexity. No training was required to derive the transform. It requires
the storage of one sine matrix in addition to the conventional DCT at each block size. Comparison
is performed in TMuC 0.7 between the proposed adaptive DCT/DST primary transform and the
conventional DCT for two cases, viz., when the secondary transform in intra prediction, i.e.,
rotational transform is off and on. Simulation results reportedly show a BD Rate improvement of
0.2% and 0.1% averaged over all the video sequences, when the rotational transform is off and on,
respectively. Only 25 frames of each video sequence were initially tested. Data was later collected
for longer sequences, and was verbally reported to be consistent with the smaller test results.
No syntax is used to indicate the selection of transform type.
There was a previous similar proposal from I2R (see JCTVC-C037). This proposal is basically the
same idea, applied to other block sizes. The contributor indicated that applying it to all block sizes
may improve the results.
Further study was encouraged.
18.10 IBDI and memory compression
See also the section on TE2.
18.10.1.1.1.1.1.1.1 JCTVC-C077 A framework for standardization of memory compression [T.
Chujoh, T. Yamakage (Toshiba)]
A framework for standardization of memory compression was proposed. A purpose of the
contribution was to discuss how to standardize memory compression. Memory compression can
reduce the bandwidth of memory access; however, most memory compression techniques depend
on hardware architectures and the process of memory compression causes a degradation of coding
efficiency and an increase of computational complexity for most implementations. This contribution
does not propose a common memory compression algorithm, but rather proposes a framework of
memory compression. Since the essence of a conformance point is the output of the decoding
process, it was suggested that a bitstream format and a bit rate not be specified; rather, a
specification of the distortion and a system to control the distortion would be specified in this
framework. The purpose of this framework is to enable introducing arbitrary memory compression
effects into the encoder and the decoder in a matching fashion.
It was discussed whether encoder/decoder mismatch should be allowed. Perhaps not, but it may be a
question worth discussing.
In the discussion, the creation of a related AHG was suggested.
18.10.1.1.1.1.1.1.2 JCTVC-C094 An extension of DPCM-based memory compression to 2-D
structure with ADPCM [H. Aoki, K. Chono, K. Senzaki, J. Tajime, Y. Senda (NEC)]
In this contribution, a 2-D extension of the previously-proposed 1-D DPCM-based memory
compression method was presented. According to some previous reports, there are conditions where
2-D image mapping to frame memory provides better memory access bandwidth performance. In
such cases, memory compression with a 2-D structure may be more straightforward and may
improve coding efficiency. In the proposed 2-D extension, the top-most and left-most samples in
228
every memory compression unit are coded in the same manner as the 1-D case, and other samples
are coded with ADPCM. The 1-D DPCM-based method can be regarded as a specialized version of
the proposed 2-D ADPCM-based method, where its unit height is one. For each sample coded with
ADPCM, the prediction value is selected in the same manner as the DC and AC prediction of the
MPEG-4 part 2 standard. Experimental results were reported to have shown that the proposed 2-D
ADPCM-based memory compression method has higher coding efficiency than the previouslyproposed 1-D DPCM-based memory compression method. Average coding losses were reported to
be 7.9% for test cases without IBDI and 1.5% for test cases 12-bit IBDI, while those of the 1-D
methods are 10.0% and 1.9%, respectively. The average increase of decoding time was reported to
be 3.8%. Memory bandwidth reduction ratios were reported as 45.1% for test cases without IBDI
and 44.3% for test cases with 12-bit IBDI. It was proposed for the method to be adopted in TMuC
software with the previously-proposed method in JCTVC-C093, and to be further evaluated in
various aspects.
18.10.1.1.1.1.1.1.3 JCTVC-C095 Performance improvement of DPCM-based memory
compression by adaptive quantization [H. Aoki, K. Chono, K. Senzaki, J. Tajime, Y.
Senda (NEC)]
This contribution presented a method for improving coding efficiency of DPCM-based memory
compression. In the proposed method, a nonlinear quantization matrix for each memory
compression unit (MCU) was adaptively selected from multiple quantization matrices so as to fit
quantizer characteristics to image characteristics of each MCU. Each index to the selected
quantization matrix is embedded into the compressed data and signaled to the frame memory
decompressor. The actual frame memory size and memory access bandwidth reportedly does not
increase as the result of this additional signaling, since the indices can reportedly be embedded into
a gap area for memory alignment. Experimental results have reportedly shown that the proposed
adaptive quantization can reduce the coding loss from 10.7% to 6.9% for test cases without IBDI,
and from 2.0% to 1.3% for test cases with 12-bit IBDI, respectively. When applied to the 2-D
version presented in JCTVC-C094, more gains were reported to have been achieved and the coding
loss can reportedly be reduced to 4.8% for test cases without IBDI and to 1.0% for test cases with
12-bit IBDI, respectively. The average increase of decoding times against anchors was reported to
be 14.7% for LC cases and 12.4% for HCE cases when four matrices were used. It can reportedly
be shown that the proposed adaptive quantization provides flexibility on the trade-off between
coding efficiency and complexity to the DPCM-based memory compression scheme.
Anomolous results were reported for Class E coding effects of memory rounding. This should be
investigated.
18.11 Complexity analysis
JCTVC-C228 A simple cache model for measuring motion compensation bandwidth [M.
Budagavi (TI)]
Motion compensation reference data accesses dominate the external memory bandwidth
requirements in video decoder system. External memory access bandwidth requirements are
expected to increase with HEVC since HEVC motion compensation interpolation filters are longer
than those in AVC and since HEVC is expected to be used with larger resolution video such as
4Kx2K and 8Kx4K. External memory bandwidth used directly impacts power consumed and cost;
in fact, power consumed due to external memory accesses in a video decoder can reportedly equal
the power consumed by computational processing in video decoding. Hence techniques that reduce
external memory bandwidth are desirable. Memory compression is one technique that is being
studied in TE2 and Memory compression AhG for reducing external memory accesses. This
contribution proposed a cache model for use in measuring external memory access bandwidth. The
cache model is built on top of NEC’s memory bandwidth measurement module in TMuC-0.7. The
229
main asserted goal of this work was to have a simple cache model that captures the most relevant
parameters for comparing bandwidth consumed with and without memory compression and that is
fair to all block structures used in memory compression. The goal was reportedly not to develop a
very sophisticated cache model. This cache model can also be used to compare memory bandwidth
requirements of various interpolation filters. The cache model was reportedly developed as a part of
Memory Compression AhG.
It was agreed to include this in the reference software.
The AHG on complexity analysis should perform an example study e.g. for multi-pass and singlepass loop filter optimization to assess the validity of the measurement.
JCTVC-C072 TE2: Improved memory bandwidth measurement model for reference frame
compression [J. Chen, X. Shen, L. Yu (Zhejiang Univ.)]
A memory-bandwidth measurement model for reference frame compression algorithm was
proposed. This proposal analyses reference frame decompression processes and divides them into
parsing process and decoding process, in which three new concepts, parsing unit, decoded area and
decoding area, are defined. Based on this, a memory access area conduction method had been
designed to determine the memory access area.
It was asserted that the proposed model can still be efficient when memory compression unit is not
equal to decompressible unit. It expands the scope of application and can measure memory
bandwidth for more general reference frame compression algorithms.
The contribution was noted.
It was remarked that this is based on software that was provided to TE2 participants.
It was suggested to work with TE coordinators to determine its appropriateness for upcoming
experiments.
19 Establishment of ad hoc groups
The ad hoc groups established to progress work on particular subject areas until the next meeting
are described in the table below. The discussion list for all of these ad hoc groups will be the main
JCT-VC reflector (jct-vc@lists.rwth-aachen.de).
Title and Email Reflector
Chairs
Mtg
JCT-VC project management
(jct-vc@lists.rwth-aachen.de)
 Coordinate overall JCT-VC interim efforts
 Report on project status to JCT-VC reflector
 Provide report to next meeting on project
coordination status
G. J. Sullivan, J.-R. Ohm (cochairs)
N
230
HEVC Draft and Test Model editing
(jct-vc@lists.rwth-aachen.de)
 Produce and finalize JCTVC-C402 HEVC Test
Model 1 (HM 1) Encoder Description
 Produce and finalize JCTVC-C403 HEVC text
specification Working Draft 1
 Gather and address comments for refinement of
these documents
 Coordinate with the Software development and HM
software technical evaluation AhG to address issues
relating to mismatches between software and text
T. Wiegand, K. McCann (cochairs), B. Bross, W.-J. Han, J.R. Ohm, J. Ridge, S. Sekiguchi,
G. J. Sullivan (vice chairs)
Software development and HM software technical
F. Bossen (chair),
evaluation
D. Flynn, K. Sühring (vice chairs)
(jct-vc@lists.rwth-aachen.de)
 Coordinate development of the HM software and its
distribution to JCTVC members
 Produce documentation of software usage for
distribution with the software
 Deliver TMuC 0.9 software version and the
reference configuration encodings according to
JCTVC-C500 for use in common conditions
experiments.
 Edit JCTVC-C500 to reflect configurations required
to run the software in "HM mode".
 Deliver HM 1.0 and corresponding reference
configuration encodings according to JCTVC-C500.
(HM 1.0 will be a stripped version of TMuC 0.9
wherein tools not included in HM are removed from
the codebase.)
 Coordinate with HEVC Draft and Test Model
editing AhG to identify any mismatches between
software and text
231
N
N
Slice support development and characterization
R. Sjöberg (chair),
Y. Chen, K. Kazui (vice chairs)
(jct-vc@lists.rwth-aachen.de)
 Implement basic slice support (independentlydecodable sequences of largest coding units in
raster-scan order) into the HM software.
 Identify issues relating to the draft text description
of slice functionality
 Coordinate the integration of slice support with the
software development and HM software technical
evaluation ad hoc group
 Study technical proposals relating to slice structured
coding
 Study the coding efficiency and loss resilience
impact of slice-structured coding
 Identify and discuss additional issues relating to
slice-structured coding
Spatial Transforms
(jct-vc@lists.rwth-aachen.de)
 Study the transforms in the HM design, including
compression performance, computational
complexity, dynamic range, storage requirements,
etc.
 Perform analysis of block transform design and
architecture, including software and hardware
considerations.
 Discuss transform-related Core Experiments, and
identify potential synergies or incompatibilities
related to the tools being tested in the CEs.
 Report the results and conclusions of these
discussions and experiments to the JCT-VC.
P. Topiwala (chair),
M. Budagavi, R. Cohen, R. Joshi
(vice chairs)
In-loop and post-processing filtering
T. Yamakage (chair),
K. Chono, Y. J. Chiu, I. S. Chong,
(jct-vc@lists.rwth-aachen.de)
M. Narroschke (vice chairs)
 Study enhancement schemes of in-loop filtering,
including de-blocking/de-banding/de-noising filters,
and adaptive Wiener-based filters including variants
with various inputs, combination of filters
 Study trade-offs and characteristics of filter designs
including complexity and subjective and objective
performance
 Discuss relationships and evaluation procedures for
the filtering techniques
 Identify possibilities for harmonization of enhanced
in-loop filtering technologies
 Study the relationship between in-loop and postprocessing filtering
232
N
N
Coding block structures
(jct-vc@lists.rwth-aachen.de)
 Study techniques of HM relating to coding block
structure
 Characterize the trade-offs involved in coding block
structure issues, including complexity, redundancy
and compression performance aspects
 Identify opportunities for harmonization and
simplification of coding block structure
K. Panusopone (chair), W.-J. Han,
T. K. Tan, T. Wiegand (vice
chairs)
N
Reference pictures memory compression
(jct-vc@lists.rwth-aachen.de)
 Study motion compensation memory access
bandwidth of HM design
 Study coding efficiency loss caused by fixed
rounding of extended bit depth values for reference
picture storage.
 Study the visual quality impact of reference picture
memory compression
 Study use cases for memory compression
techniques
 Study the appropriate scope of standardization for
memory compression
 Report on conclusions reached
K. Chono (chair),
T. Chujoh, C. S. Lim,
A. Tabatabai, M. Zhou
(vice chairs)
N
Entropy coding
(jct-vc@lists.rwth-aachen.de)
 Study the entropy coding complexity and
compression characteristics of CABAC, LCEC,
PIPE/V2V, and other entropy coding designs
 Characterize throughput, memory, silicon area,
power requirements, etc.
 Study parallel context processing, syntax-element
partitioning, and other parallelism approaches for
entropy coding
 Study and develop approaches for hardware and
software evaluation of entropy coding methods
 Identify and discuss additional issues on entropy
coding
M. Budagavi (chair),
G. Martin-Cocher, A. Segall
(vice chairs)
N
233
Entropy slices
(jct-vc@lists.rwth-aachen.de)
 Study entropy slice coding methods
 Evaluate the compression performance and other
characterstics of entropy slices relative to
conventional slices
 Analyze impacts of throughput, latency, and
memory bandwidth for entropy slice methods for
various architectures
 Develop and perform experiments evaluating
entropy slice methods
 Identify and discuss additional issues relating to
entropy slices
A. Segall (chair)
V. Sze, Y.-W. Huang (vice chairs)
N
Video test material selection
(jct-vc@lists.rwth-aachen.de)
 Identify, collect, and make available a variety of
video sequence test material
 Study the coding performance and characteristics of
test materials
 Identify and recommend appropriate test materials
and corresponding test conditions for use in HEVC
development
T. Suzuki (chair)
N
Complexity assessment
(jct-vc@lists.rwth-aachen.de)
 Investigate measurement criteria to assess the
complexity of encoder and decoder algorithmic
design elements, including computational
complexity, parallelism, memory bandwidth,
memory capacity, dynamic range requirements, etc.
 Identify and recommend tools for algorithmic
complexity assessment
D. Alfonso (chair),
J. Ridge, X. Wen (vice chairs)
N
Motion compensation interpolation
(jct-vc@lists.rwth-aachen.de)
 Study the coding efficiency and complexity
characteristics of proposed interpolation filtering
methods
 Study the potential for using a unified motion
compensation interpolation approach for high
efficiency and low complexity configurations
 Draft proposed core experiments relating to motion
compensation interpolation
 Identify and discuss additional issues relating to
motion compensation filtering
K. Ugur (chair),
E. Alshina, P. Chen, T. Chujoh
(vice chairs)
N
234
20 Core Experiment planning and Common Conditions
20.1
General issues for CEs
Because the first working draft and HEVC test model (referred to as the HM) have now been
selected, group coordinated experiments are now referred to as "core experiments" rather than "tool
experiments".
A preliminary CE description is to be approved at the meeting at which the CE plan is established.
It is possible to define sub-experiments within particular CEs, for example designated as CE X.a,
CE X.b, etc., for a CE X, where X is the basic CE number.
As a general rule, it was agreed that each CE should be run under the same testing conditions using
one software codebase, which should be based on the HM software codebase. The general agreed
common conditions for experiments were described in the output document JCTVC-C500.
A deadline of twp weeks after the meeting was established for organizations to express their interest
in participating in a CE to the CE coordinators and for finalization of the CE descriptions by the CE
coordinator with the assistance and consensus of the CE participants.
Final CEs should clearly describe specific tests to be performed, not describe vague activities.
Activities of a less specific nature are delegated to Ad Hoc Groups rather than designated as CEs.
Experiment descriptions should be written in a way such that it is understood as a JCT-VC output
document (written "third party perspective", not a company proponent perspective). The experiment
descriptions should generally not express opinions or suggest conclusions – rather, they should just
describe what technology will be tested, how it will be tested, who will participate, etc.
CE descriptions should not contain verbose descriptions of a technology (at least not unless the
technology is not adequately documented elsewhere). Instead, the CE descriptions should refer to
the relevant proposal contributions for any necessary further detail. However, the complete detail of
what technology will be tested must be available – either in the CE description itself or in
referenced documents that are also available in the JCT-VC document archive.
Those who proposed technology in the respective context (by this or the previous meeting) can
propose a CE or CE sub-experiment. Harmonizations of multiple such proposals and minor
refinements of proposed technology may also be considered. Other subjects would not be
designated as CEs.
Any technology must have at least one cross-check partner to establish a CE – a single proponent is
not enough. It is highly desirable have more than just one proponent and one cross-checker.
It is strongly recommended to plan resources carefully and not waste time on technology that may
have little benefit – it is also within the responsibility of the CE coordinator to take care of this.
CE participants are obligated to submit a report of the results of their experiment testing and any
further analysis of the subject matter. Each participant is required to submit a report – measures will
be taken if this principle is violated without good reason.
A summary report written by the coordinator (with the assistance of the participants) is also
expected to be provided to the subsequent meeting.
Non-final CE plan documents were reviewed and given tentative approval during the meeting.
The particular CEs planned at this meeting were documented in the documents numbered JCTVCC5xx, with xx = 01, 02, etc.
The CE description for each planned CE is described in an associated output document JCTVCB3xx for CExx, where "xx" is the CE number.
It must be understood that the JCT-VC is not obliged to accept the test methodology or outcome of
a CE as being adequate. Good results from a CE do not impose an obligation on the group to accept
the result (e.g., if the expert judgment of the group is that further data is needed or that the test
methodology was flawed).
235
20.2
Common Conditions
Preferred Common Conditions for experiment testing that are intended to be appropriate for both
CEs and other experiments were selected by the group and described in output document JCTVCC500.
21 Output documents
The following documents were agreed to be produced as outputs of the meeting. Names recorded
below indicate those responsible for document production.
21.1.1.1.1.1.1.1.1
JCTVC-C400 Meeting Report of 3rd JCTVC Meeting [G. J. Sullivan, J.-R.
Ohm]
21.1.1.1.1.1.1.1.2
JCTVC-C401 The process of Test Model development for the HEVC
initiative [G. J. Sullivan, J.-R. Ohm]
21.1.1.1.1.1.1.1.3
JCTVC-C402 High Efficiency Video Coding (HEVC) Test Model 1 (HM 1)
Encoder Description [K. McCann (primary), B. Bross, S. Sekiguchi]
21.1.1.1.1.1.1.1.4
JCTVC-C403 High Efficiency Video Coding (HEVC) text specification
Working Draft 1 [T. Wiegand (primary), W.-J. Han, G. J. Sullivan, J.-R. Ohm]
21.1.1.1.1.1.1.1.5
JCTVC-C404 HEVC Software guidelines [D. Flynn, K. Sühring]
21.1.1.1.1.1.1.1.6
JCTVC-C405 Summary of HEVC Model (HM) and Working Draft 1 [T.K. Tan,
G. J. Sullivan , J.-R. Ohm]
21.1.1.1.1.1.1.1.7
JCTVC-C500 Common HM test conditions and software reference
configurations [F. Bossen]
21.1.1.1.1.1.1.1.8
JCTVC-C501 Core Experiment 1: Decoder-side Motion Vector Derivation
[Y.-J. Chiu (primary), H. Yu, Y.-W. Huang, S. Sekiguchi]
21.1.1.1.1.1.1.1.9
JCTVC-C502 Core Experiment 2: Flexible Motion Partitioning [E. Francois
(primary), X. Zheng, P. Chen]
21.1.1.1.1.1.1.1.10
JCTVC-C503 Core Experiment 3: Interpolation for MC (Luma) [T. Chujoh]
21.1.1.1.1.1.1.1.11 JCTVC-C504 Core Experiment 4: Interpolation for MC (Chroma) [E. Alshina
(primary), Y.-L. Lee, P. Chen, H. Lakshman]
21.1.1.1.1.1.1.1.12 JCTVC-C505 Core Experiment 5: Low Complexity Entropy Coding
Improvement [X. Wang]
21.1.1.1.1.1.1.1.13 JCTVC-C506 Core Experiment 6: Intra prediction improvements [A.
Tabatabai (primary), A. Tanizawa, A. Segall, H. Yu, S. Lei, J. Xu, J. Chen, Y.-J.
Jeon, G. Li, M. Mrak]
21.1.1.1.1.1.1.1.14 JCTVC-C507 Core Experiment 7: Alternative transforms [R. Cohen (primary),
C. Yeo, R. Joshi]
21.1.1.1.1.1.1.1.15 JCTVC-C508 Core Experiment 8: In-Loop filtering [T. Yamakage (primary), T.
Chujoh, Y. W. Huang, K. Chono, I. S. Chong, T. Yamamoto, J. Lim, B. Jeon, J. Xu,
M. Narroschke]
21.1.1.1.1.1.1.1.16
JCTVC-C509 Core Experiment 9: MV coding [J. Jung, B. Bross]
236
21.1.1.1.1.1.1.1.17 JCTVC-C510 Core Experiment 10: Number of Intra Prediction Directions [K.
Sugimoto]
21.1.1.1.1.1.1.1.18 JCTVC-C511 Core Experiment 11: Coefficient Scanning and Coding [V. Sze
(primary), K. Panusopone, J. Chen, T. Nguyen, M. Coban]
21.1.1.1.1.1.1.1.19 JCTVC-C512 Core Experiment 12: Adaptive Motion Vector Resolution [W.-J.
Chien]
21.1.1.1.1.1.1.1.20 JCTVC-C513 Core Experiment 13: Intra Smoothing [M. Coban (primary), B.
Bross, J. Chen]
22 Future meeting plans and closing of the meeting
Future meeting plans were kept as previously planned:

Meeting under ITU-T SG 16 auspices when it meets (starting meetings on the Wednesday of
the first week and closing it on the Wednesday of the second week of such meeting), and

Otherwise meeting under ISO/IEC JTC 1/SC 29/WG 11 auspices when it meets (starting
meetings on the Thursday prior to such meetings and closing it on the last day of the WG 11
meeting).
The JCT-VC chairmen proposed to hold the 4th JCT-VC meeting during 20-28 January 2011 under
WG11 auspices in Daegu, KR. Further JCT-VC meetings are proposed to be held during 16-23
March 2011 under ITU-T auspices in Geneva, CH, and during 14-22 July 2011 under WG 11
auspices in Torino, IT.
The local hosts of the meeting and the WG11 parent body under whose auspices the meeting was
held were thanked for the arrangements provided for the meeting. The local hosts included Sun YatSen University, Peking University, and Guangzhou CHNAVS Digital Technology Co., Ltd.
The meeting was closed at approximately 1:30 p.m. on Friday 15 October.
237
Annex A to JCT-VC report:
List of documents
JCT-VC number
MPEG
number
JCTVC-C001
m18041
2010-09-27
22:20:52
2010-10-07
01:42:48
2010-10-07
01:42:48
JCT-VC AHG report: JCT-VC project management
G. J. Sullivan, J.-R. Ohm
(co-chairs)
JCTVC-C002
2010-09-27
m18042
22:25:08
2010-10-07
04:39:50
2010-10-07
04:39:50
JCT-VC AHG report: Test Model under Consideration
(TMuC) editing
K. McCann (chair), M.
Karczewicz, J. Ridge, S.
Sekiguchi, T. Wedi, T.
Wiegand (vice-chairs)
JCTVC-C003
2010-09-27
m18043
22:27:18
2010-10-13
01:40:29
2010-10-13
01:40:29
JCT-VC AHG report: Software development and TMuC
software technical evaluation
F. Bossen (chair), P. Chen,
D. Flynn, W.-J. Han, K.
Sühring, H. Schwarz, K.
Ugur (vice-chairs)
JCTVC-C004
m18044
2010-09-27
22:29:13
2010-10-02
10:02:13
2010-10-08
10:12:20
JCT-VC AHG report: Alternative transforms
R. Cohen, R. Joshi (cochairs)
JCTVC-C005
m18045
2010-09-27
22:31:47
2010-10-01
15:02:14
2010-10-07
09:04:55
T. Yamakage (chair), Y. J.
JCT-VC AHG report: In-loop and post-processing filtering Chiu, M. Karczewicz, M.
Narroschke (co-chairs)
JCTVC-C006
m18046
2010-09-27
22:34:03
2010-10-04
20:24:30
2010-10-04
20:24:30
JCT-VC AHG report: Large block structures
K. Panusopone (chair), M.
Budagavi, W.-J. Han, D. He
(vice-chairs)
JCTVC-C007
m18047
2010-09-27
22:35:40
2010-10-01
12:46:27
2010-10-05
12:57:57
JCT-VC AHG report: Memory compression
K. Chono (chair), T. Chujoh,
C. S. Lim (vice-chairs)
JCTVC-C008
m18048
2010-09-27
22:40:30
2010-10-05
01:57:23
2010-10-05
01:57:23
JCT-VC AHG report: Parallel entropy coding
M. Budagavi, A. Segall (cochairs)
JCTVC-C009
m18049
2010-09-27
22:47:50
2010-10-07
04:41:43
2010-10-07
04:41:43
JCT-VC AHG report: Screen content coding
J. Xu, W. Ding (co-chairs)
JCTVC-C010
m18050
2010-09-27
22:55:14
2010-10-02
23:17:02
2010-10-02
23:17:02
JCT-VC AHG report: Complexity assessment
D. Alfonso (chair), J. Ridge,
X. Wen (vice-chairs)
JCTVC-C021
m18028
2010-09-25
01:53:37
2010-10-02
23:19:39
2010-10-05
13:03:21
Evaluation result of JCTVC-B031
K. Kazui, J. Koyama, A.
Nakagawa (Fujitsu)
Created
First upload
Last upload
Title
238
Source
JCTVC-C022
m18032
2010-09-25
12:24:14
2010-09-25
12:25:31
2010-09-25
12:25:31
WG11 USNB contribution: Suggested practices for the
HEVC project in the JCT-VC
A. G. Tescher (for USNB to
WG11)
JCTVC-C023
m18033
2010-09-25
12:27:32
2010-09-25
12:28:38
2010-09-25
12:28:38
WG11 USNB contribution: HEVC standardization timeline
A. G. Tescher (for USNB to
WG11)
JCTVC-C024
m18035
2010-09-27
08:40:56
2010-10-01
10:49:44
2010-10-02
11:21:12
TE1: Cross-check result of DMVD proposal JCTVC-C124 T. Murakami, T. Yokoyama,
(Mitsubishi)
K. Nakamura (Hitachi)
JCTVC-C025
m18036
2010-09-27
19:25:11
2010-10-01
11:21:22
2010-10-01
11:21:22
CTB splitting on frame boundary for arbitrary resolution
video
F. Yang, W. Li, S. Wan, Y.
Chang (Xi'dian Univ.)
JCTVC-C026
m18037
2010-10-01
17:46:23
2010-10-02
23:23:44
2010-10-02
23:23:44
Proposal of frame parameters FLCTB and FSCTB for
video content fitting
F. Yang, W. Li, S. Wan, Y.
Chang (Xi'dian Univ.)
JCTVC-C027
m18038
2010-09-27
19:31:40
2010-10-02
23:24:28
2010-10-05
17:05:48
TE7: Verification results of Peking Univ. proposal (modeJ. Kim, B. Jeon (LG
dependent residual reordering for intra prediction
Electronics)
residual)
JCTVC-C028
m18039
2010-09-27
19:34:29
2010-10-02
23:25:17
2010-10-02
23:25:17
TE6: Verification results of Huawei and Hisilicon
proposals (short distance intra prediction using the
correlation between lines and pixels)
J. Kim, B. Jeon (LG
Electronics)
JCTVC-C029
m18040
2010-09-27
19:56:17
2010-10-02
23:25:50
2010-10-02
23:25:50
TE12: Report on 64x64 versus 32x32 maximum coding
unit size for low complexity configuration
P. Pandit (Harmonic)
JCTVC-C030
m18051
2010-09-28
03:15:03
2010-10-04
02:56:39
2010-10-05
08:56:34
TE12: Report on AMP evaluation
E. Francois (Technicolor), L.
Guillo (INRIA), A. Ichigaya
(NHK), H. Yu (Huawei)
JCTVC-C031
m18052
2010-09-28
03:22:04
2010-10-02
10:26:11
2010-10-08
13:16:23
TE3 subtest 2: Report on simplified geometry block
partitioning
E. Francois, D. Thoreau, P.
Bordes (Technicolor), L.
Guillo (INRIA)
JCTVC-C032
m18053
2010-09-28
06:28:12
2010-10-02
23:26:35
2010-10-05
08:34:08
Characteristics of Super Hi-Vision test sequences
Y. Shishikui, Y. Matsuo, A.
Ichigaya, K. Iguchi, S.
Sakaida (NHK)
JCTVC-C033
m18054
2010-09-28
07:07:40
2010-10-02
23:28:06
2010-10-11
11:39:59
TE3: Motion compensation with adaptive warped
reference
S. Park, J. Park, B. Jeon (LG
Electronics)
JCTVC-C034
m18055
2010-09-28
20:36:36
2010-10-02
10:27:44
2010-10-08
03:48:17
TE3 subtest 2: Cross-check of results from Huawei
E. Francois, P. Bordes
(Technicolor)
JCTVC-C035
m18056
2010-09-28
21:04:38
2010-10-02
09:16:00
2010-10-02
09:16:00
TE7: Cross-verification of the 1D directional unified
transform in TMuC
R. Cohen, A. Vetro, H. Sun
(Mitsubishi)
JCTVC-C036
m18057
2010-09-28
21:06:39
2010-10-02
09:16:35
2010-10-09
14:12:55
Alternative performance measurement of MDDT and
ROT in TMuC
R. Cohen, A. Vetro, H. Sun
(Mitsubishi)
239
JCTVC-C037
m18058
2010-09-28
21:22:52
2010-10-02
23:31:56
2010-10-07
17:54:33
TE7: Results for mode-dependent fast separable KLT for C. Yeo, Y. H. Tan, Z. Li, S.
block-based intra coding
Rahardja (I2R)
JCTVC-C038
m18059
2010-09-28
21:23:48
2010-10-02
23:32:29
2010-10-07
17:55:18
TE7: Cross-check results of MDDT simplification
proposal from Huawei
C. Yeo, Y. H. Tan, Z. Li, S.
Rahardja (I2R)
JCTVC-C039
m18060
2010-09-28
21:24:39
2010-10-02
23:32:57
2010-10-07
17:56:13
Choice of transforms in MDDT for unified intra prediction
C. Yeo, Y. H. Tan, Z. Li, S.
Rahardja (I2R)
JCTVC-C040
m18061
2010-09-28
21:25:38
2010-10-02
23:33:57
2010-10-07
17:56:49
Chroma intra prediction using reconstructed luma
C. Yeo, Y. H. Tan, Z. Li, S.
Rahardja (I2R)
JCTVC-C041
m18062
2010-09-28
21:34:51
2010-10-01
09:57:38
2010-10-05
14:01:53
TE12: Performance of partition based illumination
compensation (PBIC)
A. Fujibayashi, S. Kanumuri,
F. Bossen, T. K. Tan (NTT
DoCoMo)
JCTVC-C042
m18063
2010-09-28
21:36:09
2010-10-03
05:22:19
2010-10-03
05:22:19
TE5: Results for simplification of unified intra prediction
T. K. Tan, Frank Bossen
(NTT DoCoMo)
JCTVC-C043
m18064
2010-09-28
21:37:37
2010-10-02
11:02:42
2010-10-02
11:03:26
TE11: Simulation results of merge/skip (3.2b and 3.2d)
Y. Suzuki, F. Bossen (NTT
DoCoMo)
JCTVC-C044
m18065
2010-09-28
21:39:08
2010-10-03
05:23:08
2010-10-03
05:23:08
TE12: Results for experiments on max CU size, RDOQ
and AIS
F. Bossen, T. K. Tan (NTT
DoCoMo)
JCTVC-C045
m18066
2010-09-28
21:42:07
2010-10-02
23:34:53
2010-10-08
05:32:50
TE6.a: Cross verification for line based intra prediction
T. K. Tan, Junya Takiue
(NTT DoCoMo)
JCTVC-C046
m18067
2010-09-28
21:44:15
2010-10-02
23:35:46
2010-10-02
23:35:46
TE5: Summary report on simplification of unified intra
prediction
T. K. Tan (NTT DoCoMo), M.
Budagavi (TI), J. Lainema
(Nokia)
JCTVC-C047
m18068
2010-09-28
21:49:34
2010-10-02
23:36:17
2010-10-02
23:36:17
TE12.4: Transform skip flag test (off vs. on) by
Fraunhofer HHI
B. Bross (Fraunhofer HHI)
JCTVC-C048
m18069
2010-09-28
21:51:55
2010-10-02
23:36:50
2010-10-02
23:36:50
TE12.6: PIPE and LCEC tested against CABAC by
Fraunhofer HHI
B. Bross (Fraunhofer HHI)
JCTVC-C049
m18070
2010-09-28
21:53:13
2010-10-02
23:37:18
2010-10-02
23:37:18
TE12.4: Transform coding HHI tested against Samsung
proposal by Fraunhofer HHI
B. Bross (Fraunhofer HHI)
JCTVC-C050
m18071
2010-09-28
21:54:35
2010-10-02
23:37:53
2010-10-02
23:37:53
TE12.2: Interpolation filter SIFO and DIF tested against
MOMS by Fraunhofer HHI
B. Bross (Fraunhofer HHI)
JCTVC-C051
m18072
2010-09-28
21:55:37
2010-10-02
23:38:24
2010-10-02
23:38:24
TE12.6: PIPE tested against LCEC by Fraunhofer HHI
B. Bross (Fraunhofer HHI)
JCTVC-C052
m18073
2010-09-28
21:57:07
2010-10-02
23:38:53
2010-10-02
23:38:53
TE12.1: Block merging (MRG) test (off vs. on) by
Fraunhofer HHI
B. Bross (Fraunhofer HHI)
JCTVC-C053
m18074 2010-09-28
2010-10-02
2010-10-02
TE12.3: Adaptive intra smoothing (AIS) test (slow vs.
B. Bross (Fraunhofer HHI)
240
21:58:26
23:39:24
23:39:24
fast) by Fraunhofer HHI
TE12.3: Adaptive intra smoothing (AIS) test (off vs. on)
by Fraunhofer HHI
B. Bross (Fraunhofer HHI)
JCTVC-C054
m18075
2010-09-28
21:59:57
2010-10-02
23:39:52
2010-10-02
23:39:52
JCTVC-C055
2010-09-29
m18078
23:45:27
2010-10-02
23:40:24
2010-10-15
10:09:49
Performance report of TMuC for Super Hi-Vision
K. Iguchi, A. Ichigaya, Y.
Shishikui(NHK), S.
Sekiguchi, A. Minezawa
(Mitsubishi Electric)
JCTVC-C056
m18079
2010-09-29
23:57:16
2010-10-02
00:41:39
2010-10-02
00:41:39
TE12: Evaluation of transform unit (TU) size
M. Zhou, V. Sze (TI)
JCTVC-C057
m18080
2010-09-29
23:58:56
2010-10-02
00:42:29
2010-10-02
00:42:29
TE12: Evaluation of IBDI and TPE (transform precision
extension)
M. Zhou (TI)
JCTVC-C058
m18081
2010-09-30
00:01:06
2010-10-02
02:50:32
2010-10-05
11:51:49
TE12: Evaluation of entropy coders: PIPE tested against
V. Sze, M. Budagavi (TI)
CABAC
JCTVC-C059
m18082
2010-09-30
00:03:32
2010-10-02
02:51:15
2010-10-08
06:34:31
TE12: Evaluation of transform coefficient coding
(HHI_TRANSFORM_CODING) with tool breakdown
V. Sze, M. Budagavi, M.
Zhou (TI)
JCTVC-C060
m18083
2010-09-30
00:04:53
2010-10-02
00:43:29
2010-10-04
13:04:42
TE2: TI reference frame compression proposal
M. Zhou, M. Budagavi (TI)
JCTVC-C061
m18084
2010-09-30
00:05:42
2010-10-02
08:38:43
2010-10-09
02:06:09
TE5: TI evaluation of unified intra prediction
simplifications
M. Budagavi (TI)
JCTVC-C062
m18085
2010-09-30
00:06:21
2010-10-02
08:39:45
2010-10-09
09:40:07
TE8: TI parallel context processing (PCP) proposal
M. Budagavi (TI)
JCTVC-C063
m18086
2010-09-30
00:07:15
2010-10-02
02:52:02
2010-10-10
08:13:02
TE8: Evaluation of RIM parallel context processing (PCP)
V. Sze, M. Budagavi (TI)
proposal
JCTVC-C064
m18087
2010-09-30
00:08:17
2010-10-02
02:52:40
2010-10-10
08:19:41
TE8: Evaluation of RIM-V2V entropy coding
V. Sze, M. Budagavi (TI)
JCTVC-C065
m18088
2010-09-30
00:10:37
2010-10-02
00:44:08
2010-10-04
13:05:34
Testing results on the simplified TMuC configurations
M. Zhou (TI)
JCTVC-C066
m18089
2010-09-30
00:11:31
2010-10-02
00:44:35
2010-10-02
00:44:35
Cross verification of Mitsubishi 0.7.4 configuration
M. Zhou (TI)
TE9: Report on large block structure testing
J. Kim, M. Kim (KAIST), J.
Kim, H.-Y. Kim (ETRI), K.
Sato (SONY), X. Shen, L. Yu
(Zhejiang Univ.), K. Choi, E.
S. Jang (Hanyang Univ.), B.
Bross (HHI), W.-J. Han
JCTVC-C067
m18090
2010-09-30
00:24:33
2010-10-03
05:26:12
2010-10-06
19:29:37
241
(Samsung), J.-K. Jo, S.-N.
Park, D. G. Sim, S.-J. Oh
(Kwangwoon Univ.)
JCTVC-C068
m18091
2010-09-30
00:28:31
2010-10-05
15:45:15
2010-10-11
18:16:12
Improved side information signaling for QVBT in TMuC
B. Lee, M. Kim (KAIST), J.
Kim, H.-Y. Kim (ETRI)
JCTVC-C069
m18092
2010-09-30
00:36:29
2010-10-03
05:27:20
2010-10-03
05:27:20
TE3: Test results of second order prediction
[Withdrawn]
S. Li, L. Yu (Zhejiang Univ.)
JCTVC-C070
m18093
2010-09-30
00:38:45
2010-10-02
23:41:19
2010-10-02
23:41:19
TE7: Cross-check for I2R proposal on mode-dependent
fast separable KLT for block-based intra coding
X. Zhu, L. Yu (Zhejiang
Univ.)
JCTVC-C071
m18094
2010-09-30
00:40:05
2010-10-02
23:41:48
2010-10-09
13:12:41
TE10 subset 2: Complexity analysis on Wiener-based in- L. Wang, L. Yu (Zhejiang
loop filters
Univ.)
JCTVC-C072
m18095
2010-09-30
00:41:35
2010-10-02
23:42:24
2010-10-07
09:09:05
TE2: Improved memory bandwidth measurement model
for reference frame compression
J. Chen, X. Shen, L. Yu
(Zhejiang Univ.)
JCTVC-C073
m18096
2010-09-30
00:47:07
2010-10-02
23:42:50
2010-10-08
12:17:08
TE2: Reference frame compression using image coder
C. S. Lim, V. Wahadaniah,
S. Naing, H. W. Sun
(Panasonic)
JCTVC-C074
m18097
2010-09-30
00:48:41
2011-01-15
00:06:27
2011-01-15
00:06:27
TE2: Cross-check of memory compression results from
NEC (JCTVC-B057/JCTVC-C093) [Initially uploaded to
the prior document archive site]
C. S. Lim, V. Wahadaniah,
S. Naing, H. W. Sun
(Panasonic)
JCTVC-C075
m18098
2010-09-30
00:51:25
2010-10-02
05:46:14
2010-10-09
11:53:28
TE2: Adaptive scaling for bit depth compression on IBDI
T. Chujoh, T. Yamakage
(Toshiba)
JCTVC-C076
m18099
2010-09-30
00:53:40
2010-10-01
16:18:45
2010-10-07
04:25:02
TE2: Summary of TE2 on IBDI and memory compression T. Chujoh (TE coordinator)
JCTVC-C077
m18100
2010-09-30
00:54:32
2010-10-02
11:00:51
2010-10-09
11:56:15
A framework for standardization of memory compression
T. Chujoh, T. Yamakage
(Toshiba)
JCTVC-C078
m18101
2010-09-30
00:57:13
2010-10-02
05:48:00
2010-10-09
08:12:21
TE3 subtest 4: High accuracy interpolation filter (HAIF)
T. Chujoh, K. Kanou, T.
Yamakage (Toshiba)
JCTVC-C079
m18102
2010-09-30
00:58:22
2010-10-02
01:23:19
2010-10-09
01:03:09
TE6 subset a: Bidirectional intra prediction
T. Shiodera, A. Tanizawa, T.
Chujoh, T. Yamakage
(Toshiba)
JCTVC-C080
m18103
2010-09-30
00:59:49
2010-10-02
01:35:44
2010-10-08
11:57:55
TE7: One-dimensional directional unified transform
A. Tanizawa, J. Yamaguchi,
T. Shiodera, T. Chujoh, T.
Yamakage (Toshiba)
JCTVC-C081
m18104
2010-09-30
01:01:17
2010-10-02
01:30:04
2010-10-02
01:30:04
TE7: Cross-verification of mode dependent residual
reordering (Peking Univ.)
A. Tanizawa, J. Yamaguchi
(Toshiba)
242
JCTVC-C082
m18105
2010-09-30
01:03:24
2010-10-01
15:03:34
2010-10-10
05:51:17
TE10 subtest 2: Reduction of number of encoding passes T. Yamakage, T. Chujoh, T.
for quadtree adaptive loop filter (QALF)
Watanabe (Toshiba)
JCTVC-C083
m18106
2010-09-30
01:04:51
2010-10-01
15:04:28
2010-10-11
02:34:48
TE10: Summary of TE10 on in-loop filtering
K. Chono, T. Yamakage (TE
coordinators)
JCTVC-C084
m18107
2010-09-30
01:06:25
2010-10-01
15:05:17
2010-10-12
06:38:08
Coding efficiency report of modification by TMuC
draft005
T. Yamakage, T. Chujoh, T.
Watanabe (Toshiba)
JCTVC-C085
m18108
2010-09-30
01:07:49
2010-10-01
15:06:09
2010-10-09
11:33:40
Comparison of loop & post filtering for in-loop and postprocessing filtering AHG
T. Yamakage, T. Chujoh, T.
Watanabe (Toshiba)
JCTVC-C086
m18109
2010-09-30
01:08:53
2010-10-01
15:44:36
2010-10-01
15:44:36
Experimental results of ALF on low complexity
T. Chujoh, T. Yamakage
(Toshiba)
JCTVC-C087
m18110
2010-09-30
01:09:49
2010-10-01
15:46:14
2010-10-01
15:46:14
TE12.2: Experimental results of interpolation filter
T. Chujoh, K. Kanou, T.
Yamakage (Toshiba)
JCTVC-C088
m18111
2010-09-30
01:10:37
2010-10-01
15:47:41
2010-10-01
15:47:41
TE12.7: Experimental results of IBDI
T. Chujoh, T. Yamakage
(Toshiba)
JCTVC-C089
m18112
2010-09-30
01:20:39
2010-10-02
23:43:44
2010-10-08
11:32:37
TE7: Results for mode-dependent residual reordering for X. Zhao, L. Zhang, S. Ma,
intra prediction residual
W. Gao (Peking Univ.)
JCTVC-C090
m18113
2010-09-30
01:30:24
2010-10-02
23:44:30
2010-10-02
23:44:30
TE7: Cross-verification results of mode-dependent fast
separable KLT for block-based intra coding from I2R
X. Zhao, L. Zhang, S. Ma,
W. Gao (Peking Univ.)
JCTVC-C091
m18114
2010-09-30
01:34:44
2010-10-01
21:46:06
2010-10-06
21:58:49
TE10: Conditional joint deblocking-debanding filter
K. Chono, K. Senzaki, H.
Aoki, J. Tajime, Y. Senda
(NEC)
JCTVC-C092
m18115
2010-09-30
01:36:05
2010-10-01
15:49:32
2010-10-04
15:56:10
TE12: Report on combined intra prediction evaluation
K. Chono, K. Senzaki, H.
Aoki, J. Tajime, Y. Senda
(NEC)
JCTVC-C093
m18116
2010-09-30
01:37:18
2010-10-01
20:05:59
2010-10-06
15:55:14
TE2: 1-D DPCM-based memory compression
H. Aoki, K. Chono, K.
Senzaki, J. Tajime, Y. Senda
(NEC)
JCTVC-C094
m18117
2010-09-30
01:38:23
2010-10-01
20:06:55
2010-10-06
18:05:00
H. Aoki, K. Chono, K.
An extension of DPCM-based memory compression to 2Senzaki, J. Tajime, Y. Senda
D structure with ADPCM
(NEC)
JCTVC-C095
m18118
2010-09-30
01:39:11
2010-10-01
20:07:41
2010-10-12
04:12:35
Performance improvement of DPCM-based memory
compression by adaptive quantization
H. Aoki, K. Chono, K.
Senzaki, J. Tajime, Y. Senda
(NEC)
JCTVC-C096
m18119
2010-09-30
01:47:25
2010-10-02
23:45:04
2010-10-07
11:26:21
Low complexity rotational transform
F. C. A. Fernandes
(Samsung)
243
JCTVC-C097
m18120
2010-09-30
01:52:51
2010-10-02
23:45:31
2010-10-08
08:22:26
TE1:Huawei report on DMVD improvements in TMuC
M. Yang, S. Lin, D. Wang, J.
Zhou, H. Yu (Huawei)
JCTVC-C098
m18121
2010-09-30
01:54:56
2010-10-02
23:45:57
2010-10-07
07:58:52
TE1: Cross-checking of DMVD results from Mitsubishi
and Intel
S. Lin, D. Wang, M. Yang, H.
Yu (Huawei)
JCTVC-C099
m18122
2010-09-30
01:58:27
2010-10-02
23:46:25
2010-10-09
08:15:47
TE3.2: Huawei & HiSilicon report on flexible motion
partitioning coding
X. Zheng (HiSilicon), H. Yu
(Huawei)
JCTVC-C100
m18123
2010-09-30
02:02:22
2010-10-02
23:46:56
2010-10-02
23:46:56
TE5: Huawei & HiSilicon report on verification test results
L. Liu (HiSilicon), J. Zhou, H.
Yu (Huawei)
JCTVC-C101
m18124
2010-09-30
02:04:44
2010-10-03
00:00:52
2010-10-03
00:00:52
TE6.a: HiSilicon report on short distance intra prediction
C. Lai, Y. Lin (HiSilicon)
JCTVC-C102
m18125
2010-09-30
02:07:42
2010-10-03
00:01:31
2010-10-09
11:11:12
TE7: Symmetry-based simplification of MDDT
H. Yang, J. Zhou, H. Yu
(Huawei)
JCTVC-C103
m18126
2010-09-30
02:10:17
2010-10-03
00:02:02
2010-10-03
00:02:02
TE7: Report of cross-checking results for Toshiba
proposal
H. Yang, J. Zhou, H. Yu
(Huawei)
JCTVC-C104
m18127
2010-09-30
02:12:38
2010-10-03
00:02:39
2010-10-03
00:02:39
TE12: Huawei & HiSilicon report on AMP evaluation
X. Zheng (HiSilicon), J.
Zhou, H. Yu (Huawei)
JCTVC-C105
m18128
2010-09-30
02:14:42
2010-10-03
00:03:14
2010-10-03
00:03:14
TE12: Huawei report on max transform unit size
evaluation
J. Zhou, D. Wang, H. Yu
(Huawei)
JCTVC-C106
m18129
2010-09-30
02:16:10
2010-10-03
00:03:43
2010-10-09
11:06:00
Symmetry-based scan order sharing scheme for MDDT
H. Yang, J. Zhou, H. Yu
(Huawei)
JCTVC-C107
m18130
2010-09-30
02:21:55
Fast CABAC
[Withdrawn]
H. J. Kim (Korea Univ.)
JCTVC-C108
m18131
2010-09-30
02:26:17
2010-10-03
00:05:08
2010-10-12
07:00:26
Jointly optimal intra prediction and adaptive primary
transform
A. Saxena, F. C. A.
Fernandes (Samsung)
JCTVC-C109
m18132
2010-09-30
02:28:49
2010-10-03
01:34:36
2010-10-15
05:38:13
TE12.1: Results for PU-based merging by Fraunhofer
HHI
B. Bross (Fraunhofer HHI)
JCTVC-C110
m18133
2010-09-30
02:30:17
2010-10-03
03:03:37
2010-10-03
03:03:37
TE12.4: Transform unit quadtree tested against 2-level
by Fraunhofer HHI
B. Bross (Fraunhofer HHI)
JCTVC-C111
2010-09-30
m18134
02:32:25
2010-10-03
03:04:57
2010-10-08
08:26:33
Integration of plane mode in unified intra prediction
G. Li (Santa Clara Univ.), L.
Liu, N. Ling (Santa Clara
Univ.), J. Zheng, P. Zhang
(Hisilicon)
JCTVC-C112
m18135
2010-09-30
02:34:15
2010-10-03
03:10:10
2010-10-11
08:03:54
Fast integer transforms for the HEVC test model
W. Dai, M. Krishnan, P.
Jesudhas, P. Topiwala
(FastVDO)
244
JCTVC-C113
m18136
2010-09-30
02:38:17
2010-10-01
23:49:10
2010-10-05
04:56:56
Adaptive loop filter with low encoder complexity
I.S. Chong, W. Chien, M.
Karczewicz (Qualcomm)
JCTVC-C114
m18137
2010-09-30
02:41:44
2010-10-07
10:37:25
2010-10-07
10:37:25
Zigzag scan for CABAC/PIPE
J. Lou, K. Panusopone, L.
Wang (Motorola)
JCTVC-C115
2010-09-30
m18138
02:54:46
2010-10-03
05:29:32
2010-10-08
17:43:16
TE1: Summary report of TE1 activity
Y.-J. Chiu (Intel), Y.-W.
Huang (MediaTek), M. Wien
(RWTH Aachen Univ.), H.
Yu (Huawei)
JCTVC-C116
m18139
2010-09-30
02:56:51
2010-10-03
03:11:33
2010-10-05
10:55:26
TE11: Cross verification of Sony proposal on motion
vector coding
Y.-J. Jeon, B.-M. Jeon (LG
Electronics)
JCTVC-C117
m18140
2010-09-30
03:23:03
2010-10-03
03:14:16
2010-10-03
03:14:16
Implementation analysis of transform block size
Y. Yu (Broadcom)
JCTVC-C118
m18141
2010-09-30
03:27:12
2010-10-03
03:15:16
2010-10-04
11:16:43
TE12.1 cross-check report on block merging (MRG)
on/off
K. Sugimoto, A. Minezawa,
S. Sekiguchi (Mitsubishi
Electric)
JCTVC-C119
m18142
2010-09-30
03:27:57
2010-10-03
03:18:57
2010-10-03
03:18:57
TE12.2 report on MV prediction AMVP/IMVP
K. Sugimoto, A. Minezawa,
S. Sekiguchi (Mitsubishi
Electric)
JCTVC-C120
m18143
2010-09-30
03:28:41
2010-10-03
03:20:04
2010-10-04
11:25:35
TE12.3 cross-check report on AIS on/off
K. Sugimoto, A. Minezawa,
S. Sekiguchi (Mitsubishi
Electric)
JCTVC-C121
m18144
2010-09-30
03:30:08
2010-10-03
03:21:14
2010-10-14
01:34:45
Suggested approach toward HEVC test model creation
S. Sekiguchi, K. Sugimoto
(Mitsubishi Electric)
JCTVC-C122
m18145
2010-09-30
03:31:08
2010-10-03
03:23:20
2010-10-06
18:27:41
A suggested starting point for HEVC test model
K. Sugimoto, S. Sekiguchi
(Mitsubishi Electric)
JCTVC-C123
m18146
2010-09-30
03:32:06
2010-10-03
03:38:02
2010-10-03
03:38:02
On selecting coding tools for HEVC test model
K. Sugimoto, A. Minezawa,
S. Sekiguchi (Mitsubishi
Electric)
JCTVC-C124
m18147
2010-09-30
03:33:11
2010-10-03
03:39:49
2010-10-15
02:10:32
TE1: Implicit direct vector derivation
Y. Itani, S. Sekiguchi
(Mitsubishi Electric)
JCTVC-C125
m18148
2010-09-30
03:34:14
2010-10-03
03:40:52
2010-10-03
03:40:52
TE1: Spatio-temporal direct mode
S. Sekiguchi, Y. Itani
(Mitsubishi Electric)
JCTVC-C126
m18149
2010-09-30
03:37:21
2010-10-03
05:30:25
2010-10-06
07:34:38
Chroma interpolation filtering using high precision filter
D. Y. Kim, J. P. Kim, Y. L.
Lee (Sejong University), J.
Lim, J. Song (SKT)
245
JCTVC-C127
m18151
2010-09-30
05:39:59
2010-10-03
05:32:53
2010-10-08
10:11:44
TE1: Report of self derivation of motion estimation
improvement in TMuC
Y.-J. Chiu, L. Xu, W. Zhang,
H. Jiang (Intel)
JCTVC-C128
m18152
2010-09-30
05:41:20
2010-10-03
05:33:38
2010-10-03
05:33:38
TE1: Cross-checking of DMVD result from Huawei
Y.-J. Chiu, L. Xu, W. Zhang,
H. Jiang (Intel)
JCTVC-C129
m18153
2010-09-30
05:42:33
2010-10-03
05:34:20
2010-10-08
10:13:51
TE12: Cross verification for adaptive loop filter (ALF) (on Y.-J. Chiu, L. Xu, W. Zhang,
vs. off)
H. Jiang (Intel)
JCTVC-C130
m18154
2010-09-30
08:09:06
2010-10-02
16:56:13
2010-10-08
06:23:27
TE10 subtest 1: Results of intra deblocking filter testing
by SKKU/SKT
J. Yang, K. Won, B. Jeon
(SKKU), J. Lim, J. Song
(SKT)
JCTVC-C131
m18155
2010-09-30
08:11:10
2010-10-02
11:49:32
2010-10-02
11:54:33
TE10 subtest 1: Cross-verification result of Microsoft
deblocking filter proposal by SKKU/SKT
J. Yang, K. Won, B. Jeon
(SKKU), J. Lim, J. Song
(SKT)
JCTVC-C132
m18156
2010-09-30
08:12:22
2010-10-02
11:51:54
2010-10-02
11:51:54
TE11: Cross-verification result of Orange Labs motion
vector competition proposal (3.1a) by SKKU
J. Yang, K. Won, B. Jeon
(SKKU)
JCTVC-C133
m18157
2010-09-30
08:13:24
2010-10-02
14:50:11
2010-10-02
14:50:11
Motion vector competition method with reduced
candidate sets
[Withdrawn]
K. Won, J. Yang, B. Jeon
(SKKU), J. Lim, J. Song
(SKT)
JCTVC-C134
m18158
2010-09-30
08:20:46
2010-10-03
00:56:47
2010-10-09
11:45:55
Comments on V2V coding for TM/TMuC
D. He, G. Korodi, P. Imthurn,
J. Jamias, D. O'Loughlin, G.
Martin-Cocher (RIM)
JCTVC-C135
m18159
2010-09-30
08:26:07
2010-10-02
00:16:34
2010-10-11
07:58:53
Flexible scaling of quantization parameter
D. Hoang (Zenverge)
JCTVC-C136
m18160
2010-09-30
08:35:21
2010-10-01
17:42:26
2010-10-07
10:02:08
TE12.4: Mode dependent direnctional transform (MDDT)
X. Zhang (HKUST)
test (off vs. on) by HKUST
JCTVC-C137
m18161
2010-09-30
08:55:58
2010-10-01
17:59:25
2010-10-01
17:59:25
Decoder-side block boundary decision (DBBD) with
OBMC
S. Fukushima (JVC)
JCTVC-C138
m18162
2010-09-30
09:03:54
2010-10-01
18:08:35
2010-10-12
03:40:33
TE1.a: Implementation report of refinement motion
compensation using DMVD on TMuC
M. Ueda (JVC)
JCTVC-C139
m18163
2010-09-30
09:17:39
2010-10-03
03:42:06
2010-10-10
17:35:36
TE6.a: Huawei & HiSilicon report on cross verification of
MSRA's LIC
C. Lai, Y. Lin (HiSilicon), J.
Zhou (Huawei)
JCTVC-C140
m18164
2010-09-30
14:06:56
2010-10-03
03:43:34
2010-10-07
04:03:00
TE9-2: Report on performance tests for different sets of
PU modes by Fraunhofer HHI
B. Bross (Fraunhofer HHI)
JCTVC-C141
m18167
2010-09-30
16:24:37
2010-10-02
06:39:30
2010-10-02
06:39:30
TE8: Crosscheck on TI's proposal of parallel context
processing by MediaTek
Y.-L. Chang, Y.-W. Huang,
S. Lei (MediaTek)
JCTVC-C142
m18168 2010-09-30
2010-10-02
2010-10-05
TE10 subtest 1: Improved deblocking filter
J. An, K. Zhang, Y. Gao, X.
246
16:29:48
06:41:47
08:56:03
Guo, C.-M. Fu, Y.-W. Huang,
S. Lei (MediaTek)
JCTVC-C143
m18169
2010-09-30
16:31:16
2010-10-02
06:43:40
2010-10-05
17:31:12
TE10 subtest 2: Coding unit synchronous picture
quadtree-based adaptive loop filter (QALF)
C.-Y. Tsai, C.-M. Fu, C.-Y.
Chen, Y.-W. Huang, S. Lei
(MediaTek)
JCTVC-C144
m18170
2010-09-30
16:32:20
2010-10-02
06:44:45
2010-10-02
06:44:45
TE10 subtest 2: Crosscheck on TOSHIBA's proposal of
adaptive loop filter by MediaTek
C.-M. Fu, Y.-W. Huang, S.
Lei (MediaTek)
JCTVC-C145
m18171
2010-09-30
16:35:47
2010-10-02
06:45:31
2010-10-02
06:45:31
TE10 subtest 2: Crosscheck on SHARP's proposal of
adaptive loop filter by MediaTek
C.-Y. Tsai, Y.-W. Huang, S.
Lei (MediaTek)
JCTVC-C146
2010-09-30
m18172
16:38:49
2010-10-02
09:26:54
2010-10-02
09:26:54
TE10 subtest 3: Controlled clipping
Y.-L. Chang, C.-M. Fu
(MediaTek), A. Segall, Y. Su
(Sharp), C.-Y. Chen, Y.-W.
Huang, S. Lei (MediaTek)
JCTVC-C147
m18173
2010-09-30
16:39:44
2010-10-02
06:47:02
2010-10-05
08:57:57
TE10 subtest 3: Quadtree-based adaptive offset
C.-Ming Fu, C.-Y. Chen, Y.W. Huang, S. Lei (MediaTek)
JCTVC-C148
m18174
2010-09-30
16:40:27
2010-10-02
06:47:59
2010-10-02
06:47:59
TE12: Crosscheck on deblocking filter in TMuC0.7 by
MediaTek
Y.-L. Chang, Y.-W. Huang,
S. Lei (MediaTek)
JCTVC-C149
m18175
2010-09-30
17:44:55
2010-10-03
03:45:06
2010-10-03
03:45:06
TE12.2: Report on asymmetric motion prediction unit
(AMP) on/off
A. Ichigaya, K. Iguchi (NHK)
JCTVC-C150
m18176
2010-09-30
17:51:58
2010-10-01
08:03:22
2010-10-07
16:25:23
TE2: Cross-check of memory compression results from
TOSHIBA (JCTVC-C075)
Y. Choe, S. Hong (Yonsei
Univ.), Y-G. Kim (KGIT)
JCTVC-C151
m18177
2010-09-30
17:56:58
Reference frame denoising
[Withdrawn]
P. Amon, A. Hutter, E. Wige,
A. Kaup (Siemens)
JCTVC-C152
m18178
2010-09-30
17:58:55
2010-10-03
03:46:04
2010-10-11
02:12:23
Context-adaptive hybrid variable length coding
Y. Xu, J. Li, W. H. Chen, D.
Tian (Cisco Systems)
JCTVC-C153
m18179
2010-09-30
23:22:00
2010-10-01
22:09:49
2010-10-01
22:09:49
TE12: Report on deblocking filter
P. Chen, M. Karczewicz
(Qualcomm)
JCTVC-C154
m18180
2010-09-30
23:34:07
2010-10-02
23:36:10
2010-10-10
16:49:20
Fine granularity slices
R. Sjöberg, P. Wennersten
(Ericsson)
JCTVC-C155
m18181
2010-09-30
23:38:19
2010-10-02
23:32:09
2010-10-12
18:24:33
TMuC 0.7 results with sequence dependent QPs
K. Andersson, R. Sjöberg
(Ericsson)
JCTVC-C156
m18182
2010-09-30
23:40:22
2010-10-02
23:32:47
2010-10-09
17:33:36
TE10: Cross verification of Mediatek's deblocking filter by A. Norkin, R. Sjöberg, K.
Ericsson
Andersson (Ericsson)
JCTVC-C157
m18183
2010-09-30
23:42:33
2010-10-02
23:33:13
2010-10-02
23:33:13
TE6.a: Cross verification of line-based intra prediction
247
J. Samuelsson, R. Sjöberg,
K. Andersson (Ericsson)
JCTVC-C158
m18184
2010-09-30
23:43:54
2010-10-02
23:33:42
2010-10-02
23:33:42
TE7: Cross verification of MDDT simplification JCTVCB024
K. Andersson (Ericsson)
JCTVC-C159
m18185
2010-09-30
23:45:20
2010-10-02
23:34:19
2010-10-02
23:34:19
TE12: Cross verification of RDOQ
K. Andersson (Ericsson)
JCTVC-C160
m18186
2010-10-01
00:46:38
2010-10-01
22:27:08
2010-10-01
22:27:08
TE12: Crosscheck on LCEC phase 2
P. Chen, M. Karczewicz
(Qualcomm)
JCTVC-C161
m18187
2010-10-01
03:44:41
2010-10-01
17:53:36
2010-10-09
07:52:08
TE10: Cross-verification result of SKKU/SKT deblocking
filter
K. Chono, K. Senzaki, H.
Aoki, J. Tajime, Y. Senda
(NEC)
JCTVC-C162
m18188
2010-10-01
03:50:50
2010-10-01
13:25:57
2010-10-07
11:21:48
TE12.2: Results on MC interpolation filters
K. Kondo, T. Suzuki (Sony)
JCTVC-C163
m18189
2010-10-01
03:52:03
2010-10-01
12:28:00
2010-10-07
11:24:05
TE12.2: Results on MV prediction
K. Sato, T. Suzuki (Sony)
JCTVC-C164
m18190
2010-10-01
03:53:09
2010-10-03
03:47:01
2010-10-06
18:48:25
TE3 subset 4: Results on bi/single filter switching in FIF
K. Kondo, T. Suzuki (Sony)
JCTVC-C165
m18191
2010-10-01
03:55:00
2010-10-07
17:07:35
2010-10-15
12:39:26
TE3 subset 4: Cross verification on high accuracy
interpolation filter
K. Kondo, T. Suzuki (Sony)
JCTVC-C166
m18192
2010-10-01
03:55:32
2010-10-03
03:48:12
2010-10-07
03:26:25
TE11: Study on motion vector coding (experiment 3.3.a &
K. Sato (Sony)
3.3.c)
JCTVC-C167
m18193
2010-10-01
03:56:12
2010-10-03
03:49:10
2010-10-03
03:49:10
Proposal on large block structure and quantization
K. Sato (Sony)
JCTVC-C168
m18194
2010-10-01
03:57:05
2010-10-03
05:36:59
2010-10-07
11:25:57
Comments on common test conditions
K. Kondo, K. Sato, T. Suzuki
(Sony)
JCTVC-C169
m18198
2010-10-01
05:36:07
2010-10-03
05:37:39
2010-10-12
05:24:59
TE6.b: Experiment results of DCIM
E. Maani, L. Dong, W. Liu
(Sony)
JCTVC-C170
m18199
2010-10-01
05:38:00
2010-10-03
03:50:11
2010-10-09
08:18:28
TE3.2: Report on cross-verification of simplified geometry X. Zheng (HiSilicon), H. Yu
block partitioning from Technicolor
(Huawei)
JCTVC-C171
m18200
2010-10-01
05:39:20
2010-10-03
05:39:31
2010-10-05
06:35:24
TE12.3: Experimental results of edge based prediction
A. Tabatabai, C. Auyeung
(Sony)
JCTVC-C172
m18201
2010-10-01
05:42:20
2010-10-03
05:40:18
2010-10-12
05:32:26
TE6: Summary report of intra prediction improvements
tool experiments
A. Tabatabai (Sony)
JCTVC-C173
m18202
2010-10-01
05:45:36
2010-10-01
14:30:06
2010-10-09
11:43:19
TE10 subtest 2: Parallel adaptive loop filter
T. Ikai, T. Yamamoto
(SHARP)
JCTVC-C174
m18203
2010-10-01
05:47:23
2010-10-04
10:56:35
2010-10-04
10:56:35
TE12.3: Report on combined intra prediction
T. Yamamoto, A. Segall
(SHARP)
248
JCTVC-C175
m18204
2010-10-01
05:48:11
2010-10-04
10:57:06
2010-10-20
07:56:10
TE6.b: Cross verification of differential coding of intra
modes (DCIM)
T. Yamamoto, Y. Yasugi
(SHARP)
JCTVC-C176
m18205
2010-10-01
05:53:40
2010-10-01
14:30:50
2010-10-05
10:42:21
Analysis and improvement of differential coding of intra
modes
T. Yamamoto, Y. Yasugi
(Sharp)
JCTVC-C177
m18206
2010-10-01
06:35:49
Horizontal spatial prediction for intra coding
[Withdrawn]
P. Tao, D. Li, W. Wu, J. Wen
(Tsinghua Univ.)
JCTVC-C178
m18207
2010-10-01
07:11:57
2010-10-06
05:02:26
2010-10-06
05:02:26
TE2: Cross-check of memory compression results from
Panasonic (JCTVC-C074)
A. Segall (Sharp)
JCTVC-C179
m18208
2010-10-01
07:14:25
2010-10-06
05:03:24
2010-10-06
05:03:24
TE6: Cross-check of bi-directional intra-predition results
from Toshiba
A. Segall (Sharp)
JCTVC-C180
m18209
2010-10-01
07:18:37
2010-10-06
17:00:44
2010-10-10
02:53:54
TE10.3: Cross check of MediaTek's proposal on
quadtree-based adaptive offset
A. Segall (Sharp)
JCTVC-C181
m18210
2010-10-01
07:21:01
2010-10-06
05:04:13
2010-10-07
08:39:06
TE12.4: Cross-check of rotational transform (ROT)
K. Misra, A. Segall (Sharp)
JCTVC-C182
m18211
2010-10-01
07:24:57
2010-10-06
10:20:39
2010-10-07
08:40:33
TE12.2: Cross-check of adaptive motion vector resolution
A. Segall (Sharp)
(AMVRES) technology in the TMuC software
JCTVC-C183
m18212
2010-10-01
08:05:51
2010-10-02
08:01:01
2010-10-11
14:56:39
Enhanced switching of interpolation filter for HEVC
T. Yoshino, S. Naito, S.
Sakazawa (KDDI)
JCTVC-C184
m18215
2010-10-01
09:06:12
2010-10-03
00:03:20
2010-10-06
18:39:04
TE12.6: PIPE tested against LCEC by Cisco
A. Fuldseth (Cisco)
JCTVC-C185
m18216
2010-10-01
09:08:53
2010-10-03
00:03:56
2010-10-07
08:16:58
Recent improvements of the low complexity entropy
coder (LCEC) in TMuC
A. Fuldseth, G. Bjøntegaard
(Cisco), A. Hallapuro, K.
Ugur, J. Lainema (Nokia)
JCTVC-C186
m18217
2010-10-01
09:13:37
2010-10-03
03:51:56
2010-10-03
03:51:56
TE5: Results for simplified intra prediction tests by Nokia J. Lainema, K. Ugur (Nokia)
JCTVC-C187
m18218
2010-10-01
09:16:59
2010-10-03
03:53:00
2010-10-03
03:53:00
TE6.a: Cross-check of bidirectional intra prediction
(JCTVC-B042)
K. Iwata (Renesas)
JCTVC-C188
m18219
2010-10-01
09:17:39
2010-10-03
03:54:10
2010-10-09
03:43:41
TE12.3: Results for planar prediction tests by Nokia
J. Lainema, K. Ugur, O. Bici
(Nokia)
JCTVC-C189
m18220
2010-10-01
09:18:20
2010-10-03
03:58:20
2010-10-12
03:38:20
Evaluation of intra prediction based on repetitive pixel
replenishment
S. Mochizuki, K. Iwata
(Renesas)
JCTVC-C190
m18223
2010-10-01
11:24:27
2010-10-01
16:05:22
2010-10-02
19:16:05
TE7: Cross-check result of Peking Univ.'s proposal
(JCTVC-C089)
J. Kim, S.-C. Lim, H. Y. Kim,
H. Lee, J. S. Choi (ETRI)
JCTVC-C191
m18226
2010-10-01
11:28:00
2010-10-01
17:07:12
2010-10-04
14:16:38
TE11: Cross-check result of merge/skip (3.2c)
S. Jeong, J. Lee, H. Y. Kim,
S.-C. Lim (ETRI), K. Kim, H.
249
Lee, G. Park (KHU)
JCTVC-C192
m18228
2010-10-01
11:32:07
2010-10-01
20:55:06
2010-10-07
17:31:12
TE7: Cross-verification results of Huawei proposal on
simplified MDDT for intra prediction residual
Y. Choe, J. Kim (Yonsei
Univ.), Y-G. Kim (KGIT)
JCTVC-C193
m18230
2010-10-01
11:37:51
2010-10-02
05:48:06
2010-10-15
02:08:37
Overlapped block intra prediction
K. Zhang, X. Guo, J. An, M.
Guo, Y.-W. Huang, S. Lei
(MediaTek)
JCTVC-C194
m18231
2010-10-01
11:50:13
2010-10-03
03:59:21
2010-10-03
03:59:21
TE10 subtest 2: Cross-check results of JCTVC-C082
[Toshiba] Reduction of number of encoding passes for
quadtree-based adaptive loop filter (QALF)
P. Wu, S. Paschalakis, N.
Sprljan (Mitsubishi Electric)
JCTVC-C195
m18232
2010-10-01
11:52:32
2010-10-03
04:01:29
2010-10-03
04:01:29
An enhanced block-based adaptive loop filter in
TMuC0.7.0 platform
P. Wu, S. Paschalakis, N.
Sprljan (Mitsubishi Electric)
JCTVC-C196
m18233
2010-10-01
13:32:59
2010-10-03
04:02:43
2010-10-03
04:02:43
TE12.6: Results for LCEC_PHASE2 tests by Nokia
K. Ugur, A. Hallapuro, J.
Lainema (Nokia)
JCTVC-C197
m18235
2010-10-01
13:43:57
2010-10-03
04:03:34
2010-10-03
04:03:34
TE12.2: Results for SIFO-6Tap with DIF tests by Nokia
and Qualcomm
K. Ugur, J. Lainema (Nokia),
R. Panchal, M. Karczewicz
(Qualcomm)
JCTVC-C198
m18236
2010-10-01
15:02:40
2010-10-02
12:24:43
2010-10-12
04:25:35
TE9: Simulation results for various max. number of
transform quadtree depth
J. Chen, T. Lee, W.-J. Han
(Samsung)
JCTVC-C199
m18237
2010-10-01
15:06:05
2010-10-02
12:28:22
2010-10-02
12:33:06
TE11: Report on experiment 3.2.c: Check skip and
merge together
I.-K. Kim, T. Lee, W.-J. Han
(Samsung)
JCTVC-C200
m18238
2010-10-01
15:10:38
2010-10-02
12:30:10
2010-10-06
18:44:34
TE12.1: Experimental results of transform unit
quadtree/2-level test
T. Lee, J. Chen, W.-J. Han
(Samsung)
JCTVC-C201
m18239
2010-10-01
15:16:37
2010-10-02
12:38:06
2010-10-06
19:01:07
TE12.2: Experimental results of SIFO (DIF) / DCT-IF
E. Alshina, J. Chen, N.
Shlyakhov, A. Alshin, W.-J.
Han (Samsung)
JCTVC-C202
m18240
2010-10-01
15:19:08
2010-10-02
13:24:16
2010-10-06
19:14:19
TE12.4: Experimental results of MDDT and ROT by
Samsung and Qualcomm
E. Alshina, A. Alshin, W.-J.
Han (Samsung), R. Joshi, M.
Coban (Qualcomm)
JCTVC-C203
m18241
2010-10-01
15:26:15
2010-10-02
12:41:06
2010-10-02
12:41:06
TE12.4: Experimental results of transform coefficient
coding
J. Chen, V. Seregin, W.-J.
Han (Samsung)
JCTVC-C204
m18242
2010-10-01
15:27:44
2010-10-03
12:30:27
2010-10-05
13:52:24
Bi-directional optical flow
A. Alshin, E. Alshina
(Samsung)
JCTVC-C205
m18243
2010-10-01
15:29:50
2010-10-02
12:43:04
2010-10-07
19:03:35
Low-complexity adaptive coefficient scanning
V. Seregin, J. Chen
(Samsung)
JCTVC-C206
m18244 2010-10-01
2010-10-02
2010-10-11
Chroma intra prediction by scaled luma samples using
J. Chen, V. Seregin
250
15:36:38
12:44:14
05:59:20
integer operations
(Samsung)
JCTVC-C207
m18245
2010-10-01
15:39:22
2010-10-02
12:47:53
2010-10-07
19:06:02
Encoder improvement of unified intra prediction
Y. Piao, J.-H. Min, J. Chen
(Samsung)
JCTVC-C208
m18246
2010-10-01
15:40:37
2010-10-02
12:53:45
2010-10-08
19:35:36
Simulation results of motion partition, block merging and
motion competition schemes
I.-K. Kim, T. Lee (Samsung)
JCTVC-C209
m18247
2010-10-01
15:43:38
2010-10-02
12:56:31
2010-10-08
02:30:43
Low-complexity 16x16 and 32x32 transforms and partial
frequency transform
Y.-M. Hong, M.-S. Cheon, I.K. Kim (Samsung)
JCTVC-C210
m18248
2010-10-01
15:45:42
2010-10-02
12:57:59
2010-10-05
14:06:21
Efficient coefficient coding method for large transform in
VLC mode
S. Lee, M.-S. Cheon, I.-K.
Kim (Samsung)
JCTVC-C211
m18249
2010-10-01
15:49:19
2010-10-02
13:00:37
2010-10-05
14:40:31
Inner block oriented ALF processing for memory &
bandwidth reduction
Y. Sohn, K.-H. Lee, B.-K.
Lee, I.-K. Kim (Samsung)
JCTVC-C212
m18250
2010-10-01
15:50:17
2010-10-02
12:25:22
2010-10-04
08:16:35
Cross-verification of JCTVC-C086: Experimental results
of ALF on low complexity
W.-J. Han (Samsung)
JCTVC-C213
m18251
2010-10-01
16:11:41
2010-10-03
05:41:28
2010-10-07
06:56:55
Additional results for combined intra prediction
M. Mrak, T.Davies, D. Flynn,
A. Gabriellini (BBC)
JCTVC-C214
m18254
2010-10-01
16:37:58
2010-10-01
19:03:24
2010-10-07
06:48:05
TE12.5: Results for adaptive loop filter using prediction
and residual
M. Narroschke (Panasonic)
JCTVC-C215
m18255
2010-10-01
17:33:56
2010-10-03
04:05:02
2010-10-09
05:16:40
Adaptive MV resolution with directional filters
K. Ugur, J. Lainema (Nokia)
JCTVC-C216
m18256
2010-10-01
17:36:31
2010-10-01
18:48:44
2010-10-07
08:51:39
TE12.3: Results for edge based prediction
V. Drugeon (Panasonic)
JCTVC-C217
m18257
2010-10-01
17:51:11
2010-10-01
18:52:49
2010-10-08
18:20:18
TE06.b: Cross verification of differential coding of intra
modes (DCIM)
V. Drugeon (Panasonic)
JCTVC-C218
m18258
2010-10-01
18:15:34
2010-10-01
20:36:37
2010-10-11
11:31:48
Encoding complexity reduction for intra prediction by
disabling NxN partition
J. Kim, Y.Jeon, B. Jeon (LG
Electronics)
JCTVC-C219
m18259
2010-10-01
18:37:06
2010-10-01
19:25:33
2010-10-10
09:36:01
In-loop reference frame denoising
P. Amon, A. Hutter, E. Wige,
A. Kaup (Siemens)
JCTVC-C220
m18260
2010-10-01
19:00:04
2010-10-03
04:06:06
2010-10-14
05:41:05
TE3: Summary report for inter prediction in HEVC
A. Krutz, T. Sikora (Tech.
Univ. Berlin)
JCTVC-C221
m18261
2010-10-01
19:03:33
2010-10-03
04:06:56
2010-10-03
04:06:56
TE3: Adaptive global motion temporal prediction for
HEVC
[Withdrawn]
A. Glantz, A. Krutz, T. Sikora
(Tech. Univ. Berlin)
JCTVC-C222
m18262
2010-10-01
19:08:17
2010-10-03
04:08:16
2010-10-03
04:08:16
In-loop filtering using temporal pixel trajectories
[Withdrawn]
M. Esche, A. Krutz, A.
Glantz, T. Sikora (Tech.
Univ. Berlin)
251
JCTVC-C223
m18263
2010-10-01
19:57:24
2010-10-02
08:42:57
2010-10-09
07:19:50
TE8 report
G. Martin-Cocher (RIM), M.
Budagavi (TI)
JCTVC-C224
m18264
2010-10-01
20:18:27
2010-10-02
00:45:40
2010-10-02
00:45:40
Frame coding in vertical raster scan order
D.-K. Kwon, H. Kim (TI)
JCTVC-C225
m18265
2010-10-01
20:20:29
2010-10-07
04:44:52
2010-10-07
04:44:52
TE12: Summary of evaluation of TMuC tools in TE12
K. McCann
(Samsung/Zetacast)
JCTVC-C226
m18266
2010-10-01
20:56:12
2010-10-02
02:53:22
2010-10-12
02:54:47
Low-complexity configurable transform architecture for
HEVC
M. Sadafale, M. Budagavi
(TI)
JCTVC-C227
m18267
2010-10-01
20:58:29
2010-10-02
02:59:47
2010-10-05
11:52:28
Parallelization of HHI_TRANSFORM_CODING
V. Sze, M. Budagavi (TI)
JCTVC-C228
m18268
2010-10-01
20:59:54
2010-10-02
03:00:47
2010-10-02
03:00:47
A simple cache model for measuring motion
compensation bandwidth
M. Budagavi (TI)
JCTVC-C229
m18269
2010-10-01
21:05:44
2010-10-01
23:49:57
2010-10-02
08:18:00
TE10 subset 2: Cross check result of MediaTek ALF
I. S. Chong, M. Karczewicz
(Qualcomm)
JCTVC-C230
m18270
2010-10-01
21:14:28
2010-10-02
08:18:44
2010-10-06
17:07:18
TE12: Cross check result of Panasonic's 3-input-ALF
I. S. Chong, M. Karczewicz
(Qualcomm)
JCTVC-C231
m18271
2010-10-01
21:26:12
2010-10-06
16:05:38
2010-10-12
17:30:27
TE12.1: Results for PU based merging
Y. Suzuki, T.K. Tan, F.
Bossen (NTT DOCOMO)
JCTVC-C232
m18272
2010-10-01
21:39:50
2010-10-01
21:41:43
2010-10-01
21:41:43
TE12.7: Verification of JCTVC-C041 (partition based
illumination compensation - PBIC)
N. Sprljan, S. Paschalakis,
P. Wu (Mitsubishi Electric)
JCTVC-C233
m18273
2010-10-01
21:44:07
2010-10-02
01:51:49
2010-10-02
01:51:49
TE3 subtest 3: Local intensity compensation (LIC) for
inter prediction
N. Sprljan, S. Paschalakis,
P. Wu (Mitsubishi Electric)
JCTVC-C234
m18274
2010-10-01
22:10:09
2010-10-02
10:26:57
2010-10-07
07:11:51
Simplified intra smoothing
Y. Zheng, M. Coban, M.
Karczewicz (Qualcomm)
JCTVC-C235
m18276
2010-10-01
22:40:05
2010-10-01
22:59:24
2010-10-02
08:19:17
TE12: Evaluation of adptive in-loop filter
I. S. Chong, M. Karczewicz
(Qualcomm)
JCTVC-C236
m18277
2010-10-01
22:49:09
2010-10-01
22:51:19
2010-10-04
10:31:49
Report of complexity analysis of geometric partitioning
K. Vermeirsch, J. De Cock,
R. Van de Walle (Ghent
Univ. - IBBT)
JCTVC-C237
m18278
2010-10-01
22:51:51
2010-10-02
07:54:20
2010-10-02
07:54:20
Reduced complexity 32×32 transform by coefficient
zero-out
J. Sole, R. Joshi, M.
Karczewicz (Qualcomm)
JCTVC-C238
m18279
2010-10-01
22:53:59
2010-10-02
01:19:02
2010-10-03
00:49:23
TE12.8: Results on RDOQ in high efficiency settings
J. Zan, J. Meng, M. T. Islam,
D. He (RIM)
JCTVC-C239
m18280
2010-10-01
22:57:22
2010-10-02
01:58:44
2010-10-02
01:58:44
TE12: Report on interpolation filter: SIFO/DIF
R. Panchal, M. Karczewicz
(Qualcomm)
252
JCTVC-C240
m18281
2010-10-01
22:57:27
2010-10-03
05:43:21
2010-10-09
14:40:22
TE3: Cross-check results of local intensity compensation D. Karwowski (Poznan Univ.
tool from Mitsubishi Electric
Tech.)
JCTVC-C241
m18282
2010-10-01
22:58:59
2010-10-02
01:56:58
2010-10-02
01:56:58
TE12: Crosscheck on interpolation filter:
SIFO(DIF)/MOMS
JCTVC-C242
m18283
2010-10-01
23:01:38
JCTVC-C243
2010-10-01
m18284
23:03:08
2011-01-01
23:30:18
(post
meeting)
2011-01-01
23:30:18
(post
meeting)
Cross check result of JCTVC-C084
I. S. Chong, M. Karczewicz
(Qualcomm)
JCTVC-C244
m18285
2010-10-01
23:06:40
2010-10-07
07:06:47
2010-10-07
07:10:22
TE12.3: Report on planar intra prediction
D. Flynn (BBC)
JCTVC-C245
m18286
2010-10-01
23:07:35
2010-10-02
01:19:38
2010-10-09
05:55:24
TE 8: Cross verification of TI-PCP proposal for
significance map
J. Zan, J. Meng, M. T. Islam,
D. He (RIM)
JCTVC-C246
m18287
2010-10-01
23:25:04
2010-10-02
10:22:51
2010-10-02
10:22:51
TE12.8: Cross-verification of partition-based illumination
compensation (PBIC)
P. Onno (Canon)
JCTVC-C247
m18288
2010-10-01
23:27:35
2010-10-05
13:23:28
2010-10-05
13:23:28
TE3 subset 4: Cross-verification on bi/single filter
switching in FIF
T. Chujoh, K. Kanou, T.
Yamakage (Toshiba)
JCTVC-C248
m18289
2010-10-01
23:29:03
2010-10-05
03:00:20
2010-10-05
03:00:20
TE12.2: Cross verification on DCT-IF for chroma
T. Chujoh, K. Kanou, T.
Yamakage (Toshiba)
JCTVC-C249
m18290
2010-10-01
23:35:31
2010-10-03
00:51:58
2010-10-09
11:58:04
TE 8: Reports on V2V coding and context modeling by
RIM
J. Zan, G. Korodi, J. Meng,
M. T. Islam, D. He (RIM)
JCTVC-C250
m18291
2010-10-02
00:30:27
2010-10-02
09:50:47
2010-10-08
09:16:04
Low complexity adaptive coefficient scanning
M. Coban, R. Joshi, M.
Karczewicz (Qualcomm)
JCTVC-C251
m18292
2010-10-02
00:37:22
2010-10-02
10:53:45
2010-10-12
03:39:56
Overlapped block motion compensation in TMuC
P. Chen, R. Panchal, W.-J.
Chien, M. Karczewicz
(Qualcomm)
JCTVC-C252
m18293
2010-10-02
00:47:24
2010-10-03
03:54:18
2010-10-03
03:54:18
TE 12: Report on evaluation of internal bit depth increase R. Joshi, M. Karczewicz
and transform precision extension
(Qualcomm)
JCTVC-C253
m18294
2010-10-02
01:39:31
2010-10-02
09:24:39
2010-10-06
22:38:38
Encoder speedup for bidirectional averaging with
rounding control
R. Panchal, M. Karczewicz,
P. Chen (Qualcomm)
JCTVC-C254
m18295
2010-10-02
02:00:27
2010-10-02
09:25:30
2010-10-02
09:25:30
Test result of SIFO for sequencs with illumination change
R. Panchal, M. Karczewicz
(Qualcomm)
JCTVC-C255
m18296
2010-10-02
02:28:42
2010-10-02
03:02:13
2010-10-12
04:50:35
DCT+Hadamard low complexity large transform for Inter
coding
M. Budagavi, A. Gupte (TI)
R. Panchal, M. Karczewicz
(Qualcomm)
Document registration withdrawn
[Withdrawn]
253
JCTVC-C256
m18297
2010-10-02
02:45:47
2010-10-02
09:51:24
2010-10-02
09:51:24
New results for entropy slices for highly parallel coding
K. Misra, J. Zhao, A. Segall
(SHARP)
JCTVC-C257
m18298
2010-10-02
02:46:40
2010-10-02
10:57:46
2010-10-11
06:11:14
On motion vector competition
Y. Su, A. Segall (SHARP)
JCTVC-C258
m18299
2010-10-02
03:04:08
2010-10-02
03:31:04
2010-10-06
11:36:01
TE 12: Evaluation of transform skip flag (RQT_ROOT)
W.-J. Chien, P. Chen, M.
Karczewicz (Qualcomm)
JCTVC-C259
m18300
2010-10-02
03:05:04
2010-10-02
03:13:09
2010-10-02
03:13:09
TE12: Results for experiments on Max CU size
Y. Yu, K. Panusopone, L.
Wang, S.-T. Hsiang, F.
Ishtiaq (Motorola)
JCTVC-C260
m18301
2010-10-02
03:06:18
2010-10-02
03:14:03
2010-10-10
11:29:44
Flexible picture partitioning
K. Panusopone, X. Fang, L.
Wang (Motorola)
JCTVC-C261
m18302
2010-10-02
03:06:32
2010-10-03
05:44:33
2010-10-03
05:44:33
TE4: Summary of TE4 on variable length coding
X. Wang (Qualcomm)
JCTVC-C262
m18303
2010-10-02
03:08:46
2010-10-03
05:45:53
2010-10-09
04:54:19
TE4: Report on VLC for coded block flag
X. Wang, M. Karczewicz,
W.-J. Chien (Qualcomm)
JCTVC-C263
m18304
2010-10-02
03:09:58
2010-10-03
05:47:05
2010-10-06
17:21:25
Improvements on VLC
M. Karczewicz, W.-J. Chien,
X. Wang (Qualcomm)
JCTVC-C264
m18305
2010-10-02
03:16:48
2010-10-02
03:29:46
2010-10-02
10:34:33
TE 12: Evaluation of block merging (MRG)
W.-J. Chien, P. Chen, M.
Karczewicz (Qualcomm)
JCTVC-C265
m18306
2010-10-02
03:19:13
2010-10-02
03:29:23
2010-10-02
10:34:04
TE 12: Evaluation of adaptive motion vector resolution
(AMVRES)
W.-J. Chien, P. Chen, M.
Karczewicz (Qualcomm)
JCTVC-C266
m18307
2010-10-02
03:21:29
2010-10-02
03:31:41
2010-10-06
10:47:35
TE1: Cross-check result of DMVD proposal from JCTVC- W.-J. Chien, P. Chen, M.
C138 (JVC/Kenwood)
Karczewicz (Qualcomm)
JCTVC-C267
m18308
2010-10-02
03:24:29
2010-10-02
03:28:08
2010-10-02
03:28:08
Test results of transform skip flag and phase 2 VLC
integration
W.-J. Chien, P. Chen, X.
Wang, M. Karczewicz
(Qualcomm)
JCTVC-C268
m18310
2010-10-02
04:32:25
2010-10-02
07:50:50
2010-10-02
07:50:50
TE12: Report on evaluation of MDDT and ROT
R. Joshi, M. Coban, M.
Karczewicz (Qualcomm)
JCTVC-C269
m18312
2010-10-02
04:45:57
2010-10-03
05:48:12
2010-10-10
11:34:29
A study on the impact of intra smoothing
A. Tabatabai, C. Auyeung,
E. Maani, T. Suzuki (Sony)
JCTVC-C270
m18313
2010-10-02
04:46:40
2010-10-02
04:54:43
2010-10-13
18:24:09
TE6.a: Report of line-based coding
X. Peng, J. Xu, F. Wu
(Microsoft)
JCTVC-C271
m18314
2010-10-02
04:47:48
2010-10-02
06:15:55
2010-10-02
06:15:55
TE6: Cross-verification of HiSilicon's short distance intra
prediction by Microsoft
X. Peng, J. Xu (Microsoft)
JCTVC-C272
m18315 2010-10-02
2010-10-09
2010-10-09
TE6: Cross-verification of MediaTek's overlapped block
X. Peng, J. Xu (Microsoft)
254
04:48:39
03:24:49
03:24:49
intra prediction by Microsoft
JCTVC-C273
m18316
2010-10-02
04:49:20
2010-10-02
05:11:08
2010-10-13
18:19:13
TE10 subset 1: Report of content-adaptive de-blocking
Z. Xiong, X. Sun, J. Xu
(Microsoft)
JCTVC-C274
m18317
2010-10-02
04:49:37
2010-10-02
05:17:32
2010-10-09
03:37:03
TE10 subset 1: Cross- verification of NEC's joint
deblocking-debanding filter by Microsoft
X. Xiong, J. Xu (Microsoft)
JCTVC-C275
m18318
2010-10-02
04:49:53
2010-10-02
05:20:47
2010-10-13
19:28:30
Improve intra frame coding by PU/TU reordering
X. Peng, J. Xu, F. Wu
(Microsoft)
JCTVC-C276
m18319
2010-10-02
04:50:06
2010-10-02
08:12:20
2010-10-13
18:15:04
Screen content coding results using TMuC
C. Lan, J. Xu, F. Wu
(Microsoft), G. Shi (Xidian
Univ.)
JCTVC-C277
m18320
2010-10-02
04:50:20
2010-10-02
05:34:44
2010-10-13
19:33:57
Redundancy reduction in Cbf and merge coding
B. Li, J. Xu, F. Wu, G. J.
Sullivan (Microsoft), H. Li
(Univ. Sci. Tech China)
JCTVC-C278
m18321
2010-10-02
04:50:32
2010-10-02
05:38:36
2010-10-13
19:34:27
Redundancy reduction in B-frame coding at temporal
level zero
B. Li, J. Xu, G. J. Sullivan, F.
Wu (Microsoft), H. Li (Univ.
Sci. Tech China)
JCTVC-C279
m18322
2010-10-02
05:48:31
2010-10-03
00:53:15
2010-10-03
00:53:15
Opportunistic parallel V2V decoding
D. He, G. Korodi, E.-h. Yang,
G. Martin-Cocher (RIM)
JCTVC-C280
m18323
2010-10-02
06:52:21
2010-10-02
07:00:55
2010-10-05
08:52:33
TE8: Crosscheck result of the transcoder for JCTVCB034 source selection for V2V entropy coding in HEVC
Y.-L. Chang, Y.-W. Huang,
S. Lei (MediaTek)
JCTVC-C281
m18324
2010-10-02
07:32:11
2010-10-02
10:29:00
2010-10-07
01:38:00
TE12: Evaluation of fast adaptive intra smoothing
M. Coban, Y. Zheng, M.
Karczewicz (Qualcomm)
JCTVC-C282
m18325
2010-10-02
08:15:22
2010-10-03
05:49:29
2010-10-08
05:51:11
TE3 subtest 1: Cross-check of results from LG
M. Tok, A. Glantz, A. Krutz,
T. Sikora (Tech. Univ. Berlin)
JCTVC-C283
m18326
2010-10-02
09:14:33
2010-10-02
10:03:32
2010-10-02
10:03:32
Region of block based dynamic video processing for
HEVC
A. Paul (Hanyang Univ. /
NCKU)
JCTVC-C284
m18327
2010-10-02
09:18:13
2010-10-08
14:05:26
2010-10-08
14:05:26
TE9-2.1 Report on forced RQT split according to PUs
(Sharp, Microsoft)
A. Segall (Sharp), J. Xu
(Microsoft)
JCTVC-C285
m18328
2010-10-02
09:32:20
2010-10-02
10:31:28
2010-10-02
10:31:28
Modified uni-directional inter prediction in generalized P
and B pictures
W.-J. Chien, P. Chen, X.
Wang, M. Karczewicz
(Qualcomm)
JCTVC-C286
m18329
2010-10-02
10:20:23
2010-10-02
10:23:03
2010-10-02
10:23:03
TE12.2: Technicolor & INRIA report on AMP evaluation
E. Francois (Technicolor), L.
Guillo (INRIA)
JCTVC-C287
m18330
2010-10-02
12:56:27
2010-10-04
12:56:12
2010-10-08
08:37:15
Cross-check of encoder speedup for bidirectional
averaging with rounding control
B. Bross (Fraunhofer HHI)
255
JCTVC-C288
m18331
2010-10-02
13:14:36
2010-10-04
10:50:06
2010-10-04
10:50:06
TE4: Verification results of JCTVC-C262
T. Yamakage (Toshiba)
JCTVC-C289
m18332
2010-10-02
13:40:03
2010-10-02
13:43:12
2010-10-07
03:38:46
TE11: Report on experiment 3.1.a: Disabling MV
competition
J. Jung, G. Clare, S. Pateux
(Orange Labs)
JCTVC-C290
m18333
2010-10-02
13:48:00
2010-10-02
13:49:40
2010-10-07
03:41:06
TE11: Report on experiment 3.2.b: MV-Competition on
Inter, Skip and Direct modes
J. Jung, G. Clare, S. Pateux
(Orange Labs)
JCTVC-C291
m18334
2010-10-02
13:53:14
2010-10-02
13:55:12
2010-10-07
07:52:39
TE11: Report on experiment 3.3.b: ‘temporally
oriented' set of predictors for MV-Competition
J. Jung, G. Clare, S. Pateux
(Orange Labs)
JCTVC-C292
m18335
2010-10-02
13:57:10
2010-10-05
15:20:38
2010-10-05
15:20:38
TE11: Verification of experiment 3.2.d by NTT DOCOMO
J. Jung, G. Clare, S. Pateux
(Orange Labs)
JCTVC-C293
m18336
2010-10-02
13:59:55
2010-10-02
14:08:42
2010-10-08
13:25:52
TE11: Summary report for TE11 on motion vector coding J. Jung (Orange Labs)
JCTVC-C294
m18337
2010-10-03
03:59:52
JCTVC-C295
m18340
2010-10-03
11:46:12
2010-10-11
17:43:31
JCTVC-C296
m18341
2010-10-03
11:47:37
JCTVC-C297
m18344
JCTVC-C298
Level limits for Progressive High Profile
[Withdrawn from JCT-VC]
A. Wells (Ambarella), C.
Fogg (Harmonic)
2010-10-11
17:44:37
Re-compensation based on partial coefficients
H. Zhu (Zhu)
2010-10-05
14:10:07
2010-10-07
20:00:59
Decoding improvement on the PA-Coder
H. Zhu (Zhu)
2010-10-03
14:49:12
2010-10-06
19:26:48
2010-10-11
05:21:20
Cross-verification of JCTVC-C213: additional results for
combined intra prediction
W.-J. Han (Samsung)
m18411
2010-10-04
16:37:29
2010-10-07
17:04:02
2010-10-07
17:04:02
TE12.4 Cross-check of unified MDDT/ROT
T. Davies, D. Flynn (BBC)
JCTVC-C299
m18423
2010-10-04
17:11:53
2010-10-04
17:17:59
2010-10-04
17:17:59
UK National Body comments on HEVC Timescales
UKNB of WG11
JCTVC-C300
m18491
2010-10-05
01:59:03
2010-10-07
02:13:29
2010-10-08
22:39:09
High-efficiency entropy coding simplifications
V. Sze, M. Budagavi (TI)
JCTVC-C301
m18516
2010-10-05
11:55:43
2010-10-05
13:14:49
2010-10-05
13:14:49
Geometry motion partition
P. Chen, L. Guo, M.
Karczewicz (Qualcomm)
JCTVC-C302
m18517
2010-10-05
12:12:15
2010-10-05
18:35:46
2010-10-12
03:33:48
Improved intra smoothing for UDI and new AIS fast mode T. Nguyen (Fraunhofer HHI)
JCTVC-C303
m18522
2010-10-05
15:04:43
2010-10-06
03:23:58
2010-10-06
03:23:58
Geometry adaptive block partitioning (GEO) cross-check
E. Francois, P. Bordes
(Technicolor)
JCTVC-C304
m18537
2010-10-06
10:03:34
2010-10-06
10:41:23
2010-10-12
04:52:10
Showing the possibility of fast CABAC
H.-Joong Kim, X. Qu (Korea
Univ.), W.-J. Han (Samsung)
256
JCTVC-C305
m18544
2010-10-06
16:54:17
2010-10-08
09:09:59
2010-10-08
09:09:59
TE7: Results for Simplification of MDDT Transform
W. Ding, Y. Shi, B. Yin
(Beijing Univ. Tech)
JCTVC-C306
m18546
2010-10-06
22:48:03
2010-10-06
22:53:07
2010-10-06
22:53:07
Investigations for representing rectangular blocks using
the merging concept
H. Schwarz, D. Marpe, T.
Wiegand (Fraunhofer HHI)
JCTVC-C307
m18548
2010-10-07
00:59:51
2010-10-07
01:05:51
2010-10-10
11:18:03
Cross Verification of Low Complexity Adaptive Coefficient J. Zan, J. Meng, M. T. Islam,
Scanning
D. He (RIM)
JCTVC-C308
2010-10-08
m18556
10:46:20
2010-12-03
09:28:17
(post
meeting)
2010-12-03
09:28:17
(post
meeting)
Cross verification of MV Coding Proposal by JCTVCC293
K. Sato (Sony)
JCTVC-C309
2010-10-08
m18557
10:49:19
2010-12-03
09:28:59
(post
meeting)
2010-12-03
09:28:59
(post
meeting)
Cross verification of DCT-IF for chroma
T. Suzuki (Sony)
JCTVC-C310
m18568
2010-10-09
05:01:43
2010-10-09
05:19:21
2010-10-09
05:19:21
TE7: Summary report for MDDT Simplification
R. Cohen (Merl), C. Yeo
(I2R), R. Joshi (Qualcomm)
JCTVC-C311
m18569
2010-10-09
06:24:29
2010-10-09
07:20:20
2010-10-12
03:11:10
B. Bross, H. Kirchhoffer, H.
Fast intra encoding for fixed maximum depth of transform
Schwarz, T. Wiegand
quadtree
(Fraunhofer HHI)
JCTVC-C312
m18571
2010-10-09
12:04:06
2010-10-09
13:07:43
2010-10-13
11:16:05
Cross-verification of JCTVC-C311: Fast intra encoding
for fixed maximum depth of transform quadtree
W.-J. Han (Samsung)
JCTVC-C313
m18572
2010-10-09
19:30:20
2010-10-09
21:10:33
2010-10-09
21:10:33
TE8: Crosscheck Result of the Transcoder of JCTVCB034 for V2V Entropy Coding in HEVC
Y. Zheng, R. Joshi, M.
Coban, M. Karczewicz
(Qualcomm)
JCTVC-C314
m18573
2010-10-10
02:39:08
2010-10-11
09:51:21
2010-10-11
11:00:22
Verification results of Samsung's proposal JCTVC-C206
J. Kim, J. Park, B. Jeon (LG
Electronics)
JCTVC-C315
m18581
2010-10-11
06:39:54
2010-10-11
06:43:38
2010-10-11
06:43:38
On the computation of PSNR and BD-Rate
D. Hoang (Zenverge)
JCTVC-C316
m18583
2010-10-11
12:55:37
2010-10-11
18:26:13
2010-10-11
18:26:13
Verification Results of Simplified Intra Smoothing
(JCTVC-C234)
T. Yamakage (Toshiba)
JCTVC-C317
m18584
2010-10-12
02:53:18
2010-10-12
03:05:50
2010-10-12
03:06:10
Breakout Report on Interpolation Filters
K. Ugur (BoG coordinator)
JCTVC-C318
m18586
2010-10-12
15:15:33
2010-10-12
15:19:57
2010-10-12
15:19:57
TMuC text on max CU size seems broken
R. Sjöberg (Ericsson)
JCTVC-C319
m18590 2010-10-14
2010-10-14
2010-10-14
BoG report: Residual quadtree structure
T. Wiegand, H. Schwarz, B.
257
03:28:32
03:41:23
19:05:29
2010-10-14
10:42:10
2010-10-14
10:42:10
JCTVC-C320
m18597
2010-10-14
10:34:43
JCTVC-C400
m18617
2010-10-21
07:41:14
JCTVC-C401
m18589
2010-10-14
02:42:48
JCTVC-C402
m18618
JCTVC-C403
Bross, A. Fuldseth, X. Wang,
W.-J. Han (BoG
coordinators)
TE12: Summary of results of evaluation of TMuC tools in
K. McCann (TE coordinator)
TE12
Meeting report of the third meeting of the Joint
Collaborative Team on Video Coding (JCT-VC),
Guangzhou, CN, 7-28 October, 2010
G. J. Sullivan, J.-R. Ohm
(chairs)
The process of Test Model development for the HEVC
initiative
G. Sullivan, J.-R. Ohm
(chairs)
2010-10-21
07:42:48
High Efficiency Video Coding (HEVC) Test Model 1 (HM
1) Encoder Description
K. McCann, B. Bross, S.
Sekiguchi (editors)
m18619
2010-10-21
07:44:39
High Efficiency Video Coding (HEVC) text specification
Working Draft 1
T. Wiegand, W.-J. Han, J.-R.
Ohm, G. J. Sullivan (editors)
JCTVC-C404
m18594
2010-10-14
06:58:49
2010-10-15
06:12:59
2010-10-15
17:10:22
HEVC software guidelines
K. Sühring, D. Flynn, F.
Bossen (on behalf of JCTVC)
JCTVC-C405
m18616
2010-10-15
08:08:05
2010-10-15
08:35:05
2010-11-01
09:34:49
T. K. Tan, G. J. Sullivan, J.Summary of HEVC working draft 1 and HEVC test model
R. Ohm (on behalf of JCT(HM)
VC)
JCTVC-C500
m18607
2010-10-15
02:59:12
2010-10-15
03:08:44
2010-10-15
03:08:44
Common test conditions and software reference
configurations
F. Bossen (on behalf of JCTVC)
JCTVC-C501
m18604
2010-10-15
02:36:59
2010-10-15
02:38:30
2010-10-30
06:18:41
Description of Core Experiment 1: Decoder-Side Motion
Vector Derivation
Y.-J. Chiu, H. Yu, Y.-W.
Huang, S. Sekiguchi (CE
coordinators)
JCTVC-C502
m18602
2010-10-15
01:57:36
2010-10-15
01:59:01
2010-10-30
16:03:26
Description of Core Experiment 2: Flexible Motion
Partitioning
E. Francois, P. Chen, X.
Zheng (CE coordinators)
JCTVC-C503
m18613
2010-10-15
03:43:20
2010-10-15
03:45:12
2010-10-30
04:44:05
Description of Core Experiment 3: Interpolation Filtering
for MC (Luma)
T. Chujoh (CE coordinator)
JCTVC-C504
m18603
2010-10-15
02:20:32
2010-10-15
02:20:46
2010-11-12
06:03:36
Description of Core Experiment 4: Interpolation for MC
(Chroma)
E. Alshina, Y.-L. Lee, P.
Chen, H. Lakshman (CE
coordinators)
JCTVC-C505
m18609
2010-10-15
03:21:14
2010-10-15
03:25:06
2010-11-02
00:25:34
Description of Core Experiment 5: Low Complexity
Entropy Coding Improvements
X. Wang (CE coordinator)
JCTVC-C506
m18608 2010-10-15
2010-10-15
2010-11-01
Description of Core Experiment 6: Intra Prediction
A Tabatabai (CE
2010-10-14
02:46:52
2010-10-15
04:39:15
258
JCTVC-C507
m18601
03:20:50
03:22:43
23:01:48
Improvement
coordinator)
2010-10-14
20:02:08
2010-10-14
20:05:45
2010-11-02
21:27:32
Description of Core Experiment 7: Alternative Transforms
R. Cohen, C. Yeo, R. Joshi
(CE coordinators)
JCTVC-C508
m18612
2010-10-15
03:43:14
2010-10-15
03:44:26
2010-10-30
04:12:21
Description of Core Experiment 8: In-loop filtering
T. Yamakage, T. Chujoh, Y.
W. Huang, K. Chono, I.S.
Chong, T. Yamamoto, J.
Lim, B. Jeon, J. Xu, M.
Narroschke (CE
coordinators)
JCTVC-C509
m18600
2010-10-14
17:55:12
2010-10-14
17:57:17
2010-11-19
15:51:40
Description of Core Experiment 9: Motion Vector Coding
J. Jung, B. Bross (CE
coordinators)
JCTVC-C510
m18606
2010-10-15
02:58:50
2010-10-15
03:08:27
2010-11-01
11:10:43
Description of Core Experiment 10: Number of Intra
Prediction Directions
K. Sugimoto (CE
coordinator)
JCTVC-C511
m18610
2010-10-15
03:35:54
2010-10-15
03:37:03
2010-11-03
22:40:47
V. Sze, K. Panusopone, J.
Description of Core Experiment 11: Coefficient Scanning
Chen, T. Nguyen, M. Coban
and Coding
(CE coordinators)
JCTVC-C512
m18605
2010-10-15
02:37:32
2010-10-15
02:38:58
2010-10-30
01:59:59
Description of Core Experiment 12: Adaptive Motion
Vector Resolution
JCTVC-C513
m18611
2010-10-15
03:41:16
2010-10-15
03:47:00
2010-11-05
22:06:15
Description of Core Experiment 13: Intra Smoothing
259
W.-J. Chien (CE coordinator)
M.Coban, B. Bross, J. Chen
(CE coordinators)
Annex B to JCT-VC report:
List of meeting participants
The participants of the third meeting of the JCT-VC, according to a sign-in sheet circulated during
the meeting (approximately 244 in total), were as follows:
1.
Alshina, Elena (Samsung)
2.
Amon, Peter (Siemens)
3.
Andersson, Kenneth (Ericsson)
4.
Aoki, Hirofumi (NEC)
5.
Asai, Kohtaro (Mitsubishi Electric)
6.
Au, Jicheng (MediaTek)
7.
Au, Oscar (Hong Kong Univ. Sci. & Tech.)
8.
Auyeung, Cheung (Sony)
9.
Bandoh, Yukihiro (NTT)
10. Bang, Gun (ETRI)
11. Barbarien, Joeri (Vrije Univ. Brussels)
12. Bici, Oguz (Nokia)
13. Bivolarsky, Lazar (Skype Technologies)
14. Boitard, Ronan (INRIA)
15. Bopardikar, Rajendra (Intel)
16. Bossen, Frank (DoCoMo USA Labs)
17. Boyce, Jill (Vidyo)
18. Bross, Benjamin (Fraunhofer HHI)
19. Brylants, Tim (Vrije Univ. Brussel)
20. Budagavi, Madhukar (Texas Instruments Inc)
21. Cao, Xiaoran (Tsinghua Univ.)
22. Chen, Jianle (Samsung)
23. Chen, Lulin (Harmonic)
24. Chen, Sijia (Cisco Systems)
25. Chien, Wei-Jung (Qualcomm)
26. Chiu, Yi-Jen (Intel)
27. Cho, In Joon (KBS)
28. Cho, Jaehee (Sejong Univ.)
29. Choi, Haechul (Hanbat Univ.)
30. Choi, Jin Soo (ETRI)
31. Choi, Kiho (Hanyang Univ.)
32. Chong, In Suk (Qualcomm)
33. Chono, Keiichi (NEC)
34. Chujoh, Takeshi (Toshiba)
35. Clare, Gordon (Orange Labs)
36. Coban, Muhammed (Qualcomm)
37. Cohen, Robert (Mitsubishi Electric)
38. Davies, Thomas (BBC R&D)
39. De Cock, Jan (Ghent Univ. - IBBT)
260
40. Ding, Wenping (Beijing Univ. Tech.)
41. Dong, Jie (Chinese Univ. Hong Kong)
42. Drugeon, Virginie (Panasonic)
43. Fan, Xiaopeng (Harbin / HKUST)
44. Fernandes, Felix (Samsung)
45. Flynn, David (BBC)
46. Fogg, Chad (Harmonic)
47. François, Edouard (Technicolor)
48. Fujibayashi, Akira (NTT DoCoMo)
49. Fukushima, Shigeru (JVC Kenwood)
50. Fuldseth, Arild (Tandberg)
51. Glantz, Alexander (Tech. Univ. Berlin)
52. Gu, Chenchen (Tencent)
53. Guillo, Laurent (INRIA)
54. Guo, Xun (MediaTek)
55. Han, Jong-Ki (Sejong Univ.)
56. Han, Woo-Jim (Samsung)
57. He, Dake (RIM)
58. He, Yun (Tsinghua Univ.)
59. Heo, Youngsu (Kyunghee Univ.)
60. Ho, Yo-Sung (GIST)
61. Hoang, Dzung (Zenverge, Inc.)
62. Hong, Soongi (Yonsei Univ.)
63. Hong, Sung-Wook (Sejong Univ.)
64. Hong, Yingjie (ZTE)
65. Horowitz, Michael (Vidyo, Inc.)
66. Hsiang, Shih-Ta (Motorola, Inc.)
67. Hsu, Chih-Wei (MediaTek)
68. Huang, Yu-Wen (MediaTek)
69. Huo, Junyan (Xi'dian Univ.)
70. Ichigaya, Atsuro (NHK)
71. Ishtiaq, Faisal (Motorola Inc.)
72. Jacobs, Marc (Vrije Univ. Brussel)
73. Jang, Euy-Doc (Korea Aerospace Univ.)
74. Jeon, Byeong Moon (LG)
75. Jeon, Byeungwoo (Sungkyunkwan Univ.)
76. Jeon, Yong-Joon (LG Electronics)
77. Jeong, Seyoon (ETRI)
78. Jin, Xin (Waseda Univ.)
79. Jung, Ji Wook (LG)
80. Jung, Joël (Orange Labs)
81. Jung, Tae-Young (Hanyang Univ.)
82. Kang, Jung Won (ETRI)
83. Karczewicz, Marta (Qualcomm)
84. Kazui, Kimihiko (Fujitsu)
261
85. Kim, Hae Kwang (Sejong Univ.)
86. Kim, Hui Yong (ETRI)
87. Kim, Hyun-Dong (Sejong Univ.)
88. Kim, Il Koo (Samsung)
89. Kim, In Kwon (Galaxia Commun.)
90. Kim, Jae-Gon (Korea Aerosp. Univ.)
91. Kim, Jeong-Pil (Sejong Univ.)
92. Kim, Jongho (Yonsei Univ.)
93. Kim, Kibaek (Hanyang Univ.)
94. Kim, Kyungyong (Kwangwoon Univ.)
95. Kim, Munchurl (KAIST)
96. Kim, Seong Hoon (Galaxia Commun.)
97. Kim, Seongwan (Yonsei Univ.)
98. Klomp, Sven (Leibniz Univ. Hannover)
99. Kogure, Takuyo (Panasonic )
100. Kondo, Kenji (Sony)
101. Koyama, Jumpei (Fujitsu)
102. Krutz, Andreas (Tech. Univ. Berlin)
103. Lai, Polin (Wang) (Samsung)
104. Lainema, Jani (Nokia)
105. Lakshman, Haricharan (Fraunhofer HHI)
106. Laroche, Guillaume (Canon)
107. Lee, Ju Ock (Sejong Univ.)
108. Lee, Sangyoun (Yonsei Univ.)
109. Lee, Shanfu (Huawei)
110. Lee, Yoomjim (Kyunghee Univ.)
111. Lee, Yung-Lyul (Sejong Univ.)
112. Lei, Shawmin (MediaTek)
113. Li, Guichun (Huawei)
114. Li, Junlin (Cisco Systems)
115. Li, Ming (ZTE)
116. Li, Wei (Xi'dian Univ.)
117. Li, Zhengguo (I2R)
118. Lim, Chongsoon (Panasonic Singapore Labs)
119. Lim, Jaehyun (LG Electronics)
120. Lim, Jeongyeon (SK Telecom)
121. Lim, Sung-Chang (ETRI)
122. Lim, Woong (Kwangwoon Univ.)
123. Lin, Sixin (Huawei)
124. Lin, Yongbing (Huawei)
125. Ling, Nam (Santa Clara Univ.)
126. Liu, Hui (Cisco Systems)
127. Liu, Lingzhi (Huawei / Hisilicon)
128. Liu, Yu (ASTRI)
129. Lopez, Patrick (Technicolor)
262
130. Luthra, Ajay (Motorola)
131. Ma, Ran (Shanghai Univ.)
132. Ma, Siwei (Peking Univ.)
133. Mao, Xunan (Tencent)
134. Marpe, Detlev (Fraunhofer HHI)
135. Martin-Cocher, Gaëlle (RIM)
136. Matsuo, Shohei (NTT)
137. McCann, Ken (Zetacast / Samsung)
138. Mochizuki, Seiji (Renesas)
139. Moon, Joo-Hee (Sejong Univ.)
140. Mrak, Marta (BBC)
141. Murakami, Tokumichi (Mitsubishi Electric)
142. Murakami, Tomokazu (Hitachi)
143. Narroschke, Matthias (Panasonic R&D Germany)
144. Nishi, Takahiro (Panasonic)
145. Oh, Seoung-Jun (Kwangwoon Univ.)
146. Ohm, Jens-Rainer (RWTH Aachen Univ.)
147. O'loughlin, Daniel (Research in Motion)
148. Onno, Patrice (Canon)
149. Panusopone, Krit (Motorola)
150. Park, Gwang Hoon (Kyunghee Univ.)
151. Park, Hyoungmee (Sejong Univ.)
152. Park, Jeonghoon (Samsung)
153. Park, Seungwook (LG)
154. Park, Un-Ki (Korea Aerospace Univ.)
155. Paschalakis, Stavros (Mitsubishi Electric R&D Europe B.V.)
156. Peng, Wen-Hsiao (NCTU/ITRI)
157. Qu, Xiaochao (Korea Univ.)
158. Raad, Mohamad (RaadTech Consulting)
159. Ralston, John (Droplet Tech)
160. Ridge, Justin (Nokia)
161. Sakaida, Shinichi (NHK)
162. Sampedro, Jesus (Polycom)
163. Sasai, Hisao (Panasonic)
164. Sato, Kazushi (Sony)
165. Saxena, Ankur (Samsung Telecom. America)
166. Schelkens, Peter (Vrije Univ. Brussel)
167. Segall, Andrew (Sharp)
168. Sekiguchi, Shun-ichi (Mitsubishi Electric)
169. Senzaki, Kenta (NEC)
170. Seo, Chan-Won (Sejong Univ.)
171. Shen, Bazhong (Broadcom)
172. Shibahara, Youji (Panasonic)
173. Shima, Masato (Canon)
174. Shimizu, Shinya (NTT)
263
175. Shiodera, Taichiro (Toshiba)
176. Sim, Donggyu (Kwangwoon Univ.)
177. Sjöberg, Rickard (Ericsson)
178. Song, Jin (Huawei)
179. Sprljan, Nikola (Mitsubishi Electric R&D Centre Europe)
180. Sühring, Karsten (Fraunhofer HHI)
181. Sugimoto, Kazuo (Mitsubishi Electric)
182. Sugio, Toshiyasu (Panasonic)
183. Suh, Jung Suk (Samsung)
184. Sullivan, Gary (Microsoft)
185. Sun, Huifang (Mitsubishi Electric)
186. Suzuki, Teruhiko (Sony)
187. Sze, Vivienne (Texas Instruments)
188. Tabatabai, Ali (Sony Electronics)
189. Takamura, Seishi (NTT)
190. Tan, Thiow Keng (NTT DoCoMo)
191. Tan, Yih Han (I2R)
192. Tanizawa, Akiyuki (Toshiba)
193. Temmermom, Frederik (Vrije Univ. Brussel)
194. Tonomura, Yoshide (NTT)
195. Topiwala, Pankaj (FastVDO)
196. Tsukagushi, Ikuo (Sony)
197. Tung, Yi-Shin (MStar Semiconductor)
198. Ueda, Motoharu (JVC Kenwood)
199. Ugur, Kemal (Nokia)
200. Vermeirsch, Kenneth (Ghent Univ. -- IBBT)
201. Wahadaniah, Viktor (Panasonic Singapore Labs)
202. Wan, Wade (Broadcom)
203. Wang, David (Polycom)
204. Wang, Pulin (Huawei)
205. Wang, Xianglin (Qualcomm)
206. Wang, Ye-Kui (Huawei)
207. Wedi, Thomas (Panasonic)
208. Wegner, Krzysztof (Poznań Univ. Tech.)
209. Wen, Xing (Hong Kong Univ. Sci. & Tech.)
210. Weng, Ying (BBC)
211. Wiegand, Thomas (Fraunhofer HHI)
212. Winken, Martin (Fraunhofer HHI)
213. Won, Kwanghyun (Sungkyunkwan Univ.)
214. Wu, Ping (Mitsubishi Electric)
215. Wu, Zhixiong (OKI)
216. Xu, Jizheng (Microsoft)
217. Xu, Lidong (Intel)
218. Xu, Yian (Cisco Systems)
219. Yamakage, Tomoo (Toshiba)
264
220. Yamamoto, Tomoyuki (Sharp)
221. Yang, Fuzheng (Xi'dian Univ.)
222. Yang, Haitao (Huawei)
223. Yang, Hongzhang (Freescale Semiconductor)
224. Yang, Jungyoup (Sungkyunkwan Univ.)
225. Yang, Ming Yuan (Huawei)
226. Yang, Zhijie (Broadcom)
227. Ye, Yan (Dolby Labs)
228. Yea, Sehoon (LG)
229. Yeo, Chuohao (I2R)
230. Yoon, Daeil (Sejong Univ.)
231. Yoshino, Tomonobu (KDDI)
232. Yu, Haoping (Huawei)
233. Yu, Lu (Zhejiang Univ.)
234. Yu, Yong (Broadcom)
235. Yuan, Yuan (Tsinghua Univ.)
236. Zhang, Li (Peking Univ.)
237. Zhang, Louis (AMD)
238. Zhang, Wen (ZTE)
239. Zhao, Haiwu (Shanghai Univ.)
240. Zhao, Qing Yi (Tencent)
241. Zheng, Jianhua (Huawei)
242. Zheng, Xiaozhen (Huawei)
243. Zhou, Minhua (TI)
244. Zou, Bill (DTS)
265
Annex I – Audio report
Source: Schuyler Quackenbush, Chair, Audio Subgroup
1
2
Opening Audio Plenary ......................................................................................................... 267
Administrative matters .......................................................................................................... 267
2.1 Communications from the Chair
267
2.2 Approval of agenda and allocation of contributions 267
2.3 Creation of Task Groups
267
2.4 Approval of previous meeting report 267
2.5 Review of AHG reports
267
2.6 Joint meetings 267
2.7 Received National Body Comments and Liaison matters 267
2.8 Plenary Discussion 267
3 Record of AhG meetings ....................................................................................................... 267
3.1 AhG Meeting on USAC -- Sunday 1000-1800
267
4 Task group activities ............................................................................................................. 275
4.1 Joint Meeting 275
4.1.1 With Requirements on Audio for HEVC (Wed 1400-1500)....................................... 275
4.2 Task Group discussions
275
4.2.1 MPEG-2, MPEG-4, MPEG-7, Audio Conformance, Reference Software, MPEG
Surround ............................................................................................................................... 275
4.2.2 MPEG-D Spatial Audio Object Coding ...................................................................... 276
4.2.3 MPEG-D Unified Speech and Audio Coding ............................................................. 277
5 Closing Audio Plenary and meeting deliverables ................................................................. 289
5.1 Plenary discussions 289
5.2 Responses to Liaison and NB comments
289
5.3 Recommendations for final plenary 289
5.4 Establishment of Ad-hoc Groups
290
5.5 Approval of output documents
290
5.6 Press statement
290
5.7 Agenda for next meeting
290
5.8 All other business
290
5.9 Closing of the meeting
290
Annex A Participants ............................................................................................................... 291
Annex B Audio Contributions and Schedule .......................................................................... 292
Annex C Task Groups ............................................................................................................. 298
Annex D Output Documents ................................................................................................... 299
Annex E Agenda for the 95th MPEG Audio Meeting ............................................................. 299
266
1
Opening Audio Plenary
The MPEG Audio Subgroup meeting was held during the 94th meeting of WG11, October 11-15,
2010, Guangzhou, China. The list of participants is given in 1.
2
2.1
Administrative matters
Communications from the Chair
The Chair summarised the issues raised at the Sunday evening Chair’s meeting, proposed task groups for the week, and
proposed agenda items for discussion in Audio plenary.
2.2
Approval of agenda and allocation of contributions
The agenda and schedule for the meeting was discussed, edited and approved. It shows the
documents contributed to this meeting and presented to the Audio Subgroup, either in the task
groups or in Audio plenary. The Chair brought relevant documents from Requirements, Systems to
the attention of the group. It was revised in the course of the week to reflect the progress of the
meeting, and the final version is shown in 2.
2.3
Creation of Task Groups
Task groups were convened for the duration of the MPEG meeting, as shown in 3. Results of task
group activities are reported below.
2.4
Approval of previous meeting report
The Chair asked for approval of the 93rd Audio Subgroup meeting report, which was registered as a
contribution. There was some discussion and wording concerning discussion in the Audio closing
plenary was modified, and the revised report was approved.
2.5
Review of AHG reports
There were no requests to review any of the AHG reports.
2.6
Joint meetings
Who
Req,
Audio
2.7
What
Audio for
HEVC
3
3.1
When
Wed,
14001500
Received National Body Comments and Liaison matters
Num.
M17985
2.8
Where
Audio
Source
ITU-T SG 16 to SC 29/WG 11
Approval of a New Question on Telepresence
systems
Respond
S. Quackenbush
Plenary Discussion
Record of AhG meetings
AhG Meeting on USAC -- Sunday 1000-1800
USAC Text and Reference Software corrections
Max Neuendorf, FhG, presented
m18456
Corrections to Reference Software and CD of
USAC
267
Max Neuendorf
This contribution proposes changed to the USAC CD text and Reference Software. The various
proposed changes are categorized into the following three categories:
Changes to the CD text (which are only editorial)
 Description of spectral noiseless coding
o Consistent use of variables
o Clarification of arithmetic coding context
o Correction to numerical constants
 Description of overlap-add for synthesis of output from adjacent coded frames
 Clarifications to MDCT-based TCX description
 Decoding coefficents for complex stereo prediction in MDCT
 Combination of SBR and MPS 212
 Re-organization of CD text to conform to ISO directive
Changes to the Reference Software
 Recalculation of pitch gain for use in the bass postfilter in TCX to ACELP transitions
 Corrections to handling higher bands in 32-band QMF analysis filterbank
o Should not initialize high QMF bands to zero in unified stereo coding
Changes to both CD text and Reference Software
 Simplification of AVQ bit stream syntax. This re-organizes the bit stream but does not affect
the decoded waveform
o Make all integer serialization msb first
o Remove interleaving of AVQ coefficients
The AhG recommends to
 Adopt the proposed changes to the CD text (which are only editorial) into a “Study on
USAC CD” output document.
 Make the proposed changes to the USAC reference software
 Adopt the proposed changes to the CD text and make the associated changes to the USAC
reference software
Schuyler Quackenbush, ARL, presented
m18077
Draft USAC CE Status and Workplan
S. Quackenbush
The presenter asked USAC CE proponents to please check the CE status tables and to bring them
up to date so that producing the output document on Friday is that much easier.
Improved bass- post filter
Philippe Gournay , VoiceAge, presented
m18435
VoiceAge listening test results for the
Philippe Gournay, Roch
enhanced bass-postfilter CE
Lefebvre
The contribution presented 16 kb/s listening test results for this CE. Systems under test are RM7
and RM7+CE. Seven items containing singing voice with music background were used as items in
addition to the CfP test items. There was no difference for absolute scores. When considering
differential scores, 4 of the 19 items are better and the mean is better at 95% level of significance.
Max Neuendorf, FhG, presented
m18451
FhG listening test report on CE on improving
Guillaume Fuchs, Max
the USAC bass post-filter
Neuendorf
The contribution presented 16 kb/s mono listening test results for this CE. There is no difference for
absolute scores. When considering differential scores, 2 items are better, 1 item worse at the 95%
level of significance.
The contribution also reports that the CE software, as integrated into the decoder, was run and
decoded the listening test bit streams to produce exactly the listening test decoded waveforms.
David Virette, Huawei, presented
268
m18467
Report on cross-check listening test for the
David Virette
CE on improved Bass-post filter for USAC
The contribution presented 16 kb/s mono listening test results for this CE. There is no difference for
absolute scores. When considering differential scores, 1 item worse (es01) at the 95% level of
significance.
Kristofer Kjörling, Dolby, presented
m18379
Finalization of CE on an improved bass- post
Barbara Resch, Leif
filter operation for the ACELP of USAC
Sehlström, Heiko Purnhagen,
Lars Villemoes, Kristofer
Kjörling, Bruno Bessette,
Philippe Gournay
The contribution presented 16 kb/s mono listening test results for this CE. There is no difference for
absolute scores. When considering differential scores, 4 items better, the mean better, none worse at
the 95% level of significance.
The contribution also presents a overview of the technology and a summary of the cross-check
results.
It notes that the current bass post-filter, used in ACELP coding mode, helps enhance the vocal
signal. However in the case that there is background music with strong low-frequency harmonics,
when the coder switches between ACELP, TXCX and FD modes, the suppression of the musical
low harmonics comes and goes and is quite audible.
When considering individual sites, for 2 items and for the mean, 2 of 4 sites agree on improvement.
When all data is pooled, there are a total of 32 listeners. When considering differential scores, 4
items better, the mean better and none worse at the 95% level of significance.
The AhG recommends adopting the CE technology into the “Study on USAC CD” output document.
Complexity reduction for time warping
Takeshi Norimatsu, Panasonic, presented
m18360
Panasonic cross check report on complexity
reduction for time warping in USAC
Takeshi Norimatsu, Tomokazu
Ishikawa, Haishan Zhong, Dan
Zhao, Kok Seng Chong
The contribution reports results for a 64 kb/s stereo listening test. It showed no significant
differences in either absolute or differential scores, for any item or for the mean.
Heiko Purnhagen, Dolby, presented
m18375
Dolby listening test results for CE on
Heiko Purnhagen, Kristofer
complexity reduction for time warping in
Kjörling
USAC
The contribution reports results for a 64 kb/s stereo listening test. It showed no significant
differences for absolute scores. For differential scores, 1 item was better.
Markus Multrus, FhG, presented
m18452
Completion of the Core Experiment on
Stefan Bayer
Reducing the Complexity of the USAC Time
Warping
The contribution gives an overview of the CE technology and a summary of the listening test results.
The technology uses time warping to reduce the pitch interval variations over a frame such that the
pitch epochs have a more nearly harmonic frequency representation and thus can be more
efficiently coded by the FD coder mode.
The following table presents computational complexity. All numbers are WMOPS.
RM8
RM8+CE
TW tool
19.6
9.5
only
Full
33.4
23.3
269
Decoder
The contribution reports results for a 64 kb/s stereo listening test conducted at FhG. It showed no
significant differences in either absolute or differential scores, for any item or for the mean.
When listening test data are pooled over all test sites (25 listeners), it showed no significant
differences in either absolute or differential scores, for any item or for the mean.
Heiko Purnhagen, Dolby, reported that he decoded the listening test bitstreams (which were the
RM8 bitstreams) using the CE proponent decoder and produced the waveforms that were used in
the listening test. He then decoded the bitstreams using the RM8 decoder and produced the
reference RM8 decoded waveforms.
The AhG recommends adopting the CE technology into the “Study on USAC CD” output document.
Improved SBR in USAC
Toru Chinen, Sony, presented
m18398
Sony listening test report on improved SBR in
Toru Chinen, Masayuki
USAC
Nishiguchi
The contribution reports the results of a 12 kb/s mono listening test. It showed no significant
differences in either absolute or differential scores, for any item or for the mean.
Max Neuendorf, FhG, presented
m18431
FhG Listening Test Report – improved SBR
Stephan Wilde, Max
Neuendorf
The contribution reports the results of a 12 kb/s mono listening test. It showed no significant
differences in absolute scores. For differential scores, 2 items were better at the 95% level of
significance.
The contribution also reports on software verification: It confirms that the listening test bitstreams
did decode to the listening test waveforms. Furthermore, it confirms that WD7 bitstreams decoded
to the WD7 reference waveforms.
Kristofer Kjörling, Dolby, presented
m18378
Finalization of CE on improved SBR
Kristofer Kjörling, Leif
Sehlström
The contribution gives an overview of the CE technology, presents new listening test results and
presents all listening test results as pooled data.
In RM7, SBR uses much of the machinery present in MPEG-4 SBR. One shortcoming present in
SBR is that the copy-up envelope can have considerable discontinuities (as, for example, shown in
es01), which might be so large that adjustment limiters prevent a target envelope from being
achieved. The CE technology is a “pre-adjustment” gain stage which insures that the high-band
envelope adjuster is able to make the adjustments needed in the high bank.
Computational complexity is quite low, approximately 0.1 WMPOS.
A control bit in the SBR header signals that the tool should be used from that point forward. Toru
Chinen, Sony, noted that ARIB in Japan are currently using HE-AAC and send an SBR header very
500 ms. Kristofer Kjörling, Dolby, noted that the SBR encoder would usually send the SBR header
whenever it was advantageous to change SBR configuration (which is specified in the SBR header).
For the pooled data, two items are better at the 95% level of significance.
The AhG recommends adopting the CE technology into the “Study on USAC CD” output document.
Harmonic transposer
Kihyun Choo, Samsung, presented
m18372
Crosscheck report on harmonic transposer
Kihyun Choo, Miyoung Kim,
CEs
Eunmi Oh
The contribution reports the results of a 16 kb/s mono listening test that compares all configurations
of the two CEs. There were no differences with respect to the absolute scores.
270
With respect to differential scores, it reports the following:
 QMF – FFT: 1 better, 1 worse.
 QMFxp – QMF: 1 worse.
 FFTxp – FFT: 1 worse
Kimitaka Tsutsumi, NTT DOCOMO, presented
m18466
NTT DOCOMO Cross-check Report on
Kimitaka Tsutsumi, Kei
Improved Harmonic Transposer in USAC
Kikuiri, Nobuhiko Naka
The contribution reports the results of a 16 kb/s mono listening test that compares all configurations
of the two CEs. There were no differences with respect to the absolute scores.
With respect to differential scores, it reports the following:
 QMF – FFT: 3 better
 QMFxp – QMF: no difference.
 FFTxp – FFT: 3 better, mean better
In addition, the contribution reports that NTT DOCOMO confirms that the listening test bitstreams
decode exactly to the listening test decoded waveforms.
Jeff Huang, Qualcomm, presented
m18459
Crosscheck listening test report for USAC on
Jeff Huang
FFT and QMF harmonic transposers
The contribution reports the results of a 16 kb/s mono listening test that compares all configurations
of the two CEs. There were no differences with respect to the absolute scores.
With respect to differential scores, it reports the following:
 QMF – FFT: no difference
David Virette, Huawei, presented
m18468
Report on cross-check listening test for the
David Virette
Ces on QMF based harmonic transposer and
improved harmonic transposer in USAC
The contribution reports the results of a 16 kb/s mono listening test that compares all configurations
of the two CEs. There were no differences with respect to the absolute scores.
With respect to differential scores, it reports the following:
 QMF – FFT: 1 better
 QMFxp – QMF: no difference
 FFTxp – FFT: no difference
In addition, the contribution reports that Huawei confirms that the listening test bitstreams decode
exactly to the listening test decoded waveforms.
Zhong Haishan, Panasonic, presented
m18501
Panasonic crosscheck report on improved
Zhong Haishan, Chong Kok
harmonic transposer
Seng, Zhao Dan, Takeshi
Norimatsu, Tomokazu
Ishikawa, Neo Sua Hong
The contribution reports the results of a 16 kb/s mono listening test that compares all configurations
of the two CEs. There were no differences with respect to the absolute scores.
With respect to differential scores, it reports the following:
 QMFxp – QMF: 2 better
 FFTxp – FFT: 5 better
 QMFxp – FFTxp: no difference
Zhong Haishan, Panasonic, presented
m18386
Finalization of CE on QMF based harmonic
271
Haishan Zhong, Kok Seng
transposer
Chong, Takeshi Norimatsu,
Tomokazu Ishikawa, Lars
Villemoes, Per Ekstrand,
Kristofer Kjörling, Stephan
Wilde, Sascha Disch, Frederik
Nagel, Max Neuendorf ,
The contribution gives an overview of the CE technology, including complexity information, and
also presents a summary of all listening test results.
The FFT transposer has high frequency resolution but also high complexity, while the QMF
transposer has much lower complexity, as shown in the following table:
Total
WMOPS
WMOPS
WMOPS
percentage Total WMOPS
Configuration
percentage
Transposer
Transposer
Decoder
Decoder
only
FFT based Harmonic Transposer
with 10% oversampling frames
5.79
100%
9.42
100%
(WD7)
QMF based harmonic transposer
0.86
14.8%
4.49
47.7%
When all listening test data is combined, analysis of differential MUSHRA scores shows:
 QMF – FFT: no difference
Harmonic transposer – Cross products technology
Kristofer Kjörling, Dolby, presented
m18384
Finalization of CE on improved harmonic
Lars Villemoes, Per Ekstrand,
transposer in USAC
Sascha Disch, Frederik Nagel
The contribution presented an overview of the RM8 transposer technology as compared to the CE
transposer technologies. It makes the point that the RM8 transposer results in many missing
harmonics which are perceived as “ghost” higher fundamentals. The proposed cross-product
technology permits “filling-in” the missing fundamentals via construction filterbank signals as a
sum of adjacent low-band filter signals.
QMF suffers when signals have very low fundamentals, since its low frequency resolution results in
many distinct fundamentals mapping to the same quantized representation.
When all listening test data is combined, analysis of differential MUSHRA scores shows:
 QMFxp – QMF: 4 better, mean better
 FFTxp – FFT: 4 better, mean better
When looking at individual test sites:
 QMFxp – QMF: no strong consensus
 FFTxp – FFT: 3 items for which at least 3 of 6 agree
The AhG recommends adopting the cross-product technology into the “Study on USAC CD” output
document.
Harmonic transposer – QMF technology
Kristofer Kjörling, Dolby, presented
m18389
Overview of performance of transposer
proposals, and suggested decoding modes
272
Kristofer Kjörling, Haishan
Zhong, Kok Seng Chong,
Takeshi Norimatsu, Tomokazu
Ishikawa, Lars Villemoes, Per
Ekstrand, Stephan Wilde,
Sascha Disch, Frederik Nagel,
Max Neuendorf ,
The contribution presents an overview of the proposed transposer technology. There are two CEs
which propose the following replacements or additions to the current RM7 FFT transposer
technology:
 QMF (replacement or addition)
 Cross products (addition)
It focuses on the 16 kb/s mono and stereo operating point because here the transposer requires the
greatest fraction of decoder resources.
For mono:
 QMF – 4.5 MOPS (reduces total decoder complexity to 48% of RM7)
 Cross products
o FFT – 7.4 MOPS (reduces total decoder complexity to 78% of RM7)
o QMF – 4.7 MOPS (reduces total decoder complexity to 50% of RM7)
For stereo:
 QMF – 8.6 MOPS (reduces total decoder complexity to 63% of RM7)
 Cross products
o FFT – 11.5 MOPS (reduces total decoder complexity to 85% of RM7)
o QMF – 8.8 MOPS (reduces total decoder complexity to 65% of RM7)
It notes that the quality of the QMF transposer is comparable to that of the FFT transposer. It further
notes that incorporating cross products into the transposer provides a significant increase in quality
while requiring either a decrease in complexity (FFT) or a modest increase in complexity (QMF). If
a decoder with FFT and cross products is used as a baseline of 100%, then the QMF transproser
with cross products results in a decoder that is 65% of the baseline complexity.
In terms of quality, when all data is pooled, 1 item is better for the differential score QMFxp –
FFTxp, however there is not strong agreement amongst the results from individual test sites.
The contribution proposes that there be a “Low Power” and a “High Quality” decoding modes,
where a single bitstream syntax can be decoded in either decoding mode. For each decoding mode,
the transposers in each would be:
 Low Power: QMFxp
 High Quality: FFTxp
Discussion
The Chair noted that, in USAC, there is no “low power” mode defined in USAC. Hence, the Chair
feels that the proposal is whether to have a FFTxp transposer (low complexity) or a QMFxp
transposer (very low complexity) or both.
Max Neuendorf, FhG, notes that when the individual test sites compare QMFxp versus FFTxp, for
many sites there was judged to be 1 or more item worse (i.e. QMFxp worse than FFTxp).
Werner Oomen, Philips, felt that there should be only one transposer in USAC and that the group
should pick one.
The Chair felt that there was not consensus in the group to make a decision at this time. The topic
will be brought up again later in the MPEG week.
T/F domain post-processing
Kihyun Choo, Samsung, presented
m18371
Crosscheck report on adaptive T/F domain
Kihyun Choo, Miyoung Kim,
post-processing for USAC
Eunmi Oh
The contribution presents results of a listening test comparing WD6+CE and WD6. The operating
points and test results for the statistic (WD6+CE - WD6) were
 12 kb/s mono
273

o Absolute scores: no difference
o Differential scores: 1 better
8 kb/s mono
o Absolute scores: no difference
o Differential scores: 2 better
Heiko Purnhagen, Dolby, presented
m18373
Dolby listening test results for CE on T/F
Heiko Purnhagen, Kristofer
post-processing in USAC
Kjörling
The contribution presents results of a listening test comparing WD6+CE and WD6. The operating
points and test results for the statistic (WD6+CE - WD6) were
 12 kb/s mono
o Absolute scores: no difference
o Differential scores: no difference
 8 kb/s mono
o Absolute scores: no difference
o Differential scores: 2 better, 1 worse (Normal distribution) or 1 better (Student t
distribution)
Jeff Huang, Qualcomm, presented
m18461
Crosscheck listening test report for USAC on
Jeff Huang
time frequency domain post-processing
The contribution presents results of a listening test comparing WD6+CE and WD6. The operating
points and test results for the statistic (WD6+CE - WD6) were
 12 kb/s mono
o Absolute scores: no difference
o Differential scores: : 3 better, 1 worse
David Virette, Huawei, presented
m18471
Finalization of CE on adaptive T/F domain
David Virette, Wei Xiao
post-processing for USAC
The contribution presents results of a listening test comparing WD6+CE and WD6. The operating
points and test results for the statistic (WD6+CE - WD6) were
 12 kb/s mono
o Absolute scores: no difference
o Differential scores: 3 better, mean better
 8 kb/s mono
o Absolute scores: no difference
o Differential scores: 5 better, mean better
When data from all test sites is pooled
 12 kb/s mono
o Absolute scores: no difference
o Differential scores: 4 better, mean better
 8 kb/s mono
o Absolute scores: no difference
o Differential scores: 5 better, mean better
It reviewed the complexity of the CE technology, which is shown in the following table:
Average
Maximum
RM6
PCU
PCU
mono@8kbps
0.24
0.56
mono@12kbps
0.31
0.73
8
274
The presenter noted that the post-processing control bits are transmitted in the bit stream only if the
coding mode is LP.
Kristofer Kjörling, Dolby, noted that the Spectrum Flattening Post Processing seems similar to the
“Improved SBR” tool, which also helps to flatten the SBR HF envelope. The Chair asked how this
post processor compares to the “Improved Bass Post-Filter” Philippe Gournay , VoiceAge noted
that the Bass Post Filter was limited to processing the signal below 500 Hz. The presenter noted that
this technology does noise shaping across the spectrum.
The Chair felt that there was not consensus in the group to make a decision at this time. The topic
will be brought up again later in the MPEG week.
4
Task group activities
4.1
Joint Meeting
4.1.1
With Requirements on Audio for HEVC (Wed 1400-1500)
The Requirements and Audio Chairs reviewed a document that presented a number of application
scenarios for new audio work, the most prominent of which was audio coding and presentation for
visual display systems that make use of High Efficiency Video Coding (HEVC).
Such visual displays might be Ultra-HD (UHD) devices, such as 4K x 2K displays. In such displays,
a much closer viewing distance is feasible and perhaps desirable. At such viewing distances there is
both visual and audio envelopment. This could have significant impact on audio presentation such
that much more accurate sound localization in terms of direction and distance is desirable. If there is
only one viewer, it might be that aspects of the audio presentation might be individualized in some
meaningful way.
The Chairs captured comments from the group, with respect to relevant application scenarios,
relevant technology and, most importantly, relevant requirements. The results of the joint meeting
are captured in the output document “Audio for HEVC.”
4.2
Task Group discussions
4.2.1
MPEG-2, MPEG-4, MPEG-7, Audio Conformance, Reference Software, MPEG
Surround
Ferenc Kraemer, Dolby, presented
m18406
Proposed addition to Intensity Stereo in
Ferenc Kraemer, Heiko
ISO/IEC 14496-26:2009, Audio conformance
Purnhagen
The contribution notes that many AAC encoders and decoders have a bug in the intensity stereo
processing. The bug entails the interaction between the intensity codebooks indicated
(INTENSITY_HCB and INTENSITY_HCB2, numbers 14 and 15) and the ms_mask bit for a scale
factor band. The correct behaviour is shown in the following table:
Intensity in-phase
IS
Codebook
M/S
Bitmask
Field
Phase
Position
of Left and
Right
Intensity out-of-phase
INTENSITY_HCB INTENSITY_HCB INTENSITY_HCB2 INTENSITY_HCB2
(#15)
(#15)
(#14)
(#14)
0
1
0
1
0°
180°
180°
0°
275
The conbribution notes that the ms_mask_present flag can be used to “preset” the values of the
ms_mask array. A buggy encoder or decoder ignores the role that the “preset” or “implicit” values
of the ms_mask bit plays when constructing the desired intensity phase.
The Chair asked the group to consider whether additional text in the informative part of the MPEG4 AAC and MPEG-4 AAC specifications to clarify the new encoder behaviour might be an
appropriate additional action.
It was the consensus of the Audio Subgroup to issue two DCOR, one against MPEG-2
Conformance and one against MPEG-4 Audio Conformance.
Ferenc Kraemer, Dolby, presented
m18426
Additional information on MPEG Surround
Andreas Hölzer, Christian
conformance testing
Ertel, Markus Lohwasser,
Michael Härtl, Ferenc
Kraemer, Frans de Bont
The contribution presents data on the output of the MPEG Surround conformance tool. It report on
the decoding performance, in terms of LSB’s of difference relative to a given reference, of a
number of important platforms, e.g. floating point and fixed point.
The presenter notes that MPEG Surround conformance criteria must be defined and expects to bring
more information to the next meeting.
4.2.2
MPEG-D Spatial Audio Object Coding
Leon Terentiv, FhG, presented
m18436
Report on corrections for MPEG SAOC
Jonas Engdegård, Heiko
Purnhagen, Oliver Hellmuth,
Jürgen Herre, Cornelia Falch,
Leon Terentiv, Maria Luis
Valero, Johannes Hilpert,
Andreas Hölzer, Werner
Oomen
The contribution proposes two small corrections to the Low Delay MPEG Surround part of the
SAOC specification.
It was the consensus of the Audio Subgroup to incorporate the proposed changes into a revised
Defect Report that is an output of this meeting.
Oliver Hellmuth, FhG, presented
m18437
Revised draft of SAOC verification test report
Jonas Engdegård, Heiko
Purnhagen, Oliver Hellmuth,
Jürgen Herre, Cornelia Falch,
Leon Terentiv, Maria Luis
Valero, Johannes Hilpert,
Andreas Hölzer, Werner
Oomen
The contribution is a draft of the SAOC test report. The presenter reviewed the document. It was
noted that perhas results for individual items should be added to the report, but as an Annex. The
Chair noted that figures that graphically
It is the consensus of the ASG to
o Issue the SAOC verification test report at this meeting, with an editing period. This will be a
public document.
o Look forward to the “Karaoke” solution at the 95th meeting. It is expected to be added to the
SAOC Defect Report (from this meeting) and to issue as a DCOR at the 95th meeting.
o Based on info at the 95th meeting, decide on whether to update the VRT with a Karaoke test.
Such a revised VTR would issue at the 96th meeting.
276
4.2.3
MPEG-D Unified Speech and Audio Coding
CE Educational Material
Markus Multrus, FhG, presented
m18454
Informative Encoder Description for USAC
Vignesh Subbaraman, Markus
Improved Noiseless Coding CE
Multrus, Kihyun Choo
The contribution contains text for the informative annex of the USAC specification that describes
how to build the spectral arithmetic coding tool for the USAC encoder. The presenter verified that
the MPEG Reference Encoder has full source code for the USAC Improved Noiseless Coding CE
functionality.
Kei Kikuiri, NTT DOCOMO, presented
m18473
Educational Information on Encoder
Kei Kikuiri, Atsushi
Implementation of inter-TES tool in USAC
Yamaguchi, Nobuhiko Naka
The contribution contains text for the informative annex of the USAC specification which describes
how to build the inter-TES tool in the USAC encoder. The presenter verified that the MPEG
Reference Encoder has only bitstream syntax support for the USAC inter-TES tool.
It was the consensus of the Audio Subgroup that the contribution provides sufficient educational
information. It was noted that Markus Multrus, FhG, reported in an email (Sep 24, 2010, 16:35
CEST) that the inter-TES normative decoder software has been integrated into the USAC reference
decoder. Hence this CE is concluded.
USAC Performance
Gregory Pallone, Orange Labs, presented
m18434
Report on USAC performance
Gregory Pallone, Pierrick
Philippe
The contribution reports on a listening test that compares the performance of USAC, MPEG-4 AAC
and MPEG-4 HE-AAC. Systems under test were:
 aac+64: HE AAC encoder @ 64kbps
 aac96: AAC encoder @ 96kbps
 aac128: AAC encoder @ 128kbps
 usac64: reference quality bitstream of WD7 @ 64 kbps
 usac96: reference quality bitstream of WD7+CE (improved stereo) @ 96 kbps
There were ten subjects after post-screening. When averaged over all listeners and all test items,
USAC demonstrates a slight increase in performance (in the mean) for the range of bit rates tested.
USAC Configuration, Payload and Transport Issues
Markus Multrus, FhG, presented
m18480
Comments on further USAC investigation
Max Neuendorf, Markus
Multrus, Stefan Doehla,
Werner Oomen, Heiko
Purnhagen
This contribution presents a more detailed list of requirements that should be addressed in the final
steps of the USAC development work.
It divides the requirements into three broad categories:
 Configuration of decoder
 Payload (i.e. Access Unit)
 Transport (of payloads)
Contributions on this topic should refer to what aspect of the “requirements” (as put forth in this
document) is addressed.
It was agreed to make the body of the contribution a section of the USAC CE Status and Workplan
document.
277
MPEG Reference Encoder
David Virette, Huawei, presented
m18470
A new signal classifier for USAC reference
David Virette, Lijing Xu, Wei
encoder
Xiao
The contribution describes a new software module for the USAC MPEG Reference Encoder that
analyzes the input signal and determines which coding mode should be applied. The performance of
this classifier was compared to both the current MPEG Reference Encoder and the RM7 bitstream
classification decisions.
It was the consensus of the Audio Subgroup to incorporate this new module into the SVN trunk of
the USAC Reference Encoder.
Jeongook Song, Yonsei University, presented
m18429
Status report on USAC Reference Software
Jeongook Song, Henney Oh,
JAME
Hong-Goo Kang
The contribution reports on the status of the JAME USAC encoder project. In the AhG period
running up to the 94th MPEG meeting, work was done to revise the psychoacoustic model tool and
corresponding bit-allocation loop tool, but that work is not yet complete and so no new code release
is available at this MPEG meeting.
The Chair hopes that, at the 95th MPEG meeting, a workplan can be created that coordinates a
listening test done by interested companies in the MPEG Audio Subgroup that will benchmark the
performance of the JAME software encoder against the Reference Quality Encoder.
Additional bandwidth extension
Henney Oh, Yonsei University, presented
m18428
Yonsei Crosscheck listening test report on
Jeongook Song, Eunwoo
additional bandwidth extension CE for USAC
Song, Henney Oh
The contribution presents listening test results for USAC operating at 8 kb/s for mono signals. It
compares RM7+CE and RM7. For absolute scores there were no differences at the 95% level of
significance. When analyzing differential scores, 3 items were worse and the mean was worse at the
95% level of significance assuming a Normal distribution. When analyzed with Student-t
distribution, 1 worse mean worse at the 95% level of significance.
David Virette, Huawei, presented
m18472
Progress report on additional bandwidth
David Virette, Wei Xiao
extension CE for USAC at low bit rates
The contribution gave an overview of the CE technology. This experiment had a bitrate of 8 kb/s
and a sampling rate of 19200 Hz. The technology extends the bandwidth up to 9.6 kHz. Two
architecutes were presented: one is “outside” of the SBR framework, which is appropriate for SBR
extension that covers nearly up to the Nyquist frequency (typically when the internal sampling rate
is adjusted lower for coding efficiency). The other is “inside” the SBR framework, which is
appropriate when the SBR extension is far short of the Nyquist frequency.
The contribution presents results of a listening test of RM7+CE and RM7. For absolute scores there
were no differences at the 95% level of significance. When analyzing differential scores, 1 item was
at the 95% level of significance
Anecdotally, the presenter noted that listeners had to weigh the perceived effects of greater
bandwidth and additional noise.
Next steps for this CE will be supported by the USAC CE Workplan.
Increased structural flexibility in SBR
278
Max Neuendorf, FhG, presented
m18432
Status update of CE proposal on increased
structural flexibility in SBR
Stephan Wilde, Markus
Multrus, Max Neuendorf,
Kristofer Kjörling, Heiko
Purnhagen
The contribution presented an overview of the proposed technology. The basic idea is to extend the
SBR functionality from 1:2 extension to 1:4 extension. The 1:4 extension mode is appropriate for
very low bit rate systems (e.g. 12 kb/s) which may have very low internal sampling rates in the
encoder/decoder framework. The 1:4 extension is achieved by using only 16 of the existing 64
bands of the QMF filterbank for the core coder signal. It notes that a similar technology has already
been specified in the DRM system.
The contribution also presents results of a listening test for RM7+CE versus RM7 for the 8 kb/s
mono operating mode. For absolute scores, 1 item is better and the mean is better. For differential
scores, 9 items are better and the mean is better. The CE technology brings 4 MUSHRA points
improvement for the mean and up to 7 points for Music3.
The presenter noted that RM7 and RM7+CE both uses an core coder sampling rate of 9.6 kHz, so
that the RM7 output sampling rate is 19.2 kHz and the RM7+CE output sampling rate is 2*19.2
kHz.
The presenter expects to bring a complete CE proposal to the 96th MPEG meeting.
Enhanced long term predictor (eLTP)
Henney Oh, Yonsei University, presented
m18430
Updated CE on enhanced long term predictor
Jeongook Song ,Henney
(eLTP) for USAC
Oh ,Hong-Goo Kang
The contribution presents an enhanced functionality, in that the eLTP tool now operates for all
USAC core coding modes.
The Audio Subgroup looks forward to more information at the next meeting.
TCX windowing
Seungkwon Beack, ETRI, presented
m18234
Progress report on the TCX windowing CE
Taejin Lee, Seungkwon Beack,
for USAC
Minje Kim, Kyeongok Kang
The contribution noted that there was not sufficient time in the AhG period to execute all aspects of
the workplan, so that there are no new results to report at this meeting.
The Audio Subgroup looks forward to more information at the next meeting.
Time warping
Zhao Dan, Panasonic, presented
m18419
Status report of time warping CE in USAC
Zhao Dan, Zhong Haishan,
Chong Kok Seng, Takeshi
Norimatsu, Tomokazu
Ishikawa, Neo Sua Hong
The contribution presented an overview of the time warping technology.
It also presented results of a listening test for the operating modes 24 kb/s mono and 20 kb/s mono.
Systems under test were:
 No_TW (WD6 no time warping)
 WDQ (WD6 time warping forced to be active)
 TW_CE (WD6+CE)
Statistic
Mode
Performance (differential
score analysis)
279
TW_CE –
No_TW
TW_CE –
WDQ
TW_CE –
No_TW
TW_CE –
WDQ
24
kb/s
24
kb/s
20
kb/s
20
kb/s
3 items better, mean better
3 items better, mean better
8 better, mean better
4 items better, mean better
The contribution notes that the CE technology provides some bit savings (up to 2%) and also
accommodates a higher dynamic range in the time warping parameter (via Huffman compression).
The Audio Subgroup looks forward to more information at the next meeting.
Pulse Indexing
Toru Chinen, Sony, presented
m18396
Sony listening test report on pulse indexing in
Toru Chinen, Masayuki
USAC
Nishiguchi
The contribution presents results of a listening test which evaluated the RM7+CE versus RM7 at 20
kb/s for mono signals. All 15 test items were clean speech. It showed no differences when analyzing
either absolute or differential scores at the 95% level of significance using Student-t distribution.
David Virette, Huawei, presented
m18469
Finalization of Enhanced Pulse Indexing CE
David Virette, Dejun Zhang,
for ACELP in USAC
Fuwei Ma
The contribution summarizes the Enhanced Pulse Indexing CE technology. A theoretical analysis
showed that it is able to provide the following additional compression efficiency:
ACELP
Bit savings
Mode
per frame
18k4
8 or 16
16k4
8 to 16
14k4
4 to 8
The contribution also presents the results of a listening test. When analyzing absolute scores, there
were no differences at the 95% level of significance. When analyzing differential scores, 2 items
were better at the 95% level of significance using Normal distribution.
The Sony and Huawei listening data were pooled and an analysis of the differential scores showed 1
item better (nadib2) and 1 worse (nadib1).
This was discussed later in the MPEG week (see later in this section).
PVC for SBR
Takeshi Norimatsu, Panasonic, presented
m18361
Panasonic cross check report on PVC for SBR
Takeshi Norimatsu, Tomokazu
envelope coding in USAC
Ishikawa, Haishan Zhong, Dan
Zhao, Kok Seng Chong
The contribution gave results of a listening test which evaluated the RM7+CE versus RM7 at 12
kb/s for mono signals. It showed no differences when analyzing absolute scores at the 95% level of
significance using Normal distribution. When analyzing differential scores, 3 items were better at
the 95% level of significance using Normal distribution, 2 items better when using the Student-t
distribution.
The contribution also reports on verification of the RM7+CE decoder: all listening test bitstreams
were decoded to produce exactly the WAV files used in the listening test.
Heiko Purnhagen, Dolby, presented
m18374
Dolby listening test results for CE on PVC for
Heiko Purnhagen, Kristofer
SBR in USAC
Kjörling
280
The contribution gave results of a listening test which evaluated the RM7+CE versus RM7 at 20
kb/s for mono signals. It showed no differences when analyzing absolute scores at the 95% level of
significance using Normal distribution. When analyzing differential scores, 2 items were better and
1 worse at the 95% level of significance using Normal distribution, 1 items better and 1 worse when
using the Student-t distribution.
Markus Multrus, FhG, presented
m18455
FhG Listening Test Report – PVC for SBR
Frederik Nagel
envelope coding
The contribution gave results of a listening test which evaluated the RM7+CE versus RM7 at 20
kb/s for mono signals. It showed no differences when analyzing absolute scores at the 95% level of
significance using Normal distribution. When analyzing differential scores, 3 items were better and
2 worse at the 95% level of significance using Normal distribution and 2 better, 2 worse if Studentt distribution is used.
Toru Chinen, Sony, presented
m18399
Report on PVC CE for SBR in USAC
Toru Chinen, Yuki Yamamoto,
Mitsuyuki Hatanaka, Masayuki
Nishiguchi
The contribution gave an overview of the Predictive Vector Coding (PVC). The CE aims at
reducing encoding bit rate and increasing the subjective quality by modifying the delta coding
scheme of SBR envelope scalefactors. The proposed scheme is based on the prediction of SBR
envelope scalefactors by using the energy of QMF subband samples below SBR range. The indices
of prediction coefficient matrices are coded by using vector quantization.
PVC increases the complexity of the SBR tool by 0.4 WMOPS (peak load) and requires 1096 bytes
of additional table storage. It provides 1.44% additional compression efficiency as averaged over all
CfP test items (as compared to RM7 reference bit streams).
The contribution gave results of a listening test at Sony. When analyzing differential scores, 6 items
were better and the mean was better.
When all data is pooled, 5 items are better and the mean is better. Of these, 1 item shows an
improvement of 8 MUSHRA points, 2 items have show an improvement of 5 points. When looking
at individual test site results, 2 items are better for 2 of 4 sites, 2 items are better for 3 of 4 sites.
This was discussed later in the MPEG week (see later in this section).
Tonal component coding in eSBR
Marek Domanski, Poznan University, presented
m18532
Telcordia and PUT listening test results for
Tomasz Zernicki ,Maciej
CE on improved tonal component coding in
Bartkowiak ,Marek Domanski
eSBR (USAC)
The contribution reviewed the CE technology. It proposes a separate sinusoidal encoder/decoder
and separate bit stream of sinusoidal model parameters, as shown in the following figure. The
sinusoidal model parameter bit stream is typically 2 kb/s. The sinusoidal modelling is only run for
the SBR high-band region.
281
It presents results of a listening test at 16 kb/s and 20 kb/s. When analyzing differential scores of
RM7+CE – RM7, at
 16 kb/s 3 of the new items with many high-frequency tonal components showed
improvement (e.g. saxophone, violin, accordion). None of the CfP items showed any
degradation.
 20 kb/s 4 of the new items showed improvement.
The presenter clarified that the RM7 encoder was the MPEG Reference Encoder compiled and run
on the CfP items and the new CE items.
As a next step, it was agreed to draft a workplan in which the CE listening test experiment is
repeated, but using the Reference Quality Encoder.
New CEs
New excitation coding for LPD mode
Roch Lefebvre, VoiceAge, presented
m18481
Proposed CE for extending the LPD mode in
Bruno Bessette ,Philippe
USAC
Gournay, Roch Lefebvre
The contribution proposes to add an additional excitation coding method for use in ACEP coding
mode.
Memory for next frame
adaptive codebook
decoded
pitch
parameter
s
shift for next frame
new
decoded
prequantizer
parameter
s
decoded
ACELP
parameter
s
excitation
signal
LPC
Synthesis
filter
inverse
DCT
AVQ
decoder
…
ACELP codebook
The additional tool gives the following advantages:
 Reduce the time and frequency dynamice of the residual (LPC excitation)
 Better control of bit allocation for LPC excitation
 More explicit control of coding noise in ACELP frame.
 Re-use existing tools
Achieve higher quality for the ACELP coding mode at higher bit rates (i.e. above 32 kb/s)
The contribution presents the results of a listening test for stereo signals. The following Systems or
Codec under Test (CuT) were tested:
 USAC WD6 at 32 kbps stereo (switched FD/LPD)
 Modified USAC WD6 (CuT) at 48 kbps stereo (LPD only)
 Modified USAC WD6 (CuT) at 64 kbps stereo (LPD only)
 USAC WD6 at 64 kbps stereo (FD only)
282
The results show that for the differential statistics
 (CuT at 48 kbps – USAC WD6 at 32 kbps) – all items and the mean showed improvement at
the 95% level of significance.
 (CuT at 64 kbps – USAC WD6 at 64 kbps) – 5 better, 1 worse, mean better, at the 95% level
of significance.
The proponent expects to bring a complete CE to the next meeting. It was agreed to draft a
workplan to coordinate cross-check efforts.
Enhanced Performance at Mid Bitrates
Markus Multrus, FhG, presented
m18479
Proposed Core-Experiment on Enhancing the
Markus Multrus, Philippe
USAC Codec at Mid Bitrates
Gournay, Nikolaus Rettelbach,
Bruno Bessette, Bernhard Grill
The contribution notes that USAC can switch between a block-based transform coder and a LP
coder. The transform coder has a fixed block length and cannot adjust its time resolution (e.g. via
internal sampling rate conversion) when switching to LP mode must be accommodated. It further
notes that temporal resolution can be achieved via either of two means:
 Shorten transform frame size
 Increase sampling rate
The proposed technology used both methods to achieve higher time resolution at mid bit rates (e.g.
24 kb/s). The proposed new operating mode is shown in the following figure:
USAC, proposed extra setting:
f = 3/8 fout
f =3/8 fout
Core-coder
framelength:
Analysis
QMF
24 bands
f = f out
SBR
Upsamling
Factor 8/3
768 samples
f = f out
MPS
Decoder
f = f out
Synthesis
QMF
64 bands
2048 samples
The computational complexity of the proposed system is comparable to a system run at 44.1 kHz
sampling rate (with a 1:2 SBR configuration). The memory complexity, comprised of tables for the
new transform block lengths, is approximately 960 32-bit works.
The contribution presents results of a listening test of 24 kb/s mono. When differential scores are
analyzed: (RM7+CE – RM7), 4 items are better at the 95% level of significance.
The presenter requests that a workplan that will specify an experimental setup and coordinate crosschecks, and expects to bring a complete CE to the next meeting.
Improved stereo coding
David Virette, Huawei, presented
m18474
CE proposal on improved stereo coding at
David Virette
low bit rates
The contribution describes the problem in USAC stereo coding that is addressed by the CE
technology. For low bit rate stereo coding, when only CLD and ICC parameters are used, the upmix
matrix may have a wrong behaviour for negative ICC parameters. The proposed technology uses an
IPD estimation to better reconstruct the stereo signal in case of negative ICC.
The contribution reports the results of a listening test at 20 kb/s stereo. With differential analysis of
the statistic (H6+CE – H6), there is 1 item better and mean better at the 95% level of significance,
where “H6” is Huawei’s internal USAC encoder which employs the same the tools as in RM6.
Werner Oomen, Philis, noted that there are already means used in USAC to neutralize the antiphase, and it might be interesting to compare the CE technology with current USAC tools.
283
The presenter requests that a workplan that will specify an experimental setup and coordinate crosschecks, and expects to bring a complete CE to the next meeting.
Lower-Complexity Decorrelator
Julien Robilliard, FhG, presented
m18433
Proposed decorrelator improvements in
Julien Robilliard, Matthias
USAC
Neusinger, Johannes Hilpert,
Erik Schuijers, Bert den
Brinker, Werner Oomen
The contribution presented results of an investigation on reducing the complexity of the MPS 212
decorrelator without compromising audio quality. A complexity analysis of the current and
proposed decorrelator is shown in the table below. The main point is that the decorrelator
complexity is reduced by more than 50%, and that the complete decoder complexity is reduced by
nearly 15%.
Complexity measure
(MOPS)
RM8
Complexity
Proposed filter filter
reduction
Decorrelator
tool
1.7
3.6
52.8%
Complete
USAC decoder
11.2
13.1
14.5%
The contribution presents results of a listening test for 32 kb/s stereo, showing no significant
differences for any item at the 95% level of significance.
The presenter requests that a workplan that will specify an experimental setup and coordinate crosschecks, and expects to bring a complete CE to the next meeting.
Improved applause coding
Julien Robilliard, FhG, presented
m18413
CE proposal on improved applause coding in
Achim Kuntz, Sascha Disch,
USAC
Erik Schuijers, Werner Oomen
The contribution is a joint proposal from FhG-IIS and Philips on Transient Steering Decorrelator
(TSD). It notes that the parametric stereo technology inherited from MPEG Surround has a number
of shortcomings:
 Narrowed sound stage
 Lack of envelopment
 Sound coloration
The solution proposed it to add a block that separates out transient components and passes them
through a new block especially adapted for decorrelating transient signals. This is shown in the
following block diagram:
284
Results of a listening test at 32 kb/s stereo were presented. Analysis of absolute MUSHRA scores
showed that 3 “applause” items showed significant improvement (more than 10 MUSHRA points)
and the 95% level of significance. Analysis of differential scores showed that all items are improved
at the 95% level of significance.
The presenter requests that a workplan that will specify an experimental setup and coordinate crosschecks, and expects to bring a complete CE to the next meeting.
Enhanced Mode Transitions
Sangoh Jeong, LG, presented
m18393
Core Experiment on Enhanced Mode
Kiho Cho, Nam Soo Kim,
Transitions in USAC
Sungyong Yoon, Sangoh
Jeong
The contribution proposes a technology for transitions between TCX and ACELP frames and from
FD to ACELP frames. The main idea is shown in the following figures.
It notes that the proposed technology can save approximately 1% of the RM7 bitrate, depending on
the operating mode. The complexity of the proposal is less than the FAC operation.
The results of a listening test are reported for 20 kb/s mono and 12 kb/s mono. It shows no
differences for either absolute scores or differential scores.
Philippe Gournay, VoiceAge, noted that the proposed technology has two potential shortcomings:
 Zero the filter state on the MDCT (i.e. TXC, FD) to ACELP transition
285

Base the TDAC on ACELP encoded signal
Max Neuendorf, FhG, agreed with Philippe Gournay in that FhG experimented with a LP filter state
reset and found that it had a negative impact on audio quality. He further noted that the adaptive
codebook in ACELP assumes a “time forward” sequence in the residual, while the folding operation
results in a “time reversed” interval that will not be efficiently encoded.
The presenter requests that a workplan that will specify an experimental setup and coordinate crosschecks, and expects to bring a complete CE to the next meeting.
Unified global gain syntax
Max Neuendorf, FhG, presented
m18450
Proposed unified global gain syntax element
Guillaume Fuchs, Markus
in USAC
Multrus, Max Neuendorf
The contribution proposes to modify syntax elements of the USAC bit stream to support a means to
change the of the USAC output level via changing a single global gain element.
The current gain elements are shown here:
Mode
Gain element
FD
8-bit “Global gain” per 1024 length blocks
TCX
7-bit “TXC global gain” per 256 to 1024 length blocks
ACELP
2-bit “Mean energy” per 256 length block
SBR
Codes absolute envelope energy. (This does not automatically scale when
global gain of core coder changes.)
The proposed modifications are shown here:
Mode
Gain element
FD
Stays the same: 8-bit “Global gain” per 1024 length
blocks
TCX
1 bit refinement: 7-bit “TXC global gain” per 256 to
1024 length blocks
ACELP
Remove “Mean energy”
SBR
Code absolute envelope energy realative to the core
codec energy.
This is shown graphically in the following figure:
286
Since the proposed change is not “noiseless,” the contribution presents the results of a listening test
at 16 kb/s mono. One system under test was the proposed technology, another was the proposed
technology, but decreased in level via the 8-bit Global gain element and then increased in level in
the decoded PCM domain. When examining absolute MUSHRA scores, there is no difference at the
95% level of significance. When examining differential scores, there is no difference at the 95%
level of significance.
Heiko Purnhagen, Dolby, asked what requirement is addressed with the proposal. In particular, he
noted that the SBR high-band level adjustment is only responsive to an SBR header. Hence,
adjustments in global gain will not impact the SBR high-band until a SBR header occurs in the
bitstream.
The presenter requests that a workplan that will specify an experimental setup and coordinate crosschecks, and expects to bring a complete CE to the next meeting.
Additional discussion on PVC for SBR
Toru Chinen, Sony, made a presentation on a proposed workplan. This was discussed and it was the
consensus of the Audio subgroup draft a workplan to execute the points:
 Gather new evidence based on RM7 code base.
 Bring evidence of performance, as a listening test, at one additional operating point, e.g.
8kb/s.
 If the PVC bit stream and decoded output does not change for the 12 kb/s operating point,
then that test does not have to be repeated.
Additional discussion on T/F domain post processing
David Virette, Huawei, made a presentation on the performance of T/F domain post processing at 8
kb/s and 12 kb/s mono. The new information pertains to calculating the 95% confidence intervals
using the Student-t distribution. In addition, he presented information on the complexity of the two
modules proposed in the CE.
He presented a comparison of the role of the eTES processing and the T/F post processing. He
noted that
 The eTES processing sharply adjusted the time-varing energy envelope
 The T/F domain post processing applies a smooth emphasis to the spectral peaks and smooth
de-emphasis to the spectral valleys, as in classical speech post processing.
He presented a graphic showing per-item listening test results, and the extent that the two modules
(flattening and post-processing) were active in the frames of each item.
He concluded by noting that post-filtering is well known in the speech coding community and has
been shown to provide performance improvements at low bit rates.
Finally, he requests that the post-processing tool be incorporated, and to rely on the Dolby
improved SBR envelope flattening to perform the spectral flattening function.
There was considerable discussion. In particular, it was noted that it is not clear
 Which flattening technique is best (CE from Dolby or CE from Huawei)
 Whether the T/F post processing provides merit in addition to the spectral flattening
The workplan will conduct additional listening tests at 12 kb/s mono and 8 kb/s mono operating
points. The systems under test will be:
1. RM7+iSBR
2. RM7+iSBR+T/Fpp
3. RM7+CEfull (flattening always active)
where
 iSBR is “Improved SBR” technology, which flattens the high-band spectral envelope
287


T/Fpp is the Time/Frequency post processing filter
CEfull is both the spectral flattening module and the Time/Frequency post processing
module proposed in the CE
In system 3, above, the spectral flattening module will be tuned so that it is active for all content
categories, namely speech, speech+music and music.
Based on the MUSHRA scores, the following difference statistics would be analyzed (where
numbers refer to systems, as labelled above). For each diff score, if there is significant improvement,
then an action is indicated.
Diff
Scor
e
2-1
3-2
Conclusion if significant positive value
Incorporate iSBR + T/Fpp
Replace iSBR with CEflattening and
incorporate T/Fpp
3-1
Additional discussion on QMF transposer
KK, Dolby, made a presentation on next steps on QMF transposer. The decisions are
 Keep both QMF and FFT transposer
 Replace FFT transposer by QMF transposer
The workplan will conduct additional listening tests at 12 kb/s mono and 8 kb/s mono operating
points. The baseline software will be RM7 plus the transposer cross-products technology.
RM8 Reference Software
Markus Multrus, FhG, discussed an issue pertaining to the RM8 reference software. A bug was
discovered in the decoder when it operates at 64 kb/s and with the time warping tool active. The
bug pertained to the window size that was a float, was stored as an int but then used again as a float.
The result was that, on second use, the window size was off by one. This bug resulted in a
maximum difference of 42 in the decoded waveform, although the RMS error was much smaller.
It was the consensus of the Audio Subgroup to:
 Correct the RM8 Reference Software
 Correct the RM8 decoded waveforms
An email to the mpeg-audio-call reflector reported that this update was done during the MPEG
meeting.
Performance of MPEG Reference Encoder Software
The Chair stated that it would be very desirable for the Audio Subgroup to conduct a small-scale
but well controlled listening test to measure the performance of the MPEG Reference Encoder
software. There was good discussion on this proposal and the following issues were identified. It is
believed that the JAME software project represent the best version of the MPEG Reference Encoder
software. The Yonsei experts verified that the JAME software is not encumbered by external (i.e.
non-MPEG) copyright, such that all source code files can have the MPEG copyright header.
The Chair noted that such a listening test could be conducted in April, 2011, in which case it will
not conflict with the USAC Verification Test work, which is envisioned to have listening tests in
June, 2011. A test in the April, 2011 time frame could use the Reference Quality bit streams made
available from the March MPEG meeting, which should be identical or at least very close in
performance to the Reference Quality bit streams used in the USAC Verification Test.
USAC Timeline and CE Logistics
288
The Chair presented a timeline for USAC activities, shown in the table below. This was discussed
and audio experts agreed to this schedule. However, the Chair acknowledged that the new ballot
intervals under which USAC must operate plus DIS editing period and ballot processing could
mean that the DIS ballot results are not available at the 97th meeting, thus making it impossible to
progress to FDIS in July 2010.
Meeting
Date
Activity
94th
Oct
Study on CD
2010
95th
Jan
The default position is that all CEs
2011
must be complete at this meeting.
Exceptions will be discussed on a
case by case basis.
DoC on CD
DIS text
96th
Mar
Workplan for Verification Test
2011
May
Integrate CE reference software into
2011
normative decoder
Jun,
Conduct Verification Test listening
2011
tests and make results available as
Excel spreadsheet
th
97
Jul
DoC on DIS
2011
FDIS text
ProduceVerification Test Report
Sep
IS
2011
Name for USAC specification
The Chair discussed the need for a name for the USAC specification, and expressed a strong
interest in having that name at the 95th MPEG meeting. The Chair suggested the following name
and acronym, and asked audio experts to please come to the next meeting with something much
better:
eXtented Audio Coding – XAC
During the meeting, other experts suggested:
NSA – Next Stage Audio
FEAC – Future Entertainment Audio Coder
FRAC – Full Range Audio Coder
C-AAC – Content Agnostic Audio Coder
MMAC – MPEG Media Audio Coder
5
5.1
Closing Audio Plenary and meeting deliverables
Plenary discussions
There were none.
5.2
Responses to Liaison and NB comments
Liaison statement and NB comment responses generated by Audio were presented and approved.
5.3
Recommendations for final plenary
The Audio recommendations were presented and approved.
289
5.4
Establishment of Ad-hoc Groups
The ad-hoc groups shown in the following table were established by the Audio subgroup. Unless
otherwise indicated, Ad Hoc group meetings will be held at the location of the next MPEG meeting
on the weekend preceding that meeting.
No.
Title
Mtg
11662 AHG on MPEG Surround, SAOC, USAC
0900-1800 on Sunday prior to
meeting
5.5
Approval of output documents
All output documents, shown in 4, were presented in Audio plenary and were approved.
5.6
Press statement
The Audio contribution to the press statement was presented and approved.
5.7
Agenda for next meeting
The agenda for the next MPEG meeting is shown in Annex F.
5.8
All other business
There was none.
5.9
Closing of the meeting
The 94th Audio Subgroup meeting was adjourned Friday at 1:15 pm.
290
1.
Participants
There were 39 participants in the Audio subgroup meeting, as shown in the following table.
First Name
Last Name
Country
Affiliation
Bruno
Bessette
CA
Voiceage Corporation
Ti Eu
Chan
SG
I2R
Toru
Chinen
JP
Sony
Kihyun
Choo
KR
Samsung
Philippe
Gournay
Canada
VoiceAge Corp. / Univ. of Sherbrooke
Oliver
Hellmuth
DE
Fraunhofer IIS
Jeff
Huang
USA
Qualcomm Inc.
Sangoh
Jeong
KR
LG Electronics
Kyeong Ok
Kang
Korea
ETRI
Kei
Kikuiri
JP
NTT DOCOMO
Korea Advanced Institute of Science and
Kwangki
Kim
KR
Technology.
Kristofer
Kjörling
SE
Dolby
Ferenc
Krämer
DE
Dolby
Jaeseong
Lee
KR
Yonsei University
Roch
Lefebvre
Canada
VoiceAge Corp. / Univ. of Sherbrooke
Takehiro
Moriya
JP
NTT
Markus
Multrus
DE
Fraunhofer IIS
Max
Neuendorf
DE
Fraunhofer IIS
Masayuki
Nishiguchi
JP
Sony
Takeshi
Norimatsu
JP
Panasonic
Henney
Oh
KR
Yonsei University
Werner
Oomen
NL
Philips Applied Technologies
Gregory
Pallone
FR
Orange Labs
Hochong
Park
KR
Kwangwoon University
Pierrick
Philippe
FR
Orange Labs
Heiko
Purnhagen
SE
Dolby
Schuyler
Quackenbush USA
ARL
Mohamad
Raad
Australia
RaadTech Consulting
Julien
Robilliard
DE
Fraunhofer IIS
Jeongook
Song
KR
Yonsei University
Leon
Terentiv
DE
Fraunhofer IIS
Kimitaka
Tsutsumi
JP
NTT DOCOMO
David
Virette
DE
Huawei Technologies
Oliver
Wuebbolt
DE
Technicolor
Wei
Xiao
CN
Huawei Technologies
Jianxin
Yan
CN
NELA
Sungyong
Yoon
KR
LG Electronics
Yunxuan
Zhao
CN
Huawei
Hai Shan
Zhong
Singapore Panasonic Singapore Laboratories
291
2.
No
Sunday
1000-1800
m18456
m18077
m18435
m18451
m18467
m18379
Audio Contributions and Schedule
Title
AhG – USAC
Corrections to Reference Software
and CD of USAC
Draft USAC CE Status and Workplan
USAC Ces – possible decisions
VoiceAge listening test results for the
enhanced bass-postfilter CE
FhG listening test report on CE on
improving the USAC bass post-filter
Report on cross-check listening test
for the CE on improved Bass-post
filter for USAC
Finalization of CE on an improved
bass- post filter operation for the
ACELP of USAC
m18360
Panasonic cross check report on
complexity reduction for time warping
in USAC
m18375
Dolby listening test results for CE on
complexity reduction for time warping
in USAC
Completion of the Core Experiment
on Reducing the Complexity of the
USAC Time Warping
Sony listening test report on improved
SBR in USAC
FhG Listening Test Report –
improved SBR
Finalization of CE on improved SBR
m18452
m18398
m18431
m18378
1300-1400
m18372
m18386
m18466
Lunch
Crosscheck report on harmonic
transposer Ces
Finalization of CE on QMF based
harmonic transposer
NTT DOCOMO Cross-check Report
on Improved Harmonic Transposer in
292
Authors
X
Max Neuendorf
X
S. Quackenbush
X
Philippe Gournay ,Roch
Lefebvre
Guillaume Fuchs, Max
Neuendorf
David Virette
X
Barbara Resch, Leif
Sehlström, Heiko
Purnhagen, Lars
Villemoes, Kristofer
Kjörling, Bruno Bessette,
Philippe Gournay
Takeshi Norimatsu,
Tomokazu Ishikawa,
Haishan Zhong, Dan
Zhao, Kok Seng Chong
Heiko Purnhagen,
Kristofer Kjörling
X
Stefan Bayer
X
Toru Chinen, Masayuki
Nishiguchi
Stephan Wilde, Max
Neuendorf
Kristofer Kjörling, Leif
Sehlström
X
Kihyun Choo, Miyoung
Kim, Eunmi Oh
Haishan Zhong, Kok
Seng Chong, Takeshi
Norimatsu, Tomokazu
Ishikawa, Lars Villemoes,
Per Ekstrand, Kristofer
Kjörling, Stephan Wilde,
Sascha Disch, Frederik
Nagel, Max Neuendorf ,
Kimitaka Tsutsumi, Kei
Kikuiri, Nobuhiko Naka
X
X
X
X
X
X
X
X
X
m18501
m18459
m18468
m18384
USAC
Panasonic crosscheck report on
improved harmonic transposer
Crosscheck listening test report for
USAC on FFT and QMF harmonic
transposers
Report on cross-check listening test
for the Ces on QMF based harmonic
transposer and improved harmonic
transposer in USAC
Finalization of CE on improved
harmonic transposer in USAC
m18389
Overview of performance of
transposer proposals, and suggested
decoding modes
m18371
Crosscheck report on adaptive T/F
domain post-processing for USAC
Dolby listening test results for CE on
T/F post-processing in USAC
Crosscheck listening test report for
USAC on time frequency domain
post-processing
Finalization of CE on adaptive T/F
domain post-processing for USAC
Review of AhG report and plenary
presentation
m18373
m18461
m18471
Monday
0900-1300
1300-1400
1400-1500
MPEG Plenary
Lunch
Audio Plenary
Welcome and Remarks
Report on Sunday Chairs meeting
Review main tasks for the week
SAOC Verification Test
USAC
Ongoing and new CEs
Verification Test
USAC Reference Encoder
Maintenance
New work
293
Zhong Haishan, Chong
Kok Seng, Zhao Dan,
Takeshi Norimatsu,
Tomokazu Ishikawa, Neo
Sua Hong
Jeff Huang
X
David Virette
X
Lars Villemoes, Per
Ekstrand, Sascha Disch,
Frederik Nagel
Kristofer Kjörling,
Haishan Zhong, Kok
Seng Chong, Takeshi
Norimatsu, Tomokazu
Ishikawa, Lars Villemoes,
Per Ekstrand, Stephan
Wilde, Sascha Disch,
Frederik Nagel, Max
Neuendorf ,
Kihyun Choo, Miyoung
Kim, Eunmi Oh
Heiko Purnhagen,
Kristofer Kjörling
Jeff Huang
X
David Virette, Wei Xiao
X
X
X
X
X
X
X
m17971
m17972
m18076
m18454
m18473
m18434
m18480
m18470
m18429
1630-1800
m18428
m18472
m18432
m18430
m18234
General
Ad Hoc on Audio Standards
Maintenance
Ad Hoc on SAOC, USAC, and Audio
and Systems Interactions
93rd MPEG Audio Subgroup Report
USAC – AhG Recommendations
USAC – CE Integration Phase
Informative Encoder Description for
USAC Improved Noiseless Coding
CE
Educational Information on Encoder
Implementation of inter-TES tool in
USAC
USAC
Report on USAC performance
Comments on further USAC
investigation
USAC Reference Encoder
A new signal classifier for USAC
reference encoder
Status report on USAC Reference
Software JAME
USAC CE progress reports
Yonsei Crosscheck listening test
report on additional bandwidth
extension CE for USAC
Progress report on additional
bandwidth extension CE for USAC at
low bit rates
Status update of CE proposal on
increased structural flexibility in SBR
Updated CE on enhanced long term
predictor (eLTP) for USAC
Progress report on the TCX
windowing CE for USAC
m18419
Status report of time warping CE in
USAC
1800-
HOD
Tuesday
294
S. Quackenbush
X
S. Quackenbush
X
S. Quackenbush
X
X
Vignesh Subbaraman,
Markus Multrus, Kihyun
Choo
Kei Kikuiri, Atsushi
Yamaguchi, Nobuhiko
Naka
X
Gregory Pallone, Pierrick
Philippe
Max Neuendorf, Markus
Multrus, Stefan Doehla,
Werner Oomen, Heiko
Purnhagen
X
David Virette, Lijing Xu,
Wei Xiao
Jeongook Song ,Henney
Oh ,Hong-Goo Kang
X
Jeongook Song ,Eunwoo
Song, Henney Oh
X
David Virette, Wei Xiao
X
Stephan Wilde, Markus
Multrus, Max Neuendorf,
Kristofer Kjörling, Heiko
Purnhagen
Jeongook Song ,Henney
Oh ,Hong-Goo Kang
Taejin Lee, Seungkwon
Beack, Minje Kim,
Kyeongok Kang
Zhao Dan, Zhong
Haishan, Chong Kok
Seng, Takeshi Norimatsu,
Tomokazu Ishikawa, Neo
Sua Hong
X
X
X
X
X
X
X
0900-1300
m18396
m18469
m18361
m18374
m18455
m18399
m18532
1300-1400
1400-1500
m18406
m18426
USAC – Ongoing CEs
Sony listening test report on pulse
indexing in USAC
Finalization of Enhanced Pulse
Indexing CE for ACELP in USAC
Panasonic cross check report on PVC
for SBR envelope coding in USAC
Dolby listening test results for CE on
PVC for SBR in USAC
FhG Listening Test Report – PVC for
SBR envelope coding
Report on PVC CE for SBR in USAC
Telcordia and PUT listening test
results for CE on improved tonal
component coding in eSBR (USAC)
Lunch
MPEG-2, MPEG-4, MPEG
Surround, SAOC
Proposed addition to Intensity Stereo
in ISO/IEC 14496-26:2009, Audio
conformance
Additional information on MPEG
Surround conformance testing
m18436
Report on corrections for MPEG
SAOC
m18437
Revised draft of SAOC verification
test report
1500-1800
m18481
USAC – New Ces
Proposed CE for extending the LPD
mode in USAC
Proposed Core-Experiment on
Enhancing the USAC Codec at Mid
m18479
295
Toru Chinen, Masayuki
Nishiguchi
David Virette, Dejun
Zhang, Fuwei Ma
Takeshi Norimatsu,
Tomokazu Ishikawa,
Haishan Zhong, Dan
Zhao, Kok Seng Chong
Heiko Purnhagen,
Kristofer Kjörling
Frederik Nagel
X
Toru Chinen, Yuki
Yamamoto, Mitsuyuki
Hatanaka, Masayuki
Nishiguchi
Tomasz Zernicki ,Maciej
Bartkowiak ,Marek
Domanski
X
Ferenc Kraemer, Heiko
Purnhagen
X
Andreas Hölzer, Christian
Ertel, Markus Lohwasser,
Michael Härtl, Ferenc
Kraemer, Frans de Bont
Jonas Engdegård, Heiko
Purnhagen, Oliver
Hellmuth, Jürgen Herre,
Cornelia Falch, Leon
Terentiv, Maria Luis
Valero, Johannes Hilpert,
Andreas Hölzer, Werner
Oomen
Jonas Engdegård, Heiko
Purnhagen, Oliver
Hellmuth, Jürgen Herre,
Cornelia Falch, Leon
Terentiv, Maria Luis
Valero, Johannes Hilpert,
Andreas Hölzer, Werner
Oomen
X
Bruno Bessette ,Philippe
Gournay, Roch Lefebvre
Markus Multrus, Philippe
Gournay, Nikolaus
X
X
X
X
X
X
X
X
X
Bitrates
m18474
m18433
1800Wednesday
0900-1100
1130-1200
m18413
CE proposal on improved stereo
coding at low bit rates
Proposed decorrelator improvements
in USAC
MPEG Plenary
Break-out on PVC
Break-out on Sinusoidal Coding
CE proposal on improved applause
coding in USAC
Core Experiment on Enhanced Mode
Transitions in USAC
m18450
Proposed unified global gain syntax
element in USAC
Lunch
Joint meeting with Requirements at
Audio for HEVC, at Audio
Joint Type I licensing
USAC Status and Workplan
preparation
Social:
Meet at main entrance, Bldg 3
(HOD, Chair’s bus leaves at 1650)
Bus leaves at 1700
1500-1630
1500-1600
1630-
Thursday
0900-
1000-1100
1300-1400
1500-1600
14001800Friday
0900-1300
X
Julien Robilliard,
Matthias Neusinger,
Johannes Hilpert, Erik
Schuijers, Bert den
Brinker, Werner Oomen
X
Achim Kuntz, Sascha
Disch, Erik Schuijers,
Werner Oomen
Kiho Cho, Nam Soo Kim,
Sungyong Yoon, Sangoh
Jeong
Guillaume Fuchs, Markus
Multrus, Max Neuendorf
X
Chairs
m18393
1300-1400
1400-1500
Rettelbach, Bruno
Bessette, Bernhard Grill
David Virette
USAC timeline and CE interaction
USAC Status and Workplan
USAC name
Type I Licensing
Lunch
Type I Licensing
USAC Status and Workplan
Resolutions
Chairs
Audio Plenary
Report on Thursday Chairs meeting
Recommendations for final plenary
Establishment Ad-hoc groups and
296
X
X
1000
1030
1300-1400
1400-
review AhG Mandates
Get document numbers
Submit AhG Mandates and
Resolutions
Approve Responses to NB comments
and Liaison
Approval of output documents:
Title: N10xx (short title).doc (NOT
*.docx!)
File w10xx.zip
Agenda for next meeting
Review of Audio presentation to
MPEG plenary
A.O.B.
Closing of the Audio meeting
Lunch
MPEG Plenary
297
3.
Task Groups
1. MPEG-2 and MPEG-4 Audio, MPEG Audio Conformance, MPEG reference software,
MPEG Surround
2. MPEG-D Spatial Audio Object Coding
3. MPEG-D Unified Speech and Audio Coding
4. Audio Exploration: Audio for HEVC
298
4.
No.
11654
11655
11656
11657
11658
11659
11660
11661
5.
Output Documents
Title
13818-4 – Conformance testing
ISO/IEC 13818-4:2004 DCOR 2, AAC Intensity Stereo
14496-26 – Audio Conformance
ISO/IEC14496-26:2010/Cor 2 BSAC Corrections
ISO/IEC 14496-26:2010/DCOR 3, Intensity Stereo
23003-2 – SAOC
Report on Spatial Audio Object Coding Verification Tests
Defect Report of ISO/IEC 23003-2:2010, Spatial Audio Object
Coding
23003-3 – Unified Speech and Audio Coding
Study on ISO/IEC 23003-3:201x/CD of Unified Speech and Audio
Coding
Status and Workplan for USAC CEs
Exploration – Future Audio
Audio for HEVC
Agenda for the 95th MPEG Audio Meeting
Agenda Item
1. Opening of the meeting
2. Administrative matters
2.1. Communications from the Chair
2.2. Approval of agenda and allocation of contributions
2.3. Review of task groups and mandates
2.4. Approval of previous meeting report
2.5. Review of AhG reports
2.6. Joint meetings
2.7. Received national body comments and liaison matters
3. Plenary issues
4. Task group activities
4.1. MPEG-1, MPEG-2, MPEG-4, and MPEG Surround
4.2. Spatial Audio Object Coding
4.3. Unified Speech and Audio Coding
4.4. Exploration: Audio for HEVC
5. Discussion of unallocated contributions
6. Meeting deliverables
6.1. Responses to Liaison and NB comments
6.2. Recommendations for final plenary
6.3. Establishment of new Ad-hoc groups
6.4. Approval of output documents
6.5. Press statement
7. Future activities
8. Agenda for next meeting
9. A.O.B
10. Closing of the meeting
299
TBP Available
No
10/10/15
No
No
10/10/15
10/10/15
YES 10/11/15
No
10/11/15
No
10/11/15
No
10/10/15
No
10/10/15
Annex J – 3DG report
Source: Marius Preda, Chair
1
Opening of the meeting
1.1
Approval of the agenda
The agenda is approved.
1.2
Goals for the week
The goals of this week are:











Discuss the software and conformance status for SC-3DMC in 3DGCM (MP25 AMD1)
Status of conformance for MP25 AMD1
Review contributions related to 3D graphics engines in MXM
Review contributions related to 3D graphics aspects of MPEG-V
Review contributions related to Representation of meshes with multiple attributes
Review contributions related to Multi-resolution 3D mesh coding
Review RGC (reconfigurable graphics codec) contribution
Investigate future developments of MPEG 3D Graphics Compression
Review Liaisons
Review the votes
Web-site
1.3
Standards from 3DGC
Std Pt Edit. Project Description
4
5 2001 Amd.26 RS for scalable
complexity 3D mesh
coding in 3DG
compression model
4 16 201x Amd.1 Efficient
representation of 3D
meshes with multiple
attributes
4 16 201x Amd.2 Multiresolution mesh
coding
4 25 2009 Amd.1 Scalable complexity
3D mesh coding for
3DG compression
model
4 27 2009 Amd.3 Conformance for
scalable complexity
3D mesh coding in
3DG compression
300
CfP
WD
CD FCD FDIS Gr.
09/10 10/04 11/01 3
09/10 10/07 11/01 11/07
3
10/07 11/01 11/07 12/01
3
09/07 09/10 10/04 10/10
3
10/01 10/04 11/01
3
model
1.4
Room allocation
3DGC:
3415
301
1.5
Allocation of contributions
Note: due to the absence of several delegates the discussions took place by email and audio conferences. Several contributions (indicated in the table
below) were postponed for being discussed during the next meeting.
N°
D1
Title
Schedule
Monday
D1
09:00~11:30
13:00~14:00
MPEG Plenary
Lunch Break
3DG Plenary
Roll call, Agenda, Goals, FAQ, Awareness Event etc.,
m17969 Report of AHG on 3DGC documents, experiments and software
maintenance
Summary: The IRS is not yet connected to SC3DMC. This should be done
before the next meeting.
AFX 4th edition is in late. Take AP during the week.
Marius Preda
m17970 Ad Hoc Group on Multi-Resolution 3D mesh Coding
Summary: WD1.0 does not contain the integrated schema. Issue on sharing
the source code.
Results of voting
No voting
m18555 Notification of the change of Forum Name of Category C liaison
partner
Resolution: send the information to Yukiko, she updated the Liaison
document
Khaled Mamou, Minsu 14:00~15:00
Ahn, Xiangyang Ji
Performance compression between 3DMeshCodec
(Technicolor) and 3DMCE
A codec that uses the redundancy of repetitive structures and
may bring benefits of up to 92% compression improvement
MultiResolution 3D Resolution: Integrate the codec on the benchmarking platform to
Mesh Coding compare with recent 3DG codecs.
(MR-3DMC) m18424 Initial ideas for texturing support in MR3DMC
Summary: Automatic generate and encode textures in a
multiresolution way. There are two modules: texture adaptation
and skin transplant. Similar as for mesh, a corse texture is
obtained. However, there is a conversion of a texture into a fine
302
Francisco Moran
Marius Preda
Seung Wook Lee,
ChangHwa Lyou,
SangKwon Jeong
Mary-Luc Champel
15:00 –
16:00
David Fuentes
Sánchez , Francisco
Morán Burgos
Observation
tune representation. There is a specific unwrapping method that
has the following proprieties: it preserves the angles (there is no
deformation of the triangles), therefore the unwrapping is unique
and deterministic. The skin transplant allows to choose the pixel
of the coarse texture by projecting each 3D point of the course
mesh (not only the mesh vertices) to the fine mesh and taking the
color from the projection.
Resolution: Continue the investigation, obtain the coarse texture,
investigate on coding of textures.
Revision of WD1.0 Unified Model
The definition of the bitstream structure (this structure is not
supporting the view dependent). The partition consists in spatial
Minsu Ahn
LODs that contain Quality LODs. The header contains
information related to the mesh size (no of cords, normals, … for
the fine resolution one) and quantization steps.
D2
Tuesday
Reconfigurable
Graphics Coding
17:25 –
18:00
D2
m18225, Contribution of 3DMC FUs for RGC framework
Summary: the FUs for 3DMC + definition of TOKENs + design
of networks.
The FUs + def of TOKEN should go in GTL.
Resolution: issue the WD 3.0 for RGC containing the following
sections:
1. GTL: definition of FUs + TOKEN
2. FU network description for different tools (3DMCe)
3. Implementation (attached to the document as source code
as well)
Seungwook Lee, Bon
Ki Koo, Minsoo Park,
Sinwook Lee, Sowon
Kim, Euee S. Jang
m18224, Contribution to RGC tutorial
Summary: The document is based on the existent RVC tutorial.
The XML FU network schema is defined in MPEG B. There are
several tools that allow to specify a decoder.
A codec development process is given as example, proposing to
start from a top level diagram. One idea to design FUs is to start
from the RefSoft. The bitstream parser FU is implementing the
Minsoo Park ,
Seungwook Lee , Bon
Ki Koo , Hyunok Oh ,
Sinwook Lee , Sowon 09:00~10:30
Kim , Taehee Lim ,
Hyungyu Kim , Euee S.
Jang
303
10:30~11:00
Joint with V
entropy decoder. The FUs are described textually. One C/C++
example of a FU is provided.
RSM is the RVC Simulation model.
Case study: 3DMC: the triangle decoder is inside the parser FU.
There are additional 5 algorithmic FUs.
Supporting tools for RGC framework: ORCC, Hopes.
MultiResolution 3D
Mesh Coding
(MR-3DMC)
WD2.0 Supported features and additional needs
4 of 6 requirements are now supported: lossless connectivity,
spatial scalability, quality scalability and near lossless encoding.
The remaining two are view dependent scalability and support
for multi-attributes per vertex.
Resolution: add in the CE the description of what should be
done to investigate the support for the two remaining
requirements.
Minsu Ahn
12:00~14:00
Lunch Break
MPEG-V
11:30 –
12:00
m18449 Proposed changes for validation rules of MPEG-V
Part 7
update the error message, accepted
m18585 Additional input for Sensor / Actuator Integration
use case
Presence sensor and proximity sensor
These are some user characteristics in DIA
(MobilityCharacteristics, Destination).
There is a PlaceType already in MPEG-7 that can be used as a
basis for DevicePositionType.
Resolution: The position sensors can be used and the engine will
make the connection between the sensors and the real users.
There is no need to define the presence and proximity sensors.
AhG mandate: Identify what it can be reused from MPEG-7 and
MPEG-21 DIA for location (this was transformed in a MPEG
304
Markus Waltl ,
Christian Timmerer
Jean H.A. Gelissen ,
Herman Tuininga
14:00-14:10
14:10-15:10
resolution).
FDIS(s) review
FCD for RefSoft and Conformance will be generated by Marcus
from the schemas.
Technical corrections on MPEG-V
A new schema is proposed for VWOEventType.
Biosensors.
Sanghyun Joo
Jae Joon Han
Multi-pointing, Gaze, Wind.
All the three sensors were discussed.
Resolution: add the capabilities for all of them
Descriptions for social network services
Summary: discussions on the need of having user characteristics
in MPEG=V.
Resolution: this is an important issue for MPEG but in the past
both MPEG-7 and MPEG-21 already specified related data
format. Contributions on evaluating them for the scope of
MPEG-V are encouraged.
D3
15:10-15:30
Joint with Req
Sanghyun Joo
Wednesday
D3
09:00~11:00
MPEG Plenary
RGC
17:30 –
18:00
An animated meeting on the RVC/RGC meeting related to the
specification language. The outcome was that there is a need to
document the requirements of such a language.
Issue re-discussed in a meeting hold Friday morning and
documented below.
Marius Preda
11:00 –
13:00
13:00~14:00
Lunch Break
Deadline for announcing the participation: 2010/10/29
Flyer draft at the end of the meeting / Final version in 2 weeks
MPEG-V
with S
14:00 –
15:30
Date: January 27, 2011, 10h00 to 17h00
Demonstrations:
305
Samsung (intelligent camera, bringing sensibility in a virtual
world, 3D demo),
ETRI (motion chair, multi-sensory experience, 3D demo),
GIST (The Haptic Experience)
Technicolor (Dynamic Multiresolution Rendering)
Institut TELECOM (Remote gaming & Virtual worlds, mobile
3D graphics)
Myongji University (Authoring tools of Sensory Effects)
Videos:
(Metaverse1) Virtual Travel & SoundScape
(Metaverse1) Virtual Presence
(Metaverse1) Collaborative Working
(Metaverse1) Dynamic control of Virtual Humans
Face 2 Face virtual chat
Christian’s movie
Workshop Program:
- Introduction (Marius) 10m
- Architecture and use case (Jean) 15m
- Capturing and controlling the real world (Part 2 & 5)
(Kyongro) 20m
- Sensory effects (Part 3) (Christian) 20m
- Interoperability with virtual worlds (Part 4) (Jae Joon)
20m
- Importance of interoperability (invited speaker Yesha,
Metaverse1) 15 m
No registration fee is needed to participate to the meeting. We’ll
have a web page to register the presence.
306
AFX
D4
4th Edition
Only small progress because of lack of time. Homework for the
editors.
Marius Preda
15:30 –
16:00
Marius Preda
9:00~9:30
Thursday
Review all the action points
Output documents:
MPEG-V
FDIS for Part 2, 3, 4, 5, 6 OK
FCD for Part 7 Accepted.
WDs 2.0: (editing period of 4 weeks, keep inside only what exists today)
1.
MPEGV &
MPEG-U
3DG
A way to declare an interface to have communication between the
scene and outside. Regardless on what is inside the scene, an
interface is declared for communication, with the possibility to reply.
The problem formulation: the VO should be able to carry behavior
information from one virtual world to another. This behavior may be
represented by using the scripting language. MPEG-U part 1 is not
the answer to the problem since it is specifying the communication
interface.
2. Why MPEG-U Part 2 is not belonging to MPEG-V?
a. In the use case of gesture recognition, there is a basic sensor
of detecting the feature points (MPEG-V Part 5) followed
by an interpretation of it (such as recognizing a circle), that
is in MPEG-U part 2
Multiresolution WD2.0 review
AP: investigate on the integration between the cascaded quantization and the
bitplane coding of the KLT method.
Finalize during the editing period the QualityLODData class
Editing period of 4 weeks.
Minsu
11:00~12:00
12:00~14:00
Lunch Break
3DG Plenary
9:30~10:00
RGC core experiment edition
(documented in an output document)
14:00~15:00
Review of RGC WD3.0 (2 weeks editing period)
3DG core experiments
D5
15:00~16:00
D5
Friday
307
3DG Plenary
RGC: update of the core experiment description but accepted
almost in the form that was proposed yesterday.
RGC/RVC a document in the requirements that the specification
language should fulfill was presented, discussed and accepted.
The document is registered as an input contribution m18614
“Proposed requirements for RVC and RGC specification
language and solution”
A resolution asking NBs to comment on it was issued.
10:00~11:00
12:00~14:00
14:00~
Lunch Break
MPEG Plenary
308
2
General issues
2.1
General discussion
2.1.1 Reference Software
It is recalled that the source code of both decoder AND encoder should be provided as part of the
Reference Software for all technologies to be adopted in MPEG standards. Moreover, not providing
the complete software for a published technology shall conduct to the removal of the corresponding
technical specification from the standard.
Currently all the AFX tools published in the third edition are supported by both encoder and decoder
implementation.
2.1.2 Web site
OrangeLabs announced interrupting the maintenance of the group web-site. A call for volunteers is
now issued. In the meantime 3DGC contributors are asked to check the web-site and provide
comments on the current version of the web-site.
3
Current Voting
Document title
none
4
General 3DG related activities
4.1
AhG on 3DG activities
DoC
Editor of DoC
Report of AHG on 3DGC documents, experiments and software maintenance
Title
Authors Francisco Moran
Resolution accepted
Ad Hoc Group on Multi-Resolution 3D mesh Coding
Title
Authors Khaled Mamou, Minsu Ahn, Xiangyang Ji
Resolution accepted
309
4.2
Promotions
4.2.1 Web Site
Title
Status of www.mpeg-3dgc.com
Authors
Summary The web site is not more maintained by OrangeLab.
Action Point: Transfer the web-site to other location and call for volunteers for
Resolution
maintenance.
4.3
Joint activities
4.3.1 MPEG-V - Information Exchange with Virtual Worlds (formally Metaverse)
Additional Use
m18253 Cases for
MPEG-V
jean.gelissen@philips.com
Proposal of GPS
Sensor
Capability Type,
Altitude Sensor Kyoungro Yoon , Doohyun
m18025
Capability Type, Kim , Min-Uk Kim
and Global
Position
Capability Type
Additional type
m18507 for Virtual
Object
Multimodel interfaces for Virtual Humans
and behavior markup, propose to adopt
part from BML, examples of how this can
be used,
Second: the TV platform as a Virtual
World I/O device: use of jointSPACE:
remote rendering: the content is streamed
as video. (it is an ETSI initiative as well)
Third: Hybrid communication technology:
Interfacing the virtual world with Internet
world
Resolution: add the use cases in Part 1.
Check is new requirements are needed (for
the next meeting)
These sensors are already in ADM1, here
there are presented the capability type for
each of them.
Resolution: Change the name in
MobileDevicePotisionCapability
Use the bounding box for defining the 3D
space.
Add the VOVisual in the common
characteristic type that may describe the
light, shadow and collision effect. A
complete description of light, shadow and
collision.
Resolution: there is no need to go this
level because these features are
Sanghyun Joo
310
implemented in the graphics engine.
m18509
Add Wheel Button in the mouse. Add the
combination of different input elements.
The “Click definition” is proposed.
Resolution: change the Event schema for
allowing combination of the
Mouse/Keyboard/UserDefinedInput.
Accept the wheel button.
Issue an internal document called
“Technical corrections on MPEG-V”.
Resolution: update the FDIS for adding
the maxOccurent= unbound for the
“choice” in the VWOEventType and
adding keyCode
AMD1: Add the mousewheel events in the
MouseEventCS
Sanghyun Joo,
Minor changes in
Bas du formulaire
MPEG-V Part 4
Update of
Reference SW
m18023
for MPEG-V
Part 2
Bum Suk Choi , Eunseo
Reflects the changes that are done in FDIS
Lee , Jong Hyun Jang ,
Resolution: accepted in a study of FCD
Kyoungro Yoon , Jonghyung
Lee , Ying Ying Chen
Update of
Reference SW
m18024
for MPEG-V
Part 5
Eunseo Lee , Bum Seok
Reflects the changes that are done in FDIS
Choi, Kwang Roh Park,
Resolution: accepted in a study of FCD
Kyoungro Yoon , Jonghyung
Lee , Ying Ying Chen
Proposed
changes for
Markus Waltl , Christian
m18449 validation rules
Timmerer
of MPEG-V Part
7
Proposal of
Header
information for
m18027 binary
transmission of
MPEG-V
information
postponed
A common header for binary
representation for Part 2 to 5. It indicates
that the stream is MPEG-V, the mode, the
parts and the year of the version.
Resolution: to generate a table with all the
types, ordered per Parts and to allocate
Kyoungro Yoon , Soohan
unique identifiers to all the types. The
Kim , Woo Chool Park , Hae types from Part 1 are 1xx, the ones from
Moon Seo
Part 2 are 2xx, …
Accepted for AMD1 for Part 6
TODO: review the binsarisation of the
elements in each part to remove the
identifier that is now present in the
common header.
311
m18252
Robot
Integration
m18352 Path Finding
Data format for
sensed
m18365 information on
Electrograph
sensor
m18377
Tactile
Information
Sensed
Information for
m18410
gas and dust
sensors
acordova @tue.nl ,
jean.gelissen @philips.com
Extension for controlling the robot from
the virtual world. The paper proposes
some new types for controlling the robot.
Resolution: compare the proposed types
with the ones already existent in MPEg-V
and identify what is missing for covering
the functionality. Try to extent to generic
tools for controlling the robots and not
only one robot.
In the application of virtual tourism, it is
needed to compute the path between two
points. The requirement is that MPEG-V
should be able to specify the path.
jean.gelissen @philips.com , We have currently a type called
roland @cs.uu.nl ,
GlobalPositionCommand that can define
real positions. We have to add a possible
name and description for each point as
well as a way to combine several potions
to obtain a trajectory.
SeungJu Han , Jae Joon
Han , Won-Chul Bang ,
James D.K. Kim , SangKyun Kim
The Electrograph sensors have only usage
recommendation in other standards but not
a formalized data format.
The locations for sensors were
investigated because the sensors measure
the electrical activity as differences
between two locations.
Waveform Pattern:
Maximum amplitude of wave
To describe the electrogram it is needed to
specify the waveform, the frequency, the
amplitude, phase, coherence and location
Proposal: label (EEG, ECG, EMG),
placement, timeseries, pattern, unit and
max amplitude
Resolution: prepare an Electrograph base
type and extend for the three types: EEG,
EMG, ECG. Dissociate the raw data from
metadata.
jean.gelissen @philips.com , Editorial points for Part 3
esko.dijk @philips.com
Jae Joon Han , SeungJu
Han , Won-Chul Bang ,
James D.K. Kim , Sang-kyun
Kim , YongSoo Joo
312
A study on the different types of gas that
may be sensed.
A set of types are proposed in a
classification scheme that can be
extended. The unit is ppm but may be also
something else.
DustType is also proposed that extend the
basetype with value and unit.
Resolution: add gas and dust in AMD1.
Add use cases in Part 1.
Definition and
Seong Yong Lim , Jihun
application
m18487
Cha , Injae Lee , Youngdomains of new
kwon Lim , Joong Yun Lee
sensors
Multipointing, Gaze tracking, Wind
sensor, Add in Part 1 some clarifications,
application domain.
Resolution: add in AMD 1 of Part 1
Sensor capability Seong Yong Lim , Jihun
m18488 and preference of Cha , Injae Lee , Youngnew sensors
kwon Lim , Joong Yun Lee
Present the capabilities of the proposed
sensors as composition of existent
capabilities and additional data.
Resolution: accepted partially. Change the
blinking in a duration (ms) and not an
on/off effect. Wind Adaptation preference
not accepted.
Binary
Seong Yong Lim , Jihun
m18489 representation of Cha , Injae Lee , Youngnew sensors
kwon Lim , Joong Yun Lee
Present the binarisation of the proposed
sensors.
Resolution: Change the synthax of
GazeTrackingSensorType to only have
binary type for blink. Remove the
position component from the Wind sensor
Descriptions for
m18508 social network
Sanghyun Joo
services
Add the userProfile and communication
type for users.
Resolution: investigate on the possibilities
to integrate multi-user worlds in MPEG-V
Extends the base capability type.
Resolution1. Accepted.
Jae Joon Han , SeungJu
Resolution2: Make clear that in the case of
Sensor capability
Han , Won-Chul Bang ,
multidimentional sensors the min/max,
m18510 for gas and dust
James D.K. Kim , Sang-kyun intervals , … from
sensors
Kim , YongSoo Joo
SensorCapabilityBaseType are
overwritten and not used.
4.3.2 Reconfigurable Graphics Coding
Several joint meetings with Video RVC group took place to review the related contributions. The
resolutions are reported in the table above.
313
4.4
Explorations
none
5
Liaison
none
6
1
Output documents and Resolutions of 3DGC
Part 4 Conformance
1.1
2
The 3DG subgroup recommends to appoint Ralph Sperschneider and Francisco Morán
Burgos as editors of 14496-4:2004/Cor.7:2010
Part 16 Animation Framework eXtension (AFX)
2.1
The 3DG subgroup recommends approval of the following documents
No.
11665
11666
11667
11668
2.2
3
3.1
Title
14496-16 – Animation Framework eXtension (AFX)
Description of 3DG Core Experiments
Working Draft 3.0 on Multiresolution Mesh Compression
Working Draft 3.0 on Reconfigurable Graphics Coding
Reconfigurable graphics coding tutorial v2.0
TBP Available
No
No
No
No
10/10/15
10/11/15
10/10/30
10/10/15
The 3DG subgroup recommends NBs to study the input contribution m18614
“Proposed requirements for RVC and RGC specification language and solution” and
provide comments by the next MPEG meeting
Part 25 3DGCM
The 3DG subgroup recommends approval of the following documents
No.
Title
14496-25– 3DG Compression Model
Text of 2nd Edition of ISO/IEC 14496-25 FDIS 3DG compression
11669
model
TBP Available
No
10/10/15
3.2
The 3DG subgroup recommends to integrate 14496-25:2009/FDAM 1 into 14496-25
2nd Edition and issue FDIS 14496-25 2nd Edition.
7
Establishment of 3DG Ad-Hoc Groups
314
N11670
AHG on 3DG documents, software maintenance and core experiments
Mandate: 1. Conduct the experiments on efficient representation of 3D meshes with
multiple attributes
2. Conduct the experiments on Reconfigurable Graphics Coding
3. Maintain and edit 3DG documents
4. Coordinate 3DG related conformance and reference software
5. Coordinate editing of the www.mpeg-3dgc.com web site
Chairmen: Francisco Morán Burgos, Seung Wook Lee
Until 95th meeting
Duration:
Sunday before 95th meeting
Meetings:
mpeg-3dgc AT gti. ssr. upm. es
Reflector:
Subscribe: https://mx.gti.ssr.upm.es/mailman/listinfo/mpeg-3dgc
N11671
AHG on Multi-Resolution 3D mesh Coding
Mandate: 1. Design an integrated approach combining the PTFAN and KLPMC
techniques
2. Provide a first version of the verification model software
3. Continue the evaluation experiments
4. Update the testing platform (MMW.com)
Chairmen: Khaled Mamou, Minsu Ahn, Xiangyang Ji
Until 95th meeting
Duration:
Sunday before 95th meeting
Meetings:
mpeg-3dgc AT gti. ssr. upm. es
Reflector:
Subscribe: https://mx.gti.ssr.upm.es/mailman/listinfo/mpeg-3dgc
8
Closing of the Meeting
See you in Daegu.
315
Download
Study collections