INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC 1/SC 29/WG 11 CODING OF MOVING PICTURES AND AUDIO N11555 ISO/IEC JTC 1/SC 29/WG 11 Guangzhou, CN – October 2010 Source: Leonardo Chiariglione Title: Report of 94th meeting Status Report of 94th meeting .......................................................................................................................... 1 Annex A – Attendance list .................................................................................................................. 15 Annex B – Agenda .............................................................................................................................. 25 Annex C – Input contributions ............................................................................................................ 28 Annex D – Output documents ............................................................................................................. 53 Annex E – Requirements report .......................................................................................................... 57 Annex F – Systems report ................................................................................................................... 63 Annex G – Video report .................................................................................................................... 109 Annex H – Audio report.................................................................................................................... 266 Annex I – 3DG report ....................................................................................................................... 300 Report of 94th meeting 1 Opening The 94th MPEG Meeting will take place from 11th to 15th October 2010 at Guangzhou Baiyun International Convention Center, Oriental International Convention Hotel. 2 Roll call of participants Annex 1 provides the list of participants 3 Approval of agenda Annex 1 provides the adopted agenda 4 Allocation of contributions Annex 1 provides the list of input contributions 1 5 Communications from Convenor There was no specific communication. 6 Report of previous meeting This could not be approved because of its unavailability due to some missing components. 7 Processing of NB Position Papers These were presented and discussed. When appropriate responses were provided 8 Work plan management 8.1 Media coding 8.1.1 Frame Packing Arrangement Signalling in MPEG-2 Video The following documents were approved 11612 11613 Request for ISO/IEC 13818-2:2000/Amd.4 Text of ISO/IEC 13818-2:2000/PDAM4 Frame Packing Arrangement Signalling in MPEG-2 Video 8.1.2 Progressive High Profile The following document was approved 11616 Text of ISO/IEC 14496-10:201x/PDAM1 Progressive High Profile 8.1.3 MPEG Frame-Compatible Enhancement (MFC) The following documents were approved 11681 11682 Requirements for MPEG Frame-Compatible Enhancement (MFC) Test Conditions and Evaluation Process for MFC 8.1.4 AFX 3rd edition The following document was approved 11665 Description of 3DG Core Experiments 8.1.5 Multiresolution 3D mesh Coding The following document was approved 11666 Working Draft 3.0 on Multiresolution Mesh Compression 8.1.6 Reconfigurable Graphics Coding The following documents were approved 2 11667 11668 Working Draft 3.0 on Reconfigurable Graphics Coding Reconfigurable graphics coding tutorial v2.0 8.1.7 Composite Font in Open Font Format The following documents were approved 11571 11572 Evaluation Report WD of ISO/IEC 14496-28 Composite Font 8.1.8 Contract Expression Language The following documents were approved 11573 11664 WD of ISO/IEC 21000-20 Contract Expression Language Description of CE on Contract Expression Language 8.1.9 Codec Configuration Representation The following document was approved 11623 Working Draft of ISO/IEC 23001-4/Amd.1 8.1.10 Video Tool Library The following documents were approved 11624 11625 11627 11628 Disposition of Comments on ISO/IEC 23002-4/FPDAM1 Text of ISO/IEC 23002-4/FDAM1 Software and Conformance Testing for Video Tool Library WD of ISO/IEC 23002-4/Amd.3 Functional Units for SVC Profiles Description of Core Experiments in RVC 8.1.11 Spatial Audio Object Coding The following document was approved 11657 Report on Spatial Audio Object Coding Verification Tests 8.1.12 Unified Speech and Audio Coding The following documents were approved 11659 11660 Study on ISO/IEC 23003-3:201x/CD of Unified Speech and Audio Coding Status and Workplan for USAC CEs 8.1.13 High-Efficiency Video Coding The following documents were approved 3 11642 11629 11643 11645 Process of Test Model Development for HEVC High Efficiency Video Coding (HEVC) Test Model 1 (HM 1) Encoder Description High Efficiency Video Coding (HEVC) text specification Working Draft 1 Status of HEVC Reference Software Copyright Management in JCT-VC 8.1.14 Media Context and Control – Control Information The following document was approved 11589 WD of ISO/IEC 23005-2 AMD 1 Additional Control Information and binary representation 8.1.15 Media Context and Control – Sensory Information The following document was approved 11590 WD of ISO/IEC FDIS 23005-3 AMD 1 Binary representation of sensory information 8.1.16 Media Context and Control – Virtual World Object Characteristics The following document was approved 11591 WD of ISO/IEC FDIS 23005-4 AMD 1 Binary representation of Virtual World Object Characteristics 8.1.17 Media Context and Control – Data Formats for Interaction Devices The following document was approved 11592 WD of ISO/IEC FDIS 23005-5 AMD 1 Additional sensors, interaction devices and binary representation 8.1.18 Media Context and Control – Common Types and Tools The following document was approved 11593 WD of ISO/IEC FDIS 23005-6 AMD 1 Binary representation of Common types and tools 8.1.19 3D Video Coding The following documents were approved 11678 11679 11630 11631 Applications and Requirements on 3D Video Coding Draft Call for Proposals on 3D Video Coding Technology Description of Exploration Experiments in 3D Video Coding Report on Experimental Framework for 3D Video Coding 4 8.1.20 Low-Complexity Video Coding The following document was approved 11673 Low-Complexity Video Coding – Draft Problem Statement 8.1.21 Future Audio The following document was approved 11661 Audio for High Efficiency Video Coding 8.1.22 Option-1 Licensable Video Coding The following document was approved 11676 Call for Evidence on Option-1 Video Coding Technology 8.2 Composition coding 8.2.1 Collaborative Applications by using MPEG Scene Description Standards The following document was approved 11569 Technologies under Considerations for Collaborative Applications by using MPEG Scene Description Standards 8.3 Description coding 8.3.1 Compact Descriptors for Visual Search The following documents were approved 11675 11674 Compact Descriptors for Visual Search: Evaluation Framework (draft) Compact Descriptors for Visual Search: Draft Call for Proposals 8.3.2 Extraction and Matching of Video Signature Tools The following document was approved 11621 Study Text of ISO/IEC 15938-8/PDAM6 Extraction and Matching of Video Signature Tools 8.3.3 AudioVisual Description Profile The following document was approved 11677 ISO/IEC 15938-9:2005/PDAM 1 5 8.4 Transport and File formats 8.4.1 File formats The following documents were approved 11570 11586 Request for ISO/IEC 14496-12:2008 AMD 3 DASH support and RTP reception hint track processing Text of ISO/IEC 14496-12:2008 PDAM 3 DASH support and RTP reception hint track processing 8.4.2 MVC operation point descriptor The following documents were approved 11560 11562 Study of ISO/IEC 13818-1:2007/FPDAM 6 Extension to AVC video descriptor and signaling of operation points for MVC WD of ISO/IEC 13818-1:2007/AMD 7 Signalling of stereoscopic video in MPEG-2 Systems 8.4.3 Dynamic Adaptive Streaming over HTTP The following documents were approved 11577 11578 11579 Request for the subdivision of ISO/IEC 23001-6 Text of ISO/IEC CD 23001-6 Dynamic Adaptive Streaming over HTTP Description of Evaluation Experiments on Dynamic Adaptive Streaming over HTTP 8.5 Multimedia architecture 8.5.1 SC3DMC for 3DGCM The following document was approved 11669 Text of 2nd Edition of ISO/IEC 14496-25 FDIS 3DG compression model 8.5.2 MPEG-V Use cases The following document was approved 11588 WD of ISO/IEC 23005-1 AMD 1 Additional Use Cases 8.5.3 MPST Architecture and Technologies The following document was approved 11581 Text of ISO/IEC CD 23001-1 2nd edition Multimedia Service Platform Technologies – Architecture 6 8.5.4 MXM API The following document was approved 11582 Text of 2nd edition of ISO/IEC CD 23006-2 MPEG Extensible Middleware API 8.5.5 Elementary Services The following document was approved 11583 Text of 2nd edition of ISO/IEC CD 23006-4 Elementary Services 8.5.6 Service Aggregation The following documents were approved 11692 11584 Request of subdivision of ISO/IEC 23006-5 Service Aggregation Text of ISO/IEC CD 23006-5 Service Aggregation 8.5.7 Widget Extensions The following documents were approved 11598 11585 Request for ISO/IEC 23007-1 AMD 1 Widget Extensions Text of ISO/IEC 23007-1 PDAM 1 Widget Extensions 8.6 Application formats 8.6.1 Interactive Music AF dynamic volume change The following document was approved 11599 Description of CE on dynamic volume change 8.7 Reference implementation 8.7.1 BIFS ExtendedCore2D Profile Reference Software The following documents were approved 11567 11568 Request for ISO/IEC 14496-5:2001/AMD 30 Reference software for BIFS ExtendedCore2D Profile Text of ISO/IEC 14496-5:2001/PDAM 30 Reference software for BIFS ExtendedCore2D Profile 8.7.2 LASeR adaptation tools Reference Software The following documents were approved 11565 DoC on ISO/IEC 14496-5:2001/FPDAM 28 Reference software for LASeR adaptation tools 7 11566 Text of ISO/IEC 14496-5:2001/FDAM 28 Reference software for LASeR adaptation tools 8.7.3 Video Signature Tools Reference Software The following documents were approved 11617 11618 DoC on ISO/IEC 15938-6:2001/PDAM4 Text of ISO/IEC 15938-6:2001/FPDAM4 Reference Software for Video Signature Tools 8.7.4 DASH Reference Software The following document was approved 11580 Workplan of Reference Software for DASH 8.7.5 Stereoscopic video AF Reference Software The following document was approved 11575 Text of ISO/IEC 23000-11 FDAM 1 Conformance and Reference Software 8.7.6 MPEG Rich Media UI Reference Software The following document was approved 11587 Study of ISO/IEC FCD 23007-3 Conformance and Reference Software 8.7.7 Media Context and Control – Reference Software The following document was approved 11594 Study of ISO/IEC FCD 23005-7 Conformance and Reference Software 8.8 Conformance 8.8.1 BIFS ExtendedCore2D Profile Conformance The following documents were approved 11563 11564 Request for ISO/IEC 14496-4:2004/AMD 40 Conformance for BIFS ExtendedCore2D Profile Text of ISO/IEC 14496-4:2004/PDAM 40 Conformance for BIFS ExtendedCore2D Profile 8.8.2 Video Signature Tools Conformance The following documents were approved 8 11619 11620 DoC on ISO/IEC 15938-7:2001/PDAM6 Text of ISO/IEC 15938-7:2001/FPDAM6 Conformance for Video Signature Tools 8.8.3 Stereoscopic video AF Conformance The following document was approved 11575 Text of ISO/IEC 23000-11 FDAM 1 Conformance and Reference Software 8.8.4 Interactive Music AF Conformance The following documents were approved 11575 11576 Study of ISO/IEC 23000-12 FPDAM 1 Conformance and Reference Software Workplan of ISO/IEC 23000-12 Conformance and Reference Software 8.8.5 MPEG Rich Media UI Conformance The following document was approved 11587 Study of ISO/IEC FCD 23007-3 Conformance and Reference Software 8.8.6 Media Context and Control – Conformance The following document was approved 11594 Study of ISO/IEC FCD 23005-7 Conformance and Reference Software 8.9 Maintenance 8.9.1 Systems coding standards The following document was approved 11561 List of errata items of ISO/IEC 13818-1:2007 8.9.2 Video coding standards The following documents were approved 11614 11615 11622 Text of ISO/IEC 14496-5:2001/Amd.5 DCOR1 Defect report on ISO/IEC 14496-10:201x Text of ISO/IEC 23001-4/DCOR1 8.9.3 Audio coding standards The following documents were approved 11654 11655 ISO/IEC 13818-4:2004 DCOR 2, AAC Intensity Stereo ISO/IEC14496-26:2010/Cor 2 BSAC Corrections 9 11656 11658 9 ISO/IEC 14496-26:2010/DCOR 3, Intensity Stereo Defect Report of ISO/IEC 23003-2:2010, Spatial Audio Object Coding Organisation of this meeting 9.1 Tasks for subgroups The following tasks were assigned Requirements Std Systems Pt Amd 2 4 2 10 28 7 3 Std 2 4 7 21 A B M V Pt Amd 1 5 7 10 6 5 ? 22 1 28 12 Amd2 2 Amd1 4 Amd4 8 Amd1 6 Cor 11 1 2 12 1 6 1 1 2 3 4 5 1 2 3 4 5 6 Low complexity video codec Stereo video Stereo video Composite Font Format 3DV MMT – CfP, Requirements, Context & Objectives, Use Cases Compact Descriptors for Visual Search MPEG-V extensions MPEG-V Profiles Type 1 licensing codec New standard areas Audio for HVC Carriage of JPEG2000 Signaling of Stereoscopic Video MVC operation point descriptors SVC FF RS Open Font Format Composite Font Format Semantic enhancement PDI Protection of presentation element MVCO RS Professional Archival AF Stereoscopic video AF RS & C Additional composition type Interactive music AF RS & C DASH Architecture MXM API RS & C Elementary Services Aggregated Services Architecture Control Information Sensory information Virtual world object characteristics Data representation for interaction devices Common types and tools 10 7 U 2 3 Conformance and reference software MPEG-V Profiles Advanced input device interface Reference Software and Conformance Video 2 4 4 4 7 2 10 10 10 Cor 1 3 ? 6 7 8 4 1 2 Stereo video Stereo video AVC profiles Miscellanea Compact Descriptors for Visual Search Video Signature Tools RS Video Signature Tools Conf Extraction and Use of MPEG-7 Descriptions Video Tool Library Conformance & RS Video Tool Library extensions 3DV/FTV HEVC 26 Cor BSAC for broadcasting 2 Spatial Audio Object Coding 2 Amd1 Spatial Audio Object Coding C & RS 3 USAC New audio issues (HEVC) 5 27 Scalability complexity 3DMC RS 16 2 Mesh representation supporting multiple attributes 3 Multiresolution 3DMC 27 2 Scalable complexity 3DMC Conformance 3 Scalable complexity 3DMC in part 25 Conformance Reconfigurable Graphics Coding C H Audio D 3DG 4 9.2 Joint meetings The following joint meetings were held Groups S, R R, V S, R, V, A E, V S, 3 R, V R, V V, VCEG R, 3, S R, A 3, V S, 3 R, V R, V V, VCEG What Font Format Type 1 licensing MP7 Profiles RGC MPEG-V Stereo Video Enhancement Visual search CfP AVC issues User description in MPEG-V Audio for HEVC RVC & RGC MPEG-V/U Frame compatible stereo enhancement Visual search CfP AVC issues 11 Day Mon Mon Tue Tue Tue Tue Tue Tue Tue Wed Wed Thu Thu Thu Thu Time1 16:00 16:30 09:00 09:00 14:00 14:00 16:00 17:00 17:30 14:00 15:00 09:00 11:00 12:00 14:00 Time2 16:30 18:00 10:00 10:00 17:30 15:00 17:00 18:00 18:00 15:00 16:00 10:00 12:00 13:00 15:00 Where R R R 3 3 V R J 3 A 3 3 R R V 10 WG management 10.1 Terms of reference The following document was approved 11600 Terms of reference 10.2 Liaisons 10.2.1 Letters The following documents were approved 11680 11635 11636 11637 11638 11639 11640 11641 11663 11694 11695 11696 Liaison letter to ITU-T SG 16 on Telepresence Systems Liaison statement to SC 6 on MMT Liaison statement to 3GPP on DASH Liaison statement to JTC 1/WG 7 on Sensor Networks Liaison statement to IEC TC 100 on the definition of “trick mode” Liaison statement to JTC 1/SC 24 on “zip” format Liaison statement to IEC TC 100 on IEC CDV 62516-2, BIFS in T-DMB receiver Liaison statement to IEC TC 100 on Richmedia for e-Book Statement of benefits from establishing a Category C liaison with DTG Liaison Statement to 3GPP TSG SA4 Liaison Statement to Wireless Gigabit Alliance Liaison Statement to ISO/IEC JTC 1/SC 37 10.2.2 Templates The following documents were approved 11683 11632 11633 11634 Liaison statement template on MPEG frame-compatible activities Liaison statement template on DASH Liaison statement template on MSPT Liaison statement template on signalling of stereoscopic video in MPEG-2 System 10.2.3 Statement of benefits The following document was approved 11697 Statement of benefits from establishing a Category C liaison with Wireless Gigabit Alliance 10.2.4 List of liaisons The following document was approved 11611 List of Organisations with which MPEG entertains liaisons 12 10.3 Ad hoc groups The following document was approved 11554 Ad hoc groups established at Guangzhou 10.4 Asset management The following documents were approved 11605 11606 11607 11608 11609 Schema assets Software assets Conformance assets Content assets URI assets 10.5 Facilities fees The following document was approved 11672 MPEG and facilities fees 10.6 IPR management The following document was approved 11610 Standards under development for which a call for patent statements is issued 10.7 Work plan and time line The following documents were approved 11601 11602 11603 MPEG Standards Table of unpublished FDISs Work plan and time line 11 Administrative matters 11.1 Schedule of future MPEG meetings The following schedule of future MPEG meetings was approved # 94 95 96 97 98 99 100 City Guangzhou Daegu Geneva Torino Geneva San José, CA Geneva Country CN KR CH IT CH US CH 13 yy 10 11 11 11 11 12 12 mm 10 01 03 07 11-12 02 04-05 dd-dd 11-15 24-28 21-25 18-22 28-02 06-10 30-04 101 Stockholm 102 Suzhou? 103 Geneva SE CN CH 12 07 12 10 13 01 16-20 15-19 ??-?? 11.2 Promotional activities The following documents were approved 11691 11595 11596 11557 MPEG-V awareness day Call for Participation Storyboard for MPEG-U promotion video Short descriptions Guangzhou press release 12 Resolutions of this meeting These were approved 13 A.O.B. There was no other business 14 Closing The meeting closed at 2010/10/15T20:20 with thanks to the hosting organisation 14 Annex A – Attendance list Name Michael Grafl Christian Timmerer Mohamad Raad Kenneth Vermeirsch Jan De Cock Marc Jacobs Joeri Barbarien Louis Zhang Jian Wang Gaelle Martin-Cocher Dake He Panos Nasiopoulos Mahsa Pourazad Roch Lefebvre Feng Pan Philippe Gournay Bruno Bessette Nikolce Stefanoski Marco Mattavelli Touradj Ebrahimi Jean-Pierre EVAIN Zongbiao Niu Carmen Cheng Qiao Shujuan Wenpeng Ding Hui Liu Sijia Chen Lai-Man Po Xiaoran Cao Yun He Hongzhang Yang Jinahua Zheng Yu Liu Xing WEN Oscar Au Hongbo Zhu JING WANG Shanfu Lee Wei Xiao Yang Mingyuan Qiu Shen Sixin Lin Changcai Lai Yongbing Lin Peiyu YUE Affiliation Klagenfurt University lagenfurt University RaadTech Consulting IBBT - Ghent University IBBT - Ghent University IBBT - VUB IBBT - VUB AMD Polycom Inc. Reasearch In Motion Limited Research In Motion Ltd. University of British Columbia University of British Columbia University of Sherbrooke ViXS Systems Inc. VoiceAge Corporation VoiceAge Corporation Disney Research, Zurich EPFL EPFL European Broadcasting Union ASTRI Beihang University Beijing University of Technology Cisco Cisco City University of Hong Kong Dept. of Electronic Engineering, Tsinghua University Dept. of Electronic Engineering, Tsinghua University Freescale Semiconductor China HiSilicon Technologies Co. Ltd, HK-ASTRI HONG KONG UNIV. OF SCIENCE & TECHNOLOGY HONG KONG UNIV. OF SCIENCE & TECHNOLOGY Hongbo Zhu HUAWEI Huawei Huawei Technologies HUAWEI TECHNOLOGIES CO., LTD Huawei Technologies Co., Ltd Huawei Technologies Co., Ltd Huawei Technologies Co., Ltd Huawei Technologies Co., Ltd Huawei Technologies Co., Ltd 15 country AT AT AU BE BE BE BR CA CA CA CA CA CA CA CA CA CA CH CH CH CH CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN Pulin Wang Jin Song Haitao Yang weizhong Chen Yunxuan Zhao Xiaozhen Zheng Zhang Shaobo Lidong Xu Weimin(Wilma) Su Shawmin Lei Yu-Wen Huang Jicheng An Ching-Yeh Chen Chih-Wei Hsu Yu-Lin Chang Xun Guo Jizheng Xu Jianxin Yan Feng Shao Tiejun Huang Siwei MA Li Zhang Lingyu Duang Wen Gao An Ping Song Li ZHAO Haiwu Chenchen Gu Xunan Mao Qingyi Zhao Xuemei Yan Feng Cen Min Lu Yuan Yuan Qifei Wang Yongbing Zhang Xiangyang Ji Junyan Huo Yanzhuo Ma Wei Li Fuzheng Yang Lu YU Zhang Wen Dong Wang Yingjie Hong Ming Li Zhuping Li Fang Wang HUAWEI TECHNOLOGIES CO., LTD. Huawei Technologies Co., Ltd. Huawei Technologies Co., Ltd. Huawei technologies CO.,LTD HUAWEI TECHNOLOGIES CO.,LTD Huawei Technologies Ltd., Co Huawei Techonologies Co. Ltd Intel China Research Center Intellectual Property & Standards China, Philips MediaTek MediaTek MediaTek MediaTek MediaTek MediaTek MediaTek (Beijing) Inc. Microsoft Research Asia National Engineering Laboratory for audio Coding Technology Ningbo University Peking University Peking University Peking University Peking University Peking Universtiy School of Communication and Information Engineering Shanghai jiaotong university Shanghai University Shenzhen Tencent Computer Systems Conpany Limited Tencent Technology (Beijing) Company Limited Tencent Technology (Shenzhen) Co.,Ltd Tencent Technology (Shenzhen) Co.,Ltd Tongji University Tsinghua University Tsinghua University Tsinghua University Tsinghua University Tsinghua University Xidian University Xidian University Xidian University Xidian University Zhejiang Unviersity ZTE ZTE Coporation ZTE Corporation ZTE Corporation ZTE Corporation ZTE Corporation 16 CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN CN Yanzi Yang Ferenc Kraemer Gerhard Tech Benjamin Bross Martin Winken Karsten Suehring Karsten Mueller Detlev Marpe Karsten Grueneberg Haricharan Lakshman Max Neuendorf Markus Multrus Julien Robilliard Bernhard Grill Ingo Hofmann Terentiev Leon Oliver Hellmuth David VIRETTE Joern Ostermann Zhijie Zhao Sven Klomp Thomas Wedi Matthias Narroschke Virginie Drugeon Thomas Stockhammer Jens-Rainer Ohm Gero Baese Peter Amon Aritz Sanchez de la Fuente Thomas Wiegand Oliver Wuebbolt Alexander Glantz Andreas Krutz Shujun Li Victor Rodriguez Doncel Pablo Carballeira Francisco Moran Burgos Jaime Delgado M. Oguz Bici Jani Lainema Dmytro Rusanovskyy Kemal Ugur Imed Bouazizi Helmut Burklin Guillaume LAROCHE Patrice ONNO Gordon Clare Mickael Raulet Laurent Guillo ZTE Corporation Dolby Germany GmbH Fraunhofer Heinrich Hertz Institute Fraunhofer HHI Fraunhofer HHI Fraunhofer HHI Fraunhofer HHI Fraunhofer HHI Fraunhofer HHI Fraunhofer HHI Fraunhofer IIS Fraunhofer IIS Fraunhofer IIS Fraunhofer IIS Fraunhofer IIS Fraunhofer Institut IIS Fraunhofer Institut IIS HUAWEI TECHNOLOGIES CO.,LTD. Institut fiuer Informationsverarbeitung Leibniz Universitaet Hannover Leibniz University Hannover Panasonic Panasonic R&D Center Gemany Panasonic R&D Center Germany Qualcomm Incorporated RWTH Aachen University Siemens Siemens AG Technical University of Berlin Technical University of Berlin Technicolor Technische University Berlin Technische University Berlin University of Konstanz Universitat Politecnica de Catalunya Universidad Politecnica de Madrid Universidad Politécnica de Madrid Universitat Politecnica de Catalunya Nokia Nokia Nokia Nokia Nokia Research Center AFNOR CANON RESEARCH CENTRE FRANCE SAS CANON RESEARCH CENTRE FRANCE SAS France Telecom Orange Labs IETR/INSA Rennes INRIA Rennes 17 CN DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE DE E ES ES ES FI FI FI FI FI FR FR FR FR FR FR Ronan BOITARD Marius Preda Mihai Mitrea Pierrick Philippe Joel Jung Sylvain Kervadec Patrick Lopez Edouard Francois Mary-Luc Champel Jean-Claude Dufourd Jean LE FEUVRE Avraham Shimor Sergio Matone Leonardo Chiariglione Alessandra Mosca Giovanni Cordara Masato Shima Kimihiko Kazui Jumpei Koyama Tomokazu Murakami Yukiko Ogura Shuichi Aoki Atsuro Ichigaya Shinichi Sakaida Shigeru Fukushima Ueda Motoharu Tomonobu Yongbing Kazuo Sugimoto Tokumichi Murakami Kohtaro Asai Shun-ichi Sekiguchi Masayuki Tanimoto Meindert Wildeboer Taka Senoh Hirofumi Aoki Chono Keiichi Kenta SENZAKI Kota Iwamoto Masanori Sano Yukihiro Bandoh Takehiro Moriya Yoshihide Tonomura Shinya Shimizu Shohei Matsuo Seishi Takamura Tsutsumi Kimitaka Thiow Keng Tan INRIA Rennes Institut TELECOM Institut Télécom; Télécom Sudparis, ARTEMIS Department Orange Labs Orange Labs Orange Labs Technicolor technicolor Technicolor Telecom ParisTECH Telecom ParisTech SanDisk Corporation CEDEO.net CEDEO.net CSP s.c.a r.l. Telecom Italia Lab Canon Inc FUJITSU LABORATORIES LTD. FUJITSU LABORATORIES LTD. Hitachi, Ltd IPSJ/ITSCJ Japan Broadcasting Corporation Japan Broadcasting Corporation (NHK) Japan Broadcasting Corporation (NHK) JVC KENWOOD Holdings, Inc JVC KENWOOD Holdings, Inc KDDI R&D Laboratories Inc. Mitsubishi Electric Corporation Mitsubishi Electric Corporation Mitsubishi Electric Corporation Mitsubishi Electric Corporation Nagoya University Nagoya University National Institute of Information and Communications Technology NEC Corporation NEC Corporation NEC Corporation NEC Corporation NHK NTT NTT NTT NTT Corporation NTT Corporation NTT Cyber Space Laboratories NTT DOCOMO NTT DOCOMO, Inc 18 FR FR FR FR FR FR FR FR FR FR FR IL IT IT IT IT JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP Akira Fujibayashi Kei Kikuiri Zhixiong Wu Takuyo Kogure TAKAHIRO NISHI Hisao Sasai Youji Shibahara Takeshi Norimatsu Sugio Toshiyasu Seiji Mochizuki Tadashi Uchiumi Yasuaki Tokumo Tomoyuki Yamamoto Jun Yonemitsu Shinobu Hattori Kazushi Sato Teruhiko SUZUKI Yasuaki Yamagishi Kenji Kondo Toru Chinen Masayuki Nishiguchi Ikuo Tsukagoshi Toshiaki Fujii Itaru Kaneko AKIYUKI TANIZAWA Tomoo Yamakage Takeshi Chujoh Shiodera Taichiro Jin Xin Kim Daeyoon Sungmoon Chun Seyoon Jeong JIN SOO CHOI Bang Gun Miran Choi Jin Young Lee Inseon Jang Kyeongok KANG WEON GEUN OH Jung Won Kang Taejin Lee Seungkwon Beack Sang-il Na Kugjin Yun YOUNGKWON LIM Sung-Chang Lim Wonsuk Lee Won-Sik CHEONG NTT DOCOMO, Inc NTT DOCOMO, INC. Oki Electric Industry Co., Ltd. Panasonic Panasonic Panasonic Corporation Panasonic Corporation Panasonic Corporation Panasonic Corporation Renesas Electronics Corporation Sharp corporation Sharp corporation Sharp corporation Sony Sony Corp Sony Corp Sony Corp. Sony Corp. Sony corporation Sony Corporation Sony Corporation Sony Corporation, HE Business Group, HE Development Division Tokyo Institute of Technology Tokyo Polytechnic University TOSHIBA Corporation Toshiba Corporation Toshiba Corporation Toshiba Corporation Waseda Univ. ECT Inc. Electronics and Telecommunications Research Institute Electronics and Telecommunications Research Institute Electronics and Telecommunications Research Institute ETRI ETRI ETRI ETRI ETRI ETRI ETRI ETRI ETRI ETRI ETRI ETRI ETRI ETRI 19 JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP JP KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR Hokyom Kim Seung Wook Lee Eun Seo Lee Kwang Roh Park Seong Yong Lim YONGJU CHO Cong Thang Truong Bumsuk Choi SangHyun Joo Hui Yong KIM Chang Yul Kim KIHUM HAN Seonghoon Kim Inkwon Kim YO-SUNG HO Jeha Ryu Haechul Choi Kiho Choi Minsoo Park Euee S. Jang Hyunok Oh Tae-young Jung Kibaek Kim Jechang Jeong Junwoo Lee Dong Seok Jeong WEN-HSIAO PENG Munchurl Kim Bumshik Lee IN JOON CHO Min Uk Kim CHEN YINGYING Yungho Choi KYOUNGRO YOON Kwangki Kim Euy-Doc Jang Un Ki Park Jae Keun Kwak Kyung Won Kim Xiaochao Qu Soomin Lim Yong-Goo KIM Eunbin Oh Woong Lim Hochong Park NeungJoo Hwang Donggyu Sim SEOUNG-JUN OH Chulkeun Kim ETRI ETRI ETRI ETRI ETRI ETRI ETRI ETRI ETRI ETRI Feelingk Feelingk Galaxia Communications Co., Ltd Galaxia Communications Co., Ltd. Gwangju Institute of Science and Technology Gwangju Institute of Science and Technology (GIST) Hanbat National University Hanyang University Hanyang University Hanyang University Hanyang University Hanyang University Hanyang university Hanyang University Inha Univ. Inha University ITRI International/NCTU KAIST KAIST KBS Technical Research Institute Konkuk University KONKUK UNIVERSITY Konkuk University KONKUK UNIVERSITY Korea Advanced Institute of Science And Technology Korea Aerospace University Korea Aerospace University Korea Electronics Association Korea Electronics Technology Institute Korea University MSL Korea University MSL Korean German Institute of Technology (KGIT) Korean Standards Association Kwangwoon Univ. Kwangwoon University Kwangwoon University Kwangwoon University Kwangwoon University Kyuanhee UNIV 20 KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR Yong-Chul Park KyuHeon Kim JangWon Lee JeonHo Kang GilBok Lee Heo Youngsu Kim Kyungyong Park Gwanghoon Lee Yoonjin Yongjoon Jeon SUNGYONG YOON Seungwook Park Jaehyun Lim Jaewon Sung Sangoh Jeong Byeong Moon Jeon Ji Wook Jung Sehoon Yea Hyeon Jae Lee Suh Doug Young Lee Yonghun Kyoung-Ho Choi Sang-Kyun Kim Jae-Gon Kim Lai-Tee Cheok Bo Gyeong Kang Kihyun Choo Ha Hojin il koo Kim Rhyu Sungryeul Jaejoon Lee Seungju Han Elena Alshina Woo-Jin Han Song Jaeyeon Kyong-Sok Seo Hee Jean Kim Jae Joon Han Jianle Chen Seok Lee Minsu Ahn Hwang Seo-Young JeongHoon Park JUNG SUK SUH Kyungmo Park Chanwon SEO Jong-Ki Han Hae Kwang Kim Daeil Yoon Kyung Hee University Kyung Hee University Kyung Hee University Kyung Hee University Kyung Hee University Kyunghee University Kyunghee University Kyunghee University Kyunghee University LG Electronics LG Electronics LG Electronics LG Electronics LG Electronics LG Electronics LG Electronics LG Electronics LG Electronics LG Electronics Kyunghee university Kyunghee university Mokpo National University Myongji University Korea Aerospace University Samsung SAMSUNG Samsung Advanced Institute of Technology Samsung elec. Samsung Electonics, Co., Ltd Samsung Electonics, Co., Ltd Samsung Electronics Samsung Electronics Samsung Electronics Samsung Electronics Samsung electronics Samsung Electronics Samsung Electronics Samsung Electronics Advanced Institute of technology Samsung Electronics Co. Samsung Electronics Co. Ltd. Samsung Electronics Co., Ltd Samsung Electronics Co., LTD. Samsung Electronics Co., Ltd. Samsung Electronics Co.,Ltd. Samsung Electronics Co.,Ltd. Sejong University Sejong University Sejong University Sejong University 21 KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR Joo Hee Moon Ju Ock Lee Hyoungmee Park Yung Lyul Lee Hong Sung-Wook Kim Jeong-Pil CHO JAEHEE KIM HYUNDONG Jeongyeon Lim Min Cheol Hong Yang Jungyoup Kwanghyun Won Byeungwoo Jeon Yong Han Kim Song Jeongook SANGYOUN Lee Seongwan Kim Lee Jaeseong Soongi Hong Jongho Kim Henney Oh Delphine Roussy Fons Bruls Werner Oomen Jean H.A Gelissen Arild Fuldseth Marek Domanski Krzysztof Wegner Kristofer Kjoerling Heiko Purnhagen Kenneth Andersson Rickard Sjoberg Thomas Rusert Per Frojdh Yongwei Zhu Zhengguo Li Chuo Hao Yeo Yih Han Tan Haiyan Shu Haishan Zhong Viktor Wahadaniah DAN ZHAO ChongSoon Lim Gerard Fernando Ying Weng Nikola Sprljan Stavros Paschalakis Ping Wu Miroslaw Bober Sejong University Sejong University Sejong University Sejong University Sejong University Sejong University Sejong University/DMS Sejong University/DMS SK Telecom Soongsil University Sungkyunkwa University Sungkyunkwa University Sungkyunkwa University University of Seoul Yonsei Univ. Yonsei University Yonsei University Yonsei University Yonsei University Yonsei University Yonsei University Koninklijke Philips Electronics N.V. Philips Philips Applied Technologies Philips Research Lifestyle Program Cisco Systems Norway Poznan University of Technology Poznan University of Technology Dolby R&D Sweden AB Dolby R&D Sweden AB Ericsson Ericsson Ericsson AB Ericsson Research Institute for Infocomm Research Institute for Infocomm Research Institute for Infocomm Research Institute for Infocomm Research Institute for Infocomm Research Panasonic Singapore Laboratories Panasonic Singapore Laboratories Panasonic Singapore Laboratories Pte Ltd Panasonic Singapore Laboratories Pte Ltd British Broadcasting Corporation Mitsubishi Electric R&D Centre Europe Mitsubishi Electric R&D Centre Europe BV Mitsubishi Electric R&D Centre Europe BV Mitsubishi Electric R&D Centre Europe BV 22 KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR KR NL NL NL NL NO PL PL SE SE SE SE SE SE SG SG SG SG SG SG SG SG SG UK UK UK UK UK UK Ken McCann Chad Fogg Arianne Hinds Walt Husak Nenad Rijavec David Singer David Pope Schuyler Quackenbush Alexander Giladi Flynn David Marta Mrak Thomas Davies Yang Zhijie BZ (Bazhong) Shen Lei Zhang Yong Yu Wade Wan Panos Kudumakis Yian Xu Huipin Zhang Junlin Li Frank Bossen Alexandros Tourapis Michael Biber Yan Ye John Ralston Bill Zou Pankaj Topiwala Nam Ling Ye-Kui Wang Lingzhi Liu Xin Wang Yi-Jen Chiu RAJENDRA BOPARDIKAR Yi-Shin Tung HAOPING YU Iraj Sodagar Gary Sullivan Andrew Tescher Kilroy Hughes Huifang Sun Dong Tian Robert Cohen Anthony VETRO Vladimir Levantovsky Mandayam Narasimhan Krit Panusopone ZetaCast representing Samsung ANSI ANSI ANSI Apple Aptina, LLC Audio Research Labs Avail-TVN British Broadcasting Corporation British Broadcasting Corporation British Broadcasting Corporation Broadcom Broadcom Corp Broadcom Corp Broadcom Corp BROADCOM CORPORATION Centre for Digital Music, Queen Mary University of London Cisco Systems Cisco Systems, Inc Cisco Systems, Inc. DOCOMO USA Labs Dolby Laboratories Dolby Laboratories Dolby Laboratories Inc Droplet Technology, Inc DTS FastVDO Huawei (China) - Santa Clara University Huawei Technologies Huawei Technologies USA Huawei Technologies, USA Intel Corp Intel Corp ITRI USA/MStar Semiconductor Media Technology Research Department, Corporate Research Microsoft Microsoft Corp. Microsoft Corporation Microsoft Corporation Mitsubishi Electic Research Labs Mitsubishi Electic Research Labs Mitsubishi Electric Mitsubishi Electric Monotype Imaging Inc Motorola Motorola 23 UK US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US Ajay Luthra Faisal Ishtiaq Shih-Ta Hsiang Mark Watson Justin Ridge Lulin Chen Jesus Sampedro Marta Karczewicz Ying Chen Muhammed Coban Xianglin Wang Pengjun Huang Peisong Chen Yuriy REZNIK In Suk Chong Wei-Jung Chien Daniel O'Loughlin Felix Fernandes Nhut Nguyen Sanjeev Verma WANG Lai Ankur Saxena Guichun Li Andrew Segall Lazar Bivolarski Ali Tabatabai Cheung Auyeung Vivienne Sze Minhua Zhou Madhukar Budagavi Jill Boyce Michael Horowitz Dzung Hoang Motorola Motorola Inc. Motorola, Inc. Netflix Inc. Nokia Omneon Video Networks POLYCOM Qualcomm Qualcomm Qualcomm Qualcomm Inc Qualcomm Inc. Qualcomm Incorporated Qualcomm Incorporated Qualcomm Incorporated Qualcomm Incorporated Research in Motion Ltd Samsung Samsung Samsung Electronics Samsung Telecom America Samsung Telecommunications America Santa Clara University Sharp Labs of America Skype Sony Electronics Sony Elenctronics Inc Texas Instruments Texas Instruments Inc Texas Instruments Inc. Vidyo Vidyo, Inc. Zenverge, Inc. 24 US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US US Annex B – Agenda # # # Item 1 2 3 4 5 6 7 8 1 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 2 1 3 1 2 3 4 4 5 1 6 1 7 1 2 3 Opening Roll call of participants Approval of agenda Allocation of contributions Communications from Convenor Report of previous meeting Processing of NB Position Papers Work plan management Media coding Scalable resolution enhancement of frame-compatible stereoscopic 3D video AFX 3rd edition Multiresolution 3D mesh Coding Scalable-complexity 3D mesh compression Efficient representation of 3D meshes with multiple attributes Many-to-one range mappings in Open Font Format Composite Font in Open Font Format Video Tool Library Spatial Audio Object Coding Unified Speech and Audio Coding High-Efficiency Video Coding Media Context and Control – Control Information Media Context and Control – Sensory Information Media Context and Control – Virtual World Object Characteristics Media Context and Control – Data Formats for Interaction Devices Media Context and Control – Common Types and Tools 3D Video Coding Contract Expression Language Low-Complexity Video Coding Composition coding Advanced User Interaction Description coding Video Signature Descriptors Compact Descriptors for Visual Search Extraction and Matching of Video Signature Tools MPQF semantic enhancement Systems support IPMP Protection of Presentation Element Digital Item Presentation of Digital Item Transport and File formats File formats SAOC transport MVC operation point descriptor 25 4 5 6 7 8 9 10 11 12 9 1 2 10 Carriage of JPEG 2000 in MPEG-2 Systems Sub-track selection & switching Dynamic Adaptive Streaming over HTTP MPEG Media Transport Multimedia architecture 1 SC3DMC for 3DGCM 2 Architecture and Technologies 3 MXM API 4 Elementary Services 5 Service Aggregation Application formats 1 Stereoscopic Video AF composition type Reference implementation 1 SC3DMC Reference Software 2 Scalable Complexity 3D Mesh Coding in 3DG Compression Model Reference Software 3 Video Signature Tools Reference Software 4 Stereoscopic video AF Reference Software 5 Interactive Music AF Reference Software 6 Video Tool Library Reference Software 7 MXM and AIT Reference Software 8 MPEG Rich Media UI Reference Software 9 Media Context and Control – Reference Software Conformance 1 MPEG-4 Audio Conformance 2 3DG Conformance 3 Scalable Complexity 3DMC Conformance 4 3D Graphics Compression Model Conformance 5 Video Signature Tools Conformance 6 Stereoscopic video AF Conformance 7 Interactive Music AF Conformance 8 Video Tool Library Conformance 9 MXM and AIT Conformance 10 MPEG Rich Media UI Conformance 11 Media Context and Control – Conformance Maintenance 1 Systems coding standards 2 Video coding standards 3 Audio coding standards 4 3DG coding standards 5 Systems description coding standards 6 Visual description coding standards 7 Audio description coding standards 8 MPEG-21 standards 9 MPEG-A standards Organisation of this meeting Tasks for subgroups Joint meetings WG management 26 1 2 3 4 5 6 7 8 1 2 3 4 9 10 11 1 2 12 13 14 Terms of reference Officers Editors Liaisons Responses to National Bodies Work item assignment Ad hoc groups Asset management Reference software Conformance Test material URI IPR management Work plan and time line Administrative matters Schedule of future MPEG meetings Promotional activities Resolutions of this meeting A.O.B. Closing 27 Annex C – Input contributions No. Source Title 17964 Webmaster Guangzhou document register 17965 Yi-Shin Tung, Jens-Rainer Ohm Ad Hoc on Maintenance of MPEG-4 Visual related Documents, Reference Software and Conformance 17966 Marco Mattavelli, Kazuo Sugimoto Ad Hoc on Reconfigurable Video Coding 17967 Miroslaw Bober, Ryoma Oami Ad Hoc on MPEG-7 Visual 17968 Karsten Müller, Anthony Vetro Ad Hoc on 3D Video Coding 17969 Francisco Morán Burgos, Seung Wook Lee Ad Hoc on 3DG documents, software maintenance and core experiments 17970 Khaled Mamou, Minsu Ahn, Xiangyang Ji Ad Hoc on Multi-Resolution 3D mesh Coding 17971 S. Quackenbush Ad Hoc on Audio Standards Maintenance 17972 S. Quackenbush Ad Hoc on SAOC, USAC, and Audio and Systems Interactions 17973 Jaeyeon Song, Cyril Concolato Ad Hoc on Scene Representation 17974 David Singer, Per Fröjdh Ad Hoc on MPEG File Formats 17975 Kyuheon Kim, Kugjin Yun Ad Hoc on Application Format 17976 Vladimir Levantovsky Ad Hoc on Font Format Representation 17977 Xin Wang, Young Kwon Lim Ad Hoc on Advanced IPTV Terminal 17978 Marius Preda Ad Hoc on MPEG-V 17979 Xin Wang, Jaime Delgado Ad Hoc on Contract Expression Language 17980 Iraj Sodagar, David Singer, Young Kwon Lim Ad Hoc on Dynamic Adaptive Streaming over HTTP 17981 Walt Husak, Ajay Luthra, Gary Sullivan Ad Hoc on scalable resolution enhancement of frame-compatible stereoscopic 3D video 17982 Thomas Davies, John Ralston Ad Hoc on Lightweight Video Coding 17983 Yuriy Reznik, Giovanni Cordara, Miroslaw Bober Ad Hoc on Compact Descriptors for Visual Search 17984 Christian Timmerer Ad Hoc on MPEG Media Transport (MMT) 17985 ITU-T SG 16 via SC 29 Secretariat Liaison Statement from ITU-T SG 16 on Approval of a New Question on Telepresence systems [SC 29 N 11508] 17986 ITU-T SG 16 via SC 29 Secretariat Liaison Statement from ITU-T SG 16 to SC 29 on Video coding collaboration [SC 29 N 11525] 17987 SC 29 Secretariat Summary of Voting on ISO/IEC 1449626:2010/DCOR 2 [SC 29 N 11526] 17988 SC 37 via SC 29 Secretariat ISO/IEC JTC 1/SC 37's CD and FCDs: FCD 19794-6.2, FCD 19794-9, FCD 19794-5.3, FCD 29109-6.2, FCD 19794-8.3 and CD 19794-11.4 [SC 29 N 11531] 17989 IEC TC 100 via SC 29 Secretariat IEC CDV 62516-2 [SC 29 N 11538] 17990 WGA via SC 29 Secretariat Liaison Statement from Wireless Gigabit Alliance [SC 29 N 11539] 17991 3GPP via SC 29 Secretariat Liaison Statement from 3GPP [SC 29 N 11540] 17992 SC 29 Secretariat Summary of Voting on ISO/IEC 15938- 28 6:2003/PDAM 4 [SC 29 N 11543] 17993 DTG via SC 29 Secretariat Liaison Statement from Digital TV Group [SC 29 N 11551] 17994 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC 1449626:2010/FDAM 2 17995 SC 29 Secretariat Summary of Voting on ISO/IEC 1449625:2009/FPDAM 1 17996 SC 29 Secretariat Summary of Voting on ISO/IEC 159387:2003/PDAM 6 17997 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC FDIS 23003-2 17998 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC 1449611:2005/FDAM 7 17999 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC FDIS 23007-1 18000 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC 144963:2009/FDAM 2 18001 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC 144961:2010/FDAM 1 18002 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC 159383:2002/FDAM 4 18003 SC 29 Secretariat Summary of Voting on ISO/IEC 144965:2001/FPDAM 26 18004 SC 29 Secretariat Summary of Voting on ISO/IEC 1449627:2009/FPDAM 3 18005 SC 29 Secretariat Summary of Voting on ISO/IEC 144965:2001/FPDAM 28 18006 3GPP via SC 29 Secretariat Liaison Statement from 3GPP 18007 Waqar Zia DASH Evaluation Experiment 5: Delivery Format of MPEG-2 TS 18008 Report of DASH Evaluation Experiment 9, Delivery Format to File Format Conversion (DFC) Ye-Kui Wang, Sungryeul Rhyu, Miska M. Hannuksela 18009 Jin Young Lee, Harry Pyle DASH Evaluation Experiment #1: Compositions of Media Presentation (CMP) Proposal Comparison 18010 Jin Young Lee DASH Evaluation Experiment #2: MPD Modification, Corrections, and Extension 18011 Nhut Nguyen, Harry Pyle DASH Evaluation Experiment #8, Clock Drift Control 18012 Gerard Fernando, Kyungmo Park DASH Evaluation Experiment #4: Delivery Format Addressing 18013 Waqar Zia, Ying Chen, Yago Sanchez DASH Evaluation Experiment #3: Enhancement to Representation Description (ERD) 18014 Ying Chen, Ingo Hofmann DASH Evaluation Experiment #7: Seamless Switching (SES) 18015 Thomas Stockhammer Updated Information on Reference Software for DASH 18016 Ye-Kui Wang, Thomas Stockhammer EE#6: Trick Mode and Random Access Signaling (TRA) 18017 Mark Watson Some comments on DASH EE#7 – Seamless Switching 29 18018 Kilroy Hughes, Quintin Burns Proposed Amendment: Support for Common Encryption 18019 Kilroy Hughes, Quintin Burns Proposed Amendment: Independently Decodable Fragments 18020 JTC 1/WG 7 via SC 29 Secretariat Liaison Statement from ISO/IEC JTC 1/WG 7 [SC 29 N 11560] 18021 IEC TC 100 via SC 29 Secretariat Liaison Statement from IEC TC 100 [SC 29 N 11561] 18022 JTC 1/SC 34 via SC 29 Secretariat Liaison Statement from JTC 1/SC 34 [SC 29 N 11563] 18023 Bum Suk Choi, Eunseo Lee, Jong Hyun Jang, Kyoungro Yoon, Jonghyung Lee, Ying Ying Chen Update of Reference SW for MPEG-V Part 2 18024 Eunseo Lee, Bum Seok Choi, Kwang Roh Park, Kyoungro Yoon, Jonghyung Lee, Ying Ying Chen Update of Reference SW for MPEG-V Part 5 Proposal of GPS Sensor Capability Type, Altitude Sensor Capability Type, and Global Position Capability Type 18025 Kyoungro Yoon, Doohyun Kim, Min-Uk Kim 18026 3DV/FTV EE results of depth estimation and Gun Bang, Neungjoo Hwang, Donggyu Sim, Woong coding experiment on ‘Lovebird1’ Lim, Gi-Mun Um, Won-Sik Cheong, Namho Hur sequence 18027 Kyoungro Yoon, Soohan Kim, Woo Chool Park, Hae Proposal of Header information for binary Moon Seo transmission of MPEG-V information 18028 K. Kazui, J. Koyama, A. Nakagawa (Fujitsu) Evaluation result of JCTVC-B031 18029 A. G. Tescher for USNB USNB Contribution: WG11 resolution 2.1.4 consideration 18030 A. G. Tescher for USNB USNB Contribution: WG11 resolution 2.2.2 consideration 18031 A. G. Tescher for USNB USNB Contribution: WG 11 resolution 3.3.2 consideration 18032 A. G. Tescher for USNB USNB Contribution: Suggested practices for the HEVC project in the JCT-VC 18033 A. G. Tescher for USNB USNB Contribution: HEVC standardization timeline 18034 A. G. Tescher for USNB USNB Contribution: Option-1 video codec consideration 18035 T. Murakami, T. Yokoyama, K. Nakamura (Hitachi) TE1: Cross-check result of DMVD proposal JCTVC-C124 (Mitsubishi) 18036 Fuzheng Yang, Wei Li, Shuai Wan, Yilin Chang CTB splitting on frame boundary for arbitrary resolution video 18037 F. Yang, W. Li, S. Wan, Y. Chang (Xi'dian Univ.) Proposal of frame parameters FLCTB and FSCTB for video content fitting 18038 Jungsun Kim TE7: Verification results of Peking Univ. proposal (mode-dependent residual reordering for intra prediction residual) 18039 J. Kim, B. Jeon (LG Electronics) TE6: Verification results of Huawei and Hisilicon proposals (short distance intra prediction using the correlation between lines and pixels) 18040 Purvin Pandit TE12: Report on 64x64 versus 32x32 maximum coding unit size 18041 G. J. Sullivan, J.-R. Ohm (co-chairs) JCT-VC AHG report: JCT-VC project management 30 18042 K. McCann (chair), M. Karczewicz, J. Ridge, S. Sekiguchi, T. Wedi, T. Wiegand (vice-chairs) JCT-VC AHG report: Test Model under Consideration (TMuC) editing 18043 F. Bossen, P. Chen, D. Flynn, W.-J. Han, K. Sühring, H. Schwarz, K. Ugur JCT-VC AHG report: Software development and TMuC software technical evaluation 18044 R. Cohen, R. Joshi 18045 JCT-VC AHG report: Alternative transforms T. Yamakage (chair), Y. J. Chiu, M. Karczewicz, M. Narroschke (co-chairs) JCT-VC AHG report: In-loop and postprocessing filtering 18046 K. Panusopone, M. Budagavi, W.-J. Han, D. He JCT-VC AHG report: Large block structures 18047 K. Chono (chair), T. Chujoh, C. S. Lim (vice-chairs) JCT-VC AHG report: Memory compression 18048 M. Budagavi, A. Segall JCT-VC AHG report: Parallel entropy coding 18049 J. Xu, W. Ding JCT-VC AHG report: Screen content coding 18050 D. Alfonso, J. Ridge, X. Wen JCT-VC AHG report: Complexity assessment 18051 E. Francois (Technicolor), L. Guillo (INRIA), A. Ichigaya (NHK), H. Yu (Huawei) 18052 E. Francois, D. Thoreau, P. Bordes (Technicolor), L. TE3 – Subtest 2: Report on Simplified Guillo (INRIA), Geometry Block Partitioning 18053 Yoshiaki Shishikui, Yasutaka Matsuo, Atsuro Ichigaya, Kazuhisa Iguchi, Shinichi Sakaida (NHK) TE12: Report on AMP evaluation Characteristics of Super Hi-Vision test sequences 18054 S. Park, J. Park, B. Jeon TE3: Motion compensation with adaptive warped reference 18055 E. Francois, P. Bordes (Technicolor) TE3 – Subtest 2: Cross-check of results from Huawei 18056 R. Cohen, A. Vetro, H. Sun (Mitsubishi) TE7: Cross-verification of the 1D Directional Unified Transform in TMuC 18057 R. Cohen, A. Vetro, H. Sun (Mitsubishi) Alternative performance measurement of MDDT and ROT TMuC 18058 C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R) TE7: Results for mode-dependent fast separable KLT for block-based intra coding 18059 C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R) TE7: Cross-check results of MDDT simplification proposal from Huawei 18060 C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R) Choice of transforms in MDDT for unified intra prediction 18061 C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R) Chroma intra prediction using reconstructed luma 18062 A. Fujibayashi, S. Kanumuri, F. Bossen, T. K. Tan (NTT DoCoMo) TE12: Performance of partition based illumination compensation (PBIC) 18063 T. K. Tan, Frank Bossen (NTT DoCoMo) TE5: Results for simplification of unified intra prediction 18064 Y. Suzuki, F. Bossen (NTT DoCoMo) TE11: Simulation results of merge/skip (3.2b and 3.2d) 18065 F. Bossen, T. K. Tan (NTT DoCoMo) TE12: Results for experiments on max CU size, RDOQ and AIS 18066 T. K. Tan, Junya Takiue (NTT DoCoMo) TE6.a: Cross verification for line based intra prediction 18067 T. K. Tan (NTT DoCoMo), M. Budagavi (TI), J. Lainema (Nokia) TE5: Summary report on simplification of unified intra prediction 18068 B. Bross (Fraunhofer HHI) TE12.4: Transform skip flag proposed by Fraunhofer HHI (off vs. on) 18069 B. Bross (Fraunhofer HHI) TE12.6: PIPE and LCEC tested against CABAC by Fraunhofer HHI 31 18070 B. Bross TE12.4: Transform coding HHI tested against Samsung proposal by Fraunhofer HHI 18071 B. Bross (Fraunhofer HHI) TE12.2: Interpolation filter SIFO and DIF tested against MOMS by Fraunhofer HHI 18072 B. Bross (Fraunhofer HHI) TE12.6: PIPE tested against TENVLC by Fraunhofer HHI 18073 B. Bross (Fraunhofer HHI) TE12.1: Block merging (MRG) test (off vs. on) by Fraunhofer HHI 18074 B. Bross (Fraunhofer HHI) TE12.3: Adaptive intra smoothing (AIS) test (slow vs. fast) by Fraunhofer HHI 18075 B. Bross (Fraunhofer HHI) TE12.3: Adaptive intra smoothing (AIS) test (off vs. on) by Fraunhofer HHI 18076 S. Quackenbush 94th MPEG Audio Subgroup Report 18077 S. Quackenbush Draft USAC CE Status and Workplan 18078 K. Iguchi, A. Ichigaya, Y. Shishikui(NHK), S. Sekiguchi, A. Minezawa (Mitsubishi Electric) Performance report of TMuC for Super HiVision 18079 M. Zhou, V. Sze (TI) TE 12: Evaluation of transform unit (TU) size 18080 M. Zhou (TI) TE12: Evaluation of IBDI and TPE (transform precision extension) 18081 V. Sze, M. Budagavi (TI) TE12: Evaluation of entropy coders: PIPE vs. CABAC 18082 V. Sze, M. Budagavi TE12: Evaluation of transform coefficient coding (HHI_TRANSFORM_CODING) with tool breakdown 18083 M. Zhou, M. Budagavi (TI) TE2: TI reference frame compression proposal 18084 M. Budagavi (TI) TE5: TI evaluation of unified intra prediction simplifications 18085 M. Budagavi (TI) TE8: TI Parallel context processing (PCP) proposal 18086 V. Sze, M. Budagavi (TI) TE8: Evaluation of RIM-PCP 18087 V. Sze, M. Budagavi (TI) TE8: Evaluation of RIM-V2V entropy coding 18088 M. Zhou (TI) Testing results on the simplified TMuC configurations 18089 M. Zhou (TI) Cross verification of Mitsubishi 0.7.4 configuration J. Kim, M. Kim (KAIST), H.-Y. Kim (ETRI), K. Sato (SONY), X. Shen, L. Yu (Zhejiang Univ.), K. Choi, E. 18090 S. Jang (Hanyang Univ.), B. Bross (HHI), W.-J. Han TE9: Report on large block structure (Samsung), J.-K. Jo, S.-N. Park, D. G. Sim, S.-J. Oh (Kwangwoon U 18091 B. Lee, M. Kim (KAIST), J. Kim, H.-Y. Kim (ETRI) Improved side information signaling for QVBT in TMuC 18092 S. Li, L. Yu (Zhejiang Univ.) TE3: Test results of second order prediction 18093 X. Zhu, L. Yu (Zhejiang Univ.) TE7: Cross-check for I2R proposal on modedependent fast separable KLT for block-based intra coding 18094 L. Wang, L. Yu (Zhejiang Univ.) TE10 subset 2: Performance on Wiener-based in-loop filters 18095 J. Chen, L. Yu (Zhejiang Univ.) TE2: A memory bandwidth measurement algorithm for reference frame compression 18096 C. S. Lim, V. Wahadaniah, S. Naing, H. W. Sun TE2: Reference frame compression using 32 18097 (Panasonic) image coder C. S. Lim, V. Wahadaniah, S. Naing, H. W. Sun (Panasonic) TE2: Cross-check of memory compression results from NEC (JCTVC-B057/JCTVC-Cxxx) 18098 T. Chujoh, T. Yamakage (Toshiba) TE2: Adaptive scaling for bit depth compression on IBDI 18099 T. Chujoh (TE coordinator) TE2: Summary of TE2 on IBDI and memory compression 18100 T. Chujoh, T. Yamakage (Toshiba) A framework for standardization of memory compression 18101 T. Chujoh, K. Kanou, T. Yamakage (Toshiba) TE3 subtest 4: High accuracy interpolation filter (HAIF) 18102 T. Shiodera, A. Tanizawa, T. Chujoh, T. Yamakage (Toshiba) 18103 A. Tanizawa, J. Yamaguchi, T. Shiodera, T. Chujoh, TE7: 1-dimensional directional unified T. Yamakage (Toshiba) transform TE6 subset a: Bidirectional intra prediction 18104 A. Tanizawa, J. Yamaguchi (Toshiba) TE7: Cross-verification of mode dependent residual reordering (Peking Univ.) 18105 T. Yamakage, T. Chujoh, T. Watanabe (Toshiba) TE10 subtest 2: Reduction of number of encoding passes for quadtree adaptive loop filter (QALF) 18106 K. Chono, T. Yamakage (TE coordinators) TE10: Summary of TE10 on in-loop filtering 18107 T. Yamakage, T. Chujoh, T. Watanabe (Toshiba) Coding efficiency report of modification by TMuC draft005 18108 T. Yamakage, T. Chujoh, T. Watanabe (Toshiba) Comparison of loop & post filtering for in-loop and post-processing filtering AHG 18109 T. Chujoh, T. Yamakage (Toshiba) Experimental results of ALF on low complexity 18110 T. Chujoh, K. Kanou, T. Yamakage (Toshiba) TE12.2: Experimental results of interpolation filter 18111 T. Chujoh, T. Yamakage (Toshiba) TE12.7: Experimental results of IBDI 18112 X. Zhao, L. Zhang, S. Ma, W. Gao (Peking Univ.) TE7: Results for mode-dependent residual reordering for intra prediction residual 18113 X. Zhao, L. Zhang, S. Ma, W. Gao (Peking Univ.) TE7: Cross-verification results of I2R proposal 18114 K. Chono, K. Senzaki, H. Aoki, J. Tajime, Y. Senda (NEC) TE10: Conditional joint deblocking-debanding filter 18115 K. Chono, K. Senzaki, H. Aoki, J. Tajime, Y. Senda (NEC) TE12: Report on combined intra prediction evaluation 18116 H. Aoki, K. Chono, K. Senzaki, J. Tajime, Y. Senda (NEC) TE2: 1-D DPCM-based memory compression 18117 H. Aoki, K. Chono, K. Senzaki, J. Tajime, Y. Senda (NEC) An extension of DPCM-based memory compression to 2-D structure with ADPCM 18118 H. Aoki, K. Chono, K. Senzaki, J. Tajime, Y. Senda (NEC) Performance improvement of DPCM-based memory compression by adaptive quantization 18119 F. C. A. Fernandes Low complexity rotational transform 18120 M. Yang, S. Lin, D. Wang, J. Zhou, H. Yu, (Huawei) TE1:Huawei report on DMVD improvements in TMuC 18121 S. Lin, D. Wang, M. Yang, H. Yu, (Huawei) Cross-checking of DMVD results from Mitsubishi and Intel 18122 X. Zheng (HiSilicon), H. Yu, (Huawei) TE3.2: Huawei & HiSilicon report on flexible motion partitioning coding 18123 L. Liu (HiSilicon), J. Zhou, H. Yu (Huawei) TE5: Huawei & HiSilicon report on verification test results 33 18124 C. Lai, Y. Lin (HiSilicon) TE6.a: HiSilicon report on short distance intra prediction 18125 H. Yang, J. Zhou, H. Yu(Huawei) TE7: Symmetry-based simplification of MDDT 18126 H. Yang, J. Zhou, H. Yu (Huawei), TE7: Report of cross-checking results for Toshiba proposal 18127 X. Zheng (HiSilicon), J. Zhou, H. Yu (Huawei), TE12: Huawei & HiSilicon report on AMP evaluation 18128 J. Zhou, D. Wang, H. Yu (Huawei), TE12: Huawei report on max transform unit size evaluation 18129 H. Yang, J. Zhou, H. Yu (Huawei), Symmetry-based scan order sharing scheme for MDDT 18130 H. J. Kim Fast CABAC 18131 A. Saxena, F. C. A. Fernandes (Samsung) Jointly optimal intra prediction and adaptive primary transform 18132 B. Bross (Fraunhofer HHI) TE12.1: Results for PU-based merging by Fraunhofer HHI 18133 B. Bross (Fraunhofer HHI) TE12.4: Transform unit quadtree tested against 2-level by Fraunhofer HHI 18134 G. Li (Santa Clara Univ.), L. Liu, N. Ling (Santa Clara Univ.), J. Zheng, P. Zhang (Hisilicon) Integration of plane mode in unified intra prediction 18135 W. Dai, M. Krishnan, P. Jesudhas, P. Topiwala (FastVDO) Fast integer transforms for the HEVC test model 18136 I.S. Chong, W. Chien, M. Karczewicz (Qualcomm) Adaptive loop filter with low encoder complexity 18137 J. Lou, K. Panusopone, L. Wang (Motorola) Zigzag scan for CABAC/PIPE 18138 Y.-J. Chiu (Intel), Y.-W. Huang (MediaTek), M. Wien TE1: Summary report of TE1 activity (RWTH Aachen Univ.), H. Yu (Huawei) 18139 Y.-J. Jeon, B.-M. Jeon (LG Electronics) TE11: Cross verification of Sony proposal on motion vector coding 18140 Y. Yu (Broadcom) Implementation analysis of transform block size 18141 K. Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi TE12.1 cross-check report on block merging Electric) (MRG) on/off 18142 K. Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi TE12.2 report on MV prediction AMVP/IMVP Electric) 18143 K. Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi TE12.3 cross-check report on AIS on/off Electric) 18144 S. Sekiguchi, K. Sugimoto (Mitsubishi Electric) Suggested approach toward HEVC test model creation 18145 K. Sugimoto, S. Sekiguchi (Mitsubishi Electric) A suggested starting point for HEVC test model 18146 K. Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi On selecting coding tools for HEVC test model Electric) 18147 Y. Itani, S. Sekiguchi (Mitsubishi Electric) TE1: Implicit direct vector derivation 18148 S. Sekiguchi, Y. Itani (Mitsubishi Electric) TE1: Spatio-temporal direct mode 18149 D. Y. Kim, J. P. Kim, Y. L. Lee (Sejong University), J. Chroma interpolation filtering using high Lim, J. Song (SKT) precision filter Alberto Messina, Robbie De Sutter, Werner Bailer, 18150 Masanori Sano, Jean-Pierre Evain, Patrick NdjikiNya, Some changes to WD on MPEG-7 AudioVisual Description Profile (AVDP) 18151 Y.-J. Chiu, L. Xu, Wenhao Zhang, Hong Jiang (Intel) TE1: Report of self derivation of motion estimation improvement in TMuC 18152 Y.-J. Chiu, L. Xu, Wenhao Zhang, Hong Jiang (Intel) TE1: Cross-checking of DMVD result from 34 Huawei TE12: Cross verification for adaptive loop filter (ALF) (on vs. off) 18153 Y.-J. Chiu, L. Xu, W. Zhang, H. Jiang (Intel) 18154 J. Yang, K. Won, B. Jeon (SKKU), J. Lim, J. Song (SKT) TE10 subtest 1: Results of intra deblocking filter testing by SKKU/SKT 18155 J. Yang, K. Won, B. Jeon (SKKU), J. Lim, J. Song (SKT) TE10 subtest 1: Cross-verification result of Microsoft deblocking filter proposal by SKKU/SKT 18156 J. Yang, K. Won, B. Jeon (SKKU), J. Lim, J. Song (SKT) TE11: Cross-verification result of Orange Labs motion vector competition proposal (3.1a) by SKKU 18157 J. Yang, K. Won, B. Jeon (SKKU), J. Lim, J. Song (SKT) Motion vector competition method with reduced candidate sets 18158 D. He, G. Korodi, P. Imthurn, J. Jamias, D. O’Loughlin, G. Martin-Cocher (RIM) Comments on V2V coding for TM/TMuC 18159 D. Hoang (Zenverge) Flexible scaling of quantization parameter 18160 X. Zhang (HKUST) TE12.4: Mode dependent direnctional transform (MDDT) test (off vs. on) by HKUST 18161 S. Fukushima (JVC) Decoder-side block boundary decision (DBBD) with OBMC 18162 M.Ueda (JVC) TE1.a: Implementation report of Refinement Motion Compensation using DMVD on TMuC 18163 C. Lai, Y. Lin (HiSilicon), J. Zhou (Huawei) TE6.a: Huawei & HiSilicon report on cross verification of MSRA's LIC 18164 B. Bross (Fraunhofer HHI) TE9-2: Report on performance tests for different sets of PU modes by Fraunhofer HHI 18165 Yongwei Zhu, Rongshan Yu, Susanto Rahardja An experimental study of different dynamic volume change encoding methods for IMAF 18166 Yongwei Zhu, Rongshan Yu, Susanto Rahardja A study on the computational complexity of EQ implementation in IMAF 18167 Y.-L. Chang, Y.-W. Huang, S. Lei (MediaTek) TE8: Crosscheck on TI’s proposal of parallel context processing by MediaTek 18168 J. An, K. Zhang, Y. Gao, X. Guo, C.-M. Fu, Y.-W. Huang, S. Lei (MediaTek) TE10 subtest 1: Improved deblocking filter 18169 C.-Y. Tsai, C.-M. Fu, C.-Y. Chen, Y.-W. Huang, S. Lei (MediaTek) TE10 subtest 2: Coding unit synchronous picture quadtree-based adaptive loop filter (QALF) 18170 C.-M. Fu, Y.-W. Huang, S. Lei (MediaTek) TE10 subtest 2: Crosscheck on TOSHIBA’s proposal of adaptive loop filter by MediaTek 18171 C.-Y. Tsai, Y.-W. Huang, S. Lei (MediaTek) TE10 subtest 2: Crosscheck on SHARP’s proposal of adaptive loop filter by MediaTek Y.-L. Chang, C.-M. Fu (MediaTek), A. Segall, Y. Su 18172 (Sharp), C.-Y. Chen, Y.-W. Huang, S. Lei (MediaTek) TE10 subtest 3: Controlled clipping 18173 C.-Ming Fu, C.-Y. Chen, Y.-W. Huang, S. Lei (MediaTek) TE10 subtest 3: Quadtree-based adaptive offset 18174 Y.-L. Chang, Y.-W. Huang, S. Lei (MediaTek) TE12: Crosscheck on deblocking filter in TMuC0.7 by MediaTek 18175 A. Ichigaya, K. Iguchi (NHK) TE12.2: Report on asymmetric motion prediction unit (AMP) on/off 18176 Y. Choe, S. Hong (Yonsei Univ.), Y-G. Kim (KGIT) TE2: Cross-check of memory compression 35 results from TOSHIBA (JCTVC-C075) 18177 P. Amon, A. Hutter, E. Wige, A. Kaup (Siemens) Reference frame denoising 18178 Y. Xu, J. Li, W. H. Chen, D. Tian (Cisco Systems) Context-adaptive hybrid variable length coding 18179 P. Chen, M. Karczewicz (Qualcomm) TE12: Report on deblocking filter 18180 R. Sjöberg, P. Wennersten (Ericsson) Fine granularity slices 18181 K. Andersson, R. Sjöberg (Ericsson) TMuC 0.7 results with sequence dependent QPs 18182 A. Norkin, R. Sjöberg, K. Andersson (Ericsson) TE10: Cross verification of Mediatek’s deblocking filter by Ericsson 18183 J. Samuelsson, R. Sjöberg, K. Andersson (Ericsson) TE6.a: Cross verification of line-based intra prediction 18184 K. Andersson (Ericsson) TE7: Cross verification of MDDT simplification JCTVC-B024 18185 K. Andersson (Ericsson) TE12: Cross verification of RDOQ 18186 P. Chen, M. Karczewicz (Qualcomm) TE12: Crosscheck on LCEC phase 2 18187 K. Chono, K. Senzaki, H. Aoki, J. Tajime, Y. Senda (NEC) TE10: Cross-verification result of SKKU/SKT deblocking filter 18188 K. Kondo, T. Suzuki (Sony) TE12.2: Results on MC interpolation filters 18189 K. Sato, T. Suzuki (Sony) TE12.2: Results on MV prediction 18190 K. Kondo, T. Suzuki (Sony) TE3 subset 4: Results on bi/single filter switching in FIF 18191 K. Kondo, T. Suzuki (Sony) TE3 subset 4: Cross verification on high accuracy interpolation filter 18192 K. Sato (Sony) TE11: Study on motion vector coding (experiment 3.3.a & 3.3.c) 18193 K. Sato (Sony) Proposal on large block structure and quantization 18194 K. Kondo, K. Sato, T. Suzuki (Sony) Comments on common test conditions 18195 Japan National Body JNB comments on Resolution 2.1.4: Methods for signaling frame packing arrangement in MPEG-2 video 18196 Japan National Body JNB comments on Resolution 2.2.2 (3D Signaling in MPEG-2 video) 18197 Japan National Body JNB comments on N11465 about frame packing arrangement SEI message 18198 E. Maani, L. Dong, W. Liu (Sony) TE6.b: Experiment results of DCIM 18199 X. Zheng (HiSilicon), H. Yu (Huawei) TE3.2: Report on cross-verification of simplified geometry block partitioning from Technicolor 18200 A. Tabatabai, C. Auyeung (Sony) TE12.3: Experimental results of edge based prediction 18201 A. Tabatabai (Sony) TE6: Summary report of intra prediction improvements tool experiments 18202 T. Ikai, T. Yamamoto (SHARP) TE10 subtest 2: Parallel adaptive loop filter 18203 T. Yamamoto, A. Segall (SHARP) TE12.3: Report on combined intra prediction 18204 T. Yamamoto, Y. Yasugi (SHARP) TE6.b: Cross verification of differential coding of intra modes (DCIM) 18205 T. Yamamoto, Y. Yasugi (Sharp) Analysis and improvement of differential coding of intra modes 18206 P. Tao, D. Li, W. Wu, J. Wen (Tsinghua Univ.) Horizontal spatial prediction for intra coding 36 18207 A. Segall (Sharp) TE2: Cross-check of memory compression results from Panasonic (JCTVC-C074) 18208 A. Segall (Sharp) TE6: Cross-check of bi-directional intrapredition results from Toshiba 18209 A. Segall (Sharp) TE10.3: Cross check of MediaTek's proposal on quadtree-based adaptive offset 18210 K. Misra, A. Segall (Sharp) TE12.4: Cross-check of rotational transform (ROT) 18211 A. Segall (Sharp) TE12.2: Cross-check of adaptive motion vector resolution (AMVRES) technology in the TMuC software 18212 T. Yoshino, S. Naito, S. Sakazawa (KDDI) Enhanced switching of interpolation filter for HEVC 18213 Background for Proposing Scalable Application Kyoungro Yoon, Min-Uk Kim, Tae-Beom Lim, Kyung Markup Language for AIT’s Present Won Kim, Yun Ju Lee, Jae Won Moon Content Elementary Service 18214 Kyung Won Kim, Tae-Beom Lim, Yun Ju Lee, Jae Won Moon, Kyoungro Yoon, Min-Uk Kim 18215 A. Fuldseth (Cisco) 18216 Proposal of Scalable Application Markup Language (SAML) for AIT’s Present Content Elementary Service TE12.6: PIPE tested against LCEC by Cisco A. Fuldseth, G. Bjøntegaard (Cisco), A. Hallapuro, K. Ugur, J. Lainema (Nokia) Recent improvements of the low complexity entropy coder (LCEC) in TMuC 18217 J. Lainema, K. Ugur (Nokia) TE5: Results for simplified intra prediction tests by Nokia 18218 K. Iwata (Renesas) TE6.a: Cross-check of bidirectional intra prediction (JCTVC-B042) 18219 J. Lainema, K. Ugur, O. Bici (Nokia) TE12.3: Results for planar prediction tests by Nokia 18220 S. Mochizuki, K. Iwata (Renesas) Evaluation of intra prediction based on repetitive pixel replenishment Takanori Senoh (senoh@nict.go.jp), Kenji Yamamoto (k.yamamoto@nict.go.jp), Ryutaro Oi 18221 (oi@nict.go.jp), Yasuyuki Ichihashi (yichihashi@nict.go.jp), Taiichiro Kurita (tkurita@nict.go.jp), Depth Estimation Experiment with Poznan Hall 18222 Minsoo Park, Seungwook Lee, Bon Ki Koo, Hyunok Oh, Taehee Lim, Euee S. Jang Bug report in discrepancy between the textual specification in VTL and RVC-CAL implementation in RSM 18223 J. Kim, S.-C. Lim, H. Y. Kim, H. Lee, J. S. Choi (ETRI) TE7: Cross-check result of Peking Univ.’s proposal (JCTVC-C089) Minsoo Park, Seungwook Lee, Bon Ki Koo, Hyunok 18224 Oh, Sinwook Lee, Sowon Kim, Taehee Lim, Hyungyu Kim, Euee S. Jang Contribution to RGC tutorial 18225 Seungwook Lee, Bon Ki Koo, Sinwook Lee, Minsoo Park, Sowon Kim, Taehee Lim, Euee S. Jang Contribution of 3DMC FUs for RGC framework 18226 S. Jeong, J. Lee, H. Y. Kim, S.-C. Lim (ETRI), K. Kim, H. Lee, G. Park (KHU) TE11: Cross-check result of merge/skip (3.2c) 18227 Jae Won Moon, Kyung Won Kim, Tae-Beom Lim, Yun Ju Lee, Kyoungro Yoon, Min-Uk Kim Proposal of Stack Operators for AIT’s Present Content Elementary Service 18228 Y. Choe, J. Kim (Yonsei Univ.), Y-G. Kim (KGIT) 37 TE7: Cross-verification results of Huawei proposal on simplified MDDT for intra prediction residual 18229 Tae-Beom Lim, Kyung Won Kim, Yun Ju Lee, Jae Won Moon, Kyoungro Yoon, Min-Uk Kim Initial Proposal of DCFunctions for AIT’s Present Content Elementary Service 18230 K. Zhang, X. Guo, J. An, M. Guo, Y.-W. Huang, S. Lei (MediaTek) Overlapped block intra prediction P. Wu, S. Paschalakis, N. Sprljan (Mitsubishi 18231 Electric) TE10 subtest 2: Cross-check results of JCTVCC082 [Toshiba] Reduction of number of encoding passes for quadtree-based adaptive loop filter (QALF) P. Wu, S. Paschalakis, N. Sprljan (Mitsubishi Electric) An enhanced block-based adaptive loop filter in TMuC0.7.0 platform 18232 TE12.6: Results for LCEC_PHASE2 tests by Nokia 18233 K. Ugur, A. Hallapuro, J. Lainema (Nokia) 18234 Taejin Lee, Seungkwon Beack, Minje Kim, Kyeongok Kang Progress report on the TCX windowing CE for USAC 18235 K. Ugur, J. Lainema (Nokia), R. Panchal, M. Karczewicz (Qualcomm) TE12.2: Results for SIFO-6Tap with DIF tests by Nokia and Qualcomm 18236 J. Chen, T. Lee, W.-J. Han (Samsung) TE9: Simulation results for various max. number of transform quadtree depth 18237 I.-K. Kim, T. Lee, W.-J. Han (Samsung) TE11: Report on experiment 3.2.c: Check skip and merge together 18238 T. Lee, J. Chen, W.-J. Han (Samsung) TE12.1: Experimental results of transform unit quadtree/2-level test 18239 E. Alshina, J. Chen, N. Shlyakhov, A. Alshin, W.-J. Han (Samsung) TE12.2: Experimental results of SIFO (DIF) / DCT-IF 18240 E. Alshina, A. Alshin, W.-J. Han (Samsung), R. Joshi, M. Coban (Qualcomm) TE12.4: Experimental results of MDDT and ROT by Samsung and Qualcomm 18241 J. Chen, V. Seregin, W.-J. Han (Samsung) TE12.4: Experimental results of transform coefficient coding 18242 A. Alshin, E. Alshina (Samsung) Bi-directional optical flow 18243 V. Seregin, J. Chen (Samsung) Low-complexity adaptive coefficient scanning 18244 J. Chen, V. Seregin (Samsung) Chroma intra prediction by scaled luma samples using integer operations 18245 Y. Piao, J.-H. Min, J. Chen (Samsung) Encoder improvement of unified intra prediction 18246 I.-K. Kim, T. Lee (Samsung) Simulation results of motion partition, block merging and motion competition schemes 18247 Y.-M. Hong, M.-S. Cheon, I.-K. Kim (Samsung) Low-complexity 16x16 and 32x32 transforms and partial frequency transform 18248 S. Lee, M.-S. Cheon, I.-K. Kim (Samsung) Efficient coefficient coding method for large transform in VLC mode 18249 Y. Sohn, K.-H. Lee, B.-K. Lee, I.-K. Kim (Samsung) Inner block oriented ALF processing for memory & bandwidth reduction 18250 W.-J. Han (Samsung) Cross-verification of JCTVC-C086: Experimental results of ALF on low complexity 18251 M. Mrak, T.Davies, D. Flynn, A. Gabriellini (BBC) Additional results for combined intra prediction 18252 acordova@tue.nl, jean.gelissen@philips.com Robot Integration 18253 jean.gelissen@philips.com Additional Use Cases for MPEG-V 18254 M. Narroschke (Panasonic) TE12.5: Results for adaptive loop filter using prediction and residual 18255 K. Ugur, J. Lainema (Nokia) Adaptive MV resolution with directional filters 18256 V. Drugeon (Panasonic) TE12.3: Results for edge based prediction 38 18257 V. Drugeon (Panasonic) TE06.b: Cross verification of differential coding of intra modes (DCIM) 18258 J. Kim, Y.Jeon, B. Jeon (LG Electronics) Encoding complexity reduction for intra prediction by disabling NxN partition 18259 P. Amon, A. Hutter, E. Wige, A. Kaup (Siemens) In-loop reference frame denoising 18260 A. Krutz, T. Sikora (Tech. Univ. Berlin) TE3: Summary report for inter prediction in HEVC 18261 A. Glantz, A. Krutz, T. Sikora (Tech. Univ. Berlin) TE3: Adaptive global motion temporal prediction for HEVC 18262 M. Esche, A. Krutz, A. Glantz, T. Sikora (Tech. Univ. In-loop filtering using temporal pixel trajectories Berlin) 18263 G. Martin-Cocher (RIM), M. Budagavi (TI) TE8 report 18264 D.-K. Kwon, H. Kim (TI) Frame coding in vertical raster scan order 18265 K. McCann (Samsung/Zetacast) TE12: Summary of evaluation of TMuC tools in TE12 18266 M. Sadafale, M. Budagavi (TI) Low-complexity configurable transform architecture for HEVC 18267 V. Sze, M. Budagavi (TI) Parallelization of HHI_TRANSFORM_CODING 18268 M. Budagavi (TI) A simple cache model for measuring motion compensation bandwidth 18269 I. S. Chong, M. Karczewicz (Qualcomm) TE10 subset 2: Cross check result of MediaTek ALF 18270 I. S. Chong, M. Karczewicz (Qualcomm) TE12: Cross check result of Panasonic's 3input-ALF 18271 Y. Suzuki, T.K. Tan, F. Bossen (NTT DOCOMO) TE12.1: Results for PU based merging 18272 N. Sprljan, S. Paschalakis, P. Wu (Mitsubishi Electric) TE12.7: Verification of JCTVC-C041 (partition based illumination compensation - PBIC) 18273 N. Sprljan, S. Paschalakis, P. Wu (Mitsubishi Electric) TE3 subtest 3: Local intensity compensation (LIC) for inter prediction 18274 Y. Zheng, M. Coban, M. Karczewicz (Qualcomm) Simplified intra smoothing 18275 Dong Tian, Anthony Vetro Comments on Draft Call for Proposals on 3D Video Coding Technology 18276 I. S. Chong, M. Karczewicz (Qualcomm) TE12: Evaluation of adptive in-loop filter 18277 K. Vermeirsch, J. De Cock, R. Van de Walle (Ghent Univ. - IBBT) Report of complexity analysis of geometric partitioning 18278 J. Sole, R. Joshi, M. Karczewicz (Qualcomm) Reduced complexity 32×32 transform by coefficient zero-out 18279 J. Zan, J. Meng, M. T. Islam, D. He (RIM) TE12.8: Results on RDOQ in high efficiency settings 18280 R. Panchal, M. Karczewicz (Qualcomm) TE12: Report on interpolation filter: SIFO/DIF 18281 D. Karwowski (Poznan Univ. Tech.) TE3: Cross-check results of local intensity compensation tool from Mitsubishi Electric 18282 R. Panchal, M. Karczewicz (Qualcomm) TE12: Crosscheck on interpolation filter: SIFO(DIF)/MOMS 18283 Document registration withdrawn 18284 I. S. Chong, M. Karczewicz (Qualcomm) Cross check result of JCTVC-C084 18285 D. Flynn (BBC) TE12.3: Report on planar intra prediction 18286 J. Zan, J. Meng, M. T. Islam, D. He (RIM) TE 8: Cross verification of TI-PCP proposal for significance map 39 18287 P. Onno (Canon) TE12.8: Cross-verification of partition-based illumination compensation (PBIC) 18288 T. Chujoh, K. Kanou, T. Yamakage (Toshiba) TE3 subset 4: Cross-verification on bi/single filter switching in FIF 18289 T. Chujoh, K. Kanou, T. Yamakage (Toshiba) TE12.2: Cross verification on DCT-IF for chroma 18290 J. Zan, G. Korodi, J. Meng, M. T. Islam, D. He (RIM) TE 8: Reports on V2V coding and context modeling by RIM 18291 M. Coban, R. Joshi, M. Karczewicz (Qualcomm) Low complexity adaptive coefficient scanning 18292 P. Chen, R. Panchal, W.-J. Chien, M. Karczewicz (Qualcomm) Overlapped block motion compensation in TMuC 18293 R. Joshi, M. Karczewicz (Qualcomm) TE 12: Report on evaluation of internal bit depth increase and transform precision extension 18294 R. Panchal, M. Karczewicz, P. Chen (Qualcomm) Encoder speedup for bidirectional averaging with rounding control 18295 R. Panchal, M. Karczewicz (Qualcomm) Test result of SIFO for sequencs with illumination change 18296 M. Budagavi, A. Gupte (TI) DCT+Hadamard low complexity large transform for Inter coding 18297 K. Misra, J. Zhao, A. Segall (SHARP) New results for entropy slices for highly parallel coding 18298 Y. Su, A. Segall (SHARP) On motion vector competition 18299 W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm) TE 12: Evaluation of transform skip flag (RQT_ROOT) 18300 Y. Yu, K. Panusopone, L. Wang, S.-T. Hsiang, F. Ishtiaq (Motorola) TE12: Results for experiments on Max CU size 18301 K. Panusopone, X. Fang, L. Wang (Motorola) Flexible picture partitioning 18302 X. Wang (Qualcomm) TE4: Summary of TE4 on variable length coding 18303 X. Wang, M. Karczewicz, W.-J. Chien (Qualcomm) TE4: Report on VLC for coded block flag 18304 M. Karczewicz, W.-J. Chien, X. Wang (Qualcomm) Improvements on VLC 18305 W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm) TE 12: Evaluation of block merging (MRG) 18306 W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm) TE 12: Evaluation of adaptive motion vector resolution (AMVRES) 18307 W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm) TE1: Cross-check result of DMVD proposal from JCTVC-C138 (JVC/Kenwood) 18308 W.-J. Chien, P. Chen, X. Wang, M. Karczewicz (Qualcomm) Test results of transform skip flag and phase 2 VLC integration 18309 Ye-Kui Wang, Zhenyu Wu DASH related clarifications to ISO base media file format 18310 R. Joshi, M. Coban, M. Karczewicz (Qualcomm) TE12: Report on evaluation of MDDT and ROT Ye-Kui Wang, Per Fröjdh, Thomas Stockhammer, 18311 David Furbeck, Imed Bouazizi Base DASH on updated 3GPP adaptive HTTP streaming specification 18312 A. Tabatabai, C. Auyeung, E. Maani, T. Suzuki (Sony) A study on the impact of intra smoothing 18313 X. Peng, J. Xu, F. Wu (Microsoft) TE6.a: Report of line-based coding 18314 X. Peng, J. Xu (Microsoft) TE6: Cross-verification of HiSilicon’s short distance intra prediction by Microsoft 18315 X. Peng, J. Xu (Microsoft) TE6: Cross-verification of MediaTek's 40 overlapped block intra prediction by Microsoft 18316 Z. Xiong, X. Sun, J. Xu (Microsoft) TE10 subset 1: Report of content-adaptive deblocking 18317 X. Xiong, J. Xu (Microsoft) TE10 subset 1: Cross- verification of NEC’s joint deblocking-debanding filter by Microsoft 18318 X. Peng, J. Xu, F. Wu (Microsoft) Improve intra frame coding by PU/TU reordering 18319 C. Lan, J. Xu, F. Wu (Microsoft), G. Shi (Xidian Univ.) Screen content coding results using TMuC 18320 B. Li, J. Xu, F. Wu, G. J. Sullivan (Microsoft), H. Li (Univ. Sci. Tech China) Redundancy reduction in Cbf and merge coding 18321 B. Li, J. Xu, G. J. Sullivan, F. Wu (Microsoft), H. Li (Univ. Sci. Tech China) Redundancy reduction in B-frame coding at temporal level zero 18322 D. He, G. Korodi, E.-h. Yang, G. Martin-Cocher (RIM) Opportunistic parallel V2V decoding 18323 Y.-L. Chang, Y.-W. Huang, S. Lei (MediaTek) TE8: Crosscheck result of the transcoder for JCTVC-B034 source selection for V2V entropy coding in HEVC 18324 M. Coban, Y. Zheng, M. Karczewicz (Qualcomm) TE12: Evaluation of fast adaptive intra smoothing 18325 M. Tok, A. Glantz, A. Krutz, T. Sikora (Tech. Univ. Berlin) TE3 subtest 1: Cross-check of results from LG 18326 A. Paul (Hanyang Univ. / NCKU) Region of block based dynamic video processing for HEVC 18327 A. Segall (Sharp), J. Xu (Microsoft) TE9-2.1 Report on forced RQT split according to PUs (Sharp, Microsoft) 18328 W.-J. Chien, P. Chen, X. Wang, M. Karczewicz (Qualcomm) Modified uni-directional inter prediction in generalized P and B pictures 18329 E. Francois (Technicolor), L. Guillo (INRIA) TE12.2: Technicolor & INRIA report on AMP evaluation 18330 B. Bross (Fraunhofer HHI) Cross-check of encoder speedup for bidirectional averaging with rounding control 18331 T. Yamakage (Toshiba) TE4: Verification results of JCTVC-C262 18332 J. Jung, G. Clare, S. Pateux (Orange Labs) TE11: Report on experiment 3.1.a: Disabling MV competition 18333 J. Jung, G. Clare, S. Pateux (Orange Labs) TE11: Report on experiment 3.2.b: MVCompetition on Inter, Skip and Direct modes 18334 J. Jung, G. Clare, S. Pateux (Orange Labs) TE11: Report on experiment 3.3.b: ‘temporally oriented’ set of predictors for MV-Competition 18335 J. Jung, G. Clare, S. Pateux (Orange Labs) TE11: Verification of experiment 3.2.d by NTT DOCOMO 18336 J. Jung (Orange Labs) TE11: Summary report for TE11 on motion vector coding 18337 A. Wells (Ambarella), C. Fogg (Harmonic) Level limits for Progressive High Profile 18338 G. J. Sullivan (Microsoft) On the AVC frame packing arrangement SEI message 18339 G. J. Sullivan (Microsoft), H. Schwarz (Fraunhofer HHI) 18340 H. Zhu (Zhu) Miscellaneous AVC (ITU-T H.264 | ISO/IEC 14496-10) errata issues Re-compensation based on partial coefficients 41 18341 H. Zhu (Zhu) Decoding improvement on the PA-Coder 18342 Deliang Fu, Lu Yu 3DV EE4 Report on Book_arrival Sequence 18343 Deliang Fu, Lu Yu 3DV EE4 Report on Poznan_Street Sequence 18344 W.-J. Han (Samsung) Cross-verification of JCTVC-C213: additional results for combined intra prediction 18345 Mickael Raulet, Matthieu Wipliez, Jérôme Gorin, Nicolas Siret Addendum: Specification of typing rules for RVC-CAL 18346 Mickaël Raulet, Matthieu Wipliez Automatic classification of the FUs 18347 Ye-Kui Wang DSAH - support of rewind trick mode 18348 Ye-Kui Wang On random access point signaling in file formats 18349 Jaejoon Lee, Seok Lee, HoCheon Wey, Du Sik Park 3DV EE4 Results on Balloons sequence 18350 Jaejoon Lee, Seungsin Lee, Seok Lee, HoCheon Wey, Du Sik Park Comments on Test Scenario for Optional Case of Call for Proposal on 3D Video Coding Technology 18351 Korean NB via SC 29 Secretariat Korean NB comment on 3DTV signaling in MPEG-2 System 18352 jean.gelissen@philips.com, roland@cs.uu.nl, Path Finding 18353 Yasuaki Tokumo, Maki Takahashi, Shuichi Watanabe, Norio Ito, Consideration of Clock Drift Control for DASH 18354 Masayuki Tanimoto, Toshiaki Fujii, Mehrdad Panahpour Tehrani, Menno Wildeboer, 3DV/FTV EE1 and EE4 report on Champagne Tower sequence 18355 Masayuki Tanimoto, Toshiaki Fujii, Mehrdad Panahpour Tehrani, Menno Wildeboer, 3DV/FTV EE4 report on Kendo sequence 18356 Masayuki Tanimoto, Toshiaki Fujii, Mehrdad Panahpour Tehrani, Menno Wildeboer 3DV/FTV EE2 report on VSRS extrapolation Yasuaki Tokumo, Maki Takahashi, Shuichi 18357 Watanabe, Yoshiaki Ogisawa, Takashi Kaneko, Norio Ito, Modification of Enhancement to Representation Description on DASH for signaling of 3D content Menno Wildeboer (Nagoya Univ.), Patrick Lopez 18358 (Technicolor), Dong Tian (Mitsubishi), Yin Zhao (Zhejiang Univ.), Cheon Lee (GIST) Update of the Draft Report on Experimental Framework for 3D Video Coding 18359 DVB via SC 29 Secretariat Liaison Statement from DVB 18360 Takeshi Norimatsu, Tomokazu Ishikawa, Haishan Zhong, Dan Zhao, Kok Seng Chong Panasonic cross check report on complexity reduction for time warping in USAC 18361 Takeshi Norimatsu, Tomokazu Ishikawa, Haishan Zhong, Dan Zhao, Kok Seng Chong Panasonic cross check report on PVC for SBR envelope coding in USAC 18362 Inseon Jang, Jeongil Seo, Laurent Primaux Updated text and reference SW of ISO/IEC 23000-12/FPDAM1 IM AF Pablo Carballeira, Julián Cabrera, Fernando 18363 Jaureguizar, Gianluca Cernigliaro, Juan Casal, Narciso GarcÃa Results on 3DVC EE4 for Newspaper 18364 Tadashi Uchiumi, Makoto Ohtsu, Junsei Sato, Yoshiya Yamamoto, Atsutoshi Shimeno Comments on Requirements and Draft Call for Proposal on 3D Video Coding 18365 SeungJu Han, Jae Joon Han, Won-Chul Bang, James D.K. Kim, Sang-Kyun Kim Data format for sensed information on Electrograph sensor Proposal for new box to support random access for DASH 18366 Gerard Fernando, Yang Yanzi, Wang Fang 18367 Yoshiya Yamamoto, Tadashi Uchiumi, Makoto Ohtsu, Junsei Sato, Atsutoshi Shimeno 18368 A method of disparity vector prediction using depth map 3DV EE4 Resulr 42 18369 Patrick Lopez 3DV EE4 Results on Beergarden 18370 Gerard Fernando, Yang Yanzi, Wang Fang Carriage of MPEG-2 TS support information in Index file for DASH 18371 Kihyun Choo, Miyoung Kim, Eunmi Oh Crosscheck report on adaptive T/F domain post-processing for USAC 18372 Kihyun Choo, Miyoung Kim, Eunmi Oh Crosscheck report on harmonic transposer CEs 18373 Heiko Purnhagen, Kristofer Kjörling Dolby listening test results for CE on T/F postprocessing in USAC 18374 Heiko Purnhagen, Kristofer Kjörling Dolby listening test results for CE on PVC for SBR in USAC 18375 Heiko Purnhagen, Kristofer Kjörling Dolby listening test results for CE on complexity reduction for time warping in USAC 18376 Jean-Claude Dufourd, Jean Lefeuvre, Cyril Concolato, Kyungmo Park, Jaeyeon Song Proposed amendment for MPEG-U widgets 18377 jean.gelissen@philips.com, esko.dijk@philips.com Tactile Information 18378 Kristofer Kjörling, Leif Sehlström Finalization of CE on improved SBR Barbara Resch, Leif Sehlström, Heiko Purnhagen, 18379 Lars Villemoes, Kristofer Kjörling, Bruno Bessette, Philippe Gournay Finalization of CE on an improved bass- post filter operation for the ACELP of USAC 18380 Cyril Concolato, Jean Le Feuvre, Romain Bouqueau, Comments on DASH WD 18381 Cyril Concolato, Jean Le Feuvre, Romain Bouqueau Report on an Open Source implementation of MPEG-DASH 18382 Jean Le Feuvre Conformance Sequences for BIFS ExtendedCore2D 18383 Jean Le Feuvre Reference Software for BIFS ExtendedCore2D 18384 Lars Villemoes, Per Ekstrand, Sascha Disch, Frederik Nagel Finalization of CE on improved harmonic transposer in USAC 18385 Sungryeul Rhyu, Seo-Young Hwang, Kyungmo park, Annotation and Attributes for Media Jaeyeon Song, Bogyeong Kang, Nhut Nguyen Compositions Haishan Zhong, Kok Seng Chong, Takeshi Norimatsu, Tomokazu Ishikawa, Lars Villemoes, Per Finalization of CE on QMF based harmonic 18386 Ekstrand, Kristofer Kjörling, Stephan Wilde, transposer Sascha Disch, Frederik Nagel, Max Neuendorf, 18387 Yonghun Lee, Doug Young Suh, Jaeyeon Song, kyungmo Park, Compelling use-case for M17770(DFA-EE) 18388 Sungryeul Rhyu, Seo-Young Hwang, Nhut Nguyen, Timed metadata Support in DASH Kristofer Kjörling, Haishan Zhong, Kok Seng Chong, Takeshi Norimatsu, Tomokazu Ishikawa, 18389 Lars Villemoes, Per Ekstrand, Stephan Wilde, Sascha Disch, Frederik Nagel, Max Neuendorf, Overview of performance of transposer proposals, and suggested decoding modes 18390 Chulkeun Kim, Doug Young Suh, Jaeyeon Song, Kyungmo Park Kyuheon Kim, Jangwon Lee, Jung-Han Kim, Hee 18391 Jean Kim, Gahyun Ryu, Gilyoon Kim, Do Young Joung, Namguk Kim Benefits of TRA representation File Format for 3D Support of DASH 18392 Hojin Ha, Sungryeul Rhyu, Seo-Young Hwang, Jaeyeon Song Random Access Point (RAP) Indication 18393 Kiho Cho, Nam Soo Kim, Sungyong Yoon, Sangoh Jeong Core Experiment on Enhanced Mode Transitions in USAC 18394 Seo-Young Hwang, Sungryeul Rhyu, Hojin Ha Content Insertion on DASH 18395 fons bruls 3DV EE4 Results on Mobile 43 18396 Toru Chinen, Masayuki Nishiguchi Sony listening test report on pulse indexing in USAC 18397 fons bruls Comments on Requirements and Draft Call for Proposal on 3D Video Coding 18398 Toru Chinen, Masayuki Nishiguchi Sony listening test report on improved SBR in USAC 18399 Toru Chinen, Yuki Yamamoto, Mitsuyuki Hatanaka, Masayuki Nishiguchi Report on PVC CE for SBR in USAC 18400 Thomas Stockhammer Editor's Proposed Updates on DASH Working Draft 18401 Thomas Stockhammer Adding Minimum RAP Frequency to MPD 18402 Reference Software for DASH 18403 Thomas Stockhammer Reference Software for DASH 18404 Junaid Jameel Ahmad, Shujun Li, Marco Mattavelli, Matthieu Wipliez, Mickaël Raulet Crypto Tools Library (CTL): Applying RVC-CAL for Multimedia Security Applications 18405 Helmut Bürklin, Stéphane Gouache, Truong Cong Thang, Jin Young Lee Extensions to Media Presentation Description to support HTTP Streaming from multiple servers 18406 Ferenc Kraemer, Heiko Purnhagen Proposed addition to Intensity Stereo in ISO/IEC 14496-26:2009, Audio conformance 18407 Kim Pelsor, Jamie Marshall, Use Case for Collaborative BiFS Scenes : Distance Learning 18408 Dr Michael McGannon, Jamie Marshall, Use Case for Collaborative BiFS Scenes : Proactive Medicine 18409 Fritz-Joachim Westphal, Mario Kind, Jamie Marshall, Use Case for Collaborative BiFS Scenes : Mihai Mitrea, Consultations without Walls 18410 Jae Joon Han, SeungJu Han, Won-Chul Bang, James D.K. Kim, Sang-kyun Kim, YongSoo Joo 18411 T. Davies, D. Flynn (BBC) Sensed Information for gas and dust sensors TE12.4 Cross-check of unified MDDT/ROT 18412 Mihai Mitrea, Jamie Marshall, Françoise Preteux, Pieter Simoens, Bart Dhoedt, , Use Case for Collaborative BiFS Scenes : A day in a Professor's life 18413 Achim Kuntz, Sascha Disch, Erik Schuijers, Werner Oomen CE proposal on improved applause coding in USAC Jin Young Lee, Kwang-deok Seo, Hyung Jung Kim, 18414 Truong Cong Thang, Jung Won Kang, Seong-Jun Bae, Soon-heung Jung, Sang Taick Park Scalable Random Access for SVC based DASH Jin Young Lee, Euy-doc Jang, Jae-Gon Kim, Truong Virtual Segmentation of TS Packetized Video 18415 Cong Thang, Jung Won Kang, Seong-Jun Bae, using Key-frame Information Soon-heung Jung, Sang Taick Park Jin Young Lee, Euy-doc Jang, Jae-Gon Kim, Truong TS Header Extension for Efficient Adaptation in 18416 Cong Thang, Jung Won Kang, Seong-Jun Bae, the HTTP Streaming Soon-heung Jung, Sang Taick Park Jamie Marshall, Mihai Mitrea, Bojan Joveski, Ludovico Gardenghi, Françoise Preteux, Use Cases for Collaborative BiFS Scenes : Overview Kwang-deok Seo, Sang-woo Shim, Jin Young Lee, 18418 Truong Cong Thang, Jung Won Kang, Seong-Jun Bae, Soon-heung Jung, Sang Taick Park File Format Adaptation Based on HTTP Hint Track for HTTP Streaming Zhao Dan, Zhong Haishan, Chong Kok Seng, 18419 Takeshi Norimatsu, Tomokazu Ishikawa, Neo Sua Hong Status report of time warping CE in USAC 18420 Dmytro Rusanovskyy, Miska M. Hannuksela 3DV EE4 Results on Pantomime sequence 18417 44 3D Video Coding Results of Exploration Experiments (Book Arrival) 18421 Gerhard Tech, Karsten Müller 18422 Bojan Joveski, Ludovico Gardenghi, Jamie Marshall, Image Transmission for Mobile Thin Clients : A Mihai Mitrea, Françoise Preteux Comparative Study UK National Body comments on HEVC Timescales 18423 UKNB of WG11 18424 David Fuentes Sánchez, Francisco Morán Burgos Initial ideas for texturing support in MR3DMC 18425 fons bruls Philips (in coop with 3D4YOU) response to request for HQ stereo content for MFC AHG Andreas Hölzer, Christian Ertel, Markus 18426 Lohwasser, Michael Härtl, Ferenc Kraemer, Frans de Bont Additional information on MPEG Surround conformance testing 18427 fons bruls MFC initial results & findings 18428 Jeongook Song, Eunwoo Song, Henney Oh Yonsei Crosscheck listening test report on additional bandwidth extension CE for USAC 18429 Jeongook Song, Henney Oh, Hong-Goo Kang Status report on USAC Reference Software JAME 18430 Jeongook Song, Henney Oh, Hong-Goo Kang Updated CE on enhanced long term predictor (eLTP) for USAC 18431 Stephan Wilde, Max Neuendorf FhG Listening Test Report – improved SBR 18432 Stephan Wilde, Markus Multrus, Max Neuendorf, Kristofer Kjörling, Heiko Purnhagen Status update of CE proposal on increased structural flexibility in SBR Julien Robilliard, Matthias Neusinger, Johannes 18433 Hilpert, Erik Schuijers, Bert den Brinker, Werner Oomen Proposed decorrelator improvements in USAC 18434 Gregory Pallone, Pierrick Philippe Report on USAC performance 18435 Philippe Gournay, Roch Lefebvre VoiceAge listening test results for the enhanced bass-postfilter CE Jonas EngdegÃ¥rd, Heiko Purnhagen, Oliver Hellmuth, Jürgen Herre, Cornelia Falch, Leon 18436 Terentiv, Maria Luis Valero, Johannes Hilpert, Andreas Hölzer, Werner Oomen Report on corrections for MPEG SAOC Jonas EngdegÃ¥rd, Heiko Purnhagen, Oliver Hellmuth, Jürgen Herre, Cornelia Falch, Leon 18437 Terentiv, Maria Luis Valero, Johannes Hilpert, Andreas Hölzer, Werner Oomen Revised draft of SAOC verification test report 18438 Jean-Claude Dufourd, Jean Lefeuvre, Cyril Concolato, Kyungmo Park, Jaeyeon Song Study of FCD of 23007-3 Truong Cong Thang, Jin Young Lee, Jung Won 18439 Kang, Seong-Jun Bae, Soon-heung Jung, Sang Taick Park Contribution to DASH EE#2 Truong Cong Thang, Jin Young Lee, Jung Won 18440 Kang, Seong-Jun Bae, Soon-heung Jung, Sang Taick Park Use Cases of Multiple Byte Ranges for DASH Truong Cong Thang, Jin Young Lee, Jung Won 18441 Kang, Seong-Jun Bae, Soon-heung Jung, Sang Taick Park, Comments on the Use of RAP in Segments 18442 Thorsten Lohmar, Per Fröjdh Test sequences in DASH format 18443 Per Fröjdh, Torbjörn Einarsson DASH File Format and MPD Considerations 18444 Christopher Müller, Christian Timmerer, Proposal for using MPEG-21 Digital Item Declaration and W3C XInclude/XLink for Media Presentation Description 45 Truong Cong Thang, Jin Young Lee, Jung Won 18445 Kang, Seong-Jun Bae, Soon-heung Jung, Sang Taick Park, Proposal on Signaling for DASH 18446 Thomas Rusert, Per Fröjdh Compression Efficiency Evaluation of FrameCompatible Stereoscopic Video Coding with Resolution Enhancement 18447 Thomas Rusert, Per Fröjdh Comments on Coding Conditions in Call for Proposals on 3D Video Coding Technology 18448 Sanbao Xu, Martin Pettersson, Thomas Rusert, Mats Folkesson Evaluation of Subjective Test Methodologies for 3D Video 18449 Markus Waltl, Christian Timmerer Proposed changes for validation rules of MPEG-V Part 7 18450 Guillaume Fuchs, Markus Multrus, Max Neuendorf Proposed unified global gain syntax element in USAC 18451 Guillaume Fuchs, Max Neuendorf FhG listening test report on CE on improving the USAC bass post-filter 18452 Stefan Bayer Completion of the Core Experiment on Reducing the Complexity of the USAC Time Warping Jaime Delgado, VÃctor RodrÃguez-Doncel, 18453 Francesco Gallo, Elisa Todarello, Annarita Di Carlo, Walter Allasia, Eva RodrÃguez Results of the Core Experiment for Contracts Mapping 18454 Vignesh Subbaraman, Markus Multrus, Kihyun Choo, Informative Encoder Description for USAC Improved Noiseless Coding CE 18455 Frederik Nagel FhG Listening Test Report – PVC for SBR envelope coding 18456 Max Neuendorf Corrections to Reference Software and CD of USAC 18457 Zhijie Zhao, Marco Munderloh, Joern Ostermann MPEG-2 TS RAP indexing for DASH Jorn Janneck, Marco Mattavelli, Shujun Li, Johan Eker, Carl Von Platen, Mickael Raulet, Ghislain 18458 Roquier, Matthieu Wipliez, Ihab Amer, Christophe Lucarz, Junaid Jameel Ahmad Writing dataflow networks components (FUs) with different models of computations using RVC-CAL: a tutorial 18459 Jeff Huang Crosscheck listening test report for USAC on FFT and QMF harmonic transposers Jorn Janneck, Marco Mattavelli, Shujun Li, Mickael Raulet, Matthieu Wipliez, Johan Eker, Carl Von 18460 Platen, Ghislain Roquier, Ihab Amer, Christophe Lucarz, Pascal Faure, Junaid Jameel Ahmad A proposal of RVC-CAL extensions for improved support of I/O processing 18461 Jeff Huang Crosscheck listening test report for USAC on time frequency domain post-processing 18462 Miska M. Hannuksela Contribution to DASH EE#5: MPEG2-TS Reception Hint Track 18463 Miska M. Hannuksela, Imed Bouazizi Contribution to DASH EE#7: bitstreamStructureId 18464 Miska M. Hannuksela, Imed Bouazizi Contribution to DASH EE#9: MPD Indication for File Conversion of Scalable Streams 18465 Miska M. Hannuksela Contribution to DASH EE#9: File Construction Instruction Format 18466 Kimitaka Tsutsumi, Kei Kikuiri, Nobuhiko Naka NTT DOCOMO Cross-check Report on Improved Harmonic Transposer in USAC 18467 David Virette Report on cross-check listening test for the CE 46 on improved Bass-post filter for USAC 18468 David Virette Report on cross-check listening test for the CEs on QMF based harmonic transposer and improved harmonic transposer in USAC 18469 David Virette, Dejun Zhang, Fuwei Ma Finalization of Enhanced Pulse Indexing CE for ACELP in USAC 18470 David Virette, Lijing Xu, Wei Xiao A new signal classifier for USAC reference encoder 18471 David Virette, Wei Xiao Finalization of CE on adaptive T/F domain post-processing for USAC 18472 David Virette, Wei Xiao Progress report on additional bandwidth extension CE for USAC at low bit rates 18473 Kei Kikuiri, Atsushi Yamaguchi, Nobuhiko Naka Educational Information on Encoder Implementation of inter-TES tool in USAC 18474 David Virette CE proposal on improved stereo coding at low bit rates 18475 Ying Chen, Marta Karczewicz Comments on Scalable Resolution Enhancement of Frame-Compatible Stereoscopic 3D Video 18476 Michael Grafl, Christian Timmerer Input on w11413 (WD of ISO/IEC 23006-4 2nd edition Elementary Service Protocols) 18477 Michael Grafl, Christian Timmerer Input on w11414 (WD of ISO/IEC 23006-5 Service Aggregation) 18478 Walt Husak, Ajay Luthra, Gary Sullivan Use cases and requirements for scalable resolution enhancement of frame-compatible stereoscopic 3D video 18479 Markus Multrus, Philippe Gournay, Nikolaus Rettelbach, Bruno Bessette, Bernhard Grill Proposed Core-Experiment on Enhancing the USAC Codec at Mid Bitrates 18480 Max Neuendorf, Markus Multrus, Stefan Doehla, Werner Oomen, Heiko Purnhagen Comments on further USAC investigation 18481 Bruno Bessette, Philippe Gournay, Roch Lefebvre Proposed CE for extending the LPD mode in USAC 18482 David Furbeck DASH MPD Delta Files 18483 Xin Wang, Shaobo Zhang Content Protection and DRM in DASH 18484 Zhang Shaobo, Xin Wang Signaling for Seamless Switching for DASH 18485 Yuriy Reznik thoughts on next steps for CDVS project 18486 Koohyar Minoo, Vivian Kung, David Baylon, Krit Panusopone, Ajay Luthra, Jae Hoon Kim On scalable resolution enhancement of framecompatible stereoscopic 3D video 18487 Seong Yong Lim, Jihun Cha, Injae Lee, Young-kwon Definition and application domains of new Lim, Joong Yun Lee sensors 18488 Seong Yong Lim, Jihun Cha, Injae Lee, Young-kwon Sensor capability and preference of new Lim, Joong Yun Lee sensors 18489 Seong Yong Lim, Jihun Cha, Injae Lee, Young-kwon Binary representation of new sensors Lim, Joong Yun Lee 18490 Seong Yong Lim, Jihun Cha, Injae Lee, Young-kwon Proposed mechanism to support AUI interface Lim 18491 V. Sze, M. Budagavi (TI) High-efficiency entropy coding simplifications 18492 Jaewon Sung, Byeong-Moon Jeon Comments on depth data of 3DV test sequence 18493 David Singer On decode times in movie fragments 18494 Mark Watson, Thomas Stockhammer DASH: Considerations on Overlap during switching 47 18495 Mark Watson DASH: URL selection 18496 Mark Watson, Thomas Stockhammer DASH EE#7 – Segment Index 18497 watsonm@netflix.com DASH: Considerations on Segment Duration 18498 Mark Watson Input for DASH EE#1 (CMP): Pixel Aspect Ratio 18499 Shinobu Hattori, Teruhiko Suzuki, Comments on Requirements and Call for Proposal on 3D Video Coding Technology 18500 Teruhiko Suzuki, Ikuo Tsukagoshi, Takahiro Nishi Comments on scalable enhancement of frame compatible stereo 3D Zhong Haishan, Chong Kok Seng, Zhao Dan, 18501 Takeshi Norimatsu, Tomokazu Ishikawa, Neo Sua Hong Panasonic crosscheck report on improved harmonic transposer 18502 Rob Glidden, Cliff Reader, Gerard Fernando Option-1 (royalty-free) video codec industry considerations 18503 Next Generation Broadcasting Forum (Korea) Proposed Text of ISO/IEC 23000-11/FDAM1 Stereoscopic Video AF Conformance and Reference Software 18504 Doug Young Suh, Kyungmo Park, Sungryeul Rhyu DASH: Field data for quantitative evaluation 18505 Wonsuk Lee, Seungyun Lee Java implementation for generic metadata APIs on MXM 18506 Olgierd Stankiewicz, Krzystof Wegner Frame range extension of Poznan Street and Poznan Carpark sequences (3DV/EE1) 18507 Sanghyun Joo (joos@etri.re.kr) Additional type for Virtual Object 18508 Sanghyun Joo (joos@etri.re.kr) Descriptions for social network services 18509 Sanghyun Joo (joos@etri.re.kr) Minor changes in MPEG-V Part 4 18510 Jae Joon Han, SeungJu Han, Won-Chul Bang, James D.K. Kim, Sang-Kyun Kim, YongSoo Joo sensor capability for gas and dust sensors 18511 Cheon Lee, Yo-Sung Ho 3DV EE1 Results on Newspaper 18512 Cheon Lee, Yo-Sung Ho EE1 Results on Café 18513 Min-Koo Kang, Cheon Lee, Yo-Sung Ho 3DV EE4 Results on Newspaper 18514 Cheon Lee, Yo-Sung Ho Common-hole Filling for Boundary Noise Removal in VSRS 18515 Min-Koo Kang, Cheon Lee, Yo-Sung Ho 3DV EE4 Results on Cafe 18516 P. Chen, L. Guo, M. Karczewicz (Qualcomm) Geometry motion partition 18517 T. Nguyen (Fraunhofer HHI) Improved intra smoothing for UDI and new AIS fast mode 18518 Qiuwen Zhang, Ping An, Yan Zhang, Ran Ma, Qian Zhang, Zhaoyang Zhang A Depth Compression for MVD View Synthesis 18519 Qiuwen Zhang, Ping An, Qian Zhang, Yan Zhang, Zhaoyang Zhang Enhancement of Depth Estimation 18520 Feng Shao, Gangyi Jiang, Mei Yu, Xu Wang Stereoscopic video coding with asymmetric luminance and chrominance qualities 18521 Kyong-Sok Seo, Byeong-Doo Choi, Dae-Sung Cho Comments on Scalable resolution enhancement of frame-compatible stereoscopic 3D video 18522 E. Francois, P. Bordes (Technicolor) Geometry adaptive block partitioning (GEO) cross-check 18523 Gaëlle Martin-Cocher Requirements for LCVC 18524 Zhijie Zhao, Marco Munderloh, Joern Ostermann Contribution to EE#2 MCE: Extension and Modification 48 18525 Depth map enhancement for view synthesis 18526 sergio matone Comments on WorkingDraft 23006-4 18527 sergio matone Comments on WorkingDraft of 23006-5 18528 sergio matone Contribution for 23006-1 18529 Mattia Donna Bianco Proposal of an hierarchical representation of a LASeR scene 18530 sergio matone Comments on Negotiate ES 18531 Vijay Chandrasekhar, Bernd Girod The Stanford Mobile Visual Search Data set Tomasz Zernicki, Maciej Bartkowiak, Marek Domanski Telcordia and PUT listening test results for CE on improved tonal component coding in eSBR (USAC) 18535 Qian Zhang, Ping An, Qiuwen Zhang, Ran Ma, Zhaoyang Zhang Depth map enhancement for view synthesis 18536 Jae Joon Han, SeungJu Han, Won-Chul Bang, James D.K. Kim, Sang-Kyun Kim, Yong Soo Joo Update of Reference SW for MPEG-V Part 4 18537 H.-Joong Kim, X. Qu (Korea Univ.), W.-J. Han (Samsung) Showing the possibility of fast CABAC 18538 Miran Choi, MyungGil Jang, Chang-ki Lee, HyunKi Kim, AIT Elementary Service - Search License 18539 Miran Choi, Hyojeong Oh, Jeong Hur, Soojong Lim, Yeochan Yun AIT Elementary Service - Search Contract 18532 18533 18534 18540 Giovanni Cordara Summary of discussions on CDVS databases and software 18541 Tao Chen, Yoshiichiro Kashiwagi, Takahiro Nishi Subjective Picture Quality Evaluation of MVC Stereo High Profile for Full-Resolution Stereoscopic HD Video Applications 18542 Rongrong Ji, Lingyu Duan, Tiejun Huang, Hongxun Yao, Wen Gao A Case Study for Compact Descriptors for Visual Search - Location Discriminative Mobile Landmark Search 18543 Qifei Wang, Kun Xu, Qian Ma, Xiangyang Ji, Qionghai Dai 3DV EE1 Results on “Poznan_Carpark― Test Sequence TE7: Results for Simplification of MDDT Transform 18544 W. Ding, Y. Shi, B. Yin (Beijing Univ. Tech) 18545 Cyril Concolato, Romain Bouqueau, Jean Le Feuvre Seamless switching for TS in DASH 18546 H. Schwarz, D. Marpe, T. Wiegand (Fraunhofer HHI) 18547 Mariam Saleh, Mickaël Raulet, Ihab Amer, Marco Mattavelli Investigations for representing rectangular blocks using the merging concept RVC-CAL non BTYPE Version 18548 J. Zan, J. Meng, M. T. Islam, D. He (RIM) Cross Verification of Low Complexity Adaptive Coefficient Scanning 18549 Mark Watson DASH – Quality of Experience and performance metrics 18550 TTA (Korea) via SC 29 Secretariat Liaison Statement from TTA (Korea) 18551 Mark Watson, Thomas Stockhammer, Kilroy Hughes, DASH: MPD Representation Groups and MPD Fragmentatio Experimental Results for Inter-View Direct Mode with Decompressed Disparity Maps for Multiview Video Coding 18552 Jacek Konieczny, Marek DomaÅ„ski 49 18553 danny.cohen@oracle.com Danny Cohen Evaluation of the OMS Video Encoding 18554 Gary J. Sullivan, Jens-Rainer Ohm, Meeting report of the 2nd meeting of the Joint Collaborative Team on Video Coding (JCTVC), Geneva, CH, 21-28 July, 2010 18555 Seung Wook Lee, ChangHwa Lyou, SangKwon Jeong Notification of the change of Forum Name of Category C liaison partner 18556 K. Sato (Sony) Cross verification of MV Coding Proposal by JCTVC-C293 18557 T. Suzuki (Sony) Cross verification of DCT-IF for chroma 18558 David Singer, Alex Giladi Usage of MPEG-2 Transport Streams 18559 Alex Giladi Segment Index Syntax for MPEG-2 TS 18560 alex.giladi@gmail.com Comments on MPEG-2 Transport Streams in DASH 18561 Jaime Delgado, Victor Rodriguez-Doncel Proposal of initial CEL Working Draft 18562 hj08.lee@lge.com, w.zia@lglab.eu LGE proposal for PIP attributes for CMP 18563 Sang-il Na, Weon-Geun Oh, Kyung-Ho Choi, DongSeok Jeong Database contribution for CDVS 18564 Weon-Geun Oh, Sang-il Na, Jun-Woo Lee, DongSeok Jeong Service scenario of CDVS 18565 Zhijie Zhao, Yanzi Yang Support of virtual/non- segmentation in DASH 18566 H J Lee, w.zia@lglab.eu DASH DTS M2TS Compression gzip comparison 18567 H J Lee, w.zia@lglab.eu A few comments on LGE proposal about delivery of MPEG-2 TS 18568 R. Cohen (Merl), C. Yeo (I2R), R. Joshi (Qualcomm) TE7: Summary report for MDDT Simplification 18569 B. Bross, H. Kirchhoffer, H. Schwarz, T. Wiegand (Fraunhofer HHI) Fast intra encoding for fixed maximum depth of transform quadtree Jianwen Chen, Yaocheng Rong, Feng Xu, Yan Xu, 18570 Yun He, Xingguo Zhu, Lu Yu, Zhihang Wang, Li Zhang, Cliff Reader, Siwei Ma, Wen Gao Response to Call for Evidence on Option-1 Video Coding Technology 18571 W.-J. Han (Samsung) Cross-verification of JCTVC-C311: Fast intra encoding for fixed maximum depth of transform quadtree 18572 Y. Zheng, R. Joshi, M. Coban, M. Karczewicz (Qualcomm) TE8: Crosscheck Result of the Transcoder of JCTVC-B034 for V2V Entropy Coding in HEVC 18573 J. Kim, J. Park, B. Jeon (LG Electronics) Verification results of Samsung's proposal JCTVC-C206 18574 Yiping Chen Response to the AIT 2nd CfP: Authenticate User and Identify User 18575 Dr. Ken Lunde (Adobe Systems Inc.), Tony Tseung (Apple Inc.), Julio Gonzalez (Apple Inc.) Response to the Call for Proposals on Composite Font Standard 18576 Walt Husak 3D Test Shoot 18577 Alexandros Tourapis Comments on m17925 18578 Khaled Mamou, Minsu Ahn, Xiangyang Ji Ad Hoc Group on Multi-Resolution 3D Mesh Coding 18579 Tiejun Huang for CNNB Comments on HEVC Test Model 18580 Tiejun Huang for CNNB Comments on Option-1 Video coding for Internet Application 18581 D. Hoang (Zenverge) On the computation of PSNR and BD-Rate 18582 Shehrzad Qureshi, Steven Saunders, John Ralston Dynamic computational complexity metric for 50 video codec evaluation 18583 T. Yamakage (Toshiba) Verification Results of Simplified Intra Smoothing (JCTVC-C234) 18584 K. Ugur (BoG coordinator) Breakout Report on Interpolation Filters 18585 Jean H.A. Gelissen, Herman Tuininga Additional input for Sensor / Actuator Integration use case 18586 R. Sjöberg (Ericsson) TMuC text on max CU size seems broken 18587 18588 Sang-il Na, Weon-Geun Oh, Ju-Kyong Jin, DongSeok Jeong The process of Test Model development for the HEVC initiative 18589 G. Sullivan, J.-R. Ohm (chairs) 18590 Fast Matching for the MPEG-7 Video Signature Tools T. Wiegand, H. Schwarz, B. Bross, A. Fuldseth, X. Wang, W.-J. Han (BoG coordinators) BoG report: residual quadtree structure 18591 Sangyoun Lee, Seongwan Kim, Jaeho Lee Cross verification for ETRI proposal 18592 Helmut Bürklin, Mark Watson Update on Media Presentation Description to support HTTP Streaming from multiple servers 18593 JTC 1/SC 6 via SC 29 Secretariat Liaison Statement from JTC 1/SC 6 18594 K. Sühring, D. Flynn, F. Bossen (on behalf of JCTHEVC software guidelines VC) 18595 Miska Hannuksela, Imed Bouazizi Clock Drift Compensation 18596 Per Fröjdh, Thomas Stockhammer Support for trick modes in DASH 18597 K. McCann (TE coordinator) TE12: Summary of results of evaluation of TMuC tools in TE12 18598 Karsten Grüneberg, Ying Chen, Yasuaki Tokumo, 3D Video Support in DASH Thomas Schierl 18599 Mark Watson Evaluation Experiment on HTTP Streaming and Quality of Experience metrics 18600 J. Jung, B. Bross (CE coordinators) Description of Core Experiment 9: Motion Vector Coding 18601 R. Cohen, C. Yeo, R. Joshi (CE coordinators) Description of Core Experiment 7: Alternative Transforms 18602 E. Francois, P. Chen, X. Zheng (CE coordinators) Description of Core Experiment 2: Flexible Motion Partitioning 18603 E. Alshina, Y.-L. Lee, P. Chen, H. Lakshman (CE coordinators) Description of Core Experiment 4: Interpolation for MC (Chroma) 18604 Y.-J. Chiu, H. Yu, Y.-W. Huang, S. Sekiguchi (CE coordinators) Description of Core Experiment 1: DecoderSide Motion Vector Derivation 18605 W.-J. Chien (CE coordinator) Description of Core Experiment 12: Adaptive Motion Vector Resolution 18606 K. Sugimoto (CE coordinator) Description of Core Experiment 10: Number of Intra Prediction Directions 18607 F. Bossen (on behalf of JCT-VC) Common test conditions and software reference configurations 18608 A Tabatabai (CE coordinator) Description of Core Experiment 6: Intra Prediction Improvement 18609 X. Wang (CE coordinator) Description of Core Experiment 5: Low Complexity Entropy Coding Improvements 18610 V. Sze, K. Panusopone, J. Chen, T. Nguyen, M. Coban (CE coordinators) 51 Description of Core Experiment 11: Coefficient Scanning and Coding 18611 M.Coban, B. Bross, J. Chen (CE coordinators) Description of Core Experiment 13: Intra Smoothing T. Yamakage, T. Chujoh, Y. W. Huang, K. Chono, Description of Core Experiment 8: In-loop 18612 I.S. Chong, T. Yamamoto, J. Lim, B. Jeon, J. Xu, M. filtering Narroschke (CE coordinators) Description of Core Experiment 3: Interpolation Filtering for MC (Luma) 18613 T. Chujoh (CE coordinator) 18614 Marco Mattavelli, Mickael Raulet, Shujun Li, Jorn Janneck Proposed requirements for RVC and RGC specification languages and solution 18615 Iain-James MARSHALL, Mihai MITREA, Bojan JOVESKI Technologies under Consideration for Collaborative Applications by using MPEG scene description standards 18616 T. K. Tan, G. J. Sullivan, J.-R. Ohm (on behalf of JCT-VC) Summary of HEVC working draft 1 and HEVC test model (HM) 18617 G. J. Sullivan, J.-R. Ohm (chairs) Meeting report of the third meeting of the Joint Collaborative Team on Video Coding (JCTVC), Guangzhou, CN, 7–28 October, 2010 18618 K. McCann, B. Bross, S. Sekiguchi (editors) High Efficiency Video Coding (HEVC) Test Model 1 (HM 1) Encoder Description 18619 T. Wiegand, W.-J. Han, J.-R. Ohm, G. J. Sullivan (editors) Thomas Stockhammer, Per Fröjdh, Iraj Sodagar, 18620 Sungryeul Rhyu 52 High Efficiency Video Coding (HEVC) text specification Working Draft 1 Draft CD as submitted to Secretary of ISO/IEC JTC 1/SC 29: Text of ISO/IEC 23001-6: Dynamic adaptive streaming over HTTP (DASH) Annex D – Output documents No. Title 11552 List of Documents from the 94th Meeting in Guangzhou, China 11553 Resolutions of the 94th Meeting in Guangzhou, China 11554 List of AHGs Established at the 94th Meeting in Guangzhou, China 11555 Report of the 94th Meeting in Guangzhou, China 11556 Guidelines for Electronic Distribution of MPEG M and N Documents 11557 Press Release of the 94th Meeting in Guangzhou, China 11558 Meeting Notice of the 95th Meeting in Daegu, Korea 11559 Guide for WG 11 Meeting Hosts 11560 Study of ISO/IEC 13818-1:2007/FPDAM 6 Extension to AVC video descriptor and signaling of operation points for MVC 11561 List of errata items of ISO/IEC 13818-1:2007 11562 WD of ISO/IEC 13818-1:2007/AMD 7 Signaling of frame compatible and service compatible stereoscopic video 11563 Request for ISO/IEC 14496-4:200x/AMD XX Conformance for BIFS ExtendedCore2D Profile 11564 Text of ISO/IEC 14496-4:200x/PDAM XX Conformance for BIFS ExtendedCore2D Profile 11565 DoC on ISO/IEC 14496-5:2001/FPDAM 28 Reference software for LASeR adaptation tools 11566 Text of ISO/IEC 14496-5:2001/FDAM 28 Reference software for LASeR adaptation tools 11567 Request for ISO/IEC 14496-5:200x/AMD XX Reference software for BIFS ExtendedCore2D Profile 11568 Text of ISO/IEC 14496-5:200x/PDAM XX Reference software for BIFS ExtendedCore2D Profile 11569 Technologies under Consideration for use of BIFS on Collaborative Application 11570 Request for ISO/IEC 14496-12:2008 AMD 3 DASH Support and RTP reception track processing 11571 Evaluation Report 11572 WD of ISO/IEC 14496-28 Composite Font 11573 WD of ISO/IEC 21000-20 Contract Expression Language 11574 Text of ISO/IEC 23000-11/FDAM 1 Stereoscopic Video AF Ref. Soft and Conf. 11575 Study of ISO/IEC 23000-12 FPDAM 1 Conformance and Reference Software 11576 Workplan of ISO/IEC 23000-12 Conformance and Reference Software 11577 Request for the subdivision of ISO/IEC 23001-6 11578 Text of ISO/IEC CD 23001-6 Dynamic Adaptive Streaming over HTTP 11579 Description of Evaluation Experiments on Dynamic Adaptive Streaming over HTTP 11580 Workplan of Reference Software for DASH 11581 Text of ISO/IEC CD 23001-1 2nd edition Multimedia Service Platform Technologies - Architecture 11582 Text of ISO/IEC CD 23006-2 2nd edition MPEG Extensible Middleware API 11583 Text of ISO/IEC CD 23006-4 2nd edition Elementary Services 11584 Text of ISO/IEC CD 23006-5 Service Aggregation 11585 Text of ISO/IEC 23007-1 PDAM 1 Widget Extensions 11586 Text of ISO/IEC 14496-12:2008 PDAM 3 DASH Support and RTP reception track processing 11587 Study of ISO/IEC FCD 23007-3 Conformance and Reference Software 11588 WD of ISO/IEC 23005-1 AMD 1 Additional Use Cases 11589 WD of ISO/IEC 23005-2 AMD 1 Additional Control Information and binary representation 53 11590 WD of ISO/IEC FDIS 23005-3 AMD 1 Binary representation of sensory information 11591 WD of ISO/IEC FDIS 23005-4 AMD 1 Binary representation of Virtual World Object Characteristics 11592 WD of ISO/IEC FDIS 23005-5 AMD 1 Additional sensors, interaction devices and binary representation 11593 WD of ISO/IEC FDIS 23005-6 AMD 1 Binary representation of common types and tools 11594 Study of ISO/IEC FCD 23005-7 Conformance and Reference Software 11595 Storyboard for MPEG-U promotion video 11596 Short descriptions 11597 Guide to the MPEG Subversion Repository 11598 Request for ISO/IEC 23007-1 AMD 1 Widget Extensions 11599 Description of CE on dynamic volume change 11600 Terms of Reference 11601 MPEG Standards 11602 Unpublished standards at FDIS level 11603 MPEG work plan and time line 11604 MPEG Standard Editors 11605 Schema assets 11606 Software assets 11607 Conformance assets 11608 Content assets 11609 URI assets 11610 Standards under development for which a call for patent statements is issued 11611 List of Organisations with which MPEG entertains liaisons 11612 Request for ISO/IEC 13818-2:2000/Amd.4 11613 Text of ISO/IEC 13818-2:2000/PDAM4 Frame Packing Arrangement Signalling in MPEG-2 Video 11614 Text of 14496-5:2001/Amd.5 DCOR1 11615 Defect report on ISO/IEC 14496-10:201x 11616 Text of ISO/IEC 14496-10:201x/PDAM1 Progressive High Profile 11617 DoC on ISO/IEC 15938-6:2001/PDAM4 11618 Text of ISO/IEC 15938-6:2001/FPDAM4 Reference Software for Video Signature Tools 11619 DoC on ISO/IEC 15938-7:2001/PDAM6 11620 Text of ISO/IEC 15938-7:2001/FPDAM6 Conformance for Video Signature Tools 11621 Study Text of ISO/IEC 15938-8/PDAM6 Extraction and Matching of Video Signature Tools? 11622 Text of ISO/IEC 23001-4/DCOR1 11623 Working Draft of ISO/IEC 23001-4/Amd.1 11624 Disposition of Comments on ISO/IEC 23002-4/FPDAM1 11625 Text of ISO/IEC 23002-4/FDAM1 Software and Conformance Testing for Video Tool Library 11626 AHG on MPEG-7 Visual 11627 WD of ISO/IEC 23002-4/Amd.3 Functional Units for SVC Profiles 11628 Description of Core Experiments in RVC 11629 Test Model 1 of High Efficiency Video Coding 11630 Description of Exploration Experiments in 3D Video Coding 11631 Report on Experimental Framework for 3D Video Coding 11632 Liaison letter template on DASH 54 11633 Liaison letter template regarding Multimedia Service Platform Technologies (Formerly Advanced IPTV Terminal) 11634 Liaison statement template on signaling of stereoscopic video in MPEG-2 Systems 11635 Liaison statement to SC 6 on MMT 11636 Liaison statement to 3GPP TSG-SA4 on DASH 11637 Liaison statement to JTC 1/WG 7 on Sensor Networks 11638 Liaison statement to IEC TC 100 on the definition of “trick mode― 11639 Liaison statement to JTC 1/SC 24 on “zip― format 11640 Liaison statement to IEC TC 100 on IEC CDV 62516-2, using BIFS in T-DMB receiver 11641 Liaison statement to IDPF ePub WG on Richmedia technologies 11642 Process of Test Model Development for HEVC 11643 Working Draft 1 of High Efficiency Video Coding 11644 AHG on Reconfigurable Video Coding 11645 Status of HEVC reference software copyright management in JCT-VC 11646 AHG on Scene Representation 11647 AHG on MPEG File Formats 11648 AHG on Application Format 11649 AHG on Font Format Representation 11650 AHG on Advanced IPTV Terminal 11651 AHG on MPEG-V 11652 AHG on Contract Expression Language 11653 AHG on Dynamic Adaptive Streaming over HTTP 11654 ISO/IEC 13818-4:2004 DCOR 2, AAC Intensity Stereo 11655 ISO/IEC14496-26:2010/Cor 2 BSAC Corrections 11656 ISO/IEC 14496-26:2010/DCOR 3, Intensity Stereo 11657 Report on Spatial Audio Object Coding Verification Tests 11658 Defect Report of ISO/IEC 23003-2:2010, Spatial Audio Object Coding 11659 Study on ISO/IEC 23003-3:201x/CD of Unified Speech and Audio Coding 11660 Status and Workplan for USAC CEs 11661 Audio for High Efficiency Video Coding 11662 AHG on SAOC, USAC, and Audio and Systems Interactions 11663 Statement of benefits from establishing a Category C liaison with DTG 11664 Description of CE on Contract Expression Language 11665 Description of 3DG Core Experiments 11666 Working Draft 3.0 on Multiresolution Mesh Compression 11667 Working Draft 3.0 on Reconfigurable Graphics Coding 11668 Reconfigurable graphics coding tutorial v2.0 11669 Text of 2nd Edition of ISO/IEC 14496-25 FDIS 3DG compression model 11670 AHG on 3DG documents, software maintenance and core experiments 11671 AHG on Multi-Resolution 3D mesh Coding 11672 Standardisation projects and facilities fees 11673 Low-Complexity Video Coding – Draft Problem Statement 11674 Compact Descriptors for Visual Search: Draft Call for Proposals 11675 Compact Descriptors for Visual Search: Evaluation Framework (draft) 55 11676 Call for Evidence on Option-1 Video Coding Technology 11677 ISO/IEC 15938-9:2005/PDAM 1 11678 Applications and Requirements on 3D Video Coding 11679 Draft Call for Proposals on 3D Video Coding Technology 11680 Liaison letter to ITU-T SG 16 on Telepresence Systems 11681 Requirements for MPEG Frame-Compatible Enhancement (MFC) 11682 Test Conditions and Evaluation Process for MFC 11683 Liaison statement template on MPEG frame-compatible activities 11684 AHG on scalable resolution enhancement of frame-compatible stereoscopic 3D video 11685 AHG on Evaluation of Responses to the Call for Evidence 11686 AHG on Lightweight Video Coding 11687 AHG on 3D Video Coding 11688 AHG on Compact Descriptors for Visual Search 11689 AHG on MPEG Media Transport (MMT) 11690 AHG on MPEG-7 Profiles 11691 MPEG-V Awareness Day: Call for Participation 11692 Request fof subdivision of ISO ISO/IEC 23006-5 Service Aggregation 11693 VOID 11694 Liaison Statement to 3GPP 11695 Liaison Statement to Wireless Gigabit Alliance 11696 Liaison Statement to ISO/IEC JTC 1/SC 37 11697 Statement of benefits from establishing a Category C liaison with Wireless Gigabit Alliance 56 Annex E – Requirements report Source: Jörn Ostermann (Leibniz Universität Hannover) 1 Requirements documents approved at this meeting No. 11630 11631 11661 11673 11674 11675 11676 11677 11678 11679 11680 11681 11682 11683 11684 11685 11686 11687 Title Description of Exploration Experiments in 3D Video Coding Report on Experimental Framework for 3D Video Coding Audio for High Efficiency Video Coding Low-Complexity Video Coding – Draft Problem Statement Compact Descriptors for Visual Search: Draft Call for Proposals Compact Descriptors for Visual Search: Evaluation Framework (draft) Call for Evidence on Option-1 Video Coding Technology ISO/IEC 15938-9:2005/PDAM 1 Applications and Requirements on 3D Video Coding Draft Call for Proposals on 3D Video Coding Technology Liaison letter to ITU-T SG 16 on Telepresence Systems Requirements for MPEG Frame-Compatible Enhancement (MFC) Test Conditions and Evaluation Process for MFC Liaison statement template on MPEG frame-compatible activities AHG on scalable resolution enhancement of frame-compatible stereoscopic 3D video AHG on Evaluation of Responses to the Call for Evidence AHG on Lightweight Video Coding AHG on 3D Video Coding 11688 AHG on Compact Descriptors for Visual Search 11689 11690 AHG on MPEG Media Transport (MMT) AHG on MPEG-7 Profile 2 Systems 2.1 MPEG Font Formats rd At the 93 meeting, MPEG issued the Call for Proposals N11538 Composite Font Standard: Call for Proposals and the associated document N11537 Requirements and Use Cases for the Composite Font Standard. The Requirements group thanks Adobe and Apple for their responses to the call. MPEG received sufficient technology in order to start the standardization process. The work is carried out in the Systems Subgroup. 57 3 Video 3.1 MPEG-7 AVDP Several broadcasters requested a new profile for MPEG-7 covering several aspects of audio, video and systems. The main functionality of this audio visual description profile (AVDP) is describing the results of automatic media analysis with audio/video low-level features for audio-visual content. They are to be used for the archives of the broadcast industry. Changes to the WD were approved resulting in the current PDAM N11677 ISO/IEC 15938-9:2005/PDAM 1. The current work plan foresees the amendment to be finalized in 11/2011. In order to assure that the profile covers the relevant applications foreseen in the broadcast industry, EBU will organize an evaluation of the tools (Figure 1) listed as part of the new AVDP. Work is progressed within the adhoc group N11690 AHG on MPEG-7 Profile. Figure 1 - Tools of AVDP. 3.2 Miscellaneous nd At the 92 MPEG meeting, a proposal for a new profile for surveillance videos (m17461) was entertained. At this meeting, no support for the proposal was presented. 4 Explorations 4.1 Compact Descriptors for Visual Search (CDVS) 58 Images are foreseen as input for visual search. The temporal extension of images also known as video will enable life interaction. ETRI and Stanford University contributed databases for use in this potential standardization activity. While this increases the amount of test data, more contributions of databases is required. Additional input on destructors, high quality reference and low-quality test data is required. It appears that MPEG needs a database with text images. More video would also be beneficial. Furthermore, the annotations of the databases need to be unified. A CfP will only be meaningful if the before mentioned issues get resolved. The adhoc group N11688 AHG on Compact Descriptors for Visual Search will work on these issues. MPEG met with JPEG on CDVS. The activities JPSEC and JPSearch may benefit from the CDVS activity. Future meetings are envisioned when MPEG and JPEG have collocated meetings. In order to prepare the CfP and evaluation, MPEG issued the documents N11675 Compact Descriptors for Visual Search: Evaluation Framework (draft) and N11674 Compact Descriptors for Visual Search: Draft Call for Proposals. Given the open issues, the earliest time for issuing a call will be the 95th meeting with an evaluation planned prior to the 97th meeting. 4.2 MPEG Frame-Compatible Stereo Enhancement for AVC (MFC) The current popularity of stereoscopic video makes distribution using a frame-compatible stereoscopic 3D video service over AVC desirable. However, these solutions provide only half of the full spatial resolution for each channel. Transmission of an enhancement layer to achieve full resolution 3D video might be a solution to this issue. While there was a significant push and effort during the meeting to issue a CfP at the next meeting, even more significant issues are still to be solved: At this point, there are no test sequences available. As anchor, SVC bit streams need to be created. Furthermore, there is currently no evidence that there exists a solution superior to SVC. From a formal point of view, MPEG progressed this matter by issuing N11681 Requirements for MPEG Frame-Compatible Enhancement (MFC) and N11682 Test Conditions and Evaluation Process for MFC. MPEG hopes that members provide test sequences at the 95th meeting such that evidence may be evaluated at the 96th meeting. The adhoc group N11684 AHG on scalable resolution enhancement of frame-compatible stereoscopic 3D video works on this activity. Several members of the group assume that a similar solution based on MPEG-2 will not be required by the market. A liaison statement N11683 Liaison statement template on MPEG frame-compatible activities will be sent to several interested organisations. 4.3 Audio As in several previous meetings, MPEG had a brainstorming session on future audio activities. At this meeting, the potential work areas were driven by potential HEVC application scenarios. The main aspects are large screen, small viewing distances and hence near field audio perception. They include home cinema requiring to render a multi-channel audio program onto a different number of speakers or to define wavefield synthesis. Other applications include real-time communications, camcorders, storage (random access issues), 3D video and telepresence as well as personal 3D video. Requirements might focus on resolution of sound source localization and enhanced spatial audio for stereo displays. Results of the current discussion are summarized in N11661 Audio for High Efficiency Video Coding. Members are encouraged to further elaborate their ideas. 59 The audio group is encouraged to enlist more universities in their activities. 4.4 Leightweight Video Coding th At the 90 meeting, MPEG reviewed requirements on Low Complexity Video Coding as described in N17005 What can we do about Interlace?. Following input from the US National Body at the 92nd meeting, the Requirements Group issued a resolution encouraging its members to provide further input on this topic. In response to this resolution, one document was submitted describing the low complexity video coding algorithm VC2 which is already standardized. Despite this rather limited response and due to the potential relevance of the topic especially for mobile and UHDTV devices, MPEG updated the output document N11673 Low-Complexity Video Coding – Draft Problem Statement. Spatial random access is a functionality currently not provided by standardized solutions. The adhoc group N11686 AHG on Lightweight Video Coding will progress this matter. In order to decide whether there is room for another standard, MPEG encourages its members to provide high-quality, high-bit depth and high-rate 4:4:4 video signals. JPEG might issue a call on this leightweight video coding. 4.5 MMT The area of work for MMT (Figure 2) might include adaptive progressive transport (download/stream), cross layer optimization, hybrid delivery and conversational services. It is foreseen that MMT will also provide adaptive streaming support for mp4 files. Naturally, MMT will provide some kind of interworking with DASH. Since the CfP was issued at the 93rd meeting and responses are due prior to the 95th meeting, no activity took place at this meeting. However, N11689 AHG on MPEG Media Transport (MMT) was established in order to evaluate the responses. The Systems subgroup and its members working on MMT are encouraged to define test beds and software environments such that the development of the MMT standard can be guided by the outcome of core experiments. Figure 2 - Networks of interest in the context of MPEG Media Transport. 60 4.6 3D Video Coding Dolby Labs and Sun-Yat Sen University provided new test material. The selection of test sequences for the envisioned CfP is now finalized. The next steps related to test material include definition of rates for anchors based on MVC. The test procedure including instructions of subjects, evaluation of subjects and selection of displays needs to be defined. The current plan foresees the use of consumer 3D displays. A CfP will be issued at the 95th or 96th meeting. Evaluation is planned to be carried out prior to the 98th meeting. The call will be open to all technologies including AVC, MVC, HEVC and others. MPEG will decide on the use of the proposed techniques, i.e. technology proposed based on AVC might be used in the standard in order to adapt HEVC to 3DV. The following documents were updated: N11678 Applications and Requirements on 3D Video Coding, N11679 Draft Call for Proposals on 3D Video Coding Technology, N11630 Description of Exploration Experiments in 3D Video Coding. The adhoc group working on 3DV is N11687 AHG on 3D Video Coding. MPEG made publicly available N11631 Report on Experimental Framework for 3D Video Coding. The purpose of the document is to provide a high-level overview of the experimental framework that is developed by MPEG to support the envisioned 3D video applications (Figure 3). This document also aims to indicate assumptions on the input data as well as supplementary data derived from the input data. Left Output to Auto-Stereo N-view Display 3D Video Codec Center Right Depth Estimation Multiview Rendering virtual views Figure 3 – Auto-stereoscopic output with 3-view configuration as an example for 3DV. 4.7 Option-1 Licensable Codecs For several meetings, MPEG had discussions on Option-1 licensable codecs. Especially small companies consider the per-stream licensing fees as cumbersome. At its 91st meeting, MPEG issued document N11221 Possible future actions on standardization with Type 1 licensing where the legal issues are summarized and discussed. Type 1 licensing refers to option 1 of the joint patent declaration form, where an intellectual property holder can indicate that he will not charge for his IP. Laymen refer to this type of licensing as royalty-free. In order to finalize the discussion MPEG issued a resolution at the 92nd meeting related to royaltyfree video codecs. Until this 93rd meeting, the CH, CN, IT, KR, and UK NB indicated support. No input document on this support was received. The FR and US NB indicated that they have no resources available. Given that the minimum resources for a new work item is available, MPEG decided to further explore this type of standard. In response to N11533 Call for Evidence on Option-1 Video Coding Technology, MPEG received 2 late inputs from Oracle (US NB), Peking University, Tsinghua University, and Zhejiang University (CN NB). One proposal included the full set of coded video sequences. During this meeting, these sequences were evaluated in an informal subjective test. Compared to MPEG-1, a noticeable improvement in subjective quality and an objective improvement of 1db to 4 db was observed. 61 Comparisons against other standards like VP8 were not available. Given that the proponents envision further improvements of their codecs, a second call for evidence N11676 Call for Evidence on Option-1 Video Coding Technology was made. The due date is the 95th meeting. The adhoc group N11685 AHG on Evaluation of Responses to the Call for Evidence prepares the evaluation. No progress was made with respect to the working guidelines for developing an Option-1 licensable codec. Between the 93rd and 94th MPEG meeting, an indefinite moratorium on airtime fees for Internet video that is free for end users was announced by MPEG LA. Therefore, MPEG asks its members whether there is still a need to develop a type-1 licensable codec and what its performance requirements are. 4.8 User Identification In many applications relevant to MPEG standardization, an identification of individual users or a description of users may be desirable. Currently, a user description in the context of MPEG-V is under investigation. A more focused and all-embracing effort related to user identification might be required. This identification of users may be useful for content adaptation and for exploitation of user preferences. Obviously, such an user identification bears some risks related to privacy. The Requirements subgroup kindly encourages organizations to provide input on user identification/description. 62 Annex F – Systems report Source: Young-Kwon Lim, Chair 1 No. X 11560 11561 11562 X 11563 11564 X 11565 Output of the Meeting Title TBP Available Editor No 10/10/15 Sam Yes Yes 10/10/15 10/10/29 Sam Sam No 10/10/15 Jean L. No 10/10/15 Jean L. No 10/10/15 Text of ISO/IEC 14496-5:2001/FDAM 28 Reference software for LASeR No 10/10/15 Request for ISO/IEC 14496-5:2001/AMD 30 Reference software for BIFS No 10/10/15 SeoYoung SeoYoung Jean Text of ISO/IEC 14496-5:2001/PDAM 30 Reference software for BIFS No 10/10/22 Jean No 10/10/15 Jamie No 10/10/15 Dave No 10/10/29 Dave No No 10/10/15 10/10/15 Vladimir Vladimir No 10/10/15 10/10/15 Xin No No 10/10/29 Kugjin No 10/10/15 Inseon No 10/10/15 Inseon 13818-1 Systems Study of ISO/IEC 13818-1:2007/FPDAM 6 Extension to AVC video descriptor and signaling of operation points for MVC List of errata items of ISO/IEC 13818-1:2007 WD of ISO/IEC 13818-1:2007/AMD 7 Signaling of frame compatible and service compatible stereoscopic video 14496-4 Conformance Request for ISO/IEC 14496-4:2004/AMD 39 Conformance for BIFS ExtendedCore2D Profile Text of ISO/IEC 14496-4:2004/PDAM 39 Conformance for BIFS ExtendedCore2D Profile 14496-5 Reference Softwares DoC on ISO/IEC 14496-5:2001/FPDAM 28 Reference software for LASeR adaptation tools 11566 adaptation tools 11567 ExtendedCore2D Profile 11568 ExtendedCore2D Profile X 11569 X 11570 11586 X 14496-11 Scene Description and Application Engine Technologies under Consideration for use of BIFS on Collaborative Application 14496-12 ISO Base Media File Format Request for ISO/IEC 14496-12:2008 AMD 3 DASH support and RTP reception hint track processing Text of ISO/IEC 14496-12:2008 PDAM 3 DASH support and RTP reception hint track processing 14496-28 Composite Font 11571 Evaluation Report 11572 WD of ISO/IEC 14496-28 Composite Font X 21000-20 Contract Expression Language WD of ISO/IEC 21000-20 Contract Expression Language 11573 11664 Description of CE on Contract Expression Language X 11574 X 11575 11576 23000-11 Stereoscopic Video Application Format Text of ISO/IEC 23000-11/FDAM 1 Stereoscopic Video AF Ref. Soft and Conf. 23000-12 Interactive Music AF Study of ISO/IEC 23000-12 FPDAM 1 Conformance and Reference Software Workplan of ISO/IEC 23000-12 Conformance and Reference Software 63 11599 Description of CE on dynamic volume change No 10/10/15 Yongwei 23001-6 Dynamic Adaptive Streaming over HTTP Request for the subdivision of ISO/IEC 23001-6 No 10/10/15 Text of ISO/IEC CD 23001-6 Dynamic Adaptive Streaming over HTTP No 10/10/20 Description of Evaluation Experiments on Dynamic Adaptive Streaming No 10/10/15 Thomas St. Thomas St. Jin Workplan of Reference Software for DASH No 10/10/15 Thomas St. 10/10/19 Xin No 10/10/19 Sergio No 10/10/19 Michael No 10/10/15 No 10/10/21 Michael No No 10/10/15 10/10/15 JC JC No 10/10/15 Jean L. Y 10/10/30 Joo No 10/10/22 Jean G. No 10/11/12 Seungju 11590 WD of ISO/IEC FDIS 23005-3 AMD 1 Binary representation of sensory No 10/11/12 Kyoungro 10/11/12 Marius No 10/11/12 Kyoungro No 10/11/12 Kyoungro No 10/10/22 Christian No 10/10/15 Kyungmo X 11577 11578 11579 over HTTP 11580 X 23006-1 Architecture and Technologies 11581 Text of ISO/IEC CD 23001-1 2nd edition Multimedia Service Platform Yes Technologies - Architecture X 23006-2 Application Programing Interface 11582 Text of 2nd edition of ISO/IEC CD 23006-2 MXM APIs X 23006-4 Elementary Services 11583 Text of 2nd edition ISO/IEC CD 23006-4 Elementary Services X 23006-5 Service Aggregation Request of subdivision of ISO/IEC 23006-5 Service 11692 Aggregation 11584 Text of ISO/IEC CD 23006-5 Service Aggregation X 23007-1 Widgets 11598 Request for ISO/IEC 23007-1 AMD 1 Widget Extensions 11585 Text of ISO/IEC 23007-1 PDAM 1 Widget Extensions X 23007-3 Conformance and Reference Software 11587 Study of ISO/IEC FCD 23007-3 Conformance and Reference Software X 23005 Media context and control 11691 MPEG-V awareness day Call for Participation X 23005-1 Media context and control – Architecture 11588 WD of ISO/IEC 23005-1 AMD 1 Additional Use Cases X 23005-2 Media context and control – Control Information 11589 WD of ISO/IEC 23005-2 AMD 1 Additional Control Information and binary representation X 23005-3 Media context and control – Sensory Information information X 23005-4 Media context and control – Virtual World Object Characteristics 11591 WD of ISO/IEC FDIS 23005-4 AMD 1 Binary representation of virtual No X world object characteristics 23005-5 Media context and control – Data Formats for Interaction Devices 11592 WD of ISO/IEC FDIS 23005-5 AMD 1 Additional sensors, interaction devices and binary representation X 23005-6 Media context and control – Common Data Format 11593 WD of ISO/IEC FDIS 23005-6 AMD 1 Binary representation of common data format X 23005-7 Media context and control – Conformance and Reference Software 11594 Study of ISO/IEC FCD 23005-7 Conformance and Reference Software X Promotion 11595 Storyboard for MPEG-U promotion video 64 11596 Short descriptions X Assets and Standing Documents 11605 MPEG Schema Assets Updates 11609 MPEG URIs and MIME Types X 11632 11633 11634 11635 11636 11637 11638 11639 11640 11641 11663 10/10/15 10/10/15 Christian Christian No 10/10/15 Christian No 10/10/15 Iraj No 10/10/15 Xin No 10/10/15 Sam No No 10/10/15 10/10/15 Young Iraj No No 10/10/15 10/10/15 Young Young No No 10/10/15 10/10/15 Young JC No 10/10/15 No 10/10/15 Jean L. Young SVN support 11597 Guide to the MPEG Subversion Repository X No No Liaison Liaison statement template on Dynamic Adapative Streaming over HTTP Liaison statement template on Multimedia Service Platform Technology Liaison statement template on signalling of stereoscopic video in MPEG-2 System Liaison statement to SC 6 on MPEG Media Transport Liaison statement to 3GPP on Dynamic Adapative Streaming over HTTP Liaison statement to JTC 1/WG 7 on Sensor Networks Liaison statement to IEC TC 100 on the definition of “trick mode” Liaison statement to JTC 1/SC 24 on “zip” format Liaison statement to IEC TC 100 on IEC CDV 62516-2, BIFS in T-DMB receiver Liaison statement to IEC TC 100 on Richmedia for e-Book Statement of benefits from establishing a Category C liaison with DTG 65 2 General Input Documents 2.1 AHG reports Session Number Title Plenary m17973 Ad Hoc on Scene Representation Plenary m17974 Ad Hoc on MPEG File Formats Plenary m17975 Ad Hoc on Application Format Plenary m17976 Ad Hoc on Font Format Representation Plenary m17977 Ad Hoc on Advanced IPTV Terminal Plenary m17978 Ad Hoc on MPEG-V Plenary m17979 Ad Hoc on Contract Expression Language Plenary m17980 Ad Hoc on Dynamic Adaptive Streaming over HTTP Plenary m17984 Ad Hoc on MPEG Media Transport (MMT) 2.2 Dispositions Accepted Xin Wang , Young Kwon Lim Marius Preda Xin Wang , Jaime Delgado Iraj Sodagar , David Singer , Young Kwon Lim Christian Timmerer Accepted Accepted Accepted Accepted Accepted Accepted Accepted Accepted General technical contributions Session Number Title Plenary m18001 Table of Replies on ISO/IEC 144961:2010/FDAM 1 2.3 Demo 2.4 FAQ Source ITTF via SC 29 Secretariat . 2.5 Source Jaeyeon Song , Cyril Concolato David Singer , Per Frojdh Kyuheon Kim , Kugjin Yun Vladimir Levantovsky AOB None. 66 Dispositions Noted 3 MPEG-2 Systems (13818-1) 3.1 Topics 3.1.1 ISO/IEC 13818-1:2007 AMD 5 Carriage of JPEG2000 over MPEG-2 3.1.2 ISO/IEC 13818-1:2007 AMD 6 Extension to AVC video descriptor and operation points for MVC 3.2 Contributions Session MPEG2 MPEG2 MPEG2 MPEG2 MPEG2 3.3 Number Title m18029 USNB Contribution: WG11 resolution 2.1.4 consideration m18195 JNB comments on Resolution 2.1.4: Methods for signaling frame packing arrangement in MPEG-2 video m18351 Korean NB comment on 3DTV signaling in MPEG2 System m18359 Liaison Statement from DVB m18550 Liaison Statement from TTA (Korea) Source A. G. Tescher for USNB Dispositions Noted (N1562) Japan National Body Noted (N1562) Korean NB via SC 29 Secretariat Noted (N1562) DVB via SC 29 Secretariat TTA (Korea) via SC 29 Secretariat Noted (N1562) Noted (N1562) Summary of discussions • 3.4 ignalling of We received 5 input contributions related to system level signaling of 3D content coded MPEG-2 video: – USNB and KNB support option to use new stream_type and a new descriptor. – JNB did not support a new stream_type due to compatibility issues with deployed systems. – KNB and TTA requested adding support for service compatible mode in addition to frame compatible mode. - DVB indicated that they do have any plans currently to use MPEG-2 video for 3D services. Action Points ITU-T Rec. H.222.0 (05/2006)/Amd.5|ISO/IEC 13818-1:2007/FPDAM 5 ballot until 201012-03 ITU-T Rec. H.222.0 (05/2006)/Amd.5|ISO/IEC 13818-1:2007/FPDAM 6 ballot until 201012-03 67 4 MPEG-4 Reference Software (14496-5) 4.1 Topics 4.1.1 ISO/IEC 14496-5:2001 AMD 28 Reference software for LASeR adaptation tools 4.1.2 ISO/IEC 14496-5:2001 AMD 29 PMSI for LASeR 4.2 Contributions Session Scene 4.3 Number Title m18005 Summary of Voting on ISO/IEC 144965:2001/FPDAM 28 Source SC 29 Secretariat Dispositions Accepted (N11565, N11566) Summary of discussions m18005 One disapproval with comment from France. Disposition is provided appropriately. 4.4 Action Points ISO/IEC 14496-5:2001/PDAM 29 ballot until 2010-11-03 5 5.1 MPEG-4 BIFS (14496-11) Topics 5.1.1 Exploration - Remote and Collaborative work - Haptic interaction 5.2 Contributions Session Scene Scene Scene Scene Number Title m17998 Table of Replies on ISO/IEC 1449611:2005/FDAM 7 m18382 Conformance Sequences for BIFS ExtendedCore2D m18383 Reference Software for BIFS ExtendedCore2D m18407 Use Case for Collaborative BiFS Scenes : Distance Learning 68 Source ITTF via SC 29 Secretariat Dispositions Noted Jean Le Feuvre Accepted (N11564) Accepted (N11566) Accepted (N11569) Jean Le Feuvre Kim Pelsor , Jamie Marshall , Scene Scene m18408 Use Case for Collaborative BiFS Scenes : Proactive Medicine m18409 Use Case for Collaborative BiFS Scenes : Consultations without Walls Scene m18412 Use Case for Collaborative BiFS Scenes : A day in a Professor's life Scene m18417 Use Cases for Collaborative BiFS Scenes : Overview Scene m18422 Image Transmission for Mobile Thin Clients : A Comparative Study 5.3 Dr Michael McGannon , Jamie Marshall , Accepted (N11569) Fritz-Joachim Westphal , Mario Kind , Jamie Marshall , Mihai Mitrea , Mihai Mitrea , Jamie Marshall , Françoise Preteux , Pieter Simoens , Bart Dhoedt , , Jamie Marshall , Mihai Mitrea , Bojan Joveski , Ludovico Gardenghi , Françoise Preteux , Bojan Joveski , Ludovico Gardenghi , Jamie Marshall , Mihai Mitrea , Françoise Preteux Accepted (N11569) Accepted (N11569) Accepted (N11569) Noted Summary of discussions M18382 A set of conformance sequences produced by GPAC have been uploaded in this contribution (mp4 and xmt). Another set of files in the used by bifsenc/mux in IM1 is also provided. There is no repository for sequences, so where to put them ? This should go to the WD on BIFS ExtendedCore2D profile conformance M18383 IM1-2D has been updated to reflect the latest versions of BIFS. The project files were updated to Visual Studio 8 (and should be upgradable easily now). MPEG-J support is not activated because it could not be rebuilt. Some 3DGC code needs some help. Parsing, decoding and encoding of the new tools is functional for nodes and commands. There is utility software to check the behaviour of the nodes (GPAC). So there are two interoperable open implementations of the standard. This should go into a WD of BIFS ExtendedCore2D Reference Software. This raises the question of free readonly access to the MPEG SVN. The five contributions are presented together through m18417. m18407: question about the virtual lab, sharing a lab software, clarification that if there is software, it is on the server, and all communication with the software is managed by the collaborative environment. m18408: what is the specificity of this use case ? possibly the security/privacy issues, and how to establish and communicate trust. m18409: specificity is the mobility, the sharing of high quality material and the variety of usage environment, but not so collaborative. 69 m18412: session migration, collaborate on external software, remote upgrades There is a missing tool on allowing collaboration on external software, a sort of extended VNC. Should it be connected to BIFS? No! We should specify what are the needs: a scene description format, possibly XML-base, with events, maybe an update mechanism (but could be in the collaborative layer). M18422 This describes using mosaic of images to share an application within a BIFS environment. The criticism expressed in the contribution was clarified to be criticism of the GPAC implementation, not of the standard. Guidelines were provided to the author about the proper use of CacheTexture, once the implementation in GPAC has been improved. 5.4 Action Points 6 MPEG-4 ISO Base File Format (14496-12) 6.1 Topics 6.1.1 ISO/IEC 14496-12:2008/PDAM 2 Support for sub-track selection & switching Subtrack selection and switching Adaptive progressive download 6.2 Contributions 6.3 Summary of discussions 6.4 Action Points ISO/IEC 14496-12:2008/DCOR 4 ballot until 2010-11-10 ISO/IEC 14496-12:2008/FPDAM 2 ballot until 2011-02-03 7 LASeR & SAF (14496-20) 7.1 Topics 7.1.1 Exploration Advanced User Interaction Interface 7.2 Contributions 70 Session Scene 7.3 Number Title m18529 Proposal of an hierarchical representation of a LASeR scene Source Mattia Donna Bianco Dispositions Rejected Summary of discussions m18529: the contribution was presented, together with further discussions on the reflector. Some of it comes from a misunderstanding, and further discussions on the reflector may be needed. However, the contribution cannot be accepted. 7.4 Action Points 8 Open Font Format (14496-22) 8.1 Topics 8.1.1 Composite Fonts 8.2 Contributions . Session 8.3 Number Title Source Dispositions Summary of discussions M18575 8.4 Action Points 9 Contract Expression Language (21000-20) 9.1 Topics 9.1.1 ISO/IEC 21000-20 Contract Expression Language 9.2 Contributions Session Number Title Source 71 Dispositions MSP m18453 Results of the Core Experiment for Contracts Mapping MSP m18561 Proposal of initial CEL Working Draft 9.3 Jaime Delgado , VÃctor RodrÃguezDoncel , Francesco Gallo , Elisa Todarello , Annarita Di Carlo , Walter Allasia , Eva RodrÃguez Jaime Delgado , Victor RodriguezDoncel Accepted N11664 Accepted N11573 Summary of discussions M18453 A lot of work done for mapping 4 contracts M18561 Good to be a starting point. More discussions needed to finalize a core structure of CEL. 9.4 Action Points 10 Stereoscopic Video AF (23000-11) 10.1 Topics 10.1.1 ISO/IEC 23000-11 AMD1 Conformance and Reference Software 10.1.2 ISO/IEC 23000-11 AMD2 Additional composition type 10.2 MAF 10.3 Contributions m18503 Proposed Text of ISO/IEC 23000-11/FDAM1 Stereoscopic Video AF Conformance and Reference Software Next Generation Broadcasting Forum (Korea) Accepted N11574 Summary of discussions M18503 has been reviewed with minor changes in conformance points, conforming files, and Annex B, and then is finally approved. The reference SW is to support all the three conformance points, and the four conforming files are provided as well. In order to finalize the FDAM doc, it is required to have two weeks editing period. 72 10.4 Action Points ISO/IEC 23000-11:2009/PDAM 2 ballot until 2010-11-03 11 Interactive Music AF (23000-12) 11.1 Topics 11.1.1 ISO/IEC 23000-12 AMD1 Conformance and Reference Software 11.2 Contributions Session MAF MAF MAF 11.3 Number Title m18165 An experimental study of different dynamic volume change encoding methods for IMAF m18166 A study on the computational complexity of EQ implementation in IMAF m18362 Updated text and reference SW of ISO/IEC 23000-12/FPDAM1 IM AF Source Yongwei Zhu , Rongshan Yu , Susanto Rahardja Dispositions Accepted N11599 Yongwei Zhu , Rongshan Yu , Susanto Rahardja Noted Inseon Jang , Jeongil Seo , Laurent Primaux Accepted N11575 Summary of discussions M18165: proposed the duration_update, which seems to have a less parsing complexity and improve bit savings. However, it is not so sure whether it is affect the quality of the audio sequence. It is recommended to conduct the experiment to find out the saving of complexity and bits without any degradation of the quality in audio sequence. m18166: EQ has been requested as a function in the requirement document, however, it is not supported in the current IMAF. Thus, it is requested to bring a syntax and semantics for satisfying the requirement. M18362: Approved it. 11.4 Action Points ISO/IEC 23000-12:2010/PDAM 1 ballot until 2010-12-05 73 12 Dynamic Adaptive Streaming over HTTP (23001-6) 12.1 Topics • • • 12.2 Manifest Delivery format for ISO BM FF Delivery format for MPEG-2 TS Contributions Session Number DASH m18007 DASH m18008 DASH m18009 DASH m18010 DASH m18011 DASH m18012 DASH m18013 DASH m18014 DASH m18015 DASH m18016 DASH m18017 DASH m18018 Title DASH Evaluation Experiment 5: Delivery Format of MPEG2 TS Report of DASH Evaluation Experiment 9, Delivery Format to File Format Conversion (DFC) DASH Evaluation Experiment #1: Compositions of Media Presentation (CMP) Proposal Comparison DASH Evaluation Experiment #2: MPD Modification, Corrections, and Extension DASH Evaluation Experiment #8, Clock Drift Control DASH Evaluation Experiment #4: Delivery Format Addressing DASH Evaluation Experiment #3: Enhancement to Representation Description (ERD) DASH Evaluation Experiment #7: Seamless Switching (SES) Updated Information on Reference Software for DASH EE#6: Trick Mode and Random Access Signaling (TRA) Some comments on DASH EE#7 – Seamless Switching Proposed Amendment: Support for Common 74 Source Waqar Zia Dispositions Refer m17980 Ye-Kui Wang , Sungryeul Rhyu , Miska M. Hannuksela Refer m17980 Jin Young Lee , Harry Pyle Refer m17980 Jin Young Lee Refer m17980 Nhut Nguyen , Harry Pyle Gerard Fernando , Kyungmo Park Refer m17980 Refer m17980 Waqar Zia , Ying Chen , Yago Sanchez Refer m17980 Ying Chen , Ingo Hofmann Thomas Stockhammer Refer m17980 Refer m17980 Ye-Kui Wang , Thomas Stockhammer Refer m17980 Mark Watson Refer m17980 Accepted N11578 Kilroy Hughes , Quintin Burns DASH m18019 DASH m18309 DASH m18311 DASH m18347 DASH m18348 DASH m18353 DASH m18357 DASH m18366 DASH m18370 DASH m18380 DASH m18381 DASH m18385 DASH m18387 DASH m18388 DASH m18390 Encryption Proposed Amendment: Independently Decodable Fragments DASH related clarifications to ISO base media file format Base DASH on updated 3GPP adaptive HTTP streaming specification DSAH - support of rewind trick mode On random access point signaling in file formats Consideration of Clock Drift Control for DASH Kilroy Hughes , Quintin Burns Accepted N11578 Ye-Kui Wang , Zhenyu Wu Accepted N11578 Ye-Kui Wang , Per Fröjdh , Thomas Stockhammer , David Furbeck , Imed Bouazizi Ye-Kui Wang Accepted N11578 Ye-Kui Wang Yasuaki Tokumo , Maki Takahashi , Shuichi Watanabe , Norio Ito , Yasuaki Tokumo , Maki Takahashi , Shuichi Watanabe , Yoshiaki Ogisawa , Takashi Kaneko , Norio Ito , Gerard Fernando , Yang Yanzi , Wang Fang Modification of Enhancement to Representation Description on DASH for signaling of 3D content Proposal for new box to support random access for DASH Carriage of MPEG-2 TS Gerard Fernando , Yang support information in Index Yanzi , Wang Fang file for DASH Comments on DASH WD Cyril Concolato , Jean Le Feuvre , Romain Bouqueau , Report on an Open Source Cyril Concolato , Jean Le implementation of MPEGFeuvre , Romain DASH Bouqueau Annotation and Attributes Sungryeul Rhyu , Seofor Media Compositions Young Hwang , Kyungmo park , Jaeyeon Song , Bogyeong Kang , Nhut Nguyen Compelling use-case for Yonghun Lee , Doug M17770(DFA-EE) Young Suh , Jaeyeon Song , kyungmo Park , Timed metadata Support in Sungryeul Rhyu , SeoDASH Young Hwang , Nhut Nguyen , Benefits of TRA Chulkeun Kim , Doug 75 Accepted N11579 Accepted N11578 Noted Noted Accepted N11579 Accepted N11579 Accepted N11578 Noted Accepted N11579 Accepted N11578 Accepted N11578 Accepted representation DASH m18391 File Format for 3D Support of DASH DASH m18392 Random Access Point (RAP) Indication DASH m18394 Content Insertion on DASH DASH m18400 DASH m18401 DASH m18403 DASH m18405 Editor's Proposed Updates on DASH Working Draft Adding Minimum RAP Frequency to MPD Reference Software for DASH Extensions to Media Presentation Description to support HTTP Streaming from multiple servers Scalable Random Access for SVC based DASH DASH DASH DASH DASH Young Suh , Jaeyeon Song , Kyungmo Park Kyuheon Kim , Jangwon Lee , Jung-Han Kim , Hee Jean Kim , Gahyun Ryu , Gilyoon Kim , Do Young Joung , Namguk Kim Hojin Ha , Sungryeul Rhyu , Seo-Young Hwang , Jaeyeon Song Seo-Young Hwang , Sungryeul Rhyu , Hojin Ha Thomas Stockhammer Thomas Stockhammer Thomas Stockhammer Helmut Bürklin , Stéphane Gouache , Truong Cong Thang , Jin Young Lee m18414 Jin Young Lee , Kwangdeok Seo , Hyung Jung Kim , Truong Cong Thang , Jung Won Kang , Seong-Jun Bae , Soonheung Jung , Sang Taick Park m18415 Virtual Segmentation of TS Jin Young Lee , Euy-doc Packetized Video using Key- Jang , Jae-Gon Kim , frame Information Truong Cong Thang , Jung Won Kang , SeongJun Bae , Soon-heung Jung , Sang Taick Park m18418 File Format Adaptation Kwang-deok Seo , Based on HTTP Hint Sang-woo Shim , Jin Track for HTTP Streaming Young Lee , Truong Cong Thang , Jung Won Kang , Seong-Jun Bae , Soon-heung Jung , Sang Taick Park m18439 Contribution to DASH EE#2 Truong Cong Thang , Jin Young Lee , Jung Won Kang , Seong-Jun Bae , Soon-heung Jung , Sang Taick Park 76 N11578 Accepted N11578 Accepted N11579 Accepted N11578 Accepted N11578 Accepted N11579 Accepted N11580 Accepted N11578 Accepted N11579 Accepted N11579 Accepted N11579 Accepted N11579 DASH m18440 Use Cases of Multiple Byte Ranges for DASH DASH m18441 Comments on the Use of RAP in Segments DASH m18442 DASH m18443 DASH m18444 DASH m18445 Test sequences in DASH format DASH File Format and MPD Considerations Proposal for using MPEG-21 Digital Item Declaration and W3C XInclude/XLink for Media Presentation Description Proposal on Signaling for DASH DASH m18457 MPEG-2 TS RAP indexing for DASH DASH m18462 DASH m18463 DASH m18464 DASH m18465 DASH DASH m18482 m18483 DASH m18484 DASH m18493 DASH m18494 Contribution to DASH EE#5: MPEG2-TS Reception Hint Track Contribution to DASH EE#7: bitstreamStructureId Contribution to DASH EE#9: MPD Indication for File Conversion of Scalable Streams Contribution to DASH EE#9: File Construction Instruction Format DASH MPD Delta Files Content Protection and DRM in DASH Signaling for Seamless Switching for DASH On decode times in movie fragments DASH: Considerations on 77 Truong Cong Thang , Jin Young Lee , Jung Won Kang , Seong-Jun Bae , Soon-heung Jung , Sang Taick Park Truong Cong Thang , Jin Young Lee , Jung Won Kang , Seong-Jun Bae , Soon-heung Jung , Sang Taick Park , Thorsten Lohmar , Per Fröjdh Per Fröjdh , Torbjörn Einarsson Christopher Müller , Christian Timmerer , Accepted N11579 Truong Cong Thang , Jin Young Lee , Jung Won Kang , Seong-Jun Bae , Soon-heung Jung , Sang Taick Park , Zhijie Zhao , Marco Munderloh , Joern Ostermann Miska M. Hannuksela Noted Miska M. Hannuksela , Imed Bouazizi Miska M. Hannuksela , Imed Bouazizi Accepted N11579 Accepted N11579 Miska M. Hannuksela Accepted N11579 Accepted N11579 Accepted N11579 Accepted N11578 Accepted N11579 Accepted N11579 Accepted N11579 David Furbeck Xin Wang , Shaobo Zhang Accepted N11579 Zhang Shaobo , Xin Wang Accepted N11579 David Singer Accepted N11579 Mark Watson , Thomas Accepted Overlap during switching DASH: URL selection Stockhammer Mark Watson DASH EE#7 – Segment Index DASH: Considerations on Segment Duration Input for DASH EE#1 (CMP): Pixel Aspect Ratio DASH: Field data for quantitative evaluation Mark Watson , Thomas Stockhammer watsonm@netflix.com DASH m18495 DASH m18496 DASH m18497 DASH m18498 DASH m18504 DASH m18524 Contribution to EE#2 MCE: Extension and Modification DASH m18545 Seamless switching for TS in DASH DASH m18549 DASH m18551 DASH m18558 DASH m18559 DASH m18560 DASH m18562 DASH m18565 DASH m18566 DASH m18392 DASH – Quality of Experience and performance metrics DASH: MPD Representation Groups and MPD Fragmentatio Usage of MPEG-2 Transport Streams Segment Index Syntax for MPEG-2 TS Comments on MPEG-2 Transport Streams in DASH LGE proposal for PIP attributes for CMP Support of virtual/nonsegmentation in DASH DASH DTS M2TS Compression gzip comparison Random Access Point (RAP) Indication DASH m18416 TS Header Extension for Efficient Adaptation in the HTTP Streaming 78 Mark Watson Doug Young Suh , Kyungmo Park , Sungryeul Rhyu Zhijie Zhao , Marco Munderloh , Joern Ostermann Cyril Concolato , Romain Bouqueau , Jean Le Feuvre Mark Watson Mark Watson , Thomas Stockhammer , Kilroy Hughes , David Singer , Alex Giladi Alex Giladi alex.giladi@gmail.com hj08.lee@lge.com , w.zia@lglab.eu Zhijie Zhao , Yanzi Yang H J Lee , w.zia@lglab.eu Hojin Ha , Sungryeul Rhyu , Seo-Young Hwang , Jaeyeon Song Jin Young Lee , Euy-doc Jang , Jae-Gon Kim , Truong Cong Thang , Jung Won Kang , SeongJun Bae , Soon-heung Jung , Sang Taick Park N11578 Accepted N11579 Accepted N11578 Noted. Accepted N11578 Accepted N11579 Noted Accepted N11579 Accepted N11579 Accepted N11578 Accepted N11579 Accepted N11579 Accepted N11579 Accepted N11579 Accepted N11579 Accepted N11579 Accepted N11579 Accepted N11579 12.3 Summary of discussions m18464: the flag will be considered as part of Representation ‘annotation” in more generalized case and will be discussed there. m18465: will bring back an enhanced version of proposal that also includes less on MPD signaling. Show the evidence of how often the client can create a valid MP4 file, using the existing players (whether file with missing track in mdat can be played by legacy player?) -“If satisfied, proposal 1 (one initialization segment for all reps) get added as client/service guidelines (informative)” no input. ExpriationTimeUTC: Samsung will have contribution at the Jan meeting. m18353: seems not directly related to EE8 m18562 moved to EE1 (CMP). m18370: supporting information + index file+ MPD additions for index file m18462: dual approach using sidx and MP4 or separate file m18457: two solutions: 1) index file similar to m18370 2) include RAP index in PID stream, prefer PID approach. Need to transcode new TS. :m18558 & m18560: fixes and improvements on the baseline. m18392: extensions 2 M2TS, goes to M2TS extension work, outside of DASH m18559:similar to m18392 but use private data m18545: the audio glitch during switching of video streams m18566: - map PTS to MPD timeline, gzip vs LG proposal: not completed. m18439: ETRI, proposes “periodref”, Xlink and XInclude are not appropriate. m18551: section 3. Propose a new element “Include” in MPD inside any period or representation. Both 18439 and 18551 do not recommend XInclude or Xlink. Some people think Xlink may able to do same. Will bring evidence tomorrow. Will decide on the solution after CMP review. M18388: timemetadata, no consensus yet. m18366: msra box for accessing RAP. No major advantage to WD is shown yet. m18387: new proposal to extend sidx box, will be reviewed with TRA proposals. m18440: no strong support m18565: no new decision m18385: mediacomposition group 79 m18444: DID, MPD selection m18498: aspect ratio. New attributes for signaling the aspect ratio. m18551: RepresenationGroup The concept of RepresentationGroup in m18551 was originally introduced by ETRI in section 5.4 of m18009 as componentRepresentations Following attributes in RepresentatonGroup element (m18551) was originally introduced by ETRI in section 5.4 of m18009; lang mimeType minBandwidth maxBandwidth minWidth maxWidth minHeight maxHeight minFramerate maxFrame m18562: view priority and PIP location are provided as representation attributes in MPD. But since MPD is not designed to convey presentation/layout information, therefore the proposed attributes are not added to MPD. m18385 will provide a use case and detail example of its proposal compared with WD by Tues. DRM part of CMP will be reviewed during the week. 18390: adding RAPperiod to MPD. Accepted the maxmiumRAPperiod into MPD. Keep the constraint of one minimum RAP. 18401: minimum RAP frequency. m18347: rewind trick mode: alternatePlayoutRate have negative value + additional text to clarify the meaning of negative/positive. MP4FF needs also to signal for reverse sample order in reverse order. m18441: remove the constraint of having minimum one RAP. Propose RandAcess element in MPD to signal segments with RAPs. m18443: temporal ordering of tick mode. Not to use one trun. Sample grouping using “tele” box in MP4. Use segment type bx to brand the segment as temporal scalability. A new attribute in trick mode element of MPD for temporal ordering, also number of levels. Sample grouping functionality might exist in AVC FF. Accept m18443 with the refinement on AVC extension box, resolve the disagreement on index box after sidx. Collaboration with 18387. Update on m18016: m18387: extend the sidx box. Decision: m18441: bring back the proposal with more details. A basic question to answer generally: if the values of attributes are out of range, what would be the behavior of the client? (contributions are welcomed to answer this). 80 m18463: need two decoded 2 I frames sometimes to switch seamlessly. To avoid overlapping, proposing bitstreamStrcutureId attribute for representation meaning that they can be switched and the old reference frame used for new ones. m18493: sidx is not easy to create. MPD gives the accurate time for segments. Add the tfdt box after Track Fragment Header box. Tfad express the clean start. m18494: switching while overlapping representation. encodedSampleNooverlapFlag at represenationgroup m18496: sidx box has two purposes: timing recovery and RAP indexing. TrackFragmentDecodeTime can do it better. Sidx Define decoding time, should have defined the composition time. Accept it to WD. Ask 3gpp to align the 3GPP release 9. Come back with fragmentAlignmentFlag to propose. m18497: for information. Will be presented later. M18405 Multiple URl with simple priority. M18495 . If there is any issues with CDN, it will brought up. We will look at the exact syntax and accept it if there is no issue. M18497 Add to WD: add “duration” to period. Make period start time optional Only for VOD case, if it doesn’t break anything else in MPD. M18445: - supp. Can be delivered as content, no need to be part of MPD. Average bandwidth: not average. If use the same definition of Rep’s bandwidth. No consensus on this. Bring modified proposal. M18524: - no strong support for flag. Group: WD will be clarified that this is the case. M18357: Add stereoVideo to represenation and generic FramePackingType M18484: deferred to later. M18549 Propose QoE metrics for informative part. Exact parameters to be further defined and refined. Approve an EE on this: Mark to provide the description. M18483 On, UsageRights, UsageRightUri MPDIntegrity: Need further studies. Possibly start an EE. M18385 On downloadableDRMInfo 81 M18015 was presented in Paris and reviewed here again. DASH Media and DASH Client imple,ments the major feature of the software. Built on Mac, Linux. m18402: duplicate registration. m18403: the software plan m18442: Test sequences and a site that provide VOD and live content for DASH. m18444: Demoed a player which parses MPD and plays on representation. M18381: implementation based on GPAC open source project. Parsing MPD. It doesn’t support switching - m18309: • clalify spec that track fragment boxes covering the same time period: corrigenda issue • restriction on the composition time of following samples • Movie fragments covering the same time period should be time-aligned. -> this should be addressed in DASH spec. • Allow gaps in movie fragment box: corrigenda issue. No additional FF amendment required. Either corrigenda issues or better DASH. M18348: signaling rap in FF. 1) Define RAP to include all 3 types of RAPs. 2) Sync sample table box should have only IDR RAPs. 3) Add 2 fields to rap sample group entry. 4) Add one filed to syntax of roll sample group entry. Discuss with dave. May be already addressed. 5) Clarify raps in mfra ox can be any raps. Give a definition of sync sample. Part of corrigenda. Editors need to go look at Part 12 and 15 and see what is missing about RAPs and propose for next corrigenda. m18391: 3 new boxes for 3-D signaling. Svmi, scdi and mfsi see part 15 extension and if it can be addressed there. m18018 and m18019: -Trik box: start work with RAP sample group box. Integrated sample group to be defined to address this. Will be addressed as part of editing of the PDAM. -sample encryption box -> study if it can be implemented with time metadata track, - track encryption box add to PDAM, - pssh: draft a metada item, instread of using uuid, use a 4cc register id. Goes to PDAM. m18418: HTTPHintSampleEntry, not part of delivery. M180415: signaling the RAP indexes in MPD. There are 3 different methods: inside M2TS signaling, sidx based signaling and signaling in MPD. 12.4 Action Points Detailed schedule for DASH sessions. Day Time Topic 16:00 – Review of EE results Mon. ??:?? 09:00 – New contributions Tue. ??:?? 12:00 – Reference Software Wed. 82 13:00 14:00 – 16:00 09:00 – 13:00 Wed. Thu. MPEG-2 TS, File Format Issues Remaining issues 13 Multimedia Service Platform Architecture (23006-1) 13.1 Topics 13.1.1 ISO/IEC 23006-1 MXM Architecture and technologies 13.2 Contributions Session Number 13.3 Summary of discussions 13.4 Action Points 14 MSP APIs (23006-2) 14.1 Topics Title Source Disposition 14.1.1 ISO/IEC 23006-2 MXM APIs 14.2 Contributions Session MSP Number Title m18528 Contribution for 23006-2 Source sergio matone MSP m18505 Java implementation for generic metadata APIs on MXM Wonsuk Lee , Seungyun Lee 14.3 Summary of discussions M18528 This is an Editor’s input to improve the text. 83 Dispositions Accepted N11582 Accepted N11582 M18505 This provides software for generic metadata API for MXM. It should be integrated into the existing package. The authors will provide an input document to provide the API specification for generic metadata. 14.4 Action Points 15 MSP Elementary Service Protocol (23006-4) 15.1 Topics 15.1.1 ISO/IEC 23006-4 MXM Protocols 15.2 Contributions Session MSP MSP MSP MSP MSP MSP MSP Number Title m18526 Comments on WorkingDraft 23006-4 m18476 Input on w11413 (WD of ISO/IEC 23006-4 2nd edition Elementary Service Protocols) m18213 Background for Proposing Scalable Application Markup Language for AIT’s Present Content Elementary Service m18214 Proposal of Scalable Application Markup Language (SAML) for AIT’s Present Content Elementary Service m18227 Proposal of Stack Operators for AIT’s Present Content Elementary Service m18229 Initial Proposal of DCFunctions for AIT’s Present Content Elementary Service m18538 AIT Elementary Service Search License 84 Source sergio matone Michael Grafl , Christian Timmerer Dispositions Accepted N11583 Accepted N11583 Kyoungro Yoon , MinUk Kim , Tae-Beom Lim , Kyung Won Kim , Yun Ju Lee , Jae Won Moon Noted Kyung Won Kim , Tae-Beom Lim , Yun Ju Lee , Jae Won Moon , Kyoungro Yoon , Min-Uk Kim Noted Jae Won Moon , Kyung Won Kim , Tae-Beom Lim , Yun Ju Lee , Kyoungro Yoon , Min-Uk Kim Tae-Beom Lim , Kyung Won Kim , Yun Ju Lee , Jae Won Moon , Kyoungro Yoon , Min-Uk Kim Miran Choi , MyungGil Jang , Chang-ki Lee , Noted Noted Accepted N11583 MSP 15.3 m18539 AIT Elementary Service Search Contract HyunKi Kim , Miran Choi , Hyojeong Accepted N11583 Oh , Jeong Hur , Soojong Lim , Yeochan Yun Summary of discussions M18526 & m18476 This is an Editor input to improve the text The contributions m18213, m18214, m18227, m18229, and a demo of the use of Scalable Application Markup Language (SAML) were presented. It was agreed that the contributors will try to re-implement the demo using MPEG-21 DIA. They will evaluate, whether an extension needs to be specified in order to include and use MPEG-21 DIA information for the choice/selection mechanism in MPEG-21 DID. The result will be presented at the 95th MPEG meeting. M18538 This is an input on Search License ES. Accepted to be part of Part 4 ES. M18539 This is an input on Search Contract ES. Accepted to be part of Part 4 ES. 15.4 Action Points 16 MSP Service Aggregation (23006-5) 16.1 Topics 16.1.1 ISO/IEC 23006-5 MSP Service Aggregation 16.2 Contributions Session MSP MSP 16.3 Number Title m18527 Comments on WorkingDraft of 23006-5 m18477 Input on w11414 (WD of ISO/IEC 23006-5 Service Aggregation) Summary of discussions 85 Source sergio matone Michael Grafl , Christian Timmerer Dispositions Accepted N11584 Accepted N11584 M18527 & m18477 Editors input to improve the text 16.4 Action Points 17 Richmedia UI Widgets (23007-1) 17.1 Topics 17.1.1 Package, Delivery and Presentation of Widgets 17.1.2 Communication 17.2 Contributions Session Scene Scene 17.3 Number Title m17999 Table of Replies on ISO/IEC FDIS 23007-1 m18376 Proposed amendment for MPEG-U widgets Source ITTF via SC 29 Secretariat Jean-Claude Dufourd , Jean Lefeuvre , Cyril Concolato , Kyungmo Park , Jaeyeon Song Dispositions Noted Accepted (N11585) Summary of discussions m18376: The contribution is presented. A comment from MPEG-V is that numberarray may also be needed. An answer is that there is then a need to provide mapping of the array types to a type acceptable to all protocols (such as mapping to/from string type). Question on whether the stringarray implementation caused any problem, and the answer was no, beyond the mapping, hence the proposed restriction in the contribution. Question on the implementation of security restrictions, which led to a discussion to be continued. 17.4 Action Points 18 Advanced User Interaction Interface (23007-2) 18.1 Topics 18.1.1 Advanced User Interaction Interface 86 18.2 Contributions Session Scene 18.3 Number Title m18490 Proposed mechanism to support AUI interface Source Seong Yong Lim , Jihun Cha , Injae Lee , Young-kwon Lim Dispositions Noted Summary of discussions m18490 : Presentation of the contribution. The presented interface seems more like a higher level of MPEG-V than an MPEG-U interface. Request to do a state of the art on mouse gestures, same as was done on the iphone SDK. Question about what language to use to describe the gestures, and then what language to use to pass the information about a recognized gesture into a scene in a particular format: the answer for both is that the proposal is flexible. Once the gesture is recognized, it is important to adapt the information to the target scene format, even if the actual effect in the scene is out of scope. There seems to be a need for a database of recognized gestures, an entity which matches the output of the semantic generator to one recognized gesture (independent from the scene format), and then a module to translate the recognized gesture into some scene description specific stuff. Suggestion that MPEG-U part 1 can be used for some of the communications between MPEG-V and the scene. Request for a walkthrough of the system, a concrete example of a flow 18.4 Action Points 19 MPEG-U Conformance and Ref. SW (23007-3) 19.1 Topics 19.1.1 Conformance and Reference Software 19.2 Contributions Session Scene 19.3 Number Title m18438 Study of FCD of 23007-3 Summary of discussions m18438: 87 Source Jean-Claude Dufourd , Jean Lefeuvre , Cyril Concolato , Kyungmo Park , Jaeyeon Song Dispositions Accepted (N11587) the contribution was presented. The conformance sequences are not completed, mostly negative sequences are still missing. These will be discussed and provided or removed until the next meeting. 19.4 Action Points ISO/IEC FCD 23007-3 ballot until 2010-12-05 20 MPEG-V Architecture (23005-1) 20.1 Topics 20.1.1 ISO/IEC 23005-1 MPEG-V Architecture 20.1.2 Exploration on Haptics 20.2 Contributions Session MPEGV MPEGV MPEGV MPEGV MPEGV Number Title m18027 Proposal of Header information for binary transmission of MPEG-V information m18252 Robot Integration m18253 Additional Use Cases for MPEG-V m18508 Descriptions for social network services m18352 Path Finding 20.3 Summary of discussions 20.4 Action Points Source Kyoungro Yoon , Soohan Kim , Woo Chool Park , Hae Moon Seo Dispositions Refer 3DGC report acordova@tue.nl , jean.gelissen@philips.com jean.gelissen@philips.com Refer 3DGC report Refer 3DGC report Refer 3DGC report Sanghyun Joo (joos@etri.re.kr) jean.gelissen@philips.com , Refer 3DGC report roland@cs.uu.nl , ISO/IEC FCD 23005-1 ballot until 2010-03-02 88 21 Control Information (23005-2) 21.1 Topics 21.1.1 ISO/IEC 23005-2 Control Information 21.2 Contributions 21.3 Summary of discussions 21.4 Action Points ISO/IEC FCD 23005-2 ballot until 2010-04-14 22 Sensory Information (23005-3) 22.1 Topics 22.1.1 ISO/IEC 23005-3:201x/AMD 1 Binary representation 22.2 Contributions Session MPEGV MPEGV MPEGV MPEGV MPEGV MPEG- Number Title m18025 Proposal of GPS Sensor Capability Type, Altitude Sensor Capability Type, and Global Position Capability Type m18365 Data format for sensed information on Electrograph sensor m18377 Tactile Information m18487 Definition and application domains of new sensors m18488 Sensor capability and preference of new sensors m18489 Binary Source Kyoungro Yoon , Doohyun Kim , Min-Uk Kim Dispositions Refer 3DGC report SeungJu Han , Jae Joon Han , Won-Chul Bang , James D.K. Kim , SangKyun Kim jean.gelissen@philips.com , esko.dijk@philips.com Seong Yong Lim , Jihun Cha , Injae Lee , Youngkwon Lim , Joong Yun Lee Seong Yong Lim , Jihun Cha , Injae Lee , Youngkwon Lim , Joong Yun Lee Seong Yong Lim , Jihun Refer 3DGC report 89 Refer 3DGC report Refer 3DGC report Refer 3DGC report Refer 3DGC V MPEGV MPEGV representation of new sensors m18410 Sensed Information for gas and dust sensors m18510 sensor capability for gas and dust sensors 22.3 Summary of discussions 22.4 Action Points Cha , Injae Lee , Youngkwon Lim , Joong Yun Lee Jae Joon Han , SeungJu Han , Won-Chul Bang , James D.K. Kim , Sangkyun Kim , YongSoo Joo Jae Joon Han , SeungJu Han , Won-Chul Bang , James D.K. Kim , SangKyun Kim , YongSoo Joo report Refer 3DGC report Refer 3DGC report ISO/IEC FCD 23005-3 ballot until 2010-04-14 23 Virtual World Object Characteristics (23005-4) 23.1 Topics 23.1.1 ISO/IEC 23005-4:201x/AMD 1 Binary representation 23.2 Contributions Session MPEGV MPEGV MPEGV Number Title m18507 Additional type for Virtual Object m18509 Minor changes in MPEGV Part 4 m18536 Update of Reference SW for MPEG-V Part 4 23.3 Summary of discussions 23.4 Action Points Source Sanghyun Joo (joos@etri.re.kr) Sanghyun Joo (joos@etri.re.kr) Jae Joon Han , SeungJu Han , WonChul Bang , James D.K. Kim , Sang-Kyun Kim , Yong Soo Joo ISO/IEC FCD 23005-4 ballot until 2010-04-14 90 Dispositions Refer 3DGC report Refer 3DGC report Refer 3DGC report 24 Data formats for interaction devices (23005-5) 24.1 Topics 24.1.1 ISO/IEC 23005-5:201x/AMD 1 Additional sensors, interaction devices and binary representation 24.2 Contributions Session MPEGV Number Title m18024 Update of Reference SW for MPEG-V Part 5 24.3 Summary of discussions 24.4 Action Points Source Eunseo Lee , Bum Seok Choi, Kwang Roh Park, Kyoungro Yoon , Jonghyung Lee , Ying Ying Chen Dispositions Refer 3DGC report ISO/IEC FCD 23005-5 ballot until 2010-04-14 25 Conformance and reference software (23005-7) 25.1 Topics 25.1.1 ISO/IEC 23005-5 Reference Software 25.2 Contributions Session MPEGV Number Title m18023 Update of Reference SW for MPEG-V Part 2 MPEGV m18449 Proposed changes for validation rules of MPEGV Part 7 25.3 Summary of discussions 91 Source Bum Suk Choi , Eunseo Lee , Jong Hyun Jang , Kyoungro Yoon , Jonghyung Lee , Ying Ying Chen Markus Waltl , Christian Timmerer Dispositions Refer 3DGC report Refer 3DGC report 25.4 Action Points ISO/IEC FCD 23005-7 ballot is note started yet. 25.5 Summary of discussions 25.6 Action Points 26 Exploration – MMT 26.1 Topics • • • • • Transport- and file format friendly stream format Cross layer optimization between video and transport layer Error resilience for MPEG streams Conversion between transport mechanisms Content adaptation to different networks 26.2 Contributions 26.3 Summary of discussions 26.4 Action Points CfP 27 Liaison 27.1 List of input liaison letters Session Number Title Plenary m17985 Liaison Statement from ITU-T SG 16 on Approval 92 Source ITU-T SG 16 via SC 29 Secretariat Dispositions Jean G. Plenary m17989 Plenary m17993 Plenary m18006 Plenary m18020 Plenary m18021 Plenary m18022 of a New Question on Telepresence systems [SC 29 N 11508] IEC CDV 62516-2 [SC 29 N 11538] Liaison Statement from Digital TV Group [SC 29 N 11551] Liaison Statement from 3GPP Liaison Statement from ISO/IEC JTC 1/WG 7 [SC 29 N 11560] Liaison Statement from IEC TC 100 [SC 29 N 11561] Liaison Statement from JTC 1/SC 34 [SC 29 N 11563] Plenary m18359 Plenary m18550 IEC TC 100 via SC 29 JC Secretariat DTG via SC 29 Iraj Secretariat 3GPP via SC 29 Secretariat JTC 1/WG 7 via SC 29 Secretariat Iraj Young IEC TC 100 via SC 29 Young Secretariat JTC 1/SC 34 via SC 29 Secretariat Young Sam Sam 93 28 References 28.1 Timeline of standards under development P r P t Edit . Projec t 2 1 AMD5 Transport of JPEG2000 2 1 4 5 4 5 4 5 4 5 4 5 4 1 2 1 2 1 5 1 5 2 0 2 2 2 2 2 8 1 2 1 2 2 200 7 200 7 200 1 200 1 200 1 200 x 200 x 200 x 200 8 201 0 201 0 200 9 200 8 200 8 200 8 200 8 200 8 200 2 200 6 200 4 4 4 4 4 4 4 7 7 2 1 2 1 2 4 8 CfP Description WD CD DCO R PDA M 09/10 DIS FPDA M DAM FDIS FDA M COR 10/07 11/01 09/10 10/07 11/01 AMDx x AMDx x AMDx x AMD2 8 AMD2 9 AMD2 AVC File Format Ref. Soft 09/0 7 09/0 7 TBS SVC File Format Ref. Soft TBS MVC File Format Ref. Soft TBS Reference software for LASeR adaptation tools Reference software for PMSI for LASeR Sub-track selection & switching 10/07 11/01 11/07 10/01 10/07 11/01 COR1 Minor corrections 10/07 11/01 COR1 Minor corrections 10/07 11/01 AMD6 MVC operation point descriptor 10/10 AMD1 Sub-track definition 10/07 11/01 COR1 10/07 AMD1 Support for many-to-one range mappings COR1 1st Composite Font Format 09/04 Presentation element Protection of Presentation Information MVCO Reference Software 94 09/10 10/01 10/0 7 AMD1 MPQF reference software . AMD2 Semantic enhancement AMD1 . AMD2 . AMD2 11/07 10/1 0 09/0 7 08/1 0 08/1 0 10/07 10/07 11/01 11/07 12/01 09/07 09/10 10/07 09/10 10/07 11/01 09/04 10/07 11/01 09/04 10/07 11/01 10/07 11/01 11/07 1 2 1 A 2 0 5 A 6 A 9 A B 1 1 1 1 1 2 6 M 1 M 2 M 3 M 4 M 5 U 2 U 3 V 1 V 1 V 2 V 2 V 3 V 3 V 4 V 4 V 5 A A 8 200 x 200 x 200 9 200 8 200 x 200 x 201 0 200 x 200 x 200 x 200 x 200 x 200 x 200 x 200 x 200 x 200 x 200 x 200 x 200 x 200 x 200 x 200 x 200 x 1st ed. Contract Expression Language 10/0 7 08/0 1 2nd Ed. MS AF COR1 PA AF AMD1 Minor correction /COR1 AMD1 SVAF Ref. Soft. And Conf. 09/0 2 09/0 7 AMD2 Additional composition type AMD1 IM AF Ref. SW and Conformance st 1 ed. Dynamic Adaptive Streaming over HTTP nd 2 ed. Architecture 10/0 4 10/0 7 10/10 11/01 11/07 09/02 10/01 10/07 10/01 10/07 10/07 11/01 09/07 09/10 10/10 10/07 11/01 11/07 10/01 10/07 11/01 10/10 11/01 11/07 10/10 11/01 11/07 2nd ed. MXM APIs 10/10 11/01 11/07 2nd ed. Reference SW & Conformance 10/10 11/01 11/07 2nd ed. Elementary Service Protocol 10/10 11/01 11/07 2nd ed. Service Aggregation 10/10 11/01 11/07 1st ed. 10/07 11/01 11/07 1st ed. Advanced User Interaction Interface Conf. & Ref. SW 10/01 10/07 11/01 200x Architecture 09/07 09/10 10/07 11/01 11/07 12/01 09/07 09/10 10/07 11/01 11/07 12/01 09/07 09/10 10/07 11/01 11/07 12/01 09/07 09/10 10/07 11/01 11/07 12/01 09/07 09/10 10/07 AMD1 Streaming Support 200x Control Information AMD1 Additional Control Information 200x Sensory Information AMD1 Binary representation 200x Virtual world object characteristics AMD1 Binary representation 200x Data formats for interaction devices 95 10/0 7 10/0 7 08/1 0 09/0 4 09/0 2 10/0 7 09/0 2 10/0 7 09/0 2 10/0 7 09/0 2 10/0 7 09/0 2 V 5 V 6 V 7 200 x 200 x 200 x 28.2 Pr AMD1 Additional sensors and interaction devices 200x Common types and tools 200x Conformance testing and reference software 10/0 7 09/0 2 09/0 2 11/01 11/07 12/01 09/07 09/10 10/07 10/01 10/07 11/01 Standing Documents 1 1 1 Pt 1 1 1 No. N7675 N7676 N7677 Meeting 05/07 Nice 05/07 Nice 05/07 Nice N7678 N7679 N7680 05/07 Nice 05/07 Nice 05/07 Nice 11 1 1 1 1 6 11 12 14 15 13 13 17 18 20 Documents MPEG-1 White Paper – Multiplex Format MPEG-1 White Paper – Terminal Architecture MPEG-1 White Paper – Multiplexing and Synchronization MPEG-2 White Paper – Multiplex Format MPEG-2 White Paper – Terminal Architecture MPEG-2 White Paper – Multiplexing and Synchronization MPEG-2 White Paper – MPEG-2 IPMP MPEG-4 White Paper – MPEG-4 Systems MPEG-4 White Paper – Terminal Architecture MPEG-4 White Paper – M4MuX MPEG-4 White Paper – OCI MPEG-4 White Paper – DMIF MPEG-4 White Paper – BIFS MPEG-4 White Paper – ISO File Format MPEG-4 White Paper – MP4 File Format MPEG-4 White Paper – AVC FF White Paper on MPEG-4 IPMP MPEG IPMP Extensions Overview White Paper on Streaming Text White Paper on Font Compression and Streaming Presentation Material on LASER 2 2 2 1 1 1 2 4 4 4 4 4 4 4 4 4 4 4 4 4 4 N7503 N7504 N7610 N7921 N8148 N8149 N7608 N8150 N7923 N7924 N7505 N6338 N7515 N7508 N6969 4 4 7 7 21 A A 20 22 1 1 9 X X White Paper on LASeR White Paper on Open Font Format MPEG-7 White Paper - MPEG-7 Systems MPEG-7 White Paper – Terminal Architecture MPEG-21 White Paper – MPEG-21 File Format MPEG Application Format Overview MAF Overview Document N7507 N7519 N7509 N8151 N7925 N9421 N9840 A X MAF Overview Presentation N9841 B E X X N7922 N6335 E X MPEG-B White Paper – BinXML MPEG Multimedia Middleware Context and Objectives 1rst M3W White paper 05/07 Poznan 05/07 Poznan 05/10 Nice 06/01 Bangkok 06/04 Montreux 06/04 Montreux 05/10 Nice 06/04 Montreux 06/01 Bangkok 06/01 Bangkok 05/07 Poznan 04/03 München 05/07 Poznan 05/07 Poznan 05/01 HongKong 05/07 Poznan 05/07 Poznan 05/07 Poznan 06/04 Montreux 06/01 Bangkok 07/10 Shenzhen 08/04 Archamps 08/04 Archamps 06/01 Bangkok 04/03 München N7510 05/07 Poznan 96 E E E X X X E E E X X X E E E X X X 2nd M3W White Paper : Architecture Tutorial on M3W M3W White Paper : Multimedia Middleware Architecture M3W White Paper : Multimedia API M3W White Paper : Component Model M3W White Paper : Resource and Quality Management M3W White Paper : Component Download M3W White Paper : Fault Management M3W White Paper : System Integrity Management 97 N8152 N8153 N8687 06/04 Montreux 06/04 Monreux 06/10 Hanzhou N8688 N8689 N8690 06/10 Hanzhou 06/10 Hanzhou 06/10 Hanzhou N8691 N8692 N8693 06/10 Hanzhou 06/10 Hanzhou 06/10 Hanzhou 28.3 Mailing Lists Reminder Topic General Systems List File Format Scene Representati on AF AIT MPEG-V MXM MPEG Media Transport HTTP Delivery Information Reflector : gen-sys@lists.uni-klu.ac.at Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/gen-sys Archive: http://lists.uni-klu.ac.at/mailman/private/gen-sys/ Reflector : mp4-sys@lists.uni-klu.ac.at Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/mp4-sys Archive: http://lists.uni-klu.ac.at/mailman/private/mp4-sys/ Reflector : mpeg-laser@lists.uni-klu.ac.at Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/mpeglaser Archive: http://lists.uni-klu.ac.at/pipermail/mpeg-laser/ Reflector : maf-sys@lists.uni-klu.ac.at Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/maf-sys Archive: http://lists.uni-klu.ac.at/mailman/private/maf-sys/ Reflector: jiptv@lists.uni-klu.ac.at Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/jiptv Archive: http://lists.uni-klu.ac.at/mailman/private/jiptv/ Reflector: metaverse@lists.uni-klu.ac.at Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/metaverse Archive: http://lists.uni-klu.ac.at/mailman/private/metaverse/ Reflector: mxm@lists.uni-klu.ac.at Subscribe: http://lists.uni-klu.ac.at/mailman/listinfo/mxm Archive: http://lists.uni-klu.ac.at/mailman/listinfo/mxm Reflector: mmt@tnt.uni-hannover.de Subscribe: https://mailhost.tnt.unihannover.de/mailman/listinfo/mmt Reflector: mmt@tnt.uni-hannover.de Subscribe: https://mailhost.tnt.unihannover.de/mailman/listinfo/mmt 98 Kindly Hosted by Klagenfur t University Klagenfur t University Klagenfur t University Klagenfur t University Klagenfur t University Klagenfur t University Klagenfur t University University of Hannover University of Hannover 28.4 Latest References and Publication Status Reference on the ISO Web Site : http://www.itscj.ipsj.or.jp/sc29/open/29view/29n9270c.htm Pr Pt Standard No. 2 2 1 1 ISO/IEC 13818-1:2000 (MPEG-2 Systems 2nd Edition) ISO/IEC 13818-1:2000/COR1 (FlexMux Descr.) N3844 01/01 Pisa 2 1 ISO/IEC 13818-1:2000/COR2 (FlexMuxTiming_ descriptor) N4404 01/12 Pattaya 2 1 ISO/IEC 13818-1:2000/Amd.1 (Metadata on 2) & COR1 on Amd.1 N5867 2 1 ISO/IEC 13818-1:2000/Amd.2 (Support for IPMP on 2) N5604 03/07 Trondheim 03/03 Pattaya 2 1 ISO/IEC 13818-1:2000/Amd.3 (AVC Carriage on MPEG-2) N5771 2 1 ISO/IEC 13818-1:2000/Amd.4 (Metadata Application CP) N6847 2 1 ISO/IEC 13818-1:2000/Amd.5 (New Audio P&L Sig.) N6585 2 1 ISO/IEC 13818-1:2000/COR3 (Correction for Field Picture) N6845 04/07 Redmond 04/10 Palma 2 1 ISO/IEC 13818-1:2000/COR4 (M4MUX Code Point) N7469 05/07 Poznan 2 1 ISO/IEC 13818-1:2000/COR5 (Corrections related to 3rd Ed.) N7895 06/01 Bangkok 2 2 1 1 ISO/IEC 13818-1:2007 (MPEG-2 Systems 3rd Edition) ISO/IEC 13818-1:2007/Amd.1 (Transport of Streaming text) N8369 2 1 ISO/IEC 13818-1:2007/Amd.2 (Carriage of Auxialiry Video Data) N8798 06/07 Klagenfurt 07/01 Marrakech 99 Date Status Doc. With 00/12 Published Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Published Published N/A N/A ISO Award Proposed N/A N/A N/A N/A Proposed N/A N/A N/A Proposed N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A TBP TBP Published N/A TBP 03/07 Trondheim 04/10 Palma Purpose 2 1 ISO/IEC 13818-1:2007/Cor.1.2 (Reference to AVC Specification) N9365 2 1 ISO/IEC 13818-1:2007/Cor.3 2 1 ISO/IEC 13818-1:2007/Amd.3 (SVC in MPEG-2 Systems) 2 1 ISO/IEC 13818-1:2007/Amd.3/Cor.1 2 1 ISO/IEC 13818-1:2007/Amd.4 (Transport of Multiview Video) 2 4 11 1 ISO/IEC 13818-1:2003 (IPMP on 2) ISO/IEC 14496-1 (MPEG-4 Systems 1st Ed.) N1093 7 N1005 8 N1093 8 N1074 5 N5607 N2501 4 1 ISO/IEC 14496-1/Amd.1 (MP4, MPEG-J) N3054 99/12 Hawaii 4 1 ISO/IEC 14496-1/Cor.1 N3278 4 1 ISO/IEC 14496-1:2001 (MPEG-4 Systems 2nd Ed.) N3850 00/03 Noordwijk. 01/01 Pisa 4 1 ISO/IEC 14496-1:2001/Amd.1 (Flextime) 4 1 ISO/IEC 14496-1:2001/Cor.1 N4264 01/07 Sydney 4 1 ISO/IEC 14496-1:2001/Cor.2 N5275 02/10 Shangai 4 1 ISO/IEC 14496-1:2001/Cor.3 N6587 4 1 ISO/IEC 14496-1:2001/Amd.2 (Textual Format) N4698 4 1 ISO/IEC 14496-1:2001/Amd.3 (IPMP Extensions) N5282 4 1 ISO/IEC 14496-1:2001/Amd.4 (SL Extension) N5471 04/07 Redmond 02/03 Jeju Island 02/10 Shanghai 02/12 Awaji 100 07/10 Shenzhen 09/10 Xian Published N/A TBP COR ITTF 08/07 Hannover 09/10 Xian Published N/A COR ITTF to be published 09/07 London FDAM ITTF to be published 03/03 Pattaya 98/10 Atl. City Published Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate N/A N/A Proposed Done N/A Done N/A N/A N/A N/A N/A Done N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A to be published 4 1 ISO/IEC 14496-1:2001/Amd.7 (AVC on 4) N5976 4 1 ISO/IEC 14496-1:2001/Amd.8 (ObjectType Code Points) N6202 03/10 Brisbanne 03/12 Hawaii 4 1 ISO/IEC 14496-1:200x/Amd.1 (Text Profile Descriptors) N7229 05/04 Busan 4 1 ISO/IEC 14496-1:200x/Cor4 (Node Coding Table) N7473 05/07 Poznan 4 1 N5277 4 1 ISO/IEC 14496-1:200x/Amd.1 (Text Profile Descriptors) N7229 02/10 Shanghai 05/04 Busan 4 1 ISO/IEC 14496-1:200x/Cor.1 (Clarif. On audio codec behavior) N8117 4 1 ISO/IEC 14496-1:200x/Amd.2 (3D Profile Descriptor Extensions) N8372 4 1 ISO/IEC 14496-1:200x/Cor.2 (OD Dependencies) N8646 4 1 ISO/IEC 14496-1:200x/Amd.3 (JPEG 2000 support in Systems) N8860 4 1 ISO/IEC 14496-1 (MPEG-4 Systems 4th Ed.) 4 1 ISO/IEC 14496-1:2010/Amd.1 (Usage of LASeR in MPEG-4 systems and Registration Authority for MPEG-4 descriptors) 4 4 ISO/IEC 14496-1:200x/Amd.17 (ATG Conformance) N1094 3 N1124 8 N8861 4 4 ISO/IEC 14496-1:200x/Amd.22 (AudioBIFS v3 conformance) N9295 4 4 ISO/IEC 14496-1:200x/Amd.23 (Synthesized Texture conformance) N9369 4 4 ISO/IEC 14496-1:200x/Amd.24 (File Format Conformance) N9370 ISO/IEC 14496-1 (MPEG-4 Systems 3rd Ed.) 101 06/04 Montreux 06/07 Klagenfurt 06/10 Hangzhou 07/01 Marrakech 09/10 Xian d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Published N/A N/A N/A N/A N/A N/A N/A N/A N/A Proposed N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A 10/04 Dresden Published N/A N/A 07/01 Marrakech 07/07 Lausanne 07/10 Shenzhen 07/10 Published N/A N/A Published N/A N/A Published N/A N/A Published N/A N/A 4 Shenzhen 07/10 Shenzhen 08/04 Archamps 08/04 Archamps 09/07 London 4 ISO/IEC 14496-1:200x/Amd.25 (LASeR V1 Conformance) N9372 4 4 ISO/IEC 14496-1:200x/Amd.26 (Open Font Format Conf.) N9815 4 4 ISO/IEC 14496-1:200x/Amd.27 (LASeR Amd.1 Conformance) N9816 4 4 ISO/IEC 14496-1:200x/Amd.37 (Additional File Format Conformance) 4 4 4 4 4 4 5 5 5 6 8 11 N1075 0 N9020 N9672 N9674 4 11 ISO/IEC 14496-11/Amd.1 (AFX) N5480 02/03 Jeju 05/01 HongKong 02/12 Awaji 4 11 ISO/IEC 14496-11/Amd.2 (Advanced Text and Graphics) N6205 03/12 Hawaii 4 11 ISO/IEC 14496-11/Cor.1 N6203 03/12 Hawaii 4 11 ISO/IEC 14496-11/Cor.3 Valuator/AFX related correction N6594 4 11 ISO/IEC 14496-11/Amd.3 Audio BIFS Extensions N6591 4 11 ISO/IEC 14496-11/Amd.4 XMT and MPEG-J Extensions N6959 4 11 ISO/IEC 14496-11/Cor.3 (Audio BIFS Integrated in 3 rd Edition) N7230 4 11 ISO/IEC 14496-11/Cor.5 (Misc Corrigendum) N8383 ISO/IEC 14496-1:200x/Amd.12 (File Format) ISO/IEC 14496-1:200x/Amd.16 (SMR Ref. Soft) ISO/IEC 14496-1:200x/Amd.17 (LASeR Ref. Soft) ISO/IEC 14496-6:2000 ISO/IEC 14496-8 (MPEG-4 on IP Framework) ISO/IEC 14496-11 (MPEG-4 Scene Description 1st Edition) N4712 N6960 102 07/04 San Jose 08/01 Antalya 08/01 Antalya 04/07 Redmond 04/07 Redmond 05/01 HongKong 05/04 Busan 06/07 Klagenfurt Published N/A N/A Published N/A N/A Published N/A N/A Published N/A PDAM PDAM PDAM Published Published Published ITTF ITTF ITTF N/A N/A N/A Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Published N/A N/A N/A N/A N/A N/A N/A N/A N/A Proposed N/A N/A N/A N/A N/A N/A to be published to be published to be published N/A N/A N/A N/A Proposed Proposed 4 11 4 4 11 11 ISO/IEC 14496-11/Amd.6 Scene Partitioning 4 11 ISO/IEC 14496-11/Amd.7 ExtendedCore2D Profile 4 12 ISO/IEC 14496-12 (ISO Base Media File Format) N9021 N1024 7 N1125 1 N5295 4 12 ISO/IEC 14496-12/Amd.1 ISO FF Extension N6596 4 12 ISO/IEC 14496-12/Cor.1 (Correction on File Type Box) N7232 4 12 ISO/IEC 14496-12/Cor.2 (Miscellanea) N7901 4 12 N8659 4 12 ISO/IEC 14496-12/Amd.1 (Description of timed metadata) ISO/IEC 14496-12/Cor.3 (Miscellanea) N9024 06/01 Bangkok 06/10 Hangzhou 07/04 San Jose 4 12 ISO/IEC 14496-12/Amd.2 (Flute Hint Track) N9023 07/04 San Jose 4 12 N 4 12 ISO/IEC 14496-12 (ISO Base Media File Format 3rd edition) ISO/IEC 14496-12/Cor.1 4 12 ISO/IEC 14496-12/Cor.2 4 12 4 12 ISO/IEC 14496-12/Amd.1 General improvements including hint tracks, metadata support, and sample groups ISO/IEC 14496-12/Cor.3 ISO/IEC 14496-11/Amd.5 Symbolic Music Representation ISO/IEC 14496-11/Cor.6 (AudioFx Correction) N8657 06/10 Hangzhou 07/04 San Jose 08/10 Busan Published N/A TBP Published Published N/A N/A N/A TBP 10/04 Dresden FDAM SC29 Published N/A Proposed Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Consolidate d Published N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A N/A 02/10 Shanghai 04/07 Redmond 05/04 Busan N1025 0 N1044 1 N1058 0 08/10 Busan COR N/A N/A 09/02 Lausanne 09/04 Maui COR N/A N/A FDAM N/A N/A N1075 3 09/07 London COR N/A N/A 103 4 13 ISO/IEC 14496-13 (IPMP-X) N5284 4 14 ISO/IEC 14496-14:2003 (MP4 File Format) N5298 4 14 ISO/IEC 14496-14:2003/Cor.1 (Audio P&L Indication) N7903 4 14 4 15 ISO/IEC 14496-14:2003/Amd.1 Handling of MPEG-4 audio enhancement layers ISO/IEC 14496-15 (AVC File Format) N1113 8 N5780 4 15 ISO/IEC 14496-15/Amd.1 (Support for FREXT) N7585 4 4 15 15 ISO/IEC 14496-15/Cor.1 ISO/IEC 14496-15/Cor.2 (NAL Unit Restriction) N7575 N8387 4 4 15 15 ISO/IEC 14496-15/Amd.2 (SVC File Format Extension) ISO/IEC 14496-15 (AVC File Format 2nd edition) 4 4 4 17 18 18 4 4 19 20 ISO/IEC 14496-17 (Streaming Text) ISO/IEC 14496-18 (Font Compression and Streaming) ISO/IEC 14496-18/Cor.1 (Misc. corrigenda and clarification) ISO/IEC 14496-19 (Synthesized Texture Stream) ISO/IEC 14496-20 (LASeR & SAF) N9682 N1113 9 N7479 N6215 N8664 4 20 4 N6217 N7588 N8666 20 ISO/IEC 14496-20/Cor.1 (Misc. corrigenda and clarification) ISO/IEC 14496-20/Amd.1 (LASeR Extension) 4 20 ISO/IEC 14496-20/Cor.2 (Profile Removal) N9381 4 20 ISO/IEC 14496-20/Amd.2 (SVGT1.2 Support) N9384 N9029 104 02/10 Shanghai 02/10 Shanghai 06/01 Bangkok 10/01 Kyoto 03/07 Trondheim 05/10 Nice 05/10 Nice 06/07 Klagenfurt 08/01 Antalya 10/01 Kyoto 05/07 Poznan 03/12 Hawaii 06/10 Hangzhou 03/12 Hawaii 05/10 Nice 06/10 Hangzhou 07/04 San Jose 07/10 Shenzhen 07/10 to be published Proposed IS ITTF Published N/A Proposed Published N/A N/A Published N/A Published N/A FDAM ITTF COR COR ITTF ITTF N/A N/A FDAM Published ITTF N/A N/A Proposed FDAM Published COR ITTF N/A ITTF TBP Proposed N/A Published Consolidate d Consolidate d Consolidate d Consolidate d Consolidate N/A N/A Proposed TBP N/A N/A N/A N/A N/A N/A N/A N/A Proposed Final Text Editing N/A Shenzhen 2nd 4 4 4 20 20 20 ISO/IEC 14496-20 (LASeR & SAF edition) ISO/IEC 14496-20/Amd.1 SVGT1.2 support ISO/IEC 14496-20/Amd.2 Adaptation 4 20 ISO/IEC 14496-20/Amd.3 PMSI 4 20 ISO/IEC 14496-20/Cor. 1 4 22 ISO/IEC 14496-22 (Open Font Format) 4 4 22 22 ISO/IEC 14496-22 (Open Font Format 2nd edition) ISO/IEC 14496-20/Amd.3 PMSI 7 7 7 7 7 7 7 1 1 1 1 1 2 7 ISO/IEC 15938-1 (MPEG-7 Systems) 7 N N N1075 9 N1095 4 N1137 6 N8395 ISO/IEC 15938-7/Amd.2 (Fast Access Ext. Conformance) N N1095 4 N4285 N6326 N6328 N7490 N7532 N4288 N8672 12 ISO/IEC 15938-12 MPEG Query Format N9830 7 12 ISO/IEC 15938-12/Cor.1 7 12 ISO/IEC 15938-12/Cor.2 21 21 5 9 ISO/IEC 21000-5 (Open Release Content Profile) ISO/IEC 21000-9 (MPEG-21 File Format) N1045 2 N1095 9 N9687 N6975 21 9 ISO/IEC 21000-9/Amd.1 (MPEG-21 Mime Type) N9837 ISO/IEC 15938-1/Amd.1 (MPEG-7 Systems Extensions) ISO/IEC 15938-1/Cor.1 (MPEG-7 Systems Corrigendum) ISO/IEC 15938-1/Cor.2 (MPEG-7 Systems Corrigendum) ISO/IEC 15938-1/Amd.2 (BiM extension) ISO/IEC 15938-2 (MPEG-7 DDL) 105 09/07 London d Published Published Published N/A N/A N/A TBP N/A N/A 09/10 Xian Published N/A N/A COR ITTF N/A Published N/A TBP Published Published N/A N/A TBP N/A 01/07 Sydney 04/03 Munich 04/03 Munich 05/07 Poznan 05/10 Nice 01/07 Sydney 06/10 Hangzhou 08/04 Archamps 09/02 Lausanne 09/10 Xian Published Published Published Published Published Published Published N/A N/A N/A N/A N/A N/A N/A Done N/A N/A N/A N/A Done N/A Published N/A N/A Published N/A N/A Published N/A N/A 08/01 Antalya 05/01 HongKong 08/04 Published Published N/A N/A TBP Done Published N/A Done 10/07 Geneva 06/07 Klagenfurt 09/10 Xian 21 15 ISO/IEC 21000-15 (Security in Event Reporting) N9839 21 21 16 19 ISO/IEC 21000-16 (MPEG-21 Binary Format) ISO/IEC 21000-19 (Media Value Chain Ontology) A 4 ISO/IEC 23000-4 (Musical Slide Show MAF) N7247 N1114 6 N9037 A 4 ISO/IEC 23000-4 (Musical Slide Show MAF 2nd Ed.) N9843 A 4 A 4 A A 6 6 A A 7 7 A 8 ISO/IEC 23000-4 Amd.1 Conformance & Reference Software ISO/IEC 23000-4 Amd.2 Conformance & Reference Software for Protected MSS MAF ISO/IEC 23000-6 (Professional Archival MAF) ISO/IEC 23000-6 Amd.1 Conformance and Reference Software ISO/IEC 23000-7 (Open Access MAF) ISO/IEC 23000-7 Amd.1 Conformance and Reference Software ISO/IEC 23000-8 (Portabe Video AF) A 9 ISO/IEC 23000-9 (Digital Multi. Broadcasting MAF) N9397 A 9 N9854 A 9 A 10 ISO/IEC 23000-9/Cor.1 (Digital Multi. Broadcasting MAF) ISO/IEC 23000-9/Amd.1 (Conformance & Reference SW) ISO/IEC 23000-10 (Video Surveillance AF) A A 10 11 ISO/IEC 23000-11/Amd.1 Conformance & Reference SW N ISO/IEC 23000-11 (Stereoscopic video MAF) N9397 Archamps 08/04 Archamps 05/04 Busan 10/01 Kyoto Published N/A TBP Published Publsihed N/A N/A TBP TBP Consolidate d FDIS N/A TBP ITTF TBP N N/A TBP N N/A TBP 07/04 San Jose 08/04 Archamps TBP TBP N N N9698 N 08/01 Antalya FDIS ITTF TBP TBP N9853 08/04 Archamps 07/10 Shenzhen 08/04 Archamps FDIS ITTF TBP FDIS ITTF TBP N N9397 106 07/10 Shenzhen 07/10 Published TBP FDIS SC29 TBP FDIS ITTF TBP Published FDIS ITTF TBP TBP Shenzhen A A B B 11 12 1 1 B 1 B 1 B 1 B B E ISO/IEC 23000-11/Cor.1 ISO/IEC 23000-11 (Interactive Music AF) ISO/IEC 23001-1 (XML Binary Format) ISO/IEC 23001-1/Cor.1 (Misc. Editorial and technical clar.) ISO/IEC 23001-1/Cor.2 (Misc. Editorial and technical clar.) ISO/IEC 23001-1/Amd.1 (Reference Soft. & Conf.) N N N7597 N8680 N9296 2 3 1 ISO/IEC 23001-1/Amd.1 (Exten. On encoding of wild cards) ISO/IEC 23001-2 (Fragment Request Unit) ISO/IEC 23001-3 (IPMP XML Messages) ISO/IEC 23008-1 Architecture E 2 ISO/IEC 23008-2 Multimedia API N8893 E 3 ISO/IEC 23008-3 Component Model N8894 E 4 ISO/IEC 23008-4 Ressource & Quality Management N8895 E E E E M 5 6 7 8 1 ISO/IEC 23008-5 Component Download ISO/IEC 23008-6 Fault Management ISO/IEC 23008-7 System Integrity Management ISO/IEC 23008-7 Reference Software ISO/IEC 23006-1 Architecture and Technologies M 2 ISO/IEC 23006-2 MXM API M 3 ISO/IEC 23006-3 Reference Software N9053 N9054 N9055 N N1116 3 N1116 5 N1116 8 N9049 N8886 N9051 N9416 N8892 107 Published Published FDIS COR ITTF ITTF TBP TBP TBP N/A COR ITTF N/A FDAM ITTF N/A PDAM ITTF FDIS FDIS FDAM ITTF ITTF ITTF TBP TBP N/A FDAM ITTF N/A FDAM ITTF N/A FDAM ITTF N/A FDAM FDAM FDAM ITTF ITTF ITTF 10/01 Kyoto FDIS Editor N/A N/A N/A N/A N/A 10/01 Kyoto FDIS Editor N/A 10/01 Kyoto FDIS Editor N/A 05/10 Nice 06/10 Hangzhou 07/04 San Jose 07/01 Marrakech 07/07 Lausanne 07/04 San Jose 07/04 San Jose 07/01 Marrakech 07/01 Marrakech 07/01 Marrakech 07/01 Marrakech 07/04 San Jose 07/04 San Jose 07/04 San Jose to be published N/A M 4 ISO/IEC 23006-4 MXM Protocols U 1 ISO/IEC 23007-1 Widgets N1117 0 N1125 6 108 10/01 Kyoto FDIS SC29 N/A 10/04 Dresden FDIS SC29 N/A Annex G– Video report Source: Jens Ohm and Gary Sullivan, Chairs 1 MPEG-2 video During the July 2010 meeting, it had been requested to define a similar approach for MPEG-2 Video as was defined for AVC using an SEI message supporting "frame compatible" stereo video with various frame packing arrangements. It was suggested to enable this capability as a user data extension and to put a 4-byte ID after the user_data_start_code to avoid conflicts with existing systems. To initiate further study on this, document N11462 was issued, and NBs were requested to study and comment on the viability of this solution or to propose better solutions. Two comments by NBs were received. From these, the concept of the solution of N11462 appears to be a viable way, keeping compatibility with existing devices and simplifying the launch of new emerging services in that area. To proceed to the next step, PDAM4 was issued. The plan is to proceed to FPDAM 11/01, FDAM 11/07. The method of signaling allows left/right or top/bottom placement of the two stereo views, and switching between mono and stereo within one stream. It is planned to provide a new edition of MPEG-2 part 2 after finalization of this amendment. Document(s) reviewed: m18196 m18030 JNB comments on Resolution 2.2.2 (3D Signaling in MPEG-2 video) USNB Contribution: WG11 resolution 2.2.2 consideration Document(s) approved: No. Title 11612 Request for ISO/IEC 13818-2:2000/Amd.4 11613 Text of ISO/IEC 13818-2:2000/PDAM4 Frame Packing Arrangement Signalling in MPEG-2 Video Japan National Body A. G. Tescher for USNB TBP Available N 10/10/15 N 10/10/29 2 MPEG-4 part 2 Software During several periods of AHG work, various corrections had been provided to the MPEG-4 part 2 reference software, which is located in the MPEG SVN server. As more recently no new bugs had been detected, it can be assumed that the software has reached a mature status. Therefore, it was decided to modify the latest published version by an appropriate corrigendum. Document(s) reviewed: m17965 Ad Hoc on Maintenance of MPEG-4 Visual related Documents, Reference Software and Conformance Document(s) approved: No. Title 11614 Text of ISO/IEC 14496-5:2001/Amd.5 DCOR1 109 Yi-Shin Tung, Jens-Rainer Ohm TBP Available N 10/10/15 3 Development of AVC The video subgroup held two discussion sessions together with VCEG (ITU-T Q6/SG16) to review input documents related to AVC. The main topics of discussion were a) Potential corrigendum issues (follow-up of the defect report N11465 issued during the July 2010 meeting). b) Establishment of a “Progressive High Profile”. Regarding the corrigendum work, the most critical issue was the clarification of the positioning indicators for the video data grid related to the AVC frame packing arrangement SEI message. A solution was found that appears to be compatible with existing implementations, which was to designate the (0,0) coordinate position as a special case to be treated as indicating the same positioning as would be indicated by the positioning that had been shown in the example figures. A need for clarification of the intended buffering capacity requirements for MVC was also identified, discussed and agreed. Regarding 2D content in 3D application deployments – it was remarked that this should be considered for an amendment/extension activity rather than a corrigendum activity. A new version of the defect report was issued, which was made publicly available. More editorial work is needed on various items before it can be issued as DCOR. Following the plan established at the Geneva meeting, a PDAM was issued for defining a new Progressive High Profile. NB comments were requested on issues of level 4.2 restrictions and a potential Progressive Main Profile. No technical changes were made relative to the previous WD. Instead of defining a new conformance point via a new value of profile_idc, the specification uses the constraint_set4_flag in a manner similar to its use for multiview/interlaced Stereo High profile cases. One contribution (m18337) requested for the Progressive High Profile to impose further restrictions on level 4.2 and higher levels. Specifically, it was proposed that a maximum number of slices per row of macroblocks be established, with a maximum value of two slices per row, and that no predictions blocks smaller than 8x8 should be allowed. These issues require further study on their possible impact and desirability; therefore NBs were requested by a resolution to comment. The specification of a new, similarly-constructed “Progressive Main Profile” was also suggested in the same contribution. Document(s) reviewed: m18197 m18031 m18337 JNB comments on N11465 about frame packing arrangement SEI message Request: (0,0) should be compatible with figures USNB Contribution: WG 11 resolution 3.3.2 consideration Request: (0,0) should be "undefined" (in conflict with JNB) – a compromise was drafted in which (0,0) would indicate "default" positions as illustrated. Various issues on MVC Level Limits for Progressive High Profile Restrict number of slices (2) in rows of macroblocks for level 4.2 and higher Restrict level 4.2 and higher to only 8x8 prediction blocks Specify Progressive main profile as well 110 Japan National Body A. G. Tescher for USNB Aaron Wells m18338 On the AVC frame packing arrangement SEI message Sample aspect ratio 2D content in 3D signaling (comment: this should be an amendment rather than corrigendum) Grid position indicator Semantics sentence placement G. J. Sullivan (Microsoft) m18339 Miscellaneous AVC (ITU-T H.264 | ISO/IEC 14496-10) errata issues G. J. Sullivan (Microsoft), H. Schwarz (Fraunhofer HHI) Document(s) approved: No. Title 11615 Defect report on ISO/IEC 14496-10:201x 11616 Text of ISO/IEC 14496-10:201x/PDAM1 Progressive High Profile TBP Available Y 10/10/31 N 10/10/15 4 MPEG-7 Visual The main activities in MPEG-7 Visual technical work were related to the proceeding of software and conformance amendments for video signature tools into FPDAM status, and issuing a Study of PDAM 6 of 15938-8 (extraction and matching) related to video signature tools. As expressed by a resolution of the Geneva meeting, software related to indexing for fast matching that had been included in XM 39 was still to be provided. After this was done, the related technology was included in the description of the Study document. An AHG was appointed (N11626, as recorded in N11554) to further the work until the 95th meeting. The mandates relate to maintenance of the various MPEG-7 Visual related parts. Document(s) approved: No. Title 11617 DoC on ISO/IEC 15938-6:2001/PDAM4 11618 Text of ISO/IEC 15938-6:2001/FPDAM4 Reference Software for Video Signature Tools 11619 DoC on ISO/IEC 15938-7:2001/PDAM6 11620 Text of ISO/IEC 15938-7:2001/FPDAM6 Conformance for Video Signature Tools 11621 Study Text of ISO/IEC 15938-8/PDAM6 Extraction and Matching of Video Signature Tools TBP Available N 10/10/15 N 10/11/07 N N 10/10/15 10/11/07 N 10/11/07 5 Reconfigurable Video Coding (RVC) The main activities in RVC were as follows: Regarding MPEG-B part 4, a DCOR on ISO/IEC 23001-4 (N11622) was issued which contains various editorial corrections and clarifications of mathematical operations. In addition, a first Working Draft of ISO/IEC 23001-4/Amd.1 (N11623) was approved, which contains a more precise specification of the computation model, RVC-CAL I/O extended support, definition of typing rules for large number arithmetic and (non-normative) definition of FU dataflow classes, classification rules and methods. 111 Regarding MPEG-C part 4, FDAM1 (software and conformance) was released. Besides Amd.2 Tools for AVC High Profile (progressive only) that is currently under PDAM ballot, WD of ISO/IEC 23002-4/Amd.3 was issued which defines FUs for SVC Profiles (N11627). Report of RVC Breakout Activity 5.1 RVC-related activity scheduling at the meeting Room allocation for Video (SHAOGUAN) and RVC Breakout (3416) were arranged as shown: Date Topic Room Time Video Plenary (planning of this week) Video 1230 Monday RVC agenda time allocation approval RVC 1530 – 1630 Joint meeting REQ on Type1 Licensing Reqs 1630 - 1830 Reconfigurable Graphic Framework joint 3DG 9 – 1130 Tuesday meeting Status of Conformance Testing FPDAM1 RVC 1130 - 12 Review of input contributions RVC 14-16 Review of input contributions RVC 16-18 MPEG Plenary Yellow 9-11 Wednesday Review and approval of output documents RVC 17-18 Video Plenary Video 17-19 Thursday Friday MPEG Plenary 5.2 Review of Input contributions Scheduling and notes recorded for particular input contributions were as shown below: Time Document No. T 12 – 13 Title Notes & Recommendations Mickael Raulet , Matthieu Wipliez, Jarome Gorin, Nicolas Siret Typing rules should be included in the WD of AMD1 of ISO/IEC 23001-4 A new core experiment should be added studying the big integers supports and their typing rules. Junaid Jameel Ahmad Crypto Tools Library (CTL): Applying Shujun Li , Marco RVC-CAL for Multimedia Security Mattavelli , Matthieu Applications Wipliez , Mickael Raulet Contribution providing a first step towards the possibility to embed encryption in codecs and generate encrypted bitstreams that maintains conformance and syntax. Future contributions will be provided to demonstrate it. The need of big integers in RVC-CAL has been demonstrated. A proposal of RVC-CAL extensions for Jorn Janneck, Marco Addendum: Specification of typing rules for RVC-CAL m18345 T 15 – 18 m18404 m18460 Authors 112 improved support of I/O processing m18458 m18346 18547 Mattavelli, Shujun Li, Mickael Raulet, Matthieu Wipliez, Johan Eker, Carl Von Platen, Ghislain Roquier, Ihab Amer, Christophe Lucarz, Pascal Faure, Junaid Jameel Ahmad Proposed extensions of RVC-CAL I/O support should be included in the WD of AMD1 of ISO/IEC 23001-4. Supporting tools should be updated so as to support the I/O defined. Writing dataflow networks components Jorn Janneck, Marco (FUs) with different models of Mattavelli, Shujun Li, computations using RVC-CAL: a Mickael Raulet, tutorial Matthieu Wipliez, Johan Eker, Carl Von Platen, Ghislain Roquier, Ihab Amer, Christophe Lucarz, Pascal Faure, Junaid Jameel Ahmad The contribution presents theory and example of FU dataflow classes using RVC-CAL. It proposes a core experiment that aims at re-writing FUs that can be classified as belonging to a more specialized class. Mickael Raulet , Matthieu Wipliez The contribution reports the classification of FUs in the RVC toolbox. The classification is made automatically by a tool embedded in Orcc environment. It is recommended to use it in the core experiment proposed in m18458 Mariam Saleh, Mickael Raulet , Ihab RVC-CAL non BTYPE Version Amer, Marco Mattavelli The contribution reports the results of conformance testing of the non BType version of the decoder Automatic classification of the FUs An AHG was appointed (N11644, as recorded in N11554) to further the work until the 95th meeting. Document(s) approved: No. Title 11622 Text of ISO/IEC 23001-4/DCOR1 11623 Working Draft of ISO/IEC 23001-4/Amd.1 11624 Disposition of Comments on ISO/IEC 23002-4/FPDAM1 113 TBP N N N Available 10/10/15 10/10/15 10/10/15 11625 11627 11628 Text of ISO/IEC 23002-4/FDAM1 Software and Conformance Testing for Video Tool Library WD of ISO/IEC 23002-4/Amd.3 Functional Units for SVC Profiles Description of Core Experiments in RVC N 10/10/22 N 10/10/15 N 10/10/15 6 Explorations – 3D Video In preparation of a Call for Proposals, the 3D Video activity and associated breakout sessions operated under the management of the Requirements subgroup. Technical Document(s) reviewed (see AHG report M17968) m18026 3DV/FTV EE result of coding experiment on Lovebird1 sequence m18221 Depth Estimation Experiment with Poznan Hall m18342 m18343 3DV EE4 Report on Book_arrival Sequence 3DV EE4 Report on Poznan_Street Sequence m18349 3DV EE4 Results on Balloons sequence m18354 3DV/FTV EE1 and EE4 report on Champagne Tower sequence m18355 3DV/FTV EE4 report on Kendo sequence m18356 3DV/FTV EE2 report on VSRS extrapolation m18358 Update of the Draft Report on Experimental Framework for 3D Video Coding m18363 Results on 3DVC EE4 for Newspaper m18367 A method of disparity vector prediction using depth map m18369 m18395 3DV EE4 Results on Beergarden 3DV EE4 Results on Mobile m18420 3DV EE4 Results on Pantomime sequence m18511 m18512 3D Video Coding Results of Exploration Experiments (Book Arrival) Frame range extension of Poznan Street and Poznan Carpark sequences (3DV/EE1) 3DV EE1 Results on Newspaper EE1 Results on Cafe m18513 3DV EE4 Results on Newspaper m18514 m18515 Common-hole Filling for Boundary Noise Removal in VSRS 3DV EE4 Results on Cafe m18421 m18506 114 Donggyu Sim, Woong Lim, GiMun Um, Won-Sik Cheong, Namho Hur Takanori Senoh, Kenji Yamamoto, Ryutaro Oi, Yasuyuki Ichihashi, Taiichiro Kurita, Deliang Fu, Lu Yu Deliang Fu, Lu Yu Jaejoon Lee, Seok Lee, HoCheon Wey, Du Sik Park Masayuki Tanimoto, Toshiaki Fujii, Mehrdad Panahpour Tehrani, Menno Wildeboer Masayuki Tanimoto, Toshiaki Fujii, Mehrdad Panahpour Tehrani, Menno Wildeboer Masayuki Tanimoto, Toshiaki Fujii, Mehrdad Panahpour Tehrani, Menno Wildeboer Menno Wildeboer (Nagoya Univ.), Patrick Lopez (Technicolor), Dong Tian (Mitsubishi), Yin Zhao (Zhejiang Univ.), Cheon Lee (GIST) Pablo Carballeira , Julián Cabrera, Fernando Jaureguizar, Gianluca Cernigliaro, Juan Casal, Narciso Garcia Yoshiya Yamamoto, Tadashi Uchiumi, Makoto Ohtsu, Junsei Sato, Atsutoshi Shimeno Patrick Lopez fons bruls Dmytro Rusanovskyy, Miska M. Hannuksela Gerhard Tech, Karsten Müller Olgierd Stankiewicz , Krzystof Wegner Cheon Lee, Yo-Sung Ho Cheon Lee, Yo-Sung Ho Min-Koo Kang, Cheon Lee, YoSung Ho Cheon Lee, Yo-Sung Ho Min-Koo Kang, Cheon Lee, Yo- m18518 A Depth Compression for MVD View Synthesis m18519 Enhancement of Depth Estimation m18520 Stereoscopic video coding with asymmetric luminance and chrominance qualities m18535 Depth map enhancement for view synthesis m18543 3DV EE1 Results on Poznan_Carpark• Test Sequence m18552 Experimental Results for Inter-View Direct Mode with Decompressed Disparity Maps for Multiview Video Coding Sung Ho Qiuwen Zhang, Ping An, Yan Zhang, Ran Ma, Qian Zhang, Zhaoyang Zhang Qiuwen Zhang, Ping An, Qian Zhang, Yan Zhang, Zhaoyang Zhang Feng Shao, Gangyi Jiang, Mei Yu, Xu Wang Qian Zhang, Ping An, Qiuwen Zhang, Ran Ma, Zhaoyang Zhang Qifei Wang , Kun Xu , Qian Ma , Xiangyang Ji , Qionghai Dai Jacek Konieczny , Marek Domanski Document(s) approved: No. Title 11630 Description of Exploration Experiments in 3D Video Coding 11631 Report on Experimental Framework in 3D Video Coding TBP Available N 10/10/29 Y 10/10/15 7 Explorations – Scalable resolution enhancement of frame-compatible stereo video In continuation of the work begun at the previous meeting for evaluating the potential need for scalable resolution enhancement of frame-compatible stereo video, a number of activities took place on this subject in Guangzhou at this meeting. This work should primarily be considered to fall under the Requirements subgroup at this stage, and although a summary of the work on this topic is also provided here, the Requirements report should be reviewed for further information on this activity. An ad hoc group had met on the day preceding the WG 11 meeting. The AHG had recommended for WG 11 to Acquire more common set of test sequences. The "Beer garden" sequence is available but has asymmetric source quality due to precompression. Dolby is expected to provide sequences to MPEG by the end of this year. Further clarify and fine tune the test conditions (e.g. bit rates) that can be used with the common test sequences to evaluate the performance of existing as well as possible new coding tools that can provide reasonably higher coding efficiency than the existing tools in MPEG-4 Part 10, Create anchors with the target of issuing CfP in the next WG11 meeting if the established test conditions show that the performance of existing tools can be improved significantly. It was agreed that AVC base layer compatibility is required – at least it must support that. It is not yet known whether frame-compatible MPEG-2 will be deployed in large numbers. It was remarked regarding m18478 that the wording "The new codec shall not introduce significant degradation" may not be fully appropriate, and it was suggested to reword this to something more describing having a "visually acceptable quality base layer quality". A break-out activity was coordinated by W. Husak & A. Tourapis during the meeting to: discuss test conditions prepare use case & requirements output doc 115 prepare toward fulfilling the recommendations of the ahg plan for establishing new ahg In M18541, Panasonic reported on Subjective Picture Quality Evaluation of MVC Stereo High Profile for Full-Resolution Stereoscopic HD Video Applications. Panasonic tested the use of the Stereo High profile with a 12 Mbps first view and a varying bit rate for the 2nd view, relative to the base layer with a 1920x1080 shutter-glass frame-sequential 103" display. Confidence intervals overlapped for 25-50% supplemental bit rate, and 50% supplemental bit rate scored slightly higher than 100% added bit rate simulcast. For animation, a lower second-view relative bit rate seemed acceptable. For equal total bit rate, it seemed better to allocate 1/3 of those bits (50% relative to the main view) than to allocate substantially fewer (15% relative to the main view). Interlaced video appeared to need a higher bit rate than progressive scan video. These subjective results seemed better than the objective results reported in JVT-AE022, in terms of indicating the visual coding efficiency of the MVC Stereo High profile design. The results of the work and plans for future activity were agreed to be reflected in two output documents, one describing requirements for MPEG Frame-Compatible Enhancement (MFC), and one describing plans for the test conditions and evaluation process for MFC. And editing period was authorized for preparation of the latter document. An Ad Hoc group was established to further the work until the Daegu meeting. Relevant input documents reviewed: m17981 m18425 m18427 m18446 m18475 m18478 AHG report on scalable resolution enhancement of frame-compatible stereo video Philips (in coop with 3D4YOU) response to request for HQ stereo content for MFC AHG MFC initial results & findings Compression Efficiency Evaluation of Frame-Compatible Stereoscopic Video Coding with Resolution Enhancement Comments on Scalable Resolution Enhancement of Frame-Compatible Stereoscopic 3D Video Use cases and requirements for scalable resolution enhancement of framecompatible stereoscopic 3D video m18486 On scalable resolution enhancement of frame-compatible stereoscopic 3D video m18500 Comments on scalable enhancement of frame compatible stereo 3D m18521 Comments on Scalable resolution enhancement of frame-compatible stereoscopic 3D video m18541 Subjective Picture Quality Evaluation of MVC Stereo High Profile for Full-Resolution Stereoscopic HD Video Applications Fons Bruls Fons Bruls Thomas Rusert , Per Fröjdh Ying Chen , Marta Karczewicz Walt Husak, Ajay Luthra , Gary Sullivan Koohyar Minoo , Vivian Kung, David Baylon, Krit Panusopone, Ajay Luthra Teruhiko Suzuki , Ikuo Tsukagoshi Kyong-Sok Seo, ByeongDoo Choi, Dae-Sung Cho Tao Chen, Yoshiichiro Kashiwagi, Takahiro Nishi Document(s) approved: No. Title TBP Available 11681 Requirements for MPEG Frame-Compatible Enhancement (MFC) N 10/10/15 11682 Test Conditions and Evaluation Process for MFC N 10/10/29 116 Annex H– JCT report Source: Jens Ohm and Gary Sullivan, Chairs Summary The Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T WP3/16 and ISO/IEC JTC 1/SC 29/WG 11 held its third meeting during 7-15 October, 2010 at the Guangzhou Baiyun International Convention Center – Oriental International Convention Hotel, in Guangzhou, China. The JCT-VC meeting was held under the chairmanship of Dr. Gary Sullivan (Microsoft/USA) and Dr. Jens-Rainer Ohm (RWTH Aachen/Germany). The JCT-VC meeting sessions began at approximately 9:00 a.m. on Thursday 7 October. Meeting sessions were held on all days (including weekend days) until the meeting was closed at approximately 1:30 p.m. on Friday 15 October. Approximately 244 people attended the JCT-VC meeting, and approximately 300 input documents were discussed. The meeting took place in a colocated fashion with a meeting of ISO/IEC JTC 1/SC 29/WG 11 (MPEG) – one of the two parent bodies of the JCT-VC. The subject matter of the JCT-VC meeting activities consisted of work on the new next-generation video coding standardization project now referred to as High Efficiency Video Coding (HEVC). The primary goals of the meeting were to review the work that was performed in the interim period since the second JCT-VC meeting in implementing the Test Model under Consideration (TMuC), review results from Tool Experiments (TE), review technical input documents, establish a first Working Draft and HEVC Test Model (HM), and plan a set of Core Experiments (CE) for further investigation of proposed technology. The JCT-VC produced five important output documents from the meeting: A document describing the process of test model development for HEVC, the HEVC Test Model 1 (HM1), the HEVC specification Working Draft 1 (WD1), a document describing software guidelines and a document providing an overview about the HM1 and WD1 adoptions. Furthermore, 13 documents describing the planning of CEs were drafted. For the organization and planning of its future work, the JCT-VC established thirteen "Ad Hoc Groups" (AHGs) to progress the work on particular subject areas. The next JCT-VC meetings will be held during 20-28 January 2011 in Daegu, Korea under the auspices of JTC 1/SC 29/WG 11, and during 15-23 March 2011 in Geneva, Switzerland under the auspices of ITU-T Q6/16. A new document distribution site http://phenix.it-sudparis.eu/jct/ became operational shortly before the Guangzhou meeting and was exclusively used for distribution of all documents. The new site allows a more automated process of document registration and download. Further improvements are expected in simplified bulk download and particularly for selective download of updated documents. The reflector to be used for discussions by the JCT-VC and all of its AHGs is the JCT-VC reflector: jct-vc@lists.rwth-aachen.de. For subscription to this list, see http://mailman.rwth-aachen.de/mailman/listinfo/jct-vc. 1 1.1 Administrative topics Organization The ITU-T/ISO/IEC Joint Collaborative Team on Video Coding (JCT-VC) is a group of video coding experts from the ITU-T Study Group 16 Visual Coding Experts Group (VCEG) and the ISO/IEC JTC 1/ SC 29/ WG 11 Moving Picture Experts Group (MPEG). The parent bodies of the JCT-VC are ITU-T WP3/16 and ISO/IEC JTC 1/SC 29/WG 11. 117 The Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T WP3/16 and ISO/IEC JTC 1/SC 29/WG 11 held its third meeting during 7-15 October, 2010 at the Guangzhou Baiyun International Convention Center – Oriental International Convention Hotel, in Guangzhou, China. The JCT-VC meeting was held under the chairmanship of Dr. Gary Sullivan (Microsoft/USA) and Dr. Jens-Rainer Ohm (RWTH Aachen/Germany). 1.2 Meeting logistics The JCT-VC meeting sessions began at approximately 9:00 a.m. on Thursday 7 October. Meeting sessions were held on all days (including weekend days) until the meeting was closed at approximately 1:30 p.m. on Friday 15 October. Approximately 244 people attended the JCT-VC meeting, and approximately 300 input documents were discussed. The meeting took place in a colocated fashion with a meeting of ISO/IEC JTC 1/SC 29/WG 11 (MPEG) – one of the two parent bodies of the JCT-VC. The subject matter of the JCT-VC meeting activities consisted of work on the new next-generation video coding standardization project now referred to as High Efficiency Video Coding (HEVC). Information regarding logistics arrangements for the meeting had been provided at http://avslab.org/news/mpeg. 1.3 Primary goals The primary goals of the meeting were to review the work that was performed in the interim period since the second JCT-VC meeting in implementing the Test Model under Consideration (TMuC), review results from Tool Experiments (TE), review technical input documents, and establish a first Working Draft and HEVC Test Model (HM). 1.4 Documents The documents of the JCT-VC meeting are listed in Annex A of this report. The documents can be found at http://phenix.it-sudparis.eu/jct/. The formal deadline for registering and uploading non-administrative contributions was October 1, 2010. Some confusion over document handling occurred as the result of the transition to a new electronic document archive site just prior to the meeting. Documents with numbers JCTVC-C295 through JCTVC-C320 were all registered after October 2 and uploaded after October 3 (based on the timestamps of the document archive site, which appear to correspond to the local time zone of Paris, France), of which only JCTVC-C296, JCTVC-C297, JCTVC-C299, JCTVC-C301, JCTVC-C302 and JCTVC-C303 were available by the first meeting day. Four of these (JCTVC-C310, JCTVC-C317, JCTVC-C319, and JCTVC-C320) were summary reports or side-activity reports generated during the meeting and thus should not necessarily be considered late. The majority of the others were cross-verification reports. Among the other documents, JCTVC-C030, JCTVC-C068, JCTVC-C114, JCTVC-C165, JCTVCC174, JCTVC-C175, JCTVC-C178, JCTVC-C179, JCTVC-C180, JCTVC-C181, JCTVC-C182, JCTVC-C225, JCTVC-C231, JCTVC-C244, JCTVC-C247, JCTVC-C248, JCTVC-C287, JCTVCC288 and JCTVC-C292 had been uploaded after October 3 (based on the timestamps of the document archive site, which appear to correspond to the local time zone of Paris, France), but had been uploaded in some form by the first day of the meeting. Documents JCTVC-C272 and JCTVCC284 were missing prior to the meeting and were provided by the third meeting day during the meeting. Documents JCTVC-C243, JCTVC-C308, and JCTVC-C309, all of which were crossverification experiment reports, were only uploaded substantially after the meeting had ended. Registration timestamps, initial upload timestamps, and final upload timestamps are listed in Annex A of this report. These timestamps reflect activity on the new document site, and some documents had been uploaded (and in some cases, made available for distribution) on the prior document site before being placed on the new document site. The timestamps of the document archive site appear to correspond to the local time zone of Paris, France. 118 Documents JCTVC-C289, JCTVC-290 and JCTVC-291 were available, but were initially missing an IPR statement. As a general policy, missing documents were not to be presented, and late documents (and substantial revisions) could only be presented when sufficient time for studying was given after the upload. This policy was not strictly enforced at this meeting in a few cases – partly because of the transition to the use of a new document archive site and partly because there were network access difficulties during the meeting – although some late documents were not presented due to lack of time for their presentation. Otherwise, there were no objections raised by the group regarding presentation of late contributions. The situation about missing input documents must be significantly improved in upcoming meetings. An issue discussed at the meeting was what to do in the case of a submission of a "placeholder" contribution document that is basically empty of content, with perhaps only a brief abstract and some indication of an intent to provide a more complete submission as a revision. It was agreed that such submissions do not seem to be constructive, and should be rejected if there is a way to do this in the document handling interface; as such submissions seem to be an attempt to avoid a submission being identified as late while not providing adequate content for study by the participants. Contribution document registrations JCTVC-C069, JCTVC-C107, JCTVC-C133, JCTVC-C151, JCTVC-C177, JCTVC-C221, JCTVC-C222, JCTVC-C242, and JCTVC-C294 were withdrawn (no documents had been provided, or an incomplete document had been provided, or the registration to JCT-VC had been made in error when intending to submit a document to the WG11 parent body instead). The report documents of the previous meeting, particularly the meeting report JCTVC-B200, the draft TMuC encoder description JCTVC-B204 and decoder specification JCTVC-B205 were approved. Draft versions 6 and 7 of JCTVC-B205 were made available by Saturday. 1.5 Attendance The list of participants in the JCT-VC meeting can be found in Annex B of this report. The meeting was open to those qualified to participate either in ITU-T WP3/16 or ISO/IEC JCT1/SC29/WG11 (including experts who had been personally invited by the Chairs as permitted by ITU-T or ISO/IEC policies). Participants had been reminded of the need to be properly qualified to attend. Those seeking further information regarding qualifications to attend future meetings may contact the Chairs. 1.6 Agenda The agenda for the meeting was as follows: IPR policy reminder and declarations Contribution document allocation Reports of Ad Hoc group activities Reports of Tool Experiment activities Review of results of previous meeting Consideration of contributions and communications on HEVC project guidance Consideration of HEVC technology proposal contributions Consideration of information contributions Coordination activities 119 Future planning: Determination of next steps, discussion of working methods, communication practices, establishment of coordinated experiments, establishment of AHGs, meeting planning, refinement of expected standardization timeline, other planning issues Other business as appropriate for consideration 1.7 IPR policy reminder Participants were reminded of the IPR policy established by the parent organizations of the JCT-VC and were referred to the parent body web sites for further information. The IPR policy was summarized for the participants. The ITU-T/ITU-R/ISO/IEC common patent policy shall apply. Participants were particularly reminded that contributions proposing normative technical content shall contain a non-binding informal notice of whether the submitter may have patent rights that would be necessary for implementation of the resulting standard. The notice shall indicate the category of anticipated licensing terms according to the ITU-T/ITU-R/ISO/IEC patent statement and licensing declaration form. Contributions of software source code for incorporation into the Reference Software for the standard shall be provided with a suitable copyright disclaimer header text in a form acceptable to the parent bodies to enable publication of the source code and to enable users of the software to copy the software and use it for research and standardization purposes and as a basis for the development of products.(while the submitter separately retains any associated patent rights for licensing to be conducted outside of ITU-T/ITU-R/ISO/IEC). This obligation is supplemental to, and does not replace, any existing obligations of parties to submit formal IPR declarations to ITU-T/ITU-R/ISO/IEC. Participants were also reminded of the need to formally report patent rights to the top-level parent bodies (using the common reporting form found on the database listed below) and to make verbal and/or document IPR reports within the JCT-VC as necessary in the event that they are aware of unreported patents that are essential to implementation of a standard or of a draft standard under development. Some relevant links for organizational and IPR policy information are provided below: http://www.itu.int/ITU-T/ipr/index.html (common patent policy for ITU-T, ITU-R, ISO, IEC and guidelines and forms for formal reporting to the parent bodies) http://ftp3.itu.int/av-arch/jctvc-site (JCT-VC contribution templates) http://www.itu.int/ITU-T/studygroups/com16/jct-vc/index.html (JCT-VC founding charter) http://www.itu.int/ITU-T/dbase/patent/index.html (ITU-T IPR database) http://www.itscj.ipsj.or.jp/sc29/29w7proc.htm (SC29 Procedures) The chairs invited participants to make any necessary verbal reports of previously-unreported IPR in draft standards under preparation, and opened the floor for such reports: No such verbal reports were made. It was mentioned that further discussions about an appropriate software disclaimer header text are currently conducted between JCT-VC and parent body management, as discussed in JCTVC-C001. To proceed on this, JCT-VC issued the following resolution to the WG11 parent body: "The JCTVC has requested review of the reference software copyright handling status reported from the JCTVC chairs as found in N11645 [a document produced for WG11 corresponding to the content of JCTVC-C001]. NBs are requested to comment on the suitability of the proposed approaches." 1.8 Communication practices The documents for the meeting can be found at http://phenix.it-sudparis.eu/jct/. This is a new site that the group transitioned to using just prior to the meeting. Previously, JCT-VC documents had 120 been made available at http://ftp3.itu.int/av-arch/jctvc-site. The earlier uploaded documents for the meeting were put on the prior site until the transition to using the new site was announced. JCT-VC email lists are managed through the site http://mailman.rwthaachen.de/mailman/options/jct-vc, and to send email to the reflector, the email address is jctvc@lists.rwth-aachen.de. Only members of the reflector can send email to the list. It was emphasized that reflector subscriptions and email sent to the reflector must use their real names when subscribing and sending messages and must respond to inquiries regarding their type of interest in the work. For the case of TE/CE documents and AHGs, email addresses of participants and contributors may be obscured or absent (and will be on request), although these will be available (in human readable format – possibly with some "obscurification") for primary TE/CE coordinators and AHG chairs. 1.9 Terminology Some terminology used in this report is explained below: AHG: Ad hoc group AIS: Adaptive intra smoothing. ALF: Adaptive loop filter AVC: Advanced video coding – the video coding standard formally published as ITU-T Recommendation H.264 and ISO/IEC 14496-10. BD: Bjøntegaard-delta – a method for measuring percentage bit rate savings at equal PSNR or decibels of PSNR benefit at equal bit rate (e.g., as described in document VCEG-M33 of April 2001). BR: Bit rate. CABAC: Context-adaptive binary arithmetic coding. CBF: Coded block flags. CE: Core experiment – a coordinated experiment conducted after the 3rd meeting. DT: Decoding time. ET: Encoding time. HE: High efficiency – a set of coding capabilities designed for enhanced compression performance (contrast with LC). Often loosely associated with RA. HEVC: High Efficiency Video Coding – the video coding standardization initiative under way in the JCT-VC. HM: HEVC Test Model – a video coding design containing selected coding tools that constitutes our draft standard design – now also used especially in reference to the (nonnormative) encoder algorithms (see WD and TM). IBDI: Internal bit-depth increase – a technique by which lower bit depth (8 bits per sample) source video is encoded using higher bit depth signal processing, ordinarily including higher bit depth reference picture storage (ordinarily 12 bits per sample). JM: Joint model – the primary software codebase developed for the AVC standard. LC: Low complexity – a set of coding capabilities designed for reduced implementation complexity (contrast with HE). Often loosely associated with LD. LCEC: Low-complexity entropy coding. 121 LD: Low delay – a set of coding conditions designed to enable interactive real-time communication, with less emphasis on ease of random access (contrast with RA). Often loosely associated with LC. LUT: Look-up table. MC: Motion compensation. MDDT: Mode-dependent directional transform. MRG: block merging mode for CUs. MV: Motion vector. PCP: Parallelization of context processing. PIPE: Probability interval partitioning entropy coding (roughly synonymous with V2V for most discussion purposes, although the term PIPE tends to be more closely associated with proposals from Fraunhofer HHI while the term V2V tends to be more closely associated with proposals from RIM). QP: Quantization parameter. RA: Random access – a set of coding conditions designed to enable relatively-frequent random access points in the coded video data, with less emphasis on minimization of delay (contrast with LD). Often loosely associated with HE. R-D: Rate-distortion. RDO: Rate-distortion optimization. RDOQ: Rate-distortion optimized quantization. ROT: Rotation operation for low-frequency transform coefficients. RQT: Residual quadtree. TE: Tool Experiment – a coordinated experiment conducted after the 1st or 2nd JCT-VC meeting. TM: Test Model – a video coding design containing selected coding tools; as contrasted with the TMuC, see HM. TMuC: Test Model under Consideration – a video coding design containing selected proposed coding tools that are under study by the JCT-VC for potential inclusion in the HEVC standard. TPE: Transform precision extension. UDI: Unified directional intra. Unit types: o CU: coding unit. o LCTU: largest coding tree unit. o PU: prediction unit. o TU: transform unit. V2V: variable-length to variable-length prefix coding (roughly synonymous with PIPE for most discussion purposes, although the term PIPE tends to be more closely associated with 122 proposals from Fraunhofer HHI while the term V2V tends to be more closely associated with proposals from RIM). WD: Working draft – the draft HEVC standard corresponding to the HM. 1.10 Liaison activity The JCT-VC did not send or receive formal liaison communications at this meeting. 1.11 Contribution topic overview The approximate subject categories and quantity of contributions per category for the meeting was summarized and is provided as follows. AHG reports and TE summary reports (10) TE1: Decoder-side motion derivation (10) TE2: IBDI and memory compression (14) TE3: Inter prediction (14) TE4: Variable length coding (3) TE5: Simplified TMuC intra prediction (5) TE6: Intra prediction (18) TE7: Alternative transforms / MDDT Simplification (19) TE8: Parallel entropy coding (10) TE9: Large block structures (4) TE10: In-loop filtering (20) TE11: Motion vector coding (12) TE12: Evaluation of TMuC tools (79) Project planning and test model establishment (14) TMuC settings and common test conditions (4) Application-specific topics (4) Loop filtering (5) Block structures and partitioning (12) Motion compensation and interpolation filters (10) Motion vector coding (2) B picture reference list redundancy (2) Quantization control (1) Entropy coding (13) Intra prediction (24) Transforms and residual coding (10) IBDI and memory compression (3) 123 1.12 Complexity analysis (2) Video sequence viewing during the meeting The following informal viewing sessions were held on Saturday in Room 3410 (4th floor, building 3) during the meeting: From 10 am to 2 pm, viewing of TMuC-encoded video with Super High Vision resolution (cropped areas). From 2 pm to 5 pm, viewing of in-loop filtering TE video. In a resolution conveyed to the WG11 parent body, the JCT-VC thanked Prof. Liang Fan of Sun Yat-sen University, Mr. Taichiro Shiodera of Toshiba, and Mr. Kenta Senzaki of NEC for assistance with video viewing experiment during the 3rd meeting of the JCT-VC. 2 AHG reports The activities of ad hoc groups that had been established at the prior meeting are discussed in this section. 2.1.1.1.1.1.1.1.1 JCTVC-C001 AHG report: JCT-VC project management [G. J. Sullivan, J.-R. Ohm (co-chairs)] (missing prior, available first day) This document reported on the work of the JCT-VC ad hoc group on Project Management. The work of the JCT-VC overall had proceeded well in the interim period. A large amount of discussion was carried out on the group email reflector. All report documents from the preceding meeting had been made available. The various ad hoc groups and tool experiments had made progress and various reports from those activities had been submitted. One key topic discussed at the management level in the interim period was the need to establish an appropriate copyright status for the test model and reference software being developed by the JCTVC, as noted in the JCT-VC Terms of Reference. The intent is for the software to be developed as part of the work to develop the HEVC standard and also for it to be published as reference software by ITU-T and ISO/IEC. In regard to the software copyright issue, it was suggested to conclude as follows: That the software copyright management should be established in a manner that protects the patent rights of contributors (which are subject to the ITU-T/ITU-R/ISO/IEC Common Patent Policy) and minimizes any concerns regarding avoiding any other liabilities for contributors and users. In particular, it has been expressed that the scope of copyright rights to use of the software should not be substantially constrained. The MXM form of the BSD license as quoted above from WG11 N10791 may be a good candidate language to consider, and nearly all of those who have contributed to the software thus far have agreed to allow its use. The only reservation that has been expressed by any contributors regarding the use of the MXM form of the BSD license is a further desire to ensure full clarity that no patent rights are granted by the availability of the software. If the MXM form of the BSD license cannot be used for some reason, an alternative candidate language has been provided herein for consideration. As it does not currently seem feasible to immediately establish a particular language declaring the copyright status of the software, it must be understood by all contributors that the JCT-VC and its parent bodies plan to establish such a statement and that the act of contributing to the software involves agreeing to allow the group to do so (in a manner consistent with addressing the concerns stated above to the extent feasible). 124 These recommendations were reviewed and the intent was agreed. Consideration by the parent bodies was suggested, and the topic was addressed in a resolution produced as an output of the meeting. 2.1.1.1.1.1.1.1.2 JCTVC-C002 AHG report: Test Model under Consideration (TMuC) editing [K. McCann (chair), M. Karczewicz, J. Ridge, S. Sekiguchi, T. Wedi, T. Wiegand (vice-chairs)] (missing prior, uploaded on first day) Following the Geneva meeting, this AHG began the editorial work on the TMuC decoder description using JCTVC-B205 draft000 as the initial draft. This was identical to JCTVC-A205 draft007, the final draft from the Dresden meeting, but with all changes accepted. Seven additional drafts of this output document were produced in the interim period. The last two of these drafts were produced just as the meeting approached, and had not yet been made available for download until after the 3rd meeting began. There was no initial draft from the Geneva meeting for JCTVC-B204, the corresponding encoding document, which would also provide some tutorial information to explain how TMuC works. Shunichi Sekiguchi led the production of this document. The recommendations of the TMuC Editing AHG were to: Approve the edited JCTVC-B204 (TMuC encoder) and JCTVC-B205 (TMuC decoder) documents as JCT-VC outputs Continue to edit the TMuC encoder and decoder documents to ensure that all agreed elements of the TMuC are fully described Compare the TMuC documents with the TMuC software and resolve any discrepancies that may exist Improve the editorial consistency by ensuring that all “normative-style” text is in the decoder document, which may be referred to by the encoder document Encourage more people to volunteer to contribute text to the TMuC encoder document Consider generating a new pair of documents to describe the future Test Model, assuming that the TMuC continues to exist and evolve in parallel with the Test Model The further discussion of this report included the following remarks: We need to work to resolve any discrepancies between document and software There should be no normative phrasing style in the encoder description document (and not an excessive amount of informal informative remarks in the decoder specification either) The establishment of a TM may be an opportunity to increase the level of editorial discipline. 2.1.1.1.1.1.1.1.3 JCTVC-C003 AHG report: Software development and TMuC software technical evaluation [F. Bossen (chair), P. Chen, D. Flynn, W.-J. Han, K. Sühring, H. Schwarz, K. Ugur (vice-chairs)] (missing prior, uploaded during meeting) Initially, a non-final draft of this AHG report was discussed. The final report was submitted later during the meeting. The activities of the AHG included integration of missing tools into a common code base, resolution of compatibility issues between tools, bug fixes, etc. A near-complete implementation of TMuC had become available. A brief summary of activities related to each mandate is given below. Development of the software was coordinated with the parties needing to integrate changes. Several tracks were typically pursued in parallel to speed up development. The distribution 125 of the software was made through the SVN servers set up at HHI and BBC, as announced on the jct-vc reflector. No documentation was produced at this time. Version 0.6 of the software was delivered according to schedule. Due to the presence of a number of known bugs in version 0.6, a version 0.7 was delivered a few days later with a number of fixes and was recommended as the base version to be used for experiments. Reference encodings were generated using version 0.7 by several parties and distributed through an ftp server set up at the BBC. As of version 0.8.1, almost all tools included in TMuC had been integrated into a common code base. Missing tools were: Motion vector scaling (Qualcomm). Qualcomm indicated that they no longer wished to proceed with this integration. Geometric partitions (Qualcomm). This tool had been integrated into the 0.8qualcomm branch but has not yet been merged into a release version of the software. 3-input adaptive loop filter (Panasonic). This tool had been integrated into the 0.8panasonic branch but had not yet been merged into a release version of the software. In addition to these missing tools, the following code contributions had not been merged into a release version of the software due to lack of time: Bug fixes related to the PU-based merging tool which are present in the 0.8-hhibugfix branch Unification of PIPE and load balancing which is present in the 0.8-rim-bugfix branch A number of additional macros were added to the software to improve its configurability. Requests for new control switches were handled through the public issue tracker set up by the BBC. Multiple versions of the TMuC software were produced and announced on the jct-vc email reflector. A detailed history of changes made to the software can be viewed at http://hevc.kw.bbc.co.uk/trac/timeline Released versions of the software are available on the SVN server set up at the following URL: https://hevc.hhi.fraunhofer.de/svn/svn_TMuCSoftware/tags/{version number} where {version number} corresponds to one of the versions described below (e.g., 0.7). Intermediate code submissions can be found on a variety of branches available at https://hevc.hhi.fraunhofer.de/svn/svn_TMuCSoftware/branches/{branch name} where {branch name} corresponds to a branch (e.g., 0.8-qualcomm). The AhG recommended to the JCT-VC to: Use the TMuC software for the development of TM software Clean-up the TMuC software to remove parts that are not relevant anymore Address remaining critical issues filed on the issue tracker, if any The discussion of this report included the following remarks: Improving configurability is desirable There had been some parallel software development; after which merging was then necessary – David Flynn particularly helped with that. Documentation of software configuration was not adequately advanced. 126 It is especially important to keep track of intended normative behavior during the software development process, and ensure that normative behavior cannot be changed casually. 0.6 version was produced on the planned date, but it did not contain all planned elements (two missing at that time – one not pursued and the other rescheduled a month later). Various bugs had been identified within a couple of days and version 0.7 was issued and recommended for use in testing, and was used to generate the common conditions anchors made available on BBC hosted ftp site. Some extremely minor platform dependencies were noticed on Win32 systems – possibly compiler optimization (e.g., up to 0.05 dB PSNR differences). Some tools were still missing in 0.8.1 (motion vector scaling not pursued, geometric partitions just submitted and not yet merged, 3-input loop filter due to branch divergence issues – now seems to be available). Substantial new kinds of configurability were added. It is important to identify and preserve the distinction between what should be considered as a new proposal and what is just an improvement of configurability, and what should be considered as a bug fix. It is possible to see the software integration timeline in the software tracking system. For the "load balancing" aspect in V2V entropy coding, software had been submitted but not yet merged. PU-based merging fixes remained to be finalized. 2.1.1.1.1.1.1.1.4 JCTVC-C004 AHG report: Alternative transforms [R. Cohen, R. Joshi (cochairs)] Discussions for this AHG were carried out over the main JCT-VC reflector (jct-vc@lists.rwthaachen.de), usually with the term “AHG – Transforms” in the subject line. The discussions were mainly related to how to test alternative transform-related tools in TE12. An example of this was testing of MDDT and ROT. After some discussions, it was agreed to test MDDT and ROT under 3 different conditions, namely, MDDT=0 and ROT=0, MDDT=1 and ROT=0, MDDT=0, ROT=1. There was also some discussion about EDGE_BASED_PREDICTION and MDDT not being combined in an optimal fashion. A related TE (TE7) explored alternatives to the MDDT transform currently in the TMuC. A synopsis of TE7-related contributions was also included in this AHG report. Several cross-verifiers found that that the encoder produced slightly different results depending upon the platform, in part due to known issues with TMuC 0.7. The contributions related to alternative transforms that had been registered for this meeting were noted. Tool Experiments TE7 and TE12 related to the work of this AHG, and various contributions had been submitted in relation to these TEs. The AHG report included a summary of TE7 related contributions. 2.1.1.1.1.1.1.1.5 JCTVC-C005 AHG report: In-loop and post-processing filtering [T. Yamakage (chair), Y. J. Chiu, M. Karczewicz, M. Narroschke (co-chairs)] This document summarized the In-loop and post-processing filtering Ad hoc activities between the 2nd JCT-VC meeting in Geneva, Switzerland (21 to 28 July, 2010) and the current 3rd JCT-VC meeting in Guangzhou, China (7 to 15 October, 2010). There were a few email exchanges for this period. A related TE (TE10) covered most of mandates. 127 Other than the work on TE10, there were two contributions JCTVC-C085 and JCTVC-C113 that compared loop and post filtering. Although the examined post filtering methods in JCTVC-C085 may not be the best, the contribution made a comparison (in a fair manner in terms of coding conditions) for three methods. JCTVC-C113 presents experimental results using a different platform (i.e. JMKTA 2.7). There is also a report of signaling the position of an ALF flag in JCTVC-C084 and some verification results were supposed reported as JCTVC-C243. However, JCTVC-C243 was not uploaded until substantially after the meeting had ended. The related contributions were reviewed in the AHG report. Related to Mandate 1, 2, 3 and 4: TE10 has conducted experiments for three technical categories (deblocking/debanding filters, Wiener-based in-loop filters and Image clipping and offset). A summary of experimental results is reported in JCTVC-C083. Subtest 2 (Wiener-based in-loop filters) covered the complexity assessment at encoder side as well as coding efficiency assessment. There are complexity related contributions outside of TE10 activities, JCTVC-C113, JCTVC-C086 and its verification JCTVC-C212. In addition, a contribution (JCTVC-C173) proposes a coupled control of deblocking filter and Wiener-based filter, and a contribution JCTVC-C214 for TE12 to study various inputs to Wiener-based filters. There are contributions JCTVC-C195, JCTVC-C219 and JCTVC-C222 to enhance Wiener-based in-loop filtering. Informal subjective viewing for TE10 was conducted during this meeting. Related to Mandate 5: There are two contributions JCTVC-C085 and JCTVC-C113 that compare loop and post filtering. Although the examined post filtering methods in JCTVC-C085 may not be the best, the contribution made a comparison (in a fair manner in terms of coding conditions) for three methods. JCTVCC113 presents experimental results using a different platform (i.e. JMKTA 2.7). There was a report of signaling position of ALF_flag JCTVC-C084 and verification results in JCTVC-C243. However, JCTVC-C243 was uploaded substantially after the meeting had ended. Subjective viewing was conducted at the current meeting for some proposals (in room 3410 on 4th floor of building 3). T. Yamakage assisted in coordinating the scheduling of subjective viewing. It was generally agreed that we should avoid spending substantial time on viewing of cases where no substantial difference in quality will be perceived. 2.1.1.1.1.1.1.1.6 JCTVC-C006 AHG report: Large block structures [K. Panusopone (chair), M. Budagavi, W.-J. Han, D. He (vice-chairs)] (missing prior, available first day) This report summarized the large block structures ad hoc activities between the 2nd JCT-VC meeting in Geneva, Switzerland (21 to 28 July, 2010) and the current 3rd JCT-VC meeting in Guangzhou, China (7 to 15 October, 2010). There were some discussions relating to this ad hoc group on the main JCT-VC reflector during the interim period. The discussions included reporting of encoder crashes with some CU/PU/TU size combinations, and how to handle these issues. For example, the horizontal padding and vertical padding (or --PAD option) should be used to ensure that both frame width and height are a multiple of LCTU. There were also a lot of activities relating to large block structure occurred in TE 9 and TE 12; namely, Unit Defintions tests in TE 12 and performance tests for different CTB and TU sizes in TE 9. There are some input documents proposing solutions either to handle boundary issues, to allow flexible block partitioning, or to improve signaling efficiency. Additionally, TE 9 activities related to the mandates of this AHG. There also had been some initial discussions with the AhG on in-loop and post-processing filtering for a potential collaboration on evaluating performance of the current ALF filtering control information at the CU level. 128 2.1.1.1.1.1.1.1.7 JCTVC-C007 AHG report: Memory compression [K. Chono (chair), T. Chujoh, C. S. Lim (vice-chairs)] The ad hoc group (AHG) on memory compression was formed during the July 2010 JCT VC meeting in Geneva. The bulk of this AHG activity has been devoted to discuss and develop an MC read memory access bandwidth measure module for TMuC software in order to access the results of TE2: IBDI and Memory Compression. Approximately 70 e-mail messages were exchanged between the members from 4 parties/companies. The following items are listed as the benefits of memory compression for video encoders and decoders: Reduction in MC read memory access bandwidth; Reduction in DPB write memory access bandwidth; and Graceful power degradation by enabling the low resolution decoding of a bit-stream. The second item can be reasonably assessed by comparing the compression ratios of memory compression schemes since the DPB write memory access is not random access. Similarly, the third item can be reasonably assessed by counting the number of low resolution samples. Therefore, AHG activity has been devoted to discussing the first item. Indeed, it was mentioned by JCT-VC chairs during the last JCV-VC meeting that the compression ratios of memory compression schemes are not a comparable measure since overheads that come from the overlaps of the size of memory compression unit access in MC read random access are unclear. Through the discussion on the first item as described in detail below, the AHG developed a module that computes run-time memory access bandwidth measure for TMuC decoder. In order to have a comparable measure, some TE2 members suggested developing a module that computes run-time memory access bandwidth for given the combination of MC block size, the taplength of MC interpolation filter, Motion Vector (MV), sample-bits per pixel, etc. AHG members agreed to the suggestion and discussed a YUV storage structure in reference picture buffer. By considering the TMuC design in which the same MC interpolation filter is applied to UV components and a different MC interpolation filter is applied to Y component than UV's, the member decided to employ a YUV format comprising of a plane Y component and horizontallyinterleaved UV components. AHG members developed the module and updated it to account for the overlaps of a memory compression unit size. Given a combination of MC block size, the tap-length of MC interpolation filter, MV, sample-bits per pixel, memory compression unit size, the updated module computes the memory access samples. Then AHG members discussed a model of compressed reference storage and decided to use a 1-D storage model. In the 1-D storage model, a memory compressor compresses each of compression units and stores its compressed versions, whose size is byte-aligned, into 1-D storage in the raster scan order. A memory decompressor accesses a set of compressed reference data units stored in the 1-D storage and outputs the decompressed compression units (MC reference samples) to a MC module. Since the 1-D storage access is restricted by memory alignment and memory burst sizes, the actual memory access size (referred to as run-time memory access bandwidth) is determined on the basis of the associated memory read overheads as well as the number of compressed reference data units. Although run-time memory access bandwidth is varied by memory allocation methods, AHG members agreed that the raster scan is used as a default setting and that TE2 proponents can report run-time memory access bandwidth measures obtained by their specific memory allocation methods in addition to those obtained by the default setting. The reason is that the optimal memory allocation method depends on a memory compression unit size as well as memory alignment and memory burst sizes. After having the 1-D storage model, NEC offered their resources to implement the module and to integrate it into the top of TMuC0.7 decoder. The module can be turn on and off by setting a macro 129 "#define MC_MEMORY_ACCESS_CALC 1" in TypeDef.h. Panasonic and TI inspected it prior to the module distribution to TE2 participants. (Note that current implementation supports the runtime memory access only for Anchor encoding settings, i.e., SIFO with 12tap and AMV res., and DIF without AMV res. Furthermore there are two restrictions: for a SIFO with 12tap and AMV res. case, the number of tap-lengths is not switched as the TMuC document specifies; for a DIF case, the shapes of memory access on diagonal fractional pixel positions are assumed to be rectangular. However, the above restrictions presumably do not affect the assessment of MC read memory access bandwidth reduction obtained by memory compression schemes since Anchor and Proposal MC read memory access bandwidths are computed by the same way.) A template excel sheet containing the memory access bandwidth measurement of the anchor streams generated based on JCTVC-B300.doc was created and distributed to TE2 participants along with the software. Using the module created by NEC, the total MC read memory bandwidth of the anchor streams for high efficiency and low complexity settings are computed based on combinations of DDR address and burst alignment bits, (8,8), (32, 64), (64, 128), (128,128), and (256, 256). Using the same memory bandwidth computation tool provided by NEC, the TE2 participants were requested to provide the total MC read memory bandwidth measurement of the same anchor streams based on each proponent’s proposed memory compression unit size and compression ratio. It was also discussed that having a simple cache model to measure memory bandwidth after the use of cache will be desirable since many video coding solutions in the marketplace use cache memory. Video coding solutions that do not use cache are also widely used in the marketplace, so the cache model needs to have the option of being switched on and off. TI offered their resources to implement a simple cache model on top of NEC’s memory bandwidth measurement module in TMuC0.7 decoder. After a quick informal survey of existing L1 and L2 cache architectures on recent microprocessors and DSPs, a single level cache with following parameters was implemented in TMuC-0.7 decoder with capacity (cache size): 128 KB, line size: 64 bytes, organization: 4-way set-associative, replacement strategy: FIFO (a least Recently Used (LRU) strategy is industry standard, but FIFO is easier to implement), DDR alignment: 64 bits. The aim of the cache model is to have a simple cache simulation that captures the most relevant parameters for comparing bandwidth consumed, with and without memory compression, and that is fair to all block structures used in memory compression. The goal was not to develop a very sophisticated cache model. The software cache model was released TE2 members after it was inspected by NEC and others. The memory bandwidth usage is printed in the same format used by NEC's memory bandwidth measurement module. The code changes are isolated to be in #if MC_MEMORY_ACCESS_CACHE. MC_MEMORY_ACCESS_CALC needs to be enabled before cache can be used. It was noted that lossless aspects of memory compression (e.g entropy coding) may not need to be standardized. The recommendations of the AHG were to: Adopt the developed run-time memory access modules to the main branch of the software with a macro to be turned on or off by compilation Study MC memory access bandwidths of the combination of different minimum inter PU partition sizes and several TMuC MC interpolation schemes Study essential functionalities of memory compression schemes that need to be standardized in order to keep the interoperability of encoders and decoders In the discussion of the report, it was remarked that there may potentially be subjective impairments associated with memory compression that should be considered. It was suggested that a reduction of memory access bandwidth on the order of 50% may be a realistic goal. 130 2.1.1.1.1.1.1.1.8 JCTVC-C008 AHG report: Parallel entropy coding [M. Budagavi, A. Segall (co-chairs)] (missing prior, available first day) Parallel entropy coding tools were proposed in several contributions at the April 2010 JCTVC meeting in response to HEVC CfP. The parallelism proposed in those contributions can be broadly classified into three categories: Bin-level parallelism, which parallelizes the binary arithmetic coder (BAC), as in: o N-bin BAC in JCTVC-A101 o PIPE/V2V in JCTVC-A116 and JCTVC-A120 Syntax element-level parallelism, which parallelizes the BAC, the context modeler, and the binarizer, as found in syntax element partitioning JCTVC-A101. Slice-level parallelism, as in: o Entropy slices, JCTVC-A105 o Interleaved entropy slices, JCTVC-A101 o Sub-streams coding, JCTVC-A114 In the July JCTVC meeting, context processing was identified as an additional bottleneck. The following context parallelization tools were proposed in the July meeting and a Tool Experiment TE8 was established to further study this is more detail. Parallel context processing (PCP) o Coefficient Sign PCP (JCTVC-B088 Section 3.2) o Coeff Level BinIdx 0 PCP (JCTVC-B088 Section 3.3) o Significance map PCP (JCTVC-B088 Section 3.4) o Coding order for significance map on bin decoding throughput (JCTVC B036 Section 2) Results on entropy slices were also presented in the July JCTVC meeting, as in entropy slices for parallel entropy coding (JCTVC-B111). There was no email activity related to this AhG on the JCTVC reflector. However, several AhG members interested in this area were actively participating in proposal/cross-verification of parallel entropy coding within TE8 and TE12. Thirteen related contributions were noted. The AhG recommended to review all relevant input contributions. A question that was discussed was how to measure the amount of parallelism that is achieved. There are some metrics that have been proposed in contributions, and we should study these to determine their acceptability. 2.1.1.1.1.1.1.1.9 JCTVC-C009 AHG report: Screen content coding [J. Xu, W. Ding (cochairs)] (missing prior, uploaded first meeting day) The document summarized the activities of the screen content coding AHG between the 2nd and the 3rd JCT-VC meetings. The mandates of the AHG were to identify and describe use case scenarios including non-camera-view content; investigate and collect appropriate test material including computer screen captures, mixed video and graphics, animated graphics, game content, CAD video, news content, text overlays, etc.; and identify potential needs for action in HEVC standardization. There was very little activity for this AHG during the interim period, and only one related input document was submitted for the 2rd meeting (JCTVC-C276). The topic was discussed and further input was encouraged. MPEG and VCEG also discussed the topic at their co-located meetings. To further progress on this, JCT-VC issued the following 131 resolution to WG11: "The JCT-VC has received input contributions that indicate a potential need to consider the coding of "screen content" (text and graphics mixed into a video source, computer desktop or mobile device display content, scrolling text over video, etc.), which may have different characteristics than the camera-captured content as used in the HEVC CfP. NBs are requested to comment about the relevance of such content to the development of the HEVC standard." 2.1.1.1.1.1.1.1.10 JCTVC-C010 AHG report: Complexity assessment [D. Alfonso (chair), J. Ridge, X. Wen (vice-chairs)] This report summarized the activities of the Ad Hoc Group on Complexity Assessment between the 2nd JCT-VC meeting held in Geneva in July 2010 and the current meeting in Guangzhou. There was moderate activity on the e-mail reflector related to Complexity Assessment since the July meeting. A total of 18 e-mails on the subject were exchanged. It had been pointed out that the concept of "complexity" still lacks a fully satisfactory definition. It had been suggested that computational complexity could be assessed by measuring the CPU clock cycles using the "rdtsc" instruction, but there was no consensus on the reliability of this kind of measure. It had been suggested to use profile tools, e.g. Vtune or oprofile, although no results had been proposed so far. Some simulation results showed a discrepancy between computational complexity measures obtained with the TMuC 0.7 decoder on Linux machines and those from a Windows XP machine running Cygwin. The reason for that had not been determined yet. Several Tool Experiments ran between the last meeting and the current meeting that were aimed at providing both coding efficiency and complexity measurement results. The AHG recommended the JCT-VC experts to review the complexity results considering possible issues with precision, accuracy and reproducibility on different systems. The AHG encouraged further simulation experiments and discussions among JCT-VC experts toward reaching a consensus on suitable complexity definitions and methods to measure it. 3 TE1: Decoder-side motion vector derivation 3.1.1.1.1.1.1.1.1 JCTVC-C115 TE1: Summary report of TE1 activity [Y.-J. Chiu (Intel), Y.-W. Huang (MediaTek), M. Wien (RWTH Aachen Univ.), H. Yu (Huawei)] This contribution summarized the results and various contributions relating to TE1. The contributions and results in this area are further discussed below in this section. 3.1.1.1.1.1.1.1.2 JCTVC-C097 TE1: Huawei report on DMVD improvements in TMuC [M. Yang, S. Lin, D. Wang, J. Zhou, H. Yu (Huawei)] The goal of this Tool Experiment (TE1) was to measure the performance of Decoder-side Motion Vector Derivation (DMVD) techniques in TMuC software. There are two categories in the TE1, one is TE.1a (DMVD Inter Prediction) and the other is TE.1b (DMVD Direct Mode). This contribution continued the work on the Template Matching based DMVD (TM-DMVD) and Spatial and Temporal Direct Mode (STDM) techniques. STDM and TM-DMVD work together in the TE.1a, and STDM works alone in the TE.1b. Both techniques had been implemented into the TMuC 0.7 software and tested following the common test conditions in JCTVC-B300 and JCTVC-B301. This document described these techniques in detail, which included algorithm description, software implementation, coding performance, and complexity evaluation and analysis. According to the test results, the average bit rate savings for the random access and low delay configurations were 3.90% and 2.43%, respectively in TE1.a, and 2.9% and 0.5%, respectively in TE1.b. 132 It was noted that some of the tests for Class A and Class B sequences in the TE1.a random access configuration were not finished yet, and the authors indicated that final results would be updated when these became available. As of the time of writing this report, the latest uploaded version of the contribution still said that the results were not yet finished. TE1.a Results for Random Access were not finished yet when reviewed Current decoder runtime increase to 230% for RA, 300% for LD. "low complexity" method presented by last meeting not implemented in TMuC yet – estimate to bring complexity increase to 160%. TE1.b Current decoder runtime increase to 118% for RA, 129% for LD. Encoder runtime increase negligible AMVP was disabled in all TE1 3.1.1.1.1.1.1.1.3 JCTVC-C128 TE1: Cross-checking of DMVD result from Huawei [Y.-J. Chiu, L. Xu, W. Zhang, H. Jiang (Intel)] This contribution was a cross check of the results of JCTVC-C097 on TE1.a, reporting matching results as far as the results that were available. 3.1.1.1.1.1.1.1.4 JCTVC-C125 TE1: Spatio-temporal direct mode [S. Sekiguchi, Y. Itani (Mitsubishi Electric)] This contribution was a cross check of the results of JCTVC-C097 on TE1.b, reporting matching results. 3.1.1.1.1.1.1.1.5 JCTVC-C138 TE1.a: Implementation report of refinement motion compensation using DMVD on TMuC [M. Ueda (JVC)] In TE1 activity, the tool experiment targets at exploring the performance and value of Decoder-side Motion Vector Derivation (DMVD) and at analyzing the interaction of DMVD with other tools under consideration for HEVC. This proposal focused on the tool "Refinement Motion Compensation (RMC) using DMVD" which belongs to TE1.a: DMVD Inter Prediction. The feature in RMC is that bi-predictive MC blocks can be obtained by transmitting only one motion vector using inter-reference DMVD. In this contribution, the implementation of the basic RMC tool on TMuC0.7 software and result of evaluating the performance were reported. The simulation results reportedly showed that the proposed technique provides 1.8% BD-rate gain for the random access case and 0.0% for the low delay case as tested against the TE1 common anchor under TE1 test conditions. Decoder runtime increased to 138%, encoder runtime to 146% for RA case The method seems not to be working for bi-predictive extrapolation currently Results were relative to TMuC anchors without AMVP 3.1.1.1.1.1.1.1.6 JCTVC-C266 TE1: Cross-check result of DMVD proposal from JCTVC-C138 (JVC/Kenwood) [W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm)] The results of JCTVC-C138 were reportedly confirmed in terms of coding performance and speed. 3.1.1.1.1.1.1.1.7 JCTVC-C124 TE1: Implicit direct vector derivation [Y. Itani, S. Sekiguchi (Mitsubishi Electric)] 133 This contribution reported results of performance verification on implicit direct vector derivation technique that was approved as a part of TE1 at the last (Geneva) meeting. The evaluated technique provides a new design of B-skip and B-direct modes with locally adaptive direct MV derivation relying on a decoder-side decision. A performance evaluation using TMuC software was conducted according to the test conditions defined in TE1, and the results reportedly showed 1.8% coding gain on average for RA case (and up to 3.5% at maximum gain). The increase of decoding time was about 10% relative to the TE1 anchor, and the encoding time was reportedly almost same as that of the anchor. This contribution also provided notes on normative text in the form of modification of the TMuC specification. For LD case, 0.4% BR reduction on average Decoder time increases 13% for RA case and 5% for LC case; no significant increase in encoder run time Comparison against anchors without motion vector competition and without CU merging (skip/direct on) 3.1.1.1.1.1.1.1.8 JCTVC-C024 TE1: Cross-check result of DMVD proposal JCTVC-C124 (Mitsubishi) [T. Yokoyama, K. Nakamura (Hitachi)] This contribution confirmed the results of JCTVC-C124 w.r.t. performance and encoding/decoding time. 3.1.1.1.1.1.1.1.9 JCTVC-C127 TE1: Report of self derivation of motion estimation improvement in TMuC [Y.-J. Chiu, L. Xu, W. Zhang, H. Jiang (Intel)] This contribution presented the techniques of decoder-side motion vector derivation (DMVD) side to increase the coding efficiency of B pictures. With the motion vector (MV) information self derived at the video decoder side, the transmission of these MVs from the video encoder side to the video decoder side is skipped. The proposed self derivation of motion estimation (SDME) techniques were reported to be friendly to parallel implementation by utilizing on the temporal correlation among the available samples in the previously-decoded reference pictures, instead of operating on the previously reconstructed samples of the neighboring area of the current picture, which poses an inherent decoding ordering causal relationship among decoding blocks of current picture. Experiments reportedly demonstrated an overall 4.8% BD bit rate improvement for the test of random access (high efficiency) category under the TE1 test conditions with 73% increase in encoding time and 150% increase in decoding time increase on TMuC 0.7 Reference Software platform. A fast version of the direct-candidate simplified SDME reportedly demonstrated an overall 3.1% BD bit rate improvement with a 26% reduction in encoding time and 18% increase in decoding time when compared to the TE1 anchor. Another version of the rounded-candidate simplified SDME had reportedly demonstrated an overall 3.7% BD bit rate improvement with 10% increase in encoding time and 13% increase in decoding time when compared to the TE1 anchor. Also, the proposed SDME reportedly demonstrated 2.7% BD bit rate improvement over the anchor of TE12 under the random access test category (high efficiency) on a shorter version of the common test video sequences defined for TE12. A description of the normative syntax change in the form of TMuC specification was included. In the discussion, it was remarked that the reduction of encoding time is not specific to DMVD, but could be achieved similarly by appropriate encoder optimization (avoid unnecessary checking of intra modes). 3.1.1.1.1.1.1.1.10 JCTVC-C098 TE1: Cross-checking of DMVD results from Mitsubishi and Intel [S. Lin, D. Wang, M. Yang, H. Yu (Huawei)] The results of JCTVC-C124 w.r.t. performance and encoding/decoding time were reportedly confirmed. 134 The results of JCTVC-C127 w.r.t. performance and encoding/decoding time (x-check for the fast modes not done) were reportedly confirmed. 3.1.1.1.1.1.1.1.11 TE1 summary discussion Aspects remarked upon in the discussion of TE1 included the following: Current TE1 settings were with AMVP (motion competition) off and CU merging off. Gains were reported of 4.8% BR reduction average. New methods were proposed with significantly less decoder complexity than before. Results should be compared against the default TMuC settings (which probably brings the gains down by 2-2.5%). Some of the methods seem to reach a region of more moderate increase in decoding complexity. It was suggested to continue investigation and experiment work on this topic. 4 TE2: IBDI and memory compression 4.1 Summary 4.1.1.1.1.1.1.1.1 JCTVC-C076 TE2: Summary of TE2 on IBDI and memory compression [T. Chujoh (TE coordinator)] This contribution is a summary of activies and results relating to tool experiment 2: IBDI and memory compression as described in JCTVC-B302r2. Nine companies and one university had been registered in TE2 and four proposed tools had been evaluated on the common condition with crossverification. There are two test conditions. One is the case of keeping the bit depth of reference memory to the input bit depth on IBDI and the other is the case of reduction of reference memory access bandwidth and size. A substantial amount of information was in this summary report. The proposed techniques were summarized as shown in the table below: Tool Proponent Cross-checker Method 1 NEC Panasonic, JVC DPCM of 8 samples 2 Panasonic TOSHIBA Hadamard transform and bit-plane coding 3 TI TOSHIBA Wavelet transform and DC prediction by 8x8 block 4 TOSHIBA NEC, Yonsei Unv. Adaptive scaling by 4x4 block and de-scaling by sample The summary report included tables illustrating the impact on measures relating to coding efficiency, complexity, and memory access for HE and LC configurations It was remarked that for the various memory compression schemes in general, the behavior on static regions may be a particular concern. It was also noted that more complex techniques need to be significantly better than 8 bit rounding of IBDI results in order to to be useful. 4.2 1-D DPCM-based memory compression 4.2.1.1.1.1.1.1.1 JCTVC-C093 TE2: 1-D DPCM-based memory compression [H. Aoki, K. Chono, K. Senzaki, J. Tajime, Y. Senda (NEC)] In this contribution, a performance report on the 1-D DPCM-based memory compression method described in JCTVC-B057 was presented. The method was designed to reduce reference frame memory size and memory access bandwidth for motion compensation, with relatively low 135 complexity. The method employs 1-D DPCM. Experimental results reportedly showed that average coding losses are 10.7% for test cases without IBDI and 2.0% for test cases with 12-bit IBDI. Subjective quality degradation caused by such coding loss was reported to be invisible for most test cases. Average increase of decoding time is as small as 3.0% and memory bandwidth reduction ratios are 40.6% for test cases without IBDI and 49.0% for test cases with 12-bit IBDI. Experimental results on bandwidth reportedly showed that bandwidth reduction depends on memory architecture and implementation on mapping of compressed image data onto frame memory. It was remarked that the 2% loss with IBDI seems to be most of the gain that IBDI provides. It was remarked that such a proposal should be compared against just rounding to 8 bits when storing. JCTVC-C094 and JCTVC-C095 are related proposal documents for a new proposal. It was asked whether the decoder is required to do clipping after reconstruction of each sample. The proponent indicated that this is not necessary. Instead, the encoder must perform quantization in a way that avoids reconstruction overflow. 4.2.1.1.1.1.1.1.2 JCTVC-C074 TE2: Cross-check of memory compression results from NEC (JCTVC-B057/JCTVC-C093) [C. S. Lim, V. Wahadaniah, S. Naing, H. W. Sun (Panasonic)] (had been uploaded on the ITU ftp site and was only later put on the new site) The purpose of this document was to present a cross-check of memory compression results from NEC (JCTVC-B057/JCTVC-C093) for Tool Experiment 2 (IBDI and memory compression). The cross-check was reportedly completed successfully and the results from NEC were reportedly reproduced without any problem. 4.3 Reference frame compression using image coder 4.3.1.1.1.1.1.1.1 JCTVC-C073 TE2: Reference frame compression using image coder [C. S. Lim, V. Wahadaniah, S. Naing, H. W. Sun (Panasonic)] This contribution presented results of Tool Experiment 2 (IBDI and memory compression) from the reference frame compression scheme proposed by Panasonic (JCTVC-B103). The presented reference frame compression scheme uses an image coder comprising a transform, scanning, and bit-plane coding. The presented scheme was implemented on TMuC version 0.7.3 revision 188 and experiments were conducted using the common test conditions defined in the Tool Experiment 2 document (JCTVC-B302). The memory compression rate used in the experiments for High Efficiency setting was 12-bit to 5.33-bit (44.4%). In comparison with the TMuC software without reference frame compression, experiment results reportedly show an average coding efficiency drop of 2.9% for the High Efficiency, Random Access setting and an average coding efficiency drop of 4.7% for the High Efficiency, Low Delay setting. The memory compression rate used in the experiments for Low Complexity setting was 8-bit to 5.33-bit (66.7%). In comparison with the TMuC software without reference frame compression, experiment results reportedly show an average coding efficiency drop of 2.9% for the Low Complexity, Random Access setting and an average coding efficiency drop of 4.0% for the Low Complexity, Low Delay setting. It was noted from the results in all test settings that the larger drop in performance occurs in the lower resolution images. It was remarked that the complexity of the proposed reference frame compression method seems relatively high, and the coding efficiency impact is not negligible. 4.3.1.1.1.1.1.1.2 JCTVC-C178 TE2: Cross-check of memory compression results from Panasonic (JCTVC-C074) [A. Segall (Sharp)] (missing prior, available first day) 136 This contribution provided a cross-check of memory compression as proposed by NEC and results provided by Panasonic. This cross-check was performed within the context of TE2. The experimental results reportedly closely matched the rate-distortion results provided by Panasonic. The reported maximum Y BD Rate differed from the provided results by at most 0.1%. The crosschecker indicated that he had read through the software and found it to match the proposal algorithm. 4.4 TI reference frame compression proposal 4.4.1.1.1.1.1.1.1 JCTVC-C060 TE2: TI reference frame compression proposal [M. Zhou, M. Budagavi (TI)] (late IPR declaration prior) This document reported testing results of the reference frame compression (RFC) algorithm proposed by Texas Instruments at the Geneva JCT-VC meeting. The algorithm was integrated into the TMuC-0.7 reference software and tested with the JCT-VC common testing conditions. For the High Efficiency (HE) configurations in which IBDI was enabled, the proposed algorithm reportedly provided 12 bits to 8 bits compression at the cost of an average BD-rate increase of 0.3% for random access configurations and 1.7% for low-delay configurations, respectively. The average motion compensation memory bandwidth was reportedly reduced by amounts ranging from 14.7% to 45.7%. For the low complexity (LC) configurations in which the IBDI is off, the proposed algorithm provided 8 bits to 4 bits compression, with a reported average BD-rate increase of 2.5% for random access LC and 3.4% for low-delay LC, respectively. The memory bandwidth saving is 50% if the growing window is employed. A cross check was reportedly done by Toshiba, but no document was submitted for that, other than a summary in JCTVC-C076. Further study was encouraged. 4.5 Adaptive scaling for bit depth compression on IBDI 4.5.1.1.1.1.1.1.1 JCTVC-C075 TE2: Adaptive scaling for bit depth compression on IBDI [T. Chujoh, T. Yamakage (Toshiba)] In this contribution, results of an adaptive scaling for bit depth compression on IBDI were reported. This was one of the proposals in Tool Experiment 2 on IBDI and memory compression. The purpose of this tool experiment was to improve coding efficiency by increasing internal process of video codec while minimizing reference frame memory access bandwidth and reducing reference frame memory access bandwidth and reference frame memory size. This contribution reported an average 0.8% loss for bit depth compression on IBDI. By design, it was reported that the method should provide a benefit relative to fixed rounding. There was reportedly a substantial impact of fixed rounding to 8 bits for the Class E sequences. A substantial memory access bandwidth reduction was reported. It was remarked that performance may depend on the applied QP / fidelity. Another proposal JCTVC-C077 also related to this, as described elsewhere in this report. 4.5.1.1.1.1.1.1.2 JCTVC-C150 TE2: Cross-check of memory compression results from TOSHIBA (JCTVC-C075) [Y. Choe, S. Hong (Yonsei Univ.), Y-G. Kim (KGIT)] This document reported cross-checking results of adaptive scaling of bit depth compression for IBDI (Internal Bit Depth Increase) coding, which was proposed by Toshiba as described in JCTVCC075. The proposal was evaluated based on the TMuC 0.7 with BUGFIX50TMP software and configuration files which were based on JCTVC-B302r1. The software package was obtained from Toshiba. The verification was reported to have been carried out by encoding the test sequences and decoding the generated bitstreams, collecting PSNR/bit rate numbers and encoding/decoding times and verifying that the numbers generated matched those in the proponent's spreadsheets. 137 The results were reported to be consistent with the data provided by Toshiba. The contribution did not indicate whether the algorithm or software were carefully studied. 4.6 Summary conclusions on TE2 It was agreed that further study of the techniques involved in TE2 was desirable. However, it was agreed not to include these proposed techniques in the test model. 5 TE3: Inter prediction 5.1.1.1.1.1.1.1.1 JCTVC-C220 TE3: Summary report for inter prediction in HEVC [A. Krutz, T. Sikora (Tech. Univ. Berlin)] This document summarized the interim activity relating to TE3 – Inter prediction in HEVC (JCTVC-B303). The TE was separated into four subtests that were Subtest 1 – Warped motion compensated and second order prediction, Subtest 2 – Flexible motion partitioning, Subtest 3 – Multi hypothesis inter prediction, and Subtest 4 – Improved inter prediction with enhanced MC filter. The results that had been generated prior to the writing of this document were summarized. Due to the short period of time, integration in the new software and running the experiments was reportedly a big task. Therefore, not all participants who intended to actively contribute to this TE at the last meeting had shown results for their proposed tools. 5.1.1.1.1.1.1.1.2 JCTVC-C033 TE3: Motion compensation with adaptive warped reference [S. Park, J. Park, B. Jeon (LG Electronics)] This document presented experimental results of TE3 subtest 1, which aimed to improve temporal prediction performance by using warping motion. This tool uses an additional reference picture which is warped version of the original reference picture based on homography relation between the current picture and the reference picture. Using a test set different than the typical common conditions, the experimental results reportedly showed 3.1% overall bit rate savings under ‘low delay & high efficiency’ condition and 3.5% overall bit rate savings under the ‘random access & high efficiency’ condition as compared with TMuC 0.7.3. The results were achieved by adding additional sequences into the common test set. Using only the test set would have resulted in very small measured gain or even loss. The highest gain for the TS was Cactus, 3.1% BR savings. The maximum gain reported gain was for "station 2" LD at 41.1%. The following remarks and observations were made in the discussion of this contribution: Sequences that are added usually do not have much texture and are easy to code. With the current results there may be no reason to take action. The sequences where gain is shown are mostly easy to code (blur, low amount of texture) and were not selected as group test sequences before because of this reason. Further study is recommended, but more evidence should be shown with appropriate test material. 5.1.1.1.1.1.1.1.3 JCTVC-C282 TE3 subtest 1: Cross-check of results from LG [M. Tok, A. Glantz, A. Krutz, T. Sikora (Tech. Univ. Berlin)] The purpose of this document was to cross-check JCTVC-C033 on motion compensation with adaptive warped reference picture as submitted by LG Electronics. The verification task was reported to have been done successfully, with results that matched those proposed in JCTVC-C033. A verification of the source code had not been done. LG Electronics had provided Windows 64 binary executable files and encoded bitstreams for the cross check activity. This sort of "black box" testing does not appear to be a satisfactory cross-check for JCT-VC purposes. 138 5.1.1.1.1.1.1.1.4 JCTVC-C031 TE3 subtest 2: Report on simplified geometry block partitioning [E. Francois, D. Thoreau, P. Bordes (Technicolor), L. Guillo (INRIA)] This document relates to TE3 subset2 "flexible block partitioning". It reported results of Simplified Geometry adaptive block partitioning (SGEO) as proposed initially in document JCTVC-B085. The reported BD-rate results were as follows. Compared to the TE03 reference, SGEO average Y BDrate gainswere, respectively, 3.1% for Random Access High Efficiency, and 2.9% for Low Delay High Efficiency. Compared to GEO, SGEO average BD-rate loss is less than 1% for both Random Access High Efficiency and Low Delay High Efficiency. SGEO average encoding time reduction was reported to be around 20-25% compared to GEO. The decoding time was reported to not be significantly impacted. The following remarks and observations were made regarding this contribution: Current implementation does not support CU merge Encoding time was reported to be around 300% compared to the case of TMuC reference with AMP off with SGEO. The 4% gain appears attractive, but the current complexity is too high. 5.1.1.1.1.1.1.1.5 JCTVC-C170 TE3.2: Report on cross-verification of simplified geometry block partitioning from Technicolor [X. Zheng (HiSilicon), H. Yu (Huawei)] This contribution confirmed the results of JCTVC-C031 to the extent of the cross-check testing that was reported (all of class D were said to have been completed and some other sequences). Only very minute differences were reported, likely due to platform compiler issues. It was reported that the data of Technicolor’s proposal were matched with the cross-verification data. For random_access case, when turning off MRG for the anchor, the ratio of encoding time between Technicolor’s proposal and anchor decrease from 3.2 times to 2.8 times, while the BD-rate saving has slight decrease. For low delay case, when turning off MRG for anchor, the ratio of encoding time between Technicolor’s proposal and the anchor decreased from 2.8 times to 2.6 times, while BD-rate saving had slight increase. 5.1.1.1.1.1.1.1.6 JCTVC-C099 TE3.2: Huawei & HiSilicon report on flexible motion partitioning coding [X. Zheng (HiSilicon), H. Yu (Huawei)] This contribution reported Huawei & HiSilicon's implementation status of flexible motion partitioning on the TMuC platform. Based on the test results achieved so far, the proposed partitioning method can reportedly bring about 1.6% bit rate saving over the TE3.2 anchor for random access high efficiency configuration, and 2% bit rate saving for low delay high efficiency configuration compared against anchors with AMP off. The coding performance can reportedly be further improved with the addition of more flexible partitioning cases. The following remarks and observations were noted in discussion: Encoding time compared to TMuC default with AMP off increased to about 200%. The current complexity is too high. This needs to be further studied. Subjective effects should be studied. 5.1.1.1.1.1.1.1.7 JCTVC-C034 TE3 subtest 2: Cross-check of results from Huawei [E. Francois, P. Bordes (Technicolor)] The purpose of this document was to report cross-checking of the Flexible Block Partitioning tool proposed by Huawei/Hisilicon for TE03 subset 2. The source code had been analyzed and the conformance of the code with the tool description had been verified. The source code had been compiled successfully using Windows32 with VC 2005. The verification task had reportedly been done successfully and the results matched those presented in JCTVC-C099. 139 The cross-check of JCTVC-C099 was performed for classes C, D and E. Results were confirmed with minor mismatches (due to different platforms). Regarding encoding and decoding time comparisons, the test had also been performed on a limited set of cases. The time ratios between FMPHV encoding and default TMuC encoding measured were approximately of 140-150% for both Random Access and Low Delay cases. The time ratios between FMPHV encoding and “TmuC without AMP” encoding measured were approximately of 230-240% for both Random Access and Low Delay cases. Decoding time was almost not impacted. 5.1.1.1.1.1.1.1.8 JCTVC-C236 Report of complexity analysis of geometric partitioning [K. Vermeirsch, J. De Cock, R. Van de Walle (Ghent Univ. - IBBT)] This contribution was a report on a complexity analysis which was carried out at the Multimedia Lab of Ghent University. In this analysis a geometric partition mode is defined in function of a partition boundary line having a certain orientation (angle) and offset from the block center. First, the complexity and compression gains of geometric partitioning were determined as a function of the number of allowed orientations. Secondly, the analysis investigated the histograms of the location of the partition boundary within blocks w.r.t. the block center. Based on these observations, a strategy for complexity reduction can reportedly be put forward. By only testing a subset of partition modes, the complexity can reportedly be reduced by a factor of 40 while retaining over 50% of the gains of the more exhaustive approach. The contribution was considered useful to give hints in reducing encoder complexity in geometric partitioning. 5.1.1.1.1.1.1.1.9 JCTVC-C233 TE3 subtest 3: Local intensity compensation (LIC) for inter prediction [N. Sprljan, S. Paschalakis, P. Wu (Mitsubishi Electric)] This contribution presented the status of implementation of the Local Intensity Compensation (LIC) tool, as a successor to the previously reported Macroblock Weighted Prediction (MBWP), in TMuC 0.7.1. Intensity compensation is done in a form of a locally adapted weighted prediction where parameters are selected for partitions within a prediction unit. The implementation had only been completed partially, and the work on its additional features and fine tuning was ongoing. Although the first results with low complexity configurations show loss of up to 0.7%, it was also observed that LIC can be beneficial for some sequences. This document also described future work which addresses improvements to this method. The contribution was noted – further study of this technique is needed to reach a conclusion about it. 5.1.1.1.1.1.1.1.10 JCTVC-C240 TE3: Cross-check results of local intensity compensation tool from Mitsubishi Electric [D. Karwowski (Poznan Univ. Tech.)] The contribution presented experimental results on coding efficiency and computational complexity for the Local Intensity Compensation (LIC) tool proposed by Mitsubishi Electric. Experiments were done in the context of Subtest 3: Multi-hypothesis inter prediction defined in Tool Experiment 3: Inter Prediction in HEVC. The aim of the experiments was to cross check the results obtained by authors of the method (Mitsubishi Electric). When reviewed, no presenter was available for this document – it was reviewed and seemed to confirm results of JCTVC-C233. 5.1.1.1.1.1.1.1.11 JCTVC-C078 TE3 subtest 4: High accuracy interpolation filter (HAIF) [T. Chujoh, K. Kanou, T. Yamakage (Toshiba)] In this contribution, experimental results of high accuracy interpolation filters were reported. This was one of the proposals in subtest 4 of tool experiment 3 on inter prediction. The purpose of this tool experiment was to improve the performance of inter prediction with enhanced motion compensation filters. The experimental results reportedly showed that the interpolation filter which derives each fractional sample position directly and has more steep frequency characteristics with a long tap length can achieve good performance, and the interpolation filter and the in-loop filter have 140 strong relationship as a filter processing technology. At the nest stage, computational reduction for the interpolation filter should be discussed in this subtest. The following remarks were made: Uses an asymmetric 6-tap filter for ¼ and ¾ positions, and an 8-tap filter for ½ positions and boundaries. This also shows again the dependency of interpolation filter type and ALF gains. Complexity investigation not fully conclusive, e.g. 6/8 filter in some cases has longer runtime than 8-tap filter. Further study is needed. 5.1.1.1.1.1.1.1.12 JCTVC-C165 TE3 subset 4: Cross verification on high accuracy interpolation filter [K. Kondo, T. Suzuki (Sony)] (missing prior, uploaded by first meeting day) This contribution reported cross check results on an enhanced MC filter, called the High Accuracy Interpolation Filter (HAIF), in Tool Experiment3 (TE3) subtest4. In the test, the coding performance and complexity were reportedly measured under common test conditions, which were defined in JCTVC-B300. The proposed tools had been evaluated using the common conditions. Detailed results were summarized in an attached Excel sheet. The contributor received source code of JCTVC-C078 implementation; and confirmed the results with minor deviations (different platforms). For low complexity case, HAIF 8 tap reportedly improved BD rate about 4 % in luma. When it is used together with ALF, the gain is increased to 8 %. It was confirmed that HAIF can improve coding efficiency using together with ALF. Therefore there is a reported benefit to use two techniques together. Encoding time increased about 11 % by HAIF only. Decoding time was increased about 7 %. When HAIF and ALF are used together, encoding time is increased about 15% and decoding time increased about 16 %. 5.1.1.1.1.1.1.1.13 JCTVC-C164 TE3 subset 4: Results on bi/single filter switching in FIF [K. Kondo, T. Suzuki (Sony)] This contribution reported results on enhanced MC filter, Bi/Single filter switching in fixed interpolation filter, in Tool Experiment3 (TE3) subtest4. In the test, the coding performance and complexity were measured under common test conditions, which were defined in JCTVC-B300. Proposed tools had been evaluated on the common conditions. Detailed results were summarized in the attached Excel sheet. The following observations were reported: Decoder run time is reported not to change Encoder run time changes to 60% in case of HE Encoder runtime reduction – certainly due to single-pass operation. For LC, 6-tap filter shown to give approximately 1% BR reduction compared to DCT-IF 6tap. Further study was encouraged. 5.1.1.1.1.1.1.1.14 JCTVC-C247 TE3 subset 4: Cross-verification on bi/single filter switching in FIF [T. Chujoh, K. Kanou, T. Yamakage (Toshiba)] (missing prior, available first day) 141 Small differences were reported in cross-checking JCTVC-C164. However, the results werenearlythe same. Different OS platform usage seemed to explain the differences. The contribution did not describe wether the algorithm and software were carefully studied. 5.1.1.1.1.1.1.1.15 Conclusion on TE3: The following remarks and suggestions were recorded for TE3. TE 3.1 (warping) – further study (no experiment establishment needed) TE 3.2 (geometric partitioning) – make a separate experiment (2 methods plus AMP) TE 3.3 (illumination) – further study. TE 3.4 (interpolation filters) – an experiment on interpolation filters will be needed anyway 6 TE4: Variable length coding 6.1.1.1.1.1.1.1.1 JCTVC-C261 TE4: Summary of TE4 on variable length coding [X. Wang (Qualcomm)] This document summarized the activities in the Tool Experiment TE4 on Variable Length Coding (VLC). A group of five companies and universities had registered for participation in TE4. One TE report, JCTVC-C262, had been submitted on VLC coding for coded block flags. 6.1.1.1.1.1.1.1.2 JCTVC-C262 TE4: Report on VLC for coded block flag [X. Wang, M. Karczewicz, W.-J. Chien (Qualcomm)] Document JCTVC-C262 reported TE results based on a coding scheme presented in a previous proposal JCTVC-B098. In JCTVC-B098, coding schemes for coded block flag both at CU level and at block level were proposed. Due to the fact that the proposed coded block flag coding at CU level is very close to the one introduced in "phase 2" VLC integration, JCTVC-C262 further reported results of coded block flag coding only at block level. According to the proposal, if a coded block flag for a certain video component at the coding unit level is not zero and the transform block size is smaller than the coding unit size, the coded block flags from the four quarter-sized blocks are grouped and coded together. Due to coded block flag at the coding unit level, the coded block flags for the four quarter-sized blocks cannot all be zero. As a result there are 15 different combinations in total for these flags. A new VLC table was designed and tested. It was reported that on average around 0.2% coding gain can be obtained using the three low complexity coding configurations. It was noted that the proposal has an interaction with the use of a residual quadtree. The proposal does not apply if residual quadtree encoding with more than two levels is used. The proposal only applies to the LCEC entropy coding scheme. The coding gain is rather minor, but the proposal is a small change. It was suggested that if the proposal is adopted, it should be possible to switch it off in the software, to ease future testing. Decision: It was agreed to adopt the proposal into the TMuC / TM (provided it does not conflict with other adoptions). See also the discussion of JCTVC-C277 and JCTVC-C319. 6.1.1.1.1.1.1.1.3 JCTVC-C288 TE4: Verification results of JCTVC-C262 [T. Yamakage (Toshiba)] (missing prior, available first day) This contribution shows verification results of JCTVC-C262 (Report on VLC for coded block flag) for TE4. The contributor started with source code and compiled it themselves to produce executables, but did not study the software algorithmically. Results were reported to be slightly different but not significantly, relative to JCTVC-C262. No mismatch between the encoder and decoder was observed. Encoding and decoding time comparison with the TMuC 0.7.4 anchor (including macro change) showed approximately no impact. 142 7 TE5: Simplified TMuC intra prediction 7.1.1.1.1.1.1.1.1 JCTVC-C046 TE5: Summary report on simplification of unified intra prediction [T. K. Tan (NTT DoCoMo), M. Budagavi (TI), J. Lainema (Nokia)] This is the summary report for the TE5 on Simplification of Unified Intra Prediction. The description of the experiment can be found in JCTVC-B305r1. This tool experiment was successfully conducted and verified the simplification proposed in JCTVC-B093 and JCTVC-B118 within the context of the unified intra prediction method proposed in JCTVC-B100. Simulation results reportedly show that both methods do not result in a significant loss in coding efficiency compared to the TMuC 0.7. In the angular intra prediction method, two arrays of reference samples are used, corresponding to a row of samples lying above the current block to be predicted, and a column of samples lying to the left of the same block. Given a dominant prediction direction (horizontal or vertical), one of the reference arrays is defined to be the "main" array and the other array the "side" array. In the prediction process, only samples from the main array are used for prediction when the angle value is positive. When the angle value is negative a per-sample test is performed to determine whether samples from the main or the side array should be used for prediction. Additionally when the side array is used, the index into the array is obtained by a mapping operation that requires either a division operation or a table look-up into a sizable table. JCTVC-B093 simplifies the prediction process when the angle value is negative. The main array is extended by projecting samples from the side array onto it according to the prediction direction. This projection entails copying a subset of the samples from the side array into the main array. Note that in the copy process, the inverse angle may be replaced with a lookup table. During the prediction process, only the extended main array is used and the same simple linear interpolation formula is used to predict all samples in the block. JCTVC-B118 replaces divisions in angular intra prediction with a lookup table and additions. All six coding conditions cases were tested. Decision: After discussion of the relevant contributions, it was agreed to adopt these two simplifications into the TMuC and TM. 7.1.1.1.1.1.1.1.2 JCTVC-C042 TE5: Results for simplification of unified intra prediction [T. K. Tan, Frank Bossen (NTT DoCoMo)] This contribution reported simulation results for Tool Experiment 5 as described in JCTVC-B305r1. Two tools were compared. The first tool, based on JCTVC-B093, simplifies the angular intra prediction by eliminating the sample-based checking and division steps. The second tool, based on JCTVC-B118, replaces the division in angular intra prediction with a lookup table and additions. The results reportedly confirmed that both tools do not change the objective performance of the angular prediction process in any significant way. However, the simplified angular prediction (JCTVC-B093) provides the additional advantage of removing bottlenecks in the reference TMuC algorithm to enable a lower-complexity method for intra prediction for which parallel processing can be achieved for all prediction angles on typical SIMD architectures. 7.1.1.1.1.1.1.1.3 JCTVC-C061 TE5: TI evaluation of unified intra prediction simplifications [M. Budagavi (TI)] In HEVC TMuC-0.7 Intra prediction, pixels in current block are predicted from their neighboring left (side reference) and top pixels (main reference) using angular intra prediction. Prediction of pixels from side reference involves division of angle which is computationally complex to implement. Two proposals were made at the last JCT-VC meeting to simplify intra prediction from side reference – JCTVC-B118 and JCTVC-B093. This contribution presents results of evaluation of these two proposals for simplification of intra prediction. The simplifications are achieved using the following two tools: division elimination (JCTVC-B118, JCTVC-B093) and fractional pixel 143 position calculation optimization (JCTVC-B093). Based on the results presented in this contribution, the contributor recommended that these two tools be adopted into TMuC and Test Model (TM). 7.1.1.1.1.1.1.1.4 JCTVC-C100 TE5: Huawei & HiSilicon report on verification test results [L. Liu (HiSilicon), J. Zhou, H. Yu (Huawei)] From the results it was confirmed that the proposed methods do not result in any significant difference in coding efficiency compared to the TMuC 0.7. Both simplified angular prediction (JCTVC-B093) and JCTVC-B118 successfully eliminate the division using a lookup table. Simplified angular prediction also has the additional benefit of being more suited for typical SIMD architectures due to the single reference array and a single equation for the creation of all predicted samples in the block. It was recommended that the division elimination and sample creation method of simplified angular prediction be added to the TMuC and Test Model (TM). 7.1.1.1.1.1.1.1.5 JCTVC-C186 TE5: Results for simplified intra prediction tests by Nokia [J. Lainema, K. Ugur (Nokia)] This document reported Nokia results for the TE5 tests and verifies the correctness of the results submitted by other TE participants. This contribution was based on a separate independent implementation of the proposal. 8 TE6: Intra prediction 8.1.1.1.1.1.1.1.1 JCTVC-C172 TE6: Summary report of intra prediction improvement tool experiments [A. Tabatabai (Sony)] This contribution was a summary of Tool Experiment 6, Intra Prediction Improvements. From a total of 6 proposed toolsets, 4 had been evaluated by at least one organization. For cross checking, the recommended test conditions of intra-only were used for both high compression efficiency and low complexity as defined in the document JCTVC-B306_r3. The tools tested in this tool experiment were as shown in the table below: 144 Tool Technology TE6.a.1 Weighted Bidirectional Prediction (WBP) and Adaptive Subblock Coding Order (ASCO) TE6.a.2 Short distance intra prediction Proponent Toshiba JCTVC-B042 JCTVC-C079 Cross-checker(s) Sharp JCTVC-C179 Renesas JCTVC-C187 Huawei & Hisilicon JCTVC-B040 JCTVC-C101 TE6.a.3 Overlapped block intra prediction TE6.a.4 Line-based intra prediction MediaTek JCTVC-A109 JCTVC-C193 Microsoft JCTVC-A118 JCTVC-C270 Sony JCTVC-B109 JCTVC-C169 LGE JCTVC-C028 DOCOMO JCTVC-C045 Ericsson JCTVC-C157 Microsoft JCTVC-C271 Microsoft JCTVC-C272 TE6.b.1 Differential coding of intra modes (DCIM) 8.2 Huawei JCTVC-C139 DOCOMO JCTVC-C045 Sharp JCTVC-C175 Panasonic JCTVC-C217 TE6.a.1 Weighted Bidirectional Prediction (WBP) and Adaptive Sub-block Coding Order (ASCO) 8.2.1.1.1.1.1.1.1 JCTVC-C079 TE6 subset a: Bidirectional intra prediction [T. Shiodera, A. Tanizawa, T. Chujoh, T. Yamakage (Toshiba)] This contribution presents the detailed experimental results of Bidirectional Intra Prediction (BIP) for Tool Experiment 6 on intra prediction improvement. BIP was included in the CfP submission of JCTVC-A117 and was proposed in the contribution of JCTVC-B042 at JCT-VC Geneva meeting. In this document, experimental results using TMuC software under the common test conditions defined by JCTVC-B300 were reported. For the I slice only coding structure, the average BD-rate gain was 2.1% on low complexity conditions and 2.3% on high efficiency conditions without increase of the encoding time and the decoding time. The reason that the encoding time was not increased was noted to be fast mode decision and turning off some tools that were enabled in the reference configuration – adaptive intra smoothing (AIS) and edge-based prediction. This feature would ordinarily be expected to increase encoding complexity. It was remarked that JCTVC-C207 encoding optimization improves efficiency by 0.8% with only a fast mode decision technique without adding new tools. Further study was encouraged. 8.2.1.1.1.1.1.1.2 JCTVC-C179 TE6: Cross-check of bi-directional intra-predition results from Toshiba [A. Segall (Sharp)] (missing prior, available first day) This contribution cross-checks the bi-directional intra-prediction results provided by Toshiba. This cross-check was performed within the context of TE6 for "Intra high-performance" and "Intra lowcomplexity" configuration settings. The experimental results reportedly closely matched the ratedistortion results provided by Toshiba. The maximum Y BD Rate differs from the provided results by at most 0.1%. The Toshiba software source code was used, and the software was reportedly reviewed in detail. 145 8.2.1.1.1.1.1.1.3 JCTVC-C187 TE6.a: Cross-check of bidirectional intra prediction (JCTVCB042) [K. Iwata (Renesas)] This document described results of cross-checking the software containing the tool of bidirectional intra prediction (BIP) proposed by Toshiba Corporation. The verification task was reported to have been done successfully and the results matched with proponent’s data. The Toshiba software source code was used (without studying the software algorithm). 8.3 TE6.a.2 Short distance intra prediction 8.3.1.1.1.1.1.1.1 JCTVC-C101 TE6.a: HiSilicon report on short distance intra prediction [C. Lai, Y. Lin (HiSilicon)] This document reported the Tool Experiment TE6.a activities related to the evaluation of short distance intra prediction (SDP). Based on the test results achieved so far, line based SDP was reported to provide about 0.9% of bit rate reduction over TMuC anchors for the Intra LC configuration with comparable encoding and decoding time. The proposed tool includes two kinds of 1x16, 16x1, 2x8, 8x2 and "resample based" prediction. The "resample based" prediction was not tested, and the HE configuration context was not tested due to software development difficulties. It was remarked that the LC configuration context may not be the best in which to evaluate such a technique, since it seems to have substantial complexity impact. Further investigation was encouraged. 8.3.1.1.1.1.1.1.2 JCTVC-C028 TE6: Verification results of Huawei and Hisilicon proposals (short distance intra prediction using the correlation between lines and pixels) [J. Kim , B. Jeon (LG Electronics)] This document reported verification results of JCTVC-C101 on short distance intra prediction. The proponents provided their source coded which was compared under the condition of intra low complexity condition, and the results reportedly matched those of JCTVC-C101. 8.3.1.1.1.1.1.1.3 JCTVC-C045 TE6.a: Cross verification for line based intra prediction [T. K. Tan, Junya Takiue (NTT DoCoMo)] This contribution reported simulation results and cross verification of subset Tool Experiment 6.a as described in JCTVC-B306. This contribution documents the findings based on the study and simulation conducted using the software provided by Huawei and Microsoft. This contribution was a cross-check verification for both JCTVC-C101 and JCTVC-C270. Software was received from the proponents, and was somewhat studied to identify the algorithmic differences. In the case of JCTVC-C101 (JCTVC-B040) verification, the software changes were more difficult to identify and study, due to not being controlled by a macro switch. The results reportedly closely matched those reported for JCTVC-C101 and JCTVC-C270. The contributor remarked that deeper analysis of the changes made in these proposals in regard to the prediction process and to the difference coding process was desirable. The contributor also remarked that it would be beneficial to study harmonization of JCTVC-C101 and JCTVC-C270 as well as refinement of each scheme itself. 8.3.1.1.1.1.1.1.4 JCTVC-C157 TE6.a: Cross verification of line-based intra prediction [J. Samuelsson, R. Sjöberg, K. Andersson (Ericsson)] This contribution contained cross-checking results of Line-based Intra Prediction (LIP) presented by Huawei in JCTVC-B040 and later implemented in TMuC v7.0. Simulations run for the Intra Only Low Delay test case reportedly showed an average BD-rate reduction of 0.9%, matching what was reported in JTCVC-C101. The software was studied to determine that it matched the described algorithm. 146 8.3.1.1.1.1.1.1.5 JCTVC-C271 TE6: Cross-verification of HiSilicon's short distance intra prediction by Microsoft [X. Peng, J. Xu (Microsoft)] This document summarized the Tool Experiment TE6 activities related to the cross verification of short distance intra prediction by HiSilicon. The source code provided by HiSilicon was reported to match with their document description, except that the resample-based intra prediction was disabled. Only intra prediction by exploring the correlations between lines was enabled. In prediction, four modes were allowed on 16x16 and 8x8 blocks, which are 1x16, 16x1, 2x8 and 8x2. Each line had its own prediction mode among nine, which needs to be coded in the bit stream. After prediction, a DCT is applied on the line before it is quantized. The results reportedly essentially matched those reported in JTCVC-C101. 8.4 TE6.a.3 Overlapped block intra prediction 8.4.1.1.1.1.1.1.1 JCTVC-C193 Overlapped block intra prediction [K. Zhang, X. Guo, J. An, M. Guo, Y.-W. Huang, S. Lei (MediaTek)] This contribution presented the results of overlapped block intra prediction (OBIP) in Tool Experiment TE6.a. In the proposed OBIP method, a position-dependent weighted sum of several possible predictors is treated as the final prediction. The weighting values are obtained by offline training and are fixed during the encoding/decoding procedure. The OBIP method had been implemented into TMuC software. Experimental results reportedly showed that OBIP can attain an average bit-rate reduction of 1% and 1.7% for all-intra high efficiency and low complexity, respectively. The technique uses a position-dependent weighted combination of three predictors. The weighting combination is selected from one of nine weighting matrices. Only 4x4 and 8x8 blocks use this (as implemented). Edge-based prediction and planar prediction were turned off (which reportedly has essentially negligible PSNR impact) for reasons of tool incompatibility. The non-overlapped modes were retained in the scheme as well as the additional OBIP modes. Further study was proposed, and encouraged. 8.4.1.1.1.1.1.1.2 JCTVC-C272 TE6: Cross-verification of MediaTek's overlapped block intra prediction by Microsoft [X. Peng, J. Xu (Microsoft)] (missing prior – later provided) This contribution reported cross-verification test results for OBIP as proposed in JCTVC-C193. The results reportedly matched those reported in JCTVC-C193. The software was studied as well as compiled and run. 8.5 TE6.a.4 Line-based intra prediction 8.5.1.1.1.1.1.1.1 JCTVC-C270 TE6: Report of line-based coding [X. Peng, J. Xu, F. Wu (Microsoft)] This document reported experimental results of the line-based coding scheme, which was first presented in JCTVC-A118. The line-based coding takes a line as the reconstruction unit and utilizes content-adaptive prediction filters for accurate predictions. When integrated into the TMuC0.7 software, it reportedly showed 2.1% and 2.4% bit rate saving on average, under all intra high efficiency and low complexity conditions, respectively. Up to 5.1% bit rate saving was reportedly achieved on sequences with rich edges. Line lengths of 4, 8, and 16 were tested. Syntax is not used to indicate the prediction parameters – rather, they are derived based on the reconstructed area. The reported encoding and decoding times were approximately doubled or tripled. It was remarked that the complexity impact of the proposed scheme is substantial. Further study was encouraged. 147 Also see remarks above regarding JCTVC-C101. 8.5.1.1.1.1.1.1.2 JCTVC-C045 TE6.a: Cross verification for line based intra prediction [T. K. Tan, Junya Takiue (NTT DoCoMo)] This contribution was also described above. It contained cross verification reporting for both JCTVC-C101 (JCTVC-B040) and JCTVC-C270. 8.5.1.1.1.1.1.1.3 JCTVC-C139 TE6.a: Huawei & HiSilicon report on cross verification of MSRA's LIC [C. Lai, Y. Lin (HiSilicon), J. Zhou (Huawei)] This contribution provided a summary of the cross-checks performed by Huawei Technologies and Hisilicon Technologies on MSRA’s LIC (Line Intra Coding) proposal. The simulation results reportedly exactly matched the results in terms of bit rate and PSNR as provided by Microsoft. 8.6 TE6.b.1 Differential coding of intra modes (DCIM) 8.6.1.1.1.1.1.1.1 JCTVC-C169 TE6.b: Experiment results of DCIM [E. Maani, L. Dong, W. Liu (Sony)] This document presented experimental results of Differential Coding of Intra Modes (DCIM) under the test conditions defined for Tool Experiment 6. DCIM uses neighborhood edge estimation to predict Intra prediction direction and differentially encodes the modes with respect to the predicted direction. This enables a higher precision in the Intra prediction directions without substantially increasing the mode signaling overhead. An additional bit is transmitted per block (i.e., PU) to signal to the decoder weather DCIM is used. MDDT was disabled due to tool implementation compatibility issues, although ROT was used. Approximately 1.9% overall gain was reported. For the LC configuration, the work was affected by a bug (bug fix 79) in the underlying TMuC software. The cross verifications with the bug fix had not yet been completed at the time of presentation. Encoding time is approximately doubled (possibly somewhat slowed by increased use of ROT due to disabling of MDDT); decoding time is reportedly decreased by about 40% (due to disabling of MDDT). The contributor recommended further study, which was encouraged. 8.6.1.1.1.1.1.1.2 JCTVC-C175 TE6.b: Cross verification of differential coding of intra modes (DCIM) [T. Yamamoto, Y. Yasugi (SHARP)] (missing prior, available first day) This contribution provided cross-check results of the method called Differential Coding of Intra Modes (DCIM) proposed by Sony in the context of TE6 (TE6.b: Edge based intra prediction). The experiments defined in TE6 were carried out using the software provided by Sony. The experimental results reportedly showed that the proposed method provides 1.9% gain in High Efficiency / Intra Only operation. It was reportedly confirmed that these results matched the results provided by the proponent. Testing for the LC configuration context had encountered difficulties as described above (bug fix 79). It was suggested that perhaps, in general, we should not use the "LC" configurations to test techniques that require a substantial complexity increase. 8.6.1.1.1.1.1.1.3 JCTVC-C217 TE06.b: Cross verification of differential coding of intra modes (DCIM) [V. Drugeon (Panasonic)] This contribution was a cross-check for differential coding of intra modes (DCIM). Partial results were provided. The results reportedly closely matched those reported by Sony. The software was studied and determined to match the algorithm description as proposed in July. 148 8.6.1.1.1.1.1.1.4 JCTVC-C176 Analysis and improvement of differential coding of intra modes [T. Yamamoto, Y. Yasugi (Sharp)] This contribution related to the DCIM technology tested in TE6. Edge based intra prediction is a method to utilize the gradient of the neighboring samples to decide the direction to predict the target partition. Since one of its variants has been incorporated in the TMuC and the other variant (DCIM: Differential Coding of Intra Modes) was being evaluated in a TE, the Edge based intra prediction was one of the actively researched areas. In this contribution, some analysis of DCIM was presented for providing a better understanding of the technology. Also, some modification and (limited) experimental results were reported – with reported benefits of approximately 0.2% in HE and 0.7% in LC configurations. It was asked whether the parsing of the bitstream is dependent on the values of the reconstructed samples. The contributor said that it does not. 9 TE7: MDDT Simplification 9.1.1.1.1.1.1.1.1 JCTVC-C027 TE7: Summary report for MDDT Simplification [R. Cohen (Mitsubishi Electric Research Laboratories), C. Yeo (I2R), R. Joshi (Qualcomm) The purpose of Tool Experiment 7 (TE7) was to explore the performance and value of simplifying Mode-Dependent Directional Transforms (MDDT) within the HEVC Test Model under Consideration (TMuC). The five proposals submitted for TE7 along with their corresponding crossverification documents are summarized in the table below. Proponent Proposal TE7 Cross-Checks Other Remarks Doc. Doc. related if no contributions match I2R B024 C037 C070 C090 C158 C039 (Zhejiang (Peking (Ericsson) Univ.) Univ.) Huawei B039 C102 C038 C192 C106 (I2R) (Yonsei Univ., KGIT) Toshiba B042 C080 C035 C103 (MERL) (Huawei) Beijing B073 C305 Not Univ. available Tech. prior to meeting, so crosscheckers were reassigned. Peking B102 C089 C027 C081 C190 Some Univ. (LGE) (Toshiba) (ETRI) crosscheckers reported crashes, but otherwise results match. 149 The proposed schemes generally use a combination of the following techniques to simplify MDDT: Exploit symmetry of Intra prediction directions Reduce the number of different transforms used Use of lower-complexity transform architectures Reordering of residuals and reduction of scan orders Training of new transforms for use with the above techniques Abstracts for each of the proposals are as tabulated below: 150 Propone Doc. nt I2R JCTV CB024 Huawei JCTV CB039 Toshiba JCTV CB042 Beijing Univ. Tech. JCTV CB073 Peking Univ. JCTV CB102 Tool Abstract Mode-Dependent Directional Transform (MDDT) was previously introduced to improve transform coding of intra-predicted blocks, and is now a component of the Test Model under Consideration (TMuC). This proposal presents a simplification of the MDDT scheme that requires only two transform matrices: a DCT and a derived KLT. The derived 4x4 KLT also has a structure that can be exploited to reduce the operation count of the transform operation. Derivation of the KLT is based on an assumed image correlation model. Experimental results reportedly show that the proposed technique matches the performance of MDDT even though the approach requires no training, and has significantly lower computation and storage costs. Directional textures exist in the residuals after directional intra predictions. Based on the symmetry of directions, the residual blocks of all directional prediction modes are classified into several groups, with the group number being much less than the number of directional prediction modes. Then, separable KLT is obtained and applied to each group. Simulation results reportedly show that the proposed SMDDT (Simplified MDDT) can use fewer transform matrices to achieve the same or even slightly better coding performance than MDDT in KTA software. 1 Dimensional Directional Unified Transform (1DDUT) is a spatial transform scheme for intra coding. 1DDUT has additional two 1D directional transform matrices (type A and B) compared with AVC. The transform matrices are pre-determined. Three combinations of these two 1D directional transform matrices are also pre-determined each intra prediction mode. After intra prediction, the residual signal is transformed using pre-determined set of 1D directional transform matrices each prediction mode. And transform coefficients are converted into 1D signals from 2D signals according to the adaptive coding scan, which is a same as the scheme of MDDT. Mode Dependent Directional Transform (MDDT) can improve the coding efficiency of AVC but it also brings high computation complexity. In this contribution, a new design for implementing fast MDDT transform through integer lifting steps is presented. First, MDDT is approximated by a proper transform matrix that can be implemented with butterfly-style operation. Then the butterfly-style transform is factored into a series of integer lifting steps to eliminate the need of multiplications. Experimental results reportedly show that the proposed fast MDDT can significantly reduce the computation complexity while introducing negligible loss in the coding efficiency. Due to the merit of integer lifting steps, the proposed fast MDDT is reversible and can reportedly be implemented on hardware easily. To further simplify the mode-dependent directional transform (MDDT), a mode-dependent residual reordering (MDRR) method is proposed. In the proposed MDRR, between the prediction and transform stages, a certain kind of reordering is implemented on the residual samples for each mode in spatial. The reordering is manipulated in a way that the distribution statistics of reordered residual samples present less modedependent characteristic. After the reordering, the nine Intra modes in 151 I4MB and I8MB modes will be assigned into three groups, and only one transform matrix is assigned for each group. With the proposed MDRR, the number of transform matrices in MDDT is significantly reduced, while negligible coding performance difference is observed between the MDDT and the proposed scheme. 9.1.1.1.1.1.1.1.2 JCTVC-C310 TE7: Summary report for MDDT Simplification [Robert Cohen, Chuohao Yeo , Rajan Joshi] (late reg.) This summary report document provided a brief summary and table of documents related to TE7 on MDDT simplification. 9.2 I2R proposed simplification of MDDT 9.2.1.1.1.1.1.1.1 JCTVC-C037 TE7: Results for mode-dependent fast separable KLT for block-based intra coding [C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R)] To simplify the operation of Mode Dependent Directional Transform (MDDT), a Mode-Dependent Fast Separable KLT for Block-based Intra Coding was previously introduced that requires only two transform matrices: a "DCT" and a "derived KLT". The derived 4x4 KLT also reportedly has a structure that can be exploited to reduce the operation count of the transform operation. This contribution provides test results as stipulated in TE7. Experimental results reportedly show that the proposed technique matches the performance of MDDT even though the approach requires no training, and has lower operation count and storage costs. It was remarked that the "derived KLT" is essentially the same as a form of DST. The proposal uses two transforms per block size instead of 18. The scheme was applied to two block sizes: 4x4 and 8x8. A coding loss of 0.1% was reported overall for HE intra-only conditions relative to the anchor (which used MDDT). It was remarked that an adaptive scan order is part of the scheme. It was asked how much the coding loss would be if using just one transform. We don't know the answer to that. Roughly 1-2% was suggested as a reasonable guess. 9.2.1.1.1.1.1.1.2 JCTVC-C070 TE7: Cross-check for I2R proposal on mode-dependent fast separable KLT for block-based intra coding [X. Zhu, L. Yu (Zhejiang Univ.)] This contribution presented cross check results of I2R proposal on mode-dependent fast separable KLT for block-based intra coding. Due to lack of time, not all data was checked – only some randomly selected points had been checked. The sequences checked reportedly had the same PSNR and bit rate with the corresponding data given by proponent. The source code was studied and determined to match the proposed algorithm. 9.2.1.1.1.1.1.1.3 JCTVC-C090 TE7: Cross-verification results of mode-dependent fast separable KLT for block-based intra coding from I2R [X. Zhao, L. Zhang, S. Ma, W. Gao (Peking Univ.)] This contribution reported Peking Univ. cross-verification results of the I2R proposal on modedependent fast separable KLT for block-based intra coding in response to the Tool Experiment TE7. Some minor differences were observed between Peking Univ. and I2R results, which could be induced by the different platforms used (Peking Univ.: WindowXP, I2R: Linux). However, the average rate-distortion performance results of I2R and Peking Univ. were reported to be equal. 9.2.1.1.1.1.1.1.4 JCTVC-C158 TE7: Cross verification of MDDT simplification JCTVC-B024 [K. Andersson (Ericsson)] This contribution was a cross-verification of the TE7 MDDT Simplification that was proposed in JCTVC-B024. The PSNRs, bit rates and BD-rate measurements provided by IR2 were matched. 152 The result reportedly shows that the BD bit rate loss is 0.1% and 0.0% for high efficiency configurations all intra and random access respectively compared to the anchor TMuC 0.7 reference configuration. The encoding and decoding time were approximately unaffected. 9.2.1.1.1.1.1.1.5 JCTVC-C039 Choice of transforms in MDDT for unified intra prediction [C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R)] A simplification of Mode-Dependent Directional Transform (MDDT) was previously proposed in JCTVC-B024 that requires only two transform matrices: a "DCT" and a "derived KLT". However, the choice of transforms in JCTVC-B024 was only specified for the 9 intra prediction modes present in AVC. In this proposal, a choice of transforms is presented for the 34 intra prediction modes possible in the unified intra prediction in TMuC. As in the earlier proposal, only two transform matrices are necessary, and no training is required. This document provided information on how the proposed scheme was harmonized with Unified Intra Prediction (adding more prediction direction types to the transform selection table) for use in TE7. The testing performed in TE7 used this scheme. 9.3 Huawei proposed simplification of MDDT 9.3.1.1.1.1.1.1.1 JCTVC-C102 TE7: Symmetry-based simplification of MDDT [H. Yang, J. Zhou, H. Yu (Huawei)] In the last meeting, symmetry-based simplification of MDDT (SMDDT) was proposed to reduce the number of transform matrices in the current MDDT implementation. And the SMDDT technique was included in TE 7 for further investigation. According to the TE7 specification, SMDDT has been implemented in the TMuC software and evaluated under the common test conditions since the last meeting. This document reported the performance of the proposed SMDDT technique and discussed the complexity. Five transforms, with flipping and transposing, were used. Fewer scan orders were used in this proposal than in the I2R proposals. It was claimed that this proposal requires less data storage. Overall, a 0.1% bit rate increase was reported. 9.3.1.1.1.1.1.1.2 JCTVC-C038 TE7: Cross-check results of MDDT simplification proposal from Huawei [C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R)] This contribution provided a summary of the cross-checks performed by the Institute for Infocomm Research on the Huawei MDDT simplification proposal. The simulation results reportedly closely match the results provided by Huawei. The software source code was studied algorithmically as well as executed. 9.3.1.1.1.1.1.1.3 JCTVC-C192 TE7: Cross-verification results of Huawei proposal on simplified MDDT for intra prediction residual [Y. Choe, J. Kim (Yonsei Univ.), Y-G. Kim (KGIT)] This document reported cross-verification results of simplified MDDT (SMDDT) for intra prediction residual proposed by Huawei, according to the description of TE7. The verification task had been done successfully and the results reportedly match almost exactly those provided by Huawei. The algorithm was studied as well as executing the software. 9.3.1.1.1.1.1.1.4 JCTVC-C106 Symmetry-based scan order sharing scheme for MDDT [H. Yang, J. Zhou, H. Yu (Huawei)] This proposal was used in the TE for the scheme proposed in JCTVC-C102. However, it can reportedly also apply in the context of other schemes and was therefore proposed separately in this contribution. In this proposal, a symmetry-based scan order (SSO) sharing scheme was proposed to further simplify the mode-dependent directional transform (MDDT). Based on the symmetry among all 153 intra prediction directions, the number of scan maps can reportedly be reduced by the proposed SSO while the coding performance is kept unchanged. The proposal suggested that the proposed SSO scheme be adopted as an improvement of the current MDDT implementation in TMuC. When implemented in the current MDDT implementation in TMuC, it was reported that this scan order method results in a memory savings with no performance loss. Disabling scan order adaptivity reportedly resulted in very little compression performance loss (0.0% to 0.1%). It was remarked that if the gain from scan order adaptivity is that small, perhaps we shouldn't use it (in the HHI coefficient coding scheme as well). The contributor suggested that a harmonization of simplified MDDT schemes may also be feasible. Further study was encouraged. 9.4 Toshiba proposed simplification of MDDT 9.4.1.1.1.1.1.1.1 JCTVC-C080 TE7: One-dimensional directional unified transform [A. Tanizawa, J. Yamaguchi, T. Shiodera, T. Chujoh, T. Yamakage (Toshiba)] This contribution presented experimental results of 1 Dimensional Directional Unified Transform (1DDUT) for Tool Experiment 7 (MDDT simplification in HEVC). 1DDUT is a method of reducing the number of transform matrices of MDDT by using a set of pre-defined 1 D transform matrices depending on the intra prediction direction. 1DDUT was included in the CfP submission of JCTVC-A117 and the improved method was proposed in the contribution of JCTVC-B042 at JCTVC Geneva meeting. In this document, the experimental results in TMuC software version 0.7.1 on the high efficiency conditions based on the common test conditions for both I slice only coding structure and the random access coding structure defined by TE7 were reported. For I slice only coding structure, the BD-Bit rate gain compared with the anchor (MDDT on) was reported to be 0.1% on average, and for the random access coding structure, the BD-Bit rate gain was reported as 0.0% on average. 9.4.1.1.1.1.1.1.2 JCTVC-C035 TE7: Cross-verification of the 1D directional unified transform in TMuC [R. Cohen, A. Vetro, H. Sun (Mitsubishi)] This document reported the outcome of Mitsubishi Electric Research Laboratory’s (MERL) crossverification of Toshiba’s 1D Directional Unified Transform (1DDUT) in TMuC. This activity is part of Tool Experiment 7 on MDDT simplification. A brief analysis of the architecture and software was followed by a summary of experimental results. Differences (negligible, averaging 0.0%) in bit rates and PSNR values can be accounted for by the difference in simulation platforms. The conclusion is that the MERL cross-verification results match those produced by Toshiba. The software was studied and indicated to be straightforward and to reflect what is described in JCTVC-C080, although a bit different (simpler) than what was proposed in Geneva. 9.4.1.1.1.1.1.1.3 JCTVC-C103 TE7: Report of cross-checking results for Toshiba proposal [H. Yang, J. Zhou, H. Yu (Huawei)] This document reported cross checking results for Toshiba’s proposal in TE 7 activities. The decoded sequences matched the reconstructed sequences. The RD data provided by Toshiba were reportedly the same as the results obtained from the crosschecking (with only very minor differences likely due to compiler and platform issues). 9.5 Beijing Univ. Tech. proposed simplification of MDDT 9.5.1.1.1.1.1.1.1 JCTVC-C305 TE7: Results for Simplification of MDDT Transform [W. Ding, Y. Shi, B. Yin (Beijing Univ. Tech.)] (late) MDDT has been implemented as a direct matrix multiplication, which results in higher computation cost than that of an ordinary integer transform. This contribution proposed two symmetric transform matrices to simplify the MDDT matrices. The proposed symmetric transform matrices can be 154 implemented using a lifting structure. Experimental results reportedly demonstrate that this proposal reduced the run-time complexity of MDDT with approximately 0.5% coding performance loss. The operation count was asserted to be lower in this contribution than in the others, by enforcing symmetric transform matrices, which may be why the loss is a bit larger. The contribution was submitted late, and no cross check was available. 9.6 Peking Univ. proposed simplification of MDDT 9.6.1.1.1.1.1.1.1 JCTVC-C089 TE7: Results for mode-dependent residual reordering for intra prediction residual [X. Zhao, L. Zhang, S. Ma, W. Gao (Peking Univ.)] In this contribution, the Peking Univ. results in response to the Tool Experiment TE7 on MDDT simplification were presented, with the tool description and simulation results. To simplify the mode-dependent directional transform (MDDT), a mode-dependent residual reordering (MDRR) method was proposed. In the proposed MDRR, between the prediction and transform stages, a mode-dependent reordering is implemented on the residual samples for each mode in the spatial domain. After the reordering, a single transform matrix is applied. In all-intra HE configuration, a 0.3% average loss was reported. 9.6.1.1.1.1.1.1.2 JCTVC-C027 TE7: Verification results of Peking Univ. proposal (modedependent residual reordering for intra prediction residual) [J. Kim, B. Jeon (LG Electronics)] This document reported verification results of JCTVC-C089 on mode-dependent residual reordering for intra prediction residual, submitted by Peking University. The proponents provide their source code for cross-checking. For HE intra, the simulation succeeded on all platforms of Window7 32bit and WindowXP 64bit and reportedly produced the same results as what the proponent provided. For HE random access, the simulation succeeded and reportedly produced the same results as what the proponent provided when the same platform was used as what the proponent used. An accompanying spreadsheet shows different results for V-PSNR for BasketballDrive_1920x1080_QP22 but it was reported to seem to be caused by minor rounding issues. The results of HE random access coding on Windows 7 32bit and on WindowXP 64bit have differences for some sequences such as BQsquare_416x240. An encoder crash occurred for some sequences such as Racehorse_832x480_QP37 on Windows XP 64bit. The software was not studied – just compiled and run. 9.6.1.1.1.1.1.1.3 JCTVC-C081 TE7: Cross-verification of mode dependent residual reordering (Peking Univ.) [A. Tanizawa, J. Yamaguchi (Toshiba)] This contribution contained cross-checking of software containing the tool "Mode-dependent residual reordering (MDRR)" of JCTVC-B102 proposed by Peking University. The verification task was reported to have been completed successfully and the results reportedly matched those reported in JCTVC-C089. The encoded images matched those of the decoded ones, and the PSNR and bit rate values were reportedly identical to the results provided by Peking University except for one case. In the case of RaceHorses (QP=37), an encoder crash happened. However, the contributor suggested that this might be caused by a bug in the memory access in TMuC version 0.7. The software was not studied – just compiled and run. 9.6.1.1.1.1.1.1.4 JCTVC-C190 TE7: Cross-check result of Peking Univ.'s proposal (JCTVCC089) [J. Kim, S.-C. Lim, H. Y. Kim, H. Lee, J. S. Choi (ETRI)] This document presented verification results of JCTVC-C089 proposed by Peking University. Some encoder crashes were experienced. When the encoder did not crash, matching results were obtained. The cross-checker verified that no unrelated areas of the software that had been modified, but did not closely study exactly what the proposal part of the software was doing algorithmically. 155 10 TE8: Parallel entropy coding 10.1.1.1.1.1.1.1.1 JCTVC-C223 TE8 report [G. Martin-Cocher (RIM), M. Budagavi (TI)] This contribution was a summary of tool experiment 8, Parallel Entropy Coding. Five proposed tools had been evaluated using the conditions defined in document JCTVC-B308r1. One of the test conditions defined in JVCTV-B308 was an optional hardware test. The hardware cross-verifications had not been performed. In this TE, five tools had been evaluated according to the conditions defined in JCVTC-B308r1 and their performances had reportedly been verified. The TE participants, as a group, recommended that the following two tools be adopted into TM and TMuC: Coefficient Sign PCP (JCTVC-B088 Section 3.2) Coeff Level BinIdx 0 PCP (JCTVC-B088 Section 3.3) Decision: This was Agreed. They also recommended that a core/tool experiment be started on the following two tools for improving context processing of significance map, since interaction with HHI_TRANSFORM_CODING needs to be studied more carefully: Significance map PCP (JCTVC-B088 Section 3.4) Coding order for significance map on bin decoding throughput( JCTVC B036 Section 2) 10.2 Coefficient Sign PCP, Coeff Level BinIdx 0 PCP, and significance map PCP 10.2.1.1.1.1.1.1.1 JCTVC-C062 TE8: TI parallel context processing (PCP) proposal [M. Budagavi (TI)] Context-Adaptive Binary Arithmetic Coding (CABAC) is one of two entropy engines used by the AVC video coding standard. The processing in the CABAC engine is highly serial in nature. Consequently, in order to decode high bit rate video bit-streams in real-time, the CABAC engine needs to be run at extremely high frequencies which consumes a significant amount of power and in the worst case may not be feasible. Techniques to parallelize CABAC can be broadly classified into three categories: bin-level parallelism, syntax element-level parallelism, and slice-level parallelism. Bin-level parallelism techniques such as NBAC/PIPE/V2V parallelize binary arithmetic coder (BAC) of CABAC. However, due to serial bottlenecks in context processing, there is limited overall throughput improvement in the entropy coder. To address this issue, several techniques that parallelize context processing were advocated. The following three techniques for parallelization of context processing (PCP) were presented at the last JCTVC meeting in JCTVC-B088: Coefficient Sign PCP (JCTVC-B088 Section 3.2) Coeff Level BinIdx 0 PCP (JCTVC-B088 Section 3.3) Significance map PCP (JCTVC-B088 Section 3.4) The first two of these design elements were reported to have no impact on coding efficiency. Further study was suggested for the third topic. 10.2.1.1.1.1.1.1.2 JCTVC-C141 TE8: Crosscheck on TI's proposal of parallel context processing by MediaTek [Y.-L. Chang, Y.-W. Huang, S. Lei (MediaTek)] This document reported cross-check results for JCTVC-B088 on parallelization of context level processing submitted by Texas Instruments (TI). The first two tools listed above were checked. The software source code for the cross checking was obtained from TI, and was studied and it was concluded that it algorithmically matched the proposed technology design. The verification task 156 was reportedly completed successfully and the results reportedly matched those provided by TI exactly. 10.2.1.1.1.1.1.1.3 JCTVC-C245 TE8: Cross verification of TI-PCP proposal for significance map [J. Zan, J. Meng, M. T. Islam, D. He (RIM)] This contribution reported the results of cross-checking activities performed by RIM on TI parallel context model proposal on significance map (described in JCTVC-B088 Section 3.4). It was reported that the software was studied and determined to match the TI proposal, and the results of the experiments matched those reported by TI. 10.3 Coding order for significance map on bin decoding throughput (JCTVC B036 Section 2) and V2V coding tree (JCTVC-B034) 10.3.1.1.1.1.1.1.1 JCTVC-C249 TE8: Reports on V2V coding and context modeling by RIM [J. Zan, G. Korodi, J. Meng, M. T. Islam, D. He (RIM)] This contribution discussed the coding order for the significance map on bin decoding throughput (JCTVC B036 Section 2) and the V2V coding tree (JCTVC-B034) This contribution reported experiment results on V2V codes and modified context processing as described in JCTVC-B308 and proposed by the contributor. According to the plans established in JCTVC-B308, the following tools were tested: V2V codes proposed by RIM Improved context processing in HEVC Specifically, the following three tests were run. Bernoulli tests of RIM V2V codes against PIPE V2V codes Modified context processing in TMuC0.7 against the high efficiency anchors as defined in JCTVC-B300. RIM V2V codes in TMuC0.7 against high efficiency anchors as defined in JCTVC-B300. It was remarked that Huffman coding can be sped up by various techniques – when the VLC table is short, the codewords can be concatenated to create a larger table that requires fewer data fetches to process. Generally speaking, the RIM proposed code tables are larger than the HHI proposed code tables and they capture more source bits per codeword. It was noted that this causes extra latency/buffering and affects the overhead for flushing out the partially filled codewords when flushing is necessary. In terms of overall coding efficiency for coding video data, the HHI tables seem very slightly better for predictive coding cases (due to needing fewer bits for flushing at the end of the codestream) and the RIM table seem very slightly better for intra (due to better fitting to entropy due to using larger tables). Significant overlap with JCTVC-C134 was noted. In regard to the proposed modified context processing, the motivation is to increase how often the same context is used for the decoding of consecutive bins, so that multiple bins could be processed at once through the finite state machine. For the modified proposal, a 0.0% to 0.1% degradation of coding efficiency was reported with a 1.4x to 2.1x effect on the "throughput" for the part of the bitstream that is devoted to these symbols. (See the contribution and JCTVC-C063 for the definition of throughput.) 10.3.1.1.1.1.1.1.2 JCTVC-C134 Comments on V2V coding for TM/TMuC [D. He, G. Korodi, P. Imthurn, J. Jamias, D. O'Loughlin, G. Martin-Cocher (RIM)] This document discussed V2V coding. It was asserted that the design of a V2V tree directly impacts, not only its compression performance, but its throughput, in both software and hardware 157 implementations. The contributor recommended further evaluation of V2V codes by measuring their compression performance, encoding and decoding throughput, sizes, and power consumption in hardware. The contribution discussed the "Codeset 1" and "Codeset 2" code tables currently in the TMuC. In Bernoulli tests using the 12 probabilities used in the design of the V2V codes, CABAC and the two codesets were compared. It was remarked that this penalizes CABAC in the comparison, since CABAC was not customized for those specific probabilities. It was noted that the Bernoulli tests do not include the work needed to select which V2V code to apply in the V2V cases, which is another type of apparent bias against CABAC in the test. The "codeset 1" tables are generally somewhat smaller, and (perhaps as a result) have slightly less coding efficiency than the "codeset 2" tables (on the specific tested probabilities) – although both have very little excess entropy (0.15% versus 0.12%). It was remarked that some assumptions may be built into this analysis about how the VLC would be implemented that may not reflect the various ways that optimized encoder and decoders would perform. It was suggested that some measure such as average bits per look-up operation may be beneficial to collect in experiments. It was noted that some of the bit rates discussed in the contribution were very high. It was suggested that some aspects of power consumption may not be captured in the provided power analysis. It was remarked that area versus power, for example, has a different priority balance in different applications. The contributor asserted that using a code that has a small number of distinct codeword lengths is desirable. A participant remarked that the "codeset 2" tables have reduced coding efficiency (approx 0.5%) under some coding conditions in video test classes that result in the use of small slices, due to needing increased overhead bits for flushing the codewords at the end of the slice. 10.3.1.1.1.1.1.1.3 JCTVC-C063 TE8: Evaluation of RIM parallel context processing (PCP) proposal [V. Sze, M. Budagavi (TI)] The coding efficiency loss of RIM PCP was reported to have been evaluated to be between 0.0 to 0.1% using TMuC-0.7 with QC_MDDT disabled. The implemented RIM PCP includes reordering of significant_coeff_flag and last_significant_coeff_flag by context. The update skipping mentioned in the original proposal was not included in the code provided for evaluation; the context updates are still being done serially. The software reports RIM PCP throughput impact to be between 1.4 to 1.9 for significant_coeff_flag amd 1.5 to 2.1 for last_significant_coeff_flag. Test results were reported to have been verified to mostly match with results provided by RIM. The contributor indicated that using the HHI transform coefficient coding proposal may make it more difficult to take advantage of the throughput increase opportunity. The contributor indicated that they had studied the software algorithm and that it functioned according to the proposal. It was remarked that some of the computed throughput increase might be difficult to achieve in practice. Further study of the context modeling aspect was encouraged. 10.3.1.1.1.1.1.1.4 JCTVC-C064 TE8: Evaluation of RIM-V2V entropy coding [V. Sze, M. Budagavi (TI)] This document reported on the Bernoulli test used to compare the coding efficiency, encoding and decoding throughput of various proposed entropy coding approaches – namely RIM-V2V, HHIPIPE and BAC. It also provided additional complexity comparison of the RIM-V2V tables versus the HHI-PIPE tables. To measure the coding efficiency, Bernoulli sequences of one billion samples were generated for 12 distinct probabilities. Each of the entropy coding approaches was used to 158 encode the same bin sequence, and the total encoded bits were measured. While the RIM and HHI proposed tables were designed with the specific 12 probabilities in mind, the BAC used existing states, nearest to the 12 probabilities (i.e. BAC was not customized for the probabilities). Furthermore, the tests were not performed in the TMuC context with actual video data. Thus, it was reported to be difficult to draw a conclusion on coding efficiency. The encoding and decoding throughput were estimated based on simulation time. However, since software code for each approach was written in a very different style/structure, it was reported that no conclusion can be drawn on throughput based on the proposed use of simulation time. Further investigation was recommended. Test results given by the Bernoulli test software had been verified to match with results provided by RIM. The TMuC software has 24 tables from RIM and 12 tables from HHI, and the TMuC document has the 12 tables from HHI, and the 12 tables now proposed by RIM were first documented in a TE plan document. Similar measurements from the Bernoulli testing were reported in this contribution as what was reported by RIM. The tested software was reported to have had differences in coding style and degree of optimization. The contributor indicated that it was difficult to determine whether this affected the analysis results. The contribution indicated that the extra buffering needed by PIPE (relative to CABAC) was an issue. Area estimates for PIPE were 15x higher than for CABAC. Further study of the various issues was suggested. 10.3.1.1.1.1.1.1.5 JCTVC-C280 TE8: Crosscheck result of the transcoder for JCTVC-B034 source selection for V2V entropy coding in HEVC [Y.-L. Chang, Y.-W. Huang, S. Lei (MediaTek)] This document reported cross-check activity for the transcoder part of JCTVC-B034 for V2V entropy coding as proposed by Research In Motion (RIM). The verification task was reported to have been done successfully and the results reportedly matched those provided by RIM. There were some small variations (up to -0.12%) reported in random access conditions. The source code provided by RIM was studied to verify the proposed algorithmic functionality. 10.3.1.1.1.1.1.1.6 JCTVC-C313 TE8: Crosscheck Result of the Transcoder of JCTVC-B034 for V2V Entropy Coding in HEVC [Y. Zheng , R. Joshi , M. Coban , M. Karczewicz (Qualcomm)] (late reg.) The purpose of this late information document was to crosscheck the transcoder implementation of JCTVC-B034 on source selection for V2V entropy coding previoiusly submitted by Research In Motion (RIM). The verification task had reportedly been completed successfully and the results were reported to closely match those provided by RIM. 11 TE9: Large block structures 11.1.1.1.1.1.1.1.1 JCTVC-C067 TE9: Report on large block structure testing [J. Kim, M. Kim (KAIST), J. Kim, H.-Y. Kim (ETRI), K. Sato (SONY), X. Shen, L. Yu (Zhejiang Univ.), K. Choi, E. S. Jang (Hanyang Univ.), B. Bross (HHI), W.-J. Han (Samsung), J.-K. Jo, S.-N. Park, D. G. Sim, S.-J. Oh (Kwangwoon Univ.)] Summary, remarks, and observations: Confirms that restrictions CU 64 and TU 32 are reasonable (for current test set) Extensive statistic analysis on usage of various block sizes vs. QP/rates 11.1.1.1.1.1.1.1.2 JCTVC-C140 TE9-2: Report on performance tests for different sets of PU modes by Fraunhofer HHI [B. Bross (Fraunhofer HHI)] 159 Summary, remarks, and observations: Relates to JCTVC-C200 – TUs spanning PU boundaries has a little advantage Using merge syntax for rectangular shapes does not produce loss 11.1.1.1.1.1.1.1.3 JCTVC-C198 TE9: Simulation results for various max. number of transform quadtree depth [J. Chen, T. Lee, W.-J. Han (Samsung)] (The version of JCTVC-C198 prior to Saturday had no results in it.) Summary, remarks, and observations: Advantage of RQT in intra comes by allowing more prediction modes New suggestion is to augment quadtreetulog2maxsize & -minsize by parameter tumaxdepth – none of these parameters is a syntax element to be conveyed to the decoder – would be fixed in the standard (or in a profile) Advantage of the new restriction is saving of quadtree signaling bits Complexity in intra encoding is decreased to 75% compared to RQT but still increased to approx. 180% compared to the less flexible 2-level method (RQT off in TMuC) vs. BR gain 1.2% Decoder complexity is only marginally affected Encoders could decide to restrict the number of levels (with penalty on performance) and run faster New version of the document that would contain information about using various numbers of levels was not available yet when the document was first discussed Reported in revisit: o Results relative to RQT on (which is about 1.2% BR decrease and 250% encoder time) o Depth 3 HE +0.1% , 75% encoder time; depth 2 +0.3, 60% encoder time o Depth 3 LC +0.3%, 82% encoder time, depth 2 0.3, 58% encoder time Refer to JCTVC-C311/JCTVC-C312 which implements a fast encoder 11.1.1.1.1.1.1.1.4 JCTVC-C284 TE9-2.1 Report on forced RQT split according to PUs [A. Segall (Sharp), J. Xu (Microsoft)] (missing prior, provided by third meeting day) This contribution confirmed the results of JCTVC-C198 (without an exact match due to different platform usage). 12 TE10: In-loop filtering 12.1.1.1.1.1.1.1.1 JCTVC-C083 TE10: Summary of TE10 on in-loop filtering [K. Chono, T. Yamakage (TE coordinators)] In this tool experiment, in-loop filtering had been tested for the following functionalities: Deblocking/debanding filters (four proposals) Wiener-based in-loop filters (four proposals) Image clipping and offset (two proposals) As for the details of each proposal, please refer the documents described below. 160 Seven companies and one university participated as proponents and three companies and one university participated as cross checkers. Three post filters were investigated. The best of these (in terms of RD perf.) was selected. The difference of the anchor against post filter from JCTVC-C113 was reported as 0.9/0.3 on average. 12.1.1.1.1.1.1.1.2 JCTVC-C130 TE10 subtest 1: Results of intra deblocking filter testing by SKKU/SKT [J. Yang, K. Won, B. Jeon (SKKU), J. Lim, J. Song (SKT)] This document reported TE 10 test results of intra deblocking filter (DF) which was proposed in JCTVC-B075. The proposed method employs exactly the same DF scheme as that in the TMuC, except filter strength control for intra blocks. It is implemented on TMuC 0.7 S/W, and its experimental results under the TE10 subtest 1 procedure reportedly show BDBR gain of 0.6% (intra only), 0.3% (random access), and 0.1% (low delay). Encoding time is reportedly increased less than 1% and decoding time is reportedly the same as the TMuC0.7. The proposed method reportedly provides improved visual quality. It was remarked that this proposed modification compromises preservation of edges vs. removal of artifacts, and it is difficult to say whether one or the other is better. 12.1.1.1.1.1.1.1.3 JCTVC-C161 TE10: Cross-verification result of SKKU/SKT deblocking filter [K. Chono, K. Senzaki, H. Aoki, J. Tajime, Y. Senda (NEC)] Summary, remarks, and observations: The cross-check found no problem with the software, results were reported to be accurate. Visual inspection did not unveil relevant improvements or degradations. Method requires to store the directional mode (slight additional complexity). 12.1.1.1.1.1.1.1.4 JCTVC-C142 TE10 subtest 1: Improved deblocking filter [J. An, K. Zhang, Y. Gao, X. Guo, C.-M. Fu, Y.-W. Huang, S. Lei (MediaTek)] This contribution describes MediaTek’s proposal of a modified deblocking filter based on the AVC deblocking filter (DF). The modified deblocking was asserted to improve intra block boundaries by using intra mode dependent deblocking filter (MDDF) and modified boundary strength and thresholds (MBST). The modified deblocking filter reportedly outperforms the TMuC0.7 anchor in the objective measure RD sense in all six test configurations. Bit rate reductions were reported as 1.6%, 1.2%, 1.3%, 1.4%, 0.6%, and 0.5% for intra, random access, low delay, intra LC, random access LC, and low delay LC, respectively. Though the visual differences between the IDF and TMuC0.7 anchor are small, some improvements could reportedly be observed, especially in the image areas with obvious texture directions. The encoding time reportedly decreases by 1.8%, and the decoding time increases by 6%, compared to TMuC0.7. It was remarked that the report about computation time may not be completely reliable; 3% would mean doubling the complexity of deblocking (according to JCTVC-C147). See also cross-check below. Further work was encouraged, but it was remarked that complexity should not increase significantly. 12.1.1.1.1.1.1.1.5 JCTVC-C156 TE10: Cross verification of Mediatek's deblocking filter by Ericsson [A. Norkin, R. Sjöberg, K. Andersson (Ericsson)] This contribution reported cross-check activity summarized as follows: The PSNR and BD rate match The source code does not exactly match the description of JCTVC-B077 The decoder runtime is approx. 7.5x that of current TMuC deblocking 161 Directional filtering seemed to help on the RaceHorses sequence, but it is not as clear whether it is better than the TMuC deblocking on other sequences 12.1.1.1.1.1.1.1.6 JCTVC-C273 TE10 subset 1: Report of content-adaptive de-blocking [Z. Xiong, X. Sun, J. Xu (Microsoft)] This document presented a content-adaptive deblocking scheme to improve the visual quality of block-based compressed video. For large smooth regions with small variation, an extra smoothing deblocking mode was introduced to suppress the blocking artifacts. Experimental results reportedly demonstrate that the proposed method can improve the visual quality while maintaining the objective fidelity of heavily compressed video. The following discussion remarks were recorded: Does not affect complexity compared to current deblocking From subjective viewing, flickering was observed (in the all-intra case) which may be caused by the stronger filtering; no significant difference overall. 12.1.1.1.1.1.1.1.7 JCTVC-C131 TE10 subtest 1: Cross-verification result of Microsoft deblocking filter proposal by SKKU/SKT [J. Yang, K. Won, B. Jeon (SKKU), J. Lim, J. Song (SKT)] This contribution confirmed PSNR and bit rate results of JCTVC-C273, reporting essentially the same encoding and decoding time. 12.1.1.1.1.1.1.1.8 JCTVC-C091 TE10: Conditional joint deblocking-debanding filter [K. Chono, K. Senzaki, H. Aoki, J. Tajime, Y. Senda (NEC)] This contribution presented a performance report on a conditional joint deblocking-debanding filter described in JCTVC-B056. The conditional joint deblocking-debanding filter injects small pseudo noise into images around an intra-block boundary which is supposed to be a part of areas of low detail with subtle changes in pixel intensity. The injected small pseudo noise masks banding noise around intra-block boundaries while keeping the rate-distortion performance with the help of the subsequent Wiener filter. Furthermore, when IBDI is used, it also introduces a random quantization effect on the IBDI output image. The conditional joint deblocking-debanding filter was asserted to significantly improve visual quality of decoded video, especially when IBDI is used. Simulation results reportedly verify that the conditional joint deblocking-debanding filter reduces blocking and banding noise with a negligible impact on the video coding efficiency of the TMuC. It was proposed that the conditional joint deblocking-debanding filter be adopted in the TMuC software and that its performance be further evaluated with toolsets specified in HEVC Test Model. The following discussion remarks were recorded: For low-complexity, no Wiener filter is operated; therefore PSNR decreases, affecting BD rate increase by 1.2% in all-intra (less in inter) Does the deterministic pseudo noise introduce visible temporal artifacts? Could it produce artifacts like e.g. mosquito noise in certain test sequences? Is it not possible to do the dithering as postfiltering? Should it be better after the Wiener filter? Various solutions should be studied. This would have quite significant implication on the amount of standard text (e.g. would be necessary to describe tables of pseudo noise, exactly describe the rules of insertion, etc.) Further experimentation was recommended in particular on a) the best position in the loop b) possible implications on producing artifacts, and c) alternative as post processing. 162 12.1.1.1.1.1.1.1.9 JCTVC-C274 TE10 subset 1: Cross- verification of NEC's joint deblockingdebanding filter by Microsoft [X. Xiong, J. Xu (Microsoft)] This cross-check report confirmed the RD values of JCTVC-C091, reporting similar encoding/decoding times. No subjective tests were reported in the contribution. 12.1.1.1.1.1.1.1.10 JCTVC-C143 TE10 subtest 2: Coding unit synchronous picture quadtreebased adaptive loop filter (QALF) [C.-Y. Tsai, C.-M. Fu, C.-Y. Chen, Y.-W. Huang, S. Lei (MediaTek)] This contribution described MediaTek’s work on adaptive loop filter (ALF), namely coding unit synchronous picture quadtree-based adaptive loop filter (CS-PQALF). The proposed method uses multi-level quadtree partitions to allow local adaptivity for Wiener loop filtering. The partitions boundaries are aligned with the boundaries of the largest coding units (LCUs). Each partition is a basic filter unit (FU) and can be enhanced by different Wiener filters to reduce mean square error between the reconstructed picture and the original picture. In this proposal, the number of encoding passes is reduced from 16, for the ALF in TMuC, to 2, for the CS-PQALF, while reportedly maintaining the good coding efficiency performance of the anchor. Simulation results were compared with the anchor adopted in the subtest 2 of tool experiment 10 (TE 10). The proposed CSPQALF increases 0.1% and 0.3% BD-rates for high efficiency random access (HE-RA) configuration and high efficiency low delay (HE-LD) configuration, respectively. The encoder execution times are increased by 1% for both HE-RA and HE-LD. The decoder execution times are increased by 3% and 1% for HE-RA and HE-LD, respectively. Note that the software execution time cannot really reflect the reduction of hardware external memory access overhead. Additional experiments on combining four techniques including controlled clipping (JCTVC-C146), improved deblocking filter (JCTVC-C142), quadtree-based adaptive offset (JCTVC-147), and CS-PQALF sreportedly how 2.2% and 2.5% BD-rate reductions for HE-RA and HE-LD, respectively. The main advantage claimed is reduction of number of coding passes; run time does not really reflect that (encoding/decoding time very slightly increased), memory bandwidth is the more critical issue than computation power. Adaptive filter sizes include square 3x3 ... 9x9, rhombus 5x5 ... 9x9. In terms of performance (subjective & objective), a negligible difference was reported compared to TMuC anchors. Similar reductions appear possible with other methods (see e.g. JCTVC-C082 and JCTVC-C113) More coefficients must be stored & encoded. It was asked where would be the best place to encode the filter on/off flag: slice or CU level. Adaptation of filters was controlled by a quadtree transmitted in slice header (at most 16 partitions per frame), boundary aligned with CU quadtree. 12.1.1.1.1.1.1.1.11 JCTVC-C229 TE10 subset 2: Cross check result of MediaTek ALF [I. S. Chong, M. Karczewicz (Qualcomm)] This contribution reportedly confirmed the results of JCTVC-C143. 12.1.1.1.1.1.1.1.12 JCTVC-C173 TE10 subtest 2: Parallel adaptive loop filter [T. Ikai, T. Yamamoto (SHARP)] In this contribution, the Parallel adaptive loop filter technique from JCTVC-B064, which was implemented into TMuC0.7, was evaluated according to the TE10 common condition. The proposed technique uses both pre-DF (De-blocking filter) signal and post-DF signal as inputs, where both inputs’ weights are optimized with Wiener-filter technique. So the whole system can be seen as one in-loop filter where De-blocking filter and Wiener-based filter are combined. This combination provides coding efficiency improvement as well as the functionality to process the two filters in parallel. The experimental results reportedly showed that the proposed technique provides 0.7% / 0.4% / 0.4% bit rate reduction (IntraOnly / RandamAccess / LowDelay) compared to the anchor, where Sum-modified Laplacian based ALF is used. The complexity of the proposed method 163 was also evaluated and the results reportedly showed that the decoding time is 93% to 96% of the anchor and the encoding time is almost the same as the anchor (100%). The following remarks and observations were recorded: Method: Wiener filter applied to deblock filter input, weighted superposition of de-blocking and loop filter output. Filter on/off signaled at CU level, filter parameters (incl. weighting params) signaled at slice header. Uses 5x5, 7x7 and 9x9 rhombic filters. Loop filter itself basically identical to TMuC. Also uses 16 passes. Slight bit rate decrease compared to anchors. Encoding time similar to anchor, decoding time slightly less. Avoids pixel-wise operation. Information: It was tried to apply filtering to de-blocking output as well, but that did not result in significant gain. Subjectively, there seemed to be a tendency to look sharper, but sometimes this could reproduce more coding artifacts. 12.1.1.1.1.1.1.1.13 JCTVC-C145 TE10 subtest 2: Crosscheck on SHARP's proposal of adaptive loop filter by MediaTek [C.-Y. Tsai, Y.-W. Huang, S. Lei (MediaTek)] This cross-check report confirmed the reported reduction of JCTVC-C173 in BD rates and enc/dec time measurements. 12.1.1.1.1.1.1.1.14 JCTVC-C082 TE10 subtest 2: Reduction of number of encoding passes for quadtree adaptive loop filter (QALF) [T. Yamakage, T. Chujoh, T. Watanabe (Toshiba)] This contribution described an encoding technique to reduce the number of encoder passes for Wiener-based filter design, and detailed experimental results of Quadtree-based Adaptive Loop Filter (QALF) were reported. This is one of the proposals for Subtest 2 (Wiener-based in-loop filters) in Tool Experiment 10 (Loop filtering). The number of additional encoder passes compared to no adaptive loop filter encoding is two by this technique, which is drastically reduced from QC_ALF adopted in TMuC v0.7, while the loss of the coding efficiency from the TMuC 0.7.1 anchor is 0.5%. A supplemental information to show the coding efficiency loss (0.1%) is also provided that compares multi-pass QALF and the proposed 2-pass QALF. Note that the additional passes are not encoding a picture, but filtering a picture that requires less complexity compared to encoding a picture. This technique is also applicable to the methods that adopt block-based filtering control. The following remarks and observations were recorded: Argument for loop filter vs. post filter: Assurance of the (minimum) quality of the output is desired by industry. A slight decrease in decoding time may be caused by the less frequent usage of the loop filter. There was discussion about relation between loop filtering and interpolation filtering. Obviously the best choice of interpolation filter depends on the loop filter used. However the design of the filter coefficients is not normative. The contribution shows that it is possible to reduce the number of encoder passes in ALF coefficient derivation without a large penalty in coding efficiency 164 12.1.1.1.1.1.1.1.15 JCTVC-C144 TE10 subtest 2: Crosscheck on TOSHIBA's proposal of adaptive loop filter by MediaTek [C.-M. Fu, Y.-W. Huang, S. Lei (MediaTek)] This cross-check report confirmed BD rate values of JCTVC-C082, but encoding/decoding runtime measured may be unreliable. 12.1.1.1.1.1.1.1.16 JCTVC-C194 TE10 subtest 2: Cross-check results of JCTVC-C082 [Toshiba] Reduction of number of encoding passes for quadtree-based adaptive loop filter (QALF) [P. Wu, S. Paschalakis, N. Sprljan (Mitsubishi Electric)] This cross-check report confirmed BD rate values within a small margin (using a different platform), with encoding/decoding runtime slightly different. 12.1.1.1.1.1.1.1.17 JCTVC-C071 TE10 subset 2: Complexity analysis on Wiener-based in-loop filters [L. Wang, L. Yu (Zhejiang Univ.)] In this contribution, the complexity of Wiener-based adaptive in-loop filter (ALF) algorithms (A121, B064, B077, A117) in the subset2 of TE10 was analyzed along with their coding efficiencies. The contribution analyzeed three aspects: i) filter features, ii) encoding time, and iii) decoding time. The following remarks and observations were recorded: Only a few frames of the sequences were used to determine encoding and decoding time, so the numbers may not be reliable. New versions of software which are faster exist in most cases. It is not clear which versions of software were used (not distributed by the CE coordinator). 12.1.1.1.1.1.1.1.18 Conclusions on TE10 subtest 1&2: The following overall remarks and observations were recorded: Alternative de-blocking proposals do not seem especially interesting at the moment (either no real difference, or too complex) Comfort noise insertion for de-banding could be interesting, but needs further study. Neither subjective evaluation nor objective measures unveiled substantial differences between the various in-loop filters. Reduction of number of passes in optimization in ALF appears possible. There is a clear difference between no filtering and filtering (roughly 5% BR gain, with a subjectively noticeable benefit) Post filtering suffers from temporal discontinuity. As the loop filter has an influence on other elements of the design, it is important to have one in the TM, but it seems to be less important which one this is. Further experimentation on improvements should go on anyway. Regarding the 3-input approach investigated in TE 12 which is another option, subjective viewing was performed. Otherwise, the level of encoder runtime increase of 25% currently seems not to be justified vs. 1% additional BR reduction. After some (informal) subjective viewing, a report was given Monday evening: For one case (Park Scene QP37) a majority of test persons indicated that they had observed slight improvement (though there was 2% bit rate reduction by the 3-input method compared to anchor ALF) 165 In general, those visual tests may have the problem that they compare sequences encoded at (slightly) different bit rates. This puts a slight disadvantage on methods that have a tendency decrease the bit rate. 12.1.1.1.1.1.1.1.19 JCTVC-C146 TE10 subtest 3: Controlled clipping [Y.-L. Chang, C.-M. Fu (MediaTek), A. Segall, Y. Su (Sharp), C.-Y. Chen, Y.-W. Huang, S. Lei (MediaTek)] This proposal reported MediaTek and Sharp joint work on controlled clipping. Controlled clipping describes a process that clips predicted or reconstructed pixel values to minimum and maximum values that are signaled in the bitstream. The clipping process is applied at four stages, namely postprediction, post-reconstruction, post-deblocking, and post-adaptive loop filter (post-ALF). Results reportedly show that the proposed controlled clipping achieves average 0.6% and 0.4% BD-rate reductions for high efficiency random access (HE-RA) and high efficiency low delay (HE-LD) configurations, respectively. The encoding time measures were reportedly increased by 2% and 0% for HE-RA and HE-LD, respectively, and the decoder times were reportedly increased by 6% and 2%, respectively. The following remarks and observations were recorded: Clipping parameters are sent in picture level and/or slice level. Comparison against post clipping was intended, but no gain was found compared to that. In previous implementation (not joined proposal of two companies), gain was found. Would there be a guarantee that clipping at various places in the loop is better than postclipping? Probably not. Further study was encouraged. 12.1.1.1.1.1.1.1.20 JCTVC-C147 TE10 subtest 3: Quadtree-based adaptive offset [C.-Ming Fu, C.-Y. Chen, Y.-W. Huang, S. Lei (MediaTek)] This contribution described a MediaTek proposal of quadtree-based adaptive offset (QAO). QAO consists of two parts, quadtree partitioning and offset compensation. The former splits a picture into multi-level quadtree partitions, and each partition is compensated by one offset compensation method. The latter reduces errors between reconstructed pixels and original samples of a current picture by using four kinds of offset compensation methods, including uniform band offset, nonuniform band offset, cross pattern edge offset, and diagonal cross pattern edge offset. Simulation results reportedly show that in comparison with the TE10 anchor, which enables adaptive loop filter (ALF) and many high efficiency tools of TMuC0.7 as required in JCTVC-B300, this proposal can achieve 1.2% and 2.0% bit rate reductions for high efficiency random access (HE-RA) configurations and high efficiency low delay (HE-LD) configurations, respectively. The encoding time was reportedly increased by 2% and 1% for HE-RA and HE-LD, respectively, and the decoder time was reportedly increased by 9% and 9%, respectively. The following remarks and observations were recorded: Each quadtree partition can switch the mode: Wiener, one of the four offset methods (similar to original Samsung proposal A125)Subjective viewing did not unveil differences Complexity increase would be too high to be justified compared to the small gain Further study was encouraged. 12.1.1.1.1.1.1.1.21 JCTVC-C180 TE10.3: Cross check of MediaTek's proposal on quadtreebased adaptive offset [A. Segall (Sharp)] (missing prior, available first day) This contribution reported cross-check activity for the MediaTek proposal on quadtree-based adaptive offset. This cross-check was performed within the context of TE10. 166 MediaTek sent software, and the experiments were run, and closely matching PSNR results were reported. No visual inspection was done. 13 TE11: Motion vector coding 13.1.1.1.1.1.1.1.1 JCTVC-C293 TE11: Summary report for TE11 on motion vector coding [J. Jung (Orange Labs)] This document summarized the activities in Tool Experiment TE11 on motion vector coding. A total of 9 companies or universities had registered to the TE: Sony, Sungkyunkwan University, Samsung, NTT DOCOMO, LGE, HHI, Sharps Labs of America, ETRI / Kyung Hee University and Orange Labs. A few emails were exchanged mainly to coordinate the cross-check activity and exchange the modified versions of the TMuC software. Also it can be noticed that for some topics, either the contribution was cancelled by the proponent (lack of time running the experiment, or insufficient results), or the cross-check was not fully completed (all configurations, or sequences). Several experiments on motion vector coding had been performed for TE11 with various configurations of the software. For the Inter mode, it was reported that the MV Competition scheme provides significant gains. For the Skip, Direct, Merge modes, both Skip + Direct (with MV Competition) and Merge were reported to bring significant gains. The following options were reported to seem possible for the TM: Keep Merge only at CU and PU level Keep Skip + Direct only at CU level Keep Merge at PU level, Skip and Direct at CU level When MV Competition is concerned (for either Inter, Skip and Direct modes) it was reported that: Having a temporal predictor brings gains Having the ability to signal the set of predictors (including the ordering of the predictors) brings gains At this stage, experiments combining IMVP and MV Competition were reported to have failed to achieve noticeable improvements. 13.1.1.1.1.1.1.1.2 JCTVC-C289 TE11: Report on experiment 3.1.a: Disabling MV competition [J. Jung, G. Clare, S. Pateux (Orange Labs)] (missing IPR statement prior) The following remarks and observations were recorded: Removing the MV competition on Inter, and Skip/Direct modes reportedly provides an average coding loss of 2.6% to 4.1%. (by tendency higher for RA and LC). Decoder runtime reportedly increases by MV competition (not measured on the full set, only QVGA class D) 0.1-2.4%; with the highest increase for the lowest bit rate. Conclusion: Keep MV competition for inter and skip, considering its compression gain. 13.1.1.1.1.1.1.1.3 JCTVC-C132 TE11: Cross-verification result of Orange Labs motion vector competition proposal (3.1a) by SKKU [J. Yang, K. Won, B. Jeon (SKKU)] This cross-check report confirmed the results of JCTVC-C289. 13.1.1.1.1.1.1.1.4 JCTVC-C290 TE11: Report on experiment 3.2.b: MV-Competition on Inter, Skip and Direct modes [J. Jung, G. Clare, S. Pateux (Orange Labs)] (missing IPR statement prior) 167 The TMuC reference applies MV-Competition for Inter modes, and Merge. Skip and Direct modes are disabled. This experiment evaluated a configuration with MV-Competition for Inter, Skip and Direct modes, with Merge disabled. The document reported that the scheme with MV-Competition enabled on Inter, Skip and Direct mode provides on average better results than the current TMuC reference. Average BR gain was reported as 1.5%, gain on LC was higher – around 2.5-3%. 13.1.1.1.1.1.1.1.5 JCTVC-C043 TE11: Simulation results of merge/skip (3.2b and 3.2d) [Y. Suzuki, F. Bossen (NTT DoCoMo)] The following remarks and observations were recorded: - Confirmed the results of JCTVC-C290 for the HE case (LC not done). - Reported increased encoder run time by 10%, with a marginal decoder time impact. - Experiment 3.2d : Switching off skip, direct, CU merge increases BR by roughly 5%, no impact on encoding/decoding time (only done for HE case). 13.1.1.1.1.1.1.1.6 JCTVC-C292 TE11: Verification of experiment 3.2.d by NTT DOCOMO [J. Jung, G. Clare, S. Pateux (Orange Labs)] (missing prior, available first day) The following remarks and observations were recorded: - Confirmed the results of JCTVC-C043 3.2d HE cases. - Additional results on LC were: RA: Skip 8% / Merge 6%, LD: Skip 10% / Merge 6% - For LC, no results on encoder/decoder run times were provided. 13.1.1.1.1.1.1.1.7 JCTVC-C199 TE11: Report on experiment 3.2.c: Check skip and merge together [I.-K. Kim , T. Lee , W.-J. Han (Samsung)] In this document, the coding efficiency of a block merging process (MRG) and Skip/Direct mode of TMuC software was presented. The bit rate impact of Skip/Direct mode was reported as -0.7% (random access), -2.6% (random access low complexity), 0.1% (low delay) and -2.9% (low delay low complexity) compared to MRG which is turned on by default in the TMuC. This contribution also presented a combination of MRG and Skip/Direct where Skip/Direct was enabled on top of MRG. The reported bit rate impact was -1.2% (random access condition), -2.4% (random access low complexity condition), -0.5% (low delay condition) and -3.3% (low delay low complexity condition). The following remarks and observations were recorded: - Compared to configuration skip+direct on CU merge off, 0.5% BR gain on average if both enabled. No significant increase in encoder run time. - Skip+Direct vs. CU based merging seems not always to be identical in effect (as was assumed before). 13.1.1.1.1.1.1.1.8 JCTVC-C191 TE11: Cross-check result of merge/skip (3.2c) S. Jeong, J. Lee, H. Y. Kim, S.-C. Lim (ETRI), K. Kim, H. Lee, G. Park (KHU) This cross-check report confirmed the results of JCTVC-C199. The decoder run time was only marginally increased, encoder run time increased around 10+%. It was remarked that the high gains reported in the LC case may be caused by deficiencies of LCEC vs. PIPE, and some of this may be resolved in "LCEC phase II". 13.1.1.1.1.1.1.1.9 JCTVC-C166 TE11: Study on motion vector coding (experiment 3.3.a & 3.3.c) [K. Sato (Sony)] This contribution reported results of two experiments as follows: 168 Experiment -1: Comparison of with & without Temporal Predictor Experiment -2: Code Number Swapping of Motion Predictors The following remarks and observations were recorded: Experiments were performed only for 50 frames. Average numbers were missing; for experiment 2 the gain seems to be around 0 overall. 13.1.1.1.1.1.1.1.10 JCTVC-C116 TE11: Cross verification of Sony proposal on motion vector coding [Y.-J. Jeon, B.-M. Jeon (LG Electronics)] This cross verification contribution confirmed the results of JCTVC-C166 on experiment 2 (with a reported gain of approximately 0); no cross-check was provided on experiment 1 in the contribution. 13.1.1.1.1.1.1.1.11 JCTVC-C291 TE11: Report on experiment 3.3.b: "temporally oriented" set of predictors for MV-Competition [J. Jung, G. Clare, S. Pateux (Orange Labs)] (missing IPR statement prior) This contribution reported that by changing the order of predictors (temporal first), additional gain is possible – in particular for static background scenes. The contribution included a suggestion to signal the set of predictors in MV competition (potentially different for skip as well) at the slice level. No results on that were available yet. It was reported that this would come with practically no cost at the decoder, but the encoder would need to do some extra optimization work. Further work was encouraged 13.1.1.1.1.1.1.1.12 JCTVC-C308 Cross verification of MV Coding Proposal by JCTVC-C293 [K. Sato (Sony)] (late reg.) (verbally presented in TE11, uploaded after the meeting ended) This cross-check contribution was registered very late. When brought up for discussion, it had not yet been uploaded and the contributor verbally reported that the work was not yet complete, but that he results achieved so far had confirmed JCTVC-C291. The contribution was eventually uploaded significantly after the meeting had ended. 14 TE12 14.1 TE12: Overall TE12 summary 14.1.1.1.1.1.1.1.1 JCTVC-C225 TE12: Summary of evaluation of TMuC tools in TE12 [K. McCann (Samsung/Zetacast)] (missing prior, uploaded first day) This contribution was a summary of tool experiment 12 (TE12) on exploring the performance of individual tools in the Test Model under Consideration (TMuC). Detailed results were reported by the TE12 participants in individual input documents. The work of TE12 was very substantial, and it was a very important effort. The incremental contribution of each individual tool in the TMuC is highly dependent on the settings of the other tools in use; in effect, it is a multi-dimensional non-linear system. The starting points used in TE12 were the default configurations of the TMuC specified in JCTVC-B300, since these represent the current best estimate of realistic use cases. Each tool in the TMuC was tested individually as follows. Some tools are on/off in nature (e.g. ROT). This type of tool was tested by being turned on (if off in the default configuration) or off (if on in the default configuration) unless it has a direct alternative (see below). 169 Some tools have direct alternatives either within the TMuC itself or else added to the software for comparison purposes (e.g. PIPE and the LCEC entropy coding within the TMuC, with CABAC for comparison). This type of tool was tested by being replaced by each direct alternative in turn (e.g. PIPE was tested against both LCEC and CABAC). Some tools can have minimum or maximum limits defined (e.g. Large Transform block size). This type of tool was tested by having either the minimum or maximum limit made stricter as appropriate (e.g. maximum transform size being reduced from 64x64 to 32x32). Tools for which there is no alternative in the TMuC (e.g. inter transforms) were not tested. In most cases, only one tool at a time was exercised by giving it an alternative setting from that of the default configuration. However, some limited testing of combinations of tools was also performed, in order to improve understanding of the interaction between tools (e.g. between ROT and MDDT). All of the tools listed in the Tool Experiment 12 test plan were evaluated, with the exception of one encoder-only test on CABAC-based RDO on/off. In most cases the tests used TMuC version 0.7 and subsequent versions of the TMuC released by the Software AHG. However, three of the tests used company-specific branches: PU-Based Merging (tested with TMuC 0.8-hhi-bugfix), Enhanced CIP (tested with 0.7-BBC branch) and ALF default/3-input ALF (tested with 0.7-panasonic branch). Detailed results were reported by the TE12 participants in individual input documents. It was anticipated that the results of this work would be used to guide improvements in default operating points. In addition, the TE work provides insight on how the various tools contribute to the overall performance of the TMuC and hence helped inform the process of forming the first Test Model. However, it should be noted that the various tools in TMuC interact in a complex non-linear manner, so that careful interpretation of the results is required. A total of 70 documents directly relating to TE12 were registered. The categories were as follows: 1. Unit definitions 2. Motion representation 3. Intra-frame prediction 4. Transforms 5. Filtering 6. Entropy coding 7. Other decoder tools 8. Encoder-only tools Most subtests used TMuC 0.7, while some used 0.7.x (which would include some bug fixes that do not affect the default configuration). All of the 0.7.x versions should be able to be well compared to the distributed anchors. Some variants are noted below: PU merging testing used "0.8 HHI bug fix" software; Note that a special patch was used in Enhanced CIP on/off testing as discussed in JCTVCC213 with cross-check JCTVC-C297, which are not classified here as TE12. "0.7 Panasonic branch" was used for 3-input ALF tests (e.g., JCTVC-C214). LCEC and PBIC tests used 0.7.4. CABAC based RDO on/off encoder-only evaluation was not tested. 14.1.1.1.1.1.1.1.2 JCTVC-C320 TE12: Summary of results of evaluation of TMuC tools in TE12 [K. McCann (TE coordinator)] (late registration) 170 Due to the large effort and large number of contributions involved in TE12, the original summary report for the TE (JCTVC-C225) did not contain a summary of the results. Such a report of results was prepared during the meeting and was reported in this contribution. Although the effort involved in the preparation of this contribution was greatly appreciated, there was insufficient time to properly review the document at the meeting due to the late arrival of the document. 14.2 TE12 TMuC coding unit definitions 14.2.1.1.1.1.1.1.1 JCTVC-C029 TE12: Report on 64x64 versus 32x32 maximum coding unit size for low complexity configuration [P. Pandit (Harmonic)] This contribution presented results for the Max CU size experiment under TE12 for the low complexity configuration. Simulations reportedly showed that when using CU32 size, for Intra only case the average BD-Bit rate gain is about 0.2%, for Random access case the average BD-Bit rate loss is about 7% and for low delay case the average BD-Bit rate loss is about 8.8%. Some comments on the visual quality were also made. Cross verification of the results was reportedly done with Motorola. The tested cases included LC configuration for intra, low delay & random access Visual comparison was done for QP 37 14.2.1.1.1.1.1.1.2 JCTVC-C044 TE12: Results for experiments on max CU size, RDOQ and AIS [F. Bossen, T. K. Tan (NTT DoCoMo)] Other aspects of this contribution are discussed in another section below. This contribution reported the simulation results for Tool Experiment 12 as described in JCTVCB312r5. Three sets of experiments were simulated. The experiments are the comparison of the maximum coding unit (Max CU) size of 32 and 64, the comparison of rate distortion optimized quantization (RDOQ) on and off, and the disabling of the adaptive intra smoothing (AIS) with the default set to always off. The tested cases included HE configuration for intra, low-delay & random access Decoder run time was 2.5x longer without 64x64 (the reason being related to adaptive scanning in MDDT) With 32x32 and smaller only, there was marginal loss in intra-only, around 3% average for the inter cases Most of the gain was in classes B and E (as was the case in JCTVC-C029) 14.2.1.1.1.1.1.1.3 JCTVC-C259 TE12: Results for experiments on Max CU size [Y. Yu, K. Panusopone, L. Wang, S.-T. Hsiang, F. Ishtiaq (Motorola)] Test Experiment 12 was set up at the second JCT-VC meeting. TE12 aimed to explore the performance of individual tools in the Test Model under Consideration (TMuC). As part of TE12, the CU size was tested to decide the best maximum CU size between 32 vs 64. The simulation results were cross checked with NTT Docomo and Harmonic. From the simulation results, it was found that CU size 64 is consistently better than CU size 32. Therefore, the contributor recommended that a maximum CU size 64 should be used in the TMuC. The results mostly confirmed the findings of JCTVC-C029 and JCTVC-C044 (a different TMuC version was used, with small deviations in PSNR) It was suggested to possibly restrict the intra case to 32x32 (note: there were currently only 5 prediction directions used in intra 64x64). 171 14.3 TE12 TMuC prediction units 14.3.1.1.1.1.1.1.1 JCTVC-C052 TE12.1: Block merging (MRG) test (off vs. on) by Fraunhofer HHI [B. Bross (Fraunhofer HHI)] This document reported the results of testing the block merging mode for CUs (MRG) within the scope of tool experiment 12 evaluating TMuC Tools. For the test, MRG was disabled and tested against the MRG-enabled default. In a high efficiency scenario, average bit rate savings up to 0.7% were reported when disabling MRG. In a low complexity scenario, disabling MRG yields average bit rate savings up to 2.9%. MRG disabled encoder runtimes range from 106% to 127% and decoder runtimes from 96% to 116% of the default MRG enabled runtimes have been measured. Disabling CU based merging (reinvoking skip & direct) gives roughly 1% BR reduction overall. 14.3.1.1.1.1.1.1.2 JCTVC-C109 TE12.1: Results for PU-based merging by Fraunhofer HHI [B. Bross (Fraunhofer HHI)] This document reported the results of testing the block merging mode for PUs (MRG_PU) within the scope of tool experiment 12 evaluating TMuC Tools. For the test, MRG_PU was disabled and tested against the block merging mode for CUs (MRG) default. In a high efficiency scenario, average bit rate savings up to 1.9% were reported when disabling MRG. In a low complexity scenario, disabling MRG yielded average bit rate savings up to 2.4%. The collection of experiment results was not yet complete. A "Bug fix" done in TMuC 0.8 on the PU based merging that was there before but was switched off by default (wrong RD decision at the encoder – ticket #95 and coding of PU based merging ticket #97 avoiding some unnecessary signaling). The results reportedly showed that PU based merging gives additional gain for the cases tested so far. Implicitly, this replaces CU based merging. It was commented that proposals with similar goals are in TE11. 14.3.1.1.1.1.1.1.3 JCTVC-C118 TE12.1 Cross-check report on block merging (MRG) on/off [K. Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi Electric)] This contribution confirmed the results of JCTVC-C052. Brief checking of parameter settings and software was performed. 14.3.1.1.1.1.1.1.4 JCTVC-C231 TE12.1: Results for PU based merging [Y. Suzuki, T.K. Tan, F. Bossen (NTT DoCoMo)] (missing prior, available first day) This contribution provided a cross check with more results than JCTVC-C109, with results reported as follows. For HE LD, 1.5% BR gain, 2% for HE RA. Also results were reported on case-by-case basis which show that using only skip/direct (as in JCTVC-C052) BQSquare, BQTerrace and Vidyo3 provide losses. 14.3.1.1.1.1.1.1.5 JCTVC-C264 TE12: Evaluation of block merging (MRG) [W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm)] This contribution confirmed the results of JCTVC-C052. Brief checking of parameter settings and software was done. It was remark that from the results, CU based merging does not seem to make sense. However, investigation of software related to CU based merging may not have been performed to the detailed extent that could definitely exclude the presence of bugs that would improve the performance if fixed. 14.4 TE12 TMuC transform units 14.4.1.1.1.1.1.1.1 JCTVC-C056 TE12: Evaluation of transform unit (TU) size [M. Zhou, V. Sze (TI)] 172 This document reported the evaluation results of the Transform Unit (TU) size for TE12, and provided a brief complexity analysis of transforms. Experimental results reportedly show that the average coding loss caused by disabling the 64x64 transform is around 0.1% to 0.6%; disabling both 32x32 and 64x64 transforms lead to an average coding loss of 1.1% to 4%. The results also reportedly show that disabling the 4x4 transform leads to huge coding loss. When both the 4x4 and 64x64 transforms are disabled, the average coding loss is increased to 3.9% to 8.3%, and disabling 4x4, 32x32 and 64x64 transforms (i.e. only 8x8 and 16x16 transforms are enabled) can lead to quality degradation of 5.2% to 10.6% on average. Due to the high complexity and limited coding efficiency gain of the 64x64 transform, it was recommended that the 64x64 transform be removed from reference configurations. The contribution also recommended that the quality issues of disabling the 4x4 transform be investigated in the future. The following remarks and observations were recorded: There is relatively small loss when 64x64 transform is disabled (for LD LC case which is worst of all 0.6% on average over all classes, 1.3% average for class E LD LC). 32x32 disabling causes larger loss (1 to 3.4% depending on case). Disabling 4x4 causes larger loss compared to AVC. Almost 4x gate count when 64x64 needs to be implemented compared to 32x32. This could be dealt with as a profiling issue; the current test material seems not to benefit too much from using 64x64 (32x32 could be default in the TM, and further study of 64x64 could be performed in the future). 14.4.1.1.1.1.1.1.2 JCTVC-C105 TE12: Huawei report on max transform unit size evaluation [J. Zhou, D. Wang, H. Yu (Huawei)] This document summarized the Tool Experiment TE12 activities related to the evaluation of Max TU Size. This evaluation was performed by Huawei and TI. Confirms the results by TI, with a small difference due to the use of Windows vs. Linux. The contributor received the parameter files from TI and ran on newly compiled software. 14.4.1.1.1.1.1.1.3 JCTVC-C200 TE12.1: Experimental results of transform unit quadtree/2level test [T. Lee, J. Chen, W.-J. Han (Samsung)] This contribution presented the experimental results of TE12 Residual quadtree transform (RQT), which was to explore the performance of RQT in the Test Model under Consideration (TMuC). Experimental results of RQT off compared to TMuC showed 1.2%, 0.5% and 0.2% bit rate savings under intra only, random access and low delay conditions, respectively, in high efficiency settings. It shows 0.6%, -0.2% and 0.8% bit rate savings under the same conditions, respectively, in low complexity settings. This contribution also analyzed the performance and complexity of the current TMuC transform unit design by testing several variants of it: no transform unit across partition boundary and no transform unit splitting inside a partition. The following remarks and observations were recorded: Turning RQT off gives loss in luma PSNR but also has less encoder complexity in the intra case (where the BR increase is 1.2%), in inter cases there is a smaller BR increase but no complexity advantage by turning RQT off. Restricting TU to maximum of PU size (disallowing TU across PU partitions gives loss of around 1%. Restricting TU not becoming smaller than TU size gives loss as well. 173 Conclusion: RQT on would only make sense provided that that the suggestion below would compensate for the complexity. Intra prediction mode flag could also be used Gain of TU across PU partition boundaries may be due to more efficient CBP It could be that gain also comes mainly from asymmetric partitioning A new suggestion (related to TE9_C198) was to limit the number of TU quadtree levels by using signaling. JCTVC-C276 is also related as well. These issues are further discussed elsewhere in this report. 14.4.1.1.1.1.1.1.4 JCTVC-C110 TE12.4: Transform unit quadtree tested against 2-level by Fraunhofer HHI [B. Bross (Fraunhofer HHI)] This document reported the results of testing the transform unit quadtree (RQT) against the 2-level representation within the scope of tool experiment 12 evaluating TMuC Tools. The 2-level TU representation was tested against the RQT default. BD-rates from -0.2% up to 0.7% were reported. The encoder runtimes of the 2-level representation vary from 31% of the RQT encoder runtimes for the intra only coding constraint set to 102% of the RQT encoder runtimes for the random access coding constraint sets. This confirms the results of JCTVC-C200. 14.5 TE12 Interpolation filters 14.5.1.1.1.1.1.1.1 JCTVC-C050 TE12.2: Interpolation filter SIFO and DIF tested against MOMS by Fraunhofer HHI [B. Bross (Fraunhofer HHI)] This document reported the results of testing the 12-tap SIFO and 6-tap DIF interpolation filters against 6-tap MOMS filtering within the scope of tool experiment 12 evaluating TMuC Tools. In a high efficiency configuration scenario, MOMS was tested against the SIFO default. Bit rate savings are highly sequence dependent. Thus, average BD-rates of -4.6% are shown as well as BD-rates of 12% when comparing MOMS to the SIFO reference. The average encoder runtime of MOMS is 62%-68% of the SIFO encoder runtime. On the other hand, the average decoder runtime of MOMS is 111% to 122% of the SIFO decoder runtime. In a low complexity scenario, MOMS was tested against the DIF default. MOMS performs better than DIF in terms of coding efficiency with average bit rate savings around 5% while encoder runtimes up to 139% and decoder runtimes up to 133% of the DIF runtimes have been measured. The following remarks and observations were recorded: Results show slight increase of BR (0.3%) for HE case, but reduction of encoding time by roughly 35%, increase of decoding time by roughly 15% For LC (6-tap), rougly 6% BR decrease, increase encoding time by roughly 25%, decoding time by >30% Note: It is reported that the results may be wrong due to a bug in the fixed-point implementation which is effective in class E (static background) Sequence dependent, significant gain comes from BQSquare, BQTerrace, BlowingBubbles, PartyScene Uses 12 bit precision in case of IBDI off, 16 bit otherwise. Due to usage of IIR filter, also full-pel values are filtered. 174 Potentially this implies that 2 buffers are needed (as in the current TMuC implementation). Bringing it to one buffer would certainly affect the performance. 14.5.1.1.1.1.1.1.2 JCTVC-C087 TE12.2: Experimental results of interpolation filter [T. Chujoh, K. Kanou, T. Yamakage (Toshiba)] In this contribution, detailed results of interpolation filters were reported. This is one of reports in Tool Experiment 12: Evaluation of TMuC Tools. The interpolation filter of the high efficiency anchor is SIFO (Switched Interpolation Filter with Offset) and that of the low complexity anchor is DIF (Directional Interpolation Filter). In this experiment, some DCT-IFs (Discrete Cosine Transform – Interpolation Filters) are compared to anchor filters. The experimental results show that the coding gain of DCT-IF 12-tap compared to SIFO in the case of high efficiency anchor is average of 0.1% and the coding gain of DCT-IF 6-tap compared to DIF in the case of low complexity anchor is average of 3.6%. The encoding of DCT-IF 12-tap compared to SIFO in the case of high efficiency anchor is about 2/3 and the decoding time is reportedly decreased by 5%. The encoding and decoding times of DCT-IF 6-tap compared to DIF in the case of low complexity anchor were reportedly stable. For HE cases, little gain of DCT-IF 12 (0.1% on average), reduction of encoding time by roughly 30%. For LC cases, 1.7% gain on average for DCT-IF 6-tap, no significant increase in encoding time, for 8-tap filter 3.5% gain, increase of encoding time by 10%. It was remarked that the performance seems to be highly variable over different sequences, e.g. roughly 14% for BQSquare in LD LC. 14.5.1.1.1.1.1.1.3 JCTVC-C241 TE12: Crosscheck on interpolation filter: SIFO(DIF)/MOMS [R. Panchal, M. Karczewicz (Qualcomm)] The following remarks and observations were recorded for this contribution: Confirms results of JCTVC-C050, emphasize different treatment of chroma (higher precision filters) by MOMS (SIFO compared against 0.7) Confirm sreport of JCTVC-C087, difference of TMuC 0.7 and 0.8 due to bug fix; 0.8 roughly reduces encoding time by 25%, BR by 0.15% for SIFO Emphasizes that SIFO has additional advantage of illumination comp (not effective in current test set). 14.5.1.1.1.1.1.1.4 JCTVC-C239 TE12: Report on interpolation filter: SIFO/DIF [R. Panchal, M. Karczewicz (Qualcomm)] The following remarks and observations were recorded for this contribution: Confirms results of Toshiba (C087) Gain of SIFO12 vs. DIF6 in RA/LC 5.5%BR, 200%ET, 115% DT, LD/LC 5.9%, 176%/110% Loss of DIF6 vs. SIFO in RA/HE 3.3%BR/59%ET/96%DT, LD/HE 3.9%BR/65%ET/97%DT 14.5.1.1.1.1.1.1.5 JCTVC-C162 TE12.2: Results on MC interpolation filters [K. Kondo, T. Suzuki (Sony)] This document reported the results of MC interpolation filters, especially SIFO/DIF as compared with DCT-IF for interpolation filtering in TE12. In the test, the coding performance and complexity were measured under common test conditions, which were defined in JCTVC-B300. The coding 175 performance and complexity of MOMS were reported in addition to DCT-IF. These results were shown in the second part of the contribution. Detailed results were summarized in an attached Excel sheet. The contribution confirmed the results of JCTVC-C087, with a slight difference due to different platforms. Additional results: for LC with 6-tap DCT-IF BR reduction 2.8% RA, 0.7% LD, no difference in ET & DT for HE with 8-tap DCT-IF RA: BR increase 0.2% / 61% ET, LD: BR 1.1% / 65% ET for LC with 4-tap DCT-IF 0.8% BR increase RA, 7.4% BR increase LD still ET roughly 100% The contribution also confirmed the results of JCTVC-C050, with the following aspects noted: Additional results on 4-tap MOMS for LC cases show BR red. 5.4%/6.1% BR red., 7%/6% ET inc. 10%/14% DT inc. for RA/LD cases compared to DIF anchors (note: MOMS with bug) Confirms significant gain of MOMS in chroma due to use of longer filter (bilinear in reference cases) Analysis of MOMS complexity: Due to usage of an IIR filter, the entire frame needs to be processed first. Memory complexity is significantly increased – higher memory access than conv. 8-tap. 14.5.1.1.1.1.1.1.6 JCTVC-C197 TE12.2: Results for SIFO-6Tap with DIF tests by Nokia and Qualcomm [K. Ugur, J. Lainema (Nokia), R. Panchal, M. Karczewicz (Qualcomm)] This contribution presented the Tool Experiment 12 results on Switched Interpolation Filter with Offset (SIFO) that switches separable and directional interpolation filters with 6 taps. BR increase roughly 2% average for HE, BR decrease roughly 3% for LC ET roughly 50% increase for LC, 10% decrease for HE DT number not known, probably slight decrease in HE, slight increase in LC cases Emphasized that switching between directional and 2D filters (at slice level) gives benefit (mostly for LC case) Also reports advantage vs. DCT-IF 6-tap, particularly in LD LC gain is achieved over all classes. 14.5.1.1.1.1.1.1.7 JCTVC-C201 TE12.2: Experimental results of SIFO (DIF) / DCT-IF [E. Alshina, J. Chen, N. Shlyakhov, A. Alshin, W.-J. Han (Samsung)] This document reported the results of the chroma DCT-IF extension within the scope of tool experiment 12 evaluating TMuC Tools. For the test, DCT-IF 6 tap was used to replace the existing default chroma interpolation (bilinear filter for high efficiency and DIF for low complexity). The interpolation filter for luma remained as it was for both high efficiency and low delay configurations, which is SIFO for high efficiency and DIF for low complexity, respectively. Experiments results reportedly showed that, in the high efficiency test configuration, the average BD-rate reduction of Y, Cb, Cr components are 0.2%, 5.5% and 6.6%, respectively, for random access, and 0.5%, 2.5% and 2.9% for low delay. In the low complexity test configuration, the average BD-rate reduction of Y, Cb, Cr components are 0.4, 12.7 and 14.1%, respectively, for random access and 1.4, 12.4 and 15.3% for low delay. Encoding and decoding time was similar to that of anchor software, and deviations exist from measurement inaccuracies. 176 Encoding/decoding time increased slightly (2%Enc/7%Dec for LC case) – concern was expressed for this LC case. BD BR saving in chroma may not be the best criterion for judgment, as the effect of better chroma interpolation is more about improving the PSNR of the chroma components, and the percentage of total bit rate for chroma is low. The current default using bilinear interpolation seems to have a drift or rounding problem in chroma interpolation, and the gains reported may partially relate to the fact that this is resolved by higher-order interpolation. The suggestion was made in the contribution to replace the 2-tap filter by DCT-IF 6-tap filter. 14.5.1.1.1.1.1.1.8 JCTVC-C248 TE12.2: Cross verification on DCT-IF for chroma [T. Chujoh, K. Kanou, T. Yamakage (Toshiba)] (missing prior, available first day) This contribution confirmed the results of JCTVC-C201. JCTVC-C309 Cross verification of DCT-IF for chroma [T. Suzuki (Sony)] (late reg., uploaded after meeting had ended) This late document was reported verbally during the meeting: It referred to JCTVC-C201, and was not finished yet when reviewed. However, the results reportedly confirmed the results appropriately for classes C and D. The actual document contribution was uploaded significantly after the meeting had ended. 14.5.1.1.1.1.1.1.9 Overall summary of interpolation filter investigations: It would be preferable to have the same chroma filter when doing an experiment where different luma interpolation filters are compared. The issue of chroma interpolation should be further investigated and must be better understood. MOMS: Good performance, but has open issues. Bug reported, claimed to potentially improve performance for class E. Bidirectional IIR filter requires doubling the memory, and resolving that may reduce the performance. SIFO has highest objective gain typically, but higher encoder complexity. For DCT-IF, DIF, SIFO in the various configurations (4..12tap) the group decided to provide a table about coding performance and enc./decoder complexity. Kemal Ugur was asked to coordinate a breakout to collect and organized the data to take as a basis for decision. 14.6 TE12 TMuC motion representation 14.6.1.1.1.1.1.1.1 JCTVC-C030 TE12: Report on AMP evaluation [E. Francois (Technicolor), L. Guillo (INRIA), A. Ichigaya (NHK), H. Yu (Huawei)] (missing prior, available first day) The following remarks and observations were recorded for this contribution: Turning AMP off results in 1-2% BR increase for the 4 inter configurations, no large variations over the classes (slightly more gain in class E makes the LD cases a bit better). Slight deviations of results were observed on Windows vs. Linux platforms. Encoding time was reduced by roughly 33% by turning off AMP. 177 The relationship with other methods investigated was in TE3. Block partitioning appears useful, but not necessarily by AMP. 14.6.1.1.1.1.1.1.2 JCTVC-C104 TE12: Huawei & HiSilicon report on AMP evaluation [X. Zheng (HiSilicon), J. Zhou, H. Yu (Huawei)] This report confirmed the results of JCTVC-C030. 14.6.1.1.1.1.1.1.3 JCTVC-C149 TE12.2: Report on asymmetric motion prediction unit (AMP) on/off [A. Ichigaya, K. Iguchi (NHK)] This report confirmed the results of JCTVC-C030. 14.6.1.1.1.1.1.1.4 JCTVC-C286 TE12.2: Technicolor & INRIA report on AMP evaluation [E. Francois (Technicolor), L. Guillo (INRIA)] The report confirmed results of JCTVC-C030. 14.6.1.1.1.1.1.1.5 JCTVC-C119 TE12.2 report on MV prediction AMVP/IMVP [K. Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi Electric)] The following remarks and observations were recorded for this contribution: Switching IMVP on instead of AMVP produces BR increase (<0.5% on average, 1% for class A), and encoding time also increases slightly. IMVP seems not to give value. Further investigation of AMVP was performed in TE11. 14.6.1.1.1.1.1.1.6 JCTVC-C163 TE12.2: Results on MV prediction [K. Sato, T. Suzuki (Sony)] This contribution confirmed the results of JCTVC-C119, with identical results at the bitstream level (both run under Linux). 14.6.1.1.1.1.1.1.7 JCTVC-C265 TE12: Evaluation of adaptive motion vector resolution (AMVRES) [W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm)] The following remarks and observations were recorded for this contribution: Result were provided only for HE cases TE12 result: the BR increase by turning AMVRES off is roughly 0.5%, the effect on encoding time was roughly 10%. The new proposal of "unified AMVRES" that adapts to prediction direction results in – 1.4% difference on low delay, 0.6% for RA. Additional results on GPB=1+ AMVRES=0 / GPB=0 + AMVRES=1 shows 4.7% / 3.5% decrease in BR and encoding time increase 215% / 112% compared to LD HE anchor. 14.6.1.1.1.1.1.1.8 JCTVC-C182 TE12.2: Cross-check of adaptive motion vector resolution (AMVRES) technology in the TMuC software [A. Segall (Sharp)] (missing prior, available first day) This contribution confirmed the results of JCTVC-C265. 14.7 TE12 TMuC partition-based illumination compensation 14.7.1.1.1.1.1.1.1 JCTVC-C041 TE12: Performance of partition based illumination compensation (PBIC) [A. Fujibayashi, S. Kanumuri, F. Bossen, T. K. Tan (NTT DoCoMo)] 178 This contribution was a TE12 contribution presenting the performance of Partition Based Illumination Compensation (PBIC) tool integrated into the TMuC software. In addition to the test conditions of TE12, the contribution compared PBIC to two other tools, namely SIFO and GRefMode, present in the TMuC software that can handle illumination changes. This comparison was also done when a linear fade was applied to the test sequences. The results reportedly showed that PBIC has no impact on coding efficiency for the regular test sequences and a large gain (11-20% depending on the config) for the test sequences with linear fade. PBIC reportedly has a minimal impact (2-5% increase) on encoder/decoder run times. The authors said that this comparison indicated that PBIC is the top performing tool in terms of coding efficiency and has the least increase in encoder run time. The following remarks and observations were recorded for this contribution: For the common conditions test sequences, PBIC gives no gain or even produces a small loss (1%) in low complexity modes (could be due to the entropy coding). For the same test sequences (first 2 seconds) with artificial fade (1 s fade out/in each), PBIC was better in terms BR reduction (12% on average) than SIFO (5%, TMuC) and GRefMode (10%, from A124), and less complex. For more natural sequences with illumination variations (Nuts, Shuttlestart, Flower), a 2.6% benefit was reported on average. It was remarked that cross fades may be more relevant than fade-in/-out The PBIC proposal seems to be somewhat better than other potential methods of illumination compensation, but shows no gains on the current test set, and it is probably not a high priority to include dedicated tools for illumination variations. 14.7.1.1.1.1.1.1.2 JCTVC-C232 TE12.7: Verification of JCTVC-C041 (partition based illumination compensation - PBIC) [N. Sprljan, S. Paschalakis, P. Wu (Mitsubishi Electric)] This contribution confirmed the results of JCTVC-C041 on the common conditions set, and with some of the artificial fades, although there were some small deviations due to rounding errors. It was reported that the author had investigated the source code. 14.7.1.1.1.1.1.1.3 JCTVC-C246 TE12.8: Cross-verification of partition-based illumination compensation (PBIC) [P. Onno (Canon)] This contribution confirmed the results of JCTVC-C041 on the common conditions set. 14.8 TE12 TMuC filtering 14.8.1.1.1.1.1.1.1 JCTVC-C235 TE12: Evaluation of adaptive in-loop filter [I. S. Chong, M. Karczewicz (Qualcomm)] The effect of the adaptive in-loop filter on BD measures was reported as follows: High complexity Intra : 2.2% (max 4.2% for class A) Random access :3.7% (max 5.8% for class B) Low delay: 5.2% (max 8.5% for class E) Low complexity Intra :-2.2% Random access:-4.8% 179 Low delay: -4.3% Decoding time increased by around 15% (10-20 depending on cases). The report on encoding time (no significant difference) seems to contradict with the X-check, probably caused the the cluster environment which may not deliver reasonable numbers. 14.8.1.1.1.1.1.1.2 JCTVC-C129 TE12: Cross verification for adaptive loop filter (ALF) (on vs. off) [Y.-J. Chiu, L. Xu, W. Zhang, H. Jiang (Intel)] This contribution confirmed the results of JCTVC-C235 in terms of BR and DT; Encoding time seems also not to be consistent (increase by 30% switching ALF off?) due to usage of a computing cluster. 14.8.1.1.1.1.1.1.3 JCTVC-C153 TE12: Report on deblocking filter [P. Chen, M. Karczewicz (Qualcomm)] The following remarks and observations were recorded for this contribution: Marginal changes in BD rate were observed. The deblocking filter does not have significant impact on PSNR 14.8.1.1.1.1.1.1.4 JCTVC-C148 TE12: Crosscheck on deblocking filter in TMuC0.7 by MediaTek [Y.-L. Chang, Y.-W. Huang, S. Lei (MediaTek)] This contribution confirmed the results of JCTVC-C153 in BD rate. The encoding/decoding time was roughly 3% lower when deblocking was switched off. 14.8.1.1.1.1.1.1.5 JCTVC-C214 TE12.5: Results for adaptive loop filter using prediction and residual [M. Narroschke (Panasonic)] This contribution was a part of the TE12: It presented the results of the Adaptive loop filter using the prediction and residual as filter inputs. An average Y-BD-rate gain was reported of 1.4% for Intra High Efficiency, 1.3% for Random Access High Efficiency, and 0.8% for Low Delay High Efficiency compared to the anchors. It was proposed to include it in the TM. The test results were not complete yet due to recent bug fix work (the results were more complete in the cross check). A gain of roughly 1% average (higher on intra) compared to ALF was reported. Encoder complexity was roughly 25-30% higher than with ALF. Decoder complexity was also increased by roughly 10%. 14.8.1.1.1.1.1.1.6 JCTVC-C230 TE12: Cross check result of Panasonic's 3-input-ALF [I. S. Chong, M. Karczewicz (Qualcomm)] This contribution confirmed the results of JCTVC-C214. In comparison against ALF, the BD BR improvement was 1.2% for Intra Low Complexity, 1.1% for Random Access Low Complexity, and 0.9% for Low Delay Low Complexity 14.8.1.1.1.1.1.1.7 JCTVC-C086 Experimental results of ALF on low complexity [T. Chujoh, T. Yamakage (Toshiba)] The BR reduction in LC configurations was: RA 4.83%/2.77%/7.56% for ALF alone / DCT-IF 6-tap alone / combination LD 4.27%/0.67%/7.11% for ALF alone / DCT-IF 6-tap alone / combination The suggestion of the authors was to adopt ALF in the test model. It was remarked that the relationship of ALF and interpolation filters still needs to be better understood 180 JCTVC-C212 Cross-verification of JCTVC-C086: Experimental results of ALF on low complexity [W.-J. Han (Samsung)] This contribution confirmed the BR reduction results reported in other contributions for this feature. 14.8.1.1.1.1.1.1.8 Overall conclusions on filtering: ALF results were consistent, with average rate gain slightly less than 5%, decoder runtime increased around 15%. Encoder runtime reports were not fully consistent due to usage of clusters. The relation between interpolation and loop filtering needs to be better understood. Comparison against post filter (TE 10) should be considered. 14.9 TE12 TMuC intra-frame prediction 14.9.1 Combined intra prediction 14.9.1.1.1.1.1.1.1 JCTVC-C092 TE12: Report on combined intra prediction evaluation [K. Chono, K. Senzaki, H. Aoki, J. Tajime, Y. Senda (NEC)] This contribution presented a report on Combined Intra Prediction (CIP) evaluation. Simulation results reportedly showed that: BD-Rate increases caused by disabling CIP are 0.3%, 0.2%, and 0.1% for the High Coding Efficiency (HCE) All Intra, Random Access, Low Delay coding conditions; BD-Rate increases caused by disabling CIP are 0.2%, 0.1%, and 0% for the Low Complexity (LC) All Intra, Random Access, Low Delay coding conditions; Encoding time decrease observed by disabling CIP is about 20% for the HCE and LC AllIntra coding conditions; significant encoding time decrease was not observed by disabling CIP in other coding conditions; and Significant decoding time decrease was not observed in all coding conditions. It was proposed that CIP is not used in default coding configurations in the TMuC software due to its reported small coding gain and that CIP should be re-evaluated on the top of an HEVC test model based coding tool set established as determined by the TE12 results. It was further noted that when CIP is used, increased encoder complexity (e.g., by 20% for all-intra runtime measure) is needed to determine whether to use CIP or not. It was suggested that storing some information could be used in the encoder rather than recomputing it, somewhat reducing the encoding runtime increase. It was remarked that the recursive nature of the processing in the decoder may make it difficult to implement this mode in a highly-optimized fast implementation. 14.9.1.1.1.1.1.1.2 JCTVC-C174 TE12.3: Report on combined intra prediction [T. Yamamoto, A. Segall (SHARP)] (missing prior, available first day) The resultsin this contribution were reported to be consistent with that reported in JCTVC-C092. 14.9.1.1.1.1.1.1.3 JCTVC-C213 Additional results for combined intra prediction [M. Mrak, T. Davies, D. Flynn, A. Gabriellini (BBC)] This document is not actually directly within the planned scope of TE12, but is closely related and is thus discussed in this section. This document provides further simulation results for the Combined Intra Prediction (CIP) tool. As implemented in TMuC 0.7, the configuration of CIP was reported to be suboptimal. This document 181 proposed improvements to the CIP implementation, and the results reportedly showed that the gain of CIP can be two to three times greater than in the original implementation. The asserted suboptimality was asserted to be a result of several factors, which include: RD search with CIP for one PU splitting type only, as determined by ANG search; Suboptimal selection of constants used for CIP prediction and signaling; CIP's open-loop component computed without taking into account AIS; RD search without CIP when ROT is used. Some results were shown for a modified CIP, with roughly 0.5% reported improvement beyond that of the prior CIP design proposal on intra conditions. Work on further improvements was reported to be ongoing. It was remarked that some of the new runtime measurement results seem difficult to justify. After discussion, it was concluded that, for now, CIP should not be in the TM. Further study was encouraged. 14.9.1.1.1.1.1.1.4 JCTVC-C297 Cross-verification of JCTVC-C213: additional results for combined intra prediction [W.-J. Han (Samsung)] (late registration, missing prior, available first day) This contribution was a cross-verification report of JCTVC-C213: additional results for combined intra prediction. Provided source code was investigated. Both encoder and decoder running were done. TMuC 0.7 software had been used and the provided patch file (several lines of changes) was applied to the software. The provided patch included 1) adjustment of weights between open-loop and close-loop predictors 2) removal of encoder-side penalty due to the fast estimation process of the combined intra prediction. Both changes seem not relevant to the coding tool itself. All results were reported to be perfectly matched to those of JCTVC-C213. 14.9.2 Adaptive intra smoothing 14.9.2.1.1.1.1.1.1 JCTVC-C054 TE12.3: Adaptive intra smoothing (AIS) test (off vs. on) by Fraunhofer HHI [B. Bross (Fraunhofer HHI)] This document reported results of testing adaptive intra smoothing (AIS) within the scope of tool experiment 12 evaluating TMuC Tools. For the test, "AIS fast" was disabled and tested against the AIS enabled default. The test was thus testing "adaptive smoothing" versus "always smoothing" which was reported to perform better than using no filter, but the selection of which non-adaptive case works better depends on the configuration of other related coding tools, in particular including UDI. For an intra coding only constraint set, average bit rate increases between 2.4% and 4% were reported for the fast implementation. Encoder runtimes for this constraint set are 61% of the default AIS enabled implementation. For the inter-predictive coding constraint sets, the average bit rate increases between 0.4% and 1.5% are smaller. Encoder and decoder runtimes are 92% and 97% of the default AIS enabled runtimes. It was remarked that the setting of the default operation is part of the proper analysis of the technique. A good analysis should include smoothing always off, smoothing always on, and adaptive smoothing selection. 14.9.2.1.1.1.1.1.2 JCTVC-C120 TE12.3: Cross-check report on AIS on/off [K. Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi Electric)] The purpose of this document is to report the result of a cross-check on Adaptive Intra Smoothing ON/OFF conducted by HHI according to the conditions specified in JCTVC-B312. The verification 182 task was reported to have been done successfully and the results reported exactly matched those provided by HHI. However, some difference in measured encoding time on the same platform was reported for nonAIS operation. The contributor reported that the non-adaptive case was always faster than the adaptive case. This was suggested to be some problem in the random-access case anchor time measures used by the contributor of JCTVC-C054. The benefit relative to the always off case measured in all intra coding was measured as 0.5-0.6% for intra-only configurations. 14.9.2.1.1.1.1.1.3 JCTVC-C044 TE12: Results for experiments on max CU size, RDOQ and AIS [F. Bossen, T. K. Tan (NTT DoCoMo)] This section discusses the AIS part of this contribution – other parts of the same document were addressed in other categories. Similar behavior was reported as in the other contributions. Disabling AIS with disabling smoothing always loses about 0.5% relative to using AIS, but makes the encoder run substantially slower (e.g., 40%). Fast decision encoding reduces the encoding time of AIS, but has an R-D penalty that is actually more severe than just always disabling smoothing. Decision: As suggested by the contributor, it was agreed to disable AIS and set DEFAULT_IS to 0 in the TM and reference configurations. This may depend on various factors that could be affected by further study. This does not affect consideration of proposed modified techniques as suggested in some other related input contributions such as JCTVC-C302, JCTVC-C234 and JCTVC-C269. 14.9.2.1.1.1.1.1.4 JCTVC-C053 TE12.3: Adaptive intra smoothing (AIS) test (slow vs. fast) by Fraunhofer HHI [B. Bross (Fraunhofer HHI)] This document reported results of testing slow and fast encoder estimations for AIS within the scope of tool experiment 12 evaluating TMuC Tools. For the test, "AIS fast" was enabled and tested against the "AIS slow" default. For an intra coding only constraint set, average bit rate increases between 0.9% and 1.2% were reported for the fast implementation. Encoder runtimes for this constraint set reportedly ranged from 76% to 72% of the default slow implementation. For the interpredictive coding constraint sets, the average bit rate increases between 0.1% and 0.5% were smaller. Encoder and decoder runtimes are reportedly similar for these cases and deviations cannot be clearly distinguished from measurement inaccuracies. These results seemed consistent with that reported in JCTVC-C044, specifically consistent with the assertion that "fast AIS" operation performs worse (as tested) than always disabling the smoothing. 14.9.2.1.1.1.1.1.5 JCTVC-C281 TE12: Evaluation of fast adaptive intra smoothing [M. Coban, Y. Zheng, M. Karczewicz (Qualcomm)] This contribution presented test results for the fast adaptive intra smoothing listed in Tool Experiment 12: Evaluation of TMuC Tools. The related presentation material was requested to be uploaded. Similar results were reported as described for other contributions on this topic (JCTVC-C044 and JCTVC-C053). 14.9.3 Edge-based prediction 14.9.3.1.1.1.1.1.1 JCTVC-C171 TE12.3: Experimental results of edge based prediction [A. Tabatabai, C. Auyeung (Sony)] Contribution JCTVC-C171 presented results of edge based prediction under TE12.3 for the high and the low complexity intra configuration. Simulations reportedly showed that when edge based prediction is disabled, for both high and low complexity intra configuration, the average BD bit rate 183 is reduced by about 0.1%. Cross verification of the results was done with Panasonic. The measured average BD bit rate was the same as measured by Panasonic. However, the results in this contribution have some small differences from the results of Panasonic. The related presentation material was requested to be uploaded. The experiments were conducted with TMuC version 0.7.1 with "bug fix 51". Further details regarding macro settings were described in the document. Visual comparison was not performed. The minor benefit measured for this coding tool led to the suggestion in the contribution that edge prediction should not be included in the first test model of HEVC. This was agreed. There are some new related variations discussed in the TE6 context that remained under consideration at the time of discussion of this contribution. 14.9.3.1.1.1.1.1.2 JCTVC-C216 TE12.3: Results for edge based prediction [V. Drugeon (Panasonic)] Similar results were reported as above for JCTVC-C171. The current implementation of edge based prediction in the current TMuC software using the test configuration proposed by TE12 did not show any gain. Interaction with other coding tools was suggested as the reason that the tested edge-based prediction scheme did not show gain. To test this hypothesis, testing was performed with CIP, ROT, AIS, MDDT, adaptive scan, and unified intra prediction all disabled in the software. In that context the edge-based prediction scheme was reported to provide an average gain of 1.8% (with maximum gain of 4.8%) for intra test conditions. The question was asked how much coding performance was reduced by the disabling of the other intra coding tools – and the contributor indicated that they did not know. The related presentation material was requested to be uploaded. A relationship with technology in TE6 was also noted by the contributor. Further study of similar techniques was suggested by the contributor, toward identifying how such technology may ultimately prove beneficial, depending on the testing context in regard to other coding tools. Such further study was encouraged. It was agreed that edge based prediction should not be in TM. 14.9.4 Planar prediction 14.9.4.1.1.1.1.1.1 JCTVC-C188 TE12.3: Results for planar prediction tests by Nokia [J. Lainema, K. Ugur, O. Bici (Nokia)] This document reported Nokia results for the TE12.3 experiment on disabling planar intra prediction in the TMuC 0.7 environment. Since planar prediction is reported to be a tool to improve subjective quality, rather than objective quality, visual examples demonstrating the claimed benefit of planar prediction were also included. The contribution also described an asserted inefficiency in indication of the planar intra prediction mode in the low complexity entropy coding configurations, and proposed a fix for that. In the high efficiency intra coding configuration case, basically no difference in objective performance was measured. In the low complexity intra coding configuration case, an objective loss of 0.9% in bit rate was reported. Random access and low delay testing resulted in the expected results – diminished impact in objective terms. Joint coding of the planar mode flag and the intra split flag was tried for the low complexity configuration case. With this alteration, the objective loss of 0.9% was reduced to a loss of 0.3%. Artifact examples were shown in the document with QP=32 and 37, showing a substantial decrease in visual artifacts for smooth gradually-changing regions (which are essentially what is modeled by this mode), such as sky areas. 184 The contributor was asked about the complexity of the technique – who responded that there may actually be a net complexity reduction from the technique, since no residual difference coding is performed when this prediction mode is used (in the way it is currently found in the TMuC). It was suggested that adapting the quantizer step size in smooth regions may be able to mitigate the illustrated artifacts. The contributor indicated that such techniques may increase encoding complexity and that the handling of block edges in this technique helps avoid blocking artifacts that would be produced in other techniques. Some interaction with other proposed techniques such as MDDT was also suggested to potentially exist. The question was asked how often the mode gets selected – this suggested to typically be 10-15% in all intra coding. A TENTM co-submitter expressed the opinion that the technique substantially helps subjectively. 14.9.4.1.1.1.1.1.2 JCTVC-C244 TE12.3: Report on planar intra prediction [D. Flynn (BBC)] (missing prior) This contribution reviewed before it was available for download, due to network issues. This report presented the configuration and results from testing the intra planar prediction mode. When this mode is enabled, there is, for most configurations, no significant change in BD-Rate, with low-complexity configurations, there is a loss of between 0.9% (Intra) and 0.2% (Random Access) BD-Rate. With all configurations, there is a reduction in encoder runtime associated with enabling the intra planar prediction mode. These results are identical to those produced by Nokia in JCTVC-C188 within tolerance of ±1kbit/sec, ±0.005dB PSNR. The root cause of the loss in compression performance had not been analyzed. It was reportedly suspected that it is brought about by the combination of the following: The planar mode, as implemented, effects an implicit skip=true flag, and it was suspected that the reductions in runtime that the planar mode brings is through the use of the skip flag. It was suspected that the error for planar blocks is not distributed uniformly across the block, rather, the error is greatest at the lower and right edges. Such an error can potentially affect subsequent prediction units since no weighting is applied to the distortion calculation. It was suggested that further study and potential improved harmonization with related coding elements may show some promise for the technique. Further refinement of other elements may also show benefits (e.g., regarding residual skip handling). Subjectively, the technique was suggested to have a nice matching of adjacent regions that are predicted the same way. But the benefits seem region-dependent (e.g., better for sky than for faces) and dependent on non-normative encoding techniques (e.g., well optimized AVC intra coding looks substantially better than naïve decision encoding). As currently used, it does actually reduce encoding and decoding runtimes (by roughly 5-8%) – although it was remarked that we should not read too much into small differences in speed in the software at this stage. A decision on what is to be done in other related areas may affect what we would decide to do with this technology, at least at this stage of study and integration. Somewhat mixed feelings were expressed, but the strength of evidence on this feature at this time did not seem to justify immediate inclusion in the first TM. Further study was encouraged. 14.10 TE12 TMuC transforms and coefficient coding 14.10.1Transform skip flag 14.10.1.1.1.1.1.1.1 JCTVC-C047 TE12.4: Transform skip flag test (off vs. on) by Fraunhofer HHI [B. Bross (Fraunhofer HHI)] 185 This document reported the results of testing the transform skip flag for the transform unit quadtree (RQT_ROOT) within the scope of tool experiment 12 evaluating TMuC Tools. For the test, RQT_ROOT was enabled and tested against the RQT_ROOT disabled default. In a high efficiency scenario, similar performance was reported when enabling RQT_ROOT – with bit rate savings were reported to be around 0.1%. In a low complexity scenario (using LCEC entropy coder), enabling RQT_ROOT reportedly performs better than the default in terms of coding efficiency – with average bit rate savings around 3.6%. Encoder runtimes ranging from 93% to 106% and decoder runtimes from 91% to 103% of the default runtimes had been measured. It was remarked that this experiment used version 0.7.2, which was modified for "LCEC Phase 2" in 0.7.4. The question was asked about the potential interaction of this proposal with that one. The transform skip flag handling was asserted to be essentially the same in this proposal as in the Dresden HHI proposal. The benefit was found primarily when using LCEC entropy coding – will we be keeping that? It seemed that if we keep LCEC entropy coding, then we should also use this scheme (pending any conflicting later decisions at the meeting). 14.10.1.1.1.1.1.1.2 JCTVC-C258 TE12: Evaluation of transform skip flag (RQT_ROOT) [W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm)] This contribution was a cross-verification report with similar results as reported in JCTVC-C047. The algorithm and code was studied in detail – which resulted in a related contribution discussed below. All test cases were performed. 14.10.1.1.1.1.1.1.3 JCTVC-C267 Test results of transform skip flag and phase 2 VLC integration [W.-J. Chien, P. Chen, X. Wang, M. Karczewicz (Qualcomm)] This contribution was not directly part of the planned TE12, but is sufficiently closely related to merit being considered together with it. This contribution presented test results for Transform Skip flag (HHI_RQT_ROOT) under "Phase 2 LCEC VLC integration". According to the simulation results, further investigations and testing should be conducted to indentify the interaction of the transform skip flag and the phase 2 VLC. If we don't use the residual quadtree scheme, the proposed Phase 2 scheme was reported to provide a similar capability. If we do use the residual quadtree scheme and the LCEC entropy coding, the proposed skip flag provides a benefit. And the two schemes have roughly similar actual signaling. LCEC phase 2 was something planned for integration in Geneva, but was integrated a bit late, and was asserted to have been intended as original Dresden output (based on JCTVC-A119). Similarly, the RQT_ROOT scheme with the skip flag was asserted to have been intended as original Dresden output as well (based on JCTVC-A116). If we use the residual quadtree approach, it was suggested to use the transform skip flag with the RQT_ROOT scheme. This was agreed. If we have LCEC entropy coding and don't use the residual quadtree approach, it was suggested to use the "Phase 2" scheme (not yet reviewed). – further discussion of this avenue did not appear necessary. 14.10.1.1.1.1.1.1.4 JCTVC-C185 Recent improvements of the low complexity entropy coder (LCEC) in TMuC [A. Fuldseth (Cisco), A. Hallapuro, K. Ugur, J. Lainema (Nokia)] This contribution was not directly part of the planned TE12, but is sufficiently closely related to merit being considered together with it. This contribution reports on recent improvements of the low complexity entropy coder (LCEC) in the Test Model under Consideration (TMuC) context. 186 The low complexity entropy coder (LCEC), based on the JCTVC-A119 HEVC CfP proposal has been adopted into TMuC as described in JCTVC-B205. This has reportedly proved to be a challenging task, especially since the coding structures of the two algorithms are quite different. In particular, the side information parameters of JCTVC-A119 could not be directly mapped to the side information parameters of TMuC and the strategy of JCTVC-A119 had to be modified as appropriate. LCEC has been integrated into the TMuC software in four phases: Phase 0: Phase 1: TMuC v0.7, used as anchor in TE12 low complexity configurations, (including the PIPE vs. LCEC experiment). Phase 2: TMuC v0.1, preliminary version provided by Samsung/BBC. TMuC v0.7.4, disabled by default, but evaluated as part of TE12. Phase 3: Recent improvements not integrated in the official TMuC software. In this document, BD-Rate results for LCEC phase 2 and phase 3 relative to phase 1 were provided. Also, results with and without residual quad tree (RQT) and the transform skip flag (HHI_RQT_ROOT) were reported. The reason is that significant side information parameters such as coded block flag (cbf) are treated differently dependent on the values of these RQT parameters. Finally, a comparison between PIPE and LCEC was given. The Class E sequences reportedly benefit substantially from the proposed scheme. The gap between LCEC (as proposed) and PIPE was reported to be approximately 10%. No cross verification of "phase 3" had been done at this point – see TE12 entropy coding testing. 14.10.2MDDT and ROT 14.10.2.1.1.1.1.1.1 JCTVC-C136 TE12.4: Mode dependent directional transform (MDDT) test (off vs. on) by HKUST [X. Zhang (HKUST)] This contribution reported coding performance tests of MDDT according to Tool Experiment 12 JCTVC-B312. Some difficulties were reported in completing the testing process. Incomplete test results were presented for this reason. 0.7.0 software was used. When MDDT is disabled in the software, ROT is enabled for all block sizes. When MDDT is enabled in the software, ROT is enabled on some block sizes but not others. (ROT was integrated first – when MDDT was later integrated, since it applied only to smaller block sizes, the MDDT enabling switch was designed to revert to the prior ROT behavior when MDDT is off.) The results seem somewhat difficult to interpret because of this behavior. This is not really a simple test of "off" versus "on" for some tool, since the ROT behavior changes when the MDDT behavior changes. In the HE test cases, "MDDT on with ROT on for large block sizes" versus "MDDT off with ROT on for all block sizes" showed no significant performance difference. It was remarked that for LC configuration, some bug fix to the entropy coding behavior was made after version 0.7.0 of the software (bug fix 79) – this may call into question the LC results in this contribution. 14.10.2.1.1.1.1.1.2 JCTVC-C181 TE12.4: Cross-check of rotational transform (ROT) [K. Misra, A. Segall (Sharp)] (missing prior, available first day) This contribution cross-checked the performance of the rotational transform in the context of TE12. The experimental results reportedly showed that, with MDDT enabled, using ROT for the large block sizes introduces an average loss (Y BD-rate for each condition) ranging from 0.0% to 0.6%. This closely matched the rate-distortion results provided by Qualcomm. 187 The software was studied, and was consistent with the description. The 4x4 ROT variant was not tested. Encoding and decoding time measures were too unstable to provide confident measures. 14.10.2.1.1.1.1.1.3 JCTVC-C202 TE12.4: Experimental results of MDDT and ROT by Samsung and Qualcomm [E. Alshina, A. Alshin, W.-J. Han (Samsung), R. Joshi, M. Coban (Qualcomm)] The presentation for this contribution was requested to be uploaded This document described results of three tool evaluation tests on alternative transform field. These experiments were carried out and cross-verified by Samsung and Qualcomm in a framework of TE12 activity. A modified combination of MDDT and ROT was proposed in the contribution. The current combination of MDDT and ROT in the TMuC when enabled together, reportedly provides about 2% benefit for HE intra only (and 1% benefit for RA), with about 45% increase in encoding time. The contributor indicated that this degree of coding time increase is not really necessary – and contribution JCTVC-C250 discusses that topic. The new combination proposed in the contribution reportedly provides about 3% benefit for HE intra only (and 1% benefit for RA). It was remarked that TE7 also has some other transform modification. It was noted that there is also a lower-complexity ROT proposal in JCTVC-C096. A contributor asked if it was possible to estimate the gain from the transform aspects of this and the adaptive scanning that is part of it. There are really two features being tested together here, in terms of scan and transform changes. The opinion was offered that most of the gain is from the scan rather than the transform aspects. Contributions JCTVC-C106, JCTVC-C114, and JCTVC-C263 also discuss adaptive scanning. Considering the complexity of these features, their inclusion in the TM at this time did not appear justified for the benefit shown. 14.10.2.1.1.1.1.1.4 JCTVC-C268 TE12: Report on evaluation of MDDT and ROT [R. Joshi, M. Coban, M. Karczewicz (Qualcomm)] Roughly consistent results were presented in this contribution as in the others JCTVC-C136, JCTVC-C181, JCTVC-C202. 14.10.2.1.1.1.1.1.5 JCTVC-C298 TE12.4 Cross-check of unified MDDT/ROT [T. Davies , D. Flynn (BBC)] (late registration, missing prior) This contribution document contained cross-check results of JCTVC-C202, specifically on providing a unified framework for MDDT and ROT mode selection. The cross-check had only partially been completed, although the results collected so far matched those proposed in JCTVCC202. 14.10.2.1.1.1.1.1.6 JCTVC-C036 Alternative performance measurement of MDDT and ROT in TMuC [R. Cohen, A. Vetro, H. Sun (Mitsubishi)] This contribution was not part of the TE, but is included here since it contains closely related information. This document evaluated the performance of ROT and MDDT both individually and combined in TMuC, using a reference in which both ROT and MDDT were disabled. The results reportedly show that for all-Intra high efficiency conditions in TMuC 0.7, these tools produce average BDRate improvements from 1.6% to 2.1%, where ROT slightly outperforms both MDDT and their combination. For random access high efficiency, the gains were reportedly between 0.7% and 0.9%, with a similar relationship among individual and combined tests. Encoder run times approximately doubled for ROT, whereas decoder run times doubled or tripled with MDDT, depending upon the configuration. Possible future improvements were addressed as well. 188 The need for subjective evaluation was noted. 14.10.3Transform coefficient coding 14.10.3.1.1.1.1.1.1 JCTVC-C049 TE12.4: Transform coding HHI tested against Samsung proposal by Fraunhofer HHI [B. Bross (Fraunhofer HHI)] This document reported results of testing the transform coefficient coding proposed by Fraunhofer HHI against the transform coefficient coding proposed by Samsung within the scope of tool experiment 12 evaluating TMuC Tools. For the test, the Samsung coding was tested against the HHI coding default. It was reported that the Samsung coding results in 1.3% to 2.0% luma bit rate increase compared the HHI coding. The Samsung coding runtime of the encoder is between 92% and 110% of the HHI coding runtime and the Samsung coding runtime of the decoder is between 98% and 120% of the HHI coding runtime. The HHI scheme has adaptive scan order, context modeling for the significance map, and context modeling for the absolute coefficient levels. The Samsung scheme uses fixed scan order and different context modeling. Contributions JCTVC-C106, JCTVC-C114, and JCTVC-C263 also discussed adaptive scanning. 14.10.3.1.1.1.1.1.2 JCTVC-C059 TE12: Evaluation of transform coefficient coding (HHI_TRANSFORM_CODING) with tool breakdown [V. Sze, M. Budagavi, M. Zhou (TI)] The coding efficiency gain of HHI_TRANSFORM_CODING was evaluated with TMuC-0.7.3 to be between 1.3 to 2.0%. A breakdown of this gain across the three main tools was also studied and reported as follows: adaptive scanning order of significance map (0.2 to 0.4%), proposed context modeling of significance map (0.9 to 1.4%) and a proposed context modeling of coefficient level (0.1 to 0.2%). The coding efficiency gain of HHI_TRANSFORM_CODING was also evaluated with TMuC-0.7 by the contributor and was reported to have been verified to be a match with results provided by Samsung. However, the three tools increase the complexity of context modeling – making it more difficult to parallelize for throughput. Decision: The conclusion was to put the HHI scheme in TM, and further study the complexity issues. 14.10.3.1.1.1.1.1.3 JCTVC-C203 TE12.4: Experimental results of transform coefficient coding [J. Chen, V. Seregin, W.-J. Han (Samsung)] This contribution reported similar results as in JCTVC-C049. 14.11 TE12 TMuC entropy coding 14.11.1PIPE vs. CABAC vs. LCEC 14.11.1.1.1.1.1.1.1 JCTVC-C048 TE12.6: PIPE and LCEC tested against CABAC by Fraunhofer HHI [B. Bross (Fraunhofer HHI)] This document reported results of testing the PIPE and LCEC entropy coders against CABAC within the scope of tool experiment 12 evaluating TMuC Tools. In a high efficiency scenario, CABAC was tested against the PIPE default. Bit rate savings up to 0.3% are shown when using CABAC. In a low complexity scenario, CABAC was tested against the LCEC default. CABAC performed better than LCEC in terms of coding efficiency, with average bit rate savings up to 15.5% while encoder runtimes up to 170% and decoder runtimes up to 113% of the LCEC runtimes had been measured. 189 Intra CABAC vs. PIPE 0.2%, PIPE vs. LCEC 12.5%. Random access CABAC vs. PIPE 0.3%, PIPE vs. LCEC 15.2%. Low delay CABAC vs. PIPE 0.3%, PIPE vs. LCEC 15.5%. It was remarked that the RDO encoder decision-making uses CABAC. It seems like this could make a difference. 14.11.1.1.1.1.1.1.2 JCTVC-C051 TE12.6: PIPE tested against LCEC by Fraunhofer HHI [B. Bross (Fraunhofer HHI)] This contribution presented test results that were essentially consistent with those in JCTVC-C048. 14.11.1.1.1.1.1.1.3 JCTVC-C058 TE12: Evaluation of entropy coders: PIPE tested against CABAC [V. Sze, M. Budagavi (TI)] This document reported performance evaluation results of PIPE (Probability Interval Partitioning) in the Test Model under Consideration (TMuC) as part of TE12. The coding efficiency loss of PIPE over CABAC was reported to be between 0.2 to 0.3% using TMuC-0.7. These results were reported to have been verified to be a match with results provided by HHI. Several outstanding issues pertaining to the throughput and complexity regarding the PIPE were discussed in the contribution. The contribution suggested that the difficulty of parallelizing context modeling indicated that more investigation is required to determine how PIPE impacts the overall throughput of the entropy coder. The reported estimated memory size requirement for PIPE is 5x larger than CABAC. The cost of the additional buffers required by PIPE were suggested to also merit additional analysis. More investigation was also requested on how the parallel bitstreams will be handled (i.e. interleaved or separate); if they are interleaved, the complexity and latency implications of the various approaches was requested to be analyzed. The contributor advocated that further study is required on PIPE. Regarding interleaved versus separate modes of PIPE operation, at the moment the TMuC encoder makes a decision after buffering the bits for the picture, and the decoder follows what the encoder selected. It was remarked that reduced power requirements may be more important than reduced area requirements. It was remarked that TE8 studied some related implementation aspects. 14.11.1.1.1.1.1.1.4 JCTVC-C184 TE12.6: PIPE tested against LCEC by Cisco [A. Fuldseth (Cisco)] This document presented TE12 results on PIPE vs. LCEC comparisons. Depending on the configuration (low delay/random access/intra and low complexity/high efficiency), average BDRate figures between 12.3% and 18.0% were reported. It was noted that the quantity of this compression difference may be affected by other proposed modifications. It was remarked that some degree of encoding optimization (e.g., whether the encoder resets the statistical measures at the beginning of each picture or in some R-D decision-making) may also affect the coding efficiency difference. 14.11.2LCEC "phase 2" testing 14.11.2.1.1.1.1.1.1 JCTVC-C196 TE12.6: Results for LCEC_PHASE2 tests by Nokia [K. Ugur, A. Hallapuro, J. Lainema (Nokia)] This contribution presented Tool Experiment 12 results on LCEC_PHASE2. LCEC_PHASE2 represents the second integration phase of the low complexity entropy coding (LCEC). A bit rate improvement was reported as follows: Intra 0.3%, random access 0.4%, low delay 1.6%. 190 The contributor recommended to adopt Phase 2 into the TMuC. 14.11.2.1.1.1.1.1.2 JCTVC-C160 TE12: Crosscheck on LCEC phase 2 [P. Chen, M. Karczewicz (Qualcomm)] This contribution presents the test results for LCEC tool (phase 2) listed in Tool Experiment 12: Evaluation of TMuC Tools. Similar gains were reported as described in JCTVC-C196. 14.12 TE12 TMuC IBDI and transform precision extension 14.12.1.1.1.1.1.1.1 JCTVC-C057 TE12: Evaluation of IBDI and TPE (transform precision extension) [M. Zhou (TI)] This document reported evaluation results of the IBDI and TPE (Transform Precision Extension) for TE12. Experimental results on TMuC-0.7 reportedly show that the total gain with both the IBDI and TEP turned on is 0.9% to 5.0% on average; the average IBDI gain when TPE is off is 0.8% to 4.7%, and it is reduced to -0.1% to 3.3% when TPE is turned on; the average TPE gain when IBDI is off is 0.8% to 1.4%, but it totally vanishes when IBDI is on; The results also reportedly show that the additional left shift bits in the TPE (1 bit shift in T32x32, 2 bits shift in T16x16, and 3 bits shift in T8x8, respectively) do not provide additional gain compared to the 4 bits left shift across the transforms. IBDI (Internal Bit-Depth Increase) Input data is left shifted by 4 bits before passed to video codec Internal data precision of each video coding processing stage increased by 4 bits Reference frame has higher precison (8 bits -> 12 bits) which leads to higher memory bandwidth and increased off-chip storage requirements IBDI is on in the reference HE configuration, off in LC configurations TPE (Transform Precision Extension) Expands the intermediate data precision after the first-stage of transform. The final transform output data precison reamains unchanged. Number of TPE extension bits LC configurations (IBDI off): 7 for T8x8, 6 for T16x16, 5 for T32x32 and 4 for T64x46 Number of TPE extension bits HE configurations (IBDI on): for T8x8, 2 for T16x16, 1 for T32x32 and - for T64x64 It was suggested to study the use of 8 bit reference pictures with 12 bit prediction signals and 12 bit residual signals. It should be easy to modify the software to support it if it is not already configurable in this fashion. Results are shown in the table below (with rounding control enabled), indicating the range of benefit averaged across all test sequences within a particular configuration, out of 6 configurations total – HE & LC for all-intra, low delay, and random access. The italicized boldface entries are suggested as the particularly important ones: 191 Anchor combination Combination under Test Average Gain (BD-rate) IBDI off, TPE off IBDI on, TPE (5,6,7) on 0.8% to 5.0% IBDI off, TPE off IBDI on, TPE off 0.8% to 4.7% IBDI off, TPE (5,6,7) on IBDI on, TPE (5,6,7) on -0.1% to 3.3% IBDI off, TPE off IBDI off, TPE (5,6,7) on 0.8% to 1.4% IBDI on, TPE off IBDI on, TPE (5,6,7) on No gain IBDI off, TPE (4,4,4) on IBDI off, TPE (5,6,7) on No gain Recommendations from the proponent: TPE be turned on in HE and LC configurations by replacing o const int g_iShift8x8 = 7; o const int g_iShift16x16 = 6; o const int g_iShift32x32 = 5; o const int g_iShift64x64 = 4; In TlibCommon\Source\CommonDef.h with o const int g_iShift8x8 = 4; o const int g_iShift16x16 = 4; o const int g_iShift32x32 = 4; o const int g_iShift64x64 = 4; In the transform, it was remarked that increased bit depth between the horizontal and vertical transform stages is somewhat burdensome for implementation. It was also suggested to consider IBDI with 2 bit expansion rather than 4 bit expansion. The rounding control feature for bidirectional averaging (adopted into TMuC in Geneva) was noted to affect the degree of benefit for these techniques. It was remarked that when the rounding control feature was integrated into the software, an additional encoder distortion measurement improvement for biprediction was also included – which resulted in making the encoder substantially slower while improving its R-D behavior). JCTVC-C253 was suggested to be related to this topic. It was concluded that the TM should include 12 bit support (i.e., effectively IBDI) and 4-bit TPE. For LC configuration; use TPE, for HE configuration, use IBDI (without additional TPE). 14.12.1.1.1.1.1.1.2 JCTVC-C088 TE12.7: Experimental results of IBDI [T. Chujoh, T. Yamakage (Toshiba)] In this contribution, results of IBDI (Internal Bit-Depth Increase) and TPE (Transform Precision Extension) testing were reported. This is one of reports in Tool Experiment 12: Evaluation of TMuC Tools. The experimental results reportedly show that the coding gain of TPE in the case of intra coding averages about 1% and the coding gain of IBDI in the case of inter is a total of 3% benefit. The encoding and decoding times were reportedly stable. 14.12.1.1.1.1.1.1.3 JCTVC-C252 TE12: Report on evaluation of internal bit depth increase and transform precision extension [R. Joshi, M. Karczewicz (Qualcomm)] Similar results were presented in this contribution as in JCTVC-C057 and JCTVC-C088. 192 14.13 TE12 TMuC encoder-only aspects 14.13.1.1.1.1.1.1.1 JCTVC-C044 TE12: Results for experiments on max CU size, RDOQ and AIS [F. Bossen, T. K. Tan (NTT DoCoMo)] This section discusses the RDOQ part of this contribution – other parts of the same document were addressed in other categories. Rate-distortion optimized quantization (RDOQ) is a well-known technique for improving the ratedistortion performance associated with the encoding of transform coefficients. In the following experiments this optimization is disabled to quantify its contribution to coding efficiency and to encoding time. Only low-complexity configurations were tested. The reduction of encoding time observed when disabling RDOQ ranges from 8 to 17%. While this difference is significant, the drop in BD-rate performance can be signigicant, up to 3.2% on average for the random-access configuration. It should be noted that the experiments were run on version 0.7 of the software and thus do not include updates to the low-complexity entropy coder (LCEC_PHASE2 macro). Considering the reported results, the contributor suggested to keep RDOQ enabled by default for future work. 14.13.1.1.1.1.1.1.2 JCTVC-C238 TE12.8: Results on RDOQ in high efficiency settings [J. Zan, J. Meng, M. T. Islam, D. He (RIM)] This contribution reported results of cross-checking activities performed by RIM on RDOQ (described in JCTVC-B312) for all three high-efficiency settings. Test results had been compared to those generated by Ericsson. For "intra" and "low delay" modes, RIM test results reportedly matched Ericsson’s; for the "random access" mode, there is minor (less than 0.1%) differences on four sequences: 1) the PeopleOnStreet sequence on the V component; 2) the Kimono sequence on the U component; 3) the ParkScene sequence on the U and V component; and 4)the RaceHorsesC sequence on the V component. Overall, the difference in results between the RIM and Ericsson tests was reported to be very minor and all the data on the summary tab of the results spreadsheet matched each other. The test results were reported to have shown that RDOQ is an effective technique to enhance the coding performance (Rate and PSNR) of the video coding algorithm. Specifically, for intra only setting, the gain by using RDOQ on Y-component was reported to be on average 4.7% (in BD-rate); for random access setting, the gain on Y-component was reported to be on average 4.4%; and for low-delay setting, the gain on Y-component was reported to be on average 2.5%. It was observed that RDOQ offers the largest coding performance gain on Class B sequences. Considering the reported results, the contributor suggested to keep RDOQ enabled by default for future work. 14.13.1.1.1.1.1.1.3 JCTVC-C159 TE12: Cross verification of RDOQ [K. Andersson (Ericsson)] Both low complexity and high efficiency test cases were tested by Ericsson. Similar results were reported by Ericsson as in the other RDOQ contributions. Considering the reported results, the contributor suggested to keep RDOQ enabled by default for future work. 15 Project planning and Test Model establishment 15.1 Project timeline 15.1.1.1.1.1.1.1.1 JCTVC-C022 WG11 USNB contribution: Suggested practices for the HEVC project in the JCT-VC [A. G. Tescher (for USNB to WG11)] 193 The USNB to WG11 requested to establish the following practices for the HEVC project in the JCT-VC: The group-adopted "Test Model" (which is expected to be produced in version 1 form at the October 2010 meeting) should contain only the minimum set of well-tested tools that together form a coherent design that has been confirmed to show good capability. This "minimum set" may include such variations as would be appropriate for forming multiple profiles of the design (but should not otherwise contain significant duplication of functionality). In the event that insufficient analysis is available to establish a consensus on the appropriate minimum set of well-tested tools by the October 2010 meeting, it was suggested that obtaining this consensus should be a higher priority than the goal of adopting a version 1 of the Test Model in October 2010. An encoder text description, a decoder text specification, and software matching the encoder and decoder algorithms should be provided for the tools in the Test Model within one meeting cycle of the adoption of a tool into the Test Model. (It may be appropriate for the encoder text description to be in a separate document from the decoder specification.) Appropriate software coding guidelines should be established and followed to ensure high quality software development of the Test Model. These aspects were tentatively agreed on Sunday a.m. 15.1.1.1.1.1.1.1.2 JCTVC-C023 WG11 USNB contribution: HEVC standardization timeline [A. G. Tescher (for USNB to WG11)] Considering the timeline plan for the HEVC standardization approval process and the substantial need for maturation of the design and text drafting efforts for this new standard, the USNB to WG11 proposed that the timeline plan should be adjusted to: CD ballot to be issued from the April/May 2012 meeting DIS (formerly FCD) ballot to be issued from the October 2012 or January 2013 meeting FDIS to be issued from the July 2013 meeting. 15.1.1.1.1.1.1.1.3 JCTVC-C299 UK National Body comments on HEVC Timescales [UKNB to WG11] (late registration, missing prior, available first day) The UKNB welcomed the progress that had been made so far in the development of the new High Efficiency Video Coding (HEVC) standard. They said that the creation of the "Test Model under Consideration" has been a useful first step in making the transition from a competitive to a collaborative method of working, and that the momentum that has built up around this approach should be maintained. They indicated that there is a need to deliver this important new standard to the market in a timely manner, and requested for MPEG to maintain the timescales that were defined in the Call for Proposals, as follows: Test model selection by 2010/10 Final standard approval by 2012/07 The UKNB further requested MPEG to provide details of the planned intermediate stages of this process, bearing in mind the revised procedural rules to be followed by ISO/IEC JTC 1 and the need to align key steps with the ITU approval process. Further discussion was then postponed to Wednesday (see below) 194 15.1.1.1.1.1.1.1.4 m18579 Comments on HEVC Test Model [CNNB to WG11, late registration, missing prior, available later] The Chinese NB to WG11 submitted this late document, which was not registered as a JCT-VC document. Since it was brief, it is quoted below for the record: "According to the tentative timeline described in Joint Call for Proposals on Video Compression Technology (W11113), the Test Model of HEVC is expected to be created at the October 2010 meeting. CNNB believes that a stable version of Test Model with good quality is essential to the further development of the HEVC standard. Each tool in the Test Model shall be well-tested and be confirmed with good performance. The group should give quality and stability as highest priority in the development of the Test Model. Considering the frequent updates and inadequate testing to the TMuC software during the interim period, CNNB suggests the group adopts only such well-tested tools in the TMuC to create the first version of TM and expects the group could get consensus at this meeting." These remarks were taken into consideration in the subsequent discussion, which is reported below. 15.1.1.1.1.1.1.1.5 Schedule plan discussion (held Wednesday) The current WG 11 plan / UK suggestion was summarized as follows: (WD 11/01) CD July 11 FCD/DIS Jan 12 FDIS July 12. The WG 11 US NB suggestion was summarized as follows: (WD 11/01) CD May 12 FCD/DIS Oct 12 or Jan 13 FDIS July 13. It was remarked that we could establish a WD now (as the TM is a complete solution), rather than waiting until 2011/01. Later at the meeting, we did agree to issue a WD as an output of this meeting. Participants remarked that a CD should be Stable in text Fully verified by software Meeting the goals in terms of coding efficiency (roughly) Opinions expressed were recorded as follows: It would take 4-5 meeting cycles to come from current TM to CD. H.262 / MPEG-2 took 5 cycles; MPEG-4pt2 took more cycles than that. AVC took 16 meeting cycles from initial H.26L reference to final standard (12 to CD). "We should keep the meeting schedule to put sufficient pressure on ourselves", vs. "We could release the pressure and always go faster if things come together more quickly than anticipated." The market is changing rapidly (and new applications are coming), and the timeline should be kept but could be changed later if necessary. Milestones were met so far (CfP/TM), why should we shift the upcoming milestones compared to what was said in the CfP? There is a danger that proprietary codecs may appear and take over if we wait for too long Stable software is needed to serve as the basis of the work, and stability of text depends on that (need for verification of the text by the software). And being able to run the software is one thing, having an opportunity to actually understand it and properly specify it is another. 195 As code is not well documented yet, and a considerable amount of new tools are still needed to achieve the goal of coding efficiency, it will be difficult to reach adequate maturity in the 3 meeting cycle periods between now and a July 2011 CD milestone. Dual track idea – make a low complexity variation first? This might counteract the desire to have most tools in common for LC and HE. We must also make sure that the solution keeps the complexity reasonable – for both HE and LC. Even for an LC variant, there is an HE intent. The current specification (as from TMuC) is still far from being mature. We still need better understanding about the interaction of the different parts of the TM. The compromise then suggested was: CD Jan 12, DIS July 12, FDIS Jan 13. The coding efficiency goal is important and should not be missed. The market will take it up when HEVC achieves 50% compared to AVC. The compromise suggested would be agreeable if Jan 13 is really staying the end point (i.e. unless a substantial instability or failure of reaching the goals is detected in the standard later). This compromise was agreed and recorded in a resolution reported to the WG11 parent body: "The JCT-VC thanks the NBs of UK and US on their valuable comments related to standardization timelines of HEVC. The situation has been considered, particularly including the feasibility of proceeding from the current test model and WD to a mature CD text that fulfils the requirements of the project within the available number of meeting cycles. It is believed that with a timeline of CD 12/02, DIS 12/07 and FDIS 13/01, the probability of producing a high-quality standard would be maximized. With such a timeline being established, only exceptional circumstances should require further delay. Considering comments about the process for test model development that has been followed by the JCTVC, which were raised by these NB comments and by the NB of China, a document N11642 [a document produced for WG11 corresponding to the content of JCTVC-C401] has been issued which provides an explanation of this process." 15.2 Test model design: normative aspects 15.2.1.1.1.1.1.1.1 JCTVC-C121 Suggested approach toward HEVC test model creation [S. Sekiguchi, K. Sugimoto (Mitsubishi Electric), more co-authors added later in a revision] This joint contribution proposed a suggested approach toward definition of HEVC Test Model, which was suggested to be referred to as the "HM" in the contribution, and addressed some software related issues. As the result of test model discussion in the last Geneva meeting, the group agreed to define an HEVC Test Model with only the minimum set of well-tested tools that together form a coherent design that is confirmed to show good capability. Given this agreement, this joint contribution proposed a suggested approach toward definition of HM based on contributions available at the Guangzhou meeting. Significant efforts on TE12 have been discussed at the Guangzhou meeting to identify how much coding efficiency and complexity can be seen per each TMuC tool. And the results of other established TEs should have been discussed for evaluation of other tools those are currently not included into the TMuC. However, these tool basis performance reports will be assessments of each individual tool relative to B300 condition, and it is still difficult to identify what is the best minimum combination of available tools. Meanwhile, there are several contributions each of those reports performance evaluation of candidate tool set (JCTVC-C066, JCTVC-C122, JCTVC-C123). It can be found that each tool set evaluated in these separate 196 contributions has some sort of overlap on a common core set of coding tools, which means that those core tools have intensively been validated as tool combination via cross-verification by multiple parties. This joint contribution recommended for the group to review these tool set basis performance evaluations, to identify a base structure of the HM first, and to define the HM by adding other individual tools to the base structure as those would be identified as efficient. This contribution also addressed some software related issues. The contribution suggested the following: To review all TE reports considering software related issues To review all tool set basis performance evaluation reports available at this meeting To define a base structure of HM first that follows "one tool per one functionality" To define the HM by adding only the currently available tools, which will be evaluated by TEs and provides sufficient coding gain at agreeable complexity, to the base structure of HM 15.2.1.1.1.1.1.1.2 JCTVC-C122 A suggested starting point for HEVC test model [K. Sugimoto, S. Sekiguchi (Mitsubishi Electric), more co-authors added later in a revision] This contribution proposed a set of TMuC coding tools for both high-efficiency and low-complexity conditions, as a starting point of HEVC Test Model (HM) with experimental data jointly verified by multiple companies. The proposed toolset reportedly followed a "one tool per one functionality" principle and was asserted to achieve reasonable coding efficiency relative to the TMuC anchor used for TE12 assessment, while reducing encoding and decoding time significantly. The contributors recommended to construct a base structure of the HM based on the suggested toolset and consider further additions of other tools via careful assessment of all TE results for creation of HM. It was noted that the anchor used in this contribution had LCEC phase 2 enabled, but was otherwise encoded according to JCTVC-B300 with TMuC 0.7.4. The HE variation proposed in the contribution is shown as follows, with topics identified as open during the discussion being shown in italic font: Tool name Coding unit Transform unit Large transform Angular Intra Prediction TMuC default setting (JCTVCB300) Min size: 8x8 Max size: 64x64 Quadtree Proposed setting Parameters for proposed setting Same as B300 Same as B300 Max size: 64 Samsung based Max size: 32 I64: 5 modes I32: 34 modes I16: 34 modes I8: 34 modes I4: 17 modes I2: 3 modes (non-norm?) I64: 5 modes I32: 17 modes I16: 17 modes I8: 17 modes I4: 17 modes I2: 3 modes (non-norm?) 197 QuadTreeTUFlag : 0 QuadTreeTULog2MaxSize :5 MaxTrSize : 32 #if UNIFIED_DIRECTIONAL_INTRA const UChar g_aucIntraModeNumAng[7] = {3, // 2x2 17, // 4x4 17, // 8x8 17, // 16x16 17, // 32x32 5, // 64x64 5 // 128x128}; Rotational transform (ROT) Mode depend. directional transform (MDDT) Adaptive Intra Smoothing (AIS) on/off Combined Intra Prediction (CIP) Planar prediction Edge-based prediction Deblocking filter Adaptive loop filter (ALF) Entropy coder Enabled Disabled const UChar g_aucIntraModeBitsAng[7] = {2, // 2x2 5, // 4x4 5, // 8x8 5, // 16x16 5, // 32x32 3, // 64x64 3 // 128x128}; ROT : 0 Enabled Disabled #define QC_MDDT 0 Enabled Disabled AIS : 0 Enabled Disabled CIP : 0 Enabled Disabled #define PLANAR_INTRA 0 Enabled Disabled EdgePredictionEnable : 0 Enabled Same as B300 Same as B300 Diamond Same as B300 Same as B300 PIPE Transform coefficient coding Internal BitDepth Increase (IBDI) Rate Distortion Optimized Quantization (RDOQ) CABACbased RDO Asymmetric Motion HHI Same as B300 Same as B300 or CABAC Same as B300 Same as B300 4 bits Same as B300 Same as B300 On (non-norm) Same as B300 Same as B300 (non-norm) On (non-norm) Enabled Same as B300 Same as B300 (non-norm) Disabled AMP: 0 198 Prediction Unit (AMP) Block Merging (MRG) Motion vector prediction (AMVP/IMVP) Interpolation filter Adaptive motion vector resolution (AMVRES) Transform Skip Flag PU-Based Merging Partitionbased intensity compensation Transform precision extension Enabled Disabled MRG: 0 AMVP method Same as B300 Same as B300 SIFO DCT-IF 12 tap Enabled Disabled Disabled Same as B300 Same as B300 Disabled Same as B300 Same as B300 Disabled Same as B300 Same as B300 Enabled Disabled InterpFilterType: 0 DIFTap: 12 AMVRES : 0 undefine TRANS_PRECISION_EXT The LC variation proposed in the contribution is shown as follows (showing only the rows that differ with the HE variation shown above), with topics identified as open during the discussion being shown in italic font: Tool name Adaptive loop filter (ALF) Entropy coder Internal BitDepth Increase (IBDI) CABACbased RDO Interpolation filter Adaptive motion vector resolution (AMVRES) LCEC Phase 2 Transform precision TMuC default setting (JCTVCB300) Disabled Proposed setting LCEC Off Same as B300 Same as B300 Same as B300 Same as B300 Off (non-norm) DIF Disabled Same as B300 Same as B300 (non-norm) DCT-IF 6 tap InterpFilterType: 0 DIFTap: 6 Same as B300 Same as B300 Disabled Enabled Enabled Parameters for proposed setting Same as B300 Same as B300 #define LCEC_PHASE2 (1) Same as B300 Same as B300 199 extension Shift bits for transform precision extension 8x8: 7 bit 16x16: 6 bit 32x32: 5 bit 64x64: 4 bit 8x8: 4 bit 16x16: 4 bit 32x32: 4 bit 64x64: 4 bit const int g_iShift8x8 = 4; const int g_iShift16x16 = 4; const int g_iShift32x32 = 4; const int g_iShift64x64 = 4; Some particular other subjects of discussion not shown in the table above were recorded as follows: CU-based merging / MRG – harmonization with skip/direct PU-based merging – not included Bidirectional rounding control 15.2.1.1.1.1.1.1.3 JCTVC-C066 Cross verification of Mitsubishi 0.7.4 configuration [M. Zhou (TI)] This document reported cross-check results of the simplified TMuC-0.7.4 configurations proposed in JCTVC-C122. Independent simulations were reported to have verified that the results that Mitsubishi provided for JCTVC-C122 were accurate. 15.2.1.1.1.1.1.1.4 JCTVC-C065 Testing results on the simplified TMuC configurations [M. Zhou (TI)] This document reported evaluation results of simplified TMuC0.7.4 configurations, the tools included in the simplified configurations are identical to the ones proposed in JCTVC-C122, with the exception that the CABAC instead of PIPE is used in the HE configurations and 12-tap instead of 6-tap DCT-IF is used for the LC configurations. Also tested was the performance of the DCT-IF with different filter lengths (6, 8, 10, and 12) under the simplified configurations. The contributor recommended to use CABAC in the HE reference configurations due to an expressed uncertainty of the PIPE development for throughput improvement purposes, and considering 8-tap DCT-IF for both the HE and LC reference configurations. In terms of compression performance, using CABAC rather than PIPE was asserted to provide some slight (0.1-0.3%) improvement, and additional improvement (3% in random access and 4% in low delay) was reported for LC configurations by using longer DCT-IF (8 or 12 tap rather than 6 tap). 15.2.1.1.1.1.1.1.5 JCTVC-C269 A study on the impact of intra smoothing [A. Tabatabai, C. Auyeung, T. Suzuki (Sony)] This contribution discussed the effect of disabling intra smoothing for the suggested HEVC Test Model in JCTVC-C122. The "always off" setting was reported to be better than always smoothing by about 1.4% for Intra HE, and 1.9% for Intra LC, with respect to the results in JCTVC-C122 with PIPE. Cross verification of the results was reportedly done with NEC. This was tested in the context of the configuration suggested in JCTVC-C122. It was noted that a late input contribution JCTVC-C302 was a new proposal with some relationship to this. 15.2.1.1.1.1.1.1.6 JCTVC-C123 On selecting coding tools for HEVC test model [K. Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi Electric)] As the result of test model discussion in the last (Geneva) meeting, the group agreed to define an HEVC Test Model with a minimum set of coding tools consisting of those that can jointly provide sufficient coding gain. This contribution discussed the selection of tools for the HM to create a reliable starting point for the the future standard. Some remarks and observations made during the discussion of the contribution were as follows: LCEC Phase 1 was suggested to have some problems. (This may have been related to the issue fixed in TMuC 0.7.4.) 200 Interpolation filter length was suggested to be studied. Some coding tools seemed to have different effects on different video resolutions. It was suggested to pay attention to complexity issues. 15.2.1.1.1.1.1.1.7 Further discussion on TM definition RQT : o HE: 3-level QT and fast intra encode per JCTVC-C311 Encoder settings: HHI_RQT_INTRA_SPEEDUP_MOD = 0 with HHI_RQT_INTRA_SPEEDUP = 1 (slower search should remain configurable in the software) o LC: 2-level QT, aligned with LCEC Phase 2 (with QCOM CBP VLC), with same fast intra. o Same maximum QT depth for luma and chroma. o Clear description of this was to be provided as a BoG report document JCTVC-C319 by the 1st half of morning of Thursday, with its preparation coordinated by Thomas Wiegand. This was done and the result was reviewed and adopted as recorded below. Entropy coder (CA)PIPE or CABAC o Entropy coding is a core part and should not be exchanged arbitrarily; however, both of these technologies are using same context modeling and are almost identical in performance, and the choice does not affect other tools (PIPE 0.1-0.3% BR increase). o Complexity advantage of PIPE is not clear. o It was agreed that the TM would use CABAC rather than PIPE, further investigation (AHG, CE) was planned on PIPE. Max. 17 vs. 34 intra directions – it was agreed to keep it for the TM of this meeting as it is in current configuration of JCTVC-B300 (maximum set of 34 directions), and to set up an experiment for further investigation. AIS – settled: this will be turned off in the TM / WD Planar Prediction: o There is a slight loss in BD BR, but some experts think that this could improve subjective quality. o The current TMuC does not have encoder decisions based on subjective quality. o Conclusion: Not in TM, further study should be performed. Interpolation filters Suggestions made: a) For LC SIFO 6-tap, for HE SIFO 8/12-tap b) For LC DCT-IF 6-tap, for HE DCT-IF 12-tap c) DCT-IF 8-tap for both cases d) For LC DIF, for HE DCT-IF 8-tap 201 e) For LC SIFO 6-tap, for HE DCT-IF 12-tap For HE: DCT-IF 12-tap was reported to give the best quality – and its use was agreed. For LC: DIF Chroma interpolation will be further investigated in a CE At the close of the meeting, the decision on chroma interpolation remained somewhat unclear, and an email discussion was conducted to clarify the situation. Within approximately two weeks of the meeting, this discussion closed with an agreement to use bilinear 1/8th sample interpolation for the TM / WD chroma interpolation. More detailed remarks and notes are provided below to show some background for the above decisions: o The number of filter tap operations of 6-tap DIF vs. 6-tap separable is roughly 60% both for average and worst case. o The encoder complexity of SIFO appears higher (reported as about 30-70% computation time increase at encoder). In current optimization, all filter types are checked during encoding to determine the best filter for the next frame. o SIFO also requires a larger set of filters. o As a first conclusion, SIFO was removed from the list of candidates. o According to encoder/decoder computation times, DIF and DCT/IF 6tap appear quite close. o In the number of tap operations, DIF needs less (see above). o DIF also needs lower bit depth. o DCT-IF provides 1.8% BR reduction. o If IF takes 30% of decoder time, and DIF would reduce from 100 to 60, this would come to 0.6x30 approx. 15% of entire decoder time. Realistic may be 8%. o Agreement: use DIF – CE/AHG to further investigate. Bidirectional rounding control should be included. 15.2.1.1.1.1.1.1.8 JCTVC-C319 BoG report: residual quadtree structure [T. Wiegand, H. Schwarz, B. Bross (Fraunhofer HHI), A. Fuldseth (Cisco), X. Wang (Qualcomm), W.-J. Han (Samsung)] This document contained the report of the break-out group on residual quadtree structure. Some concern was expressed that the document did not contain sufficient informative description, and backgroun information was explained verbally during the discussion and was then added in a revision. Decision: Adopted. 15.3 Test model design: non-normative aspects and test methodology 15.3.1.1.1.1.1.1.1 JCTVC-C253 Encoder speedup for bidirectional averaging with rounding control [R. Panchal, M. Karczewicz, P. Chen (Qualcomm)] In the 2nd JCT-VC meeting, bidirectional averaging with rounding control (BiRound) was proposed and adopted in the TMuC. In this contribution further encoder optimizations were proposed leading to a reported 50% complexity reduction of the BiRound with no changes in the encoder bitstream. In TMuC 0.8, a new function with 'BiRound' parameter is always used for bidirectional motion estimation. To achieve further speedup, the new function is used only when 'BiRound' parameter is 1. Otherwise, the original function is used. 202 Results were provided with LD HE configuration. The old version as implemented in TMuC 0.7 showed an increase in encoding time by 8% and 1.1% BR reduction, a speedup which produces bitexact result is reported with 4% increase, and a sub-optimum speedup gives 0.9% reduction with no change. Conclusion: Decision: Adopted. Should be on when IBDI is off and vice versa. The fast bit-exact version shall be used in the encoder. 15.3.1.1.1.1.1.1.2 JCTVC-C287 Cross-check of encoder speedup for bidirectional averaging with rounding control [B. Bross (Fraunhofer HHI)] (missing prior, available first day) This contribution confirmed the bit rate results reported in JCTVC-C253, with the reported encoder speedup being slightly better. 15.3.1.1.1.1.1.1.3 JCTVC-C315 On the computation of PSNR and BD-Rate [D. Hoang (Zenverge)] (late reg.) This late contribution was not reviewed in group discussion due to lack of time. The contribution asserted that the computations of PSNR and BD-Rate by the current TMuC software and spreadsheet macros were flawed, and recommendations were made on how to improve these computations. 16 TMuC settings and common test conditions 16.1.1.1.1.1.1.1.1 JCTVC-C084 Coding efficiency report of modification by TMuC draft005 [T. Yamakage, T. Chujoh, T. Watanabe (Toshiba)] This contribution showed experimental results of the modification by TMuC draft005 that signals the ALF filtering control map in the slice header instead of signaling it in the CU header. Experimental results reportedly showed that the coding efficiency loss is 0.04% on average while the interleaving process of the ALF filtering control map at encoder side is removed. This was one of the features of "TMuC low priority" disabled by default (only implemented after TMuC 0.7) Several experts pointed out that this proposal simplifies the encoder (no buffering of encoded data). Decision: There was no objection by the group, and it was agreed to Adopt this proposal. One expert points out that there might be other ways to achieve this goal. 16.1.1.1.1.1.1.1.2 JCTVC-C243 Cross check result of JCTVC-C084 [I. S. Chong, M. Karczewicz (Qualcomm)] (missing prior, uploaded substantially after the meeting) This contribution was verbally presented prior to upload. It reportedly confirmed the results reported in JCTVC-C084. 16.1.1.1.1.1.1.1.3 JCTVC-C155 TMuC 0.7 results with sequence dependent QPs [K. Andersson, R. Sjöberg (Ericsson)] This contribution investigated performance when targeting CfP bit rates using sequence specific QPs rather than those specified in the common conditions (QP=22, 27, 32 and 37) for low delay and random access configurations. The sequence specific QPs was asserted to give very similar coding efficiency as for the common conditions and to give bit rates generally closer to application bit rates and thus to be better suited for subjective assessment. It was noted that the worst case encoding time was reduced by about 20% when using the proposed alternative values. The following remarks were recorded in discussion of this contribution: 203 Typically, the example QP settings suggested have a much lower range of operational points – would this really be desirable? Some cases go into the low-quality range particularly. No action was taken on this, as the current settings seem appropriate for investigation of the merit of single coding tools. 16.1.1.1.1.1.1.1.4 JCTVC-C168 Comments on common test conditions [K. Kondo, K. Sato, T. Suzuki (Sony)] This contribution proposed to change the test conditions as follows. To reduce the number of frames To increase the number of test sequences The common conditions test configurations were summarized in JCTVC-B300. Currently test sequences used by CfP are used and all frames must be encoded. Coding performance results with a smaller number of frames, e.g. 120, were reported to be not so different from the results of using all frames. Instead of reducing the number of total coded frames, it was proposed to test a larger variety of test sequences, e.g. VCEG test sequences, 4K sequence, 8K sequences, etc. To test a variety of test sequences was asserted to be more helpful to achieve an understanding of the the coding performance rather than encoding a larger number of frames in a smaller number sequences. Comments recorded during the discussion were as follows: Subjective tests should also be performed, such that long sequences are needed. Extending the test set to provide more variety appears useful, but needs careful study. Having long sequences is also an advantage, as it enforces to not use overly complex methods –with parallel processing, the length of the sequences is the main bottleneck anyway. For extreme resolutions, a shorter length may be considered. Work on fast motion estimation should be started. It was suggested to conduct further study in an AHG (chaired by T. Suzuki) 17 Application-specific topics 17.1.1.1.1.1.1.1.1 JCTVC-C021 Evaluation result of JCTVC-B031 [K. Kazui, J. Koyama, A. Nakagawa (Fujitsu)] A scheme which reportedly minimizes the loss of coding efficiency in very low delay coding was proposed at the 2nd JCT-VC meeting. This contribution shows evaluation result of the proposed scheme using TMuC software version 0.7.3. Coding gains from the conventional multiple slices scheme in BD-Bit rate were reported as 9.0% and 15.9% when the sizes of LCTB are 64x64 and 32x32, respectively. In addition, a new buffer model for a coding delay of less than one frame period was proposed in this contribution. Remarks recorded during the discussion included the following: A very high number of slices was used per frame – is this realistic? Certainly other solutions would be possible to achieve the same goal. Constant bit rate was assumed for each slice – is that realistic? It was suggested to contribute software to the work of AHG on slices. 204 Further study was encouraged. 17.1.1.1.1.1.1.1.2 JCTVC-C032 Characteristics of Super Hi-Vision test sequences [Y. Shishikui, Y. Matsuo, A. Ichigaya, K. Iguchi, S. Sakaida (NHK)] This contribution presented the characteristics of Super Hi-Vision (SHV) test sequences, which reportedly differ from those of the other test sequences. The analysis of test sequences indicated that SHV test sequences contain an extremely large amount of motion that affects spectrum characteristics of the signal to a large extent. SHV test signals also contain a larger noise component due to their high bandwidth, although this does not affect subjective picture quality as much as was expected by the objective amount. The Sequences were interpolated by Bayer filter to achieve a full 8Kx4K resolution, and spectral characteristics are different from the 2K and 4K sequences (high flat-spectrum noise component appears to be present below quarter of sampling frequency). 17.1.1.1.1.1.1.1.3 JCTVC-C055 Performance report of TMuC for Super Hi-Vision [K. Iguchi, A. Ichigaya, Y. Shishikui(NHK), S. Sekiguchi, A. Minezawa (Mitsubishi Electric)] This contribution reported the performance of TMuC 0.7.0 for Super Hi-Vision (SHV) sequences. While the TMuC achieved substantial coding gain relative to the AVC CfP anchor, it took from 3.1 to 6.1 times encoding time than JM encoding with CfP anchor condition. The following remarks and observations were recorded: There was a suggestion that cropped sequences could be used that would give roughly same results with much lower simulation time. BD bit rate improvement is approx. 60% for Steam Locomotive, 40% for Nebuta PSNR curves show larger gap (higher gain) towards higher bit rates: Does the TMuC loop filter improve prediction under noisy conditions? Bit rates ranged from 500 Mbps to 1.5 Gbps, PSNR 30-37 dB 10 bit to 8 bit conversion was done by rounding True 8K sequences (not upsampled) would be desirable to judge without influence of the upsampling filter. 17.1.1.1.1.1.1.1.4 JCTVC-C276 Screen content coding results using TMuC [C. Lan, J. Xu, F. Wu (Microsoft), G. Shi (Xidian Univ.)] New coding tools had been presented for coding screen contents using an AVC framework in JCTVC-B084. This document presented new coding results using the TMuC, for intra coding and for random-access cases. The following remarks and observations were recorded during the discussion: Recorded at 10 fps BD rate saving around 20% for random access inter, 28% for all intra. More information about the characteristics of these sequences was requested – e.g., How often do frame changes occur, is there subpixel motion? What would be typical test material? This may be the most difficult issue, as this has high influence on potential selection of tools. As this is a new application area, the importance should be reviewed by the parent bodies. Video (particularly news) includes lot of this type of content nowadays. 205 18 Non-TE Technical Contributions 18.1 Loop filtering JCTVC-C085 Comparison of loop & post filtering for in-loop and post-processing filtering AHG [T. Yamakage, T. Chujoh, T. Watanabe (Toshiba)] This contribution showed experimental results of comparison between in-loop filtering and post filtering. This was a response to one of the mandates of the In-loop and post-processing filtering AHG. Coding efficiency of several filtering schemes was compared for in-loop or post filtering and discussed. Remarks recorded included the following: The current QC_ALF of the TMuC performs 4% better than a similar stand-alone post-filter The method from AVC (SEI) AAPF performs 2.5% worse than QC_ALF (an amount which slightly decreases when operated in the loop) Results also reportedly show that a filter must be explicitly designed as a loop filter or post filter JCTVC-C113 Adaptive loop filter with low encoder complexity [I.S. Chong, W. Chien, M. Karczewicz (Qualcomm)] In this contribution, low complexity versions of the Adaptive Loop Filter (ALF) with at most two encoding passes were proposed. Simulation results for one-pass and two-pass ALF are presented to demonstrate that one-pass and two-pass methods significantly lower the complexity of the original ALF with small performance degradation (less than 0.1% for two-pass and less than 0.7% for onepass). The one-pass method derives the map by using a filter from the previous frame and then optimizing coefficients for the current frame based on regions where the filter is switched on. A two-pass method with one more iteration was described A post filter from the VCEG KTA software was also compared (losing 0.9% BR relative to the anchor) The method is very similar to the one proposed in JCTVC-C082 JCTVC-C195 An enhanced block-based adaptive loop filter in TMuC0.7.0 platform [P. Wu, S. Paschalakis, N. Sprljan (Mitsubishi Electric)] In this contribution an Enhanced Block-based Adaptive Loop Filter (EBALF) algorithm has been implemented and integrated into TMuC0.7.0 platform as a complete functional block for in-loop filtering processing. Without altering the current default adaptive loop filter technology, which is contributed by Qualcomm as QC_ALF in the TMuC software, this EBALF functional block can be placed before or after QC_ALF, resulting in an in-loop filtering composed of two stages. The simulation results reportedly confirmed that with QC_ALF "on" performance as the reference point, the joint filtering can potentially further improve the coding efficiency by up to 1.5% BD rate reduction in some cases. For Class A tests, the average BD rate reduction were 0.7% and 0.6% in both test cases. This could reportedly imply that performance of QC_ALF can be improved further, particularly for high resolution sequences. The simulation results cover encoder_randomaccess and encoder_lowdelay configuration settings with full length and all classes of the sequences as defined in the Tool Experiment 10 subtest 2 category. The Excel spreadsheet results were provided. This contribution was reportedly not intended to propose an alternative technology to replace QC_ALF 206 in TMuC but rather to be aiming to assist the current general effort on evaluating the existing TMuC video coding tools, such as that in Tool Experiment 12 (TE12) and 10 (TE10). The results reported independently in this document regarding the adaptive loop filtering technology can be treated as additional information to all other related test results in TE12 and TE10. It was remarked that a gain (0.6-0.7% BR reduction) was achieved only for class A and B partially, and that on average there was practically no gain. Decoder runtime increases by 15-20%. No action was taken in response. JCTVC-C211 Inner block oriented ALF processing for memory & bandwidth reduction [Y. Sohn, K.-H. Lee, B.-K. Lee, I.-K. Kim (Samsung)] This proposal described an adaptive loop filter that was asserted to have low memory, bandwidth and computational complexity requirements. ALF in TMuC requires spatial neighboring reconstructed data for CU based ALF processing. This document proposes applying filtering only to a restricted inner LCU area where there is no need to use data outside of the LCU for filtering. This approach was reported to result in marginal performance degradation with memory, bandwidth and computational complexity reduction. The experimental results reportedly showed 1.0% performance degradation in low delay configuration and 0.8% performance degradation in random access configuration, respectively. In expense of this performance degradation, it was asserted to reduce about 24 Kbytes external DDR memory and 0.77 Kbytes internal SRAM, and to reduce memory bandwidth about 5.98 MHz in the case of 4Kx2K@30p video sequence and 64 bit BUS. It was asserted to also reduce computational complexity for ALF processing itself. Concern was expressed that for two adjacent filtered blocks, boundary effects may occur. This would need to be a normative decoder issue (making it switchable would not help to reduce maximum decoder complexity). Most experts believed that not every application case of loop filtering would benefit from that. A suggestion by the chair was that if limitation of memory accesses is desired, it might be better to limit the number of filtered blocks in a profile. JCTVC-C219 In-loop reference frame denoising [P. Amon, A. Hutter, E. Wige, A. Kaup (Siemens)] This proposal presented an algorithm for in-loop denoising of the reference frame. The algorithm modifies the temporal predictor while the decoded picture is unchanged. Knowledge of the noise power within the reference frame is used in order to affect the inter frame prediction. For noise filtering of the reference frame, a denoising algorithm is implemented inside the AVC reference software JM 15.1. It was reported that the bit rate can be decreased for (high resolution) noisy image sequences especially for higher qualities at medium to high data-rates. The following remarks were recorded: The suggestion was to take into account additive noise in the design of the filter coefficients. The implementation was in the AVC context. The algorithm currently does not work on intra. It is not necessary to transmit filter parameters; as these are estimated at the decoder. The reported improvement was mostly at high bit rates, and may be hardly visible. The concept may eventually help for applications that require lossless or near-lossless coding (e.g. medical, studio). The algorithm does not remove quantization noise. It is more intended to separate content that is consistent over time from noise that is uncorrelated between frames and therefore 207 disturbs the prediction. At low rates, it was suggested that this would be removed anyway by the quantization. 18.2 Further study was suggested. Block structures and partitioning JCTVC-C025 CTB splitting on frame boundary for arbitrary resolution video [F. Yang, W. Li, S. Wan, Y. Chang (Xi'dian Univ.)] The conception of CTB has been proposed. But the modes of the CTB on the frame boundary may be limited when the video resolution is not an integral multiple of the LCTB. A method of CTB splitting on the frame boundary for arbitrary resolution video was proposed in this document, where the non-square CTB and a new representation of the CTB mode reportedly significantly contribute to the coding efficiency. The proposed method was asserted to be compatible with the current TmuC, with no extra syntax needed. Remarks recorded during discussion were: Largest gain comes from small image sizes, for classes other than D typically around 0.1 percent, but would require considerable change of CB design. Adds irregularity to the CTB, which makes it necessary to augment the parsing of the tree by additional conditions that depend on frame size. No action taken. JCTVC-C026 Proposal of frame parameters FLCTB and FSCTB for video content fitting [F. Yang, W. Li, S. Wan , Y. Chang (Xi'dian Univ.)] The size of CTBs in a frame in the TmuC is limited by the parameters of LCTB and SCTB, which are set on the sequence level. However, the optimal CTB splitting mode for a frame depends on the frame content, the frame type, and the quantization parameter, which indicates that different LCTB and SCTB should be used for different frames. Therefore, two frame parameters of FLCTB and FSCTB were proposed to adapt CTB splitting to the frame content. Remarks recorded during discussion were: No results available yet There is some doubt that this provides gain. No action taken. JCTVC-C154 Fine granularity slices [R. Sjöberg, P. Wennersten (Ericsson)] In order to increase the granularity of possible slice boundary positions this contribution proposed to change the definition of slices from being a sequence of largest coding tree blocks (LCTB:s) to a sequence of the smallest coding tree blocks (SCTBs) in order to increase the control of the slice sizes in terms of bytes. This contribution also contained a proposal to replace UVLC for signaling of slice start positions with a fixed length code (FLC) with a one-bit shortcut for indicating the first block in a picture since this address is present in all pictures. Remarks were recorded in discussion as follows: Applications of slice-wise processing are parallelization for UHD (where the current design is not a problem) and packetization in videoconferencing. In terms of coding efficiency, the main disadvantage of slice-wise processing is the breaking of prediction chain and entropy coder adaptation. The main argument for change of CTB signaling would be a more clean design. 208 Conclusions: o Regarding the slice processing, defining slices as sequence of largest CU give a disadvantage in regard to flexibility. o An estimate of solving the problem by restricting to 32x32 slices (taken from TE12) points to 5% BR increase. o A suggested first step would be to start slice implementation (suggested for an AHG chaired by Rickard Sjöberg) with the current CU structure, and only after that further more concrete evidence can be collected Another late "Proposal 3" was presented in the presentation slides (not in the contribution document). Slice implementation would appear to be more simple if the design would return to JCTVC-A205 draft 2 (min_coding_unit_size and hierarchy_depth); In fact it was suggested that the current syntax seems to be broken as it does not support rectangular CU (only square). It was suggested that this other proposal should be registered as a new document (first discussed with the TMuC editors). 18.2.1.1.1.1.1.1.1 JCTVC-C318 TMuC text on max CU size seems broken [R. Sjöberg (Ericsson)] (late reg.) The issue reported in JCTVC-C318 was partially presented in the slide deck of JCTVC-C154, where it was called "proposal 4". The slide deck did not give very much information about the issue; and it was not in the word document and seemed to be somewhat independent, so it was suggested to register this as a separate input document. This document arrived quite late, and there was insufficient time for this late contribution to be reviewed. It may be beneficial for the contribution to be studied by those with detailed knowledge about the tree implementation. The subject matter of the document at the close of the meeting may (or may not) be addressed by the breakout activity report in JCTVC-C319. JCTVC-C167 Proposal on large block structure and quantization [K. Sato (Sony)] For common test conditions in video codec standardization, the value of quantization parameter is usually fixed within a picture, or a sophisticated scheme like RDOQ is sometimes employed for coding efficiency improvement. However in the real applications, a QP-changing scheme with macroblock-by-macroblock variation of QP is usually employed for visual quality improvement (and rate buffer control). Adaptive quantization as defined in MPEG-2 Test Model was suggested to be a very old technique but still one of the effective techniques for subjective quality optimization. It determines QP according to the activity calculated from the values of original samples within a macroblock. The basic concept is that human eyes are sensitive to degradation in flat areas, but insensitive to busy areas, so smaller QP is allocated for the former areas and bigger QPs are allocated for the latter areas. This was suggested to work effectively, especially when the size of the image is large (like HD). One of the key applications of HEVC will be beyond-HD image coding, so techniques like adaptive quantization seem likely to also be useful for HEVC. The HEVC TMuC employs the concept of a coding unit (CU). The largest coding unit (LCU) corresponds to macroblock in the existing video coding standard like MPEG-2 or AVC. The main difference is that the size of the latter is fixed to 16x16 pixels while the one of the former can be up to 128x128. In the existing video coding standards, QP can be set at the macroblock-level. However, as the size of LCU becomes large, it may contain both flat areas and busy areas, and if QP can only be set at LCU-level, such adaptive quantization may become more difficult. Therefore it was proposed to define the CU syntax so that QP can be changed not only with LCU level but with all CU levels. This modification can enable QP changing for subjective quality and was asserted to be useful for real applications. Remarks recorded during the discussion included the following: 209 More evidence would be necessary (e.g. demonstrating that the switching of QP at 16x16 blocks gives an advantage over switching at 64x64) What would be the bit rate increase if no switching is used? At least it would be necessary to have a syntax element e.g., at the picture or slice level to disable the encoding of cu_qp_delta. Further study was suggested. JCTVC-C224 Frame coding in vertical raster scan order [D.-K. Kwon, H. Kim (TI)] In HEVC, the CUs (Coding Units) in a frame are encoded in a horizontal-first raster scan order. It was reported that the frame coding in this raster scan order is not always effective for intra prediction and requires large on-chip memory to cover large vertical motion for motion estimation (ME) and motion compensation (MC), which is very challenging especially for UHD (ultra high definition) video such as 4k x 2k and 8k x 4k. In this contribution, it was proposed to support frame coding in a vertical-first raster scan order along with the horizontal-first raster scan order. Then the best coding scan order could be selected adaptively for each frame based on coding efficiency, horizontal/vertical motion, on-chip memory saving, etc. Remarks recorded during the discussion included the following: Changing order (by rotating 90%) improves performance for some sequences by up to 3% , but nothing on average (i.e. loss for others). This introduces a 1 frame delay How would an encoder decide this? No action taken. JCTVC-C260 Flexible picture partitioning [K. Panusopone, X. Fang, L. Wang (Motorola)] This document proposed use of a 2D coordinate system to partition a picture into largest coding tree units (LCTU) in a more flexible manner. The 2D coordinate system plane is divided into square blocks by vertical and horizontal lines. The block size is the same as the selected LTCU size. The top-left corner of a picture may be aligned with the coordinate system origin, or shifted away from the coordinate origin by an offset. The picture partition follows the square blocks in the coordinate systems. Remarks recorded during the discussion were as follows: Alignment of CU grid by default in top-left; Corner mode allows one of 4 corners; explicit mode allows positioning of CU block grid with any offset Results were reported with 30 frame length sequences, coded as IPPP (not common conditions), with adaptations for each frame; and a gain was reported of 0.63% on average, the largest gain being in class E. These results were achieved by running 4 encodings with the 4 corner options and selecting the best. It was suggested that the 400% encoder complexity increase would certainly not be justified by the small gain. No action taken. JCTVC-C275 Improve intra frame coding by PU/TU reordering [X. Peng, J. Xu, F. Wu (Microsoft)] This document presented an adaptive PU/TU reordering scheme for prediction in TMuC intra-frame coding. Within a PU, the coding order of TUs was rearranged to achieve better prediction for the 210 inner TUs. Within a CU, a different coding order of PU can be selected also to reportedly obtain a better prediction. Remarks included: - The ordering is dependent on prediction direction. - An increase in encoder runtime roughly to 170% was reported. - The gain in bit rate for HE Intra was roughly 0.4% - It is not clear how much of the gain comes from TU or PU re-ordering. - No action taken. JCTVC-C283 Region of block based dynamic video processing for HEVC [A. Paul (Hanyang Univ. / NCKU)] A region based adaptive video processing scheme based on SAD of macro blocks was proposed in this proposal. The average SAD of all the blocks in this region is calculated for computing a threshold value and thereafter this threshold value is used to adaptively select the search range. The proposed algorithm reportedly achieves 60% time reduction with negligible PSNR loss. There was difficulty arranging presentation of this contribution and no action was taken on it. JCTVC-C306 Investigations for representing rectangular blocks using the merging concept [H. Schwarz, D. Marpe , T. Wiegand] (late reg.) This document investigated the impact on coding efficiency when the symmetric rectangular block modes (2NxN and Nx2N) were removed from the syntax and replaced by corresponding merge flags. For the same encoder complexity (given an algorithmic comparison and verified by the measurement of encoding times), an average increase in BD rate of 0.13% was reported for a removal of the symmetric rectangular blocks. A variation with lower encoding complexity (the average encoding time is 78% of that for the reference) reportedly yielded an increase in BD rate of 0.8%. If the encoder complexity is increased in comparison to the reference (the measured average encoding time is 144% of that for the reference), an average decrease in BD rate of 0.06% was reported. The following remarks were recorded during the discussion: This is one example about redundancy in the block structure (i.e. the same structure can be expressed in different ways). The current implementation of doing a full search on rectangular shapes may not be the best approach. It was suggested to establish an AHG to study the block structures (chairs WJ Han, TK Tan, T Wiegand) This should include intra & inter. JCTVC-C311 Fast intra encoding for fixed maximum depth of transform quadtree [B. Bross, H. Kirchhoffer , H. Schwarz , T. Wiegand (Fraunhofer HHI)] (late reg.) This document described an encoder-side complexity reduction for intra encoding. The presented fast encoding method produces bitstreams that are conforming to the specification while reportedly achieving a similar encoder complexity as for the transform quadtree disabled case. Two fast modes had been implemented: In speedup mode 1, the different intra prediction modes are tested only on the maximum allowed transform block size. The best performing mode is tested with the full transform tree partitioning and the best performing partitioning is chosen. Speedup mode 2 does basically the same except testing the full transform tree partitioning with the two best performing intra modes instead of the best one. 211 Results were provided only for 50 frames of each sequence. Mode 1: Enc. Time 101%, BR reduction 0.3%. Mode 2: Enc. Time 120%, BR reduction 0.8%. It was indicated that a new version with more additional results would be made available which shows also results with tree depth=2 which reports 0.1% and 0.3% BD BR gains for modes 1 and 2. In addition, results were reported on disabling MDDT, ROT, edge pred and smoothing, with a degradation of 0.5% for mode 1 (96%/111% encoder for HE and LC), mode 2: 0.8%/0.2% for HE/LC with 116%/132% encoder time. JCTVC-C312 Cross-verification of JCTVC-C311: Fast intra encoding for fixed maximum depth of transform quadtree [W.-J. Han (Samsung)] (late reg.) This late contribution reportedly confirmed the results of the original quadtree approach with 3 levels and the scheme described in JCTVC-C311, with computation times that differed slightly. 18.2.1.1.1.1.1.1.2 Discussion on JCTVC-C311/JCTVC-C312 The following remarks were recorded during the discussion of JCTVC-C311 and JCTVC-C312: Concern was expressed that for LC conf this may not give advantage. It was suggested to enable RQT only for HE mode, or to restrict it to a different depth for LC (possibility would be to restrict in LC depth=1 for intra, depth=2 for inter; depth=3 for both intra and inter in HE case). Note: depth=1 is said to be the same as RQT off for intra – this should be further verified. Note2: Using a different depth for inter and intra is currently not supported It was mentioned that the configurability that RQT provides has implications on other tools’ performance. Therefore it would be important to have it in the TM. It could be studied in the context of the AHG on block structures + CE Definitely this needs to be further study in the AHG on block structures It was said that RQT on/off has implications on the choice of block sizes and modes for intra prediction (signaling is different and therefore other modes may be selected). From the software perspective, it would be undesirable to use the previous on/off version as these are completely independent paths A breakout group was formed to discuss the subject further, resulting in the BoG report JCTVC-C319 as discussed elsewhere in this report. 18.3 Motion compensation and interpolation filters JCTVC-C126 Chroma interpolation filtering using high precision filter [D. Y. Kim, J. P. Kim, Y. L. Lee (Sejong University), J. Lim, J. Song (SKT)] In the TMuC, chroma interpolation is performed with the bilinear interpolation to obtain the 1/8pixel vectors in the same way as in AVC. In this contribution, a chroma interpolation method using the 6-tap AVC luma MC interpolation filter with high precision was proposed. Experimental results reportedly show that the average BD-rate improvements on chroma U and V components are 12.2% and 13.4%, respectively, in the random access, low complexity case, and the average BD-rate improvements on U and V components were reported as 4.1% and 5.0%, respectively, in the random access, high efficiency case. The average BD-rate improvements on U and V components was reported as 12.0% and 14.9%, respectively, in the low delay, low complexity case, and the average BD-rate improvements on U and V components was reported as 2.0% and 2.4%, 212 respectively, in the low delay, high efficiency case. The average BD-rate improvements on Luma were reported as 1.3% and 0.4%, respectively, in low complexity and high efficiency of the low delay case, and the average BD-rate improvements on Luma were reported as 0.2% and 0.4%, respectively, in low complexity and high efficiency of the random access case. Remarks recorded during the review included: The results are similar to what has been reported before in TE12 and other TEs. Establishing a CE on luma and chroma subpel interpolation was recommended It is still unclear what the real benefit in terms of overall BR gain is. Complexity is certainly increased compared to bilinear interpolation. Subjective quality may also be an issue in this context. JCTVC-C137 Decoder-side block boundary decision (DBBD) with OBMC [S. Fukushima (JVC)] This proposal focused on the tool "Decoder-side Block Boundary Decision (DBBD)" which was proposed in JCTVC-A108 as one of tools for HEVC CfP response. DBBD aims to perform motion compensation with various block patterns without additional coding bits by deciding the MC boundary on the decoder-side. In this proposal, DBBD with OBMC was proposed for improving the quality of MC block in addition to the previous proposal. The simulation results reportedly show that the proposed technique provides 3.1% BD bit rate gain for random access, and 2.2% for low delay as against JM16.2 anchor under HEVC common test conditions. Remarks included the following: Only used in 16x8 and 8x16 modes Encoding time increased to roughly 400-500% (Currently ME is performed for every possible boundary position). Decoder time increased roughly by 10% AMP does something similar (with signaling), reducing the bit rate by roughly 2% but only increasing encoding time by 33%, with negligible decoder complexity impact. No action taken. JCTVC-C183 Enhanced switching of interpolation filter for HEVC [T. Yoshino, S. Naito, S. Sakazawa (KDDI)] Adaptive Interpolation Filter (AIF) technology (e.g., as in prior KTA work) has been proposed to improve the interpolation filter performance for fractional-pel motion compensation. However, this scheme requires high computational cost for calculating the filter coefficients on the encoder side. This contribution proposed an Enhanced Switch Interpolation Filter (ESIF) approach that predefines several interpolation filter coefficients in order to reduce the computational cost for calculating the coefficients. This scheme allows updating the number of filter coefficient sets and the filter coefficients slice by slice. The experimental results reportedly showed that the BD-bit rate against AVC reached 1.9% under CS2 condition. Remarks recorded during the discussion included the following: The Powerpoint presentation did not match with the contribution (a set of 8 filters was proposed instead of 4). The current scheme has 500% coder complexity. The "Low complexity scheme" with 130% coder complexity for filter decision gains 0.30.4% on average. 213 No action taken. JCTVC-C204 Bi-directional optical flow [A. Alshin, E. Alshina (Samsung)] (missing prior, available first day) A modified B-slice prediction was proposed. By combining the optical flow concept and high accuracy gradients evaluation this proposed technique allows sample-wise refinement of motion. This approach does not require any signaling about motion vector refinement to the decoder. Sample-wise refinement is performed by solving optical flow equations between two reference images. For gradient calculation, DCT-based interpolation is used. Uses an anchor with both MRG and skip/direct and DCT-IF12 used for comparison The reported gain was roughly 2%, with decoder runtime roughly 150% It was suggested to include this in an experiment on decoder-side estimation JCTVC-C215 Adaptive MV resolution with directional filters [K. Ugur, J. Lainema (Nokia)] This contribution presented an adaptive motion vector resolution concept using 1D directional filters. The goal of this contribution was to demonstrate the possible coding efficiency gains in interpolation with low complexity. It also presented a design of adaptive motion vector resolution in TMuC, with directional filters. Remarks recorded included the following: Saves computations at decoder through additional 8-pel positions (not a dense grid of all possible positions) which allows more 1D interpolations. Encoding time increased to 124% (due to additional search), decoding time decreased to 80% (although the time measurement results may not be reliable). Bit rate reduction was reported at 1.3% on average; where the gain comes due to additional 8th pel positions; and the gain was mostly in class D. Bilinear interpolation is used for chroma, whereas the LC version of TMuC may have a rounding problem; and it was suggested that this could explain part of the gain. It was suggested to study this further in the context of AMVRES and MC interpolation. JCTVC-C251 Overlapped block motion compensation in TMuC [P. Chen, R. Panchal, W.-J. Chien, M. Karczewicz (Qualcomm)] In this contribution, some results of overlapped block motion compensation (OBMC) in geometry partition coding were reported. The use of OBMC to asymmetric motion partitions was also proposed, with simulation results in support being presented. Remarks included the following: Is combined with geometric partitioning Geometric partitioning reportedly gives 3.9% BR reduction for RA, 3.6% for LD; and 0.8% of this is due to OBMC. If combined with AMP, the gain of OBMC is 0.4% and 0.3% for RA & LD, respectively. The increase of complexity at the decoder was asserted to be marginal. The presentation did not match with the available document, and it was indicated that a new version would be uploaded. 214 JCTVC-C301 Geometry motion partition [P. Chen, L. Guo, M. Karczewicz (Qualcomm)] (late registration, missing prior, available first day) Geometry motion partition has been integrated into TMuC. The software implementation is based on the design proposed in JCTVC-A121. In the software, geometry motion partition is applied to CU size of 16x16 and 32x32. Geometry mode decision has two stages. First, fast geometry mode decision pre-selects 16 best modes based on SAD calculation. Then the best mode is selected based on full RD. Geometry modes are tested only if the best non-geometry mode is a non-skip mode. Remarks recorded included the following: For results, see above under JCTVC-C251 Encoder runtime is 200%; geom. Partition test only done after skip decision. It was suggested to include this in a core experiment on asymmetric & geometric partitioning (previously in TE3). Note: checking of the sample membership to the left or right partition may also be of concern for complexity; if masks are used for that purpose, it should be considered that the number is low. The presentation did not match with the available document, and the contributor was requested to upload a new version. JCTVC-C303 Geometry adaptive block partitioning (GEO) cross-check [E. Francois, P. Bordes (Technicolor)] (late registration, missing prior, available first day) This cross-check contribution reported exactly the same results in BR reduction as JCTVC-C301 (3 class B sequences were missing). No checking of runtime was reported. JCTVC-C254 Test result of SIFO for sequences with illumination change [R. Panchal, M. Karczewicz (Qualcomm)] SIFO was asserted to be not just an interpolation filter but it also to have the capability to compensate for illumination changes. SIFO results were provided for some sequences which had illumination changes to highlight that for such sequences, the offset component of SIFO provides useful gains. The test was done using TMuC 0.8 in the low delay configuration. Remarks recorded included the following: The presentation did not match with the available document, and uploading of a new version was requested. It was asked why should interpolation and illumination compensation be combined? It was suggested that these are two different issues (the latter also applying to full-pel positions). The answer by the proponents was that it would be possible to do it separately. JCTVC-C295 Re-compensation based on partial coefficients [H. Zhu (Zhu)] (late registration, missing prior) The proposal reportedly provided a new idea for video coding. The algorithm divides the quantized DCT coefficients to two parts, using the first part to get a first reconstruction, and using the first reconstruction R0 and the compensation signals Cmp0 and Cmp1 to compute a new weight for the motion compensation. No experiment results were provided, and no action was taken in response. 215 18.4 Motion vector coding JCTVC-C208 Simulation results of motion partition, block merging and motion competition schemes [I.-K. Kim, T. Lee (Samsung)] In the current TMuC design, there are two approaches for frame partitioning: prediction unit (PU) splitting to make smaller motion partitions and block merging (MRG) to make larger motion partitions. In this proposal, anchor results are obtained by running TMuC-0.81 using reference configuration with disabling all motion partitions, motion competition and MRG. By enabling MRG on top of the anchor, improved BD bit rate is reported to be about 2.0% (random access), 0.2% (random access LC), 2.4% (low delay) and 0.0% (low delay LC). But higher BD bit rate savings can reportedly be obtained by enabling motion partition: 4.2% (random access), 5.8% (random access LC), 4.1% (low delay) and 5.1% (low delay LC). By enabling motion competition, improved BD bit rate savings are reported to be about 4.3% (random access), 5.6% (random access LC), 3.2% (low delay) and 5.2% (low delay LC). The best performance can reportedly be obtained by combining motion partitions and motion vector competition which shows 7.2% (random access), 9.2% (random access LC), 6.2% (low delay) and 8.3% (low delay LC) BD bit rate savings. The results of TE12 were reported to have been confirmed. JCTVC-C257 On motion vector competition [Y. Su, A. Segall (SHARP)] A method was proposed to improve the performance of motion vector competition in applications that may experience errors during transmission. The focus of the contribution was the dependency between parsing and co-located motion vectors that is incorporated into the motion vector competition design. This dependency results in coding gain; however, it also results in an inability to parse a bit-stream if a previous frame is lost. To overcome this problem, it was proposed to send the motion vector candidate as a flag, which signals if the selected predictor is the temporal collocated block, and subsequently send an index into the set of spatial predictors when the temporal collocated block is not selected. Remarks recorded in the consideration of the contribution were as follows: This was presented near the end of the meeting, after difficulty coordinating a prior presentation time The goal is reduction of the buffer size; the advantage of robustness under loss was not proven yet. In the worst case the buffer is around 33% of a picture buffer in size. This may require computation of a median of 16 values, which adds complexity. A loss around 0.1% in bit rate was reported. It was suggested to include further investigation of this in a CE 18.5 B picture reference list redundancy JCTVC-C278 Redundancy reduction in B-frame coding at temporal level zero [B. Li, J. Xu, G. J. Sullivan, F. Wu (Microsoft), H. Li (Univ. of Sci. &Tech. China)] In the current TMuC, the usage called GPB is widely used to replace a P slice with a B slice using two reference frame lists that contain the same pictures. As list 1 is identical to list 0, there is redundancy comparing using only list 0 with using only list 1 as references. This contribution suggested not to use list 1 prediction in this kind of B slice, restricting the available predictors to using bi-prediction and list 0 prediction. Compared to TMuC0.7 anchors, such a change reportedly leads to 0.1% bit rate savings for random-access cases and 0.9% bit-saving for low-delay cases, on average. 216 Encoding time was also decreased compared to the anchor (around 10%). See further notes regarding the JCTVC-C285. JCTVC-C285 Modified uni-directional inter prediction in generalized P and B pictures [W.-J. Chien, P. Chen, X. Wang, M. Karczewicz (Qualcomm)] This contribution proposed a modification of uni-directional inter-prediction in generalized P and B pictures (GPB). The modification restricts the reference list selection for uni-directional interprediction. Simulation results reportedly showed that 0.88% BD rate reduction can be achieved. The same method was proposed in JCTVC-C278. It was remarked that the encoding time decrease is no argument, as this is implementation specific At first it was suggested for an AHG to study this further and check the two methods, work out syntax and provide stable cross-checked software and results, and report about implications on the MMCO and RPLR Decision: After further study and discussion, it was agreed that this seems to be quite simple to implement without complications (as confirmed by SW coordinator), and the proposal was adopted. 18.6 Quantization control 18.6.1.1.1.1.1.1.1 JCTVC-C135 Flexible scaling of quantization parameter [D. Hoang (Zenverge)] The current Test Model under Consideration (TMuC) employs a quantization parameter (QP) scaling that is borrowed from the AVC standard. In AVC, the quantization step size increases by approximately 12.25% with each increment of QP, so that the quantization step size doubles when QP is incremented by 6. For the purpose of rate control, the contributor asserted that this 12.25% increment may be too coarse for certain applications, such as low-delay coding. This contribution proposed a specification that allows the granularity of the quantization parameter (QP) to be varied at the slice and picture levels. Backward-compatibility with the AVC approach is maintained at one of the granularity settings. In addition to varying the granularity, this contribution also proposed a variable QP offset that can be specified at the slice and picture levels. It was remarked that there have been prior efforts to address the coarse quantization step size increment of AVC. In JCTVC-A114, the quantization step size was proposed to double every 16 increments of QP. In AVC, the QP can vary from 0 to 51, inclusive (for 8 bit per sample coding). A full transition of QP from 0 to 51 reportedly translates to a 362-fold increase in the quantization step size. The contributor said that, in practice, only a small range of QP values are used when coding a scene at a given bit rate. It was also asserted that at scene changes, QP can change due to a change in scene complexity and then typically varies only slightly within a scene. It was remarked that the complexity of supporting the proposed inverse quantization process in the decoder may not be trivial. It was suggested that dynamic range increase may be needed to obtain the suitable precision for frequency-specific norm adjustment, and that necessary LUT sizes and perhaps some other memory increase may be needed for the decoder to support this. The relationship between luma and chroma quantization was also discussed. It was noted that the stability of the transform design may not be at a stage where such a proposal would be timely. Further study was encouraged. 18.7 Entropy coding 18.7.1 Coefficient scanning and context selection 18.7.1.1.1.1.1.1.1 JCTVC-C114 Zigzag scan for CABAC/PIPE [J. Lou, K. Panusopone, L. Wang (Motorola)] (missing prior, uploaded on first meeting day) 217 In TMuC0.7, the macro "HHI_TRANSFORM_CODING" enables a set of transform coefficient coding tools. It is switched on by default when the entropy coding option is CABAC/PIPE. The set of transform coefficient coding tools is based on JCTVC-A116. Among these tools, an adaptive scan is applied for significance map coding. The contributor reported that experimental results indicate that this adaptive scan scheme achieves only a negligible performance gain, but introduces additional memory and computational complexity as comparing to the traditional zigzag scan. The contributor proposed to use a conventional zigzag scan rather than the adaptive scan for significance map coding when the macro "HHI_TRANSFORM_CODING" is switched on, or CABAC/PIPE is selected. This topic was also discussed in the TE12 context – e.g., see section on JCTVC-C059. The compression impact was estimated by the contributor was estimated here and in JCTVC-C059 as being in the range of 0.0 to 0.3%, depending on coding condition and test sequence class. Further investigation of the topic was encouraged. 18.7.1.1.1.1.1.1.2 JCTVC-C227 Parallelization of HHI_TRANSFORM_CODING [V. Sze, M. Budagavi (TI)] See also JCTVC-C059 and JCTVC-C114. While various parallel bin encoding approaches have been proposed, context modeling remains a parallelism challenge in the entropy coding engine. HHI_TRANSFORM_CODING uses a highly adaptive context modeling approach for the coefficient significance map, where context selection depends on coefficients in neighboring positions. While it provides coding gains between 0.8% to 1.1%, it reportedly introduces significant dependencies which are difficult to parallelize when using zig-zag scan or adaptive scan order. In this document, the contributor proposed a "wavefront" diagonal scan order to enable parallel processing in conjunction with the use of the highly adaptive context selection for significance map. A simplification of the context selection was also proposed in the contribution. These modifications were reportedly implemented in TMuC-0.7.3 and the coding efficiency impact was reportedly evaluated. The modifications reportedly come at a cost of 0.2% to 0.4% coding loss. Despite this loss, the parallelized HHI_TRANSFORM_CODING with simplified context selection (reduce dependency by one or two scans) and wavefront scan (down left direction) was asserted to still provide gains between 0.9 to 1.4% when compared with not using HHI_TRANSFORM_CODING. The contributor proposed conducting a CE/TE for further study of the topic. 18.7.1.1.1.1.1.1.3 JCTVC-C205 Low-complexity adaptive coefficient scanning [V. Seregin, J. Chen (Samsung)] This contribution proposed a new scanning method for transform coefficients. It was proposed for the scanning mode for every Transform Unit (TU) to be defined explicitly based on RD estimation and a corresponding mode index signaled to the decoder. The proposed technique was tested relative to TMuC 0.7 and a comparison was made to the existing methods in TMuC 0.7. The performance was evaluated on 1 second duration sequences that were otherwise following based the common test conditions specified in JCTVC-B300 – the sequences were shortened to reduce the testing time. The proposed method reportedly provided 0.2-0.9% improvement for luma on average under HE conditions. In LC configurations the proposed method reportedly had similar performance as TMuC. This would increase encoding complexity, to determine the selected scan order to be applied. It was remarked that, instead of using syntax to indicate the scan order, the scan order could be inferred based on the intra prediction direction. Some proposals relating to this were also discussed at the meeting. The contributor proposed conduction a CE/TE for further study of the topic. 18.7.1.1.1.1.1.1.4 JCTVC-C250 Low complexity adaptive coefficient scanning [M. Coban, R. Joshi, M. Karczewicz (Qualcomm)] 218 An adaptive coefficient scanning process is used to capture coefficient statistics based on the intra prediction mode. This contribution proposed to introduce a partial sort at the coefficient level for determining the scan pattern. The proposed scheme reportedly reduces the complexity of the existing adaptive coefficient scan algorithm, with a negligible average BD bit rate penalty. If MDDT is disabled, the adaptive scan described in this contribution is not used. However, it could hypothetically be used without MDDT. The contributor indicated that without MDDT, approximately 0.6% improvement under HE all-intra conditions, and 3% improvement under LC all-intra conditions was observed in some preliminary testing with short video sequences. The gain discussed for the LC case included another effect discussed in another contribution JCTVC-C263. The proposed modification reportedly cuts the decoder run-time approximately in half. Some of the results presented were not in the original contribution. Some of the results had reportedly been cross-checked by RIM as reported in JCTVC-C307. 18.7.1.1.1.1.1.1.5 JCTVC-C307 Cross Verification of Low Complexity Adaptive Coefficient Scanning [J. Zan , J. Meng , M. Towhidul Islam , D. He] (late reg.) This contribution reported cross verification results on "low complexity adaptive coefficient scanning" as described in JCTVC-C250. Due to limited time, RIM had only tested 5 sequences from each class; and for each test sequence, two QP values (22 and 37) for the "intra" and "intraLC" modes. The test results were compared with those from Qualcomm: 1) for the tests that macro "FAST_ADAPTIVE_SCAN" were enabled, both results reportedly matched on BD bit rate, and PSNRs on all test points; 2) for the tests that macro"FAST_ADAPTIVE_SCAN" were disabled, mismatches were reported between the results and those from Qualcomm except for the Class D sequences, although the difference was very minor (<0.1%). On decoding time, for the "intra LC" mode, an average of 43% decoding time was recorded on all available test points, and the Qualcomm data was an average of 37% on all QPs and all sequences; for the "intra" mode, an average of 60% decoding time was recorded on all available test points, and the Qualcomm data was an average of 51% on all QPs and all sequences. It was noted that the tests could only be done on some selected sequences, with the lowest and highest QP values, due to time constraints. From the data collected on these test points, it was reportedly shown that the decoding time was greatly reduced. This contribution was submitted late during the meeting, with very limited opportunity for review. It was not presented or discussed in detail. 18.7.2 LCEC modification proposals 18.7.2.1.1.1.1.1.1 JCTVC-C263 Improvements on VLC [M. Karczewicz, W.-J. Chien, X. Wang (Qualcomm)] This contribution proposed changes relating to variable length coding (VLC) of transform coefficients, intra prediction modes, and inter modes. In this contribution, a modification was proposed to the current transform coefficient coding for intra blocks. Coding schemes were also proposed for coding the prediction mode of both intra and inter blocks respectively. With a different VLC table selection method and different symbol combination, coding performance of the prediction modes was also proposed to be modified. For LC all-intra, an average gain of 4.3% was reported, with MDDT disabled and ROT enabled. With ROT turned off, the contributor suggested that the gain would likely be similar. For LC random access, an average gain of 3.2% was reported. For LC low delay, an average gain of 2.4% was reported. The results had not been cross verified. Some complexity impact was reported due to the use of adaptive scanning. 219 The establishment of a core experiment on the subject was proposed (possibly split into different subject areas). The LCEC coding in the TMuC software includes the coding of a terminating bit that is unnecessary and not described in the corresponding text. We agreed that this is considered a bug, and should be fixed. This somewhat affects the reported results, as noted by the contributor. It was remarked that his has overlap with JCTVC-C185, and should be evaluated in the same CE/TE. 18.7.2.1.1.1.1.1.2 JCTVC-C210 Efficient coefficient coding method for large transform in VLC mode [S. Lee, M.-S. Cheon, I.-K. Kim (Samsung)] This contribution presented a coefficient coding method for large transforms in LCEC mode. Tandberg, Ericsson, and Nokia’s proposed coefficient coding method, which is reportedly currently implemented in the TMuC, was reportedly extended for the efficient coding of the coefficients from large transforms including 16x16, 32x32, and 64x64. Experimental results reportedly showed that the proposed method provides gain for random access and intra only configurations without a significant complexity increase. For LC all-intra, an average 1.6% improvement was reported. The results had not been cross verified. 18.7.2.1.1.1.1.1.3 JCTVC-C152 Context-adaptive hybrid variable length coding [Y. Xu, J. Li, W. H. Chen, D. Tian (Cisco Systems)] This contribution presented a context-adaptive hybrid variable length coding method for entropy coding. This method is based on the reported observation that the non-zero transform coefficients are more clustered in the low frequency region while being more scattered in the high frequency region. The code tables are adaptively chosen based on the context information from coded neighboring blocks or from the coded portion of the current block. The proposed algorithm was implemented and tested relative to JM 10.1 High Profile with fixed 8x8 transform size, and the simulation results were compared with CAVLC. Further work would be needed to test the proposal in the TM design context to determine its applicability to the new design. The contributor indicated that such results may be available by the time of the next meeting. 18.7.3 V2V coding 18.7.3.1.1.1.1.1.1 JCTVC-C279 Opportunistic parallel V2V decoding D. He, G. Korodi, E.-h. Yang, G. Martin-Cocher (RIM) A buffer-based technique that distributes codewords produced by V2V codes into interleaved fixedlength phrases was proposed. On the decoder side, since subsequent phrases can be accessed independently without waiting for the decoding of the current phrase to complete, an "opportunistic parallel decoding" can reportedly be achieved. The proposal described the concept of this modified scheme. Some padding bits are needed at the end of the bitstream when applying this scheme. However, the scheme avoids the need for the pointers that are otherwise needed for identifying the location of the PIPE streams, and also avoids the bit-wise interleaving of the PIPE data. A participant asked what length might be recommended for the "phrases", and the contributor suggested such examples as 16, 32, etc. (multiples of 8 bits). It was suggested that some measurements of figures of merit for the technique should be presented. Further study was encouraged. 220 18.7.4 Modified CABAC coding 18.7.4.1.1.1.1.1.1 JCTVC-C296 Decoding improvement on the PA-Coder [H. Zhu (Zhu)] (late registration, missing prior, available first day) The contribution proposed a fast channel for MPS decoding of the PA-coder previously proposed in JCTVC-A027. Once a new bit is decoded, a threshold value is computed using a table lookup in the previous design. The threshold value is compared with the current value to decide the MPS or LPS. The PA arithmetic coder was described as having reduced computational resource requirements relative to CABAC and PIPE, with slightly less compression capability. The PA coder had been integrated into the JM, and the contributor indicated that integration of the coder with the TMuC software was reportedly expected to be completed soon. No action was taken in response to the contribution. 18.7.4.1.1.1.1.1.2 JCTVC-C300 High-efficiency entropy coding simplifications [V. Sze , M. Budagavi (TI)] (late registration, missing prior) PIPE was described as a bin-level tool aimed to improve throughput by using 12 bin encoders in parallel. Binary symbols (bins) are assigned to each of the bin encoders depending on their probabilities; with the bins of different probabilities being processed in parallel. The contributor asserted that there is a large complexity (i.e. area cost) associated with the PIPE implementation due to the use of V2V (variable length codes to variable length codes) for the bin encoder, with an estimated area over 5x larger than CABAC. In this document, a multi-CABAC approach was proposed using a simplified binary arithmetic coding engine with quantized rLPS tables for the bin encoder to reduce complexity. The resulting area savings was estimated to be over 7x. The parallelism across bin encoders is still maintained, thus providing improved throughput at the bin level. The coding efficiency of TMuC-0.7.3 with the modified multi-CABAC and quantized rLPS tables was reported to have a 0.0-0.1% coding gain over PIPE. The contributor indicated that the buffering associated with the probability interval partitioning remains a challenge in the proposed design. A participant commented that CABAC power consumption could be an issue in such a design. The fact that the probability estimate is static for each BAC decoder would substantially simplify the BAC engine operation. The compatibility of the scheme in regard to the "load balancing" scheme found in the TMuC would need to be studied. It was asked how to properly measure area, power, throughput, the difficulty of implementation in software, and, when comparing to LCEC, also coding efficiency. Further study was encouraged. 18.7.4.1.1.1.1.1.3 JCTVC-C304 Showing the possibility of fast CABAC [H.-J. Kim , X. Qu , W.J. Han] (late registration, missing prior, provided later during the meeting) This late document was initially uploaded as an empty file. It was remarked that it had a confusing IPR statement, and it was indicated that a new version would be provided. Due to lateness, there was limited opportunity for its review. It described a proposed entropy coding method that was asserted to have lower complexity than CABAC. Results were reported on some test sequences. A 2-6% bit rate increase was reported along with a reported complexity reduction by 30-35%. No action was taken in response. 18.7.5 Entropy slices 18.7.5.1.1.1.1.1.1 JCTVC-C256 New results for entropy slices for highly parallel coding [K. Misra, J. Zhao, A. Segall (SHARP)] The concept of an Entropy Slice was previously proposed for the HEVC design. Entropy slices enable separate definitions for neighborhood in the entropy decoding and reconstruction loops. 221 Motivations for entropy slices were described. As a first benefit, it was asserted that the system enables parallel decoding with negligible overhead. This includes both the context adaptation and bin coding stages, and it is compatible with all of the entropy coding engines currently in the TMuC. As a second benefit, it was asserted that the degree of parallelization is flexible – an encoder may generate a bitstream that supports a wide range of decoder parallelization factors without knowledge of the target platform. As a third benefit, it was asserted that the entropy slice concept enables more meaningful profile and level definitions in terms of the impact on entropy coding. Specifically, profiles/levels can include limits that are relevant to the operation of the entropy coding component. This was asserted to be useful for all applications, but with a significant benefit to emerging, higher rate and higher resolution scenarios. The document provided some experiment results for entropy slices. Software is also provided in the contribution. The proposal discussed the ability to initialize the state of the entropy coder either using syntax (e.g., cabac_init_idc) or using the end state from some other entropy slice. It was remarked that storing the end state from a preceding slice has a memory impact. "Wavefront" processing operation had been suggested – and a straightforward method of providing this capability was described. The basic concept of desiring enhanced high-level parallelism of the entropy coding stage to be in the HEVC design is agreed (as recorded in the Geneva meeting report). It was remarked that the need to store the parsing results prior to completing some of the decoding process for the picture will have an impact on memory and memory bandwidth, and that this could be a serious concern. The overhead bits for indicating the location of the wavefront parallelism entry points were not accounted for in the coding efficiency experiment results. It was remarked that, for coded frames that are relatively small, this overhead could be a substantial percentage of the data. We can test coding efficiency impact, including overhead bits, and including comparison to regular slices. Further study was encouraged (AHG or CE). 18.8 Intra prediction 18.8.1.1.1.1.1.1.1 JCTVC-C040 Chroma intra prediction using reconstructed luma [C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R)] Intra coding in the current TMuC achieves high compression efficiency, in part due to the intra prediction process that exploits spatial directional correlation. However, intra prediction of chroma components in YUV 4:2:0 video uses a limited set of possible predictions available for coding of luma components. Also, the coding of chroma components proceeds somewhat independently of luma components, and ignores any possible correlation between them. This contribution proposed a way of using reconstructed luma pixels to help with intra prediction of chroma pixels. Specifically, it uses the regions with similar luma texture characteristics to identify a location for prediction of chroma components. By making use of the reconstructed co-located luma block to perform template matching in the luma plane, co-located chroma blocks of the matched luma blocks are identified for us as predictors. Simulations results reportedly indicate that, on average, the proposed approach is able to obtain 2.3% chroma bit-rate reduction and 0.5% luma bit-rate reduction relative to the anchor for all intra coding. An approximate 2x encoding time and 3x decoding time impact was reported for intra. It was remarked that something like this that links together luma and chroma operations would need to provide a very good gain to be a desirable feature. The complexity seemed rather high, but further study was encouraged. 18.8.1.1.1.1.1.1.2 JCTVC-C206 Chroma intra prediction by scaled luma samples using integer operations [J. Chen, V. Seregin (Samsung)] 222 This contribution proposed an integerization of the algorithm presented in JCTVC-B021 in which chroma samples are predicted from colocated reconstructed luma samples by a linear model relationship. Only addition and multiplication operations are involved to calculate the model parameter instead of using divisions as in the proposed approach. Experiments reportedly show that the average BD-rate reduction of Y, Cb, Cr components are 1.3%, 6.8% and 5.6% for intra coding conditions and 0.7%, 6.7% and 4.6% for random access conditions. The proponent asserted that the complexity of the proposed method is similar or even smaller than that of the method used in AVC. The values of the parameters of the linear prediction model are derived at the decoder side from neighbouring reconstructed samples. Reportedly the late document JCTVC-C314 contained a cross-verification. Further study was encouraged. 18.8.1.1.1.1.1.1.3 JCTVC-C314 Verification results of Samsung's proposal JCTVC-C206 [J. Kim , J. Park , B. Jeon] (late reg.) The contents of this late document were verbally reported during the presentation of JCTVC-C206. The document was not reviewed in detail. The PSNR and bit-rate results reported in this document were reported to be the same as the results reported in the document JCTVC-C206. 18.8.1.1.1.1.1.1.4 JCTVC-C111 Integration of plane mode in unified intra prediction [G. Li (Santa Clara Univ.), L. Liu, N. Ling (Santa Clara Univ.), J. Zheng, P. Zhang (Hisilicon)] The current default intra prediction method in the TMuC, Unified Intra Prediction, reportedly has up to 33 directional modes and 1 non-directional mode, which is the intra DC mode. This contribution proposed adding a plane mode and using an "multiple predictor set" intra mode prediction. Experimental results reportedly showed that coding efficiency can be improved with the proposed method. Partial results for intra HE and LC configurations were provided. An average gain of about 0.6% was reported. There seems to be several elements discussed together in this contribution. The encoder search complexity is increased. It was noted that the encoder search is limited by using fast search in the current TMuC, and better compression can be obtained just by making a more effective search. Further study was encouraged. 18.8.1.1.1.1.1.1.5 JCTVC-C189 Evaluation of intra prediction based on repetitive pixel replenishment [S. Mochizuki, K. Iwata (Renesas)] This contribution proposed a method of intra prediction based on repetitive pixel replenishment (Intra RPR). The proposal uses a displacement vector with area extrapolation for intra coding. Preliminary partial simulation results reportedly showed that this proposed technique has an average of 0.8%, 0.2%, 1.3% BD-rate improvement, respectively at Y, U and V, against TMuC 0.1, with about double encoding time. Only the encoding the first 20 pictures of each sequence was tested. The reported maximum savings per sequence was 4.8% (0.22 dB) on the Basketball drill relative to TMuC 0.1. The contributor indicated that they are working on integration into a newer TMuC version. The design, as presented, was somewhat preliminary – some modifications were planned (making the prediction size adaptive, and harmonization with other intra coding tools). Further study was encouraged. 18.8.1.1.1.1.1.1.6 JCTVC-C207 Encoder improvement of unified intra prediction [Y. Piao, J.-H. Min, J. Chen (Samsung)] 223 This contribution proposed an encoding scheme for unified directional intra prediction (UIP) which was introduced in JCTVC-B100. This was only a non-normative proposal – only the encoder would be changed – not the decoder. Bit rate reductions of 0.5% for HE all-intra and 0.3% for LC all-intra were reported relative to the TMuC method, with lower search complexity. The results were not cross-verified. Two variants were proposed. Decision: The S1 (9-9-4-4-5) scheme was adopted as the HM encoder method, with additional further study planned, and the S2 scheme will additionally be available in the HM software as a selectable configuration. 18.8.1.1.1.1.1.1.7 JCTVC-C218 Encoding complexity reduction for intra prediction by disabling NxN partition [J. Kim, Y. Jeon, B. Jeon (LG Electronics)] This contribution proposed a simplification of the intra prediction process to reduce encoding complexity. In the current TMuC specification, two kinds of splitting, 2Nx2N and NxN, are allowed for intra prediction and it is signaled by intra_split_flag. In the proposed method, NxN is not allowed for each intra prediction unit (PU) except when the PU has the maximum depth. The proposed modification reportedly achieves approximately 4.5% encoding time reduction with 0.2% loss in BD bit rate under the intra HE setting of JCTVC-B300. When the rotational transform (ROT) and combined intra transform (CIP) are turned off, the proposed method reportedly achieves about 36% encoding time reduction with 0.4% loss in BD bit rate relative to the anchor. It was noted that this proposal includes syntax modification. It was remarked that the software includes a macro switch for a similar treatment in the inter case. Further study was suggested – perhaps as part of block structure testing rather than intra prediction. 18.8.1.1.1.1.1.1.8 JCTVC-C234 Simplified intra smoothing [Y. Zheng, M. Coban, M. Karczewicz (Qualcomm)] In this contribution, a simplified intra smoothing scheme was proposed to improve the current AIS scheme. In the proposed scheme, the prediction samples are filtered according to the prediction unit size and the intra prediction mode. The proposed scheme reportedly achieves 0.4% BD bit rate reduction on average for HE and LC intra configurations, relative to not using any intra smoothing. The encoding complexity was reportedly reduced by about 50% relative to the prior AIS scheme. No syntax is used to indicate whether to filter or not. Visual quality was not studied. Random access and low delay configurations were not tested. The late document JCTVC-C316 reportedly contained a cross-verification report. Further study was encouraged. 18.8.1.1.1.1.1.1.9 JCTVC-C316 Verification Results of Simplified Intra Smoothing (JCTVCC234) [T. Yamakage (Toshiba)] The contents of this late document were verbally reported during the presentation of JCTVC-C234. This contribution reported verification results of JCTVC-C234 (Simplified Intra Smoothing) by Qualcomm. Not all results had not been checked, but the reported results were indicated to be similar to Qualcomm’s results. 18.8.1.1.1.1.1.1.10 JCTVC-C302 Improved intra smoothing for UDI and new AIS fast mode [T. Nguyen, B. Bross, D. Marpe, T. Wiegand (HHI)] (late registration, missing prior, available first day) In this contribution, an adaption of an intra smoothing scheme for UDI was proposed to improve the overall coding performance. Furthermore, this contribution presented modified fast mode for AIS. 224 For when AIS is not in use, the contribution proposed a LUT without syntax to determine when filtering is to be applied. For when AIS is active, the contribution proposed a syntax indicator using the table as part of the entropy coding, and a fast encoding search method to guide the selection. Relative to not using filtering, the LUT-only scheme in the proposal was reported to provide about 0.3% for HE and LC all-intra. Relative to not using filtering, the scheme using syntax in the proposal was reported to provide about 0.8% for HE and 0.9% for LC in all-intra coding. It was suggested that visual quality effects would be desirable to assess. The contributor did not suggest to put this into the TM, but rather to do further study. 18.8.1.1.1.1.1.1.11 JCTVC-C269 A study on the impact of intra smoothing [A. Tabatabai, C. Auyeung, T. Suzuki (Sony)] Please see the discussion of this contribution in section 15.2. 18.9 Transforms and residual coding 18.9.1 Residual segmentation 18.9.1.1.1.1.1.1.1 JCTVC-C068 Improved side information signaling for QVBT in TMuC [B. Lee, M. Kim (KAIST), J. Kim, H.-Y. Kim (ETRI)] (missing prior, available first day) In this contribution, new syntax was proposed for the quadtree based transform to reduce side information for the transform in TMuC. The available maximum and minimum transform size in the current TMuC is 64x64 and 4x4 respectively, which were implemented with the quadtreestructure manner in the current version of TMuC. The structure is reportedly well adapted to the characteristics of the input signal while the signaling for transform types may cause large amounts of side information as the depth of the quadtree increases. Moreover, the coded_block_flags (cbf) for luma and chroma components are another overhead for the quadtree transform structure. The contribution proposed a modification of the side information including split_transform_unit_flag and cbf. The encoding and decoding for split_transform_unit_flag was proposed to be skipped when the cbf for luma (Y) and chroma (UV) are all zero. A new flag with a single bit was proposed to signal the patterns of the quantized coefficient for luma and chroma components. The work reported was preliminary and no overall gain was shown. JCTVC-C277 Redundancy reduction in Cbf and merge coding [B. Li, J. Xu, F. Wu, G. J. Sullivan (Microsoft), H. Li (Univ. of Sci. &Tech. China)] In the current TMuC, tree based coding is widely used to provide a hierarchical representation of information. In some cases, the attribute of one block can be derived from the attributes of the parent block and other "brother" blocks so that the encoder does not have to code the attribute. This document presented methods to reduce redundancy in coding Cbf and merge. Remarks recorded during the review of this contribution included the following: Decision: The first element in the proposal contribution was regarding saving CBF bits depending on split decisions gives 0.1-0.2% reduction. It was agreed to adopt this – it adds one more condition in parsing and seems straightforward. The second element of the proposal contribution was regarding PU based merging and CU based merging. Further study of this aspect was suggested, and it was suggested that we would need a more precise description relative to the upcoming TMuC text and that we may have other solutions e.g. context modeling. 225 18.9.2 Transform complexity reduction 18.9.2.1.1.1.1.1.1 JCTVC-C096 Low complexity rotational transform [F. C. A. Fernandes (Samsung)] The Rotational Transform (ROT) is a secondary transform that improves coding efficiency and is implemented in the JCT-VC Test Model under Consideration (TMuC). This contribution proposed a lifting-factorization complexity reduction for the 8x8 ROT. The ROT is split into compound Given’s rotation matrices which are then each factored into a triplet of integer lifting matrices. The lifting-matrix coefficients can reportedly be implemented in hardware with fewer elemental adders than required for a TMuC ROT hardware implementation. With BD-rate of -0.003% this technique reportedly achieves 46% and 21% reduction in the elemental-adder count of forward and inverse ROT implementations, respectively. The contributor recommended that the JCT-VC evaluate this transform in a tool-evaluation experiment or a core experiment. 18.9.2.1.1.1.1.1.2 JCTVC-C112 Fast integer transforms for the HEVC test model [W. Dai, M. Krishnan, P. Jesudhas, P. Topiwala (FastVDO)] Beyond the 4x4 and 8x8 transform sizes already found in AVC, larger transform sizes of 16x16, 32x32 and 64x64 have been included in the Test Model under Consideration (TMuC) for HEVC. However, since large transform sizes have non-trivial computational complexity, one may ask what is the cost-benefit analysis of their inclusion in the forthcoming Test Model. The larger transforms in TMuC are all reportedly based on Chen's fast DCT algorithm because of its regular butterfly structure and its extensibility to any desired transform sizes of order N=2m with m>=1. For complexity reasons, integer rather than floating-point transform computation is performed. In this document, it was reported that the complexity can be reduced with no loss of performance. Fast transforms of each candidate size were proposed that reportedly not only provide virtually identical coding performance, but offer useful gains in computational complexity. In this way, the complexity of the transforms can reportedly be kept to a minimum, even when using large transform sizes. It was remarked that lifting techniques tend to convert parallel operations into serial ones. However, it was noted that pipelining can mitigate that. A participant asked how to best evaluate complexity – this is more than just counting operations. 18.9.2.1.1.1.1.1.3 JCTVC-C255 DCT+Hadamard low complexity large transform for Inter coding [M. Budagavi, A. Gupte (TI)] This contribution proposed a class of transforms for large block sizes which is a combination of DCT+Hadamard transforms for reducing computational complexity of Inter transforms. The best performing 32x32 transform in this group reportedly provides most of the coding gains of 32x32 DCT for high-efficiency configurations but with 50% reduction in number of multiplications assuming direct matrix multiplication DCT. If Chen’s DCT implementation is assumed, then the number of multiplications is reportedly reduced by 24%. 18.9.2.1.1.1.1.1.4 JCTVC-C209 Low-complexity 16x16 and 32x32 transforms and partial frequency transform [Y.-M. Hong, M.-S. Cheon, I.-K. Kim (Samsung)] This contribution proposed a new 32-point fast DCT scheme based on Loeffler’s design principles. Fast integer realization of the 16-point and 32-point transform were provided based on the proposed transform. In addition, a partial frequency transform scheme was proposed to reduce the complexity of the transforms of the current TMuC further. The proposed approaches reportedly reduce the number of operations significantly with negligible performance loss compared to the current TMuC. 18.9.2.1.1.1.1.1.5 JCTVC-C237 Reduced complexity 32x32 transform by coefficient zero-out [J. Sole, R. Joshi, M. Karczewicz (Qualcomm)] 226 Large block size transforms (up to 64×64) are being considered for HEVC for improving coding efficiency. However, such transforms may be difficult and/or costly to implement in hardware. This proposal presented an approach to simplifying the 32×32 transform by zeroing out the high frequency coefficients. The simplification reduces the number of transform coefficients by 75℅ and reportedly attains performance close to the full 32×32 transform. The loss in terms of BD-rate is 0.14℅ for random access, high efficiency configuration and 0.12℅ for low-delay, high efficiency configuration. Similar to JCTVC-C209, this contribution suggested computation of only the lower frequency coefficients of a large block size transform. 18.9.2.1.1.1.1.1.6 JCTVC-C117 Implementation analysis of transform block size [Y. Yu (Broadcom)] This contribution analyzed the complexity of large transform block sizes, especially for a generic hardware implementation. Based on the analysis in this contribution, the hardware cost for a 64x64 transform block was reported to be roughly eight times that of an 8x8 transform block, which is the maximum transform block size of the AVC coding standard. The hardware cost for a 32x32 transform block was reported to be roughly four times that of an 8x8 transform block. The analysis also provides a compression gain comparison between different transform block sizes. The experimental results reportedly show that 80% to 90% of the total compression gain (2-4%) from large transform block sizes can be captured by 16x16 and 32x32 transform block sizes. Given the cost and compression gain, and that HEVC is also attempting to reduce overall coding complexity, this contribution suggests limiting the maximum transform block size to either 32x32 or 16x16. It was remarked that some of the impact depends on throughput requirements – e.g., whether to process stages of a transform serially or in parallel. The primary focus of the contribution was on hardware implementation – software may behave differently. It was noted that quantization is a closely related topic together with transforms. 18.9.2.1.1.1.1.1.7 JCTVC-C226 Low-complexity configurable transform architecture for HEVC [M. Sadafale, M. Budagavi (TI)] This contribution proposed a matrix multiplication architecture for DCT/IDCT implementation that is configurable and can be re-used across various transform block sizes for HEVC. Matrix multiplication implementation reportedly has the advantage that it is friendly to parallel processing with minimal dependency and control logic. In hardware, matrix multiplication reportedly results in low-area architecture, while in software it reportedly leads to efficient implementation on SIMD processors. Another asserted advantage of matrix multiplication architecture is that it is a unifying architecture in the sense that is flexible enough to support other transforms being considered in HEVC such as directional and 1D transforms. Also matrix multiplication reportedly has better fixed-point behavior than Chen’s DCT/IDCT which reportedly allows for elimination of the existing quantization matrices in TMuC. The memory requirement for storing dequantization matrices in the TMuC decoder reportedly goes down from 7.5 KB to 12 bytes. There is a similar reported memory savings in the TMuC encoder. A fixed-point version of matrix multiplication DCT/IDCT along with reduced size quantization/dequantization matrices optimization was implemented in TMuC-0.7.3. Simulation results reportedly indicate that there is no significant loss in coding efficiency (average 0.0 to -0.1%) when compared to the Chen DCT/IDCT factorization in TMuC-0.7.3. The proponent acknowledged that integration of this scheme into the TMuC would cause a substantial increase in encoder and decoder software simulation runtimes. 227 18.9.3 Alternative types of transforms 18.9.3.1.1.1.1.1.1 JCTVC-C108 Jointly optimal intra prediction and adaptive primary transform [A. Saxena, F. C. A. Fernandes (Samsung)] This document proposed applying either a conventional discrete cosine transform (DCT) or a derived discrete sine transform (DST) for intra prediction as a primary transform in TMuC 0.7. For each of the upto 34 modes in unified intra-prediction in TMuC 0.7, the derived transform (separable along the horizontal and vertical directions) was asserted to be theoretically optimal with performance close to KLT. The proposed primary transform is based on the intra-prediction modes with no additional signaling information, works in a single-pass and reportedly does not incur any additional computational complexity. No training was required to derive the transform. It requires the storage of one sine matrix in addition to the conventional DCT at each block size. Comparison is performed in TMuC 0.7 between the proposed adaptive DCT/DST primary transform and the conventional DCT for two cases, viz., when the secondary transform in intra prediction, i.e., rotational transform is off and on. Simulation results reportedly show a BD Rate improvement of 0.2% and 0.1% averaged over all the video sequences, when the rotational transform is off and on, respectively. Only 25 frames of each video sequence were initially tested. Data was later collected for longer sequences, and was verbally reported to be consistent with the smaller test results. No syntax is used to indicate the selection of transform type. There was a previous similar proposal from I2R (see JCTVC-C037). This proposal is basically the same idea, applied to other block sizes. The contributor indicated that applying it to all block sizes may improve the results. Further study was encouraged. 18.10 IBDI and memory compression See also the section on TE2. 18.10.1.1.1.1.1.1.1 JCTVC-C077 A framework for standardization of memory compression [T. Chujoh, T. Yamakage (Toshiba)] A framework for standardization of memory compression was proposed. A purpose of the contribution was to discuss how to standardize memory compression. Memory compression can reduce the bandwidth of memory access; however, most memory compression techniques depend on hardware architectures and the process of memory compression causes a degradation of coding efficiency and an increase of computational complexity for most implementations. This contribution does not propose a common memory compression algorithm, but rather proposes a framework of memory compression. Since the essence of a conformance point is the output of the decoding process, it was suggested that a bitstream format and a bit rate not be specified; rather, a specification of the distortion and a system to control the distortion would be specified in this framework. The purpose of this framework is to enable introducing arbitrary memory compression effects into the encoder and the decoder in a matching fashion. It was discussed whether encoder/decoder mismatch should be allowed. Perhaps not, but it may be a question worth discussing. In the discussion, the creation of a related AHG was suggested. 18.10.1.1.1.1.1.1.2 JCTVC-C094 An extension of DPCM-based memory compression to 2-D structure with ADPCM [H. Aoki, K. Chono, K. Senzaki, J. Tajime, Y. Senda (NEC)] In this contribution, a 2-D extension of the previously-proposed 1-D DPCM-based memory compression method was presented. According to some previous reports, there are conditions where 2-D image mapping to frame memory provides better memory access bandwidth performance. In such cases, memory compression with a 2-D structure may be more straightforward and may improve coding efficiency. In the proposed 2-D extension, the top-most and left-most samples in 228 every memory compression unit are coded in the same manner as the 1-D case, and other samples are coded with ADPCM. The 1-D DPCM-based method can be regarded as a specialized version of the proposed 2-D ADPCM-based method, where its unit height is one. For each sample coded with ADPCM, the prediction value is selected in the same manner as the DC and AC prediction of the MPEG-4 part 2 standard. Experimental results were reported to have shown that the proposed 2-D ADPCM-based memory compression method has higher coding efficiency than the previouslyproposed 1-D DPCM-based memory compression method. Average coding losses were reported to be 7.9% for test cases without IBDI and 1.5% for test cases 12-bit IBDI, while those of the 1-D methods are 10.0% and 1.9%, respectively. The average increase of decoding time was reported to be 3.8%. Memory bandwidth reduction ratios were reported as 45.1% for test cases without IBDI and 44.3% for test cases with 12-bit IBDI. It was proposed for the method to be adopted in TMuC software with the previously-proposed method in JCTVC-C093, and to be further evaluated in various aspects. 18.10.1.1.1.1.1.1.3 JCTVC-C095 Performance improvement of DPCM-based memory compression by adaptive quantization [H. Aoki, K. Chono, K. Senzaki, J. Tajime, Y. Senda (NEC)] This contribution presented a method for improving coding efficiency of DPCM-based memory compression. In the proposed method, a nonlinear quantization matrix for each memory compression unit (MCU) was adaptively selected from multiple quantization matrices so as to fit quantizer characteristics to image characteristics of each MCU. Each index to the selected quantization matrix is embedded into the compressed data and signaled to the frame memory decompressor. The actual frame memory size and memory access bandwidth reportedly does not increase as the result of this additional signaling, since the indices can reportedly be embedded into a gap area for memory alignment. Experimental results have reportedly shown that the proposed adaptive quantization can reduce the coding loss from 10.7% to 6.9% for test cases without IBDI, and from 2.0% to 1.3% for test cases with 12-bit IBDI, respectively. When applied to the 2-D version presented in JCTVC-C094, more gains were reported to have been achieved and the coding loss can reportedly be reduced to 4.8% for test cases without IBDI and to 1.0% for test cases with 12-bit IBDI, respectively. The average increase of decoding times against anchors was reported to be 14.7% for LC cases and 12.4% for HCE cases when four matrices were used. It can reportedly be shown that the proposed adaptive quantization provides flexibility on the trade-off between coding efficiency and complexity to the DPCM-based memory compression scheme. Anomolous results were reported for Class E coding effects of memory rounding. This should be investigated. 18.11 Complexity analysis JCTVC-C228 A simple cache model for measuring motion compensation bandwidth [M. Budagavi (TI)] Motion compensation reference data accesses dominate the external memory bandwidth requirements in video decoder system. External memory access bandwidth requirements are expected to increase with HEVC since HEVC motion compensation interpolation filters are longer than those in AVC and since HEVC is expected to be used with larger resolution video such as 4Kx2K and 8Kx4K. External memory bandwidth used directly impacts power consumed and cost; in fact, power consumed due to external memory accesses in a video decoder can reportedly equal the power consumed by computational processing in video decoding. Hence techniques that reduce external memory bandwidth are desirable. Memory compression is one technique that is being studied in TE2 and Memory compression AhG for reducing external memory accesses. This contribution proposed a cache model for use in measuring external memory access bandwidth. The cache model is built on top of NEC’s memory bandwidth measurement module in TMuC-0.7. The 229 main asserted goal of this work was to have a simple cache model that captures the most relevant parameters for comparing bandwidth consumed with and without memory compression and that is fair to all block structures used in memory compression. The goal was reportedly not to develop a very sophisticated cache model. This cache model can also be used to compare memory bandwidth requirements of various interpolation filters. The cache model was reportedly developed as a part of Memory Compression AhG. It was agreed to include this in the reference software. The AHG on complexity analysis should perform an example study e.g. for multi-pass and singlepass loop filter optimization to assess the validity of the measurement. JCTVC-C072 TE2: Improved memory bandwidth measurement model for reference frame compression [J. Chen, X. Shen, L. Yu (Zhejiang Univ.)] A memory-bandwidth measurement model for reference frame compression algorithm was proposed. This proposal analyses reference frame decompression processes and divides them into parsing process and decoding process, in which three new concepts, parsing unit, decoded area and decoding area, are defined. Based on this, a memory access area conduction method had been designed to determine the memory access area. It was asserted that the proposed model can still be efficient when memory compression unit is not equal to decompressible unit. It expands the scope of application and can measure memory bandwidth for more general reference frame compression algorithms. The contribution was noted. It was remarked that this is based on software that was provided to TE2 participants. It was suggested to work with TE coordinators to determine its appropriateness for upcoming experiments. 19 Establishment of ad hoc groups The ad hoc groups established to progress work on particular subject areas until the next meeting are described in the table below. The discussion list for all of these ad hoc groups will be the main JCT-VC reflector (jct-vc@lists.rwth-aachen.de). Title and Email Reflector Chairs Mtg JCT-VC project management (jct-vc@lists.rwth-aachen.de) Coordinate overall JCT-VC interim efforts Report on project status to JCT-VC reflector Provide report to next meeting on project coordination status G. J. Sullivan, J.-R. Ohm (cochairs) N 230 HEVC Draft and Test Model editing (jct-vc@lists.rwth-aachen.de) Produce and finalize JCTVC-C402 HEVC Test Model 1 (HM 1) Encoder Description Produce and finalize JCTVC-C403 HEVC text specification Working Draft 1 Gather and address comments for refinement of these documents Coordinate with the Software development and HM software technical evaluation AhG to address issues relating to mismatches between software and text T. Wiegand, K. McCann (cochairs), B. Bross, W.-J. Han, J.R. Ohm, J. Ridge, S. Sekiguchi, G. J. Sullivan (vice chairs) Software development and HM software technical F. Bossen (chair), evaluation D. Flynn, K. Sühring (vice chairs) (jct-vc@lists.rwth-aachen.de) Coordinate development of the HM software and its distribution to JCTVC members Produce documentation of software usage for distribution with the software Deliver TMuC 0.9 software version and the reference configuration encodings according to JCTVC-C500 for use in common conditions experiments. Edit JCTVC-C500 to reflect configurations required to run the software in "HM mode". Deliver HM 1.0 and corresponding reference configuration encodings according to JCTVC-C500. (HM 1.0 will be a stripped version of TMuC 0.9 wherein tools not included in HM are removed from the codebase.) Coordinate with HEVC Draft and Test Model editing AhG to identify any mismatches between software and text 231 N N Slice support development and characterization R. Sjöberg (chair), Y. Chen, K. Kazui (vice chairs) (jct-vc@lists.rwth-aachen.de) Implement basic slice support (independentlydecodable sequences of largest coding units in raster-scan order) into the HM software. Identify issues relating to the draft text description of slice functionality Coordinate the integration of slice support with the software development and HM software technical evaluation ad hoc group Study technical proposals relating to slice structured coding Study the coding efficiency and loss resilience impact of slice-structured coding Identify and discuss additional issues relating to slice-structured coding Spatial Transforms (jct-vc@lists.rwth-aachen.de) Study the transforms in the HM design, including compression performance, computational complexity, dynamic range, storage requirements, etc. Perform analysis of block transform design and architecture, including software and hardware considerations. Discuss transform-related Core Experiments, and identify potential synergies or incompatibilities related to the tools being tested in the CEs. Report the results and conclusions of these discussions and experiments to the JCT-VC. P. Topiwala (chair), M. Budagavi, R. Cohen, R. Joshi (vice chairs) In-loop and post-processing filtering T. Yamakage (chair), K. Chono, Y. J. Chiu, I. S. Chong, (jct-vc@lists.rwth-aachen.de) M. Narroschke (vice chairs) Study enhancement schemes of in-loop filtering, including de-blocking/de-banding/de-noising filters, and adaptive Wiener-based filters including variants with various inputs, combination of filters Study trade-offs and characteristics of filter designs including complexity and subjective and objective performance Discuss relationships and evaluation procedures for the filtering techniques Identify possibilities for harmonization of enhanced in-loop filtering technologies Study the relationship between in-loop and postprocessing filtering 232 N N Coding block structures (jct-vc@lists.rwth-aachen.de) Study techniques of HM relating to coding block structure Characterize the trade-offs involved in coding block structure issues, including complexity, redundancy and compression performance aspects Identify opportunities for harmonization and simplification of coding block structure K. Panusopone (chair), W.-J. Han, T. K. Tan, T. Wiegand (vice chairs) N Reference pictures memory compression (jct-vc@lists.rwth-aachen.de) Study motion compensation memory access bandwidth of HM design Study coding efficiency loss caused by fixed rounding of extended bit depth values for reference picture storage. Study the visual quality impact of reference picture memory compression Study use cases for memory compression techniques Study the appropriate scope of standardization for memory compression Report on conclusions reached K. Chono (chair), T. Chujoh, C. S. Lim, A. Tabatabai, M. Zhou (vice chairs) N Entropy coding (jct-vc@lists.rwth-aachen.de) Study the entropy coding complexity and compression characteristics of CABAC, LCEC, PIPE/V2V, and other entropy coding designs Characterize throughput, memory, silicon area, power requirements, etc. Study parallel context processing, syntax-element partitioning, and other parallelism approaches for entropy coding Study and develop approaches for hardware and software evaluation of entropy coding methods Identify and discuss additional issues on entropy coding M. Budagavi (chair), G. Martin-Cocher, A. Segall (vice chairs) N 233 Entropy slices (jct-vc@lists.rwth-aachen.de) Study entropy slice coding methods Evaluate the compression performance and other characterstics of entropy slices relative to conventional slices Analyze impacts of throughput, latency, and memory bandwidth for entropy slice methods for various architectures Develop and perform experiments evaluating entropy slice methods Identify and discuss additional issues relating to entropy slices A. Segall (chair) V. Sze, Y.-W. Huang (vice chairs) N Video test material selection (jct-vc@lists.rwth-aachen.de) Identify, collect, and make available a variety of video sequence test material Study the coding performance and characteristics of test materials Identify and recommend appropriate test materials and corresponding test conditions for use in HEVC development T. Suzuki (chair) N Complexity assessment (jct-vc@lists.rwth-aachen.de) Investigate measurement criteria to assess the complexity of encoder and decoder algorithmic design elements, including computational complexity, parallelism, memory bandwidth, memory capacity, dynamic range requirements, etc. Identify and recommend tools for algorithmic complexity assessment D. Alfonso (chair), J. Ridge, X. Wen (vice chairs) N Motion compensation interpolation (jct-vc@lists.rwth-aachen.de) Study the coding efficiency and complexity characteristics of proposed interpolation filtering methods Study the potential for using a unified motion compensation interpolation approach for high efficiency and low complexity configurations Draft proposed core experiments relating to motion compensation interpolation Identify and discuss additional issues relating to motion compensation filtering K. Ugur (chair), E. Alshina, P. Chen, T. Chujoh (vice chairs) N 234 20 Core Experiment planning and Common Conditions 20.1 General issues for CEs Because the first working draft and HEVC test model (referred to as the HM) have now been selected, group coordinated experiments are now referred to as "core experiments" rather than "tool experiments". A preliminary CE description is to be approved at the meeting at which the CE plan is established. It is possible to define sub-experiments within particular CEs, for example designated as CE X.a, CE X.b, etc., for a CE X, where X is the basic CE number. As a general rule, it was agreed that each CE should be run under the same testing conditions using one software codebase, which should be based on the HM software codebase. The general agreed common conditions for experiments were described in the output document JCTVC-C500. A deadline of twp weeks after the meeting was established for organizations to express their interest in participating in a CE to the CE coordinators and for finalization of the CE descriptions by the CE coordinator with the assistance and consensus of the CE participants. Final CEs should clearly describe specific tests to be performed, not describe vague activities. Activities of a less specific nature are delegated to Ad Hoc Groups rather than designated as CEs. Experiment descriptions should be written in a way such that it is understood as a JCT-VC output document (written "third party perspective", not a company proponent perspective). The experiment descriptions should generally not express opinions or suggest conclusions – rather, they should just describe what technology will be tested, how it will be tested, who will participate, etc. CE descriptions should not contain verbose descriptions of a technology (at least not unless the technology is not adequately documented elsewhere). Instead, the CE descriptions should refer to the relevant proposal contributions for any necessary further detail. However, the complete detail of what technology will be tested must be available – either in the CE description itself or in referenced documents that are also available in the JCT-VC document archive. Those who proposed technology in the respective context (by this or the previous meeting) can propose a CE or CE sub-experiment. Harmonizations of multiple such proposals and minor refinements of proposed technology may also be considered. Other subjects would not be designated as CEs. Any technology must have at least one cross-check partner to establish a CE – a single proponent is not enough. It is highly desirable have more than just one proponent and one cross-checker. It is strongly recommended to plan resources carefully and not waste time on technology that may have little benefit – it is also within the responsibility of the CE coordinator to take care of this. CE participants are obligated to submit a report of the results of their experiment testing and any further analysis of the subject matter. Each participant is required to submit a report – measures will be taken if this principle is violated without good reason. A summary report written by the coordinator (with the assistance of the participants) is also expected to be provided to the subsequent meeting. Non-final CE plan documents were reviewed and given tentative approval during the meeting. The particular CEs planned at this meeting were documented in the documents numbered JCTVCC5xx, with xx = 01, 02, etc. The CE description for each planned CE is described in an associated output document JCTVCB3xx for CExx, where "xx" is the CE number. It must be understood that the JCT-VC is not obliged to accept the test methodology or outcome of a CE as being adequate. Good results from a CE do not impose an obligation on the group to accept the result (e.g., if the expert judgment of the group is that further data is needed or that the test methodology was flawed). 235 20.2 Common Conditions Preferred Common Conditions for experiment testing that are intended to be appropriate for both CEs and other experiments were selected by the group and described in output document JCTVCC500. 21 Output documents The following documents were agreed to be produced as outputs of the meeting. Names recorded below indicate those responsible for document production. 21.1.1.1.1.1.1.1.1 JCTVC-C400 Meeting Report of 3rd JCTVC Meeting [G. J. Sullivan, J.-R. Ohm] 21.1.1.1.1.1.1.1.2 JCTVC-C401 The process of Test Model development for the HEVC initiative [G. J. Sullivan, J.-R. Ohm] 21.1.1.1.1.1.1.1.3 JCTVC-C402 High Efficiency Video Coding (HEVC) Test Model 1 (HM 1) Encoder Description [K. McCann (primary), B. Bross, S. Sekiguchi] 21.1.1.1.1.1.1.1.4 JCTVC-C403 High Efficiency Video Coding (HEVC) text specification Working Draft 1 [T. Wiegand (primary), W.-J. Han, G. J. Sullivan, J.-R. Ohm] 21.1.1.1.1.1.1.1.5 JCTVC-C404 HEVC Software guidelines [D. Flynn, K. Sühring] 21.1.1.1.1.1.1.1.6 JCTVC-C405 Summary of HEVC Model (HM) and Working Draft 1 [T.K. Tan, G. J. Sullivan , J.-R. Ohm] 21.1.1.1.1.1.1.1.7 JCTVC-C500 Common HM test conditions and software reference configurations [F. Bossen] 21.1.1.1.1.1.1.1.8 JCTVC-C501 Core Experiment 1: Decoder-side Motion Vector Derivation [Y.-J. Chiu (primary), H. Yu, Y.-W. Huang, S. Sekiguchi] 21.1.1.1.1.1.1.1.9 JCTVC-C502 Core Experiment 2: Flexible Motion Partitioning [E. Francois (primary), X. Zheng, P. Chen] 21.1.1.1.1.1.1.1.10 JCTVC-C503 Core Experiment 3: Interpolation for MC (Luma) [T. Chujoh] 21.1.1.1.1.1.1.1.11 JCTVC-C504 Core Experiment 4: Interpolation for MC (Chroma) [E. Alshina (primary), Y.-L. Lee, P. Chen, H. Lakshman] 21.1.1.1.1.1.1.1.12 JCTVC-C505 Core Experiment 5: Low Complexity Entropy Coding Improvement [X. Wang] 21.1.1.1.1.1.1.1.13 JCTVC-C506 Core Experiment 6: Intra prediction improvements [A. Tabatabai (primary), A. Tanizawa, A. Segall, H. Yu, S. Lei, J. Xu, J. Chen, Y.-J. Jeon, G. Li, M. Mrak] 21.1.1.1.1.1.1.1.14 JCTVC-C507 Core Experiment 7: Alternative transforms [R. Cohen (primary), C. Yeo, R. Joshi] 21.1.1.1.1.1.1.1.15 JCTVC-C508 Core Experiment 8: In-Loop filtering [T. Yamakage (primary), T. Chujoh, Y. W. Huang, K. Chono, I. S. Chong, T. Yamamoto, J. Lim, B. Jeon, J. Xu, M. Narroschke] 21.1.1.1.1.1.1.1.16 JCTVC-C509 Core Experiment 9: MV coding [J. Jung, B. Bross] 236 21.1.1.1.1.1.1.1.17 JCTVC-C510 Core Experiment 10: Number of Intra Prediction Directions [K. Sugimoto] 21.1.1.1.1.1.1.1.18 JCTVC-C511 Core Experiment 11: Coefficient Scanning and Coding [V. Sze (primary), K. Panusopone, J. Chen, T. Nguyen, M. Coban] 21.1.1.1.1.1.1.1.19 JCTVC-C512 Core Experiment 12: Adaptive Motion Vector Resolution [W.-J. Chien] 21.1.1.1.1.1.1.1.20 JCTVC-C513 Core Experiment 13: Intra Smoothing [M. Coban (primary), B. Bross, J. Chen] 22 Future meeting plans and closing of the meeting Future meeting plans were kept as previously planned: Meeting under ITU-T SG 16 auspices when it meets (starting meetings on the Wednesday of the first week and closing it on the Wednesday of the second week of such meeting), and Otherwise meeting under ISO/IEC JTC 1/SC 29/WG 11 auspices when it meets (starting meetings on the Thursday prior to such meetings and closing it on the last day of the WG 11 meeting). The JCT-VC chairmen proposed to hold the 4th JCT-VC meeting during 20-28 January 2011 under WG11 auspices in Daegu, KR. Further JCT-VC meetings are proposed to be held during 16-23 March 2011 under ITU-T auspices in Geneva, CH, and during 14-22 July 2011 under WG 11 auspices in Torino, IT. The local hosts of the meeting and the WG11 parent body under whose auspices the meeting was held were thanked for the arrangements provided for the meeting. The local hosts included Sun YatSen University, Peking University, and Guangzhou CHNAVS Digital Technology Co., Ltd. The meeting was closed at approximately 1:30 p.m. on Friday 15 October. 237 Annex A to JCT-VC report: List of documents JCT-VC number MPEG number JCTVC-C001 m18041 2010-09-27 22:20:52 2010-10-07 01:42:48 2010-10-07 01:42:48 JCT-VC AHG report: JCT-VC project management G. J. Sullivan, J.-R. Ohm (co-chairs) JCTVC-C002 2010-09-27 m18042 22:25:08 2010-10-07 04:39:50 2010-10-07 04:39:50 JCT-VC AHG report: Test Model under Consideration (TMuC) editing K. McCann (chair), M. Karczewicz, J. Ridge, S. Sekiguchi, T. Wedi, T. Wiegand (vice-chairs) JCTVC-C003 2010-09-27 m18043 22:27:18 2010-10-13 01:40:29 2010-10-13 01:40:29 JCT-VC AHG report: Software development and TMuC software technical evaluation F. Bossen (chair), P. Chen, D. Flynn, W.-J. Han, K. Sühring, H. Schwarz, K. Ugur (vice-chairs) JCTVC-C004 m18044 2010-09-27 22:29:13 2010-10-02 10:02:13 2010-10-08 10:12:20 JCT-VC AHG report: Alternative transforms R. Cohen, R. Joshi (cochairs) JCTVC-C005 m18045 2010-09-27 22:31:47 2010-10-01 15:02:14 2010-10-07 09:04:55 T. Yamakage (chair), Y. J. JCT-VC AHG report: In-loop and post-processing filtering Chiu, M. Karczewicz, M. Narroschke (co-chairs) JCTVC-C006 m18046 2010-09-27 22:34:03 2010-10-04 20:24:30 2010-10-04 20:24:30 JCT-VC AHG report: Large block structures K. Panusopone (chair), M. Budagavi, W.-J. Han, D. He (vice-chairs) JCTVC-C007 m18047 2010-09-27 22:35:40 2010-10-01 12:46:27 2010-10-05 12:57:57 JCT-VC AHG report: Memory compression K. Chono (chair), T. Chujoh, C. S. Lim (vice-chairs) JCTVC-C008 m18048 2010-09-27 22:40:30 2010-10-05 01:57:23 2010-10-05 01:57:23 JCT-VC AHG report: Parallel entropy coding M. Budagavi, A. Segall (cochairs) JCTVC-C009 m18049 2010-09-27 22:47:50 2010-10-07 04:41:43 2010-10-07 04:41:43 JCT-VC AHG report: Screen content coding J. Xu, W. Ding (co-chairs) JCTVC-C010 m18050 2010-09-27 22:55:14 2010-10-02 23:17:02 2010-10-02 23:17:02 JCT-VC AHG report: Complexity assessment D. Alfonso (chair), J. Ridge, X. Wen (vice-chairs) JCTVC-C021 m18028 2010-09-25 01:53:37 2010-10-02 23:19:39 2010-10-05 13:03:21 Evaluation result of JCTVC-B031 K. Kazui, J. Koyama, A. Nakagawa (Fujitsu) Created First upload Last upload Title 238 Source JCTVC-C022 m18032 2010-09-25 12:24:14 2010-09-25 12:25:31 2010-09-25 12:25:31 WG11 USNB contribution: Suggested practices for the HEVC project in the JCT-VC A. G. Tescher (for USNB to WG11) JCTVC-C023 m18033 2010-09-25 12:27:32 2010-09-25 12:28:38 2010-09-25 12:28:38 WG11 USNB contribution: HEVC standardization timeline A. G. Tescher (for USNB to WG11) JCTVC-C024 m18035 2010-09-27 08:40:56 2010-10-01 10:49:44 2010-10-02 11:21:12 TE1: Cross-check result of DMVD proposal JCTVC-C124 T. Murakami, T. Yokoyama, (Mitsubishi) K. Nakamura (Hitachi) JCTVC-C025 m18036 2010-09-27 19:25:11 2010-10-01 11:21:22 2010-10-01 11:21:22 CTB splitting on frame boundary for arbitrary resolution video F. Yang, W. Li, S. Wan, Y. Chang (Xi'dian Univ.) JCTVC-C026 m18037 2010-10-01 17:46:23 2010-10-02 23:23:44 2010-10-02 23:23:44 Proposal of frame parameters FLCTB and FSCTB for video content fitting F. Yang, W. Li, S. Wan, Y. Chang (Xi'dian Univ.) JCTVC-C027 m18038 2010-09-27 19:31:40 2010-10-02 23:24:28 2010-10-05 17:05:48 TE7: Verification results of Peking Univ. proposal (modeJ. Kim, B. Jeon (LG dependent residual reordering for intra prediction Electronics) residual) JCTVC-C028 m18039 2010-09-27 19:34:29 2010-10-02 23:25:17 2010-10-02 23:25:17 TE6: Verification results of Huawei and Hisilicon proposals (short distance intra prediction using the correlation between lines and pixels) J. Kim, B. Jeon (LG Electronics) JCTVC-C029 m18040 2010-09-27 19:56:17 2010-10-02 23:25:50 2010-10-02 23:25:50 TE12: Report on 64x64 versus 32x32 maximum coding unit size for low complexity configuration P. Pandit (Harmonic) JCTVC-C030 m18051 2010-09-28 03:15:03 2010-10-04 02:56:39 2010-10-05 08:56:34 TE12: Report on AMP evaluation E. Francois (Technicolor), L. Guillo (INRIA), A. Ichigaya (NHK), H. Yu (Huawei) JCTVC-C031 m18052 2010-09-28 03:22:04 2010-10-02 10:26:11 2010-10-08 13:16:23 TE3 subtest 2: Report on simplified geometry block partitioning E. Francois, D. Thoreau, P. Bordes (Technicolor), L. Guillo (INRIA) JCTVC-C032 m18053 2010-09-28 06:28:12 2010-10-02 23:26:35 2010-10-05 08:34:08 Characteristics of Super Hi-Vision test sequences Y. Shishikui, Y. Matsuo, A. Ichigaya, K. Iguchi, S. Sakaida (NHK) JCTVC-C033 m18054 2010-09-28 07:07:40 2010-10-02 23:28:06 2010-10-11 11:39:59 TE3: Motion compensation with adaptive warped reference S. Park, J. Park, B. Jeon (LG Electronics) JCTVC-C034 m18055 2010-09-28 20:36:36 2010-10-02 10:27:44 2010-10-08 03:48:17 TE3 subtest 2: Cross-check of results from Huawei E. Francois, P. Bordes (Technicolor) JCTVC-C035 m18056 2010-09-28 21:04:38 2010-10-02 09:16:00 2010-10-02 09:16:00 TE7: Cross-verification of the 1D directional unified transform in TMuC R. Cohen, A. Vetro, H. Sun (Mitsubishi) JCTVC-C036 m18057 2010-09-28 21:06:39 2010-10-02 09:16:35 2010-10-09 14:12:55 Alternative performance measurement of MDDT and ROT in TMuC R. Cohen, A. Vetro, H. Sun (Mitsubishi) 239 JCTVC-C037 m18058 2010-09-28 21:22:52 2010-10-02 23:31:56 2010-10-07 17:54:33 TE7: Results for mode-dependent fast separable KLT for C. Yeo, Y. H. Tan, Z. Li, S. block-based intra coding Rahardja (I2R) JCTVC-C038 m18059 2010-09-28 21:23:48 2010-10-02 23:32:29 2010-10-07 17:55:18 TE7: Cross-check results of MDDT simplification proposal from Huawei C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R) JCTVC-C039 m18060 2010-09-28 21:24:39 2010-10-02 23:32:57 2010-10-07 17:56:13 Choice of transforms in MDDT for unified intra prediction C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R) JCTVC-C040 m18061 2010-09-28 21:25:38 2010-10-02 23:33:57 2010-10-07 17:56:49 Chroma intra prediction using reconstructed luma C. Yeo, Y. H. Tan, Z. Li, S. Rahardja (I2R) JCTVC-C041 m18062 2010-09-28 21:34:51 2010-10-01 09:57:38 2010-10-05 14:01:53 TE12: Performance of partition based illumination compensation (PBIC) A. Fujibayashi, S. Kanumuri, F. Bossen, T. K. Tan (NTT DoCoMo) JCTVC-C042 m18063 2010-09-28 21:36:09 2010-10-03 05:22:19 2010-10-03 05:22:19 TE5: Results for simplification of unified intra prediction T. K. Tan, Frank Bossen (NTT DoCoMo) JCTVC-C043 m18064 2010-09-28 21:37:37 2010-10-02 11:02:42 2010-10-02 11:03:26 TE11: Simulation results of merge/skip (3.2b and 3.2d) Y. Suzuki, F. Bossen (NTT DoCoMo) JCTVC-C044 m18065 2010-09-28 21:39:08 2010-10-03 05:23:08 2010-10-03 05:23:08 TE12: Results for experiments on max CU size, RDOQ and AIS F. Bossen, T. K. Tan (NTT DoCoMo) JCTVC-C045 m18066 2010-09-28 21:42:07 2010-10-02 23:34:53 2010-10-08 05:32:50 TE6.a: Cross verification for line based intra prediction T. K. Tan, Junya Takiue (NTT DoCoMo) JCTVC-C046 m18067 2010-09-28 21:44:15 2010-10-02 23:35:46 2010-10-02 23:35:46 TE5: Summary report on simplification of unified intra prediction T. K. Tan (NTT DoCoMo), M. Budagavi (TI), J. Lainema (Nokia) JCTVC-C047 m18068 2010-09-28 21:49:34 2010-10-02 23:36:17 2010-10-02 23:36:17 TE12.4: Transform skip flag test (off vs. on) by Fraunhofer HHI B. Bross (Fraunhofer HHI) JCTVC-C048 m18069 2010-09-28 21:51:55 2010-10-02 23:36:50 2010-10-02 23:36:50 TE12.6: PIPE and LCEC tested against CABAC by Fraunhofer HHI B. Bross (Fraunhofer HHI) JCTVC-C049 m18070 2010-09-28 21:53:13 2010-10-02 23:37:18 2010-10-02 23:37:18 TE12.4: Transform coding HHI tested against Samsung proposal by Fraunhofer HHI B. Bross (Fraunhofer HHI) JCTVC-C050 m18071 2010-09-28 21:54:35 2010-10-02 23:37:53 2010-10-02 23:37:53 TE12.2: Interpolation filter SIFO and DIF tested against MOMS by Fraunhofer HHI B. Bross (Fraunhofer HHI) JCTVC-C051 m18072 2010-09-28 21:55:37 2010-10-02 23:38:24 2010-10-02 23:38:24 TE12.6: PIPE tested against LCEC by Fraunhofer HHI B. Bross (Fraunhofer HHI) JCTVC-C052 m18073 2010-09-28 21:57:07 2010-10-02 23:38:53 2010-10-02 23:38:53 TE12.1: Block merging (MRG) test (off vs. on) by Fraunhofer HHI B. Bross (Fraunhofer HHI) JCTVC-C053 m18074 2010-09-28 2010-10-02 2010-10-02 TE12.3: Adaptive intra smoothing (AIS) test (slow vs. B. Bross (Fraunhofer HHI) 240 21:58:26 23:39:24 23:39:24 fast) by Fraunhofer HHI TE12.3: Adaptive intra smoothing (AIS) test (off vs. on) by Fraunhofer HHI B. Bross (Fraunhofer HHI) JCTVC-C054 m18075 2010-09-28 21:59:57 2010-10-02 23:39:52 2010-10-02 23:39:52 JCTVC-C055 2010-09-29 m18078 23:45:27 2010-10-02 23:40:24 2010-10-15 10:09:49 Performance report of TMuC for Super Hi-Vision K. Iguchi, A. Ichigaya, Y. Shishikui(NHK), S. Sekiguchi, A. Minezawa (Mitsubishi Electric) JCTVC-C056 m18079 2010-09-29 23:57:16 2010-10-02 00:41:39 2010-10-02 00:41:39 TE12: Evaluation of transform unit (TU) size M. Zhou, V. Sze (TI) JCTVC-C057 m18080 2010-09-29 23:58:56 2010-10-02 00:42:29 2010-10-02 00:42:29 TE12: Evaluation of IBDI and TPE (transform precision extension) M. Zhou (TI) JCTVC-C058 m18081 2010-09-30 00:01:06 2010-10-02 02:50:32 2010-10-05 11:51:49 TE12: Evaluation of entropy coders: PIPE tested against V. Sze, M. Budagavi (TI) CABAC JCTVC-C059 m18082 2010-09-30 00:03:32 2010-10-02 02:51:15 2010-10-08 06:34:31 TE12: Evaluation of transform coefficient coding (HHI_TRANSFORM_CODING) with tool breakdown V. Sze, M. Budagavi, M. Zhou (TI) JCTVC-C060 m18083 2010-09-30 00:04:53 2010-10-02 00:43:29 2010-10-04 13:04:42 TE2: TI reference frame compression proposal M. Zhou, M. Budagavi (TI) JCTVC-C061 m18084 2010-09-30 00:05:42 2010-10-02 08:38:43 2010-10-09 02:06:09 TE5: TI evaluation of unified intra prediction simplifications M. Budagavi (TI) JCTVC-C062 m18085 2010-09-30 00:06:21 2010-10-02 08:39:45 2010-10-09 09:40:07 TE8: TI parallel context processing (PCP) proposal M. Budagavi (TI) JCTVC-C063 m18086 2010-09-30 00:07:15 2010-10-02 02:52:02 2010-10-10 08:13:02 TE8: Evaluation of RIM parallel context processing (PCP) V. Sze, M. Budagavi (TI) proposal JCTVC-C064 m18087 2010-09-30 00:08:17 2010-10-02 02:52:40 2010-10-10 08:19:41 TE8: Evaluation of RIM-V2V entropy coding V. Sze, M. Budagavi (TI) JCTVC-C065 m18088 2010-09-30 00:10:37 2010-10-02 00:44:08 2010-10-04 13:05:34 Testing results on the simplified TMuC configurations M. Zhou (TI) JCTVC-C066 m18089 2010-09-30 00:11:31 2010-10-02 00:44:35 2010-10-02 00:44:35 Cross verification of Mitsubishi 0.7.4 configuration M. Zhou (TI) TE9: Report on large block structure testing J. Kim, M. Kim (KAIST), J. Kim, H.-Y. Kim (ETRI), K. Sato (SONY), X. Shen, L. Yu (Zhejiang Univ.), K. Choi, E. S. Jang (Hanyang Univ.), B. Bross (HHI), W.-J. Han JCTVC-C067 m18090 2010-09-30 00:24:33 2010-10-03 05:26:12 2010-10-06 19:29:37 241 (Samsung), J.-K. Jo, S.-N. Park, D. G. Sim, S.-J. Oh (Kwangwoon Univ.) JCTVC-C068 m18091 2010-09-30 00:28:31 2010-10-05 15:45:15 2010-10-11 18:16:12 Improved side information signaling for QVBT in TMuC B. Lee, M. Kim (KAIST), J. Kim, H.-Y. Kim (ETRI) JCTVC-C069 m18092 2010-09-30 00:36:29 2010-10-03 05:27:20 2010-10-03 05:27:20 TE3: Test results of second order prediction [Withdrawn] S. Li, L. Yu (Zhejiang Univ.) JCTVC-C070 m18093 2010-09-30 00:38:45 2010-10-02 23:41:19 2010-10-02 23:41:19 TE7: Cross-check for I2R proposal on mode-dependent fast separable KLT for block-based intra coding X. Zhu, L. Yu (Zhejiang Univ.) JCTVC-C071 m18094 2010-09-30 00:40:05 2010-10-02 23:41:48 2010-10-09 13:12:41 TE10 subset 2: Complexity analysis on Wiener-based in- L. Wang, L. Yu (Zhejiang loop filters Univ.) JCTVC-C072 m18095 2010-09-30 00:41:35 2010-10-02 23:42:24 2010-10-07 09:09:05 TE2: Improved memory bandwidth measurement model for reference frame compression J. Chen, X. Shen, L. Yu (Zhejiang Univ.) JCTVC-C073 m18096 2010-09-30 00:47:07 2010-10-02 23:42:50 2010-10-08 12:17:08 TE2: Reference frame compression using image coder C. S. Lim, V. Wahadaniah, S. Naing, H. W. Sun (Panasonic) JCTVC-C074 m18097 2010-09-30 00:48:41 2011-01-15 00:06:27 2011-01-15 00:06:27 TE2: Cross-check of memory compression results from NEC (JCTVC-B057/JCTVC-C093) [Initially uploaded to the prior document archive site] C. S. Lim, V. Wahadaniah, S. Naing, H. W. Sun (Panasonic) JCTVC-C075 m18098 2010-09-30 00:51:25 2010-10-02 05:46:14 2010-10-09 11:53:28 TE2: Adaptive scaling for bit depth compression on IBDI T. Chujoh, T. Yamakage (Toshiba) JCTVC-C076 m18099 2010-09-30 00:53:40 2010-10-01 16:18:45 2010-10-07 04:25:02 TE2: Summary of TE2 on IBDI and memory compression T. Chujoh (TE coordinator) JCTVC-C077 m18100 2010-09-30 00:54:32 2010-10-02 11:00:51 2010-10-09 11:56:15 A framework for standardization of memory compression T. Chujoh, T. Yamakage (Toshiba) JCTVC-C078 m18101 2010-09-30 00:57:13 2010-10-02 05:48:00 2010-10-09 08:12:21 TE3 subtest 4: High accuracy interpolation filter (HAIF) T. Chujoh, K. Kanou, T. Yamakage (Toshiba) JCTVC-C079 m18102 2010-09-30 00:58:22 2010-10-02 01:23:19 2010-10-09 01:03:09 TE6 subset a: Bidirectional intra prediction T. Shiodera, A. Tanizawa, T. Chujoh, T. Yamakage (Toshiba) JCTVC-C080 m18103 2010-09-30 00:59:49 2010-10-02 01:35:44 2010-10-08 11:57:55 TE7: One-dimensional directional unified transform A. Tanizawa, J. Yamaguchi, T. Shiodera, T. Chujoh, T. Yamakage (Toshiba) JCTVC-C081 m18104 2010-09-30 01:01:17 2010-10-02 01:30:04 2010-10-02 01:30:04 TE7: Cross-verification of mode dependent residual reordering (Peking Univ.) A. Tanizawa, J. Yamaguchi (Toshiba) 242 JCTVC-C082 m18105 2010-09-30 01:03:24 2010-10-01 15:03:34 2010-10-10 05:51:17 TE10 subtest 2: Reduction of number of encoding passes T. Yamakage, T. Chujoh, T. for quadtree adaptive loop filter (QALF) Watanabe (Toshiba) JCTVC-C083 m18106 2010-09-30 01:04:51 2010-10-01 15:04:28 2010-10-11 02:34:48 TE10: Summary of TE10 on in-loop filtering K. Chono, T. Yamakage (TE coordinators) JCTVC-C084 m18107 2010-09-30 01:06:25 2010-10-01 15:05:17 2010-10-12 06:38:08 Coding efficiency report of modification by TMuC draft005 T. Yamakage, T. Chujoh, T. Watanabe (Toshiba) JCTVC-C085 m18108 2010-09-30 01:07:49 2010-10-01 15:06:09 2010-10-09 11:33:40 Comparison of loop & post filtering for in-loop and postprocessing filtering AHG T. Yamakage, T. Chujoh, T. Watanabe (Toshiba) JCTVC-C086 m18109 2010-09-30 01:08:53 2010-10-01 15:44:36 2010-10-01 15:44:36 Experimental results of ALF on low complexity T. Chujoh, T. Yamakage (Toshiba) JCTVC-C087 m18110 2010-09-30 01:09:49 2010-10-01 15:46:14 2010-10-01 15:46:14 TE12.2: Experimental results of interpolation filter T. Chujoh, K. Kanou, T. Yamakage (Toshiba) JCTVC-C088 m18111 2010-09-30 01:10:37 2010-10-01 15:47:41 2010-10-01 15:47:41 TE12.7: Experimental results of IBDI T. Chujoh, T. Yamakage (Toshiba) JCTVC-C089 m18112 2010-09-30 01:20:39 2010-10-02 23:43:44 2010-10-08 11:32:37 TE7: Results for mode-dependent residual reordering for X. Zhao, L. Zhang, S. Ma, intra prediction residual W. Gao (Peking Univ.) JCTVC-C090 m18113 2010-09-30 01:30:24 2010-10-02 23:44:30 2010-10-02 23:44:30 TE7: Cross-verification results of mode-dependent fast separable KLT for block-based intra coding from I2R X. Zhao, L. Zhang, S. Ma, W. Gao (Peking Univ.) JCTVC-C091 m18114 2010-09-30 01:34:44 2010-10-01 21:46:06 2010-10-06 21:58:49 TE10: Conditional joint deblocking-debanding filter K. Chono, K. Senzaki, H. Aoki, J. Tajime, Y. Senda (NEC) JCTVC-C092 m18115 2010-09-30 01:36:05 2010-10-01 15:49:32 2010-10-04 15:56:10 TE12: Report on combined intra prediction evaluation K. Chono, K. Senzaki, H. Aoki, J. Tajime, Y. Senda (NEC) JCTVC-C093 m18116 2010-09-30 01:37:18 2010-10-01 20:05:59 2010-10-06 15:55:14 TE2: 1-D DPCM-based memory compression H. Aoki, K. Chono, K. Senzaki, J. Tajime, Y. Senda (NEC) JCTVC-C094 m18117 2010-09-30 01:38:23 2010-10-01 20:06:55 2010-10-06 18:05:00 H. Aoki, K. Chono, K. An extension of DPCM-based memory compression to 2Senzaki, J. Tajime, Y. Senda D structure with ADPCM (NEC) JCTVC-C095 m18118 2010-09-30 01:39:11 2010-10-01 20:07:41 2010-10-12 04:12:35 Performance improvement of DPCM-based memory compression by adaptive quantization H. Aoki, K. Chono, K. Senzaki, J. Tajime, Y. Senda (NEC) JCTVC-C096 m18119 2010-09-30 01:47:25 2010-10-02 23:45:04 2010-10-07 11:26:21 Low complexity rotational transform F. C. A. Fernandes (Samsung) 243 JCTVC-C097 m18120 2010-09-30 01:52:51 2010-10-02 23:45:31 2010-10-08 08:22:26 TE1:Huawei report on DMVD improvements in TMuC M. Yang, S. Lin, D. Wang, J. Zhou, H. Yu (Huawei) JCTVC-C098 m18121 2010-09-30 01:54:56 2010-10-02 23:45:57 2010-10-07 07:58:52 TE1: Cross-checking of DMVD results from Mitsubishi and Intel S. Lin, D. Wang, M. Yang, H. Yu (Huawei) JCTVC-C099 m18122 2010-09-30 01:58:27 2010-10-02 23:46:25 2010-10-09 08:15:47 TE3.2: Huawei & HiSilicon report on flexible motion partitioning coding X. Zheng (HiSilicon), H. Yu (Huawei) JCTVC-C100 m18123 2010-09-30 02:02:22 2010-10-02 23:46:56 2010-10-02 23:46:56 TE5: Huawei & HiSilicon report on verification test results L. Liu (HiSilicon), J. Zhou, H. Yu (Huawei) JCTVC-C101 m18124 2010-09-30 02:04:44 2010-10-03 00:00:52 2010-10-03 00:00:52 TE6.a: HiSilicon report on short distance intra prediction C. Lai, Y. Lin (HiSilicon) JCTVC-C102 m18125 2010-09-30 02:07:42 2010-10-03 00:01:31 2010-10-09 11:11:12 TE7: Symmetry-based simplification of MDDT H. Yang, J. Zhou, H. Yu (Huawei) JCTVC-C103 m18126 2010-09-30 02:10:17 2010-10-03 00:02:02 2010-10-03 00:02:02 TE7: Report of cross-checking results for Toshiba proposal H. Yang, J. Zhou, H. Yu (Huawei) JCTVC-C104 m18127 2010-09-30 02:12:38 2010-10-03 00:02:39 2010-10-03 00:02:39 TE12: Huawei & HiSilicon report on AMP evaluation X. Zheng (HiSilicon), J. Zhou, H. Yu (Huawei) JCTVC-C105 m18128 2010-09-30 02:14:42 2010-10-03 00:03:14 2010-10-03 00:03:14 TE12: Huawei report on max transform unit size evaluation J. Zhou, D. Wang, H. Yu (Huawei) JCTVC-C106 m18129 2010-09-30 02:16:10 2010-10-03 00:03:43 2010-10-09 11:06:00 Symmetry-based scan order sharing scheme for MDDT H. Yang, J. Zhou, H. Yu (Huawei) JCTVC-C107 m18130 2010-09-30 02:21:55 Fast CABAC [Withdrawn] H. J. Kim (Korea Univ.) JCTVC-C108 m18131 2010-09-30 02:26:17 2010-10-03 00:05:08 2010-10-12 07:00:26 Jointly optimal intra prediction and adaptive primary transform A. Saxena, F. C. A. Fernandes (Samsung) JCTVC-C109 m18132 2010-09-30 02:28:49 2010-10-03 01:34:36 2010-10-15 05:38:13 TE12.1: Results for PU-based merging by Fraunhofer HHI B. Bross (Fraunhofer HHI) JCTVC-C110 m18133 2010-09-30 02:30:17 2010-10-03 03:03:37 2010-10-03 03:03:37 TE12.4: Transform unit quadtree tested against 2-level by Fraunhofer HHI B. Bross (Fraunhofer HHI) JCTVC-C111 2010-09-30 m18134 02:32:25 2010-10-03 03:04:57 2010-10-08 08:26:33 Integration of plane mode in unified intra prediction G. Li (Santa Clara Univ.), L. Liu, N. Ling (Santa Clara Univ.), J. Zheng, P. Zhang (Hisilicon) JCTVC-C112 m18135 2010-09-30 02:34:15 2010-10-03 03:10:10 2010-10-11 08:03:54 Fast integer transforms for the HEVC test model W. Dai, M. Krishnan, P. Jesudhas, P. Topiwala (FastVDO) 244 JCTVC-C113 m18136 2010-09-30 02:38:17 2010-10-01 23:49:10 2010-10-05 04:56:56 Adaptive loop filter with low encoder complexity I.S. Chong, W. Chien, M. Karczewicz (Qualcomm) JCTVC-C114 m18137 2010-09-30 02:41:44 2010-10-07 10:37:25 2010-10-07 10:37:25 Zigzag scan for CABAC/PIPE J. Lou, K. Panusopone, L. Wang (Motorola) JCTVC-C115 2010-09-30 m18138 02:54:46 2010-10-03 05:29:32 2010-10-08 17:43:16 TE1: Summary report of TE1 activity Y.-J. Chiu (Intel), Y.-W. Huang (MediaTek), M. Wien (RWTH Aachen Univ.), H. Yu (Huawei) JCTVC-C116 m18139 2010-09-30 02:56:51 2010-10-03 03:11:33 2010-10-05 10:55:26 TE11: Cross verification of Sony proposal on motion vector coding Y.-J. Jeon, B.-M. Jeon (LG Electronics) JCTVC-C117 m18140 2010-09-30 03:23:03 2010-10-03 03:14:16 2010-10-03 03:14:16 Implementation analysis of transform block size Y. Yu (Broadcom) JCTVC-C118 m18141 2010-09-30 03:27:12 2010-10-03 03:15:16 2010-10-04 11:16:43 TE12.1 cross-check report on block merging (MRG) on/off K. Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi Electric) JCTVC-C119 m18142 2010-09-30 03:27:57 2010-10-03 03:18:57 2010-10-03 03:18:57 TE12.2 report on MV prediction AMVP/IMVP K. Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi Electric) JCTVC-C120 m18143 2010-09-30 03:28:41 2010-10-03 03:20:04 2010-10-04 11:25:35 TE12.3 cross-check report on AIS on/off K. Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi Electric) JCTVC-C121 m18144 2010-09-30 03:30:08 2010-10-03 03:21:14 2010-10-14 01:34:45 Suggested approach toward HEVC test model creation S. Sekiguchi, K. Sugimoto (Mitsubishi Electric) JCTVC-C122 m18145 2010-09-30 03:31:08 2010-10-03 03:23:20 2010-10-06 18:27:41 A suggested starting point for HEVC test model K. Sugimoto, S. Sekiguchi (Mitsubishi Electric) JCTVC-C123 m18146 2010-09-30 03:32:06 2010-10-03 03:38:02 2010-10-03 03:38:02 On selecting coding tools for HEVC test model K. Sugimoto, A. Minezawa, S. Sekiguchi (Mitsubishi Electric) JCTVC-C124 m18147 2010-09-30 03:33:11 2010-10-03 03:39:49 2010-10-15 02:10:32 TE1: Implicit direct vector derivation Y. Itani, S. Sekiguchi (Mitsubishi Electric) JCTVC-C125 m18148 2010-09-30 03:34:14 2010-10-03 03:40:52 2010-10-03 03:40:52 TE1: Spatio-temporal direct mode S. Sekiguchi, Y. Itani (Mitsubishi Electric) JCTVC-C126 m18149 2010-09-30 03:37:21 2010-10-03 05:30:25 2010-10-06 07:34:38 Chroma interpolation filtering using high precision filter D. Y. Kim, J. P. Kim, Y. L. Lee (Sejong University), J. Lim, J. Song (SKT) 245 JCTVC-C127 m18151 2010-09-30 05:39:59 2010-10-03 05:32:53 2010-10-08 10:11:44 TE1: Report of self derivation of motion estimation improvement in TMuC Y.-J. Chiu, L. Xu, W. Zhang, H. Jiang (Intel) JCTVC-C128 m18152 2010-09-30 05:41:20 2010-10-03 05:33:38 2010-10-03 05:33:38 TE1: Cross-checking of DMVD result from Huawei Y.-J. Chiu, L. Xu, W. Zhang, H. Jiang (Intel) JCTVC-C129 m18153 2010-09-30 05:42:33 2010-10-03 05:34:20 2010-10-08 10:13:51 TE12: Cross verification for adaptive loop filter (ALF) (on Y.-J. Chiu, L. Xu, W. Zhang, vs. off) H. Jiang (Intel) JCTVC-C130 m18154 2010-09-30 08:09:06 2010-10-02 16:56:13 2010-10-08 06:23:27 TE10 subtest 1: Results of intra deblocking filter testing by SKKU/SKT J. Yang, K. Won, B. Jeon (SKKU), J. Lim, J. Song (SKT) JCTVC-C131 m18155 2010-09-30 08:11:10 2010-10-02 11:49:32 2010-10-02 11:54:33 TE10 subtest 1: Cross-verification result of Microsoft deblocking filter proposal by SKKU/SKT J. Yang, K. Won, B. Jeon (SKKU), J. Lim, J. Song (SKT) JCTVC-C132 m18156 2010-09-30 08:12:22 2010-10-02 11:51:54 2010-10-02 11:51:54 TE11: Cross-verification result of Orange Labs motion vector competition proposal (3.1a) by SKKU J. Yang, K. Won, B. Jeon (SKKU) JCTVC-C133 m18157 2010-09-30 08:13:24 2010-10-02 14:50:11 2010-10-02 14:50:11 Motion vector competition method with reduced candidate sets [Withdrawn] K. Won, J. Yang, B. Jeon (SKKU), J. Lim, J. Song (SKT) JCTVC-C134 m18158 2010-09-30 08:20:46 2010-10-03 00:56:47 2010-10-09 11:45:55 Comments on V2V coding for TM/TMuC D. He, G. Korodi, P. Imthurn, J. Jamias, D. O'Loughlin, G. Martin-Cocher (RIM) JCTVC-C135 m18159 2010-09-30 08:26:07 2010-10-02 00:16:34 2010-10-11 07:58:53 Flexible scaling of quantization parameter D. Hoang (Zenverge) JCTVC-C136 m18160 2010-09-30 08:35:21 2010-10-01 17:42:26 2010-10-07 10:02:08 TE12.4: Mode dependent direnctional transform (MDDT) X. Zhang (HKUST) test (off vs. on) by HKUST JCTVC-C137 m18161 2010-09-30 08:55:58 2010-10-01 17:59:25 2010-10-01 17:59:25 Decoder-side block boundary decision (DBBD) with OBMC S. Fukushima (JVC) JCTVC-C138 m18162 2010-09-30 09:03:54 2010-10-01 18:08:35 2010-10-12 03:40:33 TE1.a: Implementation report of refinement motion compensation using DMVD on TMuC M. Ueda (JVC) JCTVC-C139 m18163 2010-09-30 09:17:39 2010-10-03 03:42:06 2010-10-10 17:35:36 TE6.a: Huawei & HiSilicon report on cross verification of MSRA's LIC C. Lai, Y. Lin (HiSilicon), J. Zhou (Huawei) JCTVC-C140 m18164 2010-09-30 14:06:56 2010-10-03 03:43:34 2010-10-07 04:03:00 TE9-2: Report on performance tests for different sets of PU modes by Fraunhofer HHI B. Bross (Fraunhofer HHI) JCTVC-C141 m18167 2010-09-30 16:24:37 2010-10-02 06:39:30 2010-10-02 06:39:30 TE8: Crosscheck on TI's proposal of parallel context processing by MediaTek Y.-L. Chang, Y.-W. Huang, S. Lei (MediaTek) JCTVC-C142 m18168 2010-09-30 2010-10-02 2010-10-05 TE10 subtest 1: Improved deblocking filter J. An, K. Zhang, Y. Gao, X. 246 16:29:48 06:41:47 08:56:03 Guo, C.-M. Fu, Y.-W. Huang, S. Lei (MediaTek) JCTVC-C143 m18169 2010-09-30 16:31:16 2010-10-02 06:43:40 2010-10-05 17:31:12 TE10 subtest 2: Coding unit synchronous picture quadtree-based adaptive loop filter (QALF) C.-Y. Tsai, C.-M. Fu, C.-Y. Chen, Y.-W. Huang, S. Lei (MediaTek) JCTVC-C144 m18170 2010-09-30 16:32:20 2010-10-02 06:44:45 2010-10-02 06:44:45 TE10 subtest 2: Crosscheck on TOSHIBA's proposal of adaptive loop filter by MediaTek C.-M. Fu, Y.-W. Huang, S. Lei (MediaTek) JCTVC-C145 m18171 2010-09-30 16:35:47 2010-10-02 06:45:31 2010-10-02 06:45:31 TE10 subtest 2: Crosscheck on SHARP's proposal of adaptive loop filter by MediaTek C.-Y. Tsai, Y.-W. Huang, S. Lei (MediaTek) JCTVC-C146 2010-09-30 m18172 16:38:49 2010-10-02 09:26:54 2010-10-02 09:26:54 TE10 subtest 3: Controlled clipping Y.-L. Chang, C.-M. Fu (MediaTek), A. Segall, Y. Su (Sharp), C.-Y. Chen, Y.-W. Huang, S. Lei (MediaTek) JCTVC-C147 m18173 2010-09-30 16:39:44 2010-10-02 06:47:02 2010-10-05 08:57:57 TE10 subtest 3: Quadtree-based adaptive offset C.-Ming Fu, C.-Y. Chen, Y.W. Huang, S. Lei (MediaTek) JCTVC-C148 m18174 2010-09-30 16:40:27 2010-10-02 06:47:59 2010-10-02 06:47:59 TE12: Crosscheck on deblocking filter in TMuC0.7 by MediaTek Y.-L. Chang, Y.-W. Huang, S. Lei (MediaTek) JCTVC-C149 m18175 2010-09-30 17:44:55 2010-10-03 03:45:06 2010-10-03 03:45:06 TE12.2: Report on asymmetric motion prediction unit (AMP) on/off A. Ichigaya, K. Iguchi (NHK) JCTVC-C150 m18176 2010-09-30 17:51:58 2010-10-01 08:03:22 2010-10-07 16:25:23 TE2: Cross-check of memory compression results from TOSHIBA (JCTVC-C075) Y. Choe, S. Hong (Yonsei Univ.), Y-G. Kim (KGIT) JCTVC-C151 m18177 2010-09-30 17:56:58 Reference frame denoising [Withdrawn] P. Amon, A. Hutter, E. Wige, A. Kaup (Siemens) JCTVC-C152 m18178 2010-09-30 17:58:55 2010-10-03 03:46:04 2010-10-11 02:12:23 Context-adaptive hybrid variable length coding Y. Xu, J. Li, W. H. Chen, D. Tian (Cisco Systems) JCTVC-C153 m18179 2010-09-30 23:22:00 2010-10-01 22:09:49 2010-10-01 22:09:49 TE12: Report on deblocking filter P. Chen, M. Karczewicz (Qualcomm) JCTVC-C154 m18180 2010-09-30 23:34:07 2010-10-02 23:36:10 2010-10-10 16:49:20 Fine granularity slices R. Sjöberg, P. Wennersten (Ericsson) JCTVC-C155 m18181 2010-09-30 23:38:19 2010-10-02 23:32:09 2010-10-12 18:24:33 TMuC 0.7 results with sequence dependent QPs K. Andersson, R. Sjöberg (Ericsson) JCTVC-C156 m18182 2010-09-30 23:40:22 2010-10-02 23:32:47 2010-10-09 17:33:36 TE10: Cross verification of Mediatek's deblocking filter by A. Norkin, R. Sjöberg, K. Ericsson Andersson (Ericsson) JCTVC-C157 m18183 2010-09-30 23:42:33 2010-10-02 23:33:13 2010-10-02 23:33:13 TE6.a: Cross verification of line-based intra prediction 247 J. Samuelsson, R. Sjöberg, K. Andersson (Ericsson) JCTVC-C158 m18184 2010-09-30 23:43:54 2010-10-02 23:33:42 2010-10-02 23:33:42 TE7: Cross verification of MDDT simplification JCTVCB024 K. Andersson (Ericsson) JCTVC-C159 m18185 2010-09-30 23:45:20 2010-10-02 23:34:19 2010-10-02 23:34:19 TE12: Cross verification of RDOQ K. Andersson (Ericsson) JCTVC-C160 m18186 2010-10-01 00:46:38 2010-10-01 22:27:08 2010-10-01 22:27:08 TE12: Crosscheck on LCEC phase 2 P. Chen, M. Karczewicz (Qualcomm) JCTVC-C161 m18187 2010-10-01 03:44:41 2010-10-01 17:53:36 2010-10-09 07:52:08 TE10: Cross-verification result of SKKU/SKT deblocking filter K. Chono, K. Senzaki, H. Aoki, J. Tajime, Y. Senda (NEC) JCTVC-C162 m18188 2010-10-01 03:50:50 2010-10-01 13:25:57 2010-10-07 11:21:48 TE12.2: Results on MC interpolation filters K. Kondo, T. Suzuki (Sony) JCTVC-C163 m18189 2010-10-01 03:52:03 2010-10-01 12:28:00 2010-10-07 11:24:05 TE12.2: Results on MV prediction K. Sato, T. Suzuki (Sony) JCTVC-C164 m18190 2010-10-01 03:53:09 2010-10-03 03:47:01 2010-10-06 18:48:25 TE3 subset 4: Results on bi/single filter switching in FIF K. Kondo, T. Suzuki (Sony) JCTVC-C165 m18191 2010-10-01 03:55:00 2010-10-07 17:07:35 2010-10-15 12:39:26 TE3 subset 4: Cross verification on high accuracy interpolation filter K. Kondo, T. Suzuki (Sony) JCTVC-C166 m18192 2010-10-01 03:55:32 2010-10-03 03:48:12 2010-10-07 03:26:25 TE11: Study on motion vector coding (experiment 3.3.a & K. Sato (Sony) 3.3.c) JCTVC-C167 m18193 2010-10-01 03:56:12 2010-10-03 03:49:10 2010-10-03 03:49:10 Proposal on large block structure and quantization K. Sato (Sony) JCTVC-C168 m18194 2010-10-01 03:57:05 2010-10-03 05:36:59 2010-10-07 11:25:57 Comments on common test conditions K. Kondo, K. Sato, T. Suzuki (Sony) JCTVC-C169 m18198 2010-10-01 05:36:07 2010-10-03 05:37:39 2010-10-12 05:24:59 TE6.b: Experiment results of DCIM E. Maani, L. Dong, W. Liu (Sony) JCTVC-C170 m18199 2010-10-01 05:38:00 2010-10-03 03:50:11 2010-10-09 08:18:28 TE3.2: Report on cross-verification of simplified geometry X. Zheng (HiSilicon), H. Yu block partitioning from Technicolor (Huawei) JCTVC-C171 m18200 2010-10-01 05:39:20 2010-10-03 05:39:31 2010-10-05 06:35:24 TE12.3: Experimental results of edge based prediction A. Tabatabai, C. Auyeung (Sony) JCTVC-C172 m18201 2010-10-01 05:42:20 2010-10-03 05:40:18 2010-10-12 05:32:26 TE6: Summary report of intra prediction improvements tool experiments A. Tabatabai (Sony) JCTVC-C173 m18202 2010-10-01 05:45:36 2010-10-01 14:30:06 2010-10-09 11:43:19 TE10 subtest 2: Parallel adaptive loop filter T. Ikai, T. Yamamoto (SHARP) JCTVC-C174 m18203 2010-10-01 05:47:23 2010-10-04 10:56:35 2010-10-04 10:56:35 TE12.3: Report on combined intra prediction T. Yamamoto, A. Segall (SHARP) 248 JCTVC-C175 m18204 2010-10-01 05:48:11 2010-10-04 10:57:06 2010-10-20 07:56:10 TE6.b: Cross verification of differential coding of intra modes (DCIM) T. Yamamoto, Y. Yasugi (SHARP) JCTVC-C176 m18205 2010-10-01 05:53:40 2010-10-01 14:30:50 2010-10-05 10:42:21 Analysis and improvement of differential coding of intra modes T. Yamamoto, Y. Yasugi (Sharp) JCTVC-C177 m18206 2010-10-01 06:35:49 Horizontal spatial prediction for intra coding [Withdrawn] P. Tao, D. Li, W. Wu, J. Wen (Tsinghua Univ.) JCTVC-C178 m18207 2010-10-01 07:11:57 2010-10-06 05:02:26 2010-10-06 05:02:26 TE2: Cross-check of memory compression results from Panasonic (JCTVC-C074) A. Segall (Sharp) JCTVC-C179 m18208 2010-10-01 07:14:25 2010-10-06 05:03:24 2010-10-06 05:03:24 TE6: Cross-check of bi-directional intra-predition results from Toshiba A. Segall (Sharp) JCTVC-C180 m18209 2010-10-01 07:18:37 2010-10-06 17:00:44 2010-10-10 02:53:54 TE10.3: Cross check of MediaTek's proposal on quadtree-based adaptive offset A. Segall (Sharp) JCTVC-C181 m18210 2010-10-01 07:21:01 2010-10-06 05:04:13 2010-10-07 08:39:06 TE12.4: Cross-check of rotational transform (ROT) K. Misra, A. Segall (Sharp) JCTVC-C182 m18211 2010-10-01 07:24:57 2010-10-06 10:20:39 2010-10-07 08:40:33 TE12.2: Cross-check of adaptive motion vector resolution A. Segall (Sharp) (AMVRES) technology in the TMuC software JCTVC-C183 m18212 2010-10-01 08:05:51 2010-10-02 08:01:01 2010-10-11 14:56:39 Enhanced switching of interpolation filter for HEVC T. Yoshino, S. Naito, S. Sakazawa (KDDI) JCTVC-C184 m18215 2010-10-01 09:06:12 2010-10-03 00:03:20 2010-10-06 18:39:04 TE12.6: PIPE tested against LCEC by Cisco A. Fuldseth (Cisco) JCTVC-C185 m18216 2010-10-01 09:08:53 2010-10-03 00:03:56 2010-10-07 08:16:58 Recent improvements of the low complexity entropy coder (LCEC) in TMuC A. Fuldseth, G. Bjøntegaard (Cisco), A. Hallapuro, K. Ugur, J. Lainema (Nokia) JCTVC-C186 m18217 2010-10-01 09:13:37 2010-10-03 03:51:56 2010-10-03 03:51:56 TE5: Results for simplified intra prediction tests by Nokia J. Lainema, K. Ugur (Nokia) JCTVC-C187 m18218 2010-10-01 09:16:59 2010-10-03 03:53:00 2010-10-03 03:53:00 TE6.a: Cross-check of bidirectional intra prediction (JCTVC-B042) K. Iwata (Renesas) JCTVC-C188 m18219 2010-10-01 09:17:39 2010-10-03 03:54:10 2010-10-09 03:43:41 TE12.3: Results for planar prediction tests by Nokia J. Lainema, K. Ugur, O. Bici (Nokia) JCTVC-C189 m18220 2010-10-01 09:18:20 2010-10-03 03:58:20 2010-10-12 03:38:20 Evaluation of intra prediction based on repetitive pixel replenishment S. Mochizuki, K. Iwata (Renesas) JCTVC-C190 m18223 2010-10-01 11:24:27 2010-10-01 16:05:22 2010-10-02 19:16:05 TE7: Cross-check result of Peking Univ.'s proposal (JCTVC-C089) J. Kim, S.-C. Lim, H. Y. Kim, H. Lee, J. S. Choi (ETRI) JCTVC-C191 m18226 2010-10-01 11:28:00 2010-10-01 17:07:12 2010-10-04 14:16:38 TE11: Cross-check result of merge/skip (3.2c) S. Jeong, J. Lee, H. Y. Kim, S.-C. Lim (ETRI), K. Kim, H. 249 Lee, G. Park (KHU) JCTVC-C192 m18228 2010-10-01 11:32:07 2010-10-01 20:55:06 2010-10-07 17:31:12 TE7: Cross-verification results of Huawei proposal on simplified MDDT for intra prediction residual Y. Choe, J. Kim (Yonsei Univ.), Y-G. Kim (KGIT) JCTVC-C193 m18230 2010-10-01 11:37:51 2010-10-02 05:48:06 2010-10-15 02:08:37 Overlapped block intra prediction K. Zhang, X. Guo, J. An, M. Guo, Y.-W. Huang, S. Lei (MediaTek) JCTVC-C194 m18231 2010-10-01 11:50:13 2010-10-03 03:59:21 2010-10-03 03:59:21 TE10 subtest 2: Cross-check results of JCTVC-C082 [Toshiba] Reduction of number of encoding passes for quadtree-based adaptive loop filter (QALF) P. Wu, S. Paschalakis, N. Sprljan (Mitsubishi Electric) JCTVC-C195 m18232 2010-10-01 11:52:32 2010-10-03 04:01:29 2010-10-03 04:01:29 An enhanced block-based adaptive loop filter in TMuC0.7.0 platform P. Wu, S. Paschalakis, N. Sprljan (Mitsubishi Electric) JCTVC-C196 m18233 2010-10-01 13:32:59 2010-10-03 04:02:43 2010-10-03 04:02:43 TE12.6: Results for LCEC_PHASE2 tests by Nokia K. Ugur, A. Hallapuro, J. Lainema (Nokia) JCTVC-C197 m18235 2010-10-01 13:43:57 2010-10-03 04:03:34 2010-10-03 04:03:34 TE12.2: Results for SIFO-6Tap with DIF tests by Nokia and Qualcomm K. Ugur, J. Lainema (Nokia), R. Panchal, M. Karczewicz (Qualcomm) JCTVC-C198 m18236 2010-10-01 15:02:40 2010-10-02 12:24:43 2010-10-12 04:25:35 TE9: Simulation results for various max. number of transform quadtree depth J. Chen, T. Lee, W.-J. Han (Samsung) JCTVC-C199 m18237 2010-10-01 15:06:05 2010-10-02 12:28:22 2010-10-02 12:33:06 TE11: Report on experiment 3.2.c: Check skip and merge together I.-K. Kim, T. Lee, W.-J. Han (Samsung) JCTVC-C200 m18238 2010-10-01 15:10:38 2010-10-02 12:30:10 2010-10-06 18:44:34 TE12.1: Experimental results of transform unit quadtree/2-level test T. Lee, J. Chen, W.-J. Han (Samsung) JCTVC-C201 m18239 2010-10-01 15:16:37 2010-10-02 12:38:06 2010-10-06 19:01:07 TE12.2: Experimental results of SIFO (DIF) / DCT-IF E. Alshina, J. Chen, N. Shlyakhov, A. Alshin, W.-J. Han (Samsung) JCTVC-C202 m18240 2010-10-01 15:19:08 2010-10-02 13:24:16 2010-10-06 19:14:19 TE12.4: Experimental results of MDDT and ROT by Samsung and Qualcomm E. Alshina, A. Alshin, W.-J. Han (Samsung), R. Joshi, M. Coban (Qualcomm) JCTVC-C203 m18241 2010-10-01 15:26:15 2010-10-02 12:41:06 2010-10-02 12:41:06 TE12.4: Experimental results of transform coefficient coding J. Chen, V. Seregin, W.-J. Han (Samsung) JCTVC-C204 m18242 2010-10-01 15:27:44 2010-10-03 12:30:27 2010-10-05 13:52:24 Bi-directional optical flow A. Alshin, E. Alshina (Samsung) JCTVC-C205 m18243 2010-10-01 15:29:50 2010-10-02 12:43:04 2010-10-07 19:03:35 Low-complexity adaptive coefficient scanning V. Seregin, J. Chen (Samsung) JCTVC-C206 m18244 2010-10-01 2010-10-02 2010-10-11 Chroma intra prediction by scaled luma samples using J. Chen, V. Seregin 250 15:36:38 12:44:14 05:59:20 integer operations (Samsung) JCTVC-C207 m18245 2010-10-01 15:39:22 2010-10-02 12:47:53 2010-10-07 19:06:02 Encoder improvement of unified intra prediction Y. Piao, J.-H. Min, J. Chen (Samsung) JCTVC-C208 m18246 2010-10-01 15:40:37 2010-10-02 12:53:45 2010-10-08 19:35:36 Simulation results of motion partition, block merging and motion competition schemes I.-K. Kim, T. Lee (Samsung) JCTVC-C209 m18247 2010-10-01 15:43:38 2010-10-02 12:56:31 2010-10-08 02:30:43 Low-complexity 16x16 and 32x32 transforms and partial frequency transform Y.-M. Hong, M.-S. Cheon, I.K. Kim (Samsung) JCTVC-C210 m18248 2010-10-01 15:45:42 2010-10-02 12:57:59 2010-10-05 14:06:21 Efficient coefficient coding method for large transform in VLC mode S. Lee, M.-S. Cheon, I.-K. Kim (Samsung) JCTVC-C211 m18249 2010-10-01 15:49:19 2010-10-02 13:00:37 2010-10-05 14:40:31 Inner block oriented ALF processing for memory & bandwidth reduction Y. Sohn, K.-H. Lee, B.-K. Lee, I.-K. Kim (Samsung) JCTVC-C212 m18250 2010-10-01 15:50:17 2010-10-02 12:25:22 2010-10-04 08:16:35 Cross-verification of JCTVC-C086: Experimental results of ALF on low complexity W.-J. Han (Samsung) JCTVC-C213 m18251 2010-10-01 16:11:41 2010-10-03 05:41:28 2010-10-07 06:56:55 Additional results for combined intra prediction M. Mrak, T.Davies, D. Flynn, A. Gabriellini (BBC) JCTVC-C214 m18254 2010-10-01 16:37:58 2010-10-01 19:03:24 2010-10-07 06:48:05 TE12.5: Results for adaptive loop filter using prediction and residual M. Narroschke (Panasonic) JCTVC-C215 m18255 2010-10-01 17:33:56 2010-10-03 04:05:02 2010-10-09 05:16:40 Adaptive MV resolution with directional filters K. Ugur, J. Lainema (Nokia) JCTVC-C216 m18256 2010-10-01 17:36:31 2010-10-01 18:48:44 2010-10-07 08:51:39 TE12.3: Results for edge based prediction V. Drugeon (Panasonic) JCTVC-C217 m18257 2010-10-01 17:51:11 2010-10-01 18:52:49 2010-10-08 18:20:18 TE06.b: Cross verification of differential coding of intra modes (DCIM) V. Drugeon (Panasonic) JCTVC-C218 m18258 2010-10-01 18:15:34 2010-10-01 20:36:37 2010-10-11 11:31:48 Encoding complexity reduction for intra prediction by disabling NxN partition J. Kim, Y.Jeon, B. Jeon (LG Electronics) JCTVC-C219 m18259 2010-10-01 18:37:06 2010-10-01 19:25:33 2010-10-10 09:36:01 In-loop reference frame denoising P. Amon, A. Hutter, E. Wige, A. Kaup (Siemens) JCTVC-C220 m18260 2010-10-01 19:00:04 2010-10-03 04:06:06 2010-10-14 05:41:05 TE3: Summary report for inter prediction in HEVC A. Krutz, T. Sikora (Tech. Univ. Berlin) JCTVC-C221 m18261 2010-10-01 19:03:33 2010-10-03 04:06:56 2010-10-03 04:06:56 TE3: Adaptive global motion temporal prediction for HEVC [Withdrawn] A. Glantz, A. Krutz, T. Sikora (Tech. Univ. Berlin) JCTVC-C222 m18262 2010-10-01 19:08:17 2010-10-03 04:08:16 2010-10-03 04:08:16 In-loop filtering using temporal pixel trajectories [Withdrawn] M. Esche, A. Krutz, A. Glantz, T. Sikora (Tech. Univ. Berlin) 251 JCTVC-C223 m18263 2010-10-01 19:57:24 2010-10-02 08:42:57 2010-10-09 07:19:50 TE8 report G. Martin-Cocher (RIM), M. Budagavi (TI) JCTVC-C224 m18264 2010-10-01 20:18:27 2010-10-02 00:45:40 2010-10-02 00:45:40 Frame coding in vertical raster scan order D.-K. Kwon, H. Kim (TI) JCTVC-C225 m18265 2010-10-01 20:20:29 2010-10-07 04:44:52 2010-10-07 04:44:52 TE12: Summary of evaluation of TMuC tools in TE12 K. McCann (Samsung/Zetacast) JCTVC-C226 m18266 2010-10-01 20:56:12 2010-10-02 02:53:22 2010-10-12 02:54:47 Low-complexity configurable transform architecture for HEVC M. Sadafale, M. Budagavi (TI) JCTVC-C227 m18267 2010-10-01 20:58:29 2010-10-02 02:59:47 2010-10-05 11:52:28 Parallelization of HHI_TRANSFORM_CODING V. Sze, M. Budagavi (TI) JCTVC-C228 m18268 2010-10-01 20:59:54 2010-10-02 03:00:47 2010-10-02 03:00:47 A simple cache model for measuring motion compensation bandwidth M. Budagavi (TI) JCTVC-C229 m18269 2010-10-01 21:05:44 2010-10-01 23:49:57 2010-10-02 08:18:00 TE10 subset 2: Cross check result of MediaTek ALF I. S. Chong, M. Karczewicz (Qualcomm) JCTVC-C230 m18270 2010-10-01 21:14:28 2010-10-02 08:18:44 2010-10-06 17:07:18 TE12: Cross check result of Panasonic's 3-input-ALF I. S. Chong, M. Karczewicz (Qualcomm) JCTVC-C231 m18271 2010-10-01 21:26:12 2010-10-06 16:05:38 2010-10-12 17:30:27 TE12.1: Results for PU based merging Y. Suzuki, T.K. Tan, F. Bossen (NTT DOCOMO) JCTVC-C232 m18272 2010-10-01 21:39:50 2010-10-01 21:41:43 2010-10-01 21:41:43 TE12.7: Verification of JCTVC-C041 (partition based illumination compensation - PBIC) N. Sprljan, S. Paschalakis, P. Wu (Mitsubishi Electric) JCTVC-C233 m18273 2010-10-01 21:44:07 2010-10-02 01:51:49 2010-10-02 01:51:49 TE3 subtest 3: Local intensity compensation (LIC) for inter prediction N. Sprljan, S. Paschalakis, P. Wu (Mitsubishi Electric) JCTVC-C234 m18274 2010-10-01 22:10:09 2010-10-02 10:26:57 2010-10-07 07:11:51 Simplified intra smoothing Y. Zheng, M. Coban, M. Karczewicz (Qualcomm) JCTVC-C235 m18276 2010-10-01 22:40:05 2010-10-01 22:59:24 2010-10-02 08:19:17 TE12: Evaluation of adptive in-loop filter I. S. Chong, M. Karczewicz (Qualcomm) JCTVC-C236 m18277 2010-10-01 22:49:09 2010-10-01 22:51:19 2010-10-04 10:31:49 Report of complexity analysis of geometric partitioning K. Vermeirsch, J. De Cock, R. Van de Walle (Ghent Univ. - IBBT) JCTVC-C237 m18278 2010-10-01 22:51:51 2010-10-02 07:54:20 2010-10-02 07:54:20 Reduced complexity 32×32 transform by coefficient zero-out J. Sole, R. Joshi, M. Karczewicz (Qualcomm) JCTVC-C238 m18279 2010-10-01 22:53:59 2010-10-02 01:19:02 2010-10-03 00:49:23 TE12.8: Results on RDOQ in high efficiency settings J. Zan, J. Meng, M. T. Islam, D. He (RIM) JCTVC-C239 m18280 2010-10-01 22:57:22 2010-10-02 01:58:44 2010-10-02 01:58:44 TE12: Report on interpolation filter: SIFO/DIF R. Panchal, M. Karczewicz (Qualcomm) 252 JCTVC-C240 m18281 2010-10-01 22:57:27 2010-10-03 05:43:21 2010-10-09 14:40:22 TE3: Cross-check results of local intensity compensation D. Karwowski (Poznan Univ. tool from Mitsubishi Electric Tech.) JCTVC-C241 m18282 2010-10-01 22:58:59 2010-10-02 01:56:58 2010-10-02 01:56:58 TE12: Crosscheck on interpolation filter: SIFO(DIF)/MOMS JCTVC-C242 m18283 2010-10-01 23:01:38 JCTVC-C243 2010-10-01 m18284 23:03:08 2011-01-01 23:30:18 (post meeting) 2011-01-01 23:30:18 (post meeting) Cross check result of JCTVC-C084 I. S. Chong, M. Karczewicz (Qualcomm) JCTVC-C244 m18285 2010-10-01 23:06:40 2010-10-07 07:06:47 2010-10-07 07:10:22 TE12.3: Report on planar intra prediction D. Flynn (BBC) JCTVC-C245 m18286 2010-10-01 23:07:35 2010-10-02 01:19:38 2010-10-09 05:55:24 TE 8: Cross verification of TI-PCP proposal for significance map J. Zan, J. Meng, M. T. Islam, D. He (RIM) JCTVC-C246 m18287 2010-10-01 23:25:04 2010-10-02 10:22:51 2010-10-02 10:22:51 TE12.8: Cross-verification of partition-based illumination compensation (PBIC) P. Onno (Canon) JCTVC-C247 m18288 2010-10-01 23:27:35 2010-10-05 13:23:28 2010-10-05 13:23:28 TE3 subset 4: Cross-verification on bi/single filter switching in FIF T. Chujoh, K. Kanou, T. Yamakage (Toshiba) JCTVC-C248 m18289 2010-10-01 23:29:03 2010-10-05 03:00:20 2010-10-05 03:00:20 TE12.2: Cross verification on DCT-IF for chroma T. Chujoh, K. Kanou, T. Yamakage (Toshiba) JCTVC-C249 m18290 2010-10-01 23:35:31 2010-10-03 00:51:58 2010-10-09 11:58:04 TE 8: Reports on V2V coding and context modeling by RIM J. Zan, G. Korodi, J. Meng, M. T. Islam, D. He (RIM) JCTVC-C250 m18291 2010-10-02 00:30:27 2010-10-02 09:50:47 2010-10-08 09:16:04 Low complexity adaptive coefficient scanning M. Coban, R. Joshi, M. Karczewicz (Qualcomm) JCTVC-C251 m18292 2010-10-02 00:37:22 2010-10-02 10:53:45 2010-10-12 03:39:56 Overlapped block motion compensation in TMuC P. Chen, R. Panchal, W.-J. Chien, M. Karczewicz (Qualcomm) JCTVC-C252 m18293 2010-10-02 00:47:24 2010-10-03 03:54:18 2010-10-03 03:54:18 TE 12: Report on evaluation of internal bit depth increase R. Joshi, M. Karczewicz and transform precision extension (Qualcomm) JCTVC-C253 m18294 2010-10-02 01:39:31 2010-10-02 09:24:39 2010-10-06 22:38:38 Encoder speedup for bidirectional averaging with rounding control R. Panchal, M. Karczewicz, P. Chen (Qualcomm) JCTVC-C254 m18295 2010-10-02 02:00:27 2010-10-02 09:25:30 2010-10-02 09:25:30 Test result of SIFO for sequencs with illumination change R. Panchal, M. Karczewicz (Qualcomm) JCTVC-C255 m18296 2010-10-02 02:28:42 2010-10-02 03:02:13 2010-10-12 04:50:35 DCT+Hadamard low complexity large transform for Inter coding M. Budagavi, A. Gupte (TI) R. Panchal, M. Karczewicz (Qualcomm) Document registration withdrawn [Withdrawn] 253 JCTVC-C256 m18297 2010-10-02 02:45:47 2010-10-02 09:51:24 2010-10-02 09:51:24 New results for entropy slices for highly parallel coding K. Misra, J. Zhao, A. Segall (SHARP) JCTVC-C257 m18298 2010-10-02 02:46:40 2010-10-02 10:57:46 2010-10-11 06:11:14 On motion vector competition Y. Su, A. Segall (SHARP) JCTVC-C258 m18299 2010-10-02 03:04:08 2010-10-02 03:31:04 2010-10-06 11:36:01 TE 12: Evaluation of transform skip flag (RQT_ROOT) W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm) JCTVC-C259 m18300 2010-10-02 03:05:04 2010-10-02 03:13:09 2010-10-02 03:13:09 TE12: Results for experiments on Max CU size Y. Yu, K. Panusopone, L. Wang, S.-T. Hsiang, F. Ishtiaq (Motorola) JCTVC-C260 m18301 2010-10-02 03:06:18 2010-10-02 03:14:03 2010-10-10 11:29:44 Flexible picture partitioning K. Panusopone, X. Fang, L. Wang (Motorola) JCTVC-C261 m18302 2010-10-02 03:06:32 2010-10-03 05:44:33 2010-10-03 05:44:33 TE4: Summary of TE4 on variable length coding X. Wang (Qualcomm) JCTVC-C262 m18303 2010-10-02 03:08:46 2010-10-03 05:45:53 2010-10-09 04:54:19 TE4: Report on VLC for coded block flag X. Wang, M. Karczewicz, W.-J. Chien (Qualcomm) JCTVC-C263 m18304 2010-10-02 03:09:58 2010-10-03 05:47:05 2010-10-06 17:21:25 Improvements on VLC M. Karczewicz, W.-J. Chien, X. Wang (Qualcomm) JCTVC-C264 m18305 2010-10-02 03:16:48 2010-10-02 03:29:46 2010-10-02 10:34:33 TE 12: Evaluation of block merging (MRG) W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm) JCTVC-C265 m18306 2010-10-02 03:19:13 2010-10-02 03:29:23 2010-10-02 10:34:04 TE 12: Evaluation of adaptive motion vector resolution (AMVRES) W.-J. Chien, P. Chen, M. Karczewicz (Qualcomm) JCTVC-C266 m18307 2010-10-02 03:21:29 2010-10-02 03:31:41 2010-10-06 10:47:35 TE1: Cross-check result of DMVD proposal from JCTVC- W.-J. Chien, P. Chen, M. C138 (JVC/Kenwood) Karczewicz (Qualcomm) JCTVC-C267 m18308 2010-10-02 03:24:29 2010-10-02 03:28:08 2010-10-02 03:28:08 Test results of transform skip flag and phase 2 VLC integration W.-J. Chien, P. Chen, X. Wang, M. Karczewicz (Qualcomm) JCTVC-C268 m18310 2010-10-02 04:32:25 2010-10-02 07:50:50 2010-10-02 07:50:50 TE12: Report on evaluation of MDDT and ROT R. Joshi, M. Coban, M. Karczewicz (Qualcomm) JCTVC-C269 m18312 2010-10-02 04:45:57 2010-10-03 05:48:12 2010-10-10 11:34:29 A study on the impact of intra smoothing A. Tabatabai, C. Auyeung, E. Maani, T. Suzuki (Sony) JCTVC-C270 m18313 2010-10-02 04:46:40 2010-10-02 04:54:43 2010-10-13 18:24:09 TE6.a: Report of line-based coding X. Peng, J. Xu, F. Wu (Microsoft) JCTVC-C271 m18314 2010-10-02 04:47:48 2010-10-02 06:15:55 2010-10-02 06:15:55 TE6: Cross-verification of HiSilicon's short distance intra prediction by Microsoft X. Peng, J. Xu (Microsoft) JCTVC-C272 m18315 2010-10-02 2010-10-09 2010-10-09 TE6: Cross-verification of MediaTek's overlapped block X. Peng, J. Xu (Microsoft) 254 04:48:39 03:24:49 03:24:49 intra prediction by Microsoft JCTVC-C273 m18316 2010-10-02 04:49:20 2010-10-02 05:11:08 2010-10-13 18:19:13 TE10 subset 1: Report of content-adaptive de-blocking Z. Xiong, X. Sun, J. Xu (Microsoft) JCTVC-C274 m18317 2010-10-02 04:49:37 2010-10-02 05:17:32 2010-10-09 03:37:03 TE10 subset 1: Cross- verification of NEC's joint deblocking-debanding filter by Microsoft X. Xiong, J. Xu (Microsoft) JCTVC-C275 m18318 2010-10-02 04:49:53 2010-10-02 05:20:47 2010-10-13 19:28:30 Improve intra frame coding by PU/TU reordering X. Peng, J. Xu, F. Wu (Microsoft) JCTVC-C276 m18319 2010-10-02 04:50:06 2010-10-02 08:12:20 2010-10-13 18:15:04 Screen content coding results using TMuC C. Lan, J. Xu, F. Wu (Microsoft), G. Shi (Xidian Univ.) JCTVC-C277 m18320 2010-10-02 04:50:20 2010-10-02 05:34:44 2010-10-13 19:33:57 Redundancy reduction in Cbf and merge coding B. Li, J. Xu, F. Wu, G. J. Sullivan (Microsoft), H. Li (Univ. Sci. Tech China) JCTVC-C278 m18321 2010-10-02 04:50:32 2010-10-02 05:38:36 2010-10-13 19:34:27 Redundancy reduction in B-frame coding at temporal level zero B. Li, J. Xu, G. J. Sullivan, F. Wu (Microsoft), H. Li (Univ. Sci. Tech China) JCTVC-C279 m18322 2010-10-02 05:48:31 2010-10-03 00:53:15 2010-10-03 00:53:15 Opportunistic parallel V2V decoding D. He, G. Korodi, E.-h. Yang, G. Martin-Cocher (RIM) JCTVC-C280 m18323 2010-10-02 06:52:21 2010-10-02 07:00:55 2010-10-05 08:52:33 TE8: Crosscheck result of the transcoder for JCTVCB034 source selection for V2V entropy coding in HEVC Y.-L. Chang, Y.-W. Huang, S. Lei (MediaTek) JCTVC-C281 m18324 2010-10-02 07:32:11 2010-10-02 10:29:00 2010-10-07 01:38:00 TE12: Evaluation of fast adaptive intra smoothing M. Coban, Y. Zheng, M. Karczewicz (Qualcomm) JCTVC-C282 m18325 2010-10-02 08:15:22 2010-10-03 05:49:29 2010-10-08 05:51:11 TE3 subtest 1: Cross-check of results from LG M. Tok, A. Glantz, A. Krutz, T. Sikora (Tech. Univ. Berlin) JCTVC-C283 m18326 2010-10-02 09:14:33 2010-10-02 10:03:32 2010-10-02 10:03:32 Region of block based dynamic video processing for HEVC A. Paul (Hanyang Univ. / NCKU) JCTVC-C284 m18327 2010-10-02 09:18:13 2010-10-08 14:05:26 2010-10-08 14:05:26 TE9-2.1 Report on forced RQT split according to PUs (Sharp, Microsoft) A. Segall (Sharp), J. Xu (Microsoft) JCTVC-C285 m18328 2010-10-02 09:32:20 2010-10-02 10:31:28 2010-10-02 10:31:28 Modified uni-directional inter prediction in generalized P and B pictures W.-J. Chien, P. Chen, X. Wang, M. Karczewicz (Qualcomm) JCTVC-C286 m18329 2010-10-02 10:20:23 2010-10-02 10:23:03 2010-10-02 10:23:03 TE12.2: Technicolor & INRIA report on AMP evaluation E. Francois (Technicolor), L. Guillo (INRIA) JCTVC-C287 m18330 2010-10-02 12:56:27 2010-10-04 12:56:12 2010-10-08 08:37:15 Cross-check of encoder speedup for bidirectional averaging with rounding control B. Bross (Fraunhofer HHI) 255 JCTVC-C288 m18331 2010-10-02 13:14:36 2010-10-04 10:50:06 2010-10-04 10:50:06 TE4: Verification results of JCTVC-C262 T. Yamakage (Toshiba) JCTVC-C289 m18332 2010-10-02 13:40:03 2010-10-02 13:43:12 2010-10-07 03:38:46 TE11: Report on experiment 3.1.a: Disabling MV competition J. Jung, G. Clare, S. Pateux (Orange Labs) JCTVC-C290 m18333 2010-10-02 13:48:00 2010-10-02 13:49:40 2010-10-07 03:41:06 TE11: Report on experiment 3.2.b: MV-Competition on Inter, Skip and Direct modes J. Jung, G. Clare, S. Pateux (Orange Labs) JCTVC-C291 m18334 2010-10-02 13:53:14 2010-10-02 13:55:12 2010-10-07 07:52:39 TE11: Report on experiment 3.3.b: ‘temporally oriented' set of predictors for MV-Competition J. Jung, G. Clare, S. Pateux (Orange Labs) JCTVC-C292 m18335 2010-10-02 13:57:10 2010-10-05 15:20:38 2010-10-05 15:20:38 TE11: Verification of experiment 3.2.d by NTT DOCOMO J. Jung, G. Clare, S. Pateux (Orange Labs) JCTVC-C293 m18336 2010-10-02 13:59:55 2010-10-02 14:08:42 2010-10-08 13:25:52 TE11: Summary report for TE11 on motion vector coding J. Jung (Orange Labs) JCTVC-C294 m18337 2010-10-03 03:59:52 JCTVC-C295 m18340 2010-10-03 11:46:12 2010-10-11 17:43:31 JCTVC-C296 m18341 2010-10-03 11:47:37 JCTVC-C297 m18344 JCTVC-C298 Level limits for Progressive High Profile [Withdrawn from JCT-VC] A. Wells (Ambarella), C. Fogg (Harmonic) 2010-10-11 17:44:37 Re-compensation based on partial coefficients H. Zhu (Zhu) 2010-10-05 14:10:07 2010-10-07 20:00:59 Decoding improvement on the PA-Coder H. Zhu (Zhu) 2010-10-03 14:49:12 2010-10-06 19:26:48 2010-10-11 05:21:20 Cross-verification of JCTVC-C213: additional results for combined intra prediction W.-J. Han (Samsung) m18411 2010-10-04 16:37:29 2010-10-07 17:04:02 2010-10-07 17:04:02 TE12.4 Cross-check of unified MDDT/ROT T. Davies, D. Flynn (BBC) JCTVC-C299 m18423 2010-10-04 17:11:53 2010-10-04 17:17:59 2010-10-04 17:17:59 UK National Body comments on HEVC Timescales UKNB of WG11 JCTVC-C300 m18491 2010-10-05 01:59:03 2010-10-07 02:13:29 2010-10-08 22:39:09 High-efficiency entropy coding simplifications V. Sze, M. Budagavi (TI) JCTVC-C301 m18516 2010-10-05 11:55:43 2010-10-05 13:14:49 2010-10-05 13:14:49 Geometry motion partition P. Chen, L. Guo, M. Karczewicz (Qualcomm) JCTVC-C302 m18517 2010-10-05 12:12:15 2010-10-05 18:35:46 2010-10-12 03:33:48 Improved intra smoothing for UDI and new AIS fast mode T. Nguyen (Fraunhofer HHI) JCTVC-C303 m18522 2010-10-05 15:04:43 2010-10-06 03:23:58 2010-10-06 03:23:58 Geometry adaptive block partitioning (GEO) cross-check E. Francois, P. Bordes (Technicolor) JCTVC-C304 m18537 2010-10-06 10:03:34 2010-10-06 10:41:23 2010-10-12 04:52:10 Showing the possibility of fast CABAC H.-Joong Kim, X. Qu (Korea Univ.), W.-J. Han (Samsung) 256 JCTVC-C305 m18544 2010-10-06 16:54:17 2010-10-08 09:09:59 2010-10-08 09:09:59 TE7: Results for Simplification of MDDT Transform W. Ding, Y. Shi, B. Yin (Beijing Univ. Tech) JCTVC-C306 m18546 2010-10-06 22:48:03 2010-10-06 22:53:07 2010-10-06 22:53:07 Investigations for representing rectangular blocks using the merging concept H. Schwarz, D. Marpe, T. Wiegand (Fraunhofer HHI) JCTVC-C307 m18548 2010-10-07 00:59:51 2010-10-07 01:05:51 2010-10-10 11:18:03 Cross Verification of Low Complexity Adaptive Coefficient J. Zan, J. Meng, M. T. Islam, Scanning D. He (RIM) JCTVC-C308 2010-10-08 m18556 10:46:20 2010-12-03 09:28:17 (post meeting) 2010-12-03 09:28:17 (post meeting) Cross verification of MV Coding Proposal by JCTVCC293 K. Sato (Sony) JCTVC-C309 2010-10-08 m18557 10:49:19 2010-12-03 09:28:59 (post meeting) 2010-12-03 09:28:59 (post meeting) Cross verification of DCT-IF for chroma T. Suzuki (Sony) JCTVC-C310 m18568 2010-10-09 05:01:43 2010-10-09 05:19:21 2010-10-09 05:19:21 TE7: Summary report for MDDT Simplification R. Cohen (Merl), C. Yeo (I2R), R. Joshi (Qualcomm) JCTVC-C311 m18569 2010-10-09 06:24:29 2010-10-09 07:20:20 2010-10-12 03:11:10 B. Bross, H. Kirchhoffer, H. Fast intra encoding for fixed maximum depth of transform Schwarz, T. Wiegand quadtree (Fraunhofer HHI) JCTVC-C312 m18571 2010-10-09 12:04:06 2010-10-09 13:07:43 2010-10-13 11:16:05 Cross-verification of JCTVC-C311: Fast intra encoding for fixed maximum depth of transform quadtree W.-J. Han (Samsung) JCTVC-C313 m18572 2010-10-09 19:30:20 2010-10-09 21:10:33 2010-10-09 21:10:33 TE8: Crosscheck Result of the Transcoder of JCTVCB034 for V2V Entropy Coding in HEVC Y. Zheng, R. Joshi, M. Coban, M. Karczewicz (Qualcomm) JCTVC-C314 m18573 2010-10-10 02:39:08 2010-10-11 09:51:21 2010-10-11 11:00:22 Verification results of Samsung's proposal JCTVC-C206 J. Kim, J. Park, B. Jeon (LG Electronics) JCTVC-C315 m18581 2010-10-11 06:39:54 2010-10-11 06:43:38 2010-10-11 06:43:38 On the computation of PSNR and BD-Rate D. Hoang (Zenverge) JCTVC-C316 m18583 2010-10-11 12:55:37 2010-10-11 18:26:13 2010-10-11 18:26:13 Verification Results of Simplified Intra Smoothing (JCTVC-C234) T. Yamakage (Toshiba) JCTVC-C317 m18584 2010-10-12 02:53:18 2010-10-12 03:05:50 2010-10-12 03:06:10 Breakout Report on Interpolation Filters K. Ugur (BoG coordinator) JCTVC-C318 m18586 2010-10-12 15:15:33 2010-10-12 15:19:57 2010-10-12 15:19:57 TMuC text on max CU size seems broken R. Sjöberg (Ericsson) JCTVC-C319 m18590 2010-10-14 2010-10-14 2010-10-14 BoG report: Residual quadtree structure T. Wiegand, H. Schwarz, B. 257 03:28:32 03:41:23 19:05:29 2010-10-14 10:42:10 2010-10-14 10:42:10 JCTVC-C320 m18597 2010-10-14 10:34:43 JCTVC-C400 m18617 2010-10-21 07:41:14 JCTVC-C401 m18589 2010-10-14 02:42:48 JCTVC-C402 m18618 JCTVC-C403 Bross, A. Fuldseth, X. Wang, W.-J. Han (BoG coordinators) TE12: Summary of results of evaluation of TMuC tools in K. McCann (TE coordinator) TE12 Meeting report of the third meeting of the Joint Collaborative Team on Video Coding (JCT-VC), Guangzhou, CN, 7-28 October, 2010 G. J. Sullivan, J.-R. Ohm (chairs) The process of Test Model development for the HEVC initiative G. Sullivan, J.-R. Ohm (chairs) 2010-10-21 07:42:48 High Efficiency Video Coding (HEVC) Test Model 1 (HM 1) Encoder Description K. McCann, B. Bross, S. Sekiguchi (editors) m18619 2010-10-21 07:44:39 High Efficiency Video Coding (HEVC) text specification Working Draft 1 T. Wiegand, W.-J. Han, J.-R. Ohm, G. J. Sullivan (editors) JCTVC-C404 m18594 2010-10-14 06:58:49 2010-10-15 06:12:59 2010-10-15 17:10:22 HEVC software guidelines K. Sühring, D. Flynn, F. Bossen (on behalf of JCTVC) JCTVC-C405 m18616 2010-10-15 08:08:05 2010-10-15 08:35:05 2010-11-01 09:34:49 T. K. Tan, G. J. Sullivan, J.Summary of HEVC working draft 1 and HEVC test model R. Ohm (on behalf of JCT(HM) VC) JCTVC-C500 m18607 2010-10-15 02:59:12 2010-10-15 03:08:44 2010-10-15 03:08:44 Common test conditions and software reference configurations F. Bossen (on behalf of JCTVC) JCTVC-C501 m18604 2010-10-15 02:36:59 2010-10-15 02:38:30 2010-10-30 06:18:41 Description of Core Experiment 1: Decoder-Side Motion Vector Derivation Y.-J. Chiu, H. Yu, Y.-W. Huang, S. Sekiguchi (CE coordinators) JCTVC-C502 m18602 2010-10-15 01:57:36 2010-10-15 01:59:01 2010-10-30 16:03:26 Description of Core Experiment 2: Flexible Motion Partitioning E. Francois, P. Chen, X. Zheng (CE coordinators) JCTVC-C503 m18613 2010-10-15 03:43:20 2010-10-15 03:45:12 2010-10-30 04:44:05 Description of Core Experiment 3: Interpolation Filtering for MC (Luma) T. Chujoh (CE coordinator) JCTVC-C504 m18603 2010-10-15 02:20:32 2010-10-15 02:20:46 2010-11-12 06:03:36 Description of Core Experiment 4: Interpolation for MC (Chroma) E. Alshina, Y.-L. Lee, P. Chen, H. Lakshman (CE coordinators) JCTVC-C505 m18609 2010-10-15 03:21:14 2010-10-15 03:25:06 2010-11-02 00:25:34 Description of Core Experiment 5: Low Complexity Entropy Coding Improvements X. Wang (CE coordinator) JCTVC-C506 m18608 2010-10-15 2010-10-15 2010-11-01 Description of Core Experiment 6: Intra Prediction A Tabatabai (CE 2010-10-14 02:46:52 2010-10-15 04:39:15 258 JCTVC-C507 m18601 03:20:50 03:22:43 23:01:48 Improvement coordinator) 2010-10-14 20:02:08 2010-10-14 20:05:45 2010-11-02 21:27:32 Description of Core Experiment 7: Alternative Transforms R. Cohen, C. Yeo, R. Joshi (CE coordinators) JCTVC-C508 m18612 2010-10-15 03:43:14 2010-10-15 03:44:26 2010-10-30 04:12:21 Description of Core Experiment 8: In-loop filtering T. Yamakage, T. Chujoh, Y. W. Huang, K. Chono, I.S. Chong, T. Yamamoto, J. Lim, B. Jeon, J. Xu, M. Narroschke (CE coordinators) JCTVC-C509 m18600 2010-10-14 17:55:12 2010-10-14 17:57:17 2010-11-19 15:51:40 Description of Core Experiment 9: Motion Vector Coding J. Jung, B. Bross (CE coordinators) JCTVC-C510 m18606 2010-10-15 02:58:50 2010-10-15 03:08:27 2010-11-01 11:10:43 Description of Core Experiment 10: Number of Intra Prediction Directions K. Sugimoto (CE coordinator) JCTVC-C511 m18610 2010-10-15 03:35:54 2010-10-15 03:37:03 2010-11-03 22:40:47 V. Sze, K. Panusopone, J. Description of Core Experiment 11: Coefficient Scanning Chen, T. Nguyen, M. Coban and Coding (CE coordinators) JCTVC-C512 m18605 2010-10-15 02:37:32 2010-10-15 02:38:58 2010-10-30 01:59:59 Description of Core Experiment 12: Adaptive Motion Vector Resolution JCTVC-C513 m18611 2010-10-15 03:41:16 2010-10-15 03:47:00 2010-11-05 22:06:15 Description of Core Experiment 13: Intra Smoothing 259 W.-J. Chien (CE coordinator) M.Coban, B. Bross, J. Chen (CE coordinators) Annex B to JCT-VC report: List of meeting participants The participants of the third meeting of the JCT-VC, according to a sign-in sheet circulated during the meeting (approximately 244 in total), were as follows: 1. Alshina, Elena (Samsung) 2. Amon, Peter (Siemens) 3. Andersson, Kenneth (Ericsson) 4. Aoki, Hirofumi (NEC) 5. Asai, Kohtaro (Mitsubishi Electric) 6. Au, Jicheng (MediaTek) 7. Au, Oscar (Hong Kong Univ. Sci. & Tech.) 8. Auyeung, Cheung (Sony) 9. Bandoh, Yukihiro (NTT) 10. Bang, Gun (ETRI) 11. Barbarien, Joeri (Vrije Univ. Brussels) 12. Bici, Oguz (Nokia) 13. Bivolarsky, Lazar (Skype Technologies) 14. Boitard, Ronan (INRIA) 15. Bopardikar, Rajendra (Intel) 16. Bossen, Frank (DoCoMo USA Labs) 17. Boyce, Jill (Vidyo) 18. Bross, Benjamin (Fraunhofer HHI) 19. Brylants, Tim (Vrije Univ. Brussel) 20. Budagavi, Madhukar (Texas Instruments Inc) 21. Cao, Xiaoran (Tsinghua Univ.) 22. Chen, Jianle (Samsung) 23. Chen, Lulin (Harmonic) 24. Chen, Sijia (Cisco Systems) 25. Chien, Wei-Jung (Qualcomm) 26. Chiu, Yi-Jen (Intel) 27. Cho, In Joon (KBS) 28. Cho, Jaehee (Sejong Univ.) 29. Choi, Haechul (Hanbat Univ.) 30. Choi, Jin Soo (ETRI) 31. Choi, Kiho (Hanyang Univ.) 32. Chong, In Suk (Qualcomm) 33. Chono, Keiichi (NEC) 34. Chujoh, Takeshi (Toshiba) 35. Clare, Gordon (Orange Labs) 36. Coban, Muhammed (Qualcomm) 37. Cohen, Robert (Mitsubishi Electric) 38. Davies, Thomas (BBC R&D) 39. De Cock, Jan (Ghent Univ. - IBBT) 260 40. Ding, Wenping (Beijing Univ. Tech.) 41. Dong, Jie (Chinese Univ. Hong Kong) 42. Drugeon, Virginie (Panasonic) 43. Fan, Xiaopeng (Harbin / HKUST) 44. Fernandes, Felix (Samsung) 45. Flynn, David (BBC) 46. Fogg, Chad (Harmonic) 47. François, Edouard (Technicolor) 48. Fujibayashi, Akira (NTT DoCoMo) 49. Fukushima, Shigeru (JVC Kenwood) 50. Fuldseth, Arild (Tandberg) 51. Glantz, Alexander (Tech. Univ. Berlin) 52. Gu, Chenchen (Tencent) 53. Guillo, Laurent (INRIA) 54. Guo, Xun (MediaTek) 55. Han, Jong-Ki (Sejong Univ.) 56. Han, Woo-Jim (Samsung) 57. He, Dake (RIM) 58. He, Yun (Tsinghua Univ.) 59. Heo, Youngsu (Kyunghee Univ.) 60. Ho, Yo-Sung (GIST) 61. Hoang, Dzung (Zenverge, Inc.) 62. Hong, Soongi (Yonsei Univ.) 63. Hong, Sung-Wook (Sejong Univ.) 64. Hong, Yingjie (ZTE) 65. Horowitz, Michael (Vidyo, Inc.) 66. Hsiang, Shih-Ta (Motorola, Inc.) 67. Hsu, Chih-Wei (MediaTek) 68. Huang, Yu-Wen (MediaTek) 69. Huo, Junyan (Xi'dian Univ.) 70. Ichigaya, Atsuro (NHK) 71. Ishtiaq, Faisal (Motorola Inc.) 72. Jacobs, Marc (Vrije Univ. Brussel) 73. Jang, Euy-Doc (Korea Aerospace Univ.) 74. Jeon, Byeong Moon (LG) 75. Jeon, Byeungwoo (Sungkyunkwan Univ.) 76. Jeon, Yong-Joon (LG Electronics) 77. Jeong, Seyoon (ETRI) 78. Jin, Xin (Waseda Univ.) 79. Jung, Ji Wook (LG) 80. Jung, Joël (Orange Labs) 81. Jung, Tae-Young (Hanyang Univ.) 82. Kang, Jung Won (ETRI) 83. Karczewicz, Marta (Qualcomm) 84. Kazui, Kimihiko (Fujitsu) 261 85. Kim, Hae Kwang (Sejong Univ.) 86. Kim, Hui Yong (ETRI) 87. Kim, Hyun-Dong (Sejong Univ.) 88. Kim, Il Koo (Samsung) 89. Kim, In Kwon (Galaxia Commun.) 90. Kim, Jae-Gon (Korea Aerosp. Univ.) 91. Kim, Jeong-Pil (Sejong Univ.) 92. Kim, Jongho (Yonsei Univ.) 93. Kim, Kibaek (Hanyang Univ.) 94. Kim, Kyungyong (Kwangwoon Univ.) 95. Kim, Munchurl (KAIST) 96. Kim, Seong Hoon (Galaxia Commun.) 97. Kim, Seongwan (Yonsei Univ.) 98. Klomp, Sven (Leibniz Univ. Hannover) 99. Kogure, Takuyo (Panasonic ) 100. Kondo, Kenji (Sony) 101. Koyama, Jumpei (Fujitsu) 102. Krutz, Andreas (Tech. Univ. Berlin) 103. Lai, Polin (Wang) (Samsung) 104. Lainema, Jani (Nokia) 105. Lakshman, Haricharan (Fraunhofer HHI) 106. Laroche, Guillaume (Canon) 107. Lee, Ju Ock (Sejong Univ.) 108. Lee, Sangyoun (Yonsei Univ.) 109. Lee, Shanfu (Huawei) 110. Lee, Yoomjim (Kyunghee Univ.) 111. Lee, Yung-Lyul (Sejong Univ.) 112. Lei, Shawmin (MediaTek) 113. Li, Guichun (Huawei) 114. Li, Junlin (Cisco Systems) 115. Li, Ming (ZTE) 116. Li, Wei (Xi'dian Univ.) 117. Li, Zhengguo (I2R) 118. Lim, Chongsoon (Panasonic Singapore Labs) 119. Lim, Jaehyun (LG Electronics) 120. Lim, Jeongyeon (SK Telecom) 121. Lim, Sung-Chang (ETRI) 122. Lim, Woong (Kwangwoon Univ.) 123. Lin, Sixin (Huawei) 124. Lin, Yongbing (Huawei) 125. Ling, Nam (Santa Clara Univ.) 126. Liu, Hui (Cisco Systems) 127. Liu, Lingzhi (Huawei / Hisilicon) 128. Liu, Yu (ASTRI) 129. Lopez, Patrick (Technicolor) 262 130. Luthra, Ajay (Motorola) 131. Ma, Ran (Shanghai Univ.) 132. Ma, Siwei (Peking Univ.) 133. Mao, Xunan (Tencent) 134. Marpe, Detlev (Fraunhofer HHI) 135. Martin-Cocher, Gaëlle (RIM) 136. Matsuo, Shohei (NTT) 137. McCann, Ken (Zetacast / Samsung) 138. Mochizuki, Seiji (Renesas) 139. Moon, Joo-Hee (Sejong Univ.) 140. Mrak, Marta (BBC) 141. Murakami, Tokumichi (Mitsubishi Electric) 142. Murakami, Tomokazu (Hitachi) 143. Narroschke, Matthias (Panasonic R&D Germany) 144. Nishi, Takahiro (Panasonic) 145. Oh, Seoung-Jun (Kwangwoon Univ.) 146. Ohm, Jens-Rainer (RWTH Aachen Univ.) 147. O'loughlin, Daniel (Research in Motion) 148. Onno, Patrice (Canon) 149. Panusopone, Krit (Motorola) 150. Park, Gwang Hoon (Kyunghee Univ.) 151. Park, Hyoungmee (Sejong Univ.) 152. Park, Jeonghoon (Samsung) 153. Park, Seungwook (LG) 154. Park, Un-Ki (Korea Aerospace Univ.) 155. Paschalakis, Stavros (Mitsubishi Electric R&D Europe B.V.) 156. Peng, Wen-Hsiao (NCTU/ITRI) 157. Qu, Xiaochao (Korea Univ.) 158. Raad, Mohamad (RaadTech Consulting) 159. Ralston, John (Droplet Tech) 160. Ridge, Justin (Nokia) 161. Sakaida, Shinichi (NHK) 162. Sampedro, Jesus (Polycom) 163. Sasai, Hisao (Panasonic) 164. Sato, Kazushi (Sony) 165. Saxena, Ankur (Samsung Telecom. America) 166. Schelkens, Peter (Vrije Univ. Brussel) 167. Segall, Andrew (Sharp) 168. Sekiguchi, Shun-ichi (Mitsubishi Electric) 169. Senzaki, Kenta (NEC) 170. Seo, Chan-Won (Sejong Univ.) 171. Shen, Bazhong (Broadcom) 172. Shibahara, Youji (Panasonic) 173. Shima, Masato (Canon) 174. Shimizu, Shinya (NTT) 263 175. Shiodera, Taichiro (Toshiba) 176. Sim, Donggyu (Kwangwoon Univ.) 177. Sjöberg, Rickard (Ericsson) 178. Song, Jin (Huawei) 179. Sprljan, Nikola (Mitsubishi Electric R&D Centre Europe) 180. Sühring, Karsten (Fraunhofer HHI) 181. Sugimoto, Kazuo (Mitsubishi Electric) 182. Sugio, Toshiyasu (Panasonic) 183. Suh, Jung Suk (Samsung) 184. Sullivan, Gary (Microsoft) 185. Sun, Huifang (Mitsubishi Electric) 186. Suzuki, Teruhiko (Sony) 187. Sze, Vivienne (Texas Instruments) 188. Tabatabai, Ali (Sony Electronics) 189. Takamura, Seishi (NTT) 190. Tan, Thiow Keng (NTT DoCoMo) 191. Tan, Yih Han (I2R) 192. Tanizawa, Akiyuki (Toshiba) 193. Temmermom, Frederik (Vrije Univ. Brussel) 194. Tonomura, Yoshide (NTT) 195. Topiwala, Pankaj (FastVDO) 196. Tsukagushi, Ikuo (Sony) 197. Tung, Yi-Shin (MStar Semiconductor) 198. Ueda, Motoharu (JVC Kenwood) 199. Ugur, Kemal (Nokia) 200. Vermeirsch, Kenneth (Ghent Univ. -- IBBT) 201. Wahadaniah, Viktor (Panasonic Singapore Labs) 202. Wan, Wade (Broadcom) 203. Wang, David (Polycom) 204. Wang, Pulin (Huawei) 205. Wang, Xianglin (Qualcomm) 206. Wang, Ye-Kui (Huawei) 207. Wedi, Thomas (Panasonic) 208. Wegner, Krzysztof (Poznań Univ. Tech.) 209. Wen, Xing (Hong Kong Univ. Sci. & Tech.) 210. Weng, Ying (BBC) 211. Wiegand, Thomas (Fraunhofer HHI) 212. Winken, Martin (Fraunhofer HHI) 213. Won, Kwanghyun (Sungkyunkwan Univ.) 214. Wu, Ping (Mitsubishi Electric) 215. Wu, Zhixiong (OKI) 216. Xu, Jizheng (Microsoft) 217. Xu, Lidong (Intel) 218. Xu, Yian (Cisco Systems) 219. Yamakage, Tomoo (Toshiba) 264 220. Yamamoto, Tomoyuki (Sharp) 221. Yang, Fuzheng (Xi'dian Univ.) 222. Yang, Haitao (Huawei) 223. Yang, Hongzhang (Freescale Semiconductor) 224. Yang, Jungyoup (Sungkyunkwan Univ.) 225. Yang, Ming Yuan (Huawei) 226. Yang, Zhijie (Broadcom) 227. Ye, Yan (Dolby Labs) 228. Yea, Sehoon (LG) 229. Yeo, Chuohao (I2R) 230. Yoon, Daeil (Sejong Univ.) 231. Yoshino, Tomonobu (KDDI) 232. Yu, Haoping (Huawei) 233. Yu, Lu (Zhejiang Univ.) 234. Yu, Yong (Broadcom) 235. Yuan, Yuan (Tsinghua Univ.) 236. Zhang, Li (Peking Univ.) 237. Zhang, Louis (AMD) 238. Zhang, Wen (ZTE) 239. Zhao, Haiwu (Shanghai Univ.) 240. Zhao, Qing Yi (Tencent) 241. Zheng, Jianhua (Huawei) 242. Zheng, Xiaozhen (Huawei) 243. Zhou, Minhua (TI) 244. Zou, Bill (DTS) 265 Annex I – Audio report Source: Schuyler Quackenbush, Chair, Audio Subgroup 1 2 Opening Audio Plenary ......................................................................................................... 267 Administrative matters .......................................................................................................... 267 2.1 Communications from the Chair 267 2.2 Approval of agenda and allocation of contributions 267 2.3 Creation of Task Groups 267 2.4 Approval of previous meeting report 267 2.5 Review of AHG reports 267 2.6 Joint meetings 267 2.7 Received National Body Comments and Liaison matters 267 2.8 Plenary Discussion 267 3 Record of AhG meetings ....................................................................................................... 267 3.1 AhG Meeting on USAC -- Sunday 1000-1800 267 4 Task group activities ............................................................................................................. 275 4.1 Joint Meeting 275 4.1.1 With Requirements on Audio for HEVC (Wed 1400-1500)....................................... 275 4.2 Task Group discussions 275 4.2.1 MPEG-2, MPEG-4, MPEG-7, Audio Conformance, Reference Software, MPEG Surround ............................................................................................................................... 275 4.2.2 MPEG-D Spatial Audio Object Coding ...................................................................... 276 4.2.3 MPEG-D Unified Speech and Audio Coding ............................................................. 277 5 Closing Audio Plenary and meeting deliverables ................................................................. 289 5.1 Plenary discussions 289 5.2 Responses to Liaison and NB comments 289 5.3 Recommendations for final plenary 289 5.4 Establishment of Ad-hoc Groups 290 5.5 Approval of output documents 290 5.6 Press statement 290 5.7 Agenda for next meeting 290 5.8 All other business 290 5.9 Closing of the meeting 290 Annex A Participants ............................................................................................................... 291 Annex B Audio Contributions and Schedule .......................................................................... 292 Annex C Task Groups ............................................................................................................. 298 Annex D Output Documents ................................................................................................... 299 Annex E Agenda for the 95th MPEG Audio Meeting ............................................................. 299 266 1 Opening Audio Plenary The MPEG Audio Subgroup meeting was held during the 94th meeting of WG11, October 11-15, 2010, Guangzhou, China. The list of participants is given in 1. 2 2.1 Administrative matters Communications from the Chair The Chair summarised the issues raised at the Sunday evening Chair’s meeting, proposed task groups for the week, and proposed agenda items for discussion in Audio plenary. 2.2 Approval of agenda and allocation of contributions The agenda and schedule for the meeting was discussed, edited and approved. It shows the documents contributed to this meeting and presented to the Audio Subgroup, either in the task groups or in Audio plenary. The Chair brought relevant documents from Requirements, Systems to the attention of the group. It was revised in the course of the week to reflect the progress of the meeting, and the final version is shown in 2. 2.3 Creation of Task Groups Task groups were convened for the duration of the MPEG meeting, as shown in 3. Results of task group activities are reported below. 2.4 Approval of previous meeting report The Chair asked for approval of the 93rd Audio Subgroup meeting report, which was registered as a contribution. There was some discussion and wording concerning discussion in the Audio closing plenary was modified, and the revised report was approved. 2.5 Review of AHG reports There were no requests to review any of the AHG reports. 2.6 Joint meetings Who Req, Audio 2.7 What Audio for HEVC 3 3.1 When Wed, 14001500 Received National Body Comments and Liaison matters Num. M17985 2.8 Where Audio Source ITU-T SG 16 to SC 29/WG 11 Approval of a New Question on Telepresence systems Respond S. Quackenbush Plenary Discussion Record of AhG meetings AhG Meeting on USAC -- Sunday 1000-1800 USAC Text and Reference Software corrections Max Neuendorf, FhG, presented m18456 Corrections to Reference Software and CD of USAC 267 Max Neuendorf This contribution proposes changed to the USAC CD text and Reference Software. The various proposed changes are categorized into the following three categories: Changes to the CD text (which are only editorial) Description of spectral noiseless coding o Consistent use of variables o Clarification of arithmetic coding context o Correction to numerical constants Description of overlap-add for synthesis of output from adjacent coded frames Clarifications to MDCT-based TCX description Decoding coefficents for complex stereo prediction in MDCT Combination of SBR and MPS 212 Re-organization of CD text to conform to ISO directive Changes to the Reference Software Recalculation of pitch gain for use in the bass postfilter in TCX to ACELP transitions Corrections to handling higher bands in 32-band QMF analysis filterbank o Should not initialize high QMF bands to zero in unified stereo coding Changes to both CD text and Reference Software Simplification of AVQ bit stream syntax. This re-organizes the bit stream but does not affect the decoded waveform o Make all integer serialization msb first o Remove interleaving of AVQ coefficients The AhG recommends to Adopt the proposed changes to the CD text (which are only editorial) into a “Study on USAC CD” output document. Make the proposed changes to the USAC reference software Adopt the proposed changes to the CD text and make the associated changes to the USAC reference software Schuyler Quackenbush, ARL, presented m18077 Draft USAC CE Status and Workplan S. Quackenbush The presenter asked USAC CE proponents to please check the CE status tables and to bring them up to date so that producing the output document on Friday is that much easier. Improved bass- post filter Philippe Gournay , VoiceAge, presented m18435 VoiceAge listening test results for the Philippe Gournay, Roch enhanced bass-postfilter CE Lefebvre The contribution presented 16 kb/s listening test results for this CE. Systems under test are RM7 and RM7+CE. Seven items containing singing voice with music background were used as items in addition to the CfP test items. There was no difference for absolute scores. When considering differential scores, 4 of the 19 items are better and the mean is better at 95% level of significance. Max Neuendorf, FhG, presented m18451 FhG listening test report on CE on improving Guillaume Fuchs, Max the USAC bass post-filter Neuendorf The contribution presented 16 kb/s mono listening test results for this CE. There is no difference for absolute scores. When considering differential scores, 2 items are better, 1 item worse at the 95% level of significance. The contribution also reports that the CE software, as integrated into the decoder, was run and decoded the listening test bit streams to produce exactly the listening test decoded waveforms. David Virette, Huawei, presented 268 m18467 Report on cross-check listening test for the David Virette CE on improved Bass-post filter for USAC The contribution presented 16 kb/s mono listening test results for this CE. There is no difference for absolute scores. When considering differential scores, 1 item worse (es01) at the 95% level of significance. Kristofer Kjörling, Dolby, presented m18379 Finalization of CE on an improved bass- post Barbara Resch, Leif filter operation for the ACELP of USAC Sehlström, Heiko Purnhagen, Lars Villemoes, Kristofer Kjörling, Bruno Bessette, Philippe Gournay The contribution presented 16 kb/s mono listening test results for this CE. There is no difference for absolute scores. When considering differential scores, 4 items better, the mean better, none worse at the 95% level of significance. The contribution also presents a overview of the technology and a summary of the cross-check results. It notes that the current bass post-filter, used in ACELP coding mode, helps enhance the vocal signal. However in the case that there is background music with strong low-frequency harmonics, when the coder switches between ACELP, TXCX and FD modes, the suppression of the musical low harmonics comes and goes and is quite audible. When considering individual sites, for 2 items and for the mean, 2 of 4 sites agree on improvement. When all data is pooled, there are a total of 32 listeners. When considering differential scores, 4 items better, the mean better and none worse at the 95% level of significance. The AhG recommends adopting the CE technology into the “Study on USAC CD” output document. Complexity reduction for time warping Takeshi Norimatsu, Panasonic, presented m18360 Panasonic cross check report on complexity reduction for time warping in USAC Takeshi Norimatsu, Tomokazu Ishikawa, Haishan Zhong, Dan Zhao, Kok Seng Chong The contribution reports results for a 64 kb/s stereo listening test. It showed no significant differences in either absolute or differential scores, for any item or for the mean. Heiko Purnhagen, Dolby, presented m18375 Dolby listening test results for CE on Heiko Purnhagen, Kristofer complexity reduction for time warping in Kjörling USAC The contribution reports results for a 64 kb/s stereo listening test. It showed no significant differences for absolute scores. For differential scores, 1 item was better. Markus Multrus, FhG, presented m18452 Completion of the Core Experiment on Stefan Bayer Reducing the Complexity of the USAC Time Warping The contribution gives an overview of the CE technology and a summary of the listening test results. The technology uses time warping to reduce the pitch interval variations over a frame such that the pitch epochs have a more nearly harmonic frequency representation and thus can be more efficiently coded by the FD coder mode. The following table presents computational complexity. All numbers are WMOPS. RM8 RM8+CE TW tool 19.6 9.5 only Full 33.4 23.3 269 Decoder The contribution reports results for a 64 kb/s stereo listening test conducted at FhG. It showed no significant differences in either absolute or differential scores, for any item or for the mean. When listening test data are pooled over all test sites (25 listeners), it showed no significant differences in either absolute or differential scores, for any item or for the mean. Heiko Purnhagen, Dolby, reported that he decoded the listening test bitstreams (which were the RM8 bitstreams) using the CE proponent decoder and produced the waveforms that were used in the listening test. He then decoded the bitstreams using the RM8 decoder and produced the reference RM8 decoded waveforms. The AhG recommends adopting the CE technology into the “Study on USAC CD” output document. Improved SBR in USAC Toru Chinen, Sony, presented m18398 Sony listening test report on improved SBR in Toru Chinen, Masayuki USAC Nishiguchi The contribution reports the results of a 12 kb/s mono listening test. It showed no significant differences in either absolute or differential scores, for any item or for the mean. Max Neuendorf, FhG, presented m18431 FhG Listening Test Report – improved SBR Stephan Wilde, Max Neuendorf The contribution reports the results of a 12 kb/s mono listening test. It showed no significant differences in absolute scores. For differential scores, 2 items were better at the 95% level of significance. The contribution also reports on software verification: It confirms that the listening test bitstreams did decode to the listening test waveforms. Furthermore, it confirms that WD7 bitstreams decoded to the WD7 reference waveforms. Kristofer Kjörling, Dolby, presented m18378 Finalization of CE on improved SBR Kristofer Kjörling, Leif Sehlström The contribution gives an overview of the CE technology, presents new listening test results and presents all listening test results as pooled data. In RM7, SBR uses much of the machinery present in MPEG-4 SBR. One shortcoming present in SBR is that the copy-up envelope can have considerable discontinuities (as, for example, shown in es01), which might be so large that adjustment limiters prevent a target envelope from being achieved. The CE technology is a “pre-adjustment” gain stage which insures that the high-band envelope adjuster is able to make the adjustments needed in the high bank. Computational complexity is quite low, approximately 0.1 WMPOS. A control bit in the SBR header signals that the tool should be used from that point forward. Toru Chinen, Sony, noted that ARIB in Japan are currently using HE-AAC and send an SBR header very 500 ms. Kristofer Kjörling, Dolby, noted that the SBR encoder would usually send the SBR header whenever it was advantageous to change SBR configuration (which is specified in the SBR header). For the pooled data, two items are better at the 95% level of significance. The AhG recommends adopting the CE technology into the “Study on USAC CD” output document. Harmonic transposer Kihyun Choo, Samsung, presented m18372 Crosscheck report on harmonic transposer Kihyun Choo, Miyoung Kim, CEs Eunmi Oh The contribution reports the results of a 16 kb/s mono listening test that compares all configurations of the two CEs. There were no differences with respect to the absolute scores. 270 With respect to differential scores, it reports the following: QMF – FFT: 1 better, 1 worse. QMFxp – QMF: 1 worse. FFTxp – FFT: 1 worse Kimitaka Tsutsumi, NTT DOCOMO, presented m18466 NTT DOCOMO Cross-check Report on Kimitaka Tsutsumi, Kei Improved Harmonic Transposer in USAC Kikuiri, Nobuhiko Naka The contribution reports the results of a 16 kb/s mono listening test that compares all configurations of the two CEs. There were no differences with respect to the absolute scores. With respect to differential scores, it reports the following: QMF – FFT: 3 better QMFxp – QMF: no difference. FFTxp – FFT: 3 better, mean better In addition, the contribution reports that NTT DOCOMO confirms that the listening test bitstreams decode exactly to the listening test decoded waveforms. Jeff Huang, Qualcomm, presented m18459 Crosscheck listening test report for USAC on Jeff Huang FFT and QMF harmonic transposers The contribution reports the results of a 16 kb/s mono listening test that compares all configurations of the two CEs. There were no differences with respect to the absolute scores. With respect to differential scores, it reports the following: QMF – FFT: no difference David Virette, Huawei, presented m18468 Report on cross-check listening test for the David Virette Ces on QMF based harmonic transposer and improved harmonic transposer in USAC The contribution reports the results of a 16 kb/s mono listening test that compares all configurations of the two CEs. There were no differences with respect to the absolute scores. With respect to differential scores, it reports the following: QMF – FFT: 1 better QMFxp – QMF: no difference FFTxp – FFT: no difference In addition, the contribution reports that Huawei confirms that the listening test bitstreams decode exactly to the listening test decoded waveforms. Zhong Haishan, Panasonic, presented m18501 Panasonic crosscheck report on improved Zhong Haishan, Chong Kok harmonic transposer Seng, Zhao Dan, Takeshi Norimatsu, Tomokazu Ishikawa, Neo Sua Hong The contribution reports the results of a 16 kb/s mono listening test that compares all configurations of the two CEs. There were no differences with respect to the absolute scores. With respect to differential scores, it reports the following: QMFxp – QMF: 2 better FFTxp – FFT: 5 better QMFxp – FFTxp: no difference Zhong Haishan, Panasonic, presented m18386 Finalization of CE on QMF based harmonic 271 Haishan Zhong, Kok Seng transposer Chong, Takeshi Norimatsu, Tomokazu Ishikawa, Lars Villemoes, Per Ekstrand, Kristofer Kjörling, Stephan Wilde, Sascha Disch, Frederik Nagel, Max Neuendorf , The contribution gives an overview of the CE technology, including complexity information, and also presents a summary of all listening test results. The FFT transposer has high frequency resolution but also high complexity, while the QMF transposer has much lower complexity, as shown in the following table: Total WMOPS WMOPS WMOPS percentage Total WMOPS Configuration percentage Transposer Transposer Decoder Decoder only FFT based Harmonic Transposer with 10% oversampling frames 5.79 100% 9.42 100% (WD7) QMF based harmonic transposer 0.86 14.8% 4.49 47.7% When all listening test data is combined, analysis of differential MUSHRA scores shows: QMF – FFT: no difference Harmonic transposer – Cross products technology Kristofer Kjörling, Dolby, presented m18384 Finalization of CE on improved harmonic Lars Villemoes, Per Ekstrand, transposer in USAC Sascha Disch, Frederik Nagel The contribution presented an overview of the RM8 transposer technology as compared to the CE transposer technologies. It makes the point that the RM8 transposer results in many missing harmonics which are perceived as “ghost” higher fundamentals. The proposed cross-product technology permits “filling-in” the missing fundamentals via construction filterbank signals as a sum of adjacent low-band filter signals. QMF suffers when signals have very low fundamentals, since its low frequency resolution results in many distinct fundamentals mapping to the same quantized representation. When all listening test data is combined, analysis of differential MUSHRA scores shows: QMFxp – QMF: 4 better, mean better FFTxp – FFT: 4 better, mean better When looking at individual test sites: QMFxp – QMF: no strong consensus FFTxp – FFT: 3 items for which at least 3 of 6 agree The AhG recommends adopting the cross-product technology into the “Study on USAC CD” output document. Harmonic transposer – QMF technology Kristofer Kjörling, Dolby, presented m18389 Overview of performance of transposer proposals, and suggested decoding modes 272 Kristofer Kjörling, Haishan Zhong, Kok Seng Chong, Takeshi Norimatsu, Tomokazu Ishikawa, Lars Villemoes, Per Ekstrand, Stephan Wilde, Sascha Disch, Frederik Nagel, Max Neuendorf , The contribution presents an overview of the proposed transposer technology. There are two CEs which propose the following replacements or additions to the current RM7 FFT transposer technology: QMF (replacement or addition) Cross products (addition) It focuses on the 16 kb/s mono and stereo operating point because here the transposer requires the greatest fraction of decoder resources. For mono: QMF – 4.5 MOPS (reduces total decoder complexity to 48% of RM7) Cross products o FFT – 7.4 MOPS (reduces total decoder complexity to 78% of RM7) o QMF – 4.7 MOPS (reduces total decoder complexity to 50% of RM7) For stereo: QMF – 8.6 MOPS (reduces total decoder complexity to 63% of RM7) Cross products o FFT – 11.5 MOPS (reduces total decoder complexity to 85% of RM7) o QMF – 8.8 MOPS (reduces total decoder complexity to 65% of RM7) It notes that the quality of the QMF transposer is comparable to that of the FFT transposer. It further notes that incorporating cross products into the transposer provides a significant increase in quality while requiring either a decrease in complexity (FFT) or a modest increase in complexity (QMF). If a decoder with FFT and cross products is used as a baseline of 100%, then the QMF transproser with cross products results in a decoder that is 65% of the baseline complexity. In terms of quality, when all data is pooled, 1 item is better for the differential score QMFxp – FFTxp, however there is not strong agreement amongst the results from individual test sites. The contribution proposes that there be a “Low Power” and a “High Quality” decoding modes, where a single bitstream syntax can be decoded in either decoding mode. For each decoding mode, the transposers in each would be: Low Power: QMFxp High Quality: FFTxp Discussion The Chair noted that, in USAC, there is no “low power” mode defined in USAC. Hence, the Chair feels that the proposal is whether to have a FFTxp transposer (low complexity) or a QMFxp transposer (very low complexity) or both. Max Neuendorf, FhG, notes that when the individual test sites compare QMFxp versus FFTxp, for many sites there was judged to be 1 or more item worse (i.e. QMFxp worse than FFTxp). Werner Oomen, Philips, felt that there should be only one transposer in USAC and that the group should pick one. The Chair felt that there was not consensus in the group to make a decision at this time. The topic will be brought up again later in the MPEG week. T/F domain post-processing Kihyun Choo, Samsung, presented m18371 Crosscheck report on adaptive T/F domain Kihyun Choo, Miyoung Kim, post-processing for USAC Eunmi Oh The contribution presents results of a listening test comparing WD6+CE and WD6. The operating points and test results for the statistic (WD6+CE - WD6) were 12 kb/s mono 273 o Absolute scores: no difference o Differential scores: 1 better 8 kb/s mono o Absolute scores: no difference o Differential scores: 2 better Heiko Purnhagen, Dolby, presented m18373 Dolby listening test results for CE on T/F Heiko Purnhagen, Kristofer post-processing in USAC Kjörling The contribution presents results of a listening test comparing WD6+CE and WD6. The operating points and test results for the statistic (WD6+CE - WD6) were 12 kb/s mono o Absolute scores: no difference o Differential scores: no difference 8 kb/s mono o Absolute scores: no difference o Differential scores: 2 better, 1 worse (Normal distribution) or 1 better (Student t distribution) Jeff Huang, Qualcomm, presented m18461 Crosscheck listening test report for USAC on Jeff Huang time frequency domain post-processing The contribution presents results of a listening test comparing WD6+CE and WD6. The operating points and test results for the statistic (WD6+CE - WD6) were 12 kb/s mono o Absolute scores: no difference o Differential scores: : 3 better, 1 worse David Virette, Huawei, presented m18471 Finalization of CE on adaptive T/F domain David Virette, Wei Xiao post-processing for USAC The contribution presents results of a listening test comparing WD6+CE and WD6. The operating points and test results for the statistic (WD6+CE - WD6) were 12 kb/s mono o Absolute scores: no difference o Differential scores: 3 better, mean better 8 kb/s mono o Absolute scores: no difference o Differential scores: 5 better, mean better When data from all test sites is pooled 12 kb/s mono o Absolute scores: no difference o Differential scores: 4 better, mean better 8 kb/s mono o Absolute scores: no difference o Differential scores: 5 better, mean better It reviewed the complexity of the CE technology, which is shown in the following table: Average Maximum RM6 PCU PCU mono@8kbps 0.24 0.56 mono@12kbps 0.31 0.73 8 274 The presenter noted that the post-processing control bits are transmitted in the bit stream only if the coding mode is LP. Kristofer Kjörling, Dolby, noted that the Spectrum Flattening Post Processing seems similar to the “Improved SBR” tool, which also helps to flatten the SBR HF envelope. The Chair asked how this post processor compares to the “Improved Bass Post-Filter” Philippe Gournay , VoiceAge noted that the Bass Post Filter was limited to processing the signal below 500 Hz. The presenter noted that this technology does noise shaping across the spectrum. The Chair felt that there was not consensus in the group to make a decision at this time. The topic will be brought up again later in the MPEG week. 4 Task group activities 4.1 Joint Meeting 4.1.1 With Requirements on Audio for HEVC (Wed 1400-1500) The Requirements and Audio Chairs reviewed a document that presented a number of application scenarios for new audio work, the most prominent of which was audio coding and presentation for visual display systems that make use of High Efficiency Video Coding (HEVC). Such visual displays might be Ultra-HD (UHD) devices, such as 4K x 2K displays. In such displays, a much closer viewing distance is feasible and perhaps desirable. At such viewing distances there is both visual and audio envelopment. This could have significant impact on audio presentation such that much more accurate sound localization in terms of direction and distance is desirable. If there is only one viewer, it might be that aspects of the audio presentation might be individualized in some meaningful way. The Chairs captured comments from the group, with respect to relevant application scenarios, relevant technology and, most importantly, relevant requirements. The results of the joint meeting are captured in the output document “Audio for HEVC.” 4.2 Task Group discussions 4.2.1 MPEG-2, MPEG-4, MPEG-7, Audio Conformance, Reference Software, MPEG Surround Ferenc Kraemer, Dolby, presented m18406 Proposed addition to Intensity Stereo in Ferenc Kraemer, Heiko ISO/IEC 14496-26:2009, Audio conformance Purnhagen The contribution notes that many AAC encoders and decoders have a bug in the intensity stereo processing. The bug entails the interaction between the intensity codebooks indicated (INTENSITY_HCB and INTENSITY_HCB2, numbers 14 and 15) and the ms_mask bit for a scale factor band. The correct behaviour is shown in the following table: Intensity in-phase IS Codebook M/S Bitmask Field Phase Position of Left and Right Intensity out-of-phase INTENSITY_HCB INTENSITY_HCB INTENSITY_HCB2 INTENSITY_HCB2 (#15) (#15) (#14) (#14) 0 1 0 1 0° 180° 180° 0° 275 The conbribution notes that the ms_mask_present flag can be used to “preset” the values of the ms_mask array. A buggy encoder or decoder ignores the role that the “preset” or “implicit” values of the ms_mask bit plays when constructing the desired intensity phase. The Chair asked the group to consider whether additional text in the informative part of the MPEG4 AAC and MPEG-4 AAC specifications to clarify the new encoder behaviour might be an appropriate additional action. It was the consensus of the Audio Subgroup to issue two DCOR, one against MPEG-2 Conformance and one against MPEG-4 Audio Conformance. Ferenc Kraemer, Dolby, presented m18426 Additional information on MPEG Surround Andreas Hölzer, Christian conformance testing Ertel, Markus Lohwasser, Michael Härtl, Ferenc Kraemer, Frans de Bont The contribution presents data on the output of the MPEG Surround conformance tool. It report on the decoding performance, in terms of LSB’s of difference relative to a given reference, of a number of important platforms, e.g. floating point and fixed point. The presenter notes that MPEG Surround conformance criteria must be defined and expects to bring more information to the next meeting. 4.2.2 MPEG-D Spatial Audio Object Coding Leon Terentiv, FhG, presented m18436 Report on corrections for MPEG SAOC Jonas Engdegård, Heiko Purnhagen, Oliver Hellmuth, Jürgen Herre, Cornelia Falch, Leon Terentiv, Maria Luis Valero, Johannes Hilpert, Andreas Hölzer, Werner Oomen The contribution proposes two small corrections to the Low Delay MPEG Surround part of the SAOC specification. It was the consensus of the Audio Subgroup to incorporate the proposed changes into a revised Defect Report that is an output of this meeting. Oliver Hellmuth, FhG, presented m18437 Revised draft of SAOC verification test report Jonas Engdegård, Heiko Purnhagen, Oliver Hellmuth, Jürgen Herre, Cornelia Falch, Leon Terentiv, Maria Luis Valero, Johannes Hilpert, Andreas Hölzer, Werner Oomen The contribution is a draft of the SAOC test report. The presenter reviewed the document. It was noted that perhas results for individual items should be added to the report, but as an Annex. The Chair noted that figures that graphically It is the consensus of the ASG to o Issue the SAOC verification test report at this meeting, with an editing period. This will be a public document. o Look forward to the “Karaoke” solution at the 95th meeting. It is expected to be added to the SAOC Defect Report (from this meeting) and to issue as a DCOR at the 95th meeting. o Based on info at the 95th meeting, decide on whether to update the VRT with a Karaoke test. Such a revised VTR would issue at the 96th meeting. 276 4.2.3 MPEG-D Unified Speech and Audio Coding CE Educational Material Markus Multrus, FhG, presented m18454 Informative Encoder Description for USAC Vignesh Subbaraman, Markus Improved Noiseless Coding CE Multrus, Kihyun Choo The contribution contains text for the informative annex of the USAC specification that describes how to build the spectral arithmetic coding tool for the USAC encoder. The presenter verified that the MPEG Reference Encoder has full source code for the USAC Improved Noiseless Coding CE functionality. Kei Kikuiri, NTT DOCOMO, presented m18473 Educational Information on Encoder Kei Kikuiri, Atsushi Implementation of inter-TES tool in USAC Yamaguchi, Nobuhiko Naka The contribution contains text for the informative annex of the USAC specification which describes how to build the inter-TES tool in the USAC encoder. The presenter verified that the MPEG Reference Encoder has only bitstream syntax support for the USAC inter-TES tool. It was the consensus of the Audio Subgroup that the contribution provides sufficient educational information. It was noted that Markus Multrus, FhG, reported in an email (Sep 24, 2010, 16:35 CEST) that the inter-TES normative decoder software has been integrated into the USAC reference decoder. Hence this CE is concluded. USAC Performance Gregory Pallone, Orange Labs, presented m18434 Report on USAC performance Gregory Pallone, Pierrick Philippe The contribution reports on a listening test that compares the performance of USAC, MPEG-4 AAC and MPEG-4 HE-AAC. Systems under test were: aac+64: HE AAC encoder @ 64kbps aac96: AAC encoder @ 96kbps aac128: AAC encoder @ 128kbps usac64: reference quality bitstream of WD7 @ 64 kbps usac96: reference quality bitstream of WD7+CE (improved stereo) @ 96 kbps There were ten subjects after post-screening. When averaged over all listeners and all test items, USAC demonstrates a slight increase in performance (in the mean) for the range of bit rates tested. USAC Configuration, Payload and Transport Issues Markus Multrus, FhG, presented m18480 Comments on further USAC investigation Max Neuendorf, Markus Multrus, Stefan Doehla, Werner Oomen, Heiko Purnhagen This contribution presents a more detailed list of requirements that should be addressed in the final steps of the USAC development work. It divides the requirements into three broad categories: Configuration of decoder Payload (i.e. Access Unit) Transport (of payloads) Contributions on this topic should refer to what aspect of the “requirements” (as put forth in this document) is addressed. It was agreed to make the body of the contribution a section of the USAC CE Status and Workplan document. 277 MPEG Reference Encoder David Virette, Huawei, presented m18470 A new signal classifier for USAC reference David Virette, Lijing Xu, Wei encoder Xiao The contribution describes a new software module for the USAC MPEG Reference Encoder that analyzes the input signal and determines which coding mode should be applied. The performance of this classifier was compared to both the current MPEG Reference Encoder and the RM7 bitstream classification decisions. It was the consensus of the Audio Subgroup to incorporate this new module into the SVN trunk of the USAC Reference Encoder. Jeongook Song, Yonsei University, presented m18429 Status report on USAC Reference Software Jeongook Song, Henney Oh, JAME Hong-Goo Kang The contribution reports on the status of the JAME USAC encoder project. In the AhG period running up to the 94th MPEG meeting, work was done to revise the psychoacoustic model tool and corresponding bit-allocation loop tool, but that work is not yet complete and so no new code release is available at this MPEG meeting. The Chair hopes that, at the 95th MPEG meeting, a workplan can be created that coordinates a listening test done by interested companies in the MPEG Audio Subgroup that will benchmark the performance of the JAME software encoder against the Reference Quality Encoder. Additional bandwidth extension Henney Oh, Yonsei University, presented m18428 Yonsei Crosscheck listening test report on Jeongook Song, Eunwoo additional bandwidth extension CE for USAC Song, Henney Oh The contribution presents listening test results for USAC operating at 8 kb/s for mono signals. It compares RM7+CE and RM7. For absolute scores there were no differences at the 95% level of significance. When analyzing differential scores, 3 items were worse and the mean was worse at the 95% level of significance assuming a Normal distribution. When analyzed with Student-t distribution, 1 worse mean worse at the 95% level of significance. David Virette, Huawei, presented m18472 Progress report on additional bandwidth David Virette, Wei Xiao extension CE for USAC at low bit rates The contribution gave an overview of the CE technology. This experiment had a bitrate of 8 kb/s and a sampling rate of 19200 Hz. The technology extends the bandwidth up to 9.6 kHz. Two architecutes were presented: one is “outside” of the SBR framework, which is appropriate for SBR extension that covers nearly up to the Nyquist frequency (typically when the internal sampling rate is adjusted lower for coding efficiency). The other is “inside” the SBR framework, which is appropriate when the SBR extension is far short of the Nyquist frequency. The contribution presents results of a listening test of RM7+CE and RM7. For absolute scores there were no differences at the 95% level of significance. When analyzing differential scores, 1 item was at the 95% level of significance Anecdotally, the presenter noted that listeners had to weigh the perceived effects of greater bandwidth and additional noise. Next steps for this CE will be supported by the USAC CE Workplan. Increased structural flexibility in SBR 278 Max Neuendorf, FhG, presented m18432 Status update of CE proposal on increased structural flexibility in SBR Stephan Wilde, Markus Multrus, Max Neuendorf, Kristofer Kjörling, Heiko Purnhagen The contribution presented an overview of the proposed technology. The basic idea is to extend the SBR functionality from 1:2 extension to 1:4 extension. The 1:4 extension mode is appropriate for very low bit rate systems (e.g. 12 kb/s) which may have very low internal sampling rates in the encoder/decoder framework. The 1:4 extension is achieved by using only 16 of the existing 64 bands of the QMF filterbank for the core coder signal. It notes that a similar technology has already been specified in the DRM system. The contribution also presents results of a listening test for RM7+CE versus RM7 for the 8 kb/s mono operating mode. For absolute scores, 1 item is better and the mean is better. For differential scores, 9 items are better and the mean is better. The CE technology brings 4 MUSHRA points improvement for the mean and up to 7 points for Music3. The presenter noted that RM7 and RM7+CE both uses an core coder sampling rate of 9.6 kHz, so that the RM7 output sampling rate is 19.2 kHz and the RM7+CE output sampling rate is 2*19.2 kHz. The presenter expects to bring a complete CE proposal to the 96th MPEG meeting. Enhanced long term predictor (eLTP) Henney Oh, Yonsei University, presented m18430 Updated CE on enhanced long term predictor Jeongook Song ,Henney (eLTP) for USAC Oh ,Hong-Goo Kang The contribution presents an enhanced functionality, in that the eLTP tool now operates for all USAC core coding modes. The Audio Subgroup looks forward to more information at the next meeting. TCX windowing Seungkwon Beack, ETRI, presented m18234 Progress report on the TCX windowing CE Taejin Lee, Seungkwon Beack, for USAC Minje Kim, Kyeongok Kang The contribution noted that there was not sufficient time in the AhG period to execute all aspects of the workplan, so that there are no new results to report at this meeting. The Audio Subgroup looks forward to more information at the next meeting. Time warping Zhao Dan, Panasonic, presented m18419 Status report of time warping CE in USAC Zhao Dan, Zhong Haishan, Chong Kok Seng, Takeshi Norimatsu, Tomokazu Ishikawa, Neo Sua Hong The contribution presented an overview of the time warping technology. It also presented results of a listening test for the operating modes 24 kb/s mono and 20 kb/s mono. Systems under test were: No_TW (WD6 no time warping) WDQ (WD6 time warping forced to be active) TW_CE (WD6+CE) Statistic Mode Performance (differential score analysis) 279 TW_CE – No_TW TW_CE – WDQ TW_CE – No_TW TW_CE – WDQ 24 kb/s 24 kb/s 20 kb/s 20 kb/s 3 items better, mean better 3 items better, mean better 8 better, mean better 4 items better, mean better The contribution notes that the CE technology provides some bit savings (up to 2%) and also accommodates a higher dynamic range in the time warping parameter (via Huffman compression). The Audio Subgroup looks forward to more information at the next meeting. Pulse Indexing Toru Chinen, Sony, presented m18396 Sony listening test report on pulse indexing in Toru Chinen, Masayuki USAC Nishiguchi The contribution presents results of a listening test which evaluated the RM7+CE versus RM7 at 20 kb/s for mono signals. All 15 test items were clean speech. It showed no differences when analyzing either absolute or differential scores at the 95% level of significance using Student-t distribution. David Virette, Huawei, presented m18469 Finalization of Enhanced Pulse Indexing CE David Virette, Dejun Zhang, for ACELP in USAC Fuwei Ma The contribution summarizes the Enhanced Pulse Indexing CE technology. A theoretical analysis showed that it is able to provide the following additional compression efficiency: ACELP Bit savings Mode per frame 18k4 8 or 16 16k4 8 to 16 14k4 4 to 8 The contribution also presents the results of a listening test. When analyzing absolute scores, there were no differences at the 95% level of significance. When analyzing differential scores, 2 items were better at the 95% level of significance using Normal distribution. The Sony and Huawei listening data were pooled and an analysis of the differential scores showed 1 item better (nadib2) and 1 worse (nadib1). This was discussed later in the MPEG week (see later in this section). PVC for SBR Takeshi Norimatsu, Panasonic, presented m18361 Panasonic cross check report on PVC for SBR Takeshi Norimatsu, Tomokazu envelope coding in USAC Ishikawa, Haishan Zhong, Dan Zhao, Kok Seng Chong The contribution gave results of a listening test which evaluated the RM7+CE versus RM7 at 12 kb/s for mono signals. It showed no differences when analyzing absolute scores at the 95% level of significance using Normal distribution. When analyzing differential scores, 3 items were better at the 95% level of significance using Normal distribution, 2 items better when using the Student-t distribution. The contribution also reports on verification of the RM7+CE decoder: all listening test bitstreams were decoded to produce exactly the WAV files used in the listening test. Heiko Purnhagen, Dolby, presented m18374 Dolby listening test results for CE on PVC for Heiko Purnhagen, Kristofer SBR in USAC Kjörling 280 The contribution gave results of a listening test which evaluated the RM7+CE versus RM7 at 20 kb/s for mono signals. It showed no differences when analyzing absolute scores at the 95% level of significance using Normal distribution. When analyzing differential scores, 2 items were better and 1 worse at the 95% level of significance using Normal distribution, 1 items better and 1 worse when using the Student-t distribution. Markus Multrus, FhG, presented m18455 FhG Listening Test Report – PVC for SBR Frederik Nagel envelope coding The contribution gave results of a listening test which evaluated the RM7+CE versus RM7 at 20 kb/s for mono signals. It showed no differences when analyzing absolute scores at the 95% level of significance using Normal distribution. When analyzing differential scores, 3 items were better and 2 worse at the 95% level of significance using Normal distribution and 2 better, 2 worse if Studentt distribution is used. Toru Chinen, Sony, presented m18399 Report on PVC CE for SBR in USAC Toru Chinen, Yuki Yamamoto, Mitsuyuki Hatanaka, Masayuki Nishiguchi The contribution gave an overview of the Predictive Vector Coding (PVC). The CE aims at reducing encoding bit rate and increasing the subjective quality by modifying the delta coding scheme of SBR envelope scalefactors. The proposed scheme is based on the prediction of SBR envelope scalefactors by using the energy of QMF subband samples below SBR range. The indices of prediction coefficient matrices are coded by using vector quantization. PVC increases the complexity of the SBR tool by 0.4 WMOPS (peak load) and requires 1096 bytes of additional table storage. It provides 1.44% additional compression efficiency as averaged over all CfP test items (as compared to RM7 reference bit streams). The contribution gave results of a listening test at Sony. When analyzing differential scores, 6 items were better and the mean was better. When all data is pooled, 5 items are better and the mean is better. Of these, 1 item shows an improvement of 8 MUSHRA points, 2 items have show an improvement of 5 points. When looking at individual test site results, 2 items are better for 2 of 4 sites, 2 items are better for 3 of 4 sites. This was discussed later in the MPEG week (see later in this section). Tonal component coding in eSBR Marek Domanski, Poznan University, presented m18532 Telcordia and PUT listening test results for Tomasz Zernicki ,Maciej CE on improved tonal component coding in Bartkowiak ,Marek Domanski eSBR (USAC) The contribution reviewed the CE technology. It proposes a separate sinusoidal encoder/decoder and separate bit stream of sinusoidal model parameters, as shown in the following figure. The sinusoidal model parameter bit stream is typically 2 kb/s. The sinusoidal modelling is only run for the SBR high-band region. 281 It presents results of a listening test at 16 kb/s and 20 kb/s. When analyzing differential scores of RM7+CE – RM7, at 16 kb/s 3 of the new items with many high-frequency tonal components showed improvement (e.g. saxophone, violin, accordion). None of the CfP items showed any degradation. 20 kb/s 4 of the new items showed improvement. The presenter clarified that the RM7 encoder was the MPEG Reference Encoder compiled and run on the CfP items and the new CE items. As a next step, it was agreed to draft a workplan in which the CE listening test experiment is repeated, but using the Reference Quality Encoder. New CEs New excitation coding for LPD mode Roch Lefebvre, VoiceAge, presented m18481 Proposed CE for extending the LPD mode in Bruno Bessette ,Philippe USAC Gournay, Roch Lefebvre The contribution proposes to add an additional excitation coding method for use in ACEP coding mode. Memory for next frame adaptive codebook decoded pitch parameter s shift for next frame new decoded prequantizer parameter s decoded ACELP parameter s excitation signal LPC Synthesis filter inverse DCT AVQ decoder … ACELP codebook The additional tool gives the following advantages: Reduce the time and frequency dynamice of the residual (LPC excitation) Better control of bit allocation for LPC excitation More explicit control of coding noise in ACELP frame. Re-use existing tools Achieve higher quality for the ACELP coding mode at higher bit rates (i.e. above 32 kb/s) The contribution presents the results of a listening test for stereo signals. The following Systems or Codec under Test (CuT) were tested: USAC WD6 at 32 kbps stereo (switched FD/LPD) Modified USAC WD6 (CuT) at 48 kbps stereo (LPD only) Modified USAC WD6 (CuT) at 64 kbps stereo (LPD only) USAC WD6 at 64 kbps stereo (FD only) 282 The results show that for the differential statistics (CuT at 48 kbps – USAC WD6 at 32 kbps) – all items and the mean showed improvement at the 95% level of significance. (CuT at 64 kbps – USAC WD6 at 64 kbps) – 5 better, 1 worse, mean better, at the 95% level of significance. The proponent expects to bring a complete CE to the next meeting. It was agreed to draft a workplan to coordinate cross-check efforts. Enhanced Performance at Mid Bitrates Markus Multrus, FhG, presented m18479 Proposed Core-Experiment on Enhancing the Markus Multrus, Philippe USAC Codec at Mid Bitrates Gournay, Nikolaus Rettelbach, Bruno Bessette, Bernhard Grill The contribution notes that USAC can switch between a block-based transform coder and a LP coder. The transform coder has a fixed block length and cannot adjust its time resolution (e.g. via internal sampling rate conversion) when switching to LP mode must be accommodated. It further notes that temporal resolution can be achieved via either of two means: Shorten transform frame size Increase sampling rate The proposed technology used both methods to achieve higher time resolution at mid bit rates (e.g. 24 kb/s). The proposed new operating mode is shown in the following figure: USAC, proposed extra setting: f = 3/8 fout f =3/8 fout Core-coder framelength: Analysis QMF 24 bands f = f out SBR Upsamling Factor 8/3 768 samples f = f out MPS Decoder f = f out Synthesis QMF 64 bands 2048 samples The computational complexity of the proposed system is comparable to a system run at 44.1 kHz sampling rate (with a 1:2 SBR configuration). The memory complexity, comprised of tables for the new transform block lengths, is approximately 960 32-bit works. The contribution presents results of a listening test of 24 kb/s mono. When differential scores are analyzed: (RM7+CE – RM7), 4 items are better at the 95% level of significance. The presenter requests that a workplan that will specify an experimental setup and coordinate crosschecks, and expects to bring a complete CE to the next meeting. Improved stereo coding David Virette, Huawei, presented m18474 CE proposal on improved stereo coding at David Virette low bit rates The contribution describes the problem in USAC stereo coding that is addressed by the CE technology. For low bit rate stereo coding, when only CLD and ICC parameters are used, the upmix matrix may have a wrong behaviour for negative ICC parameters. The proposed technology uses an IPD estimation to better reconstruct the stereo signal in case of negative ICC. The contribution reports the results of a listening test at 20 kb/s stereo. With differential analysis of the statistic (H6+CE – H6), there is 1 item better and mean better at the 95% level of significance, where “H6” is Huawei’s internal USAC encoder which employs the same the tools as in RM6. Werner Oomen, Philis, noted that there are already means used in USAC to neutralize the antiphase, and it might be interesting to compare the CE technology with current USAC tools. 283 The presenter requests that a workplan that will specify an experimental setup and coordinate crosschecks, and expects to bring a complete CE to the next meeting. Lower-Complexity Decorrelator Julien Robilliard, FhG, presented m18433 Proposed decorrelator improvements in Julien Robilliard, Matthias USAC Neusinger, Johannes Hilpert, Erik Schuijers, Bert den Brinker, Werner Oomen The contribution presented results of an investigation on reducing the complexity of the MPS 212 decorrelator without compromising audio quality. A complexity analysis of the current and proposed decorrelator is shown in the table below. The main point is that the decorrelator complexity is reduced by more than 50%, and that the complete decoder complexity is reduced by nearly 15%. Complexity measure (MOPS) RM8 Complexity Proposed filter filter reduction Decorrelator tool 1.7 3.6 52.8% Complete USAC decoder 11.2 13.1 14.5% The contribution presents results of a listening test for 32 kb/s stereo, showing no significant differences for any item at the 95% level of significance. The presenter requests that a workplan that will specify an experimental setup and coordinate crosschecks, and expects to bring a complete CE to the next meeting. Improved applause coding Julien Robilliard, FhG, presented m18413 CE proposal on improved applause coding in Achim Kuntz, Sascha Disch, USAC Erik Schuijers, Werner Oomen The contribution is a joint proposal from FhG-IIS and Philips on Transient Steering Decorrelator (TSD). It notes that the parametric stereo technology inherited from MPEG Surround has a number of shortcomings: Narrowed sound stage Lack of envelopment Sound coloration The solution proposed it to add a block that separates out transient components and passes them through a new block especially adapted for decorrelating transient signals. This is shown in the following block diagram: 284 Results of a listening test at 32 kb/s stereo were presented. Analysis of absolute MUSHRA scores showed that 3 “applause” items showed significant improvement (more than 10 MUSHRA points) and the 95% level of significance. Analysis of differential scores showed that all items are improved at the 95% level of significance. The presenter requests that a workplan that will specify an experimental setup and coordinate crosschecks, and expects to bring a complete CE to the next meeting. Enhanced Mode Transitions Sangoh Jeong, LG, presented m18393 Core Experiment on Enhanced Mode Kiho Cho, Nam Soo Kim, Transitions in USAC Sungyong Yoon, Sangoh Jeong The contribution proposes a technology for transitions between TCX and ACELP frames and from FD to ACELP frames. The main idea is shown in the following figures. It notes that the proposed technology can save approximately 1% of the RM7 bitrate, depending on the operating mode. The complexity of the proposal is less than the FAC operation. The results of a listening test are reported for 20 kb/s mono and 12 kb/s mono. It shows no differences for either absolute scores or differential scores. Philippe Gournay, VoiceAge, noted that the proposed technology has two potential shortcomings: Zero the filter state on the MDCT (i.e. TXC, FD) to ACELP transition 285 Base the TDAC on ACELP encoded signal Max Neuendorf, FhG, agreed with Philippe Gournay in that FhG experimented with a LP filter state reset and found that it had a negative impact on audio quality. He further noted that the adaptive codebook in ACELP assumes a “time forward” sequence in the residual, while the folding operation results in a “time reversed” interval that will not be efficiently encoded. The presenter requests that a workplan that will specify an experimental setup and coordinate crosschecks, and expects to bring a complete CE to the next meeting. Unified global gain syntax Max Neuendorf, FhG, presented m18450 Proposed unified global gain syntax element Guillaume Fuchs, Markus in USAC Multrus, Max Neuendorf The contribution proposes to modify syntax elements of the USAC bit stream to support a means to change the of the USAC output level via changing a single global gain element. The current gain elements are shown here: Mode Gain element FD 8-bit “Global gain” per 1024 length blocks TCX 7-bit “TXC global gain” per 256 to 1024 length blocks ACELP 2-bit “Mean energy” per 256 length block SBR Codes absolute envelope energy. (This does not automatically scale when global gain of core coder changes.) The proposed modifications are shown here: Mode Gain element FD Stays the same: 8-bit “Global gain” per 1024 length blocks TCX 1 bit refinement: 7-bit “TXC global gain” per 256 to 1024 length blocks ACELP Remove “Mean energy” SBR Code absolute envelope energy realative to the core codec energy. This is shown graphically in the following figure: 286 Since the proposed change is not “noiseless,” the contribution presents the results of a listening test at 16 kb/s mono. One system under test was the proposed technology, another was the proposed technology, but decreased in level via the 8-bit Global gain element and then increased in level in the decoded PCM domain. When examining absolute MUSHRA scores, there is no difference at the 95% level of significance. When examining differential scores, there is no difference at the 95% level of significance. Heiko Purnhagen, Dolby, asked what requirement is addressed with the proposal. In particular, he noted that the SBR high-band level adjustment is only responsive to an SBR header. Hence, adjustments in global gain will not impact the SBR high-band until a SBR header occurs in the bitstream. The presenter requests that a workplan that will specify an experimental setup and coordinate crosschecks, and expects to bring a complete CE to the next meeting. Additional discussion on PVC for SBR Toru Chinen, Sony, made a presentation on a proposed workplan. This was discussed and it was the consensus of the Audio subgroup draft a workplan to execute the points: Gather new evidence based on RM7 code base. Bring evidence of performance, as a listening test, at one additional operating point, e.g. 8kb/s. If the PVC bit stream and decoded output does not change for the 12 kb/s operating point, then that test does not have to be repeated. Additional discussion on T/F domain post processing David Virette, Huawei, made a presentation on the performance of T/F domain post processing at 8 kb/s and 12 kb/s mono. The new information pertains to calculating the 95% confidence intervals using the Student-t distribution. In addition, he presented information on the complexity of the two modules proposed in the CE. He presented a comparison of the role of the eTES processing and the T/F post processing. He noted that The eTES processing sharply adjusted the time-varing energy envelope The T/F domain post processing applies a smooth emphasis to the spectral peaks and smooth de-emphasis to the spectral valleys, as in classical speech post processing. He presented a graphic showing per-item listening test results, and the extent that the two modules (flattening and post-processing) were active in the frames of each item. He concluded by noting that post-filtering is well known in the speech coding community and has been shown to provide performance improvements at low bit rates. Finally, he requests that the post-processing tool be incorporated, and to rely on the Dolby improved SBR envelope flattening to perform the spectral flattening function. There was considerable discussion. In particular, it was noted that it is not clear Which flattening technique is best (CE from Dolby or CE from Huawei) Whether the T/F post processing provides merit in addition to the spectral flattening The workplan will conduct additional listening tests at 12 kb/s mono and 8 kb/s mono operating points. The systems under test will be: 1. RM7+iSBR 2. RM7+iSBR+T/Fpp 3. RM7+CEfull (flattening always active) where iSBR is “Improved SBR” technology, which flattens the high-band spectral envelope 287 T/Fpp is the Time/Frequency post processing filter CEfull is both the spectral flattening module and the Time/Frequency post processing module proposed in the CE In system 3, above, the spectral flattening module will be tuned so that it is active for all content categories, namely speech, speech+music and music. Based on the MUSHRA scores, the following difference statistics would be analyzed (where numbers refer to systems, as labelled above). For each diff score, if there is significant improvement, then an action is indicated. Diff Scor e 2-1 3-2 Conclusion if significant positive value Incorporate iSBR + T/Fpp Replace iSBR with CEflattening and incorporate T/Fpp 3-1 Additional discussion on QMF transposer KK, Dolby, made a presentation on next steps on QMF transposer. The decisions are Keep both QMF and FFT transposer Replace FFT transposer by QMF transposer The workplan will conduct additional listening tests at 12 kb/s mono and 8 kb/s mono operating points. The baseline software will be RM7 plus the transposer cross-products technology. RM8 Reference Software Markus Multrus, FhG, discussed an issue pertaining to the RM8 reference software. A bug was discovered in the decoder when it operates at 64 kb/s and with the time warping tool active. The bug pertained to the window size that was a float, was stored as an int but then used again as a float. The result was that, on second use, the window size was off by one. This bug resulted in a maximum difference of 42 in the decoded waveform, although the RMS error was much smaller. It was the consensus of the Audio Subgroup to: Correct the RM8 Reference Software Correct the RM8 decoded waveforms An email to the mpeg-audio-call reflector reported that this update was done during the MPEG meeting. Performance of MPEG Reference Encoder Software The Chair stated that it would be very desirable for the Audio Subgroup to conduct a small-scale but well controlled listening test to measure the performance of the MPEG Reference Encoder software. There was good discussion on this proposal and the following issues were identified. It is believed that the JAME software project represent the best version of the MPEG Reference Encoder software. The Yonsei experts verified that the JAME software is not encumbered by external (i.e. non-MPEG) copyright, such that all source code files can have the MPEG copyright header. The Chair noted that such a listening test could be conducted in April, 2011, in which case it will not conflict with the USAC Verification Test work, which is envisioned to have listening tests in June, 2011. A test in the April, 2011 time frame could use the Reference Quality bit streams made available from the March MPEG meeting, which should be identical or at least very close in performance to the Reference Quality bit streams used in the USAC Verification Test. USAC Timeline and CE Logistics 288 The Chair presented a timeline for USAC activities, shown in the table below. This was discussed and audio experts agreed to this schedule. However, the Chair acknowledged that the new ballot intervals under which USAC must operate plus DIS editing period and ballot processing could mean that the DIS ballot results are not available at the 97th meeting, thus making it impossible to progress to FDIS in July 2010. Meeting Date Activity 94th Oct Study on CD 2010 95th Jan The default position is that all CEs 2011 must be complete at this meeting. Exceptions will be discussed on a case by case basis. DoC on CD DIS text 96th Mar Workplan for Verification Test 2011 May Integrate CE reference software into 2011 normative decoder Jun, Conduct Verification Test listening 2011 tests and make results available as Excel spreadsheet th 97 Jul DoC on DIS 2011 FDIS text ProduceVerification Test Report Sep IS 2011 Name for USAC specification The Chair discussed the need for a name for the USAC specification, and expressed a strong interest in having that name at the 95th MPEG meeting. The Chair suggested the following name and acronym, and asked audio experts to please come to the next meeting with something much better: eXtented Audio Coding – XAC During the meeting, other experts suggested: NSA – Next Stage Audio FEAC – Future Entertainment Audio Coder FRAC – Full Range Audio Coder C-AAC – Content Agnostic Audio Coder MMAC – MPEG Media Audio Coder 5 5.1 Closing Audio Plenary and meeting deliverables Plenary discussions There were none. 5.2 Responses to Liaison and NB comments Liaison statement and NB comment responses generated by Audio were presented and approved. 5.3 Recommendations for final plenary The Audio recommendations were presented and approved. 289 5.4 Establishment of Ad-hoc Groups The ad-hoc groups shown in the following table were established by the Audio subgroup. Unless otherwise indicated, Ad Hoc group meetings will be held at the location of the next MPEG meeting on the weekend preceding that meeting. No. Title Mtg 11662 AHG on MPEG Surround, SAOC, USAC 0900-1800 on Sunday prior to meeting 5.5 Approval of output documents All output documents, shown in 4, were presented in Audio plenary and were approved. 5.6 Press statement The Audio contribution to the press statement was presented and approved. 5.7 Agenda for next meeting The agenda for the next MPEG meeting is shown in Annex F. 5.8 All other business There was none. 5.9 Closing of the meeting The 94th Audio Subgroup meeting was adjourned Friday at 1:15 pm. 290 1. Participants There were 39 participants in the Audio subgroup meeting, as shown in the following table. First Name Last Name Country Affiliation Bruno Bessette CA Voiceage Corporation Ti Eu Chan SG I2R Toru Chinen JP Sony Kihyun Choo KR Samsung Philippe Gournay Canada VoiceAge Corp. / Univ. of Sherbrooke Oliver Hellmuth DE Fraunhofer IIS Jeff Huang USA Qualcomm Inc. Sangoh Jeong KR LG Electronics Kyeong Ok Kang Korea ETRI Kei Kikuiri JP NTT DOCOMO Korea Advanced Institute of Science and Kwangki Kim KR Technology. Kristofer Kjörling SE Dolby Ferenc Krämer DE Dolby Jaeseong Lee KR Yonsei University Roch Lefebvre Canada VoiceAge Corp. / Univ. of Sherbrooke Takehiro Moriya JP NTT Markus Multrus DE Fraunhofer IIS Max Neuendorf DE Fraunhofer IIS Masayuki Nishiguchi JP Sony Takeshi Norimatsu JP Panasonic Henney Oh KR Yonsei University Werner Oomen NL Philips Applied Technologies Gregory Pallone FR Orange Labs Hochong Park KR Kwangwoon University Pierrick Philippe FR Orange Labs Heiko Purnhagen SE Dolby Schuyler Quackenbush USA ARL Mohamad Raad Australia RaadTech Consulting Julien Robilliard DE Fraunhofer IIS Jeongook Song KR Yonsei University Leon Terentiv DE Fraunhofer IIS Kimitaka Tsutsumi JP NTT DOCOMO David Virette DE Huawei Technologies Oliver Wuebbolt DE Technicolor Wei Xiao CN Huawei Technologies Jianxin Yan CN NELA Sungyong Yoon KR LG Electronics Yunxuan Zhao CN Huawei Hai Shan Zhong Singapore Panasonic Singapore Laboratories 291 2. No Sunday 1000-1800 m18456 m18077 m18435 m18451 m18467 m18379 Audio Contributions and Schedule Title AhG – USAC Corrections to Reference Software and CD of USAC Draft USAC CE Status and Workplan USAC Ces – possible decisions VoiceAge listening test results for the enhanced bass-postfilter CE FhG listening test report on CE on improving the USAC bass post-filter Report on cross-check listening test for the CE on improved Bass-post filter for USAC Finalization of CE on an improved bass- post filter operation for the ACELP of USAC m18360 Panasonic cross check report on complexity reduction for time warping in USAC m18375 Dolby listening test results for CE on complexity reduction for time warping in USAC Completion of the Core Experiment on Reducing the Complexity of the USAC Time Warping Sony listening test report on improved SBR in USAC FhG Listening Test Report – improved SBR Finalization of CE on improved SBR m18452 m18398 m18431 m18378 1300-1400 m18372 m18386 m18466 Lunch Crosscheck report on harmonic transposer Ces Finalization of CE on QMF based harmonic transposer NTT DOCOMO Cross-check Report on Improved Harmonic Transposer in 292 Authors X Max Neuendorf X S. Quackenbush X Philippe Gournay ,Roch Lefebvre Guillaume Fuchs, Max Neuendorf David Virette X Barbara Resch, Leif Sehlström, Heiko Purnhagen, Lars Villemoes, Kristofer Kjörling, Bruno Bessette, Philippe Gournay Takeshi Norimatsu, Tomokazu Ishikawa, Haishan Zhong, Dan Zhao, Kok Seng Chong Heiko Purnhagen, Kristofer Kjörling X Stefan Bayer X Toru Chinen, Masayuki Nishiguchi Stephan Wilde, Max Neuendorf Kristofer Kjörling, Leif Sehlström X Kihyun Choo, Miyoung Kim, Eunmi Oh Haishan Zhong, Kok Seng Chong, Takeshi Norimatsu, Tomokazu Ishikawa, Lars Villemoes, Per Ekstrand, Kristofer Kjörling, Stephan Wilde, Sascha Disch, Frederik Nagel, Max Neuendorf , Kimitaka Tsutsumi, Kei Kikuiri, Nobuhiko Naka X X X X X X X X X m18501 m18459 m18468 m18384 USAC Panasonic crosscheck report on improved harmonic transposer Crosscheck listening test report for USAC on FFT and QMF harmonic transposers Report on cross-check listening test for the Ces on QMF based harmonic transposer and improved harmonic transposer in USAC Finalization of CE on improved harmonic transposer in USAC m18389 Overview of performance of transposer proposals, and suggested decoding modes m18371 Crosscheck report on adaptive T/F domain post-processing for USAC Dolby listening test results for CE on T/F post-processing in USAC Crosscheck listening test report for USAC on time frequency domain post-processing Finalization of CE on adaptive T/F domain post-processing for USAC Review of AhG report and plenary presentation m18373 m18461 m18471 Monday 0900-1300 1300-1400 1400-1500 MPEG Plenary Lunch Audio Plenary Welcome and Remarks Report on Sunday Chairs meeting Review main tasks for the week SAOC Verification Test USAC Ongoing and new CEs Verification Test USAC Reference Encoder Maintenance New work 293 Zhong Haishan, Chong Kok Seng, Zhao Dan, Takeshi Norimatsu, Tomokazu Ishikawa, Neo Sua Hong Jeff Huang X David Virette X Lars Villemoes, Per Ekstrand, Sascha Disch, Frederik Nagel Kristofer Kjörling, Haishan Zhong, Kok Seng Chong, Takeshi Norimatsu, Tomokazu Ishikawa, Lars Villemoes, Per Ekstrand, Stephan Wilde, Sascha Disch, Frederik Nagel, Max Neuendorf , Kihyun Choo, Miyoung Kim, Eunmi Oh Heiko Purnhagen, Kristofer Kjörling Jeff Huang X David Virette, Wei Xiao X X X X X X X m17971 m17972 m18076 m18454 m18473 m18434 m18480 m18470 m18429 1630-1800 m18428 m18472 m18432 m18430 m18234 General Ad Hoc on Audio Standards Maintenance Ad Hoc on SAOC, USAC, and Audio and Systems Interactions 93rd MPEG Audio Subgroup Report USAC – AhG Recommendations USAC – CE Integration Phase Informative Encoder Description for USAC Improved Noiseless Coding CE Educational Information on Encoder Implementation of inter-TES tool in USAC USAC Report on USAC performance Comments on further USAC investigation USAC Reference Encoder A new signal classifier for USAC reference encoder Status report on USAC Reference Software JAME USAC CE progress reports Yonsei Crosscheck listening test report on additional bandwidth extension CE for USAC Progress report on additional bandwidth extension CE for USAC at low bit rates Status update of CE proposal on increased structural flexibility in SBR Updated CE on enhanced long term predictor (eLTP) for USAC Progress report on the TCX windowing CE for USAC m18419 Status report of time warping CE in USAC 1800- HOD Tuesday 294 S. Quackenbush X S. Quackenbush X S. Quackenbush X X Vignesh Subbaraman, Markus Multrus, Kihyun Choo Kei Kikuiri, Atsushi Yamaguchi, Nobuhiko Naka X Gregory Pallone, Pierrick Philippe Max Neuendorf, Markus Multrus, Stefan Doehla, Werner Oomen, Heiko Purnhagen X David Virette, Lijing Xu, Wei Xiao Jeongook Song ,Henney Oh ,Hong-Goo Kang X Jeongook Song ,Eunwoo Song, Henney Oh X David Virette, Wei Xiao X Stephan Wilde, Markus Multrus, Max Neuendorf, Kristofer Kjörling, Heiko Purnhagen Jeongook Song ,Henney Oh ,Hong-Goo Kang Taejin Lee, Seungkwon Beack, Minje Kim, Kyeongok Kang Zhao Dan, Zhong Haishan, Chong Kok Seng, Takeshi Norimatsu, Tomokazu Ishikawa, Neo Sua Hong X X X X X X X 0900-1300 m18396 m18469 m18361 m18374 m18455 m18399 m18532 1300-1400 1400-1500 m18406 m18426 USAC – Ongoing CEs Sony listening test report on pulse indexing in USAC Finalization of Enhanced Pulse Indexing CE for ACELP in USAC Panasonic cross check report on PVC for SBR envelope coding in USAC Dolby listening test results for CE on PVC for SBR in USAC FhG Listening Test Report – PVC for SBR envelope coding Report on PVC CE for SBR in USAC Telcordia and PUT listening test results for CE on improved tonal component coding in eSBR (USAC) Lunch MPEG-2, MPEG-4, MPEG Surround, SAOC Proposed addition to Intensity Stereo in ISO/IEC 14496-26:2009, Audio conformance Additional information on MPEG Surround conformance testing m18436 Report on corrections for MPEG SAOC m18437 Revised draft of SAOC verification test report 1500-1800 m18481 USAC – New Ces Proposed CE for extending the LPD mode in USAC Proposed Core-Experiment on Enhancing the USAC Codec at Mid m18479 295 Toru Chinen, Masayuki Nishiguchi David Virette, Dejun Zhang, Fuwei Ma Takeshi Norimatsu, Tomokazu Ishikawa, Haishan Zhong, Dan Zhao, Kok Seng Chong Heiko Purnhagen, Kristofer Kjörling Frederik Nagel X Toru Chinen, Yuki Yamamoto, Mitsuyuki Hatanaka, Masayuki Nishiguchi Tomasz Zernicki ,Maciej Bartkowiak ,Marek Domanski X Ferenc Kraemer, Heiko Purnhagen X Andreas Hölzer, Christian Ertel, Markus Lohwasser, Michael Härtl, Ferenc Kraemer, Frans de Bont Jonas Engdegård, Heiko Purnhagen, Oliver Hellmuth, Jürgen Herre, Cornelia Falch, Leon Terentiv, Maria Luis Valero, Johannes Hilpert, Andreas Hölzer, Werner Oomen Jonas Engdegård, Heiko Purnhagen, Oliver Hellmuth, Jürgen Herre, Cornelia Falch, Leon Terentiv, Maria Luis Valero, Johannes Hilpert, Andreas Hölzer, Werner Oomen X Bruno Bessette ,Philippe Gournay, Roch Lefebvre Markus Multrus, Philippe Gournay, Nikolaus X X X X X X X X X Bitrates m18474 m18433 1800Wednesday 0900-1100 1130-1200 m18413 CE proposal on improved stereo coding at low bit rates Proposed decorrelator improvements in USAC MPEG Plenary Break-out on PVC Break-out on Sinusoidal Coding CE proposal on improved applause coding in USAC Core Experiment on Enhanced Mode Transitions in USAC m18450 Proposed unified global gain syntax element in USAC Lunch Joint meeting with Requirements at Audio for HEVC, at Audio Joint Type I licensing USAC Status and Workplan preparation Social: Meet at main entrance, Bldg 3 (HOD, Chair’s bus leaves at 1650) Bus leaves at 1700 1500-1630 1500-1600 1630- Thursday 0900- 1000-1100 1300-1400 1500-1600 14001800Friday 0900-1300 X Julien Robilliard, Matthias Neusinger, Johannes Hilpert, Erik Schuijers, Bert den Brinker, Werner Oomen X Achim Kuntz, Sascha Disch, Erik Schuijers, Werner Oomen Kiho Cho, Nam Soo Kim, Sungyong Yoon, Sangoh Jeong Guillaume Fuchs, Markus Multrus, Max Neuendorf X Chairs m18393 1300-1400 1400-1500 Rettelbach, Bruno Bessette, Bernhard Grill David Virette USAC timeline and CE interaction USAC Status and Workplan USAC name Type I Licensing Lunch Type I Licensing USAC Status and Workplan Resolutions Chairs Audio Plenary Report on Thursday Chairs meeting Recommendations for final plenary Establishment Ad-hoc groups and 296 X X 1000 1030 1300-1400 1400- review AhG Mandates Get document numbers Submit AhG Mandates and Resolutions Approve Responses to NB comments and Liaison Approval of output documents: Title: N10xx (short title).doc (NOT *.docx!) File w10xx.zip Agenda for next meeting Review of Audio presentation to MPEG plenary A.O.B. Closing of the Audio meeting Lunch MPEG Plenary 297 3. Task Groups 1. MPEG-2 and MPEG-4 Audio, MPEG Audio Conformance, MPEG reference software, MPEG Surround 2. MPEG-D Spatial Audio Object Coding 3. MPEG-D Unified Speech and Audio Coding 4. Audio Exploration: Audio for HEVC 298 4. No. 11654 11655 11656 11657 11658 11659 11660 11661 5. Output Documents Title 13818-4 – Conformance testing ISO/IEC 13818-4:2004 DCOR 2, AAC Intensity Stereo 14496-26 – Audio Conformance ISO/IEC14496-26:2010/Cor 2 BSAC Corrections ISO/IEC 14496-26:2010/DCOR 3, Intensity Stereo 23003-2 – SAOC Report on Spatial Audio Object Coding Verification Tests Defect Report of ISO/IEC 23003-2:2010, Spatial Audio Object Coding 23003-3 – Unified Speech and Audio Coding Study on ISO/IEC 23003-3:201x/CD of Unified Speech and Audio Coding Status and Workplan for USAC CEs Exploration – Future Audio Audio for HEVC Agenda for the 95th MPEG Audio Meeting Agenda Item 1. Opening of the meeting 2. Administrative matters 2.1. Communications from the Chair 2.2. Approval of agenda and allocation of contributions 2.3. Review of task groups and mandates 2.4. Approval of previous meeting report 2.5. Review of AhG reports 2.6. Joint meetings 2.7. Received national body comments and liaison matters 3. Plenary issues 4. Task group activities 4.1. MPEG-1, MPEG-2, MPEG-4, and MPEG Surround 4.2. Spatial Audio Object Coding 4.3. Unified Speech and Audio Coding 4.4. Exploration: Audio for HEVC 5. Discussion of unallocated contributions 6. Meeting deliverables 6.1. Responses to Liaison and NB comments 6.2. Recommendations for final plenary 6.3. Establishment of new Ad-hoc groups 6.4. Approval of output documents 6.5. Press statement 7. Future activities 8. Agenda for next meeting 9. A.O.B 10. Closing of the meeting 299 TBP Available No 10/10/15 No No 10/10/15 10/10/15 YES 10/11/15 No 10/11/15 No 10/11/15 No 10/10/15 No 10/10/15 Annex J – 3DG report Source: Marius Preda, Chair 1 Opening of the meeting 1.1 Approval of the agenda The agenda is approved. 1.2 Goals for the week The goals of this week are: Discuss the software and conformance status for SC-3DMC in 3DGCM (MP25 AMD1) Status of conformance for MP25 AMD1 Review contributions related to 3D graphics engines in MXM Review contributions related to 3D graphics aspects of MPEG-V Review contributions related to Representation of meshes with multiple attributes Review contributions related to Multi-resolution 3D mesh coding Review RGC (reconfigurable graphics codec) contribution Investigate future developments of MPEG 3D Graphics Compression Review Liaisons Review the votes Web-site 1.3 Standards from 3DGC Std Pt Edit. Project Description 4 5 2001 Amd.26 RS for scalable complexity 3D mesh coding in 3DG compression model 4 16 201x Amd.1 Efficient representation of 3D meshes with multiple attributes 4 16 201x Amd.2 Multiresolution mesh coding 4 25 2009 Amd.1 Scalable complexity 3D mesh coding for 3DG compression model 4 27 2009 Amd.3 Conformance for scalable complexity 3D mesh coding in 3DG compression 300 CfP WD CD FCD FDIS Gr. 09/10 10/04 11/01 3 09/10 10/07 11/01 11/07 3 10/07 11/01 11/07 12/01 3 09/07 09/10 10/04 10/10 3 10/01 10/04 11/01 3 model 1.4 Room allocation 3DGC: 3415 301 1.5 Allocation of contributions Note: due to the absence of several delegates the discussions took place by email and audio conferences. Several contributions (indicated in the table below) were postponed for being discussed during the next meeting. N° D1 Title Schedule Monday D1 09:00~11:30 13:00~14:00 MPEG Plenary Lunch Break 3DG Plenary Roll call, Agenda, Goals, FAQ, Awareness Event etc., m17969 Report of AHG on 3DGC documents, experiments and software maintenance Summary: The IRS is not yet connected to SC3DMC. This should be done before the next meeting. AFX 4th edition is in late. Take AP during the week. Marius Preda m17970 Ad Hoc Group on Multi-Resolution 3D mesh Coding Summary: WD1.0 does not contain the integrated schema. Issue on sharing the source code. Results of voting No voting m18555 Notification of the change of Forum Name of Category C liaison partner Resolution: send the information to Yukiko, she updated the Liaison document Khaled Mamou, Minsu 14:00~15:00 Ahn, Xiangyang Ji Performance compression between 3DMeshCodec (Technicolor) and 3DMCE A codec that uses the redundancy of repetitive structures and may bring benefits of up to 92% compression improvement MultiResolution 3D Resolution: Integrate the codec on the benchmarking platform to Mesh Coding compare with recent 3DG codecs. (MR-3DMC) m18424 Initial ideas for texturing support in MR3DMC Summary: Automatic generate and encode textures in a multiresolution way. There are two modules: texture adaptation and skin transplant. Similar as for mesh, a corse texture is obtained. However, there is a conversion of a texture into a fine 302 Francisco Moran Marius Preda Seung Wook Lee, ChangHwa Lyou, SangKwon Jeong Mary-Luc Champel 15:00 – 16:00 David Fuentes Sánchez , Francisco Morán Burgos Observation tune representation. There is a specific unwrapping method that has the following proprieties: it preserves the angles (there is no deformation of the triangles), therefore the unwrapping is unique and deterministic. The skin transplant allows to choose the pixel of the coarse texture by projecting each 3D point of the course mesh (not only the mesh vertices) to the fine mesh and taking the color from the projection. Resolution: Continue the investigation, obtain the coarse texture, investigate on coding of textures. Revision of WD1.0 Unified Model The definition of the bitstream structure (this structure is not supporting the view dependent). The partition consists in spatial Minsu Ahn LODs that contain Quality LODs. The header contains information related to the mesh size (no of cords, normals, … for the fine resolution one) and quantization steps. D2 Tuesday Reconfigurable Graphics Coding 17:25 – 18:00 D2 m18225, Contribution of 3DMC FUs for RGC framework Summary: the FUs for 3DMC + definition of TOKENs + design of networks. The FUs + def of TOKEN should go in GTL. Resolution: issue the WD 3.0 for RGC containing the following sections: 1. GTL: definition of FUs + TOKEN 2. FU network description for different tools (3DMCe) 3. Implementation (attached to the document as source code as well) Seungwook Lee, Bon Ki Koo, Minsoo Park, Sinwook Lee, Sowon Kim, Euee S. Jang m18224, Contribution to RGC tutorial Summary: The document is based on the existent RVC tutorial. The XML FU network schema is defined in MPEG B. There are several tools that allow to specify a decoder. A codec development process is given as example, proposing to start from a top level diagram. One idea to design FUs is to start from the RefSoft. The bitstream parser FU is implementing the Minsoo Park , Seungwook Lee , Bon Ki Koo , Hyunok Oh , Sinwook Lee , Sowon 09:00~10:30 Kim , Taehee Lim , Hyungyu Kim , Euee S. Jang 303 10:30~11:00 Joint with V entropy decoder. The FUs are described textually. One C/C++ example of a FU is provided. RSM is the RVC Simulation model. Case study: 3DMC: the triangle decoder is inside the parser FU. There are additional 5 algorithmic FUs. Supporting tools for RGC framework: ORCC, Hopes. MultiResolution 3D Mesh Coding (MR-3DMC) WD2.0 Supported features and additional needs 4 of 6 requirements are now supported: lossless connectivity, spatial scalability, quality scalability and near lossless encoding. The remaining two are view dependent scalability and support for multi-attributes per vertex. Resolution: add in the CE the description of what should be done to investigate the support for the two remaining requirements. Minsu Ahn 12:00~14:00 Lunch Break MPEG-V 11:30 – 12:00 m18449 Proposed changes for validation rules of MPEG-V Part 7 update the error message, accepted m18585 Additional input for Sensor / Actuator Integration use case Presence sensor and proximity sensor These are some user characteristics in DIA (MobilityCharacteristics, Destination). There is a PlaceType already in MPEG-7 that can be used as a basis for DevicePositionType. Resolution: The position sensors can be used and the engine will make the connection between the sensors and the real users. There is no need to define the presence and proximity sensors. AhG mandate: Identify what it can be reused from MPEG-7 and MPEG-21 DIA for location (this was transformed in a MPEG 304 Markus Waltl , Christian Timmerer Jean H.A. Gelissen , Herman Tuininga 14:00-14:10 14:10-15:10 resolution). FDIS(s) review FCD for RefSoft and Conformance will be generated by Marcus from the schemas. Technical corrections on MPEG-V A new schema is proposed for VWOEventType. Biosensors. Sanghyun Joo Jae Joon Han Multi-pointing, Gaze, Wind. All the three sensors were discussed. Resolution: add the capabilities for all of them Descriptions for social network services Summary: discussions on the need of having user characteristics in MPEG=V. Resolution: this is an important issue for MPEG but in the past both MPEG-7 and MPEG-21 already specified related data format. Contributions on evaluating them for the scope of MPEG-V are encouraged. D3 15:10-15:30 Joint with Req Sanghyun Joo Wednesday D3 09:00~11:00 MPEG Plenary RGC 17:30 – 18:00 An animated meeting on the RVC/RGC meeting related to the specification language. The outcome was that there is a need to document the requirements of such a language. Issue re-discussed in a meeting hold Friday morning and documented below. Marius Preda 11:00 – 13:00 13:00~14:00 Lunch Break Deadline for announcing the participation: 2010/10/29 Flyer draft at the end of the meeting / Final version in 2 weeks MPEG-V with S 14:00 – 15:30 Date: January 27, 2011, 10h00 to 17h00 Demonstrations: 305 Samsung (intelligent camera, bringing sensibility in a virtual world, 3D demo), ETRI (motion chair, multi-sensory experience, 3D demo), GIST (The Haptic Experience) Technicolor (Dynamic Multiresolution Rendering) Institut TELECOM (Remote gaming & Virtual worlds, mobile 3D graphics) Myongji University (Authoring tools of Sensory Effects) Videos: (Metaverse1) Virtual Travel & SoundScape (Metaverse1) Virtual Presence (Metaverse1) Collaborative Working (Metaverse1) Dynamic control of Virtual Humans Face 2 Face virtual chat Christian’s movie Workshop Program: - Introduction (Marius) 10m - Architecture and use case (Jean) 15m - Capturing and controlling the real world (Part 2 & 5) (Kyongro) 20m - Sensory effects (Part 3) (Christian) 20m - Interoperability with virtual worlds (Part 4) (Jae Joon) 20m - Importance of interoperability (invited speaker Yesha, Metaverse1) 15 m No registration fee is needed to participate to the meeting. We’ll have a web page to register the presence. 306 AFX D4 4th Edition Only small progress because of lack of time. Homework for the editors. Marius Preda 15:30 – 16:00 Marius Preda 9:00~9:30 Thursday Review all the action points Output documents: MPEG-V FDIS for Part 2, 3, 4, 5, 6 OK FCD for Part 7 Accepted. WDs 2.0: (editing period of 4 weeks, keep inside only what exists today) 1. MPEGV & MPEG-U 3DG A way to declare an interface to have communication between the scene and outside. Regardless on what is inside the scene, an interface is declared for communication, with the possibility to reply. The problem formulation: the VO should be able to carry behavior information from one virtual world to another. This behavior may be represented by using the scripting language. MPEG-U part 1 is not the answer to the problem since it is specifying the communication interface. 2. Why MPEG-U Part 2 is not belonging to MPEG-V? a. In the use case of gesture recognition, there is a basic sensor of detecting the feature points (MPEG-V Part 5) followed by an interpretation of it (such as recognizing a circle), that is in MPEG-U part 2 Multiresolution WD2.0 review AP: investigate on the integration between the cascaded quantization and the bitplane coding of the KLT method. Finalize during the editing period the QualityLODData class Editing period of 4 weeks. Minsu 11:00~12:00 12:00~14:00 Lunch Break 3DG Plenary 9:30~10:00 RGC core experiment edition (documented in an output document) 14:00~15:00 Review of RGC WD3.0 (2 weeks editing period) 3DG core experiments D5 15:00~16:00 D5 Friday 307 3DG Plenary RGC: update of the core experiment description but accepted almost in the form that was proposed yesterday. RGC/RVC a document in the requirements that the specification language should fulfill was presented, discussed and accepted. The document is registered as an input contribution m18614 “Proposed requirements for RVC and RGC specification language and solution” A resolution asking NBs to comment on it was issued. 10:00~11:00 12:00~14:00 14:00~ Lunch Break MPEG Plenary 308 2 General issues 2.1 General discussion 2.1.1 Reference Software It is recalled that the source code of both decoder AND encoder should be provided as part of the Reference Software for all technologies to be adopted in MPEG standards. Moreover, not providing the complete software for a published technology shall conduct to the removal of the corresponding technical specification from the standard. Currently all the AFX tools published in the third edition are supported by both encoder and decoder implementation. 2.1.2 Web site OrangeLabs announced interrupting the maintenance of the group web-site. A call for volunteers is now issued. In the meantime 3DGC contributors are asked to check the web-site and provide comments on the current version of the web-site. 3 Current Voting Document title none 4 General 3DG related activities 4.1 AhG on 3DG activities DoC Editor of DoC Report of AHG on 3DGC documents, experiments and software maintenance Title Authors Francisco Moran Resolution accepted Ad Hoc Group on Multi-Resolution 3D mesh Coding Title Authors Khaled Mamou, Minsu Ahn, Xiangyang Ji Resolution accepted 309 4.2 Promotions 4.2.1 Web Site Title Status of www.mpeg-3dgc.com Authors Summary The web site is not more maintained by OrangeLab. Action Point: Transfer the web-site to other location and call for volunteers for Resolution maintenance. 4.3 Joint activities 4.3.1 MPEG-V - Information Exchange with Virtual Worlds (formally Metaverse) Additional Use m18253 Cases for MPEG-V jean.gelissen@philips.com Proposal of GPS Sensor Capability Type, Altitude Sensor Kyoungro Yoon , Doohyun m18025 Capability Type, Kim , Min-Uk Kim and Global Position Capability Type Additional type m18507 for Virtual Object Multimodel interfaces for Virtual Humans and behavior markup, propose to adopt part from BML, examples of how this can be used, Second: the TV platform as a Virtual World I/O device: use of jointSPACE: remote rendering: the content is streamed as video. (it is an ETSI initiative as well) Third: Hybrid communication technology: Interfacing the virtual world with Internet world Resolution: add the use cases in Part 1. Check is new requirements are needed (for the next meeting) These sensors are already in ADM1, here there are presented the capability type for each of them. Resolution: Change the name in MobileDevicePotisionCapability Use the bounding box for defining the 3D space. Add the VOVisual in the common characteristic type that may describe the light, shadow and collision effect. A complete description of light, shadow and collision. Resolution: there is no need to go this level because these features are Sanghyun Joo 310 implemented in the graphics engine. m18509 Add Wheel Button in the mouse. Add the combination of different input elements. The “Click definition” is proposed. Resolution: change the Event schema for allowing combination of the Mouse/Keyboard/UserDefinedInput. Accept the wheel button. Issue an internal document called “Technical corrections on MPEG-V”. Resolution: update the FDIS for adding the maxOccurent= unbound for the “choice” in the VWOEventType and adding keyCode AMD1: Add the mousewheel events in the MouseEventCS Sanghyun Joo, Minor changes in Bas du formulaire MPEG-V Part 4 Update of Reference SW m18023 for MPEG-V Part 2 Bum Suk Choi , Eunseo Reflects the changes that are done in FDIS Lee , Jong Hyun Jang , Resolution: accepted in a study of FCD Kyoungro Yoon , Jonghyung Lee , Ying Ying Chen Update of Reference SW m18024 for MPEG-V Part 5 Eunseo Lee , Bum Seok Reflects the changes that are done in FDIS Choi, Kwang Roh Park, Resolution: accepted in a study of FCD Kyoungro Yoon , Jonghyung Lee , Ying Ying Chen Proposed changes for Markus Waltl , Christian m18449 validation rules Timmerer of MPEG-V Part 7 Proposal of Header information for m18027 binary transmission of MPEG-V information postponed A common header for binary representation for Part 2 to 5. It indicates that the stream is MPEG-V, the mode, the parts and the year of the version. Resolution: to generate a table with all the types, ordered per Parts and to allocate Kyoungro Yoon , Soohan unique identifiers to all the types. The Kim , Woo Chool Park , Hae types from Part 1 are 1xx, the ones from Moon Seo Part 2 are 2xx, … Accepted for AMD1 for Part 6 TODO: review the binsarisation of the elements in each part to remove the identifier that is now present in the common header. 311 m18252 Robot Integration m18352 Path Finding Data format for sensed m18365 information on Electrograph sensor m18377 Tactile Information Sensed Information for m18410 gas and dust sensors acordova @tue.nl , jean.gelissen @philips.com Extension for controlling the robot from the virtual world. The paper proposes some new types for controlling the robot. Resolution: compare the proposed types with the ones already existent in MPEg-V and identify what is missing for covering the functionality. Try to extent to generic tools for controlling the robots and not only one robot. In the application of virtual tourism, it is needed to compute the path between two points. The requirement is that MPEG-V should be able to specify the path. jean.gelissen @philips.com , We have currently a type called roland @cs.uu.nl , GlobalPositionCommand that can define real positions. We have to add a possible name and description for each point as well as a way to combine several potions to obtain a trajectory. SeungJu Han , Jae Joon Han , Won-Chul Bang , James D.K. Kim , SangKyun Kim The Electrograph sensors have only usage recommendation in other standards but not a formalized data format. The locations for sensors were investigated because the sensors measure the electrical activity as differences between two locations. Waveform Pattern: Maximum amplitude of wave To describe the electrogram it is needed to specify the waveform, the frequency, the amplitude, phase, coherence and location Proposal: label (EEG, ECG, EMG), placement, timeseries, pattern, unit and max amplitude Resolution: prepare an Electrograph base type and extend for the three types: EEG, EMG, ECG. Dissociate the raw data from metadata. jean.gelissen @philips.com , Editorial points for Part 3 esko.dijk @philips.com Jae Joon Han , SeungJu Han , Won-Chul Bang , James D.K. Kim , Sang-kyun Kim , YongSoo Joo 312 A study on the different types of gas that may be sensed. A set of types are proposed in a classification scheme that can be extended. The unit is ppm but may be also something else. DustType is also proposed that extend the basetype with value and unit. Resolution: add gas and dust in AMD1. Add use cases in Part 1. Definition and Seong Yong Lim , Jihun application m18487 Cha , Injae Lee , Youngdomains of new kwon Lim , Joong Yun Lee sensors Multipointing, Gaze tracking, Wind sensor, Add in Part 1 some clarifications, application domain. Resolution: add in AMD 1 of Part 1 Sensor capability Seong Yong Lim , Jihun m18488 and preference of Cha , Injae Lee , Youngnew sensors kwon Lim , Joong Yun Lee Present the capabilities of the proposed sensors as composition of existent capabilities and additional data. Resolution: accepted partially. Change the blinking in a duration (ms) and not an on/off effect. Wind Adaptation preference not accepted. Binary Seong Yong Lim , Jihun m18489 representation of Cha , Injae Lee , Youngnew sensors kwon Lim , Joong Yun Lee Present the binarisation of the proposed sensors. Resolution: Change the synthax of GazeTrackingSensorType to only have binary type for blink. Remove the position component from the Wind sensor Descriptions for m18508 social network Sanghyun Joo services Add the userProfile and communication type for users. Resolution: investigate on the possibilities to integrate multi-user worlds in MPEG-V Extends the base capability type. Resolution1. Accepted. Jae Joon Han , SeungJu Resolution2: Make clear that in the case of Sensor capability Han , Won-Chul Bang , multidimentional sensors the min/max, m18510 for gas and dust James D.K. Kim , Sang-kyun intervals , … from sensors Kim , YongSoo Joo SensorCapabilityBaseType are overwritten and not used. 4.3.2 Reconfigurable Graphics Coding Several joint meetings with Video RVC group took place to review the related contributions. The resolutions are reported in the table above. 313 4.4 Explorations none 5 Liaison none 6 1 Output documents and Resolutions of 3DGC Part 4 Conformance 1.1 2 The 3DG subgroup recommends to appoint Ralph Sperschneider and Francisco Morán Burgos as editors of 14496-4:2004/Cor.7:2010 Part 16 Animation Framework eXtension (AFX) 2.1 The 3DG subgroup recommends approval of the following documents No. 11665 11666 11667 11668 2.2 3 3.1 Title 14496-16 – Animation Framework eXtension (AFX) Description of 3DG Core Experiments Working Draft 3.0 on Multiresolution Mesh Compression Working Draft 3.0 on Reconfigurable Graphics Coding Reconfigurable graphics coding tutorial v2.0 TBP Available No No No No 10/10/15 10/11/15 10/10/30 10/10/15 The 3DG subgroup recommends NBs to study the input contribution m18614 “Proposed requirements for RVC and RGC specification language and solution” and provide comments by the next MPEG meeting Part 25 3DGCM The 3DG subgroup recommends approval of the following documents No. Title 14496-25– 3DG Compression Model Text of 2nd Edition of ISO/IEC 14496-25 FDIS 3DG compression 11669 model TBP Available No 10/10/15 3.2 The 3DG subgroup recommends to integrate 14496-25:2009/FDAM 1 into 14496-25 2nd Edition and issue FDIS 14496-25 2nd Edition. 7 Establishment of 3DG Ad-Hoc Groups 314 N11670 AHG on 3DG documents, software maintenance and core experiments Mandate: 1. Conduct the experiments on efficient representation of 3D meshes with multiple attributes 2. Conduct the experiments on Reconfigurable Graphics Coding 3. Maintain and edit 3DG documents 4. Coordinate 3DG related conformance and reference software 5. Coordinate editing of the www.mpeg-3dgc.com web site Chairmen: Francisco Morán Burgos, Seung Wook Lee Until 95th meeting Duration: Sunday before 95th meeting Meetings: mpeg-3dgc AT gti. ssr. upm. es Reflector: Subscribe: https://mx.gti.ssr.upm.es/mailman/listinfo/mpeg-3dgc N11671 AHG on Multi-Resolution 3D mesh Coding Mandate: 1. Design an integrated approach combining the PTFAN and KLPMC techniques 2. Provide a first version of the verification model software 3. Continue the evaluation experiments 4. Update the testing platform (MMW.com) Chairmen: Khaled Mamou, Minsu Ahn, Xiangyang Ji Until 95th meeting Duration: Sunday before 95th meeting Meetings: mpeg-3dgc AT gti. ssr. upm. es Reflector: Subscribe: https://mx.gti.ssr.upm.es/mailman/listinfo/mpeg-3dgc 8 Closing of the Meeting See you in Daegu. 315