INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC 1/SC 29/WG 11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC 1/SC 29/WG 11 N10314 Lausanne, CH – February 2008 Source: Leonardo Chiariglione Title: Report of 87th meeting Status Report of 87th meeting .......................................................................................................................... 1 Annex A – Attendance list .................................................................................................................. 18 Annex B – Agenda .............................................................................................................................. 23 Annex C – Input contributions ............................................................................................................ 26 Annex D – Output documents............................................................................................................. 46 Annex E – Requirements report .......................................................................................................... 54 Annex F – Systems report ................................................................................................................... 58 Annex G – Video report ...................................................................................................................... 97 Annex I – Audio report ..................................................................................................................... 115 Annex J – 3DG report ....................................................................................................................... 148 Report of 87th meeting 1 Opening The 87th MPEG Meeting was held from 2nd to 6th February 2009 at Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland. 2 Roll call of participants The attendance list is given in Annex 1. 3 Approval of agenda The agenda is given in Annex 2. 4 Allocation of contributions The input contribution are listed in Annex 3. 5 Communications from Convenor There was no specific communication. 1 6 Report of previous meeting This was approved 7 Processing of NB Position Papers Input documents from National Bodies were presented, discussed and a response provided, as appropriate. 8 Work plan management 8.1 Media coding 8.1.1 HD-AAC Profile The following document was approved 10385 8.1.2 ISO/IEC 14496-3:2009/FPDAM 1:200X HD-AAC Profile 960 frame length in MPEG-4 AAC The following document was approved 10434 8.1.3 Issues concerning frame lengths in the AAC family profiles Constrained Baseline Profile The following documents were approved 10341 10342 8.1.4 Disposition of Comments on ISO/IEC 14496-10:200X/PDAM 1 Text of ISO/IEC 14496-10:200X/FPDAM 1 Constrained Baseline Profile and supplemental enhancement information Multiview Field High Profile The following document was approved 10344 8.1.5 Working Draft 1 of ISO/IEC 14496-10:200X/Amd.2 Multiview Field High Profile AFX 3rd edition The following document was approved 10331 8.1.6 WD of ISO/IEC 14496-16 3rd Edition Multiresolution profile The following documents were approved 10329 10330 Text of ISO/IEC 14496-16:2006/PDAM4 (Scalable Complexity 3D Mesh Compression) CE on Scalable Complexity 3D Mesh Compression 2 8.1.7 Open Font Format extensions The following documents were approved 10450 10451 8.1.8 DoC on ISO/IEC FCD 14496-22 2nd Edition Text of ISO/IEC FDIS 14496-22 2nd Edition Media Value Chain Ontology The following documents were approved 10454 10455 8.1.9 Draft DoC on ISO/IEC CD 21000-19 Media Value Chain Ontology Draft Text of ISO/IEC FCD 21000-19 Media Value Chain Ontology Codec Configuration Representation The following documents were approved 10348 10349 Disposition of Comments on ISO/IEC FCD 23001-4 Text of ISO/IEC FDIS 23001-4 Codec Configuration Representation 8.1.10 Video Tool Library The following documents were approved 10350 10351 10352 10354 10355 10356 Disposition of Comments on ISO/IEC FCD 23002-4 Text of ISO/IEC FDIS 23002-4 Video Tool Library Request for ISO/IEC 23002-4/Amd.1 WD 4 of ISO/IEC 23002-4/Amd.2 (Tools for MPEG-2 MP, MPEG-4 ASP, AVC HP and SVC) RVC Work Plan and FU Development Status Description of Core Experiments in RVC 8.1.11 MPEG Surround The following document was approved 10386 Thoughts on MPEG Surround Signaling 8.1.12 Spatial Audio Object Coding The following documents were approved 10416 10417 Study on ISO/IEC FCD 23003-2:200x, Spatial Audio Object Coding Status and Workplan on SAOC Core Experiments 8.1.13 Unified Speech and Audio Coding The following documents were approved 10418 10419 WD2 of USAC Workplan for USAC CEs 3 10420 10421 10422 10423 MPEG Reference Encoder and the Audio CE Process Workplan on MPEG Reference Encoder Draft Revisions to MPEG Audio CE methodology Thoughts on Efficient Bitstream Syntax 8.1.14 Interfaces with virtual worlds The following documents were approved 10498 10474 10475 10476 10477 Requirements for MPEG-V Version 3.2 WD of Architecture WD of Sensory Information WD of Avatar Information WD of Control Information 8.1.15 3D Video Coding The following documents were approved 10357 10358 10359 10360 Vision on 3D Video Coding Applications and Requirements of 3D Video Coding Call for 3D Test Material: Depth Maps & Supplementary Information Description of Exploration Experiments in 3D Video Coding 8.1.16 High-Performance Video Coding The following documents were approved 10361 10362 10363 Vision and Requirements for High-Performance Video Coding (HVC) Call for Test Materials for High-Performance Video Coding Standardisation Draft Call for Evidence on High-Performance Video Coding 8.2 Composition coding 8.2.1 General The following document was approved 10449 8.2.2 Clarification on the usage of ISO/IEC 14496-20 by other standardization bodies Interactive Digital Radio The following documents were approved 10503 10439 8.2.3 Requirements v2.0 for a new BIFS profile to support Interactive Digital Radio WD 1.0 of ISO/IEC 14496-11:2002/AMD 7 New BIFS profile LASeR Adaptation The following documents were approved 4 10446 10447 10448 8.2.4 DoC on ISO/EC 14496-20:2008/PDAM 2 Adaptation Text of ISO/EC 14496-20:2008/FPDAM 2 Adaptation Workplan for service example of LASeR Adaptation & PSI Presentation of Structured Information The following document was approved 10453 WD2.0 of ISO/IEC 21000-2 AMD PSI 8.3 Description coding 8.3.1 Video Signature Tools The following document was approved 10345 8.3.2 Description of Core Experiments in Video Signature Description development Audio description coding standards The following documents were approved 10399 10413 DoC on ISO/IEC TR 15938-8:2002/PDAM 4, Extraction of audio features from compressed formats ISO/IEC TR 15938-8:2002/DAM 4, Extraction of audio features from compressed formats 8.4 Transport and File formats 8.4.1 Carriage of MVC in MPEG-2 Systems The following documents were approved 10435 10436 8.4.2 DoC on ISO/IEC 13818-1:2007/PDAM4 Transport of MVC Text of ISO/IEC 13818-1:2007/FPDAM4 Transport of MVC Miscellaneous additions to File Format The following document was approved 10442 8.4.3 Study of ISO/IEC 14496-12:200X/FPDAM 1 General Improvements Handling of MPEG-4 Audio enhancement layers The following documents were approved 10500 10501 Request for ISO/IEC 14496-14:2003/PDAM1 Handling of MPEG-4 Audio enhancement layers Text of ISO/IEC 14496-14:2003/PDAM1 Handling of MPEG-4 Audio enhancement layers 5 8.4.4 AVC File Format extensions for MVC The following documents were approved 10444 10445 8.4.5 DoC on ISO/IEC 14496-15:2004/PDAM 3 MVC File Format Text of ISO/IEC 14496-15:2004/FPDAM 3 MVC File Format Modern MPEG Transport The following document was approved 10496 MPEG Modern Transport (MMT) over Networks 8.5 Multimedia architecture 8.5.1 MXM general The following document was approved 10469 Proposal for new work item 8.5.2 MXM Architecture and Technologies The following document was approved 10470 8.5.3 Text of ISO/IEC CD 23006-1 MXM Architecture and Technologies MXM API The following document was approved 10471 8.5.4 Text of ISO/IEC CD 23006-1 MXM APIs Advanced IPTV Terminal The following documents were approved 10497 10478 10479 Draft Advanced IPTV Terminal (AIT) Requirements Ideas on the new AIT project Ideas on How to Implement Collaboration Between MPEG and ITU-T Q.13/SG16 on the Advanced IPTV Terminal Standardisation 8.6 Application formats 8.6.1 DMB AF Harmonization of MPEG-2 TS storage The following documents were approved 10461 10462 Request for ISO/IEC 23000-9:2008/PDAM2 DMB AF Harmonization of MPEG-2 TS storage Text of ISO/IEC 23000-9:2008/PDAM2 DMB AF Harmonization of MPEG-2 TS storage 6 8.6.2 Interactive Music Application Format The following document was approved 10468 Text of ISO/IEC CD 23000-12 Interactive Music AF 8.7 Protocols 8.7.1 MXM Protocols The following document was approved 10473 Text of ISO/IEC CD 29116-1 2nd edition MXM Protocols 8.8 Reference implementation 8.8.1 MVC Reference Software The following documents were approved 10339 10340 8.8.2 Disposition of Comments on ISO/IEC 14496-5:2001/PDAM 15 Text of ISO/IEC 14496-5:2001/FPDAM 15 Reference Software for Multiview Video Coding 3D Graphics Compression Model Reference Software The following documents were approved 10323 10324 8.8.3 DOCR on ISO/IEC 14496-5:2001/FPDAM 22 (3DGCM Reference Software) Text of ISO/IEC 14496-5:2001/FDAM 22 (3DGCM Reference Software) SC3DMC Reference Software The following documents were approved 10326 10327 8.8.4 Request for Amendment: 14496-5:2001/PDAM27 Text of ISO/IEC 14496-5:2001/PDAM27 (SC3DMC RefSoft) Scene Partitioning Reference Software The following document was approved 10325 8.8.5 Text ISO/IEC 14496-5:2001/FPDAM 25 (Scene Partitioning Reference Software) Professional Archival MAF Reference Software The following documents were approved 10456 10457 10458 DoC on ISO/IEC 23000-6 PA-AF/PDAM 1 Conformance and Reference Software Text of ISO/IEC 23000-6 PA-AF/FPDAM 1 Conformance and Reference Software Workplan for ISO/IEC 23000-6 PA-AF Conformance and Reference Software 7 8.8.6 DMB Application Format The following documents were approved 10459 10460 8.8.7 Text of ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft. Workplan for DMB AF Conf. And Ref. Soft. Video Surveillance AF Reference Software The following documents were approved 10463 10464 8.8.8 DoC on ISO/IEC 23000-10 PDAM1 Video Surveillance Application Format Conf. & Ref. SW. Text of ISO/IEC 23000-10 FPDAM1 Video Surveillance Application Format Cof. & Ref. SW. Stereoscopic video AF Reference Software The following document was approved 10466 8.8.9 WD 1.0 of ISO/IEC 23000-11/AMD1 Stereoscopic Video Application Format Conf. & Ref. SW. Video Tool Library Reference Software The following document was approved 10353 Text of ISO/IEC 23002-4/PDAM1 Video Tool Library Conformance and Reference Software 8.8.10 MXM Reference Software The following documents were approved 10507 10472 List of identified non MPEG members to be allowed to access MPEG SVN repository Text of ISO/IEC CD 23006-1 MXM Conf. & Ref. SW 8.9 Conformance 8.9.1 MPEG-4 Audio Conformance The following documents were approved 10394 10395 10397 10398 Request for Subdivision of 14496, Audio Conformance ISO/IEC 14496-26:2009, Audio Conformance ISO/IEC 14496-26:2009/FPDAM 1, AAC-ELD, OAFI, additional AAC and MPEG1/2 on MPEG-4 Conformance WD on additional BSAC conformance streams for broadcasting 8 8.9.2 AAC-ELD, OAFI and additional AAC Conformance The following document was approved 10391 8.9.3 DoC on ISO/IEC 14496-4:2004/PDAM 36, AAC-ELD, OAFI and additional AAC Conformance MVC Conformance The following documents were approved 10337 10338 8.9.4 Disposition of Comments on ISO/IEC 14496-4:2004/PDAM 38 Text of ISO/IEC 14496-4:2004/FPDAM 38 Multiview Video Coding Conformance Testing File Format Conformance The following document was approved 10438 8.9.5 Text of ISO/IEC 14496-4:2004/FPDAM 37 File Format Conformance Improvements 3DG conformance The following documents were approved 10332 10333 10334 10433 10335 8.9.6 Request for subdivision of ISO/IEC 14496-27 Text of ISO/IEC 14496-27:2009/FDIS (3DG Conformance) Text of ISO/IEC 14496-27:2009/FPDAM1 (Scene partitioning conformance) Request for Amendment: 14496-27:2009/PDAM2 (SC3DMC Conformance) Text of ISO/IEC 14496-27:2009/PDAM2 (SC3DMC Conformance) MultiResolution Profile Conformance The following document was approved 10320 8.9.7 DOCR on ISO/IEC 14496-4:2004/FPDAM 33 (Multi Resolution Profile Conformance) 3D Graphics Compression Model Conformance The following document was approved 10321 8.9.8 DOCR on ISO/IEC 14496-4:2004/FPDAM 34 (3D Graphics Model Conformance) 3D Graphics Conformance The following document was approved 10322 Text of ISO/IEC 14496-4:200x/DCOR 8 (Removal of 3DG Conformance) 9 8.9.9 Photo Player MAF Conformance The following documents were approved 10346 10347 Request for ISO/IEC 23000-3/Amd.2 Text of ISO/IEC 23000-3/PDAM2 Conformance Testing for Photo Player MAF 8.9.10 Professional Archival MAF Conformance The following documents were approved 10456 10457 10458 DoC on ISO/IEC 23000-6 PA-AF/PDAM 1 Conformance and Reference Software Text of ISO/IEC 23000-6 PA-AF/FPDAM 1 Conformance and Reference Software Workplan for ISO/IEC 23000-6 PA-AF Conformance and Reference Software 8.9.11 DMB Application Format The following documents were approved 10459 10460 Text of ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft. Workplan for DMB AF Conf. And Ref. Soft. 8.9.12 Video Surveillance AF Conformance The following documents were approved 10463 10464 DoC on ISO/IEC 23000-10 PDAM1 Video Surveillance Application Format Conf. & Ref. SW. Text of ISO/IEC 23000-10 FPDAM1 Video Surveillance Application Format Cof. & Ref. SW. 8.9.13 Stereoscopic video AF Conformance The following document was approved 10466 WD 1.0 of ISO/IEC 23000-11/AMD1 Stereoscopic Video Application Format Conf. & Ref. SW. 8.9.14 Video Tool Library Conformance The following document was approved 10353 Text of ISO/IEC 23002-4/PDAM1 Video Tool Library Conformance and Reference Software 8.9.15 MXM Conformance The following document was approved 10472 Text of ISO/IEC CD 23006-1 MXM Conf. & Ref. SW 10 8.10 Maintenance 8.10.1 Systems coding standards The following documents were approved 10437 10440 10441 10499 10443 WD 1.0 of ISO/IEC 13818-1:2007 DCOR X DoC on ISO/IEC 14496-12:200X/DCOR 2 Usage of brands and box order in sample entry Text of ISO/IEC 14496-12:200X/COR 2 Usage of brands and box order in sample entry Text of ISO/IEC 14496-12:2003/DCOR 3 Text of ISO/IEC 14496-15:2004/COR3 8.10.2 Video coding standards The following document was approved 10343 Defect Report on ISO/IEC 14496-10:200X 8.10.3 Audio coding standards The following documents were approved 10373 10374 10375 10376 10377 10378 10379 10380 10381 10382 10383 10384 10387 10388 10389 10390 10392 10393 10396 10414 10415 DoC on ISO/IEC 13818-4:2004/AMD 2:2005/DCOR 2, AAC Conformance ISO/IEC 13818-4:2004/AMD 2:2005/Cor 2, AAC Conformance DoC on ISO/IEC 13818-7:2006/DCOR 1, AAC ISO/IEC 13818-7:2006/Cor. 1, AAC DoC on ISO/IEC 14496-3:2005/DCOR. 6, AAC ISO/IEC 14496-3:2005/Cor. 6, AAC DoC on ISO/IEC 14496-3:2005/AMD 2:2006/DCOR 4, HE-AAC V2 Profile and ALS ISO/IEC 14496-3:2005/AMD 2:2006/Cor. 4, HE-AAC V2 Profile and ALS DoC on ISO/IEC 14496-3:2005/AMD 3:2006/ DCOR 2, SLS ISO/IEC 14496-3:2005/AMD 3:2006/Cor. 2, SLS DoC on ISO/IEC 14496-3:2005/AMD 9:2008/DCor. 1, AAC-ELD ISO/IEC 14496-3:2005/AMD 9:2008/Cor. 1, AAC-ELD ISO/IEC 14496-4:2004/Cor. 6, AAC-LD ISO/IEC 14496-4:2004/DCOR 7, Removal of Audio Conformance DoC on ISO/IEC 14496-4:2004/AMD13:200x/DCOR 2, AAC-LD bitstreams ISO/IEC 14496-4:2004/AMD13:200x/Cor. 2, AAC-LD bitstreams DoC on ISO/IEC 14496-5:2001/Amd.10:2007/DCOR 3, ALS and SLS ISO/IEC 14496-5:2001/Amd.10:2007/COR 3, ALS and SLS ISO/IEC 14496-26:2009/DCOR 1, ALS and SLS updates ISO/IEC 23003-1:2007/DCOR 2, Misc. Corrections ISO/IEC 23003-1:2007/AMD 2:2008/DCOR 1, Ref. Sw. Update 8.10.4 Other MPEG-7 standards The following document was approved 11 10452 Text of ISO/IEC 15938-12:2008 /COR 1 8.10.5 MPEG-A standards The following document was approved 10467 Text of ISO/IEC 23000-11/DCOR1 (SVAF signalling of voice codecs) 8.11 Work plan and time line The following documents were approved 10401 10402 10403 9 MPEG Standards Table of unpublished FDISs Work plan and time line Organisation of this meeting 9.1 Tasks for subgroups The following tasks were assigned Requirements Std 4 A V Systems Std 2 4 Pt Amd 10 11 Y 10 Pt Amd 1 4 4 11 12 15 20 7 22 12 37 23 ? ? 1 Cor.2 3 2 3 2nd Ed Cor1 MVC Profile Laser-BIFS integration User Interface framework Advanced Video Surveillance Responses to CfP for Interfaces with virtual worlds Loudness metadata Advanced IPTV Terminal Contribution to press release HVC - CfE 3DV – Vision, Applications, Requirements New standard areas Audio Carriage of MVC RA FF conformance Synthesised texture RS SVC FF RS Laser-BIFS integration Miscellanea Usage of brands etc. MVC File Format Adaptation technologies for Laser Presentation of Structured Information Open Font Format 12 21 A B M 19 4 1 2 5 2nd Ed 6 1 9 Cor 1 9 1 9 2 10 ? 1 11 Cor1 1 12 2 1 1 2 3 V MVCO Musical Slide Show MAF RS & C Protected Musical Slide Show MAF RS & C Media Streaming MAF Professional Archival AF RS & C DMB MAF DMB MAF RS & C DMB MPEG-2 TS storage Video Surveillance AF Video Surveillance AF RS & C Stereoscopic video AF Stereoscopic video AF RS & C Interactive music AF Fragment Request Unit RS & C MXM Architecture API RS & C Information exchange with virtual worlds Representation of sensory effects information Advanced IPTV Terminal Update MPEG technology web page MAFs Contribution to press release AIT? MXM Interactive music AF Video 7 A B C JVT 4 3 4 6 7 8 3 2 4 4 4 1 2 4 38 15 10 Cor 1 1 Video Signature Tools Image Signature Tools RS Image Signature Tools C Image Signature Tools Matching and feature extraction Photo Player Conformance Codec Configuration Description Video Tool Library Video Tool Library Conformance & RS Video Tool Library extensions 3DV/FTV HVC Update MPEG technology web page Contribution to press release RVC HVC CfE Emmy MVC Conformance MVC RS Miscellanea AVC Constrained Baseline Profile 13 Audio 4 D 4E 1 4 36 5 24 26 2 3 Description of work items MPEG-4 Audio SLS profile AAC-ELD conformance AAC-ELD Reference Software Conformance Spatial Audio Object Coding USAC New audio issues (HVC) Contribution to press release MPEG reference encoder 3DG 4 27 1 5 22 25 27 16 3rd Ed 4 V 3DG Conformance 3D Graphics Compression model Conformance 3D Graphics Compression model Reference Software Scene partitioning RS Scalability complexity 3DMC RS Scalable complexity 3DMC Information exchange with virtual worlds Contribution to press release 9.2 Joint meetings The following joint meetings were held Groups Sys, 3dg Sys, 3dg Sys, aud Sys, req Sys, req Vid, req 3dg, vid Aud, req What Day MXM Tue MPEG-V Tue FF Wed AIT Wed BIFS Thu Interl. MVC, HVC, 3DV Wed RVC Wed Audio for HVC Thu Time 14:00-15:00 15:00-16:00 14:00-15:00 12:00-13:00 14:00-15:00 15:30-17:30 14:00-15:00 15:00-15:30 Where 3dg 3dg aud sys sys vid 3dg aud 10 WG management 10.1 WG organisation 10.2 Terms of reference Recently the amount of activities handled by JVT group has been constantly reducing and, as a consequence, its attendance has also been reducing. Because the JVT is a joint group with ITU-T SG 16 the JVT entails a significant organisational budget. For a group like MPEG that operates on a voluntary basis continuing the JVT must be balanced by significant benefits from its existence. 14 WG11 has decided to discontinue its support for continuing the JVT. This is reflected in the following document approved at the meeting. 10400 Terms of reference 10.3 Editors The following document was approved 10404 Editors of MPEG standards 10.4 Liaisons The following liaison statements were issued 10319 10482 10483 10484 10485 10486 10487 10488 10489 10490 10491 10492 10493 10494 10495 10502 10504 10505 10506 10364 10366 10424 10425 10426 10427 10328 Liaison statement to ITU-T SG 16 Liaison statement to W3C on MXM Liaison statement to WG 1 on PA-AF Liaison statement to ITU-T SG16 on IPTV Liaison statement to OMA BCAST on ISO/IEC 14496-20 Liaison statement to IEC TC 100 on HD Recorder/Receiver Interface Liaison statement to IEC TC 100 on IP & TS based service access Liaison statement to IEC TC 100 on digital right permission code Liaison statement to JTC 1 Study Group on Sensor Network Liaison statement to ISO TC 223 on Video Surveillance Liaison statement to FNB on Informal workshop on Video Surveillance Liaison statement to SC27 on new work item regarding digital evidence Liaison statement to EDItEUR on MVCO Liaison statement to IPFI on MVCO Liaison statement to DOI on MVCO Liaison statement to SMPTE 23B/Container on package formats Liaison statement to WorldDMB on new BIFS profile Liaison statement to GRN on new BIFS profile Liaison statement to TTA on new BIFS profile Liaison statement to ITU-R SG6 re Multi-view Video Coding Liaison statement to ITU-T SG16 re interlaced Multi-view Video Coding Response to DRM on MPEG-4 AAC Technology and Profiles Response to ETSI/EBU/CENELEC JTC on MPEG-4 AAC Technology and Profiles Response to WorldDMB Forum on MPEG-4 AAC Technology and Profiles Response to IEC TC100/TA4 on IEC CDV 61937-11 and 60958-3/Amd.1 Liaison statement to SC 24 The following document was approved 10411 List of Organisations with which MPEG entertains liaisons 10.5 Ad hoc groups The following ad hoc groups were established 15 10370 10336 10514 10510 10431 10372 10513 10371 10367 10509 10369 10512 10515 10368 10432 10508 AHG on 3D Video Coding AHG on 3DGC documents, software maintenance and core experiments AHG on Advanced IPTV Terminal AHG on Application Format AHG on Audio Standards Maintenance AHG on AVC Development AHG on Font Format Representation AHG on High-Performance Video Coding AHG on Maintenance of MPEG-4 Visual related Documents, Reference Software and Conformance AHG on MPEG File Formats AHG on MPEG-7 Visual AHG on MPEG-V (including previous RoSE activities) AHG on MXM AHG on Reconfigurable Video Coding AHG on SAOC, USAC and MetaData AHG on Scene Representation 10.6 Asset management The following documents were approved 10405 10406 10407 10408 10409 Schema assets Software assets Conformance assets Content assets URI assets 10.7 IPR management The following document was approved 10410 Standards under development for which a call for patent statements is issued 11 Administrative matters 11.1 Responses to National Bodies The following responses to national Bodies were approved 10365 10428 10429 10430 Responses to National Bodies Response to AUNB Comments on USAC Response to AUNB Comments on MetaData Response to FR, FI and CN NB Comments on USAC 11.2 Schedule of future MPEG meetings The following schedule was approved 16 # 87 88 89 90 91 92 93 94 City Country yy mm Lausanne CH 09 02 Maui, HI US 09 04 London UK 09 06-07 Xian CN 09 10 Kyoto JP 10 01 Melbourne AU 10 04 Torino IT 10 07 ? ? 10 10 11.3 Promotional activities The following documents were approved 10412 10357 10316 The MPEG Vision Vision on 3D Video Coding Lausanne press release 12 Resolutions of this meeting 13 A.O.B 14 Closing 17 dd-dd 02-06 20-24 29-03 26-30 18-22 12-16 19-23 11-15 Annex A – Attendance list LASTNAME AHN ASAI BARONCINI BÄSE BOBER BOEHM BOURGE BRASNETT BRULS CABRERA QUESADA FirstName Jeong-Hwan Kohtaro Vittorio Gero Miroslaw Johannes Arnaud Paul Fons Affiliation SAMSUNG Electronics Mitsubishi Electric Corporation Fondazione Ugo Bordoni Siemens Mitsubishi Electric Deutsche Thomson OHG ST-NXP Wireless Mitsubishi Electric R&D Centre Europe Philips Country KR JP IT DE UK DE FR UK NL Julián ES CARBALLEIRA CHAISORN CHAN CHEN CHENG CHEON CHIARIGLIONE CHIARIGLIONE CHOI CHOI CHOI CHONO CHUJOH CIEPLINSKI CONCOLATO CORDARA CORVAGLIA DAI YONG DAVIES DELGADO DENIS DIVORRA ESCODA DÖHLA DUN FRANCOIS FRÖJDH GAUVIN GEIGER GELISSEN GERKE GIOIA GOURNAY GRANT GRANT Pablo Lekha Ti Eu Ying Ka Man Carmen Lee Filippo Leonardo Bumsuk Kiho Miran Keiichi Takeshi Leszek Cyril Giovanni Marzia Kim Thomas Jaime Leon Universidad Politécnica de Madrid Grupo de Tratamiento de Imagenes Universidad Politecnica de Madrid Institute for Infocomm Research Institute For Infocomm Research (A*STAR) Tampere University of Technology MPEG-CHINA Mr. CEDEO.net CEDEO.net ETRI Hanyang University ETRI NEC Corporation Toshiba Corporation Mitsubishi Electric R&D Centre Europe Telecom ParisTech Telecom Italia Lab CNIT - Univ. Brescia Hanyang University BBC Universitat Politècnica de Catalunya Vrije Universiteit Brussel - ETRO dept. Oscar Stephan Yujie Edouard Per Marc Ralf Jean H.A. Sebastian Patrick Philippe John Kate Telefonica Research Germany Xian Jiaotong University thomson Ericsson sDae Germany Philips Research Laboratories Fraunhofer HHI Orange Labs VoiceAge Corporation Nine Tiles Nine Tiles ES DE CN FR SE ES DE NL DE FR CA UK UK 18 ES SG SG FI HK KR IT IT KR KR KR JP JP UK FR IT IT KR UK ES BE GRILL GRÜNEBERG GUEZ VUCHER GUN HAECHUL HANNUKSELA HARADA HELLMUTH HENNEY HERRE HONG HUANG HUANG HUI YONG HUSAK HWA SEON HWANG ISHTIAQ ITARU ITO ITO IWAMOTO JANG JANG JEON JEONG JIN JUNG KALVA KANG KAZUI KEILER KIKUIRI KIM KIM KIM KIM KIM KIM KIM KIM KIM KIM KIMATA KITAMURA KJOERLING KLOMP KOGURE KUDUMAKIS LAGADEC LE FEUVRE LEE Bernhard Karsten Marc Bang Choi Miska Noboru Oliver Oh Juergen Jin Woo Pengjun Tiejun Kim Walt Shin Seo-Young Faisal Kaneko Satoshi Takashi Kota Euee S. Inseon Byeungwoo Dong-Seok Jukyong Yang-Won Hari Kyeongok Kimihiko Florian Kei Dongwon Hae Kwang Hyungyu Jin-Seo JungHoe Kwangki Sang-Kyun Kim Seonghoon Yeongmi Yong-Goo Hideaki Masatsugu SE Sven Takuyo Panos Owen Jean Chung Hee Germany Fraunhofer HHI IFPI ETRI ETRI Nokia Corporation NTT Germany LG Electronics Fraunhofer IIS ETRI Qualcomm Inc. Peking University ETRI US - SMPTE KETI Samsung Electronics Co. Ltd Motorola Inc. TOKYO POLYTECHNIC UNIVERSITY TOSHIBA CORPORATION Fujitsu Laboratories Ltd. NEC Corporation Hanyang University ETRI Sungkyunkwan University Inha University inha university LG Electronics Florida Atlantic University ETRI Fujitsu Laboratories Ltd. Thomson NTT DOCOMO, INC. Sejong Univ. Sejong University Hanyang University ETRI Samsung Electronics Co. Ltd Information and Communications University Myongji University VAROVISION Gwangju Institute of Science and Technology Yonsei Univ. NTT Corporation For more convenient AV life Dolby Sweden AB Institut für Informationsverarbeitung Panasonic Queen Mary University of London AFNOR Telecom ParisTech ETRI 19 DE DE FR KR KR FI JP DE KR DE KR US CN KR US KR KR US JP JP JP JP KR KR KR KR KR KR US KR JP DE JP KR KR KR KR KR KR KR KR KR KR JP JP Sweden DE JP UK FR FR KR LEE LEE LEE LEE LEE LEE LEE LEFEBVRE LEVANTOVSKY LI LIEBCHEN LIM LIM LIM LIM LOPEZ LUTHRA MASASHI MATSUO MATTAVELLI MCCANN MOON MORÁN BURGOS MORIYA MOSCHETTI MOTTA MÜLLER MULTRUS MURAKAMI NA NAKACHI NAKAYAMA NARASIMHAN NARROSCHKE NEUENDORF NISHI NOMURA NORIMATSU OAMI OGURA OH OH OHM OOMEN OSTERMANN PARK PASCHALAKIS PATEUX Gwo Giun Hyunkook Jaejoon Kangchan Seung Wook Taejin Wonsuk Roch Vladimir Zhengguo Tilman ChongSoon Jung Eun Sung-Chang Youngkwon Patrick Ajay Takahashi Shohei Marco Ken Joohee Francisco Takehiro Fulvio Giovanni Karsten Markus Tokumichi Sang-Il Takayuki Yasushige Sam Matthias Max Takahiro Toshiyuki Takeshi Ryoma Yukiko Eunmi Weongeun Jens-Rainer Werner Joern Kyungmo Stavros Stéphane PENG PHILIPPE PREDA PRETEUX Zhang Pierrick Marius Francoise National Cheng Kung University LG electronics Samsung Electronics ETRI ETRI ETRI ETRI Université de Sherbrooke Monotype Imaging Inc. Dr. LG Electronics Singapore National Body LG Electronics ETRI net&tv Inc. THOMSON Motorola Hitachi, Ltd. NTT Corporation EPFL ZetaCast, representing Samsung Sejong Univ. Universidad Politécnica de Madrid NTT EPO Qualcomm Inc. Fraunhofer HHI Germany Mitsubishi Electric Corporation ETRI NTT NHK US National Body Panasonic Germany Panasonic NEC Panasonic NEC Corporation IPSJ/ITSCJ Samsung Electronics ETRI RWTH Aachen University Philips Applied Technologies Leibniz Universität Hannover Samsung Electonics Co., Ltd. Mitsubishi Electric R&D Centre Europe Orange Labs Core Network Research Department, Huawei Technologies Co., Ltd Orange Labs Institut TELECOM Institut TELECOM 20 CN KR KR KR KR KR KR CA US SG DE SG KR KR KR FR US JP JP CH UK KR ES JP DE US DE DE JP KR JP JP US DE DE JP JP JP JP JP KR KR DE NL DE KR UK FR CN FR FR FR PRIMAUX PURNHAGEN QUACKENBUSH RAAD RADULOVIC RAULET RIDGE RODRIGUEZ RODRIGUEZDONCEL ROSSIER RYU SABIRIN SAKAZUME SAMPAIO LOBO SANGHYUN SATTI SCHNEIDER SCHREINER SEGALL SEKIGUCHI SEO SHEEN WOOK SHIMIZU SHIMOR SHU SINGER SMYTH SONG SPERSCHNEIDE R STANKIEWICZ SUGIMOTO SUH SUN SUNG SUZUKI SUZUKI SUZUKI TADDEI TAKANORI TAN TANIMOTO TANIZAWA TERENTIEV TESCHER THOMA TIAN TIMMERER TOKUMO TRIMECHE TUNG Laurent Heiko Schuyler Mohamad Ivana Mickael Justin Eva AFNOR Dolby Sweden AB Audio Research Labs RaadTech Consulting Ericsson IETR/INSA of Rennes Nokia UPC FR SE US AU SE FR US ES Victor Jean Jeha Muhammad Syah Houari Satoru Lincoln Joo Shahid Andreas Stephan Andrew Shun-ichi Jeongil Lee Shinya Avraham Haiyan David Neil Jaeyeon Universitat Politècnica de Catalunya (UPC) Nagravision Gwangju Institute of Science and Technology ES CH KR Information and Communications University Victor Company of Japan, Limited Philips ETRI Vrije Universiteit Brussel - ETRO dept. Dolby Germany GmbH Germany Sharp Mitsubishi Electric Corporation ETRI Hanyang Univeristy NTT SanDisk Corporation Institute for Infocomm Research USA IST/37 Samsung KR JP NL KR BE DE DE US JP KR KR JP Israel SG US UK KR Ralph Olgierd Kazuo Jung Suk Huifang Jaewon Kazuyoshi Teruhiko Yoshinori Hervé Senoh Thiow Keng Masayuki Akiyuki Leonid Andrew Herbert Dong Christian Yasuaki Mejdi Yi Shin Fraunhofer IIS Mr Mitsubishi Electric Corporation Samsung Electronics Co.,Ltd Mitsubishi Electric Research Labs LG Electornics Nagoya University Sony Corp. NTT DOCOMO, INC Huawei Technologies National Institute of Info & Comm tech NTT DoCoMo, Inc. Nagoya University TOSHIBA Corporation Fraunhofer Institut Microsoft germany Thomson Inc Klagenfurt University Sharp Corporation Nokia Research Center MStar Semiconductor DE PL JP KR US KR JP JP JP DE JP JP JP JP DE US DE US AT JP FI CN 21 UGUR UM VAN DER AUWERA VERMEIRSCH VETRO WANG WITTMANN WOO-JIN XIE XIONG YAMAKAGE YAMAMOTO YASHIMA YE YOO YOSHINO YU YU Kemal Gi-Mun Nokia ETRI FI KR Geert Kenneth Anthony Xin Steffen Han Minjie Lianhuan Tomoo Tomoyuki Yoshiyuki Yan Jeong-Ju Tomonobu Haoping Lu US BE US US DE KR US CN JP JP JP US KR JP US CN YUN ZHAO ZHONG ZHOU ZHU Kugjin Yin Hai Shan Huan Yongwei Samsung Information Systems Ghent University/MMLab Mitsubishi Electric ContentGuard, Inc. Panasonic Samsung Electronics Huawei Technologies (USA) Huawei Technologies Co., Ltd. TOSHIBA Corporation Sharp NTT Qualcomm Inc ETRI KDDI Huawei Technologies (USA) Zhejiang University ETRI(Electronics and Telecommunications Research Institute) Zhejiang University Panasonic Singapore Laboratories Pte Ltd Panasonic Singapore Laboratories Pte Ltd Institute for Infocomm Research 22 KR CN SG SG SG Annex B – Agenda Item 1 2 3 4 5 6 7 8 1 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 2 1 2 4 3 1 2 3 4 4 5 6 7 1 2 3 4 8 Opening Roll call of participants Approval of agenda Allocation of contributions Communications from Convenor Report of previous meeting Processing of NB Position Papers Work plan management Media coding HD-AAC Profile New Profile for ALS 960 frame length in MPEG-4 AAC AFX 3rd edition Multiresolution profile Scalable-complexity 3D mesh compression Open Font Format extensions Media Value Chain Ontology Codec Configuration Representation Video Tool Library Spatial Audio Object Coding Unified Speech and Audio Coding Representation of Sensory Experience 3D Video Coding High-Performance Video Coding New directions in future audio coding Composition coding Interactive Digital Radio LASeR Adaptation Presentation of Structured Information Description coding Video Signature Tools Metadata driven post processing of audio signals Audio description coding standards Extraction and Matching of Image Signature Tools Systems support IPMP Digital Item Transport and File formats Carriage of SVC in MPEG-2 Systems Carriage of MVC in MPEG-2 Systems Miscellaneous additions to File Format AVC File Format extensions for MVC Multimedia architecture 23 1 2 3 4 5 9 1 10 1 11 1 2 3 4 5 6 7 8 9 10 11 12 12 1 2 3 4 5 6 7 8 9 10 11 12 13 14 13 1 2 3 4 5 6 7 8 14 9 Interfaces with virtual worlds MXM Architecture and API MXM API Advanced IPTV Terminal Rich Media UI Framework Application formats Interactive Music Application Format Protocols MXM Protocols Reference implementation AAC-ELD Reference Software MVC Reference Software File Format Reference Software Geometry and Shadow Reference Software 3D Graphics Compression Model Reference Software Scene Partitioning Reference Software Image Signature Tools Reference Software Protected Musical Slide Show MAF Reference Software Musical Slide Show MAF Reference Software Professional Archival MAF Reference Software Video Surveillance MAF Reference Software MXM Reference Software Conformance MVC Conformance MPEG-4 Audio Conformance AAC-ELD, OAFI and additional AAC Conformance File Format Conformance Scene Partitioning Conformance MultiResolution Profile Conformance 3D Graphics Compression Model Conformance Image Signature Tools Conformance Photo Player MAF Conformance Musical Slide Show MAF Conformance Professional Archival MAF Conformance Video Surveillance MAF Conformance Video Tool Library Conformance MXM Conformance Maintenance Systems coding standards Video coding standards Audio coding standards 3DG coding standards Visual description coding standards Audio description coding standards MPEG-21 standards MPEG-A standards Work plan and time line Organisation of this meeting 24 1 2 10 1 2 3 4 5 6 7 1 2 3 4 8 9 11 1 2 3 12 13 14 Tasks for subgroups Joint meetings WG management Terms of reference Officers Editors Liaisons Work item assignment Ad hoc groups Asset management Reference software Conformance Test material URI IPR management Work plan Administrative matters Responses to National Bodies Schedule of future MPEG meetings Promotional activities Resolutions of this meeting A.O.B Closing 25 Annex C – Input contributions Number Source m15944 Webmaster m15945 Francisco Morán Burgos, Patrick Gioia m15946 Yi-Shin Tung, Teruhiko Suzuki Title Lausanne document register Ad Hoc Group on 3DGC documents, software maintenance and core experiments Ad Hoc Group on Maintenance of MPEG-4 Visual related Documents, Reference Software and Conformance m15947 Euee S. Jang, Marco Mattavelli, Ad Hoc Group on Reconfigurable Video Coding Kazuo Sugimoto m15948 Miroslaw Bober, Paul Brasnett, Ryoma Oami Ad Hoc Group on MPEG-7 Visual m15949 Hideaki Kimata, Aljoscha Smoli, Anthony Vetro Ad Hoc Group on 3D Video and FTV Coding Jens-Rainer Ohm, Jörn m15950 Ostermann, Ajay Luthra, Jason Suh, T.K. Tan m15951 Filippo Chiariglione, Marius Preda Ad Hoc Group on High-Performance Video Coding Ad Hoc Group on MxM m15952 R. Sperschneider Ad Hoc Group on Audio Standards Maintenance m15953 S. Quackenbush, P. Philippe Ad Hoc Group on SAOC, USAC m15954 Jean Gelissen, Marius Preda, Keiji Mitsubuchi Ad Hoc Group on Information Exchange with Virtual Worlds m15955 Young-Kwon Lim, Jaeyeon Song, Cyril Concolato Ad Hoc Group on Scene Representation m15956 David Singer Ad Hoc Group on MPEG File Formats Kyuheon Kim, Hui Yong Kim, m15957 Jean Cha, Noboru Harada, Hendry Ad Hoc Group on Application Format m15958 Sanghyun Joo, Jean Gelissen, Christian Timmerer Ad Hoc Group on the RoSE Framework m15959 Xin Wang, Young Kwon Lim Ad Hoc Group on Advanced IPTV Terminal m15960 Vladimir Levantovsky Ad Hoc Group on Font Format Representation m15961 SC 29 Secretariat Summary of Voting on ISO/IEC 138184:2004/Amd.2:2005/DCOR 2 [SC 29 N 9874] m15962 SC 29 Secretariat Summary of Voting on ISO/IEC 13818-7:2006/DCOR 1 [SC 29 N 9875] 26 m15963 SC 29 Secretariat Summary of Voting on ISO/IEC 144963:2005/Amd.2:2006/DCOR 4 [SC 29 N 9876] m15964 SC 29 Secretariat Summary of Voting on ISO/IEC 144963:2005/Amd.3:2006/DCOR 2 [SC 29 N 9877] m15965 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-4:2004/DCOR 6 [SC 29 N 9878] m15966 SC 29 Secretariat Summary of Voting on ISO/IEC 144964:2004/Amd.13:2007/DCOR 2 [SC 29 N 9879] m15967 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-15:2004/DCOR 3 [SC 29 N 9880] m15968 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-20:200X 2nd Edition/PDAM 2 [SC29 N 9881] m15969 SC 29 Secretariat Summary of Voting on ISO/IEC 23000-9:2008/PDAM 1 [SC 29 N 9882] m15970 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC FDIS 23000-8 [SC 29 N 9910] m15971 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC FDIS 15938-12 [SC 29 N 9911] m15972 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC 21000-7:2007/FDAM 1 [SC 29 N 9912] m15973 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-3:2005/DCOR 6 [SC 29 N 9913] m15974 SC 29 Secretariat Summary of Voting on ISO/IEC 144963:2005/Amd.9:2008/DCOR 1 [SC 29 N 9914] m15975 IEC TC 100 via SC 29 Secretariat IEC CDV 62546 [SC 29 N 9917] m15976 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC 14496-5:2001/FDAM 18 [SC 29 N 9928] m15977 SC 29 Secretariat Summary of Voting on ISO/IEC 2300010:200X/PDAM 1 [SC 29 N 9929] m15978 ITU-R SG 6 via SC 29 Secretariat Liaison Statement from ITU-R SG 6 [SC 29 N 9930] m15979 ITU-R SG 6 via SC 29 Secretariat Liaison Statement from ITU-R SG 6 [SC 29 N 9931] m15980 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-4:2004/FPDAM 34 m15981 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-5:2001/PDAM 15 m15982 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-4:2004/FPDAM 33 [SC 29 N 9948] 27 m15983 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-5:2001/FPDAM 22 [SC 29 N 9949] m15984 SC 29 Secretariat Summary of Voting on ISO/IEC FCD 14496-22 [2nd Edition] m15985 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC 13818-1:2007/FDAM 3 m15986 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC 14496-4:2004/FDAM 30 m15987 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC 14496-5:2001/FDAM 20 m15988 SC 29 Secretariat Summary of Voting on ISO/IEC 13818-1:2007/PDAM 4 m15989 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-3:2005/PDAM 10 m15990 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-4:2004/PDAM 37 m15991 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-4:2004/PDAM 39 m15992 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-5:2001/PDAM 25 m15993 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-15:2004/PDAM 3 m15994 SC 29 Secretariat Summary of Voting on ISO/IEC 15938-12:2008/DCOR 1 m15995 SC 29 Secretariat Summary of Voting on ISO/IEC CD 21000-19 m15996 SC 29 Secretariat Summary of Voting on ISO/IEC 23000-6:200X/PDAM 1 m15997 ITTF via SC 29 Secretariat Table of Replies on ISO/IEC 14496-16:2006/FDAM 2 m15998 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-4:2004/PDAM 38 m15999 SC 29 Secretariat Summary of Voting on ISO/IEC 144965:2001/Amd.10:2007/DCOR 3 m16000 SC 29 Secretariat Summary of Voting on ISO/IEC 1449620:200X/PDAM 3 m16001 SC 29 Secretariat Summary of Voting on ISO/IEC 14496-12:200X/DCOR 2 & ISO/IEC 15444-12:200X/DCOR 2 m16002 SC 29 Secretariat Summary of Voting on ISO/IEC 1449610:200X/PDAM 1 m16003 W3C via SC 29 Secretariat Liaison Statement from W3C [SC 29 N 9930] m16004 IEC TC 100 via SC 29 Secretariat m16005 TTA via SC 29 Secretariat IEC CDV 61937-11 [SC 29 N 9951] Liaison Statement from TTA [SC 29 N 9959] 28 m16006 SC 24 via SC 29 Secretariat m16007 IEC TC 100 via SC 29 Secretariat ISO/IEC FCD 19775-2.2 2nd Edition [SC 29 N 9958] IEC CD 62455 [SC 29 N 9960] m16008 Leonardo Chiariglione Use cases for consideration by Ad Hoc Group on Advanced IPTV Terminal m16009 Leonardo Chiariglione Technologies for consideration by Ad Hoc Group on Advanced IPTV Terminal m16010 Walter Allasia Peer-to-Peer iDRM m16011 Filippo Chiariglione ,Tiejun Huang Web, Internet and Mobile TV m16012 Leonardo Chiariglione The Digital Media in Italia proposal m16013 Lucia Marchisio Open IPTV Platform For an Open Content Market m16014 Young-Kwon LIM Approaching the Zettabyte Era m16015 Young-Kwon LIM Contribution to the scope of the planned Advanced IPTV Terminal standard m16016 SC 29 Secretariat Establishment of JTC 1/Study Group on Digital Content Management and Protection m16017 Leonardo Chiariglione The MPEG Vision Shun-ichi Sekiguchi Yoshihisa Yamada m16018 Yoshiaki Kato Kohtaro Asai Tokumichi Murakami Response to call for test materials for HVC study Shun-ichi Sekiguchi Shuichi Yamagishi Yoshihisa Yamada m16019 Yoshiaki Kato Kohtaro Asai Tokumichi Murakami On coding efficiency with extended block size for UHDTV m16020 IFPI via SC 29 Secretariat Liaison Statement from IFPI [SC 29 N 9995] m16021 Gangyi Jiang Depth Map Compression for View Synthesis in FTV m16022 Japan National Body JNB comment on the resolution 3.5.4 m16023 Simon Daniels Vladimir Levantovsky m16024 Christian Timmerer m16025 Sergio Arnaldo Francisco Morán Burgos Olgierd Stankiewicz m16026 Krzysztof Wegner Krzysztof Klimaszewski Proposal for a new work item for ISO/IEC 14496-22 MPEG Representation of Sensory Effects Vision Corrections to "WD3.0 of ISO/IEC 14496-16 AMD4, Scalable Complexity 3D Mesh Coding" Results of 3DV/FTV Exploration Experiments, described in w10173, for Alt Moabit sequence. 29 m16027 Krzysztof Wegner Olgierd Stankiewicz Analysis of sub-pixel precision in Depth Estimation Reference Software and View Synthesis Reference Software m16028 Olgierd Stankiewicz Krzysztof Wegner Application of Middle Level Hypothesis algorithm for improvement of depth maps produced by Depth Estimation Reference Software. m16029 IEC TC 100 via SC 29 Secretariat Liaison Statement from IEC TC 100 [SC 29 N 10025] m16030 Yi-Shin Tung Hwa Seon Shin Editor's Input on Study Text of ISO/IEC FCD 23002-4 Hwa Seon Shin Sowon Kim Minsoo Park m16031 Hyungyu Kim Sinwook Lee Byeongho Choi Euee S. Jang Revised FU Network and Tokens for MPEG-4 SP m16032 Singapore National Body SGNB Comments on Multiview Video Coding Profile Gwo Giun Lee Jia-wei Liang m16033 He-Yuan Lin Ming-Jiun Wang Functional unit of AVC deblocking filter with MBAFF Fons Bruls Lincoln Lobo m16034 Yin Zhao Lu Yu Basic LDV view-synthesis/renderer SW : LDVS m16035 TK Tan Yoshinori Suzuki Noboru Harada Tilman Liebchen m16036 Takehiro Moriya Yutaka Kamamoto m16037 Kangchan Lee Seungyun Lee Kyungmo Park Cyril Concolato m16038 Jean Le Feuvre Giovanni Cordara m16039 Wonsuk Lee Seungyun Lee Yin Zhao m16040 Deliang Fu Lu Yu Response to Call for Test Materials for HighPerformance Video Coding Standards Development Proposed Text of ISO/IEC 144963:2005/Amd.2:2006/DCOR4 Proposal of MXM Ontology for inter-MXM communication protocols Items under considerations in Rich UI Framework Proposal for new APIs of video metadata on MXM APIs LDV Virtual View Rendering Software 30 Fons Bruls Lincoln Lobo Yin Zhao Deliang Fu m16041 Lu Yu Lianhuan Xiong Temporal Improvement Method in View Synthesis Yin Zhao m16042 Deliang Fu Lu Yu 3DV EE3 Report on Champagne_tower Sequences m16043 Yin Zhao Lu Yu 3DV EE4 Report on Dog Sequences m16044 Mohamad Raad Comment on the unified speech and audio coding activity m16045 Mohamad Raad comment on the exploration on metadata driven postprocessing of audio Carmen CHENG m16046 Yan HUO Yu LIU 3DV EE3 results on Dog sequence Carmen CHENG m16047 Yan HUO Yu LIU 3DV EE4 results on Dog sequence Hui Yuan Yilin Chang Haitao Yang m16048 Xiaoxian Liu Sixin Lin Lianhuan Xiong Depth Estimation Improvement for Depth Discontinuity Areas and Temporal Consistency Preserving Xiaoxian Liu Yingying Guo Haitao Yang m16049 Junyan Huo Yilin Chang Sixin Lin Lianhuan Xiong 3DV/FTV EE3/EE4 Results on Alt Moabit sequence Siping Tao Ying Chen m16050 Miska M. Hannuksela Houqiang Li Depth Map Coding Quality Analysis for View Synthesis Seungju Han Hyunjeong Lee m16051 Jae-Joon Han jeong-hwan ahn Full motion control and navigation of avatar/object with multi-input sources in MPEG-V m16052 Woo-Jin Han JeongHoon Park Samsung's response to Call for Test Materials for MPEG HVC standardization 31 IlKoo Kim Tammy Lee Ken McCann m16053 Ivana Radulovic Per Fröjdh 3DTV Exploration Experiments on Pantomime sequence m16054 Stefan Doehla Scalable Audio and MP4 Kihyun Choo m16055 Junghoe Kim Eunmi Oh Comments on WD of Unified Speech and Audio Coding m16056 Markus Schnell Per Ekstrand Proposed Draft Corrigendum on AAC-ELD Novel approaches to remote display representations: BiFS-based solution and its deployment within the FP7 MobiThin project m16057 Françoise PRETEUX Mihai MITREA Pieter SIMOENS Bojan JOVESKI m16058 Bert VANKEIRSBILCK Abdeslam TAGUENGAYTE Françoise PRETEUX Novel approaches to remote display representations: BiFS-based solution and its deployment within the FP7 MobiThin project Sehoon Yea m16059 Zafer Arican Anthony Vetro Results of Exploration Experiments in 3D Video for Lovebird2 m16060 Cheon Lee Yo-Sung Ho EE1: Depth Estimation Results on 'Pantomime? Sequence m16061 Cheon Lee Yo-Sung Ho EE2: View Synthesis Results on 'Pantomime? Sequence m16062 Cheon Lee Yo-Sung Ho EE4: Coding Results on 'Pantomime? Sequence Sang-Beom Lee m16063 Cheon Lee Yo-Sung Ho m16064 Cheon Lee Yo-Sung Ho Cheon Lee Jae-Il Jung m16065 Yun-Suk Kang Yo-Sung Ho m16066 Takanori Senoh Kenji Yamamoto Experimental Results on Improved Temporal Consistency Enhancement Implementation of Boundary Noise Removal for View Synthesis Additional Test Sequence for 3D Video Report of 3DV/FTV Exploration E xperiments with Champagne Tower 32 Ryutaro Oi Tomoyuki Mishina Makoto Okui Gun Bang Gi Mun Um m16067 Namho Hur Jinwoong Kim 3DV/FTV EE results of Depth Estimantion and View Synthesis on "lovebird1" sequence Gun Bang Gwang sin Cho m16068 Namho Hur Jinwoong Kim Donggyu Sim 3DV/FTV EE4 result of Coding Experiment on "Dog" sequence m16069 Steffen Kamp Mathias Wien Fast Decoder Side Motion Vector Derivation with Candidate Scaling for Improving AVC Compression Performance Gun Bang Jaeho Lee m16070 Namho Hur Jinwoong Kim The consideration of the imrpoved depth estimation algorithm m16071 Andy Tescher for USNB Response to resolution 3.5.4 of 86-th WG 11 meeting m16072 Jean H.A. Gelissen (ed) MPEG-V CfP Response Seo-Young Hwang Jaeyeon Song m16073 Young-Kwon Lim Jean Le Feuvre Comment on Study text of 14496-20 PDAM2 Seo-Young Hwang m16074 Jaeyeon Song Young-Kwon Lim Improvement of parsingSwitch on 14496-20 PDAM2 Seo-Young Hwang m16075 Jaeyeon Song Young-Kwon Lim Service Scenario examples on 14496-20 PDAM2 Hyungyu Kim Sinwook Lee Hwa Seon Shin m16076 Sowon Kim Minsoo Park Euee S. Jang Comments on ISO/IEC 23001-4 FCD 2 Hyungyu Kim m16077 Sinwook Lee Euee S. Jang Update proposal on the Vision of RVC Hui Yong Kim Myung Seok Ki m16078 HanKyu Lee Houari Sabirin Updated text, conf. files, and ref. sw for ISO/IEC 23000-9 (DMB-AF) 33 Munchurl Kim Jung Soo Lee Yong Han Kim Herve Taddei m16079 Minjie Xie Qing Zhang Discussion on the Unified Speech and Audio Coding Activity m16080 Jean H.A. Gelissen (ed) MPEG-V CfP Response Inseon Jang m16081 Huiyong Kim Jeongil Seo Study text of ISO/IEC 23000-12 WD Interactive music application format Tomonobu Yoshino m16082 Sei Naito Shigeyuki Sakazawa Preliminary response for Draft Call for Evidence on High Performance Video Coding m16083 WG 1 via SC 29 Secretariat Liaison Statement from SC 29/WG 1 Kwang-Ki Kim Jeongil Seo m16084 Seungkwon Beack Kyeongok Kang MinsooHahn CE on Residual Coding Process for Post Downmix Gain m16085 jean Le Feuvre Cyril Concolato Zhong Haishan Zhou Huan m16086 Chong Kok Seng Tomokazu Ishikawa Takeshi Norimatsu Comments on LASeR PDAM2 Efficient inter-object relation indicator for SAOC m16087 Patrick Lopez Dong Tian 3DV/FTV EE3 : LeavingLaptop and Lovebird1 m16088 Patrick Lopez Dong Tian 3DV/FTV EE4 : Dog sequence m16089 Teruhiko Suzuki Comments on 14496-12:200X FPDAM1 Masayuki Tanimoto m16090 Toshiaki Fujii Kazuyoshi Suzuki View Synthesis Algorithm in View Synthesis Reference Software 2.0 (VSRS2.0) Masayuki Tanimoto m16091 Toshiaki Fujii Kazuyoshi Suzuki View Synthesis Method without Blending Masayuki Tanimoto m16092 Toshiaki Fujii Kazuyoshi Suzuki Depth Estimation Reference Software (DERS) with Image Segmentation and Block Matching m16093 Masayuki Tanimoto Toshiaki Fujii Data Format for FTV 34 Kazuyoshi Suzuki m16094 Mejdi Trimeche Miska M Hannuksela Results of 3D Video Coding Experiments EE1 and EE2 for Dog Data Set Jonas Engdegård Heiko Purnhagen Oliver Hellmuth Johannes Hilpert Maria Luis Valero m16095 Andreas Hölzer Markus Schnell Leonid Terentiev Erik Schuijers Per Ekstrand Information regarding CE on Low Delay MPEG SAOC Jonas Engdegård Heiko Purnhagen m16096 Oliver Hellmuth Leonid Terentiev Erik Schuijers Information regarding CE on Low Power MPEG SAOC Cornelia Falch Leonid Terentiev m16097 Johannes Hilpert Oliver Hellmuth Information regarding mixing mode for the enhanced Karaoke/Solo processing Leonid Terentiev m16098 Cornelia Falch Oliver Hellmuth Proposal for MCU functionality extension for the MPEG SAOC Jonas Engdegård Heiko Purnhagen Cornelia Falch Leonid Terentiev Andreas Hölzer m16099 Oliver Hellmuth Johannes Hilpert Yang-Won Jung Henney Oh Jeroen Koppens Report on corrections for the MPEG SAOC FCD text and RM software Heiko Purnhagen Cornelia Falch Leonid Terentiev Oliver Hellmuth m16100 Johannes Hilpert Yang-Won Jung Henney Oh Jeroen Koppens Proposal for dynamic preset extension for the MPEG SAOC m16101 Shinya Shimizu Hideaki Kimata m16102 Zhuangfei Wu 3DV/FTV EE Report on Doorflower sequence Updates to the MVC File Format 35 Per Fröjdh m16103 Yang-Won Jung Henney Oh Consideration on User Interface in SAOC m16104 Yang-Won Jung Henney Oh Proposal for adding information on object characteristics in SAOC m16105 Yang-Won Jung Henney Oh Proposal for including guideline information on the rendering parameters in SAOC m16106 Yang-Won Jung Henney Oh Comments on the enhanced karaoke mode in SAOC Pierrick Philippe m16107 Gregory Pallone Marc Emerit Proposed Audio Sequences for MPEG-D SAOC m16108 Matthias Gruhne Study on ISO/IEC TR 15938-8:2002/FPDAM 4 Shangwen Li m16109 Lu Yu Lianhuan Xiong Second Order Prediction of Video Coding m16110 Werner Oomen Comment on the unified speech and audio coding activity m16111 Tomoo Yamakage Potential Corrigendum Items for MPEG-2 Systems m16112 Sangil Na DongSeok Jeong Proposal to remoe pair in comparison pair list for indpendence test m16113 WonGeun Oh JuKyong Jin Ground true table & incorrect video query clips of AVC, CC m16114 Miska M. Hannuksela Ying Chen On MVC File Format m16115 Miska M. Hannuksela On FPDAM1 of ISO Base Media File Format m16116 S. Quackenbush 86th MPEG Audio Report Frans de Bont Stefan Döhla m16117 Heiko Purnhagen Alexander Gröschel Thoughts on MPEG Surround signaling Hyun-Kook Lee Dong Soo Kim m16118 Sungyong Yoon Jaehyun Lim Considerations on the development of common USAC reference encoder Dong Soo Kim Sungyong Yoon m16119 Hyun-Kook Lee Jaehyun Lim Proposed syntax revision on USAC RM0 m16120 Dong Soo Kim Sungyong Yoon Proposed syntax revision regarding SBR bitstream on USAC RM0 36 Hyun-Kook Lee Jaehyun Lim Heiko Purnhagen m16121 Jeroen Koppens Matthias Neusinger Further corrections to MPEG Surround text Dong Soo Kim Sungyong Yoon m16122 Hyun-Kook Lee Jaehyun Lim Efficient signaling for FD frame on USAC RM0 Dong Soo Kim Sungyong Yoon m16123 Hyun-Kook Lee Jaehyun Lim Comment on random access issue on USAC RM0 Heiko Purnhagen Jeroen Koppens m16124 Claus-Christian Spenger Matthias Neusinger Corrections to MPEG Surround reference software Dong Soo Kim Sungyong Yoon m16125 Hyun-Kook Lee Jaehyun Lim Proposed syntax revision regarding window sequence on USAC RM0 Jaime Delgado Eva Rodríguez Víctor Rodríguez-Doncel m16126 Silvia Llorente Rubén Barrio Víctor Torres DMAG-UPC Comments on WD2.0 of MXM API Ralf Geiger Fabian Haussel m16127 Michael Haertl Virgilio Bacigalupo Proposed Corrigendum on MPEG-4 SLS Conformance Victor Rodriguez-Doncel m16128 Jaime Delgado Ruben Tous Presentation of the W3C MAWG Activities Y. Wang K. Müller m16129 P. Merkle A. Smolic Results of Exploration Experiments in 3D Video Coding for Dog Data Set Aljoscha Smolic Karsten Mueller m16130 Peter Kauff Thomas Wiegand Considerations about the Vision of a 3D Video Standard Laurent Primaux m16131 Owen Lagadec Emmanuel Bouix Report of Mini Experiment on IM AF Constraints representation 37 Fabien Gallot Inseon Jang Hui Yong Kim Jeongil Seo Kyeongok Kang Laurent Primaux Owen Lagadec m16132 Emmanuel Bouix Fabien Gallot Constraints Specifications for IM AF Next generation Broadcasting m16133 Forum(Korea) Proposed Text for WD of ISO/IEC 23000-11 Stereoscopic Video AF Conformance and Reference Software Laurent Primaux Owen Lagadec Emmanuel Bouix Fabien Gallot m16134 Inseon Jang Hui Yong Kim Jeongil Seo Kyeongok Kang Constraints representation method for IM AF m16135 Fons Bruls Lincoln Lobo delete Hussein Aman-Allah m16136 Ihab Amer Marco Mattavelli An AVC Entropy Coding Module for the MPEG RVC VTL Ehab Asaad Hanna m16137 Ihab Amer Marco Mattavelli An AVC Motion estimation Module for the MPEG RVC VTL Karim Maarouf m16138 Ihab Amer Marco Mattavelli An AVC Intra Prediction Module for the MPEG RVC VTL m16139 Fons Bruls Lincoln Lobo Philips 3DV EE1,2,3,4 results m16140 Kristofer Kjörling Heiko Purnhagen Core Experiment procedures and MPEG reference software encoder m16141 Kristofer Kjörling Andreas Schneider Proposal for splitting the current AAC family profiles into two Lars Villemoes m16142 Per Ekstrand Kristofer Kjörling Core experiment proposal on the USAC eSBR module m16143 Pierrick Philippe Proposed improvements for MPEG Audio Core Experiment Methodology and Reference Software Development 38 Stephan Schreiner m16144 Wolfgang Fiesel Akshaya Thippur Perspectives on Application Scenarios for PostProcessing Audio Metadata Matthieu Wipliez m16145 Mickael Raulet Jean-François Nezan Proposed changes for RVC-CAL annex A of ISO-IEC 23001-4 Roch Lefebvre m16146 Philippe Gournay Redwan Salami Comments on Core Experiments methodology for MPEG USAC standardisation Philippe Gournay Bruno Bessette m16147 Roch Lefebvre Redwan Salami Proposed Core Experiment on LPC Quantization for USAC Khaled Mamou Titus Zaharia m16148 Marius Preda Françoise Preteux Attributes Encoding for TFAN Benoit Le Bonhomme m16149 marius.preda@int-evry.fr Françoise Preteux Scalable Complexity Mesh Coding Benchmark Benoit Le Bonhomme m16150 marius.preda@int-evry.fr Françoise Preteux MMW.com API extension for 3D graphics attributes Ivica Arsov m16151 marius.preda@int-evry.fr Françoise Preteux MXM API for 3D Graphics content creation Blagica Jovanova m16152 marius.preda@int-evry.fr Françoise Preteux Selecting elementary streams in MP25 RefSoft Max Neuendorf Philippe Gournay Jérémie Lecomte Markus Multrus m16153 Stefan Bayer Guillaume Fuchs Ralf Geiger Frederik Nagel Proposed Corrections to WD and Reference Software on Unified Speech and Audio Coding Jérémie Lecomte Max Neuendorf m16154 Ralf Geiger Markus Multrus Proposed Update on USAC Bitstream Syntax m16155 Christian Timmerer On WD 1.0 of ISO/IEC 21000-2:2005 AMD1 (PSI) m16156 Taejin Lee Max Neuendorf Progress of Technology Merge Between System 2 and USAC RM 39 Jeremie Lecomte Kyeongok Kang Bernhard Grill m16157 Markus Waltl Christian Timmerer Minor Corrections to RoSE WD 2.0 XML Schema m16158 Christian Timmerer Updates for the MPEG Extensible Middleware Maria Teresa Andrade Vítor Barbosa Anna Carreras Pedro Carvalho Giovanni Cordara Jaime Delgado Safak Dogan Frederic Dufaux Touradj Ebrahimi Gianluca Francini Isabel Gallego m16159 Lutz Goldmann Ivan Ivanov Thien Ha Minh Shan Jin Hemantha Kodikara Arachchi Gelareh Mohammadi Marta Mrak Adam Pietrowcew Toni Rama Eva Rodríguez Thomas Sikora Rubén Tous Extended template for the Advanced Surveillance AF Bernhard Grill Jürgen Herre m16160 Ralf Geiger Max Neuendorf Markus Multrus Thoughts on Core-Experiment Methodology Maria Teresa Andrade Vítor Barbosa Anna Carreras Pedro Carvalho Giovanni Cordara Jaime Delgado Safak Dogan m16161 Frederic Dufaux Touradj Ebrahimi Gianluca Francini Isabel Gallego Lutz Goldmann Ivan Ivanov Thien Ha Minh Contribution to the Advanced Surveillance AF 40 Shan Jin Hemantha Kodikara Arachchi Gelareh Mohammadi Marta Mrak Adam Pietrowcew Toni Rama Eva Rodríguez Thomas Sikora Rubén Tous m16162 Guillaume Fuchs Markus Multrus Eva Rodríguez m16163 Jaime Delgado Isabel Gallego m16164 Ihab Amer Marco Mattavelli Proposed Update of Arithmetic Coder Tables for USAC Fragments governance for the Advanced Surveillance AF An MPEG Fixed Point IDCT Module for the RVC VTL Fons Bruls m16165 Lincoln Lobo Wiebe de Haan On addressing market 3D developments, Stereo & MPEG 3DV activity. m16166 Mauri Väänänen On the Unified Speech and Audio Coding Activity m16167 Markus Waltl Christian Timmerer Updates and Additional Tools for MPEG RoSE Christian Timmerer Mark Stuard m16168 Franc Kozamernik Jari Ahola Use cases and Requirements for Advanced Internet TV Terminals Jin-Seo Kim Maeng-Sub Cho m16169 Bon-Ki Koo Yong Soo Joo Sang-Kyun Kim A simple RoSE system implementation including SDC, USP, and SDCom Jin-Seo Kim Maeng-Sub Cho m16170 Bon-Ki Koo Yong Soo Joo Sang-Kyun Kim A demonstration for reference color type and its parameters in RoSE m16171 Kota Iwamoto Ryoma Oami Response to the Call for Proposals on Video Signature Tools Paul Brasnett m16172 Stavros Paschalakis Miroslaw Bober Response to the Call for Proposals on Video Signature Tools JungHoe Kim m16173 Julien Robilliard Eunmi Oh Progress report on phase experiment for USAC 41 Bernhard Grill m16174 Jean H. A. Gelissen (Ed) MPEG-V CfP Response Dong Tian m16175 Po-Lin Lai Patrick Lopez 3DV EE1 & EE2 on Leaving_Laptop and Improvements in ViSBD 2.1 Houari Sabirin Hendry m16176 Noboru Harada Munchurl Kim Status report on ISO/IEC 23000-6 Professional Archival Application Format Reference Software and Conformance files m16177 Hosang Sung Eunmi Oh Progress report on unvoiced speech coding m16178 Jens-Rainer Ohm Received responses to CfP on Video Signature Tools Cyril Concolato m16179 Jean Le Feuvre Benoit Pellan Comments on requirements for a new BIFS profile m16180 David Singer On the MVC File format (14496-15 amendment) m16181 David Singer Errata report for 14496-12:2005 (ISO Base Media File Format) m16182 David Singer On Movie Fragments, Edit Lists, and other timing questions, for 14496-12 (ISO Base Media File Format) m16183 Filippo Chiariglione Proposed WD3.0 of MxM Architecture and Technologies m16184 Filippo Chiariglione Proposed WD3.0 of MxM APIs m16185 Filippo Chiariglione Proposed WD2.0 of MxM Ref. SW. and Conf. m16186 Filippo Chiariglione Proposed WD2.0 of 2nd edition of ISO/IEC 29116-1 (MXM Protocols) m16187 Patrick Gioia MXM use-case proposals for 3D services m16188 Leonardo Chiariglione Proposal of Advanced IPTV Terminal (AIT) requirements Jaewon Sung m16189 Yong-Joon Jeon Byeong-Moon Jeon 3DV EE1 and EE2 Results on Newspaper Sequence Jaewon Sung m16190 Yong-Joon Jeon Byeong-Moon Jeon 3DV EE4 Results on Pantomime Sequence m16191 Yasuaki Tokumo Shin-ya Hasegawa Yasuaki Tokumo m16192 Shin-ya Hasegawa Takuya Iwanami Study on Sensory Effect Metadata Proposal for Sensory Effect Metadata 42 m16193 IDF via SC 29 Secretariat m16194 World DMB Forum via SC 29 Secretariat Liaison Statement from the International DOI Foundation (IDF) Liaison Statement from the World DMB Forum Kyoungsoo Son Seungwook Lee m16195 Bonki Koo Daiyong Kim Euee S. Jang An Explanation of SVA and QBCR En-Decoding Algorithm Seungwook Lee Bonki Koo m16196 Daiyong Kim Kyoungsoo Son Euee S. Jang Bitstream Syntax and Semantics for QBCR and SVA Seungwook Lee Bonki Koo m16197 Daiyong Kim Kyoungsoo Son Euee S. Jang CE Report Version 3 on the SC3DMC Daiyong Kim Seungwook Lee m16198 Kyoungsu Son Preda Marius A Report on the Conformance Test of 3D Graphics Group Seungwook Lee Bonki Koo m16199 Daiyong Kim Kyoungsoo Son Euee S. Jang A Report on the Reference Software of SC3DMC Miyoung Kim m16200 Junghoe Kim Eunmi Oh Proposed BSAC Conformance Bitstreams for Terrestrial DMB B. S. Choi SangHyun Joo m16201 HaeRyong Lee KwangRo Park Comments and Proposal for Sensory Effect Metadata Kei Kikuiri m16202 Nobuhiko Naka Kousuke Tsujino Comments on USAC Standardization Activities m16203 Sanghyun Joo A proposal for RoSE system architecture m16204 Aljoscha Smolic delete m16205 Aljoscha Smolic delete m16206 Aljoscha Smolic delete m16207 Kemal Ugur Requirements for high-performance video standards 43 Justin Ridge m16208 Mohamad Raad Proposal for the development of a common MPEG Audio encoder for use in the CE phase Jungyoup Yang Kwanghyun Won m16209 Byeungwoo Jeon Su Nyeon Kim Motion Vector Coding with Optimal Predictor m16210 Pierrick Philippe on behalf of the FRNB FRNB Comment on Video Coding Sinwook Lee Sowon Kim m16211 Jeonghwan Ahn Euee S. Jang Source code for Interpolation Compression for MPEP-4 part 25 m16212 Thomas Davies BBC 1080p50 test materials for HVC study m16213 Korea National Body Late KNB comment on 14496-20 PDAM2 Yongwei Zhu Susanto Rahardja m16214 Te Li Haibin Huang A proposal for streaming support for IM AF m16215 Japan National Body JNB Comment on High Performance Video Coding Activity m16216 P. Philippe Subjective Evaluation of Low Delay MPEG SAOC m16217 GRN Consortium via SC 29 Secretariat Liaison Statement from GRN Consortium m16218 IEC TC 100 via SC 29 Secretariat IEC CDV 60958-3/Amd.1 Kohtaro Asai m16219 Ryuta Suzuki Shun-ichi Sekiguchi Status of potential test materials for HVC with 4K or higher resolutions werner bailer peter shallauer m16220 mathias lux walter allasia francesco gallo Proposal for APIs of image and video metadata on MXM APIs m16221 Sanghyun Joo Disposition of MPEG RoSE within MPEG-V m16222 Next generation Broadcasting Forum(Korea) Proposed Corrigendum on ISO/IEC 23000-11 Stereoscopic Video Application Format m16223 JTC 1 SGSN via SC29 Secretariat Liaison Statement from JTC 1 Study Group on Sensor Network to SC 29 m16224 Ken McCann Woo-Jin Han Proposal on Focus for MPEG HVC standard development 44 Jason Suh m16225 Miska M. Hannuksela Stefan Döhla Miscellaneous comments on ISO/IEC 14496-12:2008 FPDAM1 m16226 ISO TC 223 via SC 29 Secretariat Liaison Statement from ISO TC 223 m16227 SC 29 Secretariat Informal Workshop on Videosurveillance m16228 Mohamad Raad for AUNB Comment on the discontinuation of JVT Kohtaro Asai m16229 Takuyo Kogure Hiroshi Yasuda Report of MPEG 20th anniversary commemoration event m16230 Jean Le Feuvre FNB Comment on LASeR PDAM2 m16231 EDItEUR via SC 29 Secretariat Liaison Statement from EDItEUR Leon Denis m16232 Dan Cernea Munteanu Updates concerning the MeshGrid compression software m16233 ITU-T SG 16 via SC 29 Secretariat Liaison Statement from ITU-T SG 16 m16234 Gero Bäse for GNB GNB on JVT matters Gary J. Sullivan Jens-Rainer Ohm m16235 Thomas Wiegand Ajay Luthra Meeting Report of the 30th JVT Meeting (29 January 2 February 2009, Geneva, CH) Gary J. Sullivan Jens-Rainer Ohm m16236 Thomas Wiegand Ajay Luthra Revised meeting report of the 27th JVT meeting (April 2008) 45 Annex D – Output documents Number Source Title w10311 Convener List of Documents from the 87th Meeting in Lausanne, Switzerland w10312 Convener Resolutions of the 87th Meeting in Lausanne, Switzerland w10313 Convener List of AHGs Established at the 87th Meeting in Lausanne, Switzerland w10314 Convener Report of the 87th Meeting in Lausanne, Switzerland w10315 Convener Guidelines for Electronic Distribution of MPEG M and N Documents w10316 Convener Press Release of the 87th Meeting in Lausanne, Switzerland w10317 Convener Meeting Notice of the 88th Meeting in Maui, US w10318 Convener Guide for WG 11 Meeting Hosts w10319 Convener Liaison Statement to ITU-T SG 16 w10320 3DGC DOCR on ISO/IEC 14496-4:2004/FPDAM 33 (Multi Resolution Profile Conformance) w10321 3DGC DOCR on ISO/IEC 14496-4:2004/FPDAM 34 (3D Graphics Model Conformance) w10323 3DGC DOCR on ISO/IEC 14496-5:2001/FPDAM 22 (3DGCM Reference Software) w10324 3DGC Text of ISO/IEC 14496-5:2001/FDAM 22 (3DGCM Reference Software) w10325 3DGC Text ISO/IEC 14496-5:2001/FPDAM 25 (Scene Partitioning Reference Software) w10326 3DGC Request for AMD w10327 3DGC WD1.0 of ISO/IEC 14496-5:2001/AMD 27 (SC3DMC RefSoft) w10328 3DGC Answer to liaison from W3D w10329 3DGC Text of ISO/IEC 14496-16:2006/PDAM4 (Scalable Complexity 3D Mesh Compression) w10330 3DGC CE on Scalable Complexity 3D Mesh Compression w10331 3DGC WD of ISO/IEC 14496-16 3rd Edition w10332 3DGC Request for subdivision of ISO/IEC 14496-27 w10333 3DGC Text of ISO/IEC 14496-27:200x/FDIS (3DG Conformance) w10334 3DGC Text of ISO/IEC 14496-27:200x/FPDAM1 (Scene partitioning conformance) w10335 3DGC Text of ISO/IEC 14496-27:2009/PDAM2 (SC3DMC Conformance) w10336 Convener AHG on 3DGC documents, software maintenance and core experiments w10337 Video Disposition of Comments on ISO/IEC 14496-4:2004/PDAM38 46 w10338 Video Text of ISO/IEC 14496-4:2004/FPDAM38 Multiview Video Coding Conformance Testing w10339 Video Disposition of Comments on ISO/IEC 14496-5:2001/PDAM 15 w10340 Video Text of ISO/IEC 14496-5:2001/FPDAM 15 Reference Software for Multiview Video Coding w10341 Video Disposition of Comments on ISO/IEC 14496-10:200X/PDAM 1 w10342 Video Text of ISO/IEC 14496-10:200X/FPDAM 1 Constrained Baseline Profile and supplemental enhancement information w10343 Video Defect Report on ISO/IEC 14496-10:200X w10344 Video Working Draft 1 of ISO/IEC 14496-10:200X/Amd.2 Multiview Field High Profile w10345 Video Description of Core Experiments in Video Signature Description development w10346 Video Request for ISO/IEC 23000-3/Amd.2 w10347 Video Text of ISO/IEC 23000-3/PDAM2 Conformance Testing for Photo Player MAF w10348 Video Disposition of Comments on ISO/IEC FCD 23001-4 w10349 Video Text of ISO/IEC FDIS 23001-4 Codec Configuration Representation w10350 Video Disposition of Comments on ISO/IEC FCD 23002-4 w10351 Video Text of ISO/IEC FDIS 23002-4 Video Tool Library w10352 Video Request for ISO/IEC 23002-4/Amd.1 w10353 Video Text of ISO/IEC 23002-4/PDAM1 Video Tool Library Conformance and Reference Software w10354 Video WD 4 of ISO/IEC 23002-4/Amd.2 (Tools for MPEG-2 MP, MPEG-4 ASP, AVC HP and SVC) w10355 Video RVC Work Plan and FU Development Status w10356 Video Description of Core Experiments in RVC w10357 Video Vision on 3D Video Coding w10358 Video Applications and Requirements of 3D Video Coding w10359 Video Call for 3D Test Material: Depth Maps & Supplementary Information w10360 Video Description of Exploration Experiments in 3D Video Coding w10361 Video Vision and Requirements for High-Performance Video Coding (HVC) w10362 Video Call for Test Materials for High-Performance Video Coding Standardisation w10363 Video Draft Call for Evidence on High-Performance Video Coding w10364 Convener Liaison statement to ITU-R SG6 re Multi-view Video Coding 47 w10365 Convener Response to National Bodies w10366 Convener Liaison statement to ITU-T SG16 re interlaced Multi-view Video Coding w10367 Convener AHG on Maintenance of MPEG-4 Visual related Documents, Reference Software and Conformance w10368 Convener AHG on Reconfigurable Video Coding w10369 Convener AHG on MPEG-7 Visual w10370 Convener AHG on 3D Video Coding w10371 Convener AHG on High-Performance Video Coding w10372 Convener AHG on AVC Development w10373 Audio DoC on ISO/IEC 13818-4:2004/AMD 2:2005/DCOR 2, AAC Conformance w10374 Audio ISO/IEC 13818-4:2004/AMD 2:2005/Cor 2, AAC Conformance w10375 Audio DoC on ISO/IEC 13818-7:2006/DCOR 1, AAC w10376 Audio ISO/IEC 13818-7:2006/Cor. 1, AAC w10377 Audio DoC on ISO/IEC 14496-3:2005/DCOR. 6, AAC w10378 Audio ISO/IEC 14496-3:2005/Cor. 6, AAC w10379 Audio DoC on ISO/IEC 14496-3:2005/AMD 2:2006/DCOR 4, HE-AAC V2 Profile and ALS w10380 Audio ISO/IEC 14496-3:2005/AMD 2:2006/Cor. 4, HE-AAC V2 Profile and ALS w10381 Audio DoC on ISO/IEC 14496-3:2005/AMD 3:2006/ DCOR 2, SLS w10382 Audio ISO/IEC 14496-3:2005/AMD 3:2006/Cor. 2, SLS w10383 Audio DoC on ISO/IEC 14496-3:2005/AMD 9:2008/DCor. 1, AAC-ELD w10384 Audio ISO/IEC 14496-3:2005/AMD 9:2008/Cor. 1, AAC-ELD w10385 Audio ISO/IEC 14496-3:2009/FPDAM 1:200X HD-AAC Profile w10386 Audio Thoughts on MPEG Surround Signaling w10387 Audio ISO/IEC 14496-4:2004/Cor. 6, AAC-LD w10388 Audio ISO/IEC 14496-4:2004/DCOR 7, Removal of Audio and 3DG Conformance w10389 Audio DoC on ISO/IEC 14496-4:2004/AMD13:200x/DCOR 2, AAC-LD bitstreams w10390 Audio ISO/IEC 14496-4:2004/AMD13:200x/Cor. 2, AAC-LD bitstreams w10391 Audio DoC on ISO/IEC 14496-4:2004/PDAM 36, AAC-ELD, OAFI and additional AAC Conformance w10392 Audio DoC on ISO/IEC 14496-5:2001/Amd.10:2007/DCOR 3, ALS and SLS 48 w10393 Audio ISO/IEC 14496-5:2001/Amd.10:2007/COR 3, ALS and SLS w10394 Audio Request for Subdivision of 14496, Audio Conformance w10395 Audio ISO/IEC 14496-26:2009, Audio Conformance w10396 Audio ISO/IEC 14496-26:2009/DCOR 1, ALS and SLS updates w10397 Audio ISO/IEC 14496-26:2009/FPDAM 1, AAC-ELD, OAFI, additional AAC and MPEG 1/2 on MPEG-4 Conformance w10398 Audio WD on additional BSAC conformance streams for broadcasting w10399 Audio DoC on ISO/IEC TR 15938-8:2002/PDAM 4, Extraction of audio features from compressed formats w10400 Convener Terms of reference w10401 Convener MPEG Standards w10402 Convener Unpublished standards at FDIS level w10403 Convener MPEG work plan and time line w10404 Convener MPEG Standard Editors w10405 Convener Schema assets updates w10406 Convener Software assets w10407 Convener Conformance assets w10408 Convener Content assets w10409 Convener URI assets w10410 Convener Standards under development for which a call for patent statements is issued w10411 Convener List of Organisations with which MPEG entertains liaisons w10412 Convener The MPEG Vision w10413 Audio ISO/IEC TR 15938-8:2002/DAM 4, Extraction of audio features from compressed formats w10414 Audio ISO/IEC 23003-1:2007/DCOR 2, Misc. Corrections w10415 Audio ISO/IEC 23003-1:2007/AMD 2:2008/DCOR 1, Ref. Sw. Update w10416 Audio Study on ISO/IEC FCD 23003-2:200x, Spatial Audio Object Coding w10417 Audio Status and Workplan on SAOC Core Experiments w10418 Audio WD2 of USAC w10419 Audio Workplan for USAC CEs w10420 Audio MPEG Reference Encoder and the Audio CE Process w10421 Audio Workplan on MPEG Reference Encoder w10422 Audio Draft Revisions to MPEG Audio CE methodology 49 w10423 Audio Thoughts on Efficient Bitstream Syntax w10424 Convener Response to DRM on MPEG-4 AAC Technology and Profiles w10425 Convener Response to ETSI/EBU/CENELEC JTC on MPEG-4 AAC Technology and Profiles w10426 Convener Response to WorldDMB Forum on MPEG-4 AAC Technology and Profiles w10427 Convener Response to IEC TC100/TA4 on IEC CDV 61937-11 and 60958-3/Amd.1 w10428 Convener Response to AUNB Comments on USAC w10429 Convener Response to AUNB Comments on MetaData w10430 Convener Response to FR, FI and CN NB Comments on USAC w10431 Convener AHG on Audio Standards Maintenance w10432 Convener AHG on SAOC, USAC and MetaData w10433 3DGC Request for Amendment: 14496-27:2009/PDAM2 (SC3DMC Conformance) w10434 Audio Issues concerning frame lengths in the AAC family profiles w10435 Systems DoC on ISO/IEC 13818-1:2007/PDAM4 Transport of MVC w10436 Systems Text of ISO/IEC 13818-1:2007/FPDAM4 Transport of MVC w10437 Systems WD 1.0 of ISO/IEC 13818-1:2007 DCOR X w10438 Systems Text of ISO/IEC 14496-4:2004/FPDAM 37 File Format Conformance Improvements w10439 Systems WD 1.0 of ISO/IEC 14496-11:2002/AMD X New BIFS profile w10440 Systems DoC on ISO/IEC 14496-12:200X/DCOR 2 Usage of brands and box order in sample entry w10441 Systems Text of ISO/IEC 14496-12:200X/COR 2 Usage of brands and box order in sample entry w10442 Systems Study of ISO/IEC 14496-12:200X/FPDAM 1 General Improvements w10443 Systems Text of ISO/IEC 14496-15:2004/COR3 w10444 Systems DoC on ISO/IEC 14496-15:2004/PDAM 3 MVC File Format w10445 Systems Text of ISO/IEC 14496-15:2004/FPDAM 3 MVC File Format w10446 Systems DoC on ISO/EC 14496-20:2008/PDAM 2 Adaptation w10447 Systems Text of ISO/EC 14496-20:2008/FPDAM 2 Adaptation w10448 Systems Workplan for service example of LASeR Adaptation & PSI w10449 Systems Clarification on the usage of ISO/IEC 14496-20 by other standardization bodies w10450 Systems DoC on ISO/IEC 14496-22 FCD 2nd Edition 50 w10451 Systems Text of ISO/IEC 14496-22 FDIS 2nd Edition w10452 Systems Text of ISO/IEC 15938-12:2008 /COR 1 w10453 Systems WD2.0 of ISO/IEC 21000-2 AMD PSI w10454 Systems Draft DoC on ISO/IEC CD 21000-19 Media Value Chain Ontology w10455 Systems Draft Text of ISO/IEC FCD 21000-19 Media Value Chain Ontology w10456 Systems DoC on ISO/IEC 23000-6 PA-AF/PDAM 1 Conformance and Reference Software w10457 Systems Text of ISO/IEC 23000-6 PA-AF/FPDAM 1 Conformance and Reference Software w10458 Systems Workplan for ISO/IEC 23000-6 PA-AF Conformance and Reference Software w10459 Systems Text of ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft. w10460 Systems Workplan for DMB AF Conf. And Ref. Soft. w10461 Systems Request for ISO/IEC 23000-9:2008/PDAM2 DMB AF Harmonization of MPEG-2 TS storage w10462 Systems Text of ISO/IEC 23000-9:2008/PDAM2 DMB AF Harmonization of MPEG-2 TS storage w10463 Systems DoC on ISO/IEC 23000-10 PDAM1 Video Surveillance Application Format Conf. & Ref. SW. w10464 Systems Text of ISO/IEC 23000-10 FPDAM1 Video Surveillance Application Format Conf. & Ref. SW. w10466 Systems WD 1.0 of ISO/IEC 23000-11/AMD1 Stereoscopic Video Application Format Conf. & Ref. SW. w10467 Systems Text of ISO/IEC 23000-11/DCOR1 (SVAF signalling of voice codecs) w10468 Systems Text of ISO/IEC 23000-12 CD Interactive Music AF w10469 Systems Proposal for new work item w10470 Systems Text of ISO/IEC 23006-1 CD MxM Architecture and Technologies w10471 Systems Text of ISO/IEC 23006-2 CD MXM APIs w10472 Systems Text of ISO/IEC 23006-3 CD MXM Conf. & Ref. SW w10473 Systems Text of ISO/IEC 29116-1 2nd edition MXM Protocols w10474 Systems WD of Architecture w10475 Systems WD of Sensory Information w10476 Systems WD of Avatar Information w10477 Systems WD of Control Information w10478 Systems Ideas on the new AIT project w10479 Systems Ideas on How to Implement Collaboration Between MPEG and ITU-T 51 Q.13/SG16 on the Advanced IPTV Terminal Standardisation w10480 Systems MPEG Schema Assets Updates w10481 Systems MPEG URIs and MIME Types w10482 Convener Liaison statement to W3C on MXM w10483 Convener Liaison statement to WG 1 on PA-AF w10484 Convener Liaison statement to ITU-T SG16 on IPTV w10485 Convener Liaison statement to OMA BCAST on ISO/IEC 14496-20 w10486 Convener Liaison statement to IEC TC 100 on HD Recorder/Receiver Interface w10487 Convener Liaison statement to IEC TC 100 on IP & TS based service acess w10488 Convener Liaison statement to IEC TC 100 on digital right permission code w10489 Convener Liaison statement to JTC 1 Study Group on Sensor Network w10490 Convener Liaison statement to ISO TC 223 on Video Surveillance w10491 Convener Liaison statement to FNB on Informal workshop on Video Surveillance w10492 Convener Liaison statement to SC27 on new work item regarding digital evidence w10493 Convener Liaison statement to EDItEUR on MVCO w10494 Convener Liaison statement to IPFI on MVCO w10495 Convener Liaison statement to DOI on MVCO w10496 Requirements MPEG Modern Transport (MMT) over Networks w10497 Requirements Draft Advanced IPTV Terminal (AIT) Requirements w10498 Requirements Requirements for MPEG-V Version 3.2 w10499 Systems Text of ISO/IEC 14496-12:2008/DCOR 3 w10500 Systems Request for ISO/IEC 14496-14:2003/PDAM1 Handling of MPEG-4 Audio Enhancement Layers w10501 Systems Text of ISO/IEC 14496-14:2003/PDAM1 Handling of MPEG-4 Audio Enhancement Layers w10502 Convener Liaison Statement to SMPTE 23B/Container on Package Formats w10503 Requirements Requirements v2.0 for a new BIFS profile to support Interactive Digital Radio w10504 Convener Liaison statement to WorldDMB on new BIFS profile w10505 Convener Liaison statement to GRN on new BIFS profile w10506 Convener Liaison statement to TTA on new BIFS profile w10507 Convener List of identified non MPEG members to be allowed to access MPEG SVN repository w10508 Convener AHG on Scene Representation 52 w10509 Convener AHG on MPEG File Formats w10510 Convener AHG on Application Format w10511 Convener AHG on MVCO - ignore w10512 Convener AHG on MPEG-V (including previous RoSE activities) w10513 Convener AHG on Font Format Representation w10514 Convener AHG on Advanced IPTV Terminal w10515 Convener AHG on MXM w10516 Convener Liaison Statement: frame lengths in the AAC family profiles 53 Annex E – Requirements report Source: Jörn Ostermann (Leibniz Universität Hannover) 1 Requirements documents approved at this meeting No. 10357 10358 10359 10360 10361 10362 10363 10496 10497 10498 10503 2 Title Vision on 3D Video Coding Applications and Requirements of 3D Video Coding Call for 3D Test Material: Depth Maps & Supplementary Information Description of Exploration Experiments in 3D Video Coding Vision and Requirements for High-Performance Video Coding (HVC) Call for Test Materials for High-Performance Video Coding Standardisation Draft Call for Evidence on High-Performance Video Coding MPEG Modern Transport (MMT) over Networks Draft Advanced IPTV Terminal (AIT) Requirements Requirements for MPEG-V Version 3.2 Requirements v2.0 for a new BIFS profile to support Interactive Digital Radio Multiview Video Coding (MVC) In response to resolution 3.5.4 of the 86th WG11 meeting, MPEG received input from three National Bodies requesting support for interlaced video in multiview video coding. Since companies also promised to provide software and to support the necessary experiments, a WD for a new profile MVC interlaced was started. 3 MPEG-V: Information exchange with virtual worlds In response to the Call for Proposals (N10239), MPEG received four contributions. One contribution provided significant technical detail to start the work. There was no input related to command and control for MPEG-V. However, a new call for proposals is not necessary and MPEG started defining the working drafts of the four parts of MPEG-V. Requirements for MPEG-V Version 3.2 were established (N10498). 4 MAF The main goal of MAFs is to define the collection of MPEG standards that enables the deployment of an application. As such, a MAF needs to be focused, should show a demonstration of the application, requires continuous input over the development time of the MAF and strong industry support. The MAF Overview document (N10233/N10234) has not been updated. After more than two years of inactivity, MPEG received again input on the topic of Advanced Surveillance AF. Input document M16159 Extended template for the Advanced Surveillance AF with more than 20 authors from mainly academic organisations provides clarified requirements with reference to an old version of the MAF Overview document. Technology from MPEG-4, MPEG-7, and MPEG-21 is required. The proposers are encouraged to show a demonstration and more support from industry. 54 5 Video Signature Tools In response to the call for video signature tools, MPEG received satisfactory input from five organisations. The core experiment process was started by the video group. 6 Explorations 6.1 Exploration on MPEG User Interface Framework The response to the call for proposals N10232 is expected for the 88th meeting. 6.2 Interactive Radio Requirements for Interactive Radio were updated as captured in N10503 Requirements v2.0 for a new BIFS profile to support Interactive Digital Radio. Furthermore, input for Requirements M16179 Comments on requirements for a new BIFS profile was processed. It appears that there was sufficient technology available to MPEG such that there was no need to issue a Call for Proposals at the 87th meeting. Instead, a working draft (N10439) was issued. 6.3 3D Video Coding Following the vision to enable both advanced stereoscopic display processing and improved support for auto-stereoscopic N-view displays as outlined in N10357 Vision on 3D Video Coding, MPEG issued N10359 Call for 3D Test Material Depth Maps & Supplementary Information and N10360 Description of Exploration Experiments in 3D Video Coding. Work on applications and requirements for 3D Video coding started (N10358). Applications are to be defined at the 88th meeting. The vision does not outline a time line since the time line depends on the technology available to MPEG. 6.4 High Performance Video Coding (HVC) HVC targets mobile services, IPTV, and Ultra High Definition (UHD) displays with a focus on coding efficiency considering codec complexity as well. The current target is to increase coding efficiency by 25% at low complexity and 50% at full complexity. MPEG foresees that the reduction of complexity will be achieved by turning off some tools required to reach the full performance in terms of coding efficiency. Another Call for high quality test material (N10362) and a Draft call for evidence on high performance video coding (N10363) was issued. Evaluation will take place prior to the 89th meeting. Draft requirements are captured in Vision and Requirements for High-Performance Video Coding (HVC) (N10361) which was further developed using input from M16207 Requirements fir highperformance video standards. 55 Bit Rate Simulcast 3DV should be compatible with: • existing standards • mono and stereo devices • existing or planned infrastructure MVC 3DV 2D+Depth 2D 3D Rendering Capability Figure 1: Envisioned performance of 3D Video Coding with respect so existing solutions. Figure 2: Use case for Ultra High Definition Displays targeted by HVC. 6.5 Audio coding for HVC With the use of ultra high definition displays, the appropriate audio environment has to be considered. Shared (Figure 2) and individual UHD (Figure 3)experiences with a viewing distance of 50 cm should be considered. The audio group is encouraged to evaluate current standards and their suitability for HVC which will require the precise localisation of sound sources by the listeners and might have to consider the current head position of the listener as well. 56 Figure 3: Displays with associated speakers 6.6 Advanced IPTV Terminal Based on M16188 Proposal of Advanced IPTV Terminal (AIT) requirements, N10497 Draft Advanced IPTV Terminal (AIT) Requirements was developed. It is envisioned that a joint project with ITU-T Q13./SG16 could be started to define the architecture and technologies for an IPTV terminal. 6.7 Modern MPEG Transport Currently, MPEG standardizes bitstreams and IETF provides protocols for their transport. While MPEG develops standards having error resilience and IP-networks in mind, a joint optimization of coding and transport for IP networks is not done jointly by experts of MPEG and IETF. According to N10496 MPEG Modern Transport (MMT) over Networks there is a need for a transport and file format friendly stream format. Error resilience of current MPEG streams might not be optimal. The potential gains of joint optimization of coding and transport are not known. Conversions between different transport mechanisms like from MPEG-2 Transport Stream to MPEG Program Stream are not straight forward or defined. Furthermore, MPEG does not provide any hint on how to adapt content to different networks. An extension and review of the issues is sought in order to determine a potential need for contributions of MPEG and an adaptation of its internal standards development process. 57 Annex F – Systems report Source: Young-Kwon Lim, Chair 1 List of output documents The main outputs of the meeting from the Systems Sub-group perspective are: No. Title X 10435 10436 10437 13818-1 MPEG-2 Systems DoC on ISO/IEC 13818-1:2007/PDAM4 Transport of MVC Text of ISO/IEC 13818-1:2007/FPDAM4 Transport of MVC WD 1.0 of ISO/IEC 13818-1:2007 DCOR X 14496-4 Conformance Text of ISO/IEC 14496-4:2004/FPDAM 37 File Format Conformance Improvements 14496-11 BIFS WD 1.0 of ISO/IEC 14496-11:2002/AMD X New BIFS profile 14496-12 ISO File Format DoC on ISO/IEC 14496-12:200X/DCOR 2 Usage of brands and box order in sample entry Text of ISO/IEC 14496-12:200X/COR 2 Usage of brands and box order in sample entry Study of ISO/IEC 14496-12:200X/FPDAM 1 General Improvements 10438 X 10439 X 10440 10441 10442 10499 Text of ISO/IEC 14496-12:2008/DCOR 3 14496-14 MP4 File Format 10500 Request for ISO/IEC 14496-14:2003/PDAM1 Handling of MPEG-4 Audio enhancement layers 10501 Text of ISO/IEC 14496-14:2003/PDAM1 Handling of MPEG-4 Audio enhancement layers 10443 10444 14496-15 AVC File Format Text of ISO/IEC 14496-15:2004/COR3 DoC on ISO/IEC 14496-15:2004/PDAM 3 MVC File Format 10445 Text of ISO/IEC 14496-15:2004/FPDAM 3 MVC File Format X 14496-20 LASeR 10446 10447 10448 10449 DoC on ISO/EC 14496-20:2008/PDAM 2 Adaptation Text of ISO/EC 14496-20:2008/FPDAM 2 Adaptation Workplan for service example of LASeR Adaptation & PSI Clarification on the usage of ISO/IEC 14496-20 by other standardization bodies X 14496-22 Open Font Format 10450 DoC on ISO/IEC FCD 14496-22 2nd Edition 10451 Text of ISO/IEC FDIS 14496-22 2nd Edition X Available No No No 06/02/09 20/02/09 06/02/09 No 06/02/09 No 06/02/09 No 06/02/09 No 06/02/09 No No 23/02/09 06/02/09 No 06/02/09 No 06/02/09 No No 06/02/09 06/02/09 No 23/02/09 No Yes No Yes 06/02/09 27/02/09 06/02/09 06/02/09 No No 06/02/09 27/03/09 No 06/02/09 No 06/03/09 No No 06/02/09 06/03/09 15938-12 MPEG Query Format 10452 Text of ISO/IEC 15938-12:2008 /COR 1 X 10453 X 10454 10455 X TBP 21000-2 Digital Item Declaration WD2.0 of ISO/IEC 21000-2 AMD PSI 21000-19 Media Value Chain Ontology Draft DoC on ISO/IEC CD 21000-19 Media Value Chain Ontology Draft Text of ISO/IEC FCD 21000-19 Media Value Chain Ontology 23000-6 Professional Archival Application Format 58 10456 DoC on ISO/IEC 23000-6 PA-AF/PDAM 1 Conformance and Reference Software 10457 Text of ISO/IEC 23000-6 PA-AF/FPDAM 1 Conformance and Reference Software No 06/02/09 No 06/02/09 10458 X 10459 10460 10461 No 06/02/09 No No No 06/02/09 06/02/09 06/02/09 No 06/02/09 No 06/02/09 No 06/16/09 No 06/02/09 No 06/02/09 No 06/02/09 No No No No 06/02/09 27/02/09 27/02/09 27/02/09 No No No No No 06/02/09 20/03/09 20/03/09 20/03/09 06/02/09 No 27/03/09 Ideas on the new project Ideas on How to Implement Collaboration Between MPEG and ITU-T Q.13/SG16 on the Advanced IPTV Terminal Standardisation Assets and Standing Documents MPEG Schema Assets Updates MPEG URIs and MIME Types Liaison No No 06/02/09 06/02/09 No No 06/02/09 06/02/09 Liaison statement to W3C on MXM Liaison statement to WG 1 on PA-AF Liaison statement to ITU-T SG16 on IPTV Liaison statement to OMA BCAST on ISO/IEC 14496-20 Liaison statement to IEC TC 100 on HD Recorder/Receiver Interface Liaison statement to IEC TC 100 on IP & TS based service acess Liaison statement to IEC TC 100 on digital right permission code Liaison statement to JTC 1 Study Group on Sensor Network Liaison statement to ISO TC 223 on Video Surveillance No No No No No 06/02/09 06/02/09 06/02/09 06/02/09 06/02/09 No No No No 06/02/09 06/02/09 06/02/09 06/02/09 10462 X 10463 10464 X 10466 10467 X 10468 X 10474 10475 10476 10477 X 10469 10470 10471 10472 10507 X 10473 X 10478 10479 X 10405 10409 X 10482 10483 10484 10485 10486 10487 10488 10489 10490 Workplan for ISO/IEC 23000-6 PA-AF Conformance and Reference Software 23000-9 Digital Multimedia Broadcasting Application Format Text of ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft. Workplan for ISO/IEC 23000-9:2008/FPDAM1 DMB AF Conf. And Ref. Soft. Request for ISO/IEC 23000-9:2008/PDAM2 DMB AF Harmonization of MPEG-2 TS storage Text of ISO/IEC 23000-9:2008/PDAM2 DMB AF Harmonization of MPEG-2 TS storage 23000-10 Video Surveillance Application Format DoC on ISO/IEC 23000-10 PDAM1 Video Surveillance Application Format Cof. & Ref. SW. Text of ISO/IEC 23000-10 FPDAM1 Video Surveillance Application Format Cof. & Ref. SW. 23000-11 Stereoscopic Video Application Format WD 1.0 of ISO/IEC 23000-11/AMD1 Stereoscopic Video Application Format Conf. & Ref. SW. Text of ISO/IEC 23000-11/DCOR1 SVAF signalling of voice codecs 23000-12 Interactive Music AF Text of ISO/IEC CD 23000-12 Interactive Music AF 23005 – MPEG-V WD of Architecture WD of Sensory Information WD of Avatar Information WD of Control Information 23006 – MPEG eXtensible Middleware Proposal for new work item Text of ISO/IEC CD 23006-1 MxM Architecture and Technologies Text of ISO/IEC CD 23006-1 MXM APIs Text of ISO/IEC CD 23006-1 MXM Conf. & Ref. SW List of identified non MPEG members to be allowed to access MPEG SVN repository Supplemental Media Technologies – MPEG eXtensible Middleware Text of ISO/IEC CD 29116-1 2nd edition MXM Protocols Exploration – Advanced IPTV Terminal 59 10491 10492 10493 10494 10495 10502 10504 10505 10506 Liaison statement to FNB on Informal workshop on Video Surveillance Liaison statement to SC27 on new work item regarding digital evidence Liaison statement to EDItEUR on MVCO Liaison statement to IPFI on MVCO Liaison statement to DOI on MVCO Liaison statement to SMPTE 23B/Container on package formats Liaison statement to WorldDMB on new BIFS profile Liaison statement to GRN on new BIFS profile Liaison statement to TTA on new BIFS profile 60 No 06/02/09 No 06/02/09 No No No No No No No 06/02/09 06/02/09 06/02/09 06/02/09 06/02/09 06/02/09 06/02/09 2 General issues 2.1 List of standards under development Pr Pt 2 1 4 1 4 4 4 4 4 4 4 4 4 4 4 7 21 A A A A A A A A A A B E M M M V V V V Edit. Project 2007 AMD4 200x AMD4 4 2004 AMD37 5 2007 AMDxx 5 2007 AMDxx 5 2007 AMD23 12 200x AMD1 12 200x COR2 15 200x COR3 15 200x AMD3 20 200X AMD2 20 200X AMD3 22 2008 2nd Ed. 12 2008 COR1. 19 200x 1st Ed. 4 200x AMD2 5 200x 2nd Ed. 6 200x AMD1 8 200x AMD1 9 200x AMD1 9 200x AMD2 10 200x AMD1 11 200x 1st Ed. 11 200x COR1 12 200x 1st Ed. 2 200x AMD1 1 200x 2nd Ed. 8 200x 1st Ed. 1 200x 1st ed. 2 200x 200x 3 200x 200x 1 200x 200x 2 200x 200x 3 200x 200x 4 200x 200x Description Transport of MVC -RA -Use of LASeR in MPEG-2 & MPEG-4 Systems File Format Conf. AVC File Format Ref. Soft SVC File Format Ref. Soft Synth. Texture Ref. Soft Misc. Addition to FF Brands & box orders Minor corrections to AVC FF MVC File Format Scene Adaptation PSI Open Font Format MPQF minor corrections Media Value Chain Onto. Prot. MSSAF Conf. & Soft MS AF PA-AF Conf. & Ref. SW PVP AF Soft. And Conf. DMB AF Soft. And Conf. DMB AF MPEG-2 Storage VS Conf. & Ref. SW SVAF Ref. Soft. And Conf. SVAF Voice codec signaling Interactive Music AF FRU Ref. Soft. And Conf. MXM Protocols Ref. Soft. and Conformance MxM Architecture MxM APIs MxM Conf. & Ref. SW Architecture Sensory Information Avatar Information Control Interface 61 CfP WD CD FCD FDIS 08/10 09/02 09/07 07/10 08/04 08/10 TBS TBS 08/04 08/04 08/10 08/07 08/04 08/10 08/07 08/07 08/10 08/01 08/01 08/07 08/10 08/04 08/01 09/02 08/07 09/02 09/07 08/10 09/04 08/10 09/04 09/02 09/02 09/02 09/07 09/02 09/07 09/02 09/07 08/07 09/02 08/07 09/02 09/02 09/07 08/10 09/04 09/04 09/10 09/02 09/07 TBS 09/02 09/07 10/01 08/07 09/02 09/07 08/07 09/02 09/07 09/02 09/02 09/07 10/01 09/02 09/07 10/01 07/01 08/07 08/07 08/07 09/02 09/02 09/02 09/02 09/02 07/07 09/02 09/02 09/02 09/07 09/07 09/07 09/07 09/10 08/04 09/10 09/10 09/10 09/10 09/10 09/10 09/10 10/04 08/10 10/04 10/04 10/04 10/04 10/04 10/04 10/04 2.2 Standing Documents Pr Pt Documents 1 1 MPEG-1 White Paper – Multiplex Format 1 1 MPEG-1 White Paper – Terminal Architecture 1 1 MPEG-1 White Paper – Multiplexing and Synchronization 2 1 MPEG-2 White Paper – Multiplex Format 2 1 MPEG-2 White Paper – Terminal Architecture 2 1 MPEG-2 White Paper – Multiplexing and Synchronization 2 11 MPEG-2 White Paper – MPEG-2 IPMP 4 1 MPEG-4 White Paper – MPEG-4 Systems 4 1 MPEG-4 White Paper – Terminal Architecture 4 1 MPEG-4 White Paper – M4MuX 4 1 MPEG-4 White Paper – OCI 4 6 MPEG-4 White Paper – DMIF 4 11 MPEG-4 White Paper – BIFS 4 12 MPEG-4 White Paper – ISO File Format 4 14 MPEG-4 White Paper – MP4 File Format 4 15 MPEG-4 White Paper – AVC FF 4 4 4 4 4 4 4 7 7 21 A A A B E E E No. Meeting N7675 05/07 Nice N7676 05/07 Nice N7677 05/07 Nice N7678 05/07 Nice N7679 05/07 Nice N7680 05/07 Nice N7503 N7504 N7610 N7921 N8148 N8149 N7608 N8150 N7923 N7924 05/07 Poznan 05/07 Poznan 05/10 Nice 06/01 Bangkok 06/04 Montreux 06/04 Montreux 05/10 Nice 06/04 Montreux 06/01 Bangkok 06/01 Bangkok 13 13 17 18 20 White Paper on MPEG-4 IPMP MPEG IPMP Extensions Overview White Paper on Streaming Text White Paper on Font Compression and Streaming Presentation Material on LASER N7505 N6338 N7515 N7508 N6969 20 22 1 1 9 White Paper on LASeR White Paper on Open Font Format MPEG-7 White Paper - MPEG-7 Systems MPEG-7 White Paper – Terminal Architecture MPEG-21 White Paper – MPEG-21 File Format N7507 N7519 N7509 N8151 N7925 05/07 Poznan 04/03 München 05/07 Poznan 05/07 Poznan 05/01 HongKong 05/07 Poznan 05/07 Poznan 05/07 Poznan 06/04 Montreux 06/01 Bangkok X X X X X MPEG Application Format Overview MAF Overview Document MAF Overview Presentation MPEG-B White Paper – BinXML MPEG Multimedia Middleware Context and Objectives 1rst M3W White paper 2nd M3W White Paper : Architecture N9421 N9840 N9841 N7922 N6335 07/10 Shenzhen 08/04 Archamps 08/04 Archamps 06/01 Bangkok 04/03 München X X 62 N7510 05/07 Poznan N8152 06/04 Montreux E X Tutorial on M3W E X E X X X M3W White Paper : Multimedia Middleware Architecture M3W White Paper : Multimedia API M3W White Paper : Component Model M3W White Paper : Resource and Quality Management M3W White Paper : Component Download M3W White Paper : Fault Management M3W White Paper : System Integrity Management E E E E E X X X 63 N8153 06/04 Monreux N8687 06/10 Hanzhou N8688 06/10 Hanzhou N8689 06/10 Hanzhou N8690 06/10 Hanzhou N8691 06/10 Hanzhou N8692 06/10 Hanzhou N8693 06/10 Hanzhou 2.3 Mailing Lists Reminder Topic General Systems List File Format LASeR MAF ISO File Format Transport AIT Metaverse RoSE MXM Information Reflector : gen-sys@lists.uni-klu.ac.at Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/gen-sys Archive: http://lists.uniklu.ac.at/mailman/private/gen-sys/ Reflector : mp4-sys@lists.uni-klu.ac.at Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/mp4-sys Archive: http://lists.uniklu.ac.at/mailman/private/mp4-sys/ Reflector : mpeg-laser@lists.uni-klu.ac.at Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/mpeg-laser Archive: http://lists.uni-klu.ac.at/pipermail/mpeglaser/ Reflector : maf-sys@lists.uni-klu.ac.at Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/maf-sys Archive: http://lists.uniklu.ac.at/mailman/private/maf-sys/ Reflector: isoff-transport@lists.uni-klu.ac.at Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/isoff-transport Archive: http://lists.uniklu.ac.at/mailman/private/isoff-transport/ Reflector: jiptv@lists.uni-klu.ac.at Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/jiptv Archive: http://lists.uniklu.ac.at/mailman/private/jiptv/ Reflector: metaverse@lists.uni-klu.ac.at Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/metaverse Archive: http://lists.uniklu.ac.at/mailman/private/metaverse/ Reflector: rose@lists.uni-klu.ac.at Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/rose Archive: http://lists.uniklu.ac.at/mailman/private/rose/ Reflector: mxm@lists.uni-klu.ac.at Subscribe: http://lists.uniklu.ac.at/mailman/listinfo/mxm Archive: http://lists.uniklu.ac.at/mailman/listinfo/mxm 64 Kindly Hosted by Klagenfurt University Klagenfurt University Klagenfurt University Klagenfurt University Klagenfurt University Klagenfurt University Klagenfurt University Klagenfurt University Klagenfurt University 2.4 2.4.1 Session General General Technical Issues Contributions Numbe Title r m1595 Ad Hoc Group on Scene Representation 5 General m1595 7 Ad Hoc Group on Application Format General m1595 1 m1595 8 Ad Hoc Group on MxM m1595 4 m1595 9 m1595 6 m1596 0 m1602 9 m1597 0 m1597 2 m1597 5 m1600 7 M1622 3 Ad Hoc Group on Information Exchange with Virtual Worlds Ad Hoc Group on Advanced IPTV Terminal Ad Hoc Group on MPEG File Formats General General General General General General General General General General General General General M1622 6 M1622 7 Ad Hoc Group on the RoSE Framework Ad Hoc Group on Font Format Representation Liaison Statement from IEC TC 100 [SC 29 N 10025] Table of Replies on ISO/IEC FDIS 230008 [SC 29 N 9910] Table of Replies on ISO/IEC 210007:2007/FDAM 1 [SC 29 N 9912] IEC CDV 62546 [SC 29 N 9917] (HD Recorder/receiver interface) IEC CD 62455 [SC 29 N 9960] (IP & TS based service access) Liaison Statement from JTC 1 Study Group on Sensor Network to SC 29 Liaison Statement from ISO TC 223 (Video surveillance) Informal Workshop on Video Surveillance Authors Young-Kwon Lim, Jaeyeon Song, Cyril Concolato Kyuheon Kim, Hui Yong Kim, Jean Cha, Noboru Harada, Hendry Filippo Chiariglione, Marius Preda Sanghyun Joo, Jean Gelissen, Christian Timmerer Jean Gelissen, Marius Preda, Keiji Mitsubuchi Xin Wang, Young Kwon Lim David Singer Vladimir Levantovsky IEC TC 100 via SC 29 Secretariat ITTF via SC 29 Secretariat ITTF via SC 29 Secretariat IEC TC 100 IEC TC 100 JTC1 SGSN TC 223 FNB m15955, m15957, m15951, m15958, m15954, m15959, m15956, m15960 All AHG reports are reviewed. No specific comments are made. m16007 Liaison informing CD of IEC 62455: Internet protocol (IP) and transport stream (TS) based service access (TA1). Reply to thank you for the information and to provide MXM standards. 65 m16029 Liaison informing the publication of the new IEC International Standard 62227 (IEC6222) Digital Rights Permission Code. Reply to thank you and to provide information about REL. m15975 Liaison informing CD of IEC 62546: HD Recording Link Guidelines. Reply to thank you. m16223 Liaison requesting contributions from us with information on your current work (including the scope of your current projects) and the potential new areas for standardization related to sensor networks in our field. Reply to inform the context and objectives of MPEG-V and RoSE. m16226 Liaison informing the proposed new work item regarding video surveillance format interoperability. Reply to inform ISO base file format and Video Surveillance Application Format m16227 Liaison to invite to a Workshop on Digital Videosurveillance Format. Reply to thank you. 2.4.2 Others 2.5 Demonstrations None. 2.6 FAQ The FAQ were updated as needed. 2.7 AOB None. 66 3 3.1 MPEG-2 Systems (13818-1) 13818-1:2007 Amd.3 Carriage of SVC 3.1.1 Topics 1. 3.1.2 Sessio n Genera l Transport of Scalable Video Coding Contributions Numbe Title r m1598 Table of Replies on ISO/IEC 138185 1:2007/FDAM 3 Authors ITTF via SC 29 Secretariat m15985 ISO/IEC 13818-1:2007 / FDAM3 is approved. . Work Completed 3.2 13818-1:2007 Amd.4 Transport of MVC 3.2.1 Topics 1. 3.2.2 Sessio n Genera l Transport of Multiview Video Coding Contributions Numbe Title r m1598 Summary of Voting on ISO/IEC 138188 1:2007/PDAM 4 Authors SC 29 Secretariat m15988 One disapproval with comments from USNB. Technical Work in Progress. 3.3 13818-1:2007 Cor.X 3.3.1 Topics 1. 3.3.2 Sessio n Genera l Potential Corrigendum Contributions Numbe Title r m1611 Potential Corrigendum Items for MPEG-2 1 Systems 67 Authors Tomoo Yamakage m16111 This contribution proposes two potential corrigendum items regarding removal rate from transport buffer and unit of VBV buffers size. Agreed to start a working draft at this meeting Technical Work in Progress. 4 4.1 MPEG-4 Conformance (14496-4) 14496-4 Amd.37 File Format Conformance Improvements 4.1.1 Topics 1. 4.1.2 Sessio n FF File Format Conformance Contributions Numbe Title r m1599 Summary of Voting on ISO/IEC 144960 4:2004/PDAM 37 Authors SC 29 Secretariat m15990 All approved. Technical Work in Progress. 5 5.1 MPEG-4 BIFS (14496-11) 14496-11/AMD X Digital Radio Profile 5.1.1 Topics 1. 5.1.2 Sessio n Scene Scene Scene Scene Scene Digital Radio Profile Contributions Numbe r m1619 4 m1600 5 m1621 7 m1617 9 m1605 Title Authors Liaison Statement from the World DMB Forum Liaison Statement from TTA [SC 29 N 9959] Liaison Statement from GRN Consortium Comments on requirements for a new BIFS profile Novel approaches to remote display 68 World DMB Forum via SC 29 Secretariat TTA via SC 29 Secretariat GRN Consortium via SC 29 Secretariat Cyril Concolato Jean Le Feuvre Benoit Pellan Mihai MITREA 8 representations: BiFS-based solution and its deployment within the FP7 MobiThin project Pieter SIMOENS Bojan JOVESKI Bert VANKEIRSBILCK Abdeslam TAGUENGAYTE Françoise PRETEUX m16058 This contribute introduces preliminary results of converting X11 commands to BIFS commands. This can be used to people can access the screen of their computer from mobile phone if the computer uses X11 commands. This is part of EU FP7 projects. m16179 This contribution reports draft analysis on the list technologies to fulfill the requirement prepared at the Busan meeting to support Interactive Digital Radio applications. No solutions to support focus navigation, caching and scene state management are found. Probably this might need verification experiment to measure required complexity to implement. m16194 This liaison proposes addition requirements to new BIFS profile under consideration as follows: no modification to underlying delivery layers supporting all requirements that aim at extending the MPEG-4 BIFS Core2D@Level1 functionalities to either improve BIFS coding efficiency or minimize the use of coded images (JPEG or PNG) by replacing them with graphics elements introducing mechanisms allowing the integration of DAB applications Agreed to reply with the latest requirement document and working draft containing proposed new BIFS profile. m16005 This liaison supports creation of new BIFS profile and propose continuous information exchange. Agreed to reply with the latest requirement document and working draft containing proposed new BIFS profile. m16217 This liaison supports creation of new BIFS profile and the list of requirements identified. Agreed to reply with the latest requirement document and working draft containing proposed new BIFS profile Technical Work In Progress. 6 6.1 MPEG-4 ISO Base File Format (14496-12) 14496-12/COR 2 Usage of brands and box order in sample entry 6.1.1 Topics 1. 6.1.2 Sessio n Corrigendum Items Contributions Numbe r Title Authors 69 FF m1600 1 Summary of Voting on ISO/IEC 1449612:200X/DCOR 2 & ISO/IEC 1544412:200X/DCOR 2 SC 29 Secretariat m16001 The comment is indeed spot on; we clarified this in an amendment to ISO2. However, it did change reader behavior. We feel that some review and comment on this correction should be permitted, so we will issue a new Cor using the proposed text, and inviting comment Technical Work In Progress. 6.2 14496-12:200X/AMD 1 General improvements 6.2.1 Topics 1. 6.2.2 Sessio n FF FF FF FF FF Amendment Contributions Numbe r m1618 1 m1618 2 m1611 5 m1608 9 m1622 5 Title Authors Errata report for 14496-12:2005 (ISO Base Media File Format) On Movie Fragments, Edit Lists, and other timing questions, for 14496-12 (ISO Base Media File Format) On FPDAM1 of ISO Base Media File Format Comments on 14496-12:200X FPDAM1 David Singer Miscellaneous comments on ISO/IEC 14496-12:2008 FPDAM1 Miska M. Hannuksela Stefan Döhla David Singer Miska M. Hannuksela Teruhiko Suzuki m16181 Editors to integrate into DCOR, or work with the secretariat, as appropriate. m16182 Curious. Do we prefer replacement edit lists, which allow the writer to simplify life for the reader (no calculations to see if edits should be merged, the writer did it), but also raise the possibility of ‘re-defining the past’, or elf’s, which have the opposite characteristics? There are questions of complexity here, of course. To think about, more comment welcome. m16115 Oh dear. We have never said whether track references are always ‘strong’ (though most, if not all, are today). We might want to clarify the status of references while we are at it. Is a referencing track still useful if the target is removed? Do we need a better mechanism to indicate track semantic grouping? Perhaps a track-grouping box, adjacent to track-reference, that has a grouping-type and a group-id, and/or a track pointer? Perhaps also a strong/weak grouping indicator? We’ll put something in the study and encourage NB review. The startup sample group seems cool on first look. It could do with an example worked through, and we like the idea of using the grouping_parameter to indicate multiple different startup rolls. Does this go in the base spec. or the AVC/SVC/MVC spec.? Roll sample groups are part 12 – but they work for audio, these don’t. Into the study. 70 m16089 Yes, we should restrict ctts version 1 to iso4-branded files, in the annex and probably in the ctts section. The tutorial section should be updated “if you want composition to start at time 0, you can…”. Thank you. m16225 “There shall be at most one of each of …”. “If it is expected that the RTP … will be used…”. OK, editors to integrate in the study. Technical Work in Progress. 7 7.1 MPEG-4 File Format (14496-14) General 7.1.1 Topics 1. 7.1.2 Sessio n FF New proposal Contributions Numbe Title r m1605 Scalable Audio and MP4 4 Authors Stefan Doehla m16054 We need an amendment to Part14 to introduce the codec (sample entry) type ‘m4ae’ (MPEG-4 Audio Extension), and document its relationship with ‘dpnd’ track dependencies. Mr Döhla to supply the PDAM text and the request for amendment. Supporting NBs: Germany, France, Sweden, Singapore, USA. 7.1.3 Others Thoughts on MPEG Surround signaling (m16117) This was handled in Audio, joint session. We agreed on the direction and Audio will deal with it in the audio specs. No action here. 8 MPEG-4 AVC Base File Format (14496-15) 71 8.1 14496-15:2004/COR.3 8.1.1 Topics 2. Corrigendum 8.1.2 Sessio n FF Contributions Numbe Title r m1596 Summary of Voting on ISO/IEC 144967 15:2004/DCOR 3 [SC 29 N 9880] Authors SC 29 Secretariat m15967 All approved, thank you. 8.2 14496-15:2004/AMD 3 8.2.1 Topics 1. 8.2.2 Sessio n FF FF FF FF MVC File Format Contributions Numbe r m1599 3 m1610 2 m1611 4 m1618 0 Title Authors Summary of Voting on ISO/IEC 1449615:2004/PDAM 3 Updates to the MVC File Format On MVC File Format On the MVC File format (14496-15 amendment) SC 29 Secretariat Zhuangfei Wu Per Fröjdh Miska M. Hannuksela Ying Chen David Singer m15993 Thank you for the comments. We are concerned, but do not know how to resolve, about the question of track_ids. Some comments are not completely resolved to our satisfaction; we may need to revisit some of these issues. m16102 Thank you. We wonder about the view priority: only sample entry (now), both entry and group box (as resolved), only the multiview group box (possible), a separate sample group (as proposed) – or solve the whole problem with timed meta-data (like SVC)? We allow it near view identifier for now. The simplified view information is nice, but we had a lot of discussion on whether “adjacent” is always well defined, and so on. (We prefer 0=undefined, 1=left, 2=right, 3=ordered linearly but left/right not well defined.) We wonder if we need both local and global simple view ordering information; we keep the global but delete the local. We wonder if we are clear that lines are straight; what about (inflexion-point-free curved) arcs? We will include that case also. m16114 On the editorial and alignment material, accepted, thank you. 72 On the priority question, this is a-temporal, which worries us, but related to groups rather than views, which is attractive. The question above on the whole treatment of priority also applies. For now, we don’t put in the paragraph on ordering multiview groups. We remove view association. On the view relations, we might need a ‘differs in geometry differentiating code point’ (geom), e.g. that group is planar and this one is spherical. We put these code-points in, but note that it provides some overlap with the information in the global view information (which, for now, we retain). We agree to delete the local supplementary view information completely (F.6.3.1.5 to 9). The idea of ordering of views is interesting, but the MVCG doesn’t strictly list views, but tiers. To consider further. Thank you for the disparity improvements. m16180 Thank you. We will attempt to write a clear introduction, and a failure would indicate that we need to simplify more! 9 LASeR (14496-20) 9.1 14496-20:200X/AMD 2 Adaptation 9.1.1 Topics 1. 9.1.2 Sessio n Scene Scene Scene Adaptation Contributions Numbe Title r m1596 Summary of Voting on ISO/IEC 144968 20:200X 2nd Edition/PDAM 2 [SC29 N 9881] m1621 Late KNB comment on 14496-20 PDAM2 3 m1607 Study text on 14496-20 PDAM2 3 Scene m1607 4 Improvement of parsingSwitch on 14496-20 PDAM2 Scene m1607 5 Service Scenario examples on 14496-20 PDAM2 Scene m1608 5 m1623 0 Comments on LASeR PDAM2 Scene FNB Comments on LASeR PDAM2 m15968 Approved without comments 73 Authors SC 29 Secretariat Korea National Body Seo-Young Hwang Jaeyeon Song Young-Kwon Lim Seo-Young Hwang Jaeyeon Song Young-Kwon Lim Seo-Young Hwang Jaeyeon Song Young-Kwon Lim jean Le Feuvre Cyril Concolato jean Le Feuvre Cyril Concolato m16213 Late KNB comments asking to accept the changes made in Study text from Busan meeting and proposed technical modifications in m16073 and 16074 m16073 This contribution proposed solutions to several open questions Proposed to add new attribute “minResolution” in DPI for screen size adaptation accepted with a note that square pixel is assumed Proposed to add new attribute “minSize” for text adaptation accepted with modification that option will be the separate attribute Proposed to add parameter for MemoryStatus event, the number of points, the number of Unicode characters, and the size of composition buffer accepted m16074 This contribution proposed additional attributes for parsingSwitch Proposed to add new attribute “mode” & “removable” accepted with modification, adding one more mode, “ascending” and removing “removable” m16075 This contribution illustrate the example use case of scene adaptation under the IPTV service environment agreed to add an AHG mandate to create demonstration content based on this service scenario m16085 This contribution proposes Simple media object filtering accepted to put into TuC (need more investigation on the relevance of categorization and completeness of lists) User input filtering accepted to put into TuC (need more investigation on completeness of lists) Advanced content constraints accepted to put into FPDAM m16230 This is official FNB comment version of m16085 Technical Work in Progress. 9.2 14496-20:200X/AMD 3 PSI 9.2.1 Topics 1. 9.2.2 Sessio n Scene Presentation of Structured Information Contributions Numbe Title r m1600 Summary of Voting on ISO/IEC 144960 20:200X/PDAM 3 Authors SC 29 Secretariat m16000 Approved with one comment from KNB. Unfortunately no experts to answer this comment attend this meeting. So, disposition is delayed until the next meeting. 74 10 Open Font Format (14496-22) 10.1 14496-22:200X 2nd edition 10.1.1 Topics 1. 10.1.2 Sessio n Genera l Open Font Format Contributions Numbe Title r m1598 Summary of Voting on ISO/IEC FCD 4 14496-22 [2nd Edition] Authors SC 29 Secretariat m15984 Comments from JNB and USNB are all accepted. Need editing period until the end of March. Technical Work in Progress. 10.2 14496-22:200X Amd.X 10.2.1 Topics 1. Open Font Format Extension 10.2.2 Contributions Sessio Numbe Title Authors n r Genera m1602 Proposal for a new work item for ISO/IEC Simon Daniels l 3 14496-22 Vladimir Levantovsky m16023 This contribution propose to establish an AHG with the mandate to explore possible ways to overcome the 64K limit of the existing font format specification, which would allow supporting full Unicode character repertoire without an adverse effect on existing implementations. Agreed to establish AHG with proposed mandate. Technical Work in Progress. 11 11.1 MPEG-7 15938-12 MPEG Query Format 11.1.1 Topics 1. Corrigendum 75 11.1.2 Sessio n Genera l Genera l Contributions Numbe r m1597 1 m1599 4 Title Authors Table of Replies on ISO/IEC FDIS 1593812 [SC 29 N 9911] Summary of Voting on ISO/IEC 1593812:2008/DCOR 1 ITTF via SC 29 Secretariat SC 29 Secretariat m15971 ISO/IEC FDIS 15938-12 is approved. m15994 ISO/IEC 15938-12:2008/DCOR1 is approved without any comments. Technical Work in Progress. 12 12.1 21000 MPEG-21 21000-2 DID 12.1.1 Topics 1. 12.1.2 Sessio n Scene PSI Contributions Numbe Title r m1615 On WD 1.0 of ISO/IEC 21000-2:2005 5 AMD1 (PSI) Authors Christian Timmerer m16155 Raising open questions regarding the place to embed presentation element and proposing to include example. Accept to be included in new Working Draft. Technical Work In Progress. 12.2 21000-19 Media Value Chain Ontology 12.2.1 Topics 1. Media Value Chain Ontology 12.2.2 Contributions Sessio Numbe Title n r MVCO m1599 Summary of Voting on ISO/IEC CD 210005 19 MVCO m1602 Liaison Statement from IFPI [SC 29 N 0 9995] MVCO m1623 Liaison Statement from EDItEUR 1 76 Authors SC 29 Secretariat IFPI via SC 29 Secretariat MVCO m1619 3 Liaison Statement from the International DOI Foundation (IDF) IDF via SC 29 Secretariat m15995 One approval with comment from Spain, Three disapproval with comments from Japan, Germany and USA. Unfortunately comments are not finally disposed during the meeting because of lack of participants from certain Nation Body and lack of time to discuss the issues. Final disposition is delayed until next meeting. m16020, m16193, and m16231 Similar liaisons raising confusion about the purpose of MVCO and its relationship with RDD. Agreed to send replies explaining that the MVCO is intended as a fully machine readable ontology with a core model centered on the value chain while the RDD was designed to be implemented as a rights dictionary for referencing terms and their human definitions. Technical Work In Progress. 13 MPEG-A MAF (23000) 13.1 23000-6 Professional Archival AF 13.1.1 Topics 1. Professional Archival AF 13.1.2 Sessio n MAF MAF MAF Contributions Numbe r m1599 6 m1608 3 m1617 6 Title Authors Summary of Voting on ISO/IEC 230006:200X/PDAM 1 Liaison Statement from SC 29/WG 1 Status report on ISO/IEC 23000-6 Professional Archival Application Format Reference Software and Conformance files SC 29 Secretariat WG 1 via SC 29 Secretariat Houari Sabirin Hendry Noboru Harada Munchurl Kim m15996 PDAM registration is approved without any comment. PDAM text is approved with one comment from JNB asking conf. sw. and bitstream as soon as possible. This is accepted. m16083 JPEG is working on a Digital Cinema archival format. The packaging format will be based on MPEG-21 File Format and willing to use PA AF as a base and extend it to fulfill their requirements. We will inform the progress of PA AF conf. and ref. sw. and invite to join the AHG reflector for further discussion. m16176 The first version of PA AF packager and extractor are developed. Only one conformance point can be checked with this software. API will be developed further. This work will be completed by July 2009. 77 Technical Work in Progress. 13.2 23000-9 DMB Application Format 13.2.1 Topics 1. 13.2.2 Sessio n MAF MAF DMB Application Format Contributions Numbe r m1596 9 m1607 8 Title Authors Summary of Voting on ISO/IEC 230009:2008/PDAM 1 [SC 29 N 9882] Updated text, conf. files, and ref. sw for ISO/IEC 23000-9 (DMB-AF) SC 29 Secretariat Hui Yong Kim Myung Seok Ki HanKyu Lee Houari Sabirin Munchurl Kim Jung Soo Lee Yong Han Kim m15969 PDAM has been approved without any comments m16078 No major changes to the workplan. The softwares and bitstream will be ready by July 2009 Technical Work in Progress. 13.3 23000-10 Video Surveillance MAF 13.3.1 Topics 1. 13.3.2 Sessio n MAF Video Surveillance MAF Contributions Numbe Title r m1597 Summary of Voting on ISO/IEC 230007 10:200X/PDAM 1 [SC 29 N 9929] Authors SC 29 Secretariat m15977 Editorial comments from Japan, US, Germany are all aceepted. One major technical comment from UK is also accepted but needs two weeks of editing period to implement. Technical Work in Progress. 78 13.4 23000-11 Stereoscopic Video AF 13.4.1 Topics 1. 13.4.2 Sessio n MAF MAF Stereoscopic Video AF Contributions Numbe Title r m1613 Proposed Text for WD of ISO/IEC 230003 11 Stereoscopic Video AF Reference Software m1622 Proposed Corrigendum on ISO/IEC 230002 11 Stereoscopic Video Application Format Authors Next generation Broadcasting Forum(Korea) Next generation Broadcasting Forum(Korea) m16133 The firs version of SSVAF player has been implemented. The final version will be developed by July 2009. m16222 It was identified during the editing of the final text of FDIS that the descriptor to identify the type of voice codec used is missing. Editors agreed to issue a DCOR at this meeting by adding Sample Entry Boxes as used in 3GP file format standard. Technical Work in Progress. 13.5 23000-12 Interactive Music AF 13.5.1 Topics 1. 13.5.2 Sessio n MAF Interactive Music AF Contributions Numbe Title r m1608 Study text of ISO/IEC 23000-12 WD 1 Interactive music application format MAF m1613 1 Report of Mini Experiment on IM AF Constraints representation MAF m1613 2 Constraints Specifications for IM AF MAF m1613 Constraints representation method for IM 79 Authors Inseon Jang Huiyong Kim Jeongil Seo Laurent Primaux Owen Lagadec Emmanuel Bouix Fabien Gallot Inseon Jang Hui Yong Kim Jeongil Seo Kyeongok Kang Laurent Primaux Owen Lagadec Emmanuel Bouix Fabien Gallot Laurent Primaux 4 MAF AF m1621 4 A proposal for streaming support for IM AF Owen Lagadec Emmanuel Bouix Fabien Gallot Inseon Jang Hui Yong Kim Jeongil Seo Kyeongok Kang Yongwei Zhu Susanto Rahardja Te Li Haibin Huang m16132 Two constrains, Selection Constraint and Mixing Constraint have been described. This has been used as a base of mini experiment m16131 Two method, ISO File Format and MPEG-21 DIA UCD were compared to verify which represents constraints better. ISO File Format fulfills all requirements more efficiently than MPEG-21 DIA. ISO File Format is recommended to be used. m16134 This contribution proposes Constraints representation method based on ISO File Format. Accepted to be included in the CD. m16081 Improved text of previous WD. m16214 Propose to include SLS codec to support scalability. However, scalability does not seem to be an appropriate requirement for IMAF. 14 Project Started 14.1 MPEG eXtensible Middleware 14.1.1 Topics 1. 2. 14.1.2 Sessio n MxM MxM MxM MxM MPEG eXtensible Middleware Architecture and Technology MXM APIs Contributions Numbe r m1618 3 m1618 4 m1603 9 m1612 6 Title Authors Proposed WD3.0 of MxM Architecture and Technologies Proposed WD3.0 of MxM APIs Filippo Chiariglione Proposal for new APIs of video metadata on MXM APIs DMAG-UPC Comments on WD2.0 of MXM API Wonsuk Lee Seungyun Lee Jaime Delgado Eva Rodríguez Víctor Rodríguez-Doncel Silvia Llorente 80 Filippo Chiariglione MxM MxM MxM MxM MxM MxM MxM MxM m1618 5 m1618 6 m1603 7 m1615 8 m1618 7 m1615 1 Proposed WD2.0 of MxM Ref. SW. and Conf. Proposed WD2.0 of 2nd edition of ISO/IEC 29116-1 (MXM Protocols) Proposal of MXM Ontology for inter-MXM communication protocols Updates for the MPEG Extensible Middleware MXM use-case proposals for 3D services m1600 3 m1612 8 Liaison Statement from W3C [SC 29 N 9930] Presentation of the W3C MAWG Activities MXM API for 3D Graphics content creation Rubén Barrio Víctor Torres Filippo Chiariglione Filippo Chiariglione Kangchan Lee Seungyun Lee Christian Timmerer Patrick Gioia Ivica Arsov marius.preda@int-evry.fr Françoise Preteux W3C via SC 29 Secretariat Victor Rodriguez-Doncel Jaime Delgado Ruben Tous m16183 - Not many big changes compared to the version approved in Busan m16184 - Recommendation: Use Doxygen or similar tools to generate API specification automatically. m16037 - We expect further contributions in terms of use cases and requirements that MXM currently does not support in order to understand whether an MXM ontology is needed, and if so which would be the best place to accommodate it. m16039 - Having a generic API to set/get typical metadata fields not only MPEG-7 based would be desirable. Recommendation: add generic API next to more MPEG-7 oriented API to support different types of metadata within MXM m16126 - All proposed additions were accepted; we will leave the specific work on the API to the editing period m16128 - A liaison statement was written. m16151 - Very good progress on 3D Graphics part of MediaFramework engine, now having both creation and access API. We need a uniform approach for dealing with file format issues in MXM m16158 - DIA APIs are in a very good shape. Reference software available in Java m16187 - Interesting use cases to be considered within MXM Protocols m16220 - Recommendation: add the caliph and the MPEG7 JRS to the set of MXM APIS; consider harmonisation with MXM style - harmonise the documentation and the code style Technical Work in Progress. 14.2 Representation of Sensory Effects 14.2.1 Topics 1. Representation of Sensory Effects 81 14.2.2 Sessio n RoSE Contributions Numbe r m1602 4 m1615 7 m1616 7 m1616 9 MPEG Representation of Sensory Effects Vision Minor Corrections to RoSE WD 2.0 XML Schema Updates and Additional Tools for MPEG RoSE A simple RoSE system implementation including SDC, USP, and SDCom RoSE m1617 0 A demonstration for reference color type and its parameters in RoSE RoSE m1619 1 m1619 2 Study on Sensory Effect Metadata RoSE m1620 1 Comments and Proposal for Sensory Effect Metadata RoSE m1620 3 A proposal for RoSE system architecture RoSE RoSE RoSE RoSE Title Authors Proposal for Sensory Effect Metadata Christian Timmerer Markus Waltl Christian Timmerer Markus Waltl Christian Timmerer Jin-Seo Kim Maeng-Sub Cho Bon-Ki Koo Yong Soo Joo Sang-Kyun Kim Jin-Seo Kim Maeng-Sub Cho Bon-Ki Koo Yong Soo Joo Sang-Kyun Kim Yasuaki Tokumo Shin-ya Hasegawa Yasuaki Tokumo Shin-ya Hasegawa Takuya Iwanami B. S. Choi SangHyun Joo HaeRyong Lee KwangRo Park Sanghyun Joo m16157 Summary: - Minor corrections for XML schema in WD2.0 Evaluations: - Agreed. m16167 Summary: - Classification schemes for human readability, extensibility - Sensory effect pattern - New sensory effect types for water sprayer, perfumer, fog, blind, and sound Evaluations: - CS: ok, adopted - Sensory effect pattern: not adopted - Water sprayer: ok, adopted - Perfumer: ok, adopted - Fog: ok, adopted 82 - Window blind: basically adopted but needs to be aligned m16201 (shading); add’l attributes might be ‘range’, ‘speed’, … - Sound: not sure because this could be also integrated into the audio channel(s) of the movie. Needs further evidence before being included as a new sensory effect type. m16169 Summary: - Demonstration of implementation of RoSE system applied the “wind” sensory effect Evaluations: - Informative report for demonstration purposes only. - The BoG thanks the authors for this invaluable contribution and solicits further contributions like that. m16170 Summary: - New sensory effect type for color correction effect - ReferenceColorParameterType as extension of the ParameterBaseType - ReferenceColor Effect: basically turn on/off Evaluations: - Reference color effect can be applied to certain scenes only (e.g., advertisement, coloured scenes in a black/white movie) - Size of the parameters is constant and independent of the number of scenes where the correction should be applied - Name should be changed to “color correction” m16191 Summary: - Study on TuC regarding representation of position - Clarification of WD 2.0 regarding SEM Adaptability - Abstract Position Table (APT) + Specific Location Table (SLT) Evaluations: - Harmonize with m16201, e.g., adopt a classification scheme with the positions in a hierarchical way that may be grouped according to a certain criteria (see also below). - Proposed changes for ‘adaptType’ as informative note m16192 Summary: - Hint information for fragmentation - Improvement for time model - Hint information for automatic extraction Evaluations: - Hint information for seamless play: interesting, but this is considered something to be handled by the RoSE engine and does not require explicit signalling in metadata - DTS: interesting, but warm-up/initialization time is an issue for the device capabilities but not for the sensory effect metadata because ‘dts’ cannot be known at authoring stage - Fade-in/-out-value: interesting but possible with existing tools, i.e., (1) start up to intensity Ii and (2) fade-in from Ii to I. We should consider including this as an example. - Hint information for automatic extraction: adopt ‘autoExtraction’ but signal this at the beginning in the ‘header’ of the sensory effect metadata 83 m16201 Summary: - Comments on TuC - New sensory effect types for Flash, Color Light, heating/cooling, shading Evaluations: - position: we that but we’ll adopt a classification scheme approach and develop a wild-card mechanism for, e.g., addressing all effects on the left-hand side - direction: withdrawn - namedColor & colorInRGB: adopt both, i.e., “Standard Named Colours” and RGB with hexadecimal representation - intensity in general: keep as it is but may need to define a range for Celsius, Beaufort, Richter. - intensity for heating/cooling: seems to be a general issue as it is not clear how temperature is perceived by individual persons – there’s a need for (room) temperature classification like it is done for http://en.wikipedia.org/wiki/Lux - FlashType: ok, adopted but frequency need not be restricted - ColorLightType: merged to LightType - Shading vs. Window Blind: range [closed, opened], for speed the scale is not clear and we’ll adopt only ‘slow’ and ‘fast’ for the moment m16204 Summary: - Concept, Scope of standard Evaluations: - Agreed through e-mail reflector. m16203 Summary: - Detail information for implementation Evaluations: - Need more discussion in BoG meeting Technical Work in Progress. 14.3 MPEG-V 14.3.1 Topics 1. 14.3.2 Information exchange with Virtual Worlds General Issues Harmonization with RoSE o Agreed on the structure of harmonized standard. There will be no differentiation of schemas for “information exchange between virtual worlds” and “information exchange between virtual worlds and real worlds” o Proposed structure Part 1 Architecture Part 2 Control Information Part 3 Sensory Information 84 Part 4 Avatar Information (TBD) o Schemas to represent virtual world information will be categorized based on the characteristics of what is described and will form a separate parts 14.3.3 Session MPEGV MPEGV MPEGV MPEGV Contributions Numbe r m1607 2 m1608 0 m1617 4 m1605 1 Title Authors MPEG-V CfP Response Jean H.A. Gelissen (ed) MPEG-V CfP Response Jean H.A. Gelissen (ed) MPEG-V CfP Response Jean H. A. Gelissen (Ed) Full motion control and navigation of avatar/object with multi-input sources in MPEG-V jeong-hwan ahn The first conclusions from the evaluation of these contributions are that they address the following requirement areas of MPEG-V as indicated in the contributions: m16051: Requirements related to ‘Data representations between virtual worlds and the real world’ m16072: Requirements related to ‘Data representations between virtual worlds’ m16080: Requirements related to ‘Data representations between virtual worlds and the real world’ m16174: Requirements related to ‘Data representations between virtual worlds and the real world’ As mentioned in the contributions m16072, m16080, and m16051are in an early stage, in the case of m16174 even at the stage of a more detailed description of the requirements, due to the early stage of the related activities. They all also mention however that more detailed contributions will be submitted for next MPEG meetings. 85 15 Exploration 15.1 Richmedia UI Framework 15.1.1 Topics 1. Richmedia UI Frameworks 15.1.2 Sessio n Scene Contributions Numbe Title r m1603 Items under considerations in Rich UI 8 Framework Authors Kyungmo Park Cyril Concolato Jean Le Feuvre Giovanni Cordara m16038 Prposed update to Context and Objectives of RMUIF. the goal of this activity is to integrate the MPEG RMUIF with the existing MPEG delivery systems: the ISO base media file format, the MPEG-2 TS and MPEG-4 Part 8 (4onIP). The MPEG framework should be divided in two parts: the first part agnostic of the presentation and focused on the delivery and the second part agnostic of the delivery and focused on the presentation. 15.2 Advanced IPTV Terminal 15.2.1 Topics 1. 15.2.2 Sessio n AIT AIT AIT AIT AIT AIT AIT Advanced IPTV Terminal Contributions Numbe r m1600 8 m1600 9 m1601 0 m1601 1 m1601 2 m1601 3 m1601 4 Title Authors Use cases for consideration by Ad Hoc Group on Advanced IPTV Terminal Technologies for consideration by Ad Hoc Group on Advanced IPTV Terminal Peer-to-Peer iDRM Leonardo Chiariglione Web, Internet and Mobile TV Filippo Chiariglione ,Tiejun Huang Leonardo Chiariglione The Digital Media in Italia proposal Open IPTV Platform For an Open Content Market Approaching the Zettabyte Era 86 Leonardo Chiariglione Walter Allasia Lucia Marchisio Young-Kwon LIM AIT AIT AIT m1601 5 m1616 8 Contribution to the scope of the planned Advanced IPTV Terminal standard Use cases and Requirements for Advanced Internet TV Terminals m1618 8 Proposal of Advanced IPTV Terminal (AIT) requirements Young-Kwon LIM Christian Timmerer Mark Stuard Franc Kozamernik Jari Ahola Leonardo Chiariglione m16008, m16009, m16010, m16011, m16012, m16013 and m16014 were reviewed at Torino AHG meeting and integrated to produce m16015 proposing more detailed scope of AIT project m16168 This contribution provides commercial use cases and requirements derived from them especially in a Peer-to-Peer environment. Agreed to adopt some use cases as starting points to develop AIT use cases. m16188 This contribution proposed refined and restructured requirements on AIT. 15.3 AOB 15.3.1 Joint meeting with Video & Requirement Question about the modern way of transportation of multimedia content is raised. Five possible areas might be useful to explorer are identified. This seems to be HVC System work item… Transport- and file format friendly stream format (David Singer, Apple) Cross layer optimization between video and transport layer (Young-Kwon) Error resilience for MPEG streams (Joern) 4th bullet (Friendliness with other transport mechanism (Leonardo) Content adaptation to different networks (Christian) 16 Liaison Cf. Liaison output. 87 17 Latest References and Publication Status Reference on the ISO Web Site : http://www.itscj.ipsj.or.jp/sc29/open/29view/29n9270c.htm Pr Pt 2 1 1 1 1 1 ISO/IEC 13818-1/Amd.7 1 1 ISO/IEC 13818-1:2000/Amd.2 (Support for IPMP on 2) 1 1 ISO/IEC 13818-1:2000/Amd.4 (Metadata Application CP) 1 1 1 ISO/IEC 13818-1:2000/COR3 (Correction for Field Picture) ISO/IEC 13818-1:2006 (MPEG-2 Systems 3rd Edition) 2 1 1 ISO/IEC 13818-1:2006/Amd.1 (Transport of Streaming text) N8369 2 1 ISO/IEC 13818-1:2006/Amd.2 (Carriage of Auxialiry Video Data) N8798 2 2 2 2 2 2 2 2 2 2 2 2 Standard No. Issue ISO/IEC 13818-1:2000 (MPEG-2 Systems 2nd Edition) ISO/IEC 13818-1:2000/COR1 (FlexMux Descr.) ISO/IEC 13818-1:2000/COR2 (FlexMuxTiming_ descriptor) ISO/IEC 13818-1:2000/Amd.1 (Metadata on 2) & COR1 on Amd.1 ISO/IEC 13818-1:2000/Amd.3 (AVC Carriage on MPEG-2) ISO/IEC 13818-1:2000/Amd.5 (New Audio P&L Sig.) ISO/IEC 13818-1:2000/COR4 (M4MUX Code Point) ISO/IEC 13818-1:2000/COR5 (Corrections related to 3rd Ed.) 00/12 N3844 N4404 N5867 N5604 N5771 N6847 N6585 N6845 N7469 N7895 01/01 Pisa 01/12 Pattaya 03/07 Trondheim 03/03 Pattaya 03/07 Trondheim 04/10 Palma 04/07 Redmond 04/10 Palma 05/07 Poznan 06/01 Bangkok 06/xx 88 06/07 Klagenfurt 07/01 Marrakech Status Doc. With Purpose Published Published Published Published Published 2000/12 2000/12 2002/03 2002/12 2003/12 ISO Award Done Proposed N/A N/A Proposed Published Published 2004/03 XXXX N/A Proposed FDAM FDAM ITTF ITTF to be published to be published N/A N/A COR COR COR ITTF ITTF ITTF to be published to be published to be published N/A N/A N/A Published FDAM ITTF ITTF to be published TBP TBP FDAM ITTF to be published TBP 2 1 ISO/IEC 13818-1:2006/Cor.1.2 (Reference to AVC Specification) N9365 2 1 ISO/IEC 13818-1:2006/Amd.3 (SVC in MPEG-2 Systems) 2 1 ISO/IEC 13818-1:2006/Cor.2 (Corrections to SVC in MPEG-2) 2 11 1 N1005 8 N1024 0 N5607 N2501 4 4 ISO/IEC 13818-1:2003 (IPMP on 2) st ISO/IEC 14496-1 (MPEG-4 Systems 1 Ed.) 07/10 Shenzhen 08/07 Hannover 08/10 Busan FDAM ITTF to be published TBP FDAM ITTF to be published TBP COR ITTF to be published TBP 03/03 Pattaya 98/10 Atl. City Published Published 2003/12 1999/12 Proposed Done Published Published 2001/11 2001/11 Done N/A Published Published COR COR 2001/11 2002/10 ITTF ITTF N/A Done N/A N/A COR ITTF N/A AMD ITTF N/A Published 2004-05 N/A Published Published 2003/12 2004-08 N/A N/A AMD PDAM ITTF ITTF PDAM IS ITTF ITTF 1 1 ISO/IEC 14496-1/Amd.1 (MP4, MPEG-J) ISO/IEC 14496-1/Cor.1 N3054 N3278 ISO/IEC 14496-1:2001 (MPEG-4 Systems 2nd Ed.) N3850 4 1 1 1 1 99/12 Hawaii 00/03 Noordwijk. 01/01 Pisa ISO/IEC 14496-1:2001/Cor.2 N4264 N5275 01/07 Sydney 02/10 Shangai 4 1 ISO/IEC 14496-1:2001/Cor.3 N6587 4 1 ISO/IEC 14496-1:2001/Amd.2 (Textual Format) N4698 4 1 ISO/IEC 14496-1:2001/Amd.3 (IPMP Extensions) N5282 4 1 1 ISO/IEC 14496-1:2001/Amd.4 (SL Extension) N5471 N5976 1 1 ISO/IEC 14496-1:2001/Amd.8 (ObjectType Code Points) N6202 N7229 04/07 Redmond 02/03 Jeju Island 02/10 Shanghai 02/12 Awaji 03/10 Brisbanne 03/12 Hawaii 05/04 Busan 1 1 ISO/IEC 14496-1:200x/Cor4 (Node Coding Table) N7473 N5277 05/07 Poznan 02/10 4 4 4 4 4 4 4 4 4 ISO/IEC 14496-1:2001/Amd.1 (Flextime) ISO/IEC 14496-1:2001/Cor.1 ISO/IEC 14496-1:2001/Amd.7 (AVC on 4) ISO/IEC 14496-1:200x/Amd.1 (Text Profile Descriptors) ISO/IEC 14496-1 (MPEG-4 Systems 3rd Ed.) 89 to be published Final Text Editing to be published to be published N/A N/A N/A Proposed 4 1 ISO/IEC 14496-1:200x/Amd.1 (Text Profile Descriptors) N7229 4 1 ISO/IEC 14496-1:200x/Cor.1 (Clarif. On audio codec behavior) N8117 4 1 ISO/IEC 14496-1:200x/Amd.2 (3D Profile Descriptor Extensions) N8372 4 1 ISO/IEC 14496-1:200x/Cor.2 (OD Dependencies) N8646 4 1 ISO/IEC 14496-1:200x/Amd.3 (JPEG 2000 support in Systems) N8860 4 4 ISO/IEC 14496-1:200x/Amd.17 (ATG Conformance) N8861 4 4 ISO/IEC 14496-1:200x/Amd.22 (AudioBIFS v3 conformance) N9295 4 4 ISO/IEC 14496-1:200x/Amd.23 (Synthesized Texture conformance) N9369 4 4 ISO/IEC 14496-1:200x/Amd.24 (File Format Conformance) N9370 4 4 ISO/IEC 14496-1:200x/Amd.25 (LASeR V1 Conformance) N9372 4 4 ISO/IEC 14496-1:200x/Amd.26 (Open Font Format Conf.) N9815 4 4 ISO/IEC 14496-1:200x/Amd.27 (LASeR Amd.1 Conformance) N9816 4 5 ISO/IEC 14496-1:200x/Amd.12 (File Format) N9020 4 5 ISO/IEC 14496-1:200x/Amd.14 (Open Font Format) 4 5 5 6 ISO/IEC 14496-1:200x/Amd.16 (SMR Ref. Soft) N1024 6 N9672 N9674 4 4 ISO/IEC 14496-1:200x/Amd.17 (LASeR Ref. Soft) ISO/IEC 14496-6:2000 90 Shanghai 05/04 Busan 06/04 Montreux 06/07 Klagenfurt 06/10 Hangzhou 07/01 Marrakech 07/01 Marrakech 07/07 Lausanne 07/10 Shenzhen 07/10 Shenzhen 07/10 Shenzhen 08/04 Archamps 08/04 Archamps 07/04 San Jose 08/10 Busan 08/01 Antalya 08/01 Antalya N/A ITTF Final Text Editing Final Text Editing to be published COR ITTF to be published N/A PDAM ITTF to be published N/A PDAM ITTF to be published N/A PDAM ITTF to be published N/A PDAM ITTF to be published N/A PDAM ITTF to be published N/A PDAM ITTF to be published N/A PDAM ITTF to be published N/A PDAM ITTF to be published N/A PDAM ITTF to be published N/A PDAM ITTF to be published N/A PDAM PDAM Published ITTF ITTF 2000/12 to be published to be published N/A N/A N/A PDAM ITTF COR ITTF PDAM N/A N/A 4 ISO/IEC 14496-8 (MPEG-4 on IP Framework) 4 8 11 4 11 ISO/IEC 14496-11/Amd.1 (AFX) 4 11 4 11 Published FDIS 2004-05 SC29 N5480 02/03 Jeju 05/01 HongKong 02/12 Awaji FDAM ITTF ISO/IEC 14496-11/Amd.2 (Advanced Text and Graphics) N6205 03/12 Hawaii FDAM ITTF ISO/IEC 14496-11/Cor.1 N6203 03/12 Hawaii COR SC29 ISO/IEC 14496-11 (MPEG-4 Scene Description 3rd Edition) N4712 N6960 4 11 ISO/IEC 14496-11/Cor.3 Valuator/AFX related correction N6594 4 11 ISO/IEC 14496-11/Amd.3 Audio BIFS Extensions N6591 4 11 ISO/IEC 14496-11/Amd.4 XMT and MPEG-J Extensions N6959 4 11 ISO/IEC 14496-11/Cor.3 (Audio BIFS Integrated in 3rd Edition) N7230 4 11 ISO/IEC 14496-11/Cor.5 (Misc Corrigendum) N8383 4 11 ISO/IEC 14496-11/Amd.5 Symbolic Music Representation N8657 4 ISO/IEC 14496-11/Cor.6 (AudioFx Correction) 4 11 12 ISO/IEC 14496-12 (ISO Base Media File Format) N9021 N5295 4 12 ISO/IEC 14496-12/Amd.1 ISO FF Extension N6596 4 12 N7232 4 12 ISO/IEC 14496-12/Cor.1 (Correction on File Type Box) ISO/IEC 14496-12/Cor.2 (Miscellanea) 4 12 ISO/IEC 14496-12/Amd.1 (Description of timed metadata) N8659 N7901 91 04/07 Redmond 04/07 Redmond 05/01 HongKong 05/04 Busan 06/07 Klagenfurt 06/10 Hangzhou 07/04 San Jose 02/10 Shanghai 04/07 Redmond 05/04 Busan 06/01 Bangkok 06/10 Hangzhou Final Text Editing Integration in 1st Ed. Integration in 1st Ed. Proposed Proposed N/A N/A N/A st Integration in 1 Ed. Integration in 1st Ed. Integration in 1st Ed. Final Text Editing N/A COR ITTF FDAM ITTF FDAM ITTF COR ITTF COR SC29 N/A FDAM ITTF TBP COR Published SC29 2004-02 N/A Proposed FDAM ITTF FDAM 04/11/30 N/A COR ITTF N/A COR ITTF Final Text Editing Final Text Editing FDAM ITTF Proposed N/A N/A N/A N/A 4 12 ISO/IEC 14496-12/Cor.3 (Miscellanea) N9024 07/04 San Jose COR ITTF 4 07/04 San Jose FDAM ITTF N/A 4 12 COR ITTF N/A 13 N1025 0 N5284 08/10 Busan 4 ISO/IEC 14496-12/Amd.2 (Flute Hint Track) ISO/IEC 14496-12:2008 (ISO Base Media File Format 2nd ed.) ISO/IEC 14496-12:2008/Cor.1 (Corrections to Flute Hint Track) ISO/IEC 14496-13 (IPMP-X) N9023 4 12 12 IS ITTF 4 14 ISO/IEC 14496-14 (MP4 File Format) N5298 Published 2003-11 4 14 ISO/IEC 14496-14/Cor.1 (Audio P&L Indication) N7903 COR ITTF 4 15 ISO/IEC 14496-15 (AVC File Format) N5780 Published 2004-04 4 15 ISO/IEC 14496-15/Amd.1 (Support for FREXT) N7585 02/10 Shanghai 02/10 Shanghai 06/01 Bangkok 03/07 Trondheim 05/10 Nice FDAM ITTF 4 4 15 15 ISO/IEC 14496-15/Cor.1 ISO/IEC 14496-15/Cor.2 (NAL Unit Restriction) N7575 N8387 COR COR ITTF ITTF N/A N/A 4 15 N9682 FDAM ITTF N/A 4 17 18 18 ISO/IEC 14496-15/Amd.2 (SVC File Format Extension) ISO/IEC 14496-17 (Streaming Text) ISO/IEC 14496-18 (Font Compression and Streaming) ISO/IEC 14496-18/Cor.1 (Misc. corrigenda and clarification) ISO/IEC 14496-19 (Synthesized Texture Stream) ISO/IEC 14496-20 (LASeR) ISO/IEC 14496-20/Cor.1 (Misc. corrigenda and clarification) FDAM Published COR ITTF 2004-07 ITTF TBP Proposed N/A Published FDAM COR 2004-07 Editor ITTF Proposed TBP N/A 4 4 4 4 4 19 20 20 N7479 N6215 N8664 N6217 N7588 N8666 92 05/10 Nice 06/07 Klagenfurt 08/01 Antalya 05/07 Poznan 03/12 Hawaii 06/10 Hangzhou 03/12 Hawaii 05/10 Nice 06/10 Hangzhou Final Text Editing to be published N/A Proposed Proposed Final Text Editing N/A Proposed Final Text Editing N/A 4 4 20 20 ISO/IEC 14496-20/Amd.1 (LASeR Extension) ISO/IEC 14496-20/Cor.2 (Profile Removal) N9029 N9381 4 20 ISO/IEC 14496-20/Amd.2 (SVGT1.2 Support) N9384 4 22 ISO/IEC 14496-22 (Open Font Format) N8395 7 1 ISO/IEC 15938-1 (MPEG-7 Systems) N4285 7 ISO/IEC 15938-1/Amd.1 (MPEG-7 Systems Extensions) 7 1 1 1 1 2 7 ISO/IEC 15938-7/Amd.2 (Fast Access Ext. Conformance) N6326 N6328 N7490 N7532 N4288 N8672 7 12 ISO/IEC 15938-12 MPEG Query Format N9830 21 7 21 7 ISO/IEC 21000-7 AMD 1 (DIA Query Format Capability) ISO/IEC 21000-7 COR 1 (Corrections) 21 8 ISO/IEC 21000-8 AMD 1 (Extra Ref. SW) 21 9 ISO/IEC 21000-9 (MPEG-21 File Format) N1026 0 N1026 2 N6975 21 9 ISO/IEC 21000-9/Amd.1 (MPEG-21 Mime Type) N9837 21 15 ISO/IEC 21000-15 (Security in Event Reporting) N9839 7 7 7 7 ISO/IEC 15938-1/Cor.1 (MPEG-7 Systems Corrigendum) ISO/IEC 15938-1/Cor.2 (MPEG-7 Systems Corrigendum) ISO/IEC 15938-1/Amd.2 (BiM extension) ISO/IEC 15938-2 (MPEG-7 DDL) 93 07/04 San Jose 07/10 Shenzhen 07/10 Shenzhen 06/07 Klagenfurt 01/07 Sydney FDAM FDAM ITTF ITTF N/A N/A FDAM ITTF N/A FDAM Editor Published 2002/07 FDAM COR COR FDAM Published FDAM ITTF Editor ITTF ITTF 2002/02 ITTF FDIS ITTF FDAM ITTF 08/10 Busan COR ITTF 08/10 Busan FDAM ITTF 05/01 HongKong 08/04 Archamps 08/04 Archamps FDIS ITTF FDAM ITTF Done FDIS ITTF TBP 04/03 Munich 04/03 Munich 05/07 Poznan 05/10 Nice 01/07 Sydney 06/10 Hangzhou 08/04 Archamps Final Text Editing TBP Done FDAM 04/11/28 N/A N/A N/A N/A Done N/A N/A FDIS 05/01/21 Done 21 A 16 5 4 4 ISO/IEC 21000-16 (MPEG-21 Binary Format) ISO/IEC 21000-5 (Open Release Content Profile) ISO/IEC 23000-4 (Musical Slide Show MAF) ISO/IEC 23000-4 (Musical Slide Show MAF 2nd Ed.) N7247 N9687 N9037 N9843 A 4 ISO/IEC 23000-4 AMD 1 (MSS AF Conf. & Ref. SW) A 6 ISO/IEC 23000-6 (Professional AF) A A 7 7 ISO/IEC 23000-7 (Open Access MAF) ISO/IEC 23000-7 AMD 1 (OA AF Conf. & Ref. SW) A 8 ISO/IEC 23000-8 (Portabe Video AF) N1026 7 N1026 9 N9698 N1027 4 N9853 A 9 ISO/IEC 23000-9 (Digital Multi. Broadcasting MAF) N9397 A 9 N9854 A 10 ISO/IEC 23000-9/Cor.1 (Digital Multi. Broadcasting MAF) ISO/IEC 23000-10 (Video Surveillance AF) A 11 ISO/IEC 23000-11 (Stereoscopic Video AF) B B 1 1 B 1 B 1 ISO/IEC 23001-1 (XML Binary Format) ISO/IEC 23001-1/Cor.1 (Misc. Editorial and technical clar.) ISO/IEC 23001-1/Cor.2 (Misc. Editorial and technical clar.) ISO/IEC 23001-1/Amd.1 (Reference Soft. & Conf.) B 1 ISO/IEC 23001-1/Amd.1 (Exten. On encoding of wild 21 A N1027 9 N1028 3 N7597 N8680 N9049 N8886 N9296 94 05/04 Busan 08/01 Antalya 07/04 San Jose 08/04 Archamps 08/10 Busan FDIS FDAM FDIS FDIS ITTF ITTF ITTF ITTF FDAM ITTF TBP FDIS ITTF TBP FDIS FDAM ITTF ITTF TBP TBP 08/04 Archamps 07/10 Shenzhen 08/04 Archamps 08/10 Busan FDIS ITTF TBP FDIS ITTF TBP COR ITTF TBP FDIS ITTF TBP 08/10 Busan FDIS ITTF TBP 05/10 Nice 06/10 Hangzhou 07/04 San Jose FDIS COR ITTF ITTF TBP N/A COR ITTF N/A FDAM ITTF N/A PDAM ITTF 08/10 Busan 08/01 Antalya 08/10 Busan 07/01 Marrakech 07/07 FDIS 05/04/22 to be published TBP TBP TBP TBP N/A E 2 3 1 cards) ISO/IEC 23001-2 (Fragment Request Unit) ISO/IEC 23001-3 (IPMP XML Messages) ISO/IEC 23008-1 Architecture N9051 N9416 N8892 E 2 ISO/IEC 23008-2 Multimedia API N8893 E 3 ISO/IEC 23008-3 Component Model N8894 E 4 ISO/IEC 23008-4 Ressource & Quality Management N8895 E 5 6 7 1 ISO/IEC 23008-5 Component Download ISO/IEC 23008-6 Fault Management ISO/IEC 23008-7 System Integrity Management ISO/IEC 29116 Media Streaming MAF Protocols N9053 N9054 N9055 N9420 B B E E 29116 95 Lausanne 07/04 San Jose 07/04 San Jose 07/01 Marrakech 07/01 Marrakech 07/01 Marrakech 07/01 Marrakech 07/04 San Jose 07/04 San Jose 07/04 San Jose 07/10 Shenzhen FDIS FDIS FDAM ITTF ITTF ITTF TBP TBP N/A FDAM ITTF N/A FDAM ITTF N/A FDAM ITTF N/A FDAM FDAM FDAM FDAM ITTF ITTF ITTF ITTF N/A N/A N/A N/A 18 Resolutions of Systems Cf. WG11 resolution. 96 Annex G – Video report Sources: Jens-Rainer Ohm, Gary Sullivan, Paul Brasnett, Euee S. Jang 1 Development of AVC The video subgroup jointly approved the output documents relating to ISO/IEC approval process milestones that were produced during the 30th JVT meeting which was held in Geneva (2009-0130/02-03. Important work items in this context were – Work on MVC software and conformance (both reaching FPDAM at this meeting) – Work towards a new corrigendum (defect report issued on miscellaneous issues related to MVC and SVC – Work towards a new amendment containing Constrained Baseline Profile and a new SEI message defining various interleaving methods for left/right stereo views in a conventional (monoscopic) video (reaching FPDAM at this meeting). Further discussion was performed jointly with the Requirements subgroup on possible definition of a new profile that would allow usage of MVC for stereo video captured by interlaced cameras or encoded in interlaced mode, as requested by the National Bodies of Japan, Singapore and the U.S. This topic had also been discussed in the 30th JVT meeting, and the JVT had issued a "profile under consideration" description document on the subject. The current multiview high profile does not support so-called interlaced coding tools. It was asserted by some participants that there is need for such support because interlaced video camera capture will still continue to be used, including for stereoscopic applications. Technically, it is certainly feasible to include interlaced coding tools such as MBAFF, because MVC does not change the AVC operation below the slice header level. Orally, the companies Panasonic, Motorola and Mitsubishi expressed support for such a possible profile. To guarantee a bug-free definition, it will be necessary to implement software. This is expected to be submitted by proponents by the April 2009 meeting, such that a PDAM could be started by that time. A WD was issued on this to allow further study. Documents reviewed: m15976 m15981 m15986 m15998 m16002 M16022 M16032 M16071 Table of Replies on ISO/IEC 14496-5:2001/FDAM 18 [SC 29 N 9928] Summary of Voting on ISO/IEC 14496-5:2001/PDAM 15 Table of Replies on ISO/IEC 14496-4:2004/FDAM 30 Summary of Voting on ISO/IEC 14496-4:2004/PDAM 38 Summary of Voting on ISO/IEC 14496-10:200X/PDAM 1 JNB comment on the resolution 3.5.4 SGNB Comments on Multiview Video Coding Profile Response to resolution 3.5.4 of 86-th WG 11 meeting ITTF via SC 29 Secretariat SC 29 Secretariat ITTF via SC 29 Secretariat SC 29 Secretariat SC 29 Secretariat Japan National Body Singapore National Body Andy Tescher for USNB Documents approved: No. Title 10337 Disposition of Comments on ISO/IEC 14496-4:2004/PDAM 38 10338 Text of ISO/IEC 14496-4:2004/FPDAM 38 Multiview Video Coding Conformance Testing 10339 Disposition of Comments on ISO/IEC 14496-5:2001/PDAM 15 10340 Text of ISO/IEC 14496-5:2001/FPDAM 15 Reference Software for Multiview Video Coding 10341 Disposition of Comments on ISO/IEC 14496-10:200X/PDAM 1 97 TBP Available N 09/02/06 N 09/02/20 N N 09/02/06 09/02/20 N 09/02/06 10342 10343 10344 Text of ISO/IEC 14496-10:200X/FPDAM 1 Constrained Baseline Profile and supplemental enhancement information Defect Report on ISO/IEC 14496-10:200X Working Draft 1 of ISO/IEC 14496-10:200X/Amd.2 Multiview Field High Profile N 09/02/20 N Y 09/02/06 09/02/06 2 MPEG-7 Visual and Photo Player MAF 2.1 MPEG-7 Visual related work The MPEG-7 breakout group was active during the whole week. Input documents related to the Visual part in 15938-3 and Photo Player MAF 23003-3 are listed in the table below. m16112 m16113 Proposal on removing comparison pair in comparison list for independence test Ground true table & incorrect video query clips of AVC, CC m16171 Response to the Call for Proposals on Video Signature Tools m16172 Response to the Call for Proposals on Video Signature Tools m16178 Received responses to CfP on Video Signature Tools Sangil Na DongSeok Jeong WonGeun Oh JuKyong Jin Kota Iwamoto Ryoma Oami Paul Brasnett Stavros Paschalakis Miroslaw Bober Jens-Rainer Ohm For the Photo Player MAF, the conformance testing amendment was prepared (PDAM2). The main activity during the week was the evaluation of the responses received after the Call for Proposals on Video Signature Tools, and preparation of the related Core experiments. A summary of the responses received is given in the subsequent subsections. 2.1.1 Proposal 1: Video signature based on feature difference between various pairs of sub-regions (NEC) Frame signature representing relations between values of various pairs of sub-regions Mean Success Ratio = 93.24% (92.32%)(<5ppm) More than 93% SR for all modifications at “light” level for direct and partial matching of D=10sec Descriptor size = 14.64 kbps Matching speed = 1,774 clips/sec (Partial Matching) dimension 1 dimension 2 dimension 3 dimension 4 dimension 5 dimension 6 dimension 7 dimension 8 dimension 9 dimension 10 1. Resizing (to 320x240) 2. Mean value calculation of sub-regions 98 3. Comparison & Quantization 4. Encoding frame #1 header frame signature (60 bytes) frame #2 confidence (1 byte) frame signature (60 bytes) confidence (1 byte) … sequence1 sequence 2 Strengths Highest reported performance. Only complete set of results. Consistent performance across all modifications. Ternary values – concept of zero – additional robustness. Weaknesses Descriptor training – part of test data used, overfitting? Some test conditions used, overfitted? Assumes framerate. Potentially fragile regions. 2.1.2 Proposal 2: Video Signature based on Video Tomography (Florida Atlantic University) Extraction: On Intel Core 2, 2.4 GHz, 65 milli sec for 180 frames Descriptor Size: 6.144 Kbps The proposed approach is based on video tomography - NTT Research Labs, 1994 A tomographic image is created from a set of video frames, e.g, a shot in a movie Captures spatio-temporal changed in a shot. FRAME I + 2 FRAME Frame i I+1 HT TH[I][J] = FI[HT][J] 0≤J<W 0≤I<S 3 FRAME I H/2 S VIDEO TOMOGRAPH H - HT WT W - WT 5 W TV[I][J] = FI[J][WT] 0≤J<H 0≤I<S S H Edge Map of Tomography 99 4 1 2 6 • • 360 pixels are samples along each line Only luminance is used Video Clip Pattern 1 Tomograph Pattern 2 Tomograph Pattern 3 Tomograph Pattern 4 Tomograph Pattern 5 Tomograph Pattern 6 Tomograph Edge Detection Edge Detection Edge Detection Edge Detection Edge Detection Edge Detection Composite Edge Image Composite Edge Image Composite Edge Image 16-byte Sub-Signature 16-byte Sub-Signature 16-byte Sub-Signature 48-byte Signature • Matching in two stages: – Matching closest shot (segment) using shot level signatures – Matching closest frame using frame signatures Strengths Weaknesses Exploits temporal information. Incomplete performance results. Efficient descriptor extraction. Limited region of frame considered – may be simple to Initial results suggest good attack? robustness – CIVR 2007 dataset. Assumes frame rate. 2.1.3 Proposal 3: Video Signature based on Multi-resolution decomposition (Mitsubishi Electric) Descriptor Extraction: ~1 ms per frame Descriptor Size: 8.96 kbps Matching: ~2,500 per second Detection: ~ 75.21% for 5 s at 5 ppm - Direct matching A frame is divided into 2x2 neighbourhoods In each 2x2 neighbourhood, the relation between the descriptor elements {a’, b’, c’, d’} and the pixel values {a, b, c, d} of the neighbourhood is given by (1)-(4) a’ = (a+b+c+d)/4 (1) b’ = (a-b)/2 (2) c’= (b-d)/2 (3) d’ = (d-c)/2 (4) Multi-resolution scheme – applied at 4 scales 2x2, 4x4, 8x8 and 16x16 Element binarisation by keeping MSB A word is formed by projecting descriptor to lower dimension space Multiple projections → multiple words, each from a different vocabulary 100 ~ di ~ di bits at 2x2 resolution ~ di bits at 4x4 resolution ~ di bits at 8x8 resolution bits at 16x16 resolution 0 1 4 8 5 11 20 36 21 39 22 42 23 45 84 3 2 10 9 13 12 38 37 41 40 44 43 47 46 150 149 153 152 156 155 159 158 162 161 165 164 168 167 171 170 148 6 14 7 17 24 48 25 51 26 54 27 57 92 16 15 19 18 50 49 53 52 56 55 59 58 174 173 177 176 180 179 183 182 186 185 189 188 192 191 195 194 28 60 29 63 30 66 31 69 100 196 101 199 102 202 103 205 104 208 105 211 106 214 107 217 62 61 65 64 68 67 71 70 198 197 201 200 204 203 207 206 210 209 213 212 216 215 219 218 32 72 33 75 34 78 35 81 108 220 109 223 110 226 111 229 112 232 113 235 114 238 115 241 74 73 77 76 80 79 83 82 222 221 225 224 228 227 231 230 234 233 237 236 240 239 243 242 172 85 93 151 175 86 94 154 178 87 95 157 181 88 96 160 184 89 97 163 187 90 98 166 190 91 99 169 193 116 244 117 247 118 250 119 253 120 256 121 259 122 262 123 265 Word 1 Word 2 246 245 249 248 252 251 255 254 258 257 261 260 264 263 267 266 Word 3 Word 4 124 268 125 271 126 274 127 277 128 280 129 283 130 286 131 289 Word 5 270 269 273 272 276 275 279 278 282 281 285 284 288 287 291 290 132 292 133 295 134 298 135 301 136 304 137 307 138 310 139 313 In each 2x2 neighbourhood, the relation between the ~ d ielements and the pixel values 294 293 297 296 300 299 303 302 306 305 309 308 312 311 315 314 a, b, c, d of the neighbourhood is given by eq. (1)-(4), i.e 140 316 141 319 142 322 143 325 144 328 145 331 146 334 147 337 318 317 321 320 324 323 327 326 330 329 333 332 336 335 339 338 a b (a+b+ (a-b)/2 c+d)/4 c d (d-c)/2 (b-d)/2 1 stage matching – Word occurrence 2nd stage matching – Temporal ordering of words 3rd stage matching – full descriptor matching Frame Rate Ratio & Frame Displacement estimation Strengths Weaknesses Efficient descriptor extraction. No partial matching results. Multi-scale representation Current bit selection of the words provides robustness could impact performance. Does not assume frame rate. Weakness on frame rate reduction and camera capturing. Efficient search with words. st 2.1.4 N.B. No assumption of frame rate makes matching problem harder for all conditions. 2.1.5 Proposal 4: Hierarchical video signature description using existing MPEG-7 features and motion activity (University of Brescia) Descriptor Size: Dependent on descriptors used e.g. 4.96 kbps for 4 proposed descriptors Extraction & Matching Complexity: Dependent on descriptors used Description Scheme Hierarchical Temporal Structure Flexibility Possibility to choose and use many descriptors e.g. Standard - Dominant Color, Color Layout Non standard - Motion Activity Map (MAM), Direction of Motion Activity (DMA) header 101 data VS: Matching o The proposed method is exhaustive o To speed up the comparison Pre-processing: clustering of D associated to each segment Strengths Flexibility in implementation and use of descriptors. Hierarchical Temporal Structure – supports granular searching. Schema is compatible with existing MPEG-7 standard. 2.1.6 Weaknesses Incomplete performance results. Possibility of no common descriptors between sequences → not possible to judge match. Variable extraction cost. Proposal 5: Video signature based on saliency map (Peking University) Descriptor Size: 16bits/frame, 10FPS → 0.160 kbps Independence rates: 8.65% for direct 2 seconds condition, 2.05% for direct 5 seconds, 0.5% for direct 10 seconds condition, 0.02% for partial condition (30 seconds) Robustness: from 80-99% @ ~10,000 ppm Saliency map – combines colour, intensity & orientation information. Saliency map simulates the attention model of the human visual system. Extraction Process Input video frames Pre-processing Partition the saliency map to M×N blocks Strengths Very compact descriptor. Very fast matching. HVS model may provide robustness. Extract visual features maps Sum saliency values within each block, and adaptively threshold Construct a saliency map Video fingerprints Weaknesses Sensitive to text/logo overlay & widescreen bars etc. High complexity extraction. High false alarm rates. 102 2.2 Conclusions and next actions With the current information, comparison between the methods that were submitted as responses to the CfP is difficult. Only one proposal came with complete results, while all others came with incomplete results due to late availability of the extremely large data set. For the one proposal with complete results, it was not fully clear by how much the fact that queries were partially used to train the descriptor would affect the performance. Further, there were some small problems with the ground truth definition, and it was only detected during the experimentation that some of the data contain black sequences which could bias the results both in correct and false matching. Therefore, it was decided to not decide for one particular algorithm by adoption into the XM, but rather continue with core experiments, using corrected testing conditions, such that it will be possible to achieve a more clear evidence about the benefits of the various technologies by the next meeting (see N10345). Some issues were further discussed, but for the current time were not taken up into the CE definition (may be in future CEs): New robustness conditions needed (instead of consecutive independence/robustness tests which give only one data point on the RoC for precision/ recall)? Matching scenario used (only a→a’ or also a→a’…z’). Localisation accuracy? (1s – can this be relaxed in some scenarios?) Scalability & complexity of querying large datasets – specific CE needed? 2.3 Output documents related to MPEG-7 Visual No. 10345 10346 10347 Title 15938-3 Visual Description of Core Experiments in Video Signature Description development 23000-3 Photo Player Application Format Request for ISO/IEC 23000-3/Amd.2 Text of ISO/IEC 23000-3/PDAM2 Conformance Testing for Photo Player MAF TBP Available N 09/02/06 N N 09/02/06 09/02/06 3 23002 MPEG-C Video Technologies 3.1 23001-4 and 23002-4 Reconfigurable Video Coding (RVC) 3.1.1 General status of work The two parts related to RVC (ISO/IEC 23001-4 Codec Configuration Representation in MPEG-B and ISO/IEC FCD 23002-4 Video Tool Library in MPEG-C) were progressed into FDIS status, and most of the work during the week was dedicated to this. • In ISO/IEC FDIS 23001-4 Codec Configuration Representation (N10349), mainly the parts describing languages and their usage (FNL, BSDL) have been enhanced. • In ISO/IEC FDIS 23002-4 Video Tool Library (N10351), lots of detailed changes were made, in particular more precise description of FUs, aligning text with software etc. was provided. 103 No fundamental technical issues have been changed relative to the “Study of FPDAM” texts. However, a serious check of the current software status has indicated that MPEG-2 main profile is not fully supported yet. AVC baseline profile is supported, but currently the parser is not fully automatically generated from BSDL (instead, it is a hand-coded functional unit). There is however no doubt that fully automatic generation of the parser is possible in principle, and this implementation will not affect the description in the FDIS text. Whereas the text of the 23002-4 standard is relevant to describe the current content of the video tool library, it does not give a normative reference to FU behaviour. To make the specification “complete” with this regard, amendment 1 (software and conformance related to the suite of tools in the current standard text) was issued. As decided by the October 2008 meeting, the normative reference of input/output behaviour of FUs will be described by the input/output of the related software module(s). It is clear, however, that different implementations could be made in principle with identical I/O behaviour; the software modules as such are therefore not normative. It was further discussed and clarified that RVC is still in "phase 1", which is re-implementation of existing decoder conformance points • MPEG-4 SP and MPEG-4 AVC CBP currently • MPEG-2 MP, MPEG-4 ASP, AVC HP and SVC to follow soon in Amd.2. "Phase 2" could open up more options that were previously discussed in the RVC context, in particular • Possible simplification of standards development by adding new FUs. Whereas this approach sounds attractive, it appears clear that a real simplification (by adding / exchanging single FU modules) would only be given for cases where a preceding standard is fully implemented in RVC schema – it needs to be clarified whether RVC-based encoders are necessary in this context as well. • downloadable (on the fly) decoder solutions More of this will be discussed and provided in a future update of the vision document, for which not sufficient time was available during the Lausanne meeting. 3.2 Assignment of editors Documents FDIS of 23001-4 (MPEG-B CCR) FDIS of 23002-4 (MPEG-C VTL) Workplan Conformance & RSM WD Extension to VTL RVC Vision RVC Core Experiments Editors Gwo Giun Hwa Seon Marco Gwo Giun, Christophe Mikael Euee Ihab 3.3 Allocation of input contributions MPEG-B FDIS Preparation Doc. Category Title No. Authors MPEG-B m16076 Video Hyungyu Kim, Sinwook Lee, Hwa Seon Shin, Sowon Kim, Minsoo Park, Euee S. Jang Comments on ISO/IEC 23001-4 FCD 2 104 Recommendation m16145 Matthieu Wipliez Proposed changes for RVC-CAL Mickael Raulet annex A of ISO-IEC 23001-4 Jean-François Nezan All General/All MPEG-C FDIS preparation Doc. Category Title Authors No. MPEG-C An MPEG Fixed Point IDCT Ihab Amer, Marco Mattavelli Video Module for the RVC VTL m16164 Fixed point 8x8 IDCT cal module is proposed with an Recommendation enhanced throughput capability [Recommendation] To put into RSM MPEG-C Editor's Input on Study Text of Yi-Shin Tung, Hwa Seon Shin ISO/IEC FCD 23002-4 m16030 Video Recommendation Hwa Seon Shin, Sowon Kim, MPEG-C Revised FU Network and m16031 Minsoo Park, Byeongho Choi, Video Tokens for MPEG-4 SP Chungku Yie, Euee S. Jang Recommendation Discussed during the FDIS preparation. RVC Vision Doc. Category No. MPEG-B m16077 Video Title Authors Update proposal on the Vision of RVC Hyungyu Kim, Sinwook Lee, Euee S. Jang 2nd Phase Work m16033 MPEG-C Video Summary & Recommendation m16136 MPEG-C Video Summary & Recommendation m16137 MPEG-C Video Summary & Recommendation m16138 MPEG-C Video Summary & Recommendation Functional unit of AVC Gwo Giun Lee, Jia-wei Liang, deblocking filter with He-Yuan Lin, Ming-Jiun Wang MBAFF Deblocking filter FU for AVC is proposed. Accepted the FU in VTL extension and RSM. An AVC Entropy Coding Hussein Aman-Allah, Ihab Amer, Module for the MPEG RVC Marco Mattavelli VTL Entropy coding FU for encoding tools is proposed. To start a CE 3 on developing encoding tools. An AVC Motion estimation Ehab Asaad Hanna, Ihab Amer, Module for the MPEG RVC Marco Mattavelli VTL ME FU for encoding tools is proposed. To start a CE 3 on developing encoding tools. An AVC Intra Prediction Karim Maarouf, Ihab Amer, Module for the MPEG RVC Marco Mattavelli VTL Intra prediction FU for encoding tools is proposed. To start a CE 3 on developing encoding tools. 105 Other Documents reviewed: m15947 Ad Hoc Group on Reconfigurable Video Coding m15781 m15782 Summary of Voting on ISO/IEC FCD 23001-4 Summary of Voting on ISO/IEC FCD 23002-4 Euee S. Jang, Marco Mattavelli, Kazuo Sugimoto SC 29 Secretariat SC 29 Secretariat Output Documents: No. 10348 10349 10350 10351 10352 10353 10354 10355 10356 Title 23001-4 Codec Configuration Representation Disposition of Comments on ISO/IEC FCD 23001-4 Text of ISO/IEC FDIS 23001-4 Codec Configuration Representation 23002-4 Video Tool Library Disposition of Comments on ISO/IEC FCD 23002-4 Text of ISO/IEC FDIS 23002-4 Video Tool Library Request for ISO/IEC 23002-4/Amd.1 Text of ISO/IEC 23002-4/PDAM1 Video Tool Library Conformance and Reference Software WD 4 of ISO/IEC 23002-4/Amd.2 (Tools for MPEG-2 MP, MPEG-4 ASP, AVC HP and SVC) RVC Work Plan and FU Development Status Description of Core Experiments in RVC TBP Available N N 09/02/06 09/03/31 N N N N 09/02/06 09/03/31 09/02/06 09/02/25 N 09/02/13 N N 09/02/06 09/02/06 4 Explorations – 3D Video The goal of 3D video, as a first step towards a broader range of free-viewpoint (FTV) applications, is to generate interpolated views from available videos of multiview camera configurations. The target application is mostly seen for upcoming generations of (auto-) stereoscopic displays, for which only a low number (1) of video sequences will be transmitted, but rendering of additional views will be enabled by associated depth information. The main achievements in this activity have been review of technical developments from the exploration experiments, further clarification about the vision, applications and requirements, and planning of next steps. The vision is definition of a new 3D Video (3DV) format that goes beyond the capabilities of existing standards to enable both advanced stereoscopic display processing and improved support for auto-stereoscopic N-view displays, while enabling interoperable 3D services. This is illustrated in the following Figure. It is assumed that only limited camera inputs and constrained rate transmission would be available according to a distribution environment. The 3DV data format aims to be capable of rendering a large number of output views for auto-stereoscopic N-view displays and support advanced stereoscopic processing. 106 Stereoscopic displays • Variable stereo baseline • Adjust depth perception Left Right Limited Camera Inputs Data Format Data Format Constrained Rate (based on distribution) Auto-stereoscopic N-view displays • Wide viewing angle • Large number of output views Compared to the existing coding formats, the 3DV format would have several advantages in terms of bit rate and 3D rendering capabilities, which is illustrated in the Figure below: 2D+Depth, as specified by MPEG-C Part 3, supports the inclusion of depth for generation of an increased number of views. While it has the advantage of being backward compatible with legacy devices and is agnostic of coding formats, it is only capable of rendering a limited depth range since it does not directly handle occlusions. The 3DV format expects to enhance the 3D rendering capabilities beyond this format. Multiview Video Coding (MVC) supports the direct coding of multiple views and exploits inter-camera redundancy to reduce the bit rate. Although MVC is more efficient than simulcast, the rate of MVC encoded video is proportional to the number of views. The 3DV format is expected to significantly reduce the bit rate needed to generate the required views at the receiver. Bit Rate Simulcast 3DV should be compatible with: • existing standards • mono and stereo devices • existing or planned infrastructure MVC 3DV 2D+Depth 2D 3D Rendering Capability To make a 3D video system operational, it is necessary to provide view synthesis (interpolation) with sufficient quality. To achieve this, and produce anchor encodings for depth maps and video data based on currently available compression technology (AVC / MVC / MPEG-C part 3) is the main objective of current exploration experiments. Judging of results was performed by experts 107 viewing using stereoscopic and autostereoscopic displays. The main conclusions drawn are the following: – Good progress has been made in view synthesis, appropriate methods for hole filling and depthdiscontinuity boundary processing have been implemented. – Depth estimation was also improved (e.g. by implementing temporal consistency and spatial smoothing), and now gives better results for sequences that were of unacceptable quality before (e.g. Champagne Tower); however, apparently the generated depth maps are still considerably wrong, in particular in cases of small structures, larger depth ranges and more complicated occlusion areas (e.g. the Newspaper and Leaving Laptop sequences). – It was discussed that the unavailability of reliable depth maps currently is the main factor that delays the development. To make progress, it was decided to consider using hand-tuned or semi-automatically generated depth maps, in particular for the more challenging sequences. A Call for depth maps and supplementary information (e.g. segmentation masks) was therefore issued (N10359). Exploration Experiments in 3D Video Coding are continued as follows (N10360): • EE1: Improvement of depth estimation – Including one semi-automatic method • EE2: Improvement of view interpolation – Including combination of "best known" methods • EE3: Investigation of alternative method for representation: Layered depth video (LDV) Usage of hand-tuned depth maps appears also realistic for application scenarios of 3D Video, in particular could such data be provided during the production phase (not for real-time applications). Once the data are available and proven to provide sufficient quality in combination with the available view synthesis, the next step towards a CfP will be made by defining rate points and coding conditions for the anchors. The previous EE4, which was related to first findings about coding of video with associated depth maps, was judged to have been not suitable for useful conclusions, because the currently available quality of depth maps could have significant effects on the necessary data rate, and the quality seemed to be more affected by the defects in the original depth maps themselves than by deviations introduced by coding. Documents reviewed in AHG (see AHG report) m16021 m16026 Depth Map Compression for View Synthesis in FTV Results of 3DV/FTV Exploration Experiments, described in w10173, for Alt Moabit sequence. m16027 m16034 Analysis of sub-pixel precision in Depth Estimation Reference Software and View Synthesis Reference Software Application of Middle Level Hypothesis algorithm for improvement of depth maps produced by Depth Estimation Reference Software. Basic LDV view-synthesis/renderer SW : LDVS m16040 LDV Virtual View Rendering Software m16041 Temporal Improvement Method in View Synthesis m16042 3DV EE3 Report on Champagne_tower Sequences m16043 3DV EE4 Report on Dog Sequences m16046 3DV EE3 results on Dog sequence m16028 108 Gangyi Jiang Olgierd Stankiewicz Krzysztof Wegner Krzysztof Klimaszewski Krzysztof Wegner Olgierd Stankiewicz Olgierd Stankiewicz Krzysztof Wegner Fons Bruls Lincoln Lobo Yin Zhao Lu Yu Yin Zhao Deliang Fu Lu Yu Fons Bruls Lincoln Lobo Yin Zhao Deliang Fu Lu Yu Lianhuan Xiong Yin Zhao Deliang Fu Lu Yu Yin Zhao Lu Yu Carmen CHENG Yan HUO m16047 3DV EE4 results on Dog sequence m16048 Depth Estimation Improvement for Depth Discontinuity Areas and Temporal Consistency Preserving m16049 3DV/FTV EE3/EE4 Results on Alt Moabit sequence m16050 Depth Map Coding Quality Analysis for View Synthesis m16053 3DTV Exploration Experiments on Pantomime sequence m16059 Results of Exploration Experiments in 3D Video for Lovebird2 m16060 EE1: Depth Estimation Results on 'Pantomime? Sequence m16061 EE2: View Synthesis Results on 'Pantomime? Sequence m16062 EE4: Coding Results on 'Pantomime? Sequence m16063 Experimental Results on Improved Temporal Consistency Enhancement m16064 Implementation of Boundary Noise Removal for View Synthesis m16066 Report of 3DV/FTV Exploration E xperiments with Champagne Tower m16067 3DV/FTV EE results of Depth Estimantion and View Synthesis on "lovebird1" sequence m16068 3DV/FTV EE4 result of Coding Experiment on "Dog" sequence m16070 The consideration of the imrpoved depth estimation algorithm m16087 3DV/FTV EE3 : LeavingLaptop and Lovebird1 m16088 3DV/FTV EE4 : Dog sequence m16090 View Synthesis Algorithm in View Synthesis Reference Software 2.0 (VSRS2.0) m16091 View Synthesis Method without Blending m16092 Depth Estimation Reference Software (DERS) with Image Segmentation and Block Matching m16094 Results of 3D Video Coding Experiments EE1 and EE2 for Dog Data Set 3DV/FTV EE Report on Doorflower sequence m16101 109 Yu LIU Carmen CHENG Yan HUO Yu LIU Hui Yuan Yilin Chang Haitao Yang Xiaoxian Liu Sixin Lin Lianhuan Xiong Xiaoxian Liu Yingying Guo Haitao Yang Junyan Huo Yilin Chang Sixin Lin Lianhuan Xiong Siping Tao Ying Chen Miska M. Hannuksela Houqiang Li Ivana Radulovic Per Fröjdh Sehoon Yea Zafer Arican Anthony Vetro Cheon Lee Yo-Sung Ho Cheon Lee Yo-Sung Ho Cheon Lee Yo-Sung Ho Sang-Beom Lee Cheon Lee Yo-Sung Ho Cheon Lee Yo-Sung Ho Takanori Senoh Kenji Yamamoto Ryutaro Oi Tomoyuki Mishina Makoto Okui Gun Bang Gi Mun Um Namho Hur Jinwoong Kim Gun Bang Gwang sin Cho Namho Hur Jinwoong Kim Donggyu Sim Gun Bang Jaeho Lee Namho Hur Jinwoong Kim Patrick Lopez Dong Tian Patrick Lopez Dong Tian Masayuki Tanimoto Toshiaki Fujii Kazuyoshi Suzuki Masayuki Tanimoto Toshiaki Fujii Kazuyoshi Suzuki Masayuki Tanimoto Toshiaki Fujii Kazuyoshi Suzuki Mejdi Trimeche Miska M Hannuksela Shinya Shimizu m16129 Results of Exploration Experiments in 3D Video Coding for Dog Data Set m16135 Philips 3DV EE2,EE4 results m16139 Philips 3DV EE1,2,3,4 results m16175 3DV EE1 & EE2 on Leaving_Laptop and Improvements in ViSBD 2.1 m16189 3DV EE1 and EE2 Results on Newspaper Sequence m16190 3DV EE4 Results on Pantomime Sequence Hideaki Kimata Y. Wang K. Müller P. Merkle A. Smolic Fons Bruls Lincoln Lobo Fons Bruls Lincoln Lobo Dong Tian Po-Lin Lai Patrick Lopez Jaewon Sung Yong-Joon Jeon Byeong-Moon Jeon Jaewon Sung Yong-Joon Jeon Byeong-Moon Jeon Documents reviewed in Video m16065 m16093 m16130 Additional Test Sequence for 3D Video Propose test sequence „study“ instead of „newspaper“ (similar scene, but without large depth contrast at the newspaper). No synthesis results provided, but proponents say that results are better. Question to be discussed: Is it appropriate to replace „difficult“ sequences? 3DV must work for any. Thank for the contribution. Data Format for FTV "FDU" (FTV data unit) consists of depth data and synthesis error for sythesized views relative to the center view. This is rather an alternative representation method than a data format. It is claimed that the residual for the views in between center and left/right view is correlated with the residual of the left/right views, and therefore can be beneficial for synthesis. Could also be beneficial for coding when a wider range of views is required. All this is still to be proven. No action to be taken at this point. Considerations about the vision on 3D Video Enable 3D video on 3D displays such that observer gets a good depth impression. Both for stereoscopic and multi-view (autostereoscopic) displays. Cheon Lee Jae-Il Jung Yun-Suk Kang Yo-Sung Ho Masayuki Tanimoto Toshiaki Fujii Kazuyoshi Suzuki Aljoscha Smolic Karsten Mueller, Peter Kauff, Thomas Wiegand Compatibility with mono and stereo (i.e. that the uncompressed format includes a mono or stereo view) as „may“ Complete 3DV solutions should be compared and be judged by final outcome: How good does it look on stereo and N-view displays? Same input should be used for all proposals. m16165 Reference solution should be developed by MPEG On addressing market 3D developments, Stereo & MPEG 3DV activity. Lack of one clear recognized 3D standard – this could cause confusion in the market. Applications could e.g. include autostereoscopic, variable-stereo Fons Bruls Lincoln Lobo Wiebe de Haan Typical requirements of view synthesis range is up to 4 baseline distances, where "1 BD" would be the configuration of "best parallax distance" on a stereo display. Currently, the experiments use up to 3BD. Stereo baseline adjustment (for current stereo displays) could be a near-term issue, before the market introduction of autostereoscopic displays. Question is raised on whether the encoded format of 3DV should be display unaware and backward compatible. This would be a big advantage compared to current stereo, where everything (starting from camera settings) is often fully tuned to one target display type. Output documents: No. Title TBP Available 110 10357 Vision on 3D Video Coding 10358 Applications and Requirements of 3D Video Coding 10359 Call for 3D Test Material: Depth Maps & Supplementary Information 10360 Description of Exploration Experiments in 3D Video Coding Y N Y 09/02/06 09/02/06 09/02/06 N 09/02/06 5 Explorations – High-Performance Video Coding Following the workshop in Busan, the main focus of the HVC activity currently is to prepare a more solid assessment of the availability of improved-compression technology, that would in particular fulfil the needs of high-resolution, high-quality video applications. In Busan, a Call for test materials had been issued, as it was recognized that the currently available materials would not fulfil this purpose, in particular for HD and Ultra HD scenarios. The following responses were received (see more detailed assessment in the AHG report M15950): m16018 Response to call for test materials for HVC study m16035 Response to Call for Test Materials for High-Performance Video Coding Standards Development Samsung response to Call for Test Materials for MPEG HVC standardization m16052 m16212 M16219 BBC 1080p50 test materials for HVC study Status of potential test materials for HVC with 4K or higher resolutions Shun-ichi Sekiguchi Yoshihisa Yamada Yoshiaki Kato Kohtaro Asai Tokumichi Murakami TK Tan Yoshinori Suzuki Woo-Jin Han JeongHoon Park IlKoo Kim Tammy Lee Thomas Davies Kohtaro Asai, Ryuta Suzuki, Shun-ichi Sekiguchi After extensive viewing sessions, a set of 4 sequences for the HD/UHD range of sizes (including newly proposed and 2 from the available SVT set), and another 4 sequences for the WVGA range (as expected to become important in mobile applications) were selected for the upcoming Call for Evidence. However, it was assessed that expecting responses to the Call by April would result in a very short timeline and probably a lower number of responses. More specifically, after distributing and converting the new test sequences, AVC anchor encoding that is necessary to define the rate points would hardly be available before end of April. It was therefore decided to issue again a Draft Call (N10363), substantially refined as compared to the initial version of the previous (Busan) meeting. A final Call for Evidence is planned to be issued in April, with responses expected for July. Once evidence would be available, a Call for proposals could be issued immediately, with possible responses for January 2010. It must also be observed that certainly still better test material would be desirable for a CfP and development of an HVC standard. Therefore, an updated Call for Test materials was also issued (N10362). Further, the expert viewing to be performed in the context of the CfE is challenging in terms of necessary testing equipment (in particular for UHD resolutions), and must be carefully further explored and prepared. Materials to be provided within the responses to the Call for Evidence will be decoded results, bitstreams and a binary decoder. It will not be necessary to submit for all classes (ranging from WQVGA up to UHD), but a submission must be complete for each class that is chosen. 111 The following input contributions, reporting about methods for improved compression, were reviewed: m16019 On coding efficiency with extended block size for UHDTV Setting only IPPP: Usage of extended macroblock size 32x32: 7.8% BR reduction for 1 4Kx2K, approx.4.5% for 1080p, 7.9% for 720p.Performance improvement mainlyby reduction of MV bits. No experiments on B pictures yet. m16069 Fast Decoder Side Motion Vector Derivation with Candidate Scaling for Improving AVC Compression Performance Additional MB types where the decoder can derive MVs. Encoder decides which mode is used. In contrast to previous contribution from Archamps, reduction of complexity is main target. Use left and top-right neighbors as candidates. Options with or without sub-pel refinement. In case of multi-hypothesis prediction, scaling of candidates is used. Results are 9%/8% for HD, and 5.8%/5% for subpel/no subpel refinement, respectively. When modified rounding (not standard AVC) is used, gain is lowered. Results only for IPPP, but contributors expect gain for B pictures as well. Preliminary response for Draft Call for Evidence on High Performance Video Coding Extended macroblock size to 32x32 with same sub-partitions as in AVC. IPPP coding structure. Use 8Kx4K sequences, but only cropped area of size 1920x1080 out of these. Average around 15% over 6 sequences, each with „best“and „worst“cropped area. Second Order Prediction of Video Coding Inter prediction followed by intra prediction (i.e. intra prediction of residuals) as additional mode. To generate the residual at the boundary, the MV of the current block is used (i.e. not the actual prediction signal of the neighbored block). Bitrate reduction with CABAC around 4.5% similar for IPPP, IBBP and IBbBP. Reduces to around 3% with WP on. CAVLC slightly higher gains. Encoder complexity increase less than 10% on average, decoder less than 4% average (considering runtime). Selection of mode varies, but could be up to 40% for some cases. Motion Vector Coding with Optimal Predictor Extension of motion vector competition. Decoder could find the best MV predictor among candidates. If certain conditions are violated, the encoder signals that the median or another candidate shall be used. Candidate is determined by template matching. Gain up to 6.8% BR reduction for 720p, 3.6% for CIF. Interpretation: Number of skip modes is increased. No investigation about error propagation of the scheme yet. Temporal neighbor used as in MVC. m16082 m16109 m16209 Shun-ichi Sekiguchi Shuichi Yamagishi Yoshihisa Yamada Yoshiaki Kato Kohtaro Asai Tokumichi Murakami Steffen Kamp Mathias Wien Tomonobu Yoshino Sei Naito Shigeyuki Sakazawa Shangwen Li Lu Yu Lianhuan Xiong Jungyoup Yang Kwanghyun Won Byeungwoo Jeon Su Nyeon Kim The following input documents were discussed jointly with the Requirements subgroup, and were taken as an initial point for further updates of the vision, applications and requirements document (N10361): m16207 m16224 Requirements for high-performance video standards Mobile devices should be capable to decode content from video databases and home servers, which are going to 720p or 1080p. Main target of the contribution is to include tradeoff between power efficiency / complexity such as "half complexity as AVC gives 25% coding efficiency improvement, while 2-3x complexity as AVC gives 50% coding efficiency". Could be structured in different profiles. Also request shorter timeline for the lowcomplexity case. Conclusion: Include relationship between complexity and compression efficiency in A&R doc. Consider also relationship with delay. Proposal on Focus for MPEG HVC standard development Requirements can differ between different applications, e.g. mobile and professional. "Most practical way to start with high-efficiency solution and strip off tools to arrive at lower complexity". Complexity can hardly be quantified. Propose onion-shaped profile structure. Output documents: No. Title Kemal Ugur Justin Ridge Ken McCann Woo-Jin Han Jason Suh TBP Available 112 10361 10362 10363 Vision and Requirements for High-Performance Video Coding (HVC) Call for Test Materials for High-Performance Video Coding Standards Development Draft Call for Evidence on High-Performance Video Coding 113 Y 09/02/06 Y 09/02/06 N 09/02/06 Annex H – JVT report Source: Jens Ohm and Gary Sullivan, Chairs 114 Annex I – Audio report Source: Schuyler Quackenbush, Chair Audio Subgroup Report for the 87th MPEG Meeting Source: Schuyler Quackenbush, Chair, Audio Subgroup 1 2 Opening of the meeting ......................................................................................................... 117 Administrative matters .......................................................................................................... 117 2.1 Communications from the Chair 117 2.2 Approval of agenda and allocation of contributions 117 2.3 Creation of Task Groups 117 2.4 Approval of previous meeting report 117 2.5 Review of AHG reports 117 2.6 Joint meetings 117 2.7 Received National Body Comments and Liaison matters 117 2.8 Plenary Discussion 118 3 Record of AhG meetings ....................................................................................................... 118 3.1 AhG Meeting on USAC Sunday 1000-1700 118 4 Task group activities ............................................................................................................. 123 4.1 Joint meetings 123 4.1.1 MPEG Surround Signalling and MP4 FF issues (with Systems) ................................ 123 4.1.2 High-Performance Video Coding (HVC) and Audio (with Requirements) ................ 124 4.2 Task Group discussions 124 4.2.1 MPEG-2, MPEG-4, MPEG-7 Audio and MPEG Surround, conformance, reference software ................................................................................................................................ 124 4.2.2 MPEG-D Spatial Audio Object Coding ...................................................................... 125 4.2.3 MPEG-D Unified Speech and Audio .......................................................................... 130 4.2.4 Exploration: Meta-Data ............................................................................................... 135 5 Audio closing plenary discussions ........................................................................................ 136 6 Meeting deliverables ............................................................................................................. 136 6.1 Responses to Liaison and NB comments 136 6.2 Recommendations for final plenary 136 6.3 Establishment of Ad-hoc Groups 136 6.4 Approval of output documents 137 6.5 Press statement 137 7 Future activities ..................................................................................................................... 137 7.1 Schedule of future meetings 137 7.2 Agenda for next meeting 137 7.3 All other business 137 7.4 Closing of the meeting 137 Annex A Participants ............................................................................................................... 138 Annex B Audio Contributions and Schedule .......................................................................... 139 115 Annex C Annex D Annex E Task Groups ............................................................................................................. 144 Output Documents ................................................................................................... 145 Agenda for the 88th MPEG Audio Meeting ............................................................. 147 116 1 Opening of the meeting The MPEG Audio Subgroup meeting was held during the 87th meeting of WG11, February 2-6, 2009, Lausanne, CH. The list of participants is given in Annex A. 2 2.1 Administrative matters Communications from the Chair The Chair summarised the issues raised at the Sunday evening Chair’s meeting, proposed task groups for the week, and proposed agenda items for discussion in Audio plenary. 2.2 Approval of agenda and allocation of contributions The agenda and schedule for the meeting was discussed, edited and approved. It shows the documents contributed to this meeting and presented to the Audio Subgroup, either in the task groups or in Audio plenary. The Chair brought relevant documents from Requirements, Systems to the attention of the group. It was revised in the course of the week to reflect the progress of the meeting, and the final version is shown in Annex B. 2.3 Creation of Task Groups Task groups were convened for the duration of the MPEG meeting, as shown in Annex C. Results of task group activities are reported below. 2.4 Approval of previous meeting report The 86th 2.5 Audio Subgroup meeting report was registered as a contribution, and was approved. Review of AHG reports There were no requests to review any of the AHG reports. 2.6 Joint meetings The joint meetings with Audio for the week are shown below: Groups Audio, Systems Audio, Req 2.7 What File Format Audio for HVC Where Audio Audio Day Wed Thu Time 1400-1500 1500-1530 Received National Body Comments and Liaison matters The NB Comments and Liaison documents for the meeting that require a response are as shown below. No. m16044 From AU NB m16045 AU NB m15911 m15879 m15914 M15908 CN NB FI NB FR NB Liaison Statement from DRM m15916 Liaison Statement from "ETSI/EBU/CENELEC JTC on Broadcast" Audio Liaison Statement from WorldDMB Forum via SC 29 Secretariat m15919 m15978 Liaison Statement from ITU-R SG 6 to SC 29/WG 11 Title Comment on the unified speech and audio coding activity Comment on the exploration on metadata driven post-processing of audio Comment on USAC Comment on USAC Comment on USAC on 960 frame length in the MPEG-4 AAC family of profiles on 960 frame length in the MPEG-4 AAC family of profiles on Proposal to remove 960 transform from the AAC, HE AAC and HE AAC v2 profiles on Extension of ITU-R BS.1387 and Call for Proposal 117 Comment (from 86th mtg) (from 86th mtg) (from 86th mtg) (from 86th mtg) (from 86th mtg) (from 86th mtg) Respond at next meeting m16004 IEC TC100/TA4 m16218 IEC TC100/TA4 2.8 IEC CDV 61937-11: Digital audio -Interface for non-linear PCM encoded audio bitstreams applying IEC 60958 - Part 11: MPEG-4 AAC and its extensions in LATM/LOAS IEC CDV 60958-3/Amd.1 Plenary Discussion It was communicated to the Chair via informal channels that the use of the complexity metrics PCU and MCU in MPEG Audio specifications is not clear to technical experts from outside MPEG. The Chair attempted to clarify this via informal channels, but noted that Audio experts might revise documents on complexity to address this lack of clarity. 3 3.1 Record of AhG meetings AhG Meeting on USAC Sunday 1000-1700 USAC Core Experiments Taejin Lee, ETRI, presented m16156 Progress of Technology Merge Between System 2 and USAC RM Taejin Lee Max Neuendorf Jeremie Lecomte Kyeongok Kang Bernhard Grill This contribution discusses the merge of Sys2 SBR technology into RM0. Listening tests show promise, but did not demonstrate improvement at the 95% level of significance. The SBR technology merge will continue to be explored. In addition, aspects of TCX will be explored to see if a technology merge can result in a gain in performance. ETRI anticipates reporting additional information at the next meeting. Several experts noted that the rules for the “technology merge” were not clear, nor were the specific tools under consideration clear. It was the consensus of the AhG that the “technology merge” obey the elements of the Core Experiment process. Eunmi Oh, Samsung, presented m16177 Progress report on unvoiced speech coding Hosang Sung Eunmi Oh Eunmi Oh noted that the Sys4/RM0 “technology merge” work started in October, after the RM0 source code was available to MPEG. She also noted that in exploring the possibilities for technology merge, the following were unexpectedly identified as candidates for the technology merge: coding of phase in MPEG Surround coding of unvoiced speech segments along with variable bit rate encoding. The contribution presented details on a technology merge/CE on Voiced/Unvoiced/Silence plus Variable Bit Rate. It presents theoretical bitrate savings for the Voiced/Unvoiced/Silence detection and coding mode switching. Samsung anticipates a workplan at this meeting and to report additional information at the next meeting. JungHoe Kim, Samsung, presented m16173 Progress report on phase experiment for USAC JungHoe Kim Julien Robilliard Eunmi Oh Bernhard Grill This contribution presented results for a technology merge/CE on enhancing the phase encoding in the MPEG Surround tool for the purpose of coding stereo signals. Included are all elements required for a CE proposal, including an overview of the technology and listening test showing performance. Kristofer Kjörling, Dolby, noted that there are test items for which the MPEG-4 PS tool was not able to sufficiently accurately estimating phase differences such that the proposed technology would 118 work. Both Dolby and Philips experts offered to do cross-check using additional test material. This will be captured in a workplan and additional information is anticipated at the next meeting. Max Neuendorf, FhG, presented m16153 Proposed Corrections to WD and Reference Software on Unified Speech and Audio Coding Max Neuendorf Philippe Gournay Jérémie Lecomte Markus Multrus Stefan Bayer Guillaume Fuchs Ralf Geiger Frederik Nagel This contribution presented several categories of proposed fixed or changes to the WD text and Reference Software. The categories are Editorial changes, which include: o Incorrect fonts, formulas and references. o Clarifications to ambiguous text. o Obvious bugfixes in the Reference Software o Changes to the text so that it is aligned to the Reference Software Bugfixes in software o Incorrect handling of window transitions in decoder software and clarification of associated text o Incorrect time alignment of some parameters when core coder mode is switched o Incorrect Harmonic SBR transitions o Incorrect Harmonic SBR Stretching factor and clarification of associated text o Incorrect Time-Warped SBR resampling buffer length. The presenter estimated that this bug did not impact the CfP waveforms when they are quantized to 16-bit word lengths. o Extend MPEG Surround to lower sampling rates. This does not affect the CfP waveforms as MPEG Surround was never active at the lower sampling rates. o Other miscellaneous software bug fixes Proposal presented as Open Issues o Encoder and Decoder block diagram o Decoding of innovation sequence The proposals were discussed as they were presented. The presenter noted that all software changes except for those listed as “Open Issues” have already been incorporated into a revised version of the software which can be found in the zip archive of the contribution. Eunmi Oh, Samsung, requested a “sanity-check” listening test to verify that none of the reference software changes have an impact on audio quality. Max Neuendorf will get back to the group during the week with a proposal for such a listening test (i.e. at which operating modes will it be evaluated). It was the consensus of the AhG to adopt the Editorial and Bugfix changes (with the exception of the window sequence signalling, which might conflict with another proposal in a contribution still to be presented and in anticipation of a positive outcome of the “sanity-check” listening test). The Open Issue items will be discussed during the MPEG week. The Chair noted that contributions not presented in Sunday’s AhG meeting will be presented in the Audio Subgroup. Reference Encoder Software Mohamad Raad, RaadTech Consulting, presented m16044 Comment on the unified speech and audio coding activity Mohamad Raad The presenter read the AU NB comment. The NB comment asks for a common encoder/decoder pair for use in the CE process. The Chair stated that, in his view, the request for a single encoder code base for use in CEs might represent a viewpoint on which there is lack of consensus in the Audio Subgroup. 119 Herve Taddei, Huawei Technologies, presented m16079 Discussion on the Unified Speech and Audio Coding Activity Herve Taddei Minjie Xie Qing Zhang The contribution noted that A reference encoder of high quality that is fully in source code would be of great benefit to MPEG. A workplan should be maintained to organize the work of creating such a reference encoder. This reference encode could be used as CE proponent shows merit of tool using the MPEG Reference Encoder The cross-check shows the merit of the tool using the Reference Quality Encoder A successful CE obligated the proponent to integrate a (perhaps sub-optimal) version of the tool into the MPEG Reference Encoder, and to provide evidence (e.g. listening test) that the quality of the MPEG Reference Encoder is improved when the tool is incorporated. The contribution included a listening test result showing that Huawei was able to, internally, increase the quality of the current MPEG Reference Encoder at all bitrates. If, in Huawei’s view, the Audio Subgroup consensus position on this matter is in line with Huawei’s view, then Huawei would be willing to contribute the source code on which their listening tests are based. Open issues are How would the Audio Subgroup determine that the MPEG Reference Encoder is of quality “good enough” for use in meaningful CE work. Kristofer Kjörling, Dolby, noted that when the MPEG Reference Encoder quality is quite low and has many missing modules, then showing an improvement in the MPEG Reference Encode might be quite difficult. He noted that a CE proponent might additional be obligated to provide rudimentary versions of the missing modules. Markus Multrus, FhG, noted that many encoder modules were inherited from the MPEG-4 Audio Reference Software such that if modules were missing in the MPEG-4 Audio Reference software, they would be missing in the USAC Reference Software. Chair proposed break-out group to categorize MPEG Reference Encoder module into “signal flow” and “control” and to investigate and verify what modules are present in the MPEG Reference Encoder. Werner Oomen, Philips, presented m16110 Comment on the unified speech and audio coding activity Werner Oomen This contribution presents the following definitions: Informative Encoder – the source code available from ISO Reference Quality Encoder – the best quality encoder(s) within MPEG member companies Reference Encoder – the source code base proposed to be the “open source” project. (Note that there may be reasons for NOT making this part of the source code available from ISO, e.g. copyright and patent issues.) The contribution observes MPEG CE methodology has produced outstanding technology that has been widely adopted by industry. The CE methodology had evolved through numerous revisions There are typically three categories of CE o Overwhelming clear winner o Demonstrated clear merit o Has ambiguous demonstration of merit A competitive CE process if beneficial 120 Concerning the Reference Encoder, Philips very much endorses creation of a high-quality Reference Encoder, and if this Reference Encoder is also the Informative Encoder, then this speeds adoption of the standard. However, they feel quite strongly that the Reference Encoder should not be mandated for use in the CE process. Rather, a CE proponent should be free to use any encoder in their CE proposal. However, it is quite likely that those with a Reference Quality Encoder will try the CE tool in their Reference Quality Encoder code base to cross-check whether the tool provides a comparable improvement. The contribution provides figures that show example CE listening test results might drive CE results. Mohamad Raad, RaadTech Consulting, asked what would occur if a CE was based on a low-quality Reference Encoder, was accepted by the group, but the holder(s) of the Reference Quality Encoder did not wish to incorporate the tool. The Chair responded that, first, the refusal to adopt the tool by the Reference Quality Encoder proponent would in no way preclude the adoption of the CE tool. But, second, it is the obligation of the Reference Quality Encoder proponent to incorporate the tool and run a cross-check at the request of the Audio Subgroup. Hyun-Kook Lee, LG, presented m16118 Considerations on the development of common USAC reference encoder Dong Soo Kim Sungyong Yoon Hyun-Kook Lee Jaehyun Lim The contribution makes the following points: That there should be a common encoder for use in the CE process. This could be the Reference Quality Encoder or a Reference Encoder that achieves comparable quality. However, they stress the importance of a common encoder for all CEs. In order to create a Reference Encoder that becomes a Reference Quality Encoder, requirements for development and a method for performance assessment must be agreed to. The results from successful CE should be incorporated into the Reference Encode. The Chair noted that the current CE methodology does not mandate one and only one encoder for use in CEs. Further, the Chair noted that the Audio Subgroup must balance potentially competing objectives: that there be broad participation in the CE process, for example by the development of an “open-source” Reference Encoder, although it may be that it does not have quality equal to that of the best-known Reference Quality Encoder. That the standard uses the best known Reference Quality Encode in the CE process to insure that only the best technology is adopted into the specification. Kristofer Kjörling, Dolby, presented m16140 Core Experiment procedures and MPEG reference software Kristofer Kjörling encoder Heiko Purnhagen This contribution makes the following points MPEG has a long history of creating successful and widely adopted standards using the existing CE methodology MPEG specifications leave the encoder as informative so that o encoding performance can increase after the specification has issued o different encoder tradeoffs (e.g. performance/complexity) are possible The MPEG process must use the best encoder available in the CE process in order to assure that the highest quality specification is produced. Since the encoder is informative, MPEG cannot mandate that a proponent submit encoder source code that may reveal their proprietary know-how. The Chair noted that there may be some discussion on what level of informative source code must be submitted. For example, if the decoder has as block-switching filterbank, then the encoder 121 should also have such a filterbank. The Chair proposed that it may be a role of the Audio Subgroup to render an opinion as to whether the submitted code is “complete.” Pierrick Philippe, Orange Labs, presented m16143 Proposed improvements for MPEG Audio Core Experiment Methodology and Reference Software Development Pierrick Philippe The contribution makes the following points: A poor quality Informative Encoder is useless for the purposes of standardization. It has the danger of being used by outside parties as representative of MPEG Reference Quality Encoder A good quality Reference Encoder permits interested parties to quickly do a “trial” CE in their own lab. A high quality Reference Encoder permits easier participation in the MPEG process A good quality Reference Encoder clearly need not be computationally efficient. This contribution also proposes a way to collaboratively build a Reference MPEG Encoder. It gives several examples of how performance results can be interpreted in the CE process. It also comments on the current CE methodology: That the CE process use the CMOS test (e.g. 7-level, “A better than B”, A much better than B”, …), and that the comparison RQ vs. AE+CE be made. That the CE uses a set of items that represent the three sets of categories: speech, speech and music, and music. However, additions to the common set of items can be entertained for specific tools. There was considerable discussion on whether any conclusion could be drawn from a comparison of RQ vs AE+CE and whether it is “good engineering practice” to compare one system against another system plus modification, in that one cannot, in general, conclude that an observed difference in quality is due to the new tool as opposed to a difference between the two encoders RQ and AE. Kristofer Kjörling, Dolby, and Juergen Herre, FhG, noted that an MPEG Reference Quality Encoder that was once labelled as “good” or “high” quality might be a disservice to MPEG years later when commercial encoders far surpass the quality of the MPEG Reference Quality Encoder. Roch Lefebvre, VoiceAge, presented m16146 Comments on Core Experiments methodology for MPEG USAC standardisation Roch Lefebvre Philippe Gournay Redwan Salami This contribution makes the following points A CE methodology exists, and the MPEG CE process is open to all experts, which is not the case in some other standardization bodies. The CE process must balance o Protection of RM0 and CE proponent encoder know-how. o Making the CE process accessible to all MPEG experts. Markus Multrus, FhG, presented m16160 Thoughts on Core-Experiment Methodology Bernhard Grill Jürgen Herre Ralf Geiger Max Neuendorf Markus Multrus This contribution raised the following points: A Reference Quality Encoder should be used in CEs It is an unproductive diversion of resources to create an “open-source” Reference Encoder and to maintain the code and verify its quality. The goal of CEs is to achieve a strictly monotonic improvement of the performance of the specification. This can be assured if the CEs only use a Reference Quality Encoder. There need not be one and only one Reference Quality Encoder. The current CE process has worked well, leading to several widely adopted specifications. 122 The current CE process is still in force. With regard to the Encoder Software The Reference Quality Encoder is available to all interested parties via e.g. a customdesigned object-code interface to RM0. The goal is to create a high-quality specification via continuously improving the normative technology. Imre Varga, Siemens, noted that it should not be the role of MPEG to mandate how the resources of member companies should be spent. Kei Kikuiri, NTT DoCoMo, presented m16202 Comments on USAC Standardization Activities Kei Kikuiri Nobuhiko Naka Kousuke Tsujino This contribution makes the following points Common encoder used by CE proponents That the common encoder be of good to high quality Results of CE process be incorporated in the common encoder. The Chair noted that contributions not presented in Sunday’s AhG meeting will be presented in the Audio Subgroup. 4 Task group activities 4.1 Joint meetings 4.1.1 MPEG Surround Signalling and MP4 FF issues (with Systems) Heiko Purnhagen, Dolby, presented m16117 Thoughts on MPEG Surround signaling Frans de Bont Stefan Döhla Heiko Purnhagen Alexander Gröschel The contribution presents a method for backwards-compatible signalling of MPEG Surround. David Singer, Apple, noted that this “single-stream” MPEG Surround solution would also be compatible with 3GPP File Format and IETF RTP streaming solutions. The presenter confirmed that there is software that generates the bitstream and decodes the bitstream to produce an MPEG Surround output. Furthermore the bitstreams with MPEG Surround signalling was fed to numerous AAC or HE-AAC decoders and not “crashed” on the bistream and most played the core codec signal. It was the consensus of Audio Subgroup to have a “Thoughts on MPEG Surround Signaling” output document that NBs should consider when balloting. Stefan Doehla, FhG, presented m16054 Scalable Audio and MP4 Stefan Doehla The contribution notes that a single-stream solution for SLS is not feasible for exact reconstruction, as the SLS side information would overflow the AAC core input buffer constraints, and hence, a two-stream solution is necessary. However, multiple occurrences of “mp4a” tracks cause problems for many existing implementations, especially when such tracks are not alternate encodings rather where one depends on the other (e.g. one is a “base” AAC stream and the other is an SLS enhancement stream). Some implementations only parse the first audio track encountered and ignore any additional tracks while others do nothing even though they could decode the base track. The contribution proposes an extension to the MP4FF specification that would support the notion of “base” and “dependent” streams in the context of audio. mp4a is audio base track 123 m4ae is audio enhancement track MP4FF supports a track reference structure (dpng) that might be used to infer base and extension streams. Not all implementations reference this information, but they could. New audio codecs could explicitly indicate that this should be done, e.g. in an informative section. Systems proposed to start an amendment to 14496-14 (MP4 FF) to implement the “mp4ae” box type. Discussion of carriage of Audio Profile indication Dave Singer noted that 14496-14/Cor1 already supports carriage of an Audio Profile indication within the ES_Descriptor. 4.1.2 High-Performance Video Coding (HVC) and Audio (with Requirements) The MPEG Video Subgroup is considering video coding for devices whose capabilities are beyond those of current HD. These might be Ultra-HD (UHD) devices, such as 4K x 2K displays. We note that for such UHD displays, a much closer viewing distance is feasible and perhaps desirable. This could be envisioned as a “personal UHD” experience in which there is both visual and audio envelopment. This could have significant impact on audio presentation such that accurate sound localization can occur. If there is truly only one viewer, it might be that aspects of the audio presentation might be individualized in some meaningful way. Audio experts can track documents in the “HVC” work item of the WG11 resolutions. 4.2 Task Group discussions 4.2.1 MPEG-2, MPEG-4, MPEG-7 Audio and MPEG Surround, conformance, reference software Tilman Liebchen, LG, presented m16036 Proposed Text of ISO/IEC 144963:2005/Amd.2:2006/DCOR4 Noboru Harada Tilman Liebchen Takehiro Moriya Yutaka Kamamoto This contribution is candidate corrigendum text that: corrects syntax of pseudo-code clarify mathematical operations in pseudo-code such that they are never ambiguous clarify use of terms (i.e. layer vs. stage) Kristofer Kjörling, Dolby, presented m16056 Proposed Draft Corrigendum on AAC-ELD Markus Schnell Per Ekstrand This contribution is candidate DCOR text that preserves text describing how an implementer can alter the complex-exponential phaseshifts in the Complex QMF low delay filter bank based on implementation goals provides PCU and RCU complexity figures Pierrick Phillipe, Orange Labs, asked to clarify the definition of PCU and RCU and how they were derived. The Chair noted that, by his understanding, 1 PCU is defined as 1 Million signal processing operations per second (e.g. multiply-accumulate is one operation) and 1 RCU is defined as 1 K-Word of storage, where a “Word” is the storage required for an arithmetic value. Kristofer Kjörling m16141 Proposal for splitting the current AAC family profiles into two Kristofer Kjörling Andreas Schneider This topic has been raised at previous MPEG meetings. This contribution reviews the timeline and history of the issue and the status of support for 960 transform length with respect to reference software and conformance. As of the date of this MPEG meeting, all conformance sequences for 960 block length are available and that the reference software (mp4mcDec) has been extended to support 960 block length. It notes that most implementations that use the AAC or HE-AAC v2 124 profile use and support exclusively 1024 block length and only DAB+ uses HE-AAC V2 with 960 block length. DRM uses HE-AAC v2, but uses a ER Scalable AAC core. The contribution proposes the following changes to the profiles: AAC profile, HE-AAC profile and HE-AAC v2 profile be restricted to 1024 block length. HE-AAC v2 960 profile be created, and that it be restricted to 960 block length. There was considerable discussion concerning the impact of the proposed change on current licensing programs for the AAC family of technology. David Singer, Apple, recommended that the MP4FF be extended to signal the audio profile. Bernhard Grill, FhG, suggested that there be a break-out to discuss the future direction of audio profiles, e.g. in what market applications 1024 or 960 should be supported. The Chair scheduled the break-out for 2PM Tuesday, after which this discussion will be resumed. Ralf Geiger, FhG, presented m16127 Proposed Corrigendum on MPEG-4 SLS Conformance Ralf Geiger The contribution proposes that conformance information be updated to fix the truncated bitstream bug present in the arithmetic coding tool have a consistent delay within the SLS conformance It was the consensus of the ASG to issue this as a DCOR at this meeting. The Chair noted that there is a Systems/MP4FF mechanism to eliminate the decoder “state-variable zero output block” and urged that the USAC decoder software support this mechanism. Junghoe Kim M16200 Proposed BSAC Conformance Bitstreams for Terrestrial DMB Miyoung Kim Junghoe Kim Eunmi Oh The contribution proposes a definition for conformance streams that specifically support the BSAC configuration used in the Terrestrial DMB (T-DMB) ETSI specification. It is the consensus of the ASG to incorporate these definitions and conformance streams into MPEG-4 Audio Conformance. On behalf of Matthias Gruhne, FhG, the Audio Chair presented m16108 Study on ISO/IEC TR 15938-8:2002/FPDAM 4 Matthias Gruhne The contribution supplies extensive additional information concerning how STFFT coefficients are derived from the MDCT coefficients in the audio coded representation. This includes addition explanatory text and Matlab and C source code that provides an example implementation. It was the consensus of the ASG to issue this as ISO/IEC TR 15938-8:2002/DAM 4. Heiko Purnhagen, Dolby, presented m16121 Further corrections to MPEG Surround text Heiko Purnhagen Jeroen Koppens Matthias Neusinger This contribution proposed various editorial corrections and clarifications. It was the consensus of the ASG to issue this as a DCOR to MPEG Surround Heiko Purnhagen, Dolby, presented m16124 Corrections to MPEG Surround reference software Heiko Purnhagen Jeroen Koppens Claus-Christian Spenger Matthias Neusinger It was the consensus of the ASG to issue this as a DCOR to MPEG Surround Reference Software 4.2.2 MPEG-D Spatial Audio Object Coding Leonid Terentiev, FhG, presented m16099 Report on corrections for the MPEG SAOC FCD text and RM software 125 Jonas Engdegård Heiko Purnhagen Cornelia Falch Leonid Terentiev Andreas Hölzer Oliver Hellmuth Johannes Hilpert Yang-Won Jung Henney Oh Jeroen Koppens This contribution presents Editorial changes to the SAOC text consisting of correcting formatting and equation style, spelling and abbreviation errors. In addition, mathematical expressions were corrected, clarified and simplified. Bugfixes to the SAOC reference software. It was the consensus of the ASG to incorporate the proposed changes in to a Study on the FCD. Heiko Purnhagen, Dolby, presented m16095 Information regarding CE on Low Delay MPEG SAOC Jonas Engdegård Heiko Purnhagen Oliver Hellmuth Johannes Hilpert Maria Luis Valero Andreas Hölzer Markus Schnell Leonid Terentiev Erik Schuijers Per Ekstrand The contribution notes that adopting the AAC-ELD low-delay filterbank resulted in audible artefacts in a number of identified critical items. Hence it proposes to use a filterbank with slightly larger delay (from 1.3 ms to 5.3 ms), in which case artefacts are no longer audible. However this new filterbank reduces the SAOC system delay from 26.7 ms to 5.3 ms (i.e. assuming a zero-delay core “coder”). Various other changes are proposed in order to optimize the low delay operating mode (parameter band grouping, decorrelators, and bit that signals low delay mode). The proposed low-delay SAOC system achieved the following delay: Core Coder Total one-way system delay AAC LD 26.6 ms AAC ELD 21.3 ms AAC ELD with SBR 39.0 ms Listening test results were shown for the AAC ELD core coder using an operating mode that results in a 21.3 ms one-way delay (as in the table above). Generally, the results showed that the low delay mode provides audio quality that is comparable to the regular mode, in the mean and for all individual items. It was the consensus of the Audio Subgroup to incorporate this technology into the FCD text via a “Study on ISO/IEC FDIS 23003-2:200x, Spatial Audio Object Coding.” The Chair noted that the FCD ballot closes on 2009-03-14. Hence the final low delay filterbank coefficients and a “sanity check” listening test on that final filterbank must be available sufficiently in advance of that date so that National Bodies can take that information into account when casting their ballots. It was agreed that the following information will be available prior to the close of the ballot Frequency response of interim and final prototype filters “Sanity check” listening test, e.g. 8 listeners for most critical items This will be documented in the workplan for SAOC. Finally, it was noted that the SAOC low-delay filterbank is not compatible with MPEG Surround, however there was no reason to expect that MPEG Surround could not be extended, if desired, to have a low-delay mode using the exactly the SAOC low-delay filterbank. Pierrick Philippe, Orange Labs, presented m16216 Subjective Evaluation of Low Delay MPEG SAOC P. Philippe This contribution presented listening test information that was very similar to the test results provided in the previous contribution (m16095). 126 Heiko Purnhagen, Dolby, presented m16096 Information regarding CE on Low Power MPEG SAOC Jonas Engdegård Heiko Purnhagen Oliver Hellmuth Leonid Terentiev Erik Schuijers This contribution provides addition information on the CE on low power SAOC. The final configuration of the technology was clarified and a comprehensive set of listening test results provided. Relative to the “high-quality” mode, the low power mode most prominently has A mixed real-valued/complex-valued filterbank (first 8 bands are complex), as is used in low-power MPEG Surround. Low-complexity decorrelator for mono downmix to stereo output, as is used in low-power MPEG Surround. No decorrelator for stereo downmix to stereo output. Reduced-bandwidth residual, as is used in low-power MPEG Surround. A 50 % reduction in computational complexity with respect to “normal” SAOC. Overall, the performance of the low-power mode demonstrated a slightly lower mean performance, but not at the 95% level of significance. It was noted that the “reduced-bandwidth residual” was not tested for the advanced karaoke application. A Workplan will coordinate the tasks for FhG to create test waveforms and LG to conducting a listening test. Interoperability and naming of the various “flavors” of SAOC is summarized in the following table, where an “X” indicates that a given bitstream can be decoded by a given decoder. Note that HighQuality and Low-Power bitstreams are identical. Decoder Bitstream HQ LP LD HQ/LP X X LD X Kwang-Ki Kim, ETRI, presented m16084 CE on Residual Coding Process for Post Downmix Gain Kwang-Ki Kim Jeongil Seo Seungkwon Beack Kyeongok Kang MinsooHahn This contribution provides listening test data that shows that a residual signal with a bandwidth equal to the first 2 QMF bands improves audio quality at the 95% level of significance in this application area. The contribution also reports on a cross-check listening test done at LG. The Chair noted that the SAOC processing will remove the mastering “feel” even before any sound stage re-mixing is applied, so that the user will experience a very different feel as compared to the stereo mastered mix. There was considerable discussion as to what the user experience would be when using the proposed technology and whether the proposed technology would bring the expected value to the consumer experience. There was further discussion as to the increase in complexity, if any, that the proposed technology would bring. The Chair suggested that ETRI bring additional information later in the week that would clearly indicate the evolution of technology and performance over the course of the CE, and this will be discussed later in the week. Zhong Haishan, Panasonic, presented m16086 Efficient inter-object relation indicator for SAOC Zhong Haishan Zhou Huan Chong Kok Seng Tomokazu Ishikawa Takeshi Norimatsu The contribution proposes a syntax that supports both a “flexible” and an “efficient” way of indicating inter-object relation. Leondid Terentiev, FhG, noted that the contribution does not take 127 into account the fact that the Config structure is always padded out to an integer number of bits (i.e. via byte_align()), which may significantly change the bit savings count. Heiko Purnhagen, Dolby, noted that it is informative to measure bit savings WRT an entire bitstream under the assumption of a single Config (i.e. file-based decoding) or periodic Config (i.e. transmission-based decoding supporting break-in). Panasonic experts will bring additional information later in the week that addresses the bytealignment issue and the periodic transmission issue. Leonid Terentiev, FhG, presented m16097 Information regarding mixing mode for the enhanced Karaoke/Solo processing Cornelia Falch Leonid Terentiev Johannes Hilpert Oliver Hellmuth The contribution proposes an extension to the Karaoke/Solo mode that allows for more efficient processing in the case that an arbitrary mix is used for the Karaoke/Solo separation. It was the consensus of Audio Subgroup to include this technology into the Study on SAOC FCD text. Leonid Terentiev, FhG, presented m16098 Proposal for MCU functionality extension for the MPEG SAOC Leonid Terentiev Cornelia Falch Oliver Hellmuth This contribution provides test clarifying MCU functionalities MCU operations (i.e. as mathematical equations) MCU control interface as was requested at the 86th MPEG meeting. It was the consensus of the Audio Subgroup to incorporate this additional information into the Study on SAOC FCD text. Yang-Won Jung, LG, presented m16100 Proposal for dynamic preset extension for the MPEG SAOC Heiko Purnhagen Cornelia Falch Leonid Terentiev Oliver Hellmuth Johannes Hilpert Yang-Won Jung Henney Oh Jeroen Koppens This contribution proposed dynamic presets. Since presents are uniquely identified by a label string, it is proposed that when a decoder receives a preset with an already known label, it overwrites the currently stored preset settings with the new information. Heiko Purnhagen, Dolby, noted that there are a few details still to be specified in order to have a rational and deterministic system. He will lead a break-out group to study these issues and report back to the group. It was the consensus of Audio Subgroup to adopt this technology into the Study on SAOC FCD text, subject to a positive report from Heiko Purnhagen. Yang-Won Jung, LG, presented m16103 Consideration on User Interface in SAOC Yang-Won Jung Henney Oh This contribution describes several possible user interface controls that an SAOC decoder might use in different application scenarios. There was considerable discussion, in which it was pointed out that MPEG might offer tutorial examples of how to determine a rendering matrix, i.e. as mathematical expressions. LG will draft candidate informative text for review later in the week. Yang-Won Jung, LG, presented m16104 Proposal for adding information on object characteristics in SAOC 128 Yang-Won Jung Henney Oh The contribution noted that it may be valuable for the decoder to have some knowledge of the nature of the objects contained in the downmix signal. It proposes to add syntax constructs in the Config() structure to carry such information. It was noted that the proposal does not seem to have a computational-based algorithm to deduce the information and has no normative decoding aspect. The Chair suggested that this matter requires more discussion in a break-out group, and asked that group to report after lunch Thursday. Yang-Won Jung, LG, presented m16105 Proposal for including guideline information on the rendering Yang-Won Jung parameters in SAOC Henney Oh The contribution proposes an informative method to influence, or even limit, the user’s choice of rendering matrix. Informative guide would take the form of mathematical expressions. The Chair noted that a possible mechanism to support this need and that expressed in the previous contribution is to define a User XML Data box that could contain arbitrary information, perhaps based on a normative set of tags. The Chair suggested that this matter requires more discussion in a break-out group, and asked that group to report after lunch Thursday. Yang-Won Jung, LG, presented m16106 Comments on the enhanced karaoke mode in SAOC Yang-Won Jung Henney Oh This contribution notes that in “normal” SAOC mode it is possible to apply a small amount of control to every object via a small rate of additional side information per object. In order to achieve Karaoke/Solo function, a residual signal is required for “foreground” and “background” objects. The contribution proposes to unify the normal mode and EKS mode transmit information in the bitsream that indicates which object is matched to which residual signal. This proposal was discussed, particularly the validity of the assumptions taken by the contribution It was clarified that ENG mode and residuals typically are used together, with residual being bandlimited (e.g. first 8 bands) and ENG being used in the higher bands (which there is no residual signal). The Chair suggested that this matter requires more discussion in a break-out group, and asked that group to report after lunch Thursday. Pierrick Philippe, Orange Labs, presented m16107 Proposed Audio Sequences for MPEG-D SAOC Pierrick Philippe Gregory Pallone Marc Emerit The contribution reports that Orange Labs is making available to MPEG, for the purpose of developing MPEG standards, audio signals that are appropriate for assessing performance of SAOC in a teleconferencing scenario. These include signals with up to 5 talkers, each recorded individually in an acoustically isolated environment background music talkers are involved in an actual task, which might optionally involve the background music item (e.g. interactively guess the song title, or conduct a technical meeting). Items are available on a USB stick (some 80 Mbytes). Break-Out on SAOC (Oliver Hellmuth) These contributions have already been presented, and addition discussion occurred in the break-out group, and the recommendations of the break-out are reported here: m16084, ETRI, CE on Residual Coding Process for Post Downmix Gain Value not clear. Proposal: no action 129 m16103, LG, Consideration on user interface in SAOC Proposal: Put guideline into informative annex m16104, LG, Proposal for adding information on object characteristics in SAOC Proposal: Use existing BsRelatedTo + additional informative text (to give an idea of the functionality) m16105l, LG, Proposal for including guideline information on the rendering parameters in SAOC Problem understood, but not solved with the proposed solution. Proposal: no action m16106, LG, Comments on the enhanced karaoke mode in SAOC Proposal: FhG to provide input to the next meeting: Clarification of the "Study On FCD" text based on questions raised in this input contribution m16086, Panasonic, Proposal on efficient inter-object relation indicator for SAOC Minimal saving relative to overall bit rate. Proposal: no action m16100, Dolby, FhG, Philips, Proposal for dynamic preset extension for the MPEG SAOC Proposal: Include necessary clarifications of the input contribution and the related discussion in the "Study On FCD" text within the editing period 4.2.3 MPEG-D Unified Speech and Audio USAC Core Experiments Kristofer Kjörling, Dolby, presented m16142 Core experiment proposal on the USAC eSBR module Lars Villemoes Per Ekstrand Kristofer Kjörling The contribution notes that there is audible pre-echo when using the SBR tool at 16 kb/s for some mono signals, e.g. “castanets.” It gives an overview of the harmonic transposition algorithm as used in USAC SBR, and notes that when the signals are pulse-like rather than sinusoidal and harmonic, care must be taken when doing the time-frequency transforms that are part of constructing the harmonic extension. Spectrograms where presented that showed the presence of pre-echo in the current processing mode and their absence in the proposed mode. Listening test results were presented for an extended test set (adding castanets and harpsichord). Results suggest that the technology has merit, but may have a significant increase in complexity, with the complexity increase depending on how one arranges the alignment of windows and perhaps other factors. Dolby experts will work with the RM0 proponents to refine this proposal and bring additional information to the next MPEG meeting. Philippe Gournay, VoiceAge, presented m16147 Proposed Core Experiment on LPC Quantization for USAC Philippe Gournay Bruno Bessette Roch Lefebvre Redwan Salami The contribution reviewed the RM0 LPC vector quantizer, and proposes an alternative solution which has significantly lower table storage requirements. In addition, the proposed solution has a more flexible bit allocation scheme that permits using more bits when needed such that there are fewer quantized LPC models large spectral distortion (relative to the unquantized model). Listening test results were presented at the 16 kb/s mono operating mode, showing that the proposed system had quality that was not different from the RM0 system. 130 It was suggested that additional information be provided to aid the Audio Subgroup in making a decision. This could be: cross-check listening test at 16 kb/s for mono signals proponent and cross-check listening test at some additional operating mode(s) listening test methodology might be MUSHRA and also CMOS spectral distortion information at all operating points Markus Multrus, FhG, presented m16162 Proposed Update of Arithmetic Coder Tables for USAC Guillaume Fuchs Markus Multrus The contribution reviews the current context-adaptive arithmetic coder, which requires 105K words of storage (75% of total table storage in decoder). It proposes to replace the existing tables with new tables whose total size is 15 K words, which is a reduction by a factor of 7. This is achieved by reducing the number of probability distributions from 256 to 32. It is possible to create an “encoder” via a lossless transcoding of the RM0 bitstreams to the proposed bitstream format and to decode these by the proposed decoder. It was verified that both decoders outputs were bit-identical waveforms. When averaged over all test items and all operating points, the new arithmetic coding scheme had the same bitrate (identical to 2 decimal digits). It was further noted that computational complexity is comparable. Bernhard Grill, FhG, noted that the RM0 arithmetic coding design was crude and preliminary, and this proposal is based on a careful analysis of the problem and subsequent optimization. Samsung offered to conduct a cross-check of the decoded waveforms and the bitrate changes as presented in the contribution. The Chair suggested that, if that cross-check is available one week after the close of the MPEG meeting, and if it is positive, then the ASG could agree at this meeting to incorporate the proposed technology into the USAC WD and RM. Audio experts will draft a Workplan for USAC CEs that coordinates work for the following CEs: Company CE Samsung Phase in stereo coding Samsung Additional mode of unvoiced speech coding ETRI SBR ETRI TCX Dolby SBR harmonic extension VoiceAge LPC VQ FhG Arithmetic coding tables MPEG Encoder and the CE process This was a continued discussion of how the USAC source code encoder could be used in the CE process. Mohamad Raad, RaadTech Consulting, made a presentation on this topic. The proposal emphasises the following point: An additional component of the CE integration phase, the proponent must provide sufficient support, as educational text and/or exemplary source code so as to enable others to implement the tool in the MPEG Reference Encoder. The “support” component must be accepted by the consensus of the Audio Subgroup. In addition, the proposal requests that the RM0 proponents enhance the current Informative Encoder source code such that there is code for all modules in the encoder block diagram in the informative part of the WD. The ASG agrees with the following plan of action: Draft Workplan to define all modules categorize modules as signal flow or control 131 identify what modules are present or missing in the Informative Encoder source evaluate the informative encoder description in the WD and judge if it provides sufficient “support” Update CE methodology to incorporate An additional component of the CE integration phase, the proponent must provide sufficient support, as educational text and/or exemplary source code so as to enable others to implement the tool in the MPEG Reference Encoder. The “support” component must be accepted by the consensus of the Audio Subgroup. The Chair requests that RM0 proponents submit contribution that specifies the areas, if any, that they could provide missing or improved source code. Junghoe Kim, Samsung, presented m16055 Comments on WD of Unified Speech and Audio Coding Kihyun Choo Junghoe Kim Eunmi Oh This contribution suggested editorial corrections and clarifications alignment of text to ref sw for noise filling tool minor technical changes to “acelp_core_mode” signalling that slightly reduce bitrate It further notes that the RM0 CfP bitstreams have undefined bits at the end of usac_raw_data_block(). This appears to be an extension payload element. It is the consensus of the ASG to incorporate the following into the USAC WD text: editorial corrections and clarifications alignment of text to ref sw for noise filling tool Hyun-Kook Lee, LG, presented m16119 Proposed syntax revision on USAC RM0 Dong Soo Kim Sungyong Yoon Hyun-Kook Lee Jaehyun Lim This contribution suggested minor technical changes to “acelp_core_mode” signalling that slightly reduce bitrate It is the consensus of the ASG to incorporate the comments from Samsung and LG on “acelp_core_mode” signalling into a document titled “Thoughts on Efficient Bitstream Syntax.” The ASG will consider the syntax change proposals in this document no later than the meeting at which USAC progresses to CD stage. This does not imply that the ASG will take any action on the proposals, but rather than each will be considered on the basis of its technical merit. USAC CE Break-out (Markus Multrus) Hyun-Kook Lee, LG, continued his presentation of m16119 Proposed syntax revision on USAC RM0 Dong Soo Kim Sungyong Yoon Hyun-Kook Lee Jaehyun Lim The main points of this contribution were: 1st Issue on acelp_core_mode and lpd_mode already talked through before the break-out 2nd issue on the transmission of noise_offset parameter in case noise_level equals 0 (in fd_channel_streams) o In case noise_level == 0, noise_offset is not used o Max Neuendorf, FhG proposed, to collect this proposal in the "Thoughts on Efficient Bitstream Syntax" document. There was no agreement on this so the decision was postponed. 132 o Roch Lefebvre, VoiceAge, suggested that the group concentrate on other CEs that have a more significant impact on the performance of the work item. It was the consensus of the Audio Subgroup that this technology would be included in the “Thoughts on Efficient Bitstream Syntax” document. Hyun-Kook Lee, LG, presented m16122 Efficient signaling for FD frame on USAC RM0 Dong Soo Kim Sungyong Yoon Hyun-Kook Lee Jaehyun Lim The main points of this contribution were: Proposal to apply differential coding for global_gain, max_sfb. Difference should be Huffman-coded Syntax revision in single_channel_elemnt(), channel_pair_element(), ics_info() Differential coding could be turned on/off. This is signalled by an additional bit in the bitstream Bitrate savings for 2 scenarios were presented: Storage scenario (up 0.81% saving), streaming scenario (up to 0.78%) Ralf Geiger pointed out that this proposal infringes the random-access of AUs: parsing of syntax is infringed Bernhard Grill pointed out the importance of the correct global_gain for decoding and level adjustment: increament/decreament by 1 issues a level difference for 1.5 dB Ralf Geiger also pointed out that the correct decoding of max_sfb is important for paring of the frame Tilman Liebchen pointed out the bitrate savings. Werner Oomen proposed to include this issue into the "Thoughts on Efficient Bitstream Syntax" document Max Neuendorf repeated the importance of the self-containedness for the bitstream parsing There was no consensus to put issue into the "Thoughts on Efficient Bitstream Syntax" document Advantages and disadvantages were listed by the presenter: o Advantage: In average bitrate saving o Disadvantge: Break up self-containedness (ER) o Disadvantage: Bitrate saving not guaranteed o For Random-Access: Even bitrate increment o Disadvantage: Loss of functionality (level adjustment) It was the consensus of the Audio Subgroup that, for now, this technology would not be included in the “Thoughts on Efficient Bitstream Syntax” document. Hyun-Kook Lee, LG, presented m16125 Proposed syntax revision regarding window sequence on USAC RM0 Dong Soo Kim Sungyong Yoon Hyun-Kook Lee Jaehyun Lim The main points of this contribution were: Proposal to entropy code window_sequence for USAC In RM0: fixed 2 bits In proposal: 1..2 bits Conflicts with m16153, m16154 Bitsavings 0.060% … 0.099% for all FD frames, 0.032%..0.067% in total Proposal in case the proposed corrections to WD are applied, proposal not possible any more (more than 3 possibilities): 4 transitions present in bitstreams Conflicts with m16154 133 Problems with Random-Access: if last frame is lost, not clear if long/short frame: Frame not parsable It was the consensus of the Audio Subgroup that, for now, this technology would not be included in the “Thoughts on Efficient Bitstream Syntax” document. Hyun-Kook Lee, LG, presented m16123 Comment on random access issue on USAC RM0 Dong Soo Kim Sungyong Yoon Hyun-Kook Lee Jaehyun Lim The main points of this contribution were: In current RM0: Frame depends on previous frame Correct Decoding of frame not possible in case last frame is lost Proposal: include last_core_mode, last_window_sequence in header (USACSpecificConfig()) Werner Oomen pointed out that decoding of 1st frame is not completely possible , because e.g. filterbank states are missing Problem: Start-up only in case header is received Previous frame needed to decode current frame Take discussion offline Ralf Geiger, FhG, presented m16154 Proposed Update on USAC Bitstream Syntax Jérémie Lecomte Max Neuendorf Ralf Geiger Markus Multrus The main points of this contribution were: Alternative signalling for window_sequence, which simplifies, saves bitrates In RM0: window_sequence coded by 2 bits, info from previous frame needed to determine correct seuqnece Proposed sequence: 1..2 bits, indicates transform length, right window slope length; length of left window slope derived from previous window, incorporated in ics_info() Proposed update: No restrictions to window signalling as in RM0 Bitrate reduction between 4.41 … 22.41 bit/second Additional presentation on "Requirements for window signaling": o Random access: Bitstream must be parsable, right window half must be preserved o Efficient syntax with low redundancy Comparison of RM0, m16123, m16125, m16154 wrt random-access: a) bitstream syntax self contained, b) random-access, c) perfect reconstruction, d) efficient syntax, easy to understand o RM0: a), b) o M16123: a), b) (b with restrictions) o M16125: c) o M16154: a), b), c), d) Discussion on need of this proposal: Kristofer Kjörling pointed out the agreement not to go after maximum bitrate saving on current syntax; Max Neuendorf explained the proposed scheme as new concept of window signalling: Treat every proposal equal Take discussion offline After offline discussion between Ralf Geiger, FhG, and Hyun-Kook Lee, LG, it was agreed to include FhG’s m16154 proposal rather than LG's m16123 proposal in the “Thoughts on Efficient Bitstream Syntax” document. This was approved by the Audio Subgroup upon presentation of the “Thoughts on Efficient Bitstream Syntax” document in the Friday Audio Plenary. Hyun-Kook Lee, LG, presented 134 m16120 Proposed syntax revision regarding SBR bitstream on USAC RM0 Dong Soo Kim Sungyong Yoon Hyun-Kook Lee Jaehyun Lim The main points of this contribution were: Propose to reduce redundancy in bs_frame_class, bs_var_bord In RM0: bs_frame_class: 2bits Proposal 1: replace it by 1 bit, dependent on frame_class of previous frame Proposal 2: bs_var_bord_0: same as bs_var_bord_1 of previous frame (indicates the borders of SBR framing), could so omitted transmit both values "self-containing" in case of (frame_refresh != 0); frame_refersh is set in case of header transmission Discussion by Kristofer Kjörling: Not all frame class combinations can be signalled by proposal 1, e.g. not VARVAR succeeding FIXFIX, which are valid in SBR, proposal 2 prevents check for transmission errors on decoder side No further actions on this. There was a proposal that Kristofer Kjörling should come up with a tutorial on SBR frame class transitions. Reference Encoder Software (continued from AhG meeting) Mohamad Raad, RaadTech Consulting, presented M16208 Proposal for the development of a common MPEG Audio encoder for use in the CE phase Mohamad Raad The contribution notes that there should be a common USAC encoder that will be used for the CE process. To this end, it presents a detailed process on how to organize and manage the project of creating such software. Justin Ridge, Nokia, presented m16166 On the Unified Speech and Audio Coding Activity Mauri Väänänen This contribution made the following points: That the USAC work item should have the goal of significant improvement of performance, and that improvement of worst-case performance is most important (i.e. so that consistent performance is achieved). That complexity is important (i.e. so that implementations for battery-powered devices are possible) That it is more important to achieve significant improvement of performance than achieve quick time to market. It further notes that a high-quality standard is more assured if There is a low threshold of participation There is a high threshold of acceptance (for tools proposed in CEs) The contribution presents an example process for how the USAC CE process might proceed using both proprietary Reference Quality Encoder and “open-source” Reference Encoder. The Chair welcomed the opinions of Nokia as a major device manufacturer, and urged audio experts to “aim high” in order to be successful in the marketplace. 4.2.4 Exploration: Meta-Data Stephan Schreiner, FhG, presented m16144 Perspectives on Application Scenarios for Post-Processing Audio Metadata Stephan Schreiner Wolfgang Fiesel Akshaya Thippur This contribution presents possible application scenarios for the use of metadata in the context of audio post-processing. It noted that, historically, there may have been a well-defined and wellcontrolled closed environment in which a content producer could exactly predict and control the quality of the user experience. As content production, program assembly for transmission and user 135 listening environments becomes much more dynamic and heterogeneous, the assumption that a content producer can predict and precisely control the quality of the user experience is no longer true. It specifically discussed “Comfort Zone” adjustment, in which the dynamic range of an audio program is controlled and adjusted “Clean Audio” enhancement, in which the dialog component of an audio program can be boosted relative to the remaining program audio elements. It proposes that there is now the possibility to serve the discussed applications using a unique and efficient manner based on the concept of audio objects and associated parametric representations. Bernhard Grill, FhG, suggested that a liaison statement to DVB groups that could solicit their vision for future-looking system functionality that might be enabled by audio-related metadata. The Chair suggested that there should be an AhG mandate to craft candidate liaison text and/or “vision” document that could be attached to liaison statement. Kristofer Kjörling, Dolby, noted that often the dialog is often not available as a distinct program component, and so computing the dialog metadata can be a difficult problem. Bernhard Grill, FhG, noted that the MPEG experts should think about the technology that could be used to support the envisoned funcitonality 5 Audio closing plenary discussions Max Neuendorf, FhG, presented additional information on proposed changed to the USAC WD TW- MDCT o Segmental SNR plots show that change does not affect a 16-bit reconstruction of the output waveform For all proposed changes taken together o SNR and SegSNR show reasonable performance o Listening tests show no difference between RM with and without the proposed changes Additional proposed changes to the mathematical formulas were proposed Add additional details to Figure 1.1, Encoder Block Diagram. The Chair requested that the new Figure 1.1 be posted to the USAC email reflector and that it be used in the Workplan for developing the MPEG Reference Encoder. It was the consensus of the Audio Subgroup to incorporate these proposed changes into USAC WD2. 6 6.1 Meeting deliverables Responses to Liaison and NB comments The responses to Liaison and NB comments were prepared and approved. 6.2 Recommendations for final plenary The Audio recommendations were presented and approved. 6.3 Establishment of Ad-hoc Groups The following ad-hoc groups were established by the Audio subgroup: No. Title AHG on Audio Standards Maintenance AHG on Unified Speech and Audio Coding and Spatial Audio Object Coding 136 Mtg No Yes 6.4 Approval of output documents All output documents, shown in Annex D, were presented in Audio plenary and were approved. 6.5 Press statement The Audio contribution to the press statement was presented. Editing and further review will be done via email. 7 7.1 Future activities Schedule of future meetings Ad Hoc group meetings are indicated in Section 6.3. Unless otherwise indicated, Ad Hoc group meetings will be held at the location of the next MPEG meeting on the weekend preceding that meeting. 7.2 Agenda for next meeting The agenda for the next MPEG meeting is shown in Annex F. 7.3 All other business There was none. 7.4 The 87th Closing of the meeting Audio Subgroup meeting was adjourned Friday at 13:45. 137 Annex A Participants First Name Last Name Country Affiliation Johannes Boehm DE Thomson Ti Eu Chan SG I2R Yujie Dun CN XJTU Ralf Geiger DE Fraunhofer IIS Philippe Gournay Canada Bernhard Grill DE VoiceAge Corp. / Univ. of Sherbrooke Fraunhofer IIS Oliver Hellmuth DE Fraunhofer IIS Jürgen Herre DE Fraunhofer IIS Jeff Huang USA Qualcomm Inc. Yang-Won Jung KR LG Electronics Kyeong Ok Kang Korea ETRI Florian Keiler Germany Thomson Kei Kikuiri JP NTT DOCOMO Junghoe Kim KR Samsung AIT Kwangki Kim KR Kristofer Kjörling SE Information and Communications Univ. Dolby Hyunkook Lee KR LG electronics Taejin Lee KR ETRI Roch Lefebvre Canada Tilman Liebchen DE VoiceAge Corp. / Univ. of Sherbrooke LG Electronics Takehiro Moriya JP NTT Markus Multrus DE Fraunhofer IIS Yasushige Nakayama JP NHK Max Neuendorf Germany Fraunhofer IIS Toshiyuki Nomura JP NEC Takeshi Norimatsu JP Panasonic Eunmi Oh KR Samsung Henney Oh KR LG Electronics Werner Oomen NL Philips Applied Technologies Pierrick Philippe FR France Telecom R&D Heiko Purnhagen SE Dolby Schuyler Quackenbush USA ARL Mohamad Raad Australia RaadTech Consulting Andreas Schneider DE Dolby Stephan Schreiner Germany Fraunhofer IIS Jeongil Seo KR ETRI Haiyan SHU Singapore I2R Ralph Sperschneider DE Fraunhofer IIS Herve Taddei DE Huawei Technologies Leonid Terentiev DE Fraunhofer IIS Mauri Vaananen FIN Nokia Res. Center David Virette FR France Telecom R&D Minjie Xie USA Huawei Hai Shan Zhong Singapore Panasonic Singapore Laboratories Huan Zhou SG Panasonic Singapore Laboratories Yongwei Zhu Singapore Institute for Infcomm Resarch 138 Annex B Audio Contributions and Schedule Day / Time Task Group Sunday 1000-1700 AhG: USAC m16156 Progress of Technology Merge Between System 2 and USAC RM Taejin Lee Max Neuendorf Jeremie Lecomte Kyeongok Kang Bernhard Grill X m16173 Progress report on phase experiment for USAC JungHoe Kim Julien Robilliard Eunmi Oh Bernhard Grill X m16177 Progress report on unvoiced speech coding Hosang Sung Eunmi Oh X m16153 Proposed Corrections to WD and Reference Software on Unified Speech and Audio Coding Max Neuendorf Philippe Gournay Jérémie Lecomte Markus Multrus Stefan Bayer Guillaume Fuchs Ralf Geiger Frederik Nagel X 13001400 Lunch m16079 Discussion on the Unified Speech and Audio Coding Activity Herve Taddei Minjie Xie Qing Zhang X m16110 Comment on the unified speech and audio coding activity Werner Oomen X m16118 Considerations on the development of common USAC reference encoder Dong Soo Kim Sungyong Yoon Hyun-Kook Lee Jaehyun Lim X m16140 Core Experiment procedures and MPEG reference software encoder Kristofer Kjörling Heiko Purnhagen X m16143 Proposed improvements for MPEG Audio Core Experiment Methodology and Reference Software Development Pierrick Philippe X m16146 Comments on Core Experiments methodology for MPEG USAC standardisation Roch Lefebvre Philippe Gournay Redwan Salami X m16160 Thoughts on Core-Experiment Methodology Bernhard Grill Jürgen Herre Ralf Geiger Max Neuendorf Markus Multrus X m16202 Comments on USAC Standardization Activities Kei Kikuiri Nobuhiko Naka Kousuke Tsujino X 1800- Chairs Meeting Monday 0900-1230 MPEG Plenary 1300-1400 Lunch 1400-1430 Audio Plenary Welcome 139 Report on Sunday Chairs meeting Review main tasks for the week General documents m16116 86th MPEG Audio Report S. Quackenbush X m15952 Ad Hoc Group on Audio Standards Maintenance R. Sperschneider X m15953 Ad Hoc Group on SAOC, USAC S. Quackenbush X NB Position Papers m16044 AUNB Comment on the unified speech and audio coding activity AUNB X m16045 AUNB Comment on the exploration on metadata driven AUNB post-processing of audio X m15914 FNB Comment (from Busan) on USAC FNB m15879 Fin NB m15911 CNB 1430- USAC Reference Software M16208 Proposal for the development of a common MPEG Audio encoder for use in the CE phase Mohamad Raad X m16166 On the Unified Speech and Audio Coding Activity Mauri Väänänen X Discussion 1630-1800 MPEG-2, MPEG-4 and MPEG-7 m16036 Proposed Text of ISO/IEC 144963:2005/Amd.2:2006/DCOR4 Noboru Harada Tilman Liebchen Takehiro Moriya Yutaka Kamamoto X m16056 Proposed Draft Corrigendum on AAC-ELD Markus Schnell Per Ekstrand X m16127 Proposed Corrigendum on MPEG-4 SLS Conformance Ralf Geiger X M16200 Proposed BSAC Conformance Bitstreams for Terrestrial Miyoung Kim DMB Junghoe Kim Eunmi Oh X m16141 Proposal for splitting the current AAC family profiles into Kristofer Kjörling two Andreas Schneider X 1800- HoD Meeting Tuesday 0900-1300 MPEG-7 m16108 Study on ISO/IEC TR 15938-8:2002/FPDAM 4 0930-1300 SAOC m16099 m16095 Matthias Gruhne X Report on corrections for the MPEG SAOC FCD text and RM software Jonas Engdegård Heiko Purnhagen Cornelia Falch Leonid Terentiev Andreas Hölzer Oliver Hellmuth Johannes Hilpert Yang-Won Jung Henney Oh Jeroen Koppens X Information regarding CE on Low Delay MPEG SAOC Jonas Engdegård Heiko Purnhagen X 140 Oliver Hellmuth Johannes Hilpert Maria Luis Valero Andreas Hölzer Markus Schnell Leonid Terentiev Erik Schuijers Per Ekstrand m16216 Subjective Evaluation of Low Delay MPEG SAOC P. Philippe X m16096 Information regarding CE on Low Power MPEG SAOC Jonas Engdegård Heiko Purnhagen Oliver Hellmuth Leonid Terentiev Erik Schuijers X m16084 CE on Residual Coding Process for Post Downmix Gain Kwang-Ki Kim Jeongil Seo Seungkwon Beack Kyeongok Kang MinsooHahn X m16086 Efficient inter-object relation indicator for SAOC Zhong Haishan Zhou Huan Chong Kok Seng Tomokazu Ishikawa Takeshi Norimatsu X m16097 Information regarding mixing mode for the enhanced Karaoke/Solo processing Cornelia Falch Leonid Terentiev Johannes Hilpert Oliver Hellmuth X 1300-1400 Lunch 1400-1500 Future directions in Audio profiles 1400-1600 SAOC m16098 Proposal for MCU functionality extension for the MPEG Leonid Terentiev SAOC Cornelia Falch Oliver Hellmuth X m16100 Proposal for dynamic preset extension for the MPEG SAOC Heiko Purnhagen Cornelia Falch Leonid Terentiev Oliver Hellmuth Johannes Hilpert Yang-Won Jung Henney Oh Jeroen Koppens X m16103 Consideration on User Interface in SAOC Yang-Won Jung Henney Oh X 1600-1800 USAC Reference Encoder and the CE process 1800- Chairs Meeting Stephan Schreiner Wolfgang Fiesel Akshaya Thippur X Frans de Bont Stefan Döhla X Wednesday 0900-1100 MPEG Plenary 1200-1300 Exploration: Metadata m16144 Perspectives on Application Scenarios for PostProcessing Audio Metadata 1300-1400 Lunch 1400-1500 MPEG Surround – Joint with Systems m16117 Thoughts on MPEG Surround signaling 141 Heiko Purnhagen Alexander Gröschel m16054 Scalable Audio and MP4 Stefan Doehla Discussion: signaling audio profile in MP4FF X X 1500-1530 MPEG Surround m16121 Further corrections to MPEG Surround text Heiko Purnhagen Jeroen Koppens Matthias Neusinger X m16124 Corrections to MPEG Surround reference software Heiko Purnhagen Jeroen Koppens Claus-Christian Spenger Matthias Neusinger X 1400-1500 SAOC m16104 Proposal for adding information on object characteristics in SAOC Yang-Won Jung Henney Oh X m16105 Proposal for including guideline information on the rendering parameters in SAOC Yang-Won Jung Henney Oh X m16106 Comments on the enhanced karaoke mode in SAOC Yang-Won Jung Henney Oh X m16107 Proposed Audio Sequences for MPEG-D SAOC Pierrick Philippe Gregory Pallone Marc Emerit X 1600-1700 USAC m16142 Core experiment proposal on the USAC eSBR module Lars Villemoes Per Ekstrand Kristofer Kjörling X 1900 Social (don’t be late!) Thursday 0900-1300 USAC m16147 Proposed Core Experiment on LPC Quantization for USAC Philippe Gournay Bruno Bessette Roch Lefebvre Redwan Salami X m16162 Proposed Update of Arithmetic Coder Tables for USAC Guillaume Fuchs Markus Multrus x m16055 Comments on WD of Unified Speech and Audio Coding Kihyun Choo Junghoe Kim Eunmi Oh X USAC MPEG Reference Encoder discussion m16119 Proposed syntax revision on USAC RM0 Dong Soo Kim Sungyong Yoon Hyun-Kook Lee Jaehyun Lim X m16120 Proposed syntax revision regarding SBR bitstream on USAC RM0 Dong Soo Kim Sungyong Yoon Hyun-Kook Lee Jaehyun Lim X m16122 Efficient signaling for FD frame on USAC RM0 Dong Soo Kim Sungyong Yoon Hyun-Kook Lee Jaehyun Lim X m16123 Comment on random access issue on USAC RM0 Dong Soo Kim Sungyong Yoon Hyun-Kook Lee Jaehyun Lim X m16125 Proposed syntax revision regarding window sequence on USAC RM0 Dong Soo Kim Sungyong Yoon Hyun-Kook Lee X 142 Jaehyun Lim m16154 Proposed Update on USAC Bitstream Syntax 1300-1400 Lunch 1400-1800 TBD 1500-1530 Audio for HVC 1800- Chairs Meeting Audio plenary Remarks on Thursday Chairs meeting Recommendations for final plenary Establishment of new Ad-hoc groups AhG Mandates Get document numbers 1000 Approve Responses to NB comments and Liaison 1030 Approval of output documents Title: N10xxx File: w10xxx (short title).doc (NOT *.docx!) Zip: w10xxx.zip Review of Audio presentation to MPEG plenary Agenda for next meeting A.O.B. Closing of the Audio meeting 1300-1400 Lunch 1400- MPEG Plenary X X Friday 0730-1300 Jérémie Lecomte Max Neuendorf Ralf Geiger Markus Multrus 143 Annex C Task Groups 1. 2. 3. 4. MPEG-2 and MPEG-4 audio, conformance, reference software MPEG-D Spatial Audio Object Coding MPEG-D Unified Speech and Audio Coding Exploration: Meta-Data 144 Annex D Output Documents No. 10373 10374 10375 10376 10377 10378 10379 10380 10381 10382 10383 10384 10385 10386 10434 10387 10388 10389 10390 10391 10392 10393 10394 10395 10396 10397 10398 10399 10413 10414 10415 Title 13818-4 Conformance DoC on ISO/IEC 13818-4:2004/AMD 2:2005/DCOR 2, AAC Conformance ISO/IEC 13818-4:2004/AMD 2:2005/Cor 2, AAC Conformance 13818-7 Advanced Audio Coding DoC on ISO/IEC 13818-7:2006/DCOR 1, AAC ISO/IEC 13818-7:2006/Cor. 1, AAC 14496-3 Audio DoC on ISO/IEC 14496-3:2005/DCOR. 6, AAC ISO/IEC 14496-3:2005/Cor. 6, AAC DoC on ISO/IEC 14496-3:2005/AMD 2:2006/DCOR 4, HE-AAC V2 Profile and ALS ISO/IEC 14496-3:2005/AMD 2:2006/Cor. 4, HE-AAC V2 Profile and ALS DoC on ISO/IEC 14496-3:2005/AMD 3:2006/ DCOR 2, SLS ISO/IEC 14496-3:2005/AMD 3:2006/Cor. 2, SLS DoC on ISO/IEC 14496-3:2005/AMD 9:2008/DCOR 1, AAC-ELD ISO/IEC 14496-3:2005/AMD 9:2008/Cor. 1, AAC-ELD ISO/IEC 14496-3:2009/FPDAM 1:200X HD-AAC Profile Thoughts on MPEG Surround Signaling Issues concerning frame lengths in the AAC family profiles 14496-4 Conformance testing ISO/IEC 14496-4:2004/Cor. 6, AAC-LD ISO/IEC 14496-4:2004/DCOR 7, Removal of Audio Conformance DoC on ISO/IEC 14496-4:2004/AMD13:200x/DCOR 2, AAC-LD bitstreams ISO/IEC 14496-4:2004/AMD13:200x/Cor. 2, AAC-LD bitstreams DoC on ISO/IEC 14496-4:2004/PDAM 36, AAC-ELD, OAFI and additional AAC Conformance 14496-5 Reference Software DoC on ISO/IEC 14496-5:2001/Amd.10:2007/COR 3, ALS and SLS ISO/IEC 14496-5:2001/Amd.10:2007/COR 3, ALS and SLS Request for Subdivision of 14496, Audio Conformance ISO/IEC 14496-26:2009, Audio Conformance ISO/IEC 14496-26:2009/DCOR 1, ALS and SLS updates ISO/IEC 14496-26:2009/FPDAM 1, AAC-ELD, OAFI additional AAC and MPEG-1/2 on MPEG-4 Conformance WD on additional BSAC conformance streams for T-DMB 15938-8 Extraction and Use of MPEG-7 Descriptions DoC on ISO/IEC TR 15938-8:2002/PDAM 4, Extraction of audio features from compressed formats ISO/IEC TR 15938-8:2002/DAM 4, Extraction of audio features from compressed formats 23003-1 MPEG Surround ISO/IEC 23003-1:2007/DCOR 2, Misc. Corrections ISO/IEC 23003-1:2007/AMD 2:2008/DCOR 1, Ref. Sw. Update 145 TBP Available No 09/02/06 No 09/02/06 No No 09/02/06 09/02/06 No No No 09/02/06 09/02/06 09/02/06 No 09/02/06 No No No No No No YES 09/02/06 09/02/06 09/02/06 09/02/06 09/02/06 09/02/06 09/02/06 No No No 09/02/06 09/02/06 09/02/06 No No 09/02/06 09/02/06 No 09/02/06 No No No No No 09/03/06 09/02/06 09/03/06 09/03/06 09/02/20 No 09/02/06 No 09/02/06 No 09/02/06 No No 09/02/06 09/02/06 23003-2 SAOC 10416 Study on ISO/IEC FCD 23003-2:200x, Spatial Audio Object Coding 10417 Status and Workplan on SAOC Core Experiments 23003-3 Unified Speech and Audio Coding 10418 WD2 of USAC 10419 Workplan for USAC CEs 10420 MPEG Reference Encoder and the Audio CE Process 10421 Workplan on MPEG Reference Encoder 10422 Draft Revisions to MPEG Audio CE methodology 10423 Thoughts on Efficient Bitstream Syntax Liaison Statements 10424 Response to DRM on MPEG-4 AAC Technology and Profiles 10425 Response to ETSI/EBU/CENELEC JTC on MPEG-4 AAC Technology and Profiles 10426 Response to WorldDMB Forum on MPEG-4 AAC Technology and Profiles 10427 Response to IEC TC100/TA4 on IEC CDV 61937-11 and 609583/Amd.1 Responses to National Bodies 10428 Response to AUNB Comments on USAC 10429 Response to AUNB Comments on MetaData 10430 Response to FR, FI and CN NB Comments on USAC 146 No 09/02/20 No 09/02/06 No No No No No No 09/03/06 09/02/06 09/02/06 09/02/06 09/02/06 09/02/06 No No 09/02/06 09/02/06 No 09/02/06 No 09/02/06 No No No 09/02/06 09/02/06 09/02/06 Annex E Agenda for the 88th MPEG Audio Meeting Agenda Item 1. Opening of the meeting 2. Administrative matters 2.1. Communications from the Chair 2.2. Approval of agenda and allocation of contributions 2.3. Review of task groups and mandates 2.4. Approval of previous meeting report 2.5. Review of AhG reports 2.6. Joint meetings 2.7. Received national body comments and liaison matters 3. Plenary issues 4. Task group activities 4.1. MPEG-1, MPEG-2, MPEG-4, and MPEG-7 4.2. Spatial Audio Object Coding 4.3. Unified Speech and Audio Coding 4.4. Exploration: Meta-Data 5. Discussion of unallocated contributions 6. Meeting deliverables 6.1. Responses to Liaison and NB comments 6.2. Recommendations for final plenary 6.3. Establishment of new Ad-hoc groups 6.4. Approval of output documents 6.5. Press statement 7. Future activities 8. Agenda for next meeting 9. A.O.B 10. Closing of the meeting 147 Annex J – 3DG report Source: Marius Preda, Chair 1 Opening of the meeting 1.1 Approval of the agenda The agenda is approved. 1.2 Goals for the week The goals of this week are: Review SC-3DMC contributions and issue the associated CD and CE Discuss the software status for SC-3DMC Review the votes Discuss FAMC, Scene Partitioning RefSoftware and Conformance Status of software implementation in MP25 (especially the IC integration issues) Compile and test reference software Check the validity and re-generate when necessary conformance data for 3DGC Issue a new part of 14496 containing only 3DGC conformance Investigate future developments of MPEG 3D Graphics Compression Review Liaisons 1.3 Standards from 3DGC 4 4 2004 Amd.33 4 4 2004 Amd.34 4 4 2004 Amd.39 4 5 2001 Amd.22 4 5 2001 Amd.25 4 16 2006 Amd.4 4 16 200x 3rd Ed. 1.4 Multiresolution profile conformance 3DGC Model Conformance Scene partitioning conformance 3DG Compr. Model RefSof scene partitioning RefSof Scalable complexity 3D mesh coding AFX Room allocation 3DGC: CM100 148 07/04 06/07 08/01 07/10 07/10 08/07 09/02 3 08/01 08/07 09/02 3 08/10 09/02 09/07 3 08/01 08/07 09/02 3 08/10 09/02 09/07 3 09/02 09/07 10/01 3 09/0 3 1.5 Allocation of contributions N° D1 m15945 m16151 m16187 Title Schedule D1 09:00~11:30 13:00~14:00 14:00~15:30 Monday MPEG Plenary Lunch Break 3DG Plenary Roll call, Agenda, Goals, FAQ, etc., Report of AHG on 3DGC documents, experiments and software maintenance Results of voting Liaison Report on MXM latest developments MXM API for 3D Graphics content creation MXM use-case proposals for 3D services Marius Preda Patrick Gioia, Francisco Moran Marius Preda Francisco Moran Marius Preda Ivica Arsov, Marius Preda Francoise Prêteux Patrick Gioia 15:30~16:00 16:00 – 18:00 Coffee Break Scalable Complexity 3D Mesh Encoding (SC-3DMC) m16196 Bitstream Syntax and Semantics for QBCR and SVA m16149 Scalable Complexity Mesh Coding Benchmark m16195 An Explanation of SVA and QBCR En-Decoding Algorithm 149 Seungwook Lee Bonki Koo Daiyong Kim Kyoungsoo Son Euee S. Jang Benoit Le Bonhomme, Marius Preda, Françoise Preteux Kyoungsoo Son Seungwook Lee Bonki Koo Daiyong Kim Euee S. Jang m16025 D2 Corrections to "WD3.0 of ISO/IEC 14496-16 AMD4, Scalable Complexity 3D Mesh Coding" Sergio Arnaldo Francisco Morán Burgos D2 09:00~12:00 Tuesday Scalable Complexity 3D Mesh Encoding (SC-3DMC) m16197 CE Report Version 3 on the SC3DMC m16148 Attributes Encoding for TFAN m16150 MMW.com API extension for 3D graphics attributes Seungwook Lee Bonki Koo Daiyong Kim Kyoungsoo Son Euee S. Jang Khaled Mamou Titus Zaharia Marius Preda Françoise Preteux Benoit le Bonhomme Marius Preda Francoise Prêteux Marius Preda SC-3DMC Editing Plan Lunch Break ok Joint with System on MXM Joint with System on MPEG-V AFX Conformance and RefSoft Filippo Chiariglione Jean Gelissen m16198 A Report on the Conformance Test of 3D Graphics Group m16199 A Report on the Reference Software of SC3DMC Daiyong Kim Seungwook Lee Kyoungsu Son Preda Marius A Report on the Reference Software of SC3DMC 17:00~18:00 MP25 m16211 12:00~14:00 14:00 – 15:00 15:00 – 16:00 16:00~17:00 Source code for Interpolation Compression for MPEP-4 part 25 150 Sinwook Lee Sowon Kim Jeonghwan Ahn Euee S. Jang m16152 D3 Blagica Jovanova Marius Preda Françoise Preteux Selecting elementary streams in MP25 RefSoft Wednesday MPEG Plenary Lunch Break Joint with Video on RVC AFX: Mesh Grid SC-3DMC Editing of 14496-16 AMD 4 (check with Sergio if the updates are considered) AFX Editing of AFX 3rd Edition D4 Marco Mattavelli, Euee S. Jang D3 09:00~11:00 13:00~14:00 14:00 – 15:00 15:00~16:00 16:00~17:00 all 17:00~18:00 Marius Preda Thursday Editing of 14496-27 (3DGC Conformance) Avatar characteristics Editing of 14496-16 AMD 4 Seungwook Lee Jeong-Hwan Ahn D4 9:00 – 12:00 Lunch Break 14:00 – 18:00 AFX Issues (Ref Soft) 3DGCM issues (IC in RefSoft) Ref soft for SC3DMC Liaison with X3D 3 DoC D5 Friday 3DG output documents preparation Liaison statements review AhGs and resolutions Lunch Break MPEG Plenary all D5 09:00~12:00 12:00~14:00 14:00~ 151 152 1.6 Attendance list Name Marius Preda Francoise Preteux Francisco Morán Burgos Seung Wook Lee Euee S. Jang Byoungjun Kim Mingxiao Chen Jeong-Hwan Ahn Country France France Spain Korea Korea Korea Korea Korea 2 General issues 2.1 General discussion Company Institut TELECOM Institut TELECOM UPM ETRI Hanyang Univ. Hanyang Univ. Hanyang Univ. Samsung 2.1.1 Reference Software It is recalled that the source code of both decoder AND encoder should be provided as part of the Reference Software for all technologies to be adopted in MPEG standards. Moreover, not providing the complete software for a published technology shall conduct to the removal of the corresponding technical specification from the standard. Currently almost all the AFX tools published in the second edition are supported by both encoder and decoder implementation. Only exception is the MeshGrid tool; however commitment was renewed by VUB. 2.1.2 Web site OrangeLabs proposed a new version of the web site, now available at www.mpeg-3dgc.com. The goal of the web site is to disseminate the group activities (documents, software and demonstration), to maintain the FAQ and to be active in providing answers through the use of the Forum. 3DGC contributors are kindly asked to check the web-site and provide comments. 3 Current Voting Document title ISO/IEC JTC 1/SC 29 N 9642 :ISO/IEC 144965:2001/FPDAM 22: Information technology -Coding of audio-visual objects -- Part 5: Reference software AMENDMENT 22: Reference software for 3D Graphics Compression Model (3DGCM) DoC yes Editor of DoC Marius Preda ISO/IEC JTC 1/SC 29 N 9640 :ISO/IEC 144964:2004/FPDAM 34: Information technology -Coding of audio-visual objects --Part 4: yes Marius Preda 153 Conformance testing AMENDMENT 34: Conformance for 3D Graphics Compression Model (3DGCM) ISO/IEC JTC 1/SC 29 N 9638 :ISO/IEC 14496Yes 4:2004/FPDAM 33: Information technology -Coding of audio-visual objects -- Part 4: Conformance testingAMENDMENT 33: Multiresolution profile conformance ISO/IEC JTC 1/SC 29 N 9817 :Combined PDAM Non Registration and PDAM Consideration Ballot onISO/IEC 14496-4:2004/PDAM 39:Information technology --Coding of audio-visual objects -- Part 4: Conformance testing,AMENDMENT 39: Conformance testing for scene partitioning ISO/IEC JTC 1/SC 29 N 9818 : Combined PDAM Non Registration and PDAM Consideration Ballot onISO/IEC 14496-5:2001/PDAM 25:Information technology --Coding of audio-visual objects -- Part 5: Reference software,AMENDMENT 25: Reference software for scene partitioning 4 AFX (14496-16) related activities 4.1 AhG on AFX activities Patrick Gioia Report of AHG on 3DGC documents, experiments and software maintenance Title Authors Patrick Gioia, Francisco Moran Summary See m15945 - use the reflector for exchanges on technology development - Ivica Arsov is responsible for maintaining the 3DG reference software Resolution - Seungwook Lee is responsible for regenerating conformance and maintaining it. 4.2 Scalable Complexity 3D Mesh Compression (14496-16 Amd.4) Title Authors Bitstream Syntax and Semantics for QBCR and SVA Seungwook Lee, Bonki Koo, Daiyong Kim, Kyoungsoo Son, Euee S. Jang - Common header for the SC3DMC + specific header for each of the bitstreams - Open issue: why is OptimizedforParallelDecoding should be signalized in the bitstream Summary - FDmode for SVA should go in the payload - proposes the two manners of encoding the range: case 1 common range for X, Y, Z; case 2 different ranges. Recommendation: use only case 1 Resolution Build a common header for SC3DMC. Title Scalable Complexity Mesh Coding Benchmark 154 Benoit le Bonhomme, Marius Preda, Francoise Prêteux Results for all the SC3DMC tools (TFAN, QBCR and SVA) as well as 3DMC Summary and TG are presented. Resolution Accepted. Authors Title Authors MMW.com API extension for 3D graphics attributes Benoit le Bonhomme, Marius Preda, Francoise Prêteux A new version of the API is available allowing to communicate the attributes Summary between encoding libraries and MMW.com Resolution Accepted. This version should be used in the future experiments Title Authors Summary Resolution Attributes Encoding for TFAN Khaled Mamou, Titus Zaharia, Marius Preda, Françoise Prêteux A syntax is proposed for encoding attributes in TFAN. Accepted and updated as specified in the PDAM (output document) An Explanation of SVA and QBCR En-Decoding Algorithm, CE Report Version 3 on the SC3DMC Authors Kyoungsoo Son, Seungwook Lee, Bonki Koo, Daiyong Kim, Euee S. Jang The current benchmark results for QBCR do not emphasize the fact that QBCR is a low complexity algorithm (the execution time for encoding and decoding Summary are similar to the one of other methods). The contribution presents an analysis of the number of operations for QBCR and SVA. Accepted. However the current implementation of the QBCR decoder should be Resolution revisited. Title Corrections to "WD3.0 of ISO/IEC 14496-16 AMD4, Scalable Complexity 3D Mesh Coding" Authors Sergio Arnaldo, Francisco Morán Burgos This contributions reports on several editorial and technical problems in the Summary current WD. Resolution All the comments were addressed and solved. Title 4.2.1 Scene partitioning (14496-11 Amd.6) SP is followed as a joint activity between Systems and 3DGC. The technology is integrated in Part 11. There was no joint meeting with Systems on this topic during this meeting. SP activity on conformance and reference software continued. 4.3 Maintenance 4.3.1 FAMC Conformance and Reference Software FNB reports on a problem related to FAMC reference software, namely the usage of little endian convention when writing the bitstream. This conducts to errors in parsing the FAMC bitstream when encapsulated in MP4. Resolution: issue a corrigendum on FAMC ref soft and conformance and ask the contributors to update the software and regenerate the bitstreams. 155 AFX 3rd Edition 4.3.2 The document was updated during the week. The final publication is delayed for April 2009 in order to include current corrigendums. 4.4 Dataset and benchmarking For Scalable Complexity 3D Mesh Coding, the www.MyMultimediaWorld.com will be used for benchmarking. 4.5 Software Title Authors Current status of MeshGrid compression software A presentation of the current implementation (including a GUI) of MeshGrid was demonstrated by VUB representatives. Some bugs still occur. Submit the current version (command line) on the SVN and fix the bugs before Resolution the next meeting. Summary Title Authors A Report on the Conformance Test of 3D Graphics Group Daiyong Kim, Seungwook Lee, Kyoungsu Son, Preda Marius All the documents describing the conformance are identified as well as the Summary associated bitstreams Accepted. Collect all the conformance documents within a new part of MPEG-4 Resolution (MPEG-4 Part 27), edit the resuest for subdivision and corrigendum for removing the 3DG conformance from MPEG-4 Part 5. Title Authors A Report on the Reference Software of SC3DMC Seungwook Lee, Bonki Koo, Daiyong Kim, Kyoungsoo Son, Euee S. Jang Current version of the IM1 available on the SVN is not complete (some projects Summary such as AACDecode) are missing. A new version of the software was build during the week. This version should Resolution be commited to SVN and used for checking the conformance bitstreams. 4.6 Promotions 4.6.1 Title Authors Web Site Status of www.mpeg-3dgc.com Patrick Gioia The web site is in beta version but no improvement was done since the last Summary meeting Action Point: Resolution Patrick Gioia will ask more actively contributions for demos from individual parties. 156 4.7 Future 4.7.1 Metaverse) MPEG-V - Information Exchange with Virtual Worlds (formally Title Joint meeting with Systems on MPEG-V Authors Jean Gelissen Summary The MPEG-V WD was reviewed. Action Point: Resolution The avatar information part should only integrate metadata and do links to the media layer already specified by MPEG-4 tools (mesh, animation, texture). Title Authors Avatar Characteristics Jeong-Hwan Ahn This contribution presents a model for representing avatar characteristics of Summary various nature: skeleton configuration, animation types, mental state. The definition is not complete but only at a concept level. Action Point: Resolution Review the existent literature and available systems (VHML, SL, IMVU, …) for avatar metadata. Continue the discussions on the reflector. 4.7.2 MXM Title Authors Report on MXM latest developments Marius Preda Informal discussion on possible impact of MXM activities on technologies Summary developed by 3DG group Action Point: Resolution Actively participate in proposing a complete API for accessing 3D graphics tools. Title Authors MXM use-case proposals for 3D services Patrick Gioia Three applications (Virtual Worlds, 3D GPS and 3D Yellow pages) were Summary presented as well as their requirements with respect to the communication protocol between client and server. It was identified that MXM already proposes some communication protocols. Resolution This should be extended to support communication needed for the above mentioned applications. Title Authors MXM API for 3D Graphics content creation Ivica Arsov, Marius Preda, Francoise Prêteux A complete implementation of the MXM 3D graphics engine was presented. It Summary covers the encoding and decoding API. The documentation was done with Doxygene. Resolution Accepted. 157 4.7.3 Future directions of 3D Graphics Compression Title Authors Joint meeting with video on RVC Marco Mattavelli This is a first presentation of the RVC framework in the 3DG group. The architecture of the framework is generic (being applicable to all kind of media Summary compression), the functional units implemented currently refer for video coding (in a Video Tool Library). Start investigation on a Video Graphics Library. Resolution Make a collection of documentation related to RVC (tutorials, papers, software) Contact person is (Shin Hwaseon, Ketu) L544@keti.re.kr 5 3D Graphics Compression Model (14496-25) activities 5.1 Software and conformance Title Authors Selecting elementary streams in MP25 RefSoft Blagica Jovanova, Marius Preda; Françoise Preteux The selection of the XML elements in COLLADA for transmission to the Summary corresponding encoders is explained. A new GUI is proposed as a wrapper for MP25 encoder and decoder software. Resolution Accepted, the GUI is considered as an utility part of the RefSoft. Commit the new GUI on the SVN Title Authors Source code for Interpolation Compression for MPEP-4 part 25 Sinwook Lee, Sowon Kim, Jeonghwan Ahn, Euee S. Jang Initial implementation for the IC encoder and decoder is available as part of Summary MPEG-4 Part 25 reference software. Some bugs are identified. Resolution Accept the reference software with the condition to fix the bugs. 6 Liaison Title Authors Answer to liaison statement of SC24 All SC24 informs SC29 on the FCD of the Second Edition of ISO/IEC 19775-2, Summary i.e., Part 2 (Scene Access Interface – SAI) of your Extensible 3D (X3D) Include in the answer the fact that ISO/IEC 14496-11:2005, i.e., Part 11 (Scene Resolution description and application engine) includes some components addressed by the new SC24 standard. 158 7 Output documents and Resolutions of 3DGC 7.1 Part 4 7.1.1 Conformance testing The 3DG subgroup recommends approval of the following documents No. Title 14496-4 Conformance testing 10320 DOCR on ISO/IEC 14496-4:2004/FPDAM 33 (Multi Resolution Profile Conformance) 10321 DOCR on ISO/IEC 14496-4:2004/FPDAM 34 (3D Graphics Model Conformance) 10322 Text of ISO/IEC 14496-4:200x/DCOR 7 (Removal of 3DG Conformance) 7.2 Part 5 7.2.1 No. 10323 10324 10325 10326 10327 7.3 09/02/06 No 09/02/06 No 09/02/06 The 3DG subgroup recommends approval of the following documents TBP Available No 09/02/06 Yes 09/02/20 No 09/02/20 No No 09/02/06 09/02/20 The 3DG subgroup recommends nominating Seung Wook Lee (ETRI) and Khaled Mammou (Institut TELECOM) as editors of 14496-5:2001 AMD 27. Management/Liaison 7.3.1 The 3DG subgroup recommends approval of the following documents No. Title 14496-16 Animation Framework eXtension (AFX) 10328 Answer to liaison from W3D 7.4 No Reference Software Title 14496-4 Reference Software DOCR on ISO/IEC 14496-5:2001/FPDAM 22 (3DGCM Reference Software) Text of ISO/IEC 14496-5:2001/FDAM 22 (3DGCM Reference Software) Text ISO/IEC 14496-5:2001/FPDAM 25 (Scene Partitioning Reference Software) Request for Amendment: 14496-5:2001/PDAM27 Text of ISO/IEC 14496-5:2001/PDAM27 (SC3DMC RefSoft) 7.2.2 TBP Available TBP Available No 09/02/06 Part 16 Animation Framework eXtension (AFX) 7.4.1 The 3DG subgroup recommends approval of the following documents No. Title 14496-16 Animation Framework eXtension (AFX) 10329 Text of ISO/IEC 14496-16:2006/PDAM4 (Scalable Complexity 3D Mesh Compression) 10330 CE on Scalable Complexity 3D Mesh Coding 159 TBP Available No 09/02/20 No 09/02/06 10331 WD of ISO/IEC 14496-16 3rd Edition 7.4.2 7.5 Part 27 7.5.1 No. 10332 10333 10334 10433 10335 7.6 No 09/02/06 The 3DG subgroup recommends to add Khaled Mammou (Institut TELECOM) to the editor list of 14496-16:2006 AMD 4. 3D Graphics Conformance The 3DG subgroup recommends approval of the following documents Title 14496-27 Conformance testing Request for subdivision of ISO/IEC 14496-27 Text of ISO/IEC 14496-27:2009/FDIS (3DG Conformance) Text of ISO/IEC 14496-27:2009/FPDAM1 (Scene partitioning conformance) Request for Amendment: 14496-27:2009/PDAM2 (SC3DMC Conformance) Text of ISO/IEC 14496-27:2009/PDAM2 (SC3DMC Conformance) TBP Available No No No 09/02/06 09/02/13 09/02/06 No 09/02/06 No 09/02/20 7.5.2 The 3DG subgroup recommends nominating Daiyong Kim (HYU) and Francisco Morán (UPM) as editors of 14496-27:2009. 7.5.3 The 3DG subgroup recommends nominating Seung Wook Lee (ETRI) and Khaled Mammou (Institut TELECOM) as editors of 14496-27:2009 AMD 2. Establishment of 3DGC Ad-Hoc Groups 10336 Mandate: AHG on 3DGC documents, software maintenance and core experiments 1. Conduct the experiments in Scalable Complexity Mesh Compression 2. Coordinate 3DGC related conformance and reference software 3. Maintain and edit 3DGC documents 4. Coordinate editing of the www.mpeg-3dgc.com web site Chairmen: Francisco Morán Burgos Patrick Gioia Duration: Until 88th Meeting Sunday before 88th meeting Meetings Reflector: mpeg-3dgc AT gti. ssr. upm. es Subscribe: https://mx.gti.ssr.upm.es/mailman/listinfo/mpeg-3dgc 8 Closing of the Meeting See you in Maui. 160